BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 006140
         (659 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255564100|ref|XP_002523048.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223537731|gb|EEF39352.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 644

 Score =  719 bits (1856), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/633 (57%), Positives = 453/633 (71%), Gaps = 33/633 (5%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
           +LYP SID+ELWL+AE+R QEIL  +QP+  SE+KRKEVI+YIQRLI  +Y  EVFPFGS
Sbjct: 22  DLYPLSIDSELWLMAEKRTQEILWVLQPSSSSEQKRKEVIDYIQRLIKHHYATEVFPFGS 81

Query: 95  VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC 154
           VPLKTYLPDGDIDLT L HQN+EEDLAR VC IL   +Q+   E+KDV+Y+ AQVK+VKC
Sbjct: 82  VPLKTYLPDGDIDLTALSHQNMEEDLAREVCDILTYAEQNLESEVKDVRYIQAQVKVVKC 141

Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
           SV+NI VDISFNQMAGL ALCFLE+VD+LIGKDHL K S+ILIKAWC+YESRILGAH+GL
Sbjct: 142 SVKNISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKHSIILIKAWCFYESRILGAHHGL 201

Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
           +STYALE++VL+                 RFL+YY+TFDWDNYC++INGPVAISSLPEI+
Sbjct: 202 LSTYALEILVLYIVNVFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAISSLPEIM 261

Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
            E   ++ +ELLL+PEFLK+C+E +SVPI A+EN GHEF+IKHLNI+DPLKD+NNLGRSV
Sbjct: 262 TEAPYSNRNELLLTPEFLKRCKERFSVPIKAVENGGHEFSIKHLNILDPLKDSNNLGRSV 321

Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
           SKGNFHRI+CALSYGAQRLGEIL LPG  +G GLE FFINTL+RNG+G RPD  VP+ AF
Sbjct: 322 SKGNFHRIKCALSYGAQRLGEILMLPGENMGAGLENFFINTLDRNGRGERPDALVPVPAF 381

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV 437
           G+G+SEA+DLSGDYD+YYNG+L GQ YH Y+LPV+ Q +PPSSPSQI     WD L Q +
Sbjct: 382 GTGRSEASDLSGDYDNYYNGILQGQWYHSYSLPVSPQLTPPSSPSQIQQGCTWDTLSQLL 441

Query: 438 QGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEI 497
             K N + Q GT VFVPR+P+ HPYA++V A+  STDE  KSRGTGTYIP  +  P  + 
Sbjct: 442 WWKQNFLSQRGTNVFVPRVPYHHPYAAKVYATASSTDEMGKSRGTGTYIPHVSHHPCRDK 501

Query: 498 NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGC---SLNVENGCSLNMENSCS 554
               R +NP+SL +  ++  RRK +K E  P   +    C   SL+       N EN   
Sbjct: 502 MSCFRVKNPDSLTYSSINS-RRKAEKVEGGPEAKSDNECCLNLSLDQNKAREENSENGSC 560

Query: 555 LILSLEEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSPVVTTIMFGTYEHSLKPSV 614
             LSL++FPLL           + Q      +S Q  + S+  + +I FGT+ H   P  
Sbjct: 561 PDLSLDQFPLL-----------SYQASHRILQSPQAKECSNS-IRSIPFGTFRHIPSPLA 608

Query: 615 LPSPSARMKVSGASTTLDTEPGLTIMERQKQQE 647
           LPS    ++     TT +  P +      KQQ+
Sbjct: 609 LPSSVTGVQTESVPTTSEIIPAIRETGMLKQQK 641


>gi|255559667|ref|XP_002520853.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223539984|gb|EEF41562.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 655

 Score =  716 bits (1848), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/634 (57%), Positives = 457/634 (72%), Gaps = 24/634 (3%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
           +L+P SID+ELWL+AE+R QEIL  +QP++ SE+KRK VI+YIQRLI  ++  EV PFGS
Sbjct: 22  DLFPLSIDSELWLMAEKRAQEILWILQPSLASEQKRKVVIDYIQRLIKHHFATEVLPFGS 81

Query: 95  VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC 154
           VPLKTYLPDGDIDLT L HQN+EEDL R +C IL  E+Q+S  E+KDV+Y+ AQVKIVKC
Sbjct: 82  VPLKTYLPDGDIDLTALSHQNMEEDLVREICNILTYEEQNSESEVKDVRYIQAQVKIVKC 141

Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
           SV+NI VDISFNQMAGL ALCFLE+VD+LIGKDHL K S+ILIKAWC+YESRILGAH+GL
Sbjct: 142 SVKNISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKCSIILIKAWCFYESRILGAHHGL 201

Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
           +STYALE++VL+                 RFL+YY+TFDWDNYC++INGPVA+SSLPEI+
Sbjct: 202 LSTYALEILVLYIINAFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAVSSLPEIM 261

Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
            E+  N+G+ELLL PEFLK+C+E +SVPI A+EN GHEF+IKHLNI+DPLKDNNNLGRSV
Sbjct: 262 TESPYNNGNELLLCPEFLKRCKEKFSVPIKAVENGGHEFSIKHLNILDPLKDNNNLGRSV 321

Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
           SKGNFHRI+CALSYGAQRLGEIL LPG  +G GLE FFINTL+RNG+G RPD  VP+  F
Sbjct: 322 SKGNFHRIKCALSYGAQRLGEILALPGENMGAGLEIFFINTLDRNGRGERPDTLVPVPTF 381

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV 437
           G+G+SEA DLSGDYD++Y+GLL GQ YH Y+LPV+ + +PPSSPSQI     WD L Q +
Sbjct: 382 GTGRSEAFDLSGDYDNHYSGLLQGQWYHKYSLPVSPEMTPPSSPSQIQQSFTWDRLSQLL 441

Query: 438 QGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEI 497
           + K N + Q GT VFVPR+P  HPYA +V A+  ++DE  KS+GTGTYIP  +  P  ++
Sbjct: 442 RCKQNFLSQRGTNVFVPRVPHRHPYAPKVYATASTSDEKGKSQGTGTYIPNVSHHPRKDM 501

Query: 498 NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGC---SLNVENGCSLNMENSCS 554
              +R++NP+SL H  ++   RK +  E  P   +    C   S +       N EN   
Sbjct: 502 MSWMRSKNPDSLTHSSLNS-PRKAEIVEGGPGAKSDNYCCLDLSFDQYKAGQENRENGSC 560

Query: 555 LILSLEEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSPVVTTIMFGTYEHSLKPSV 614
            +LSL++FPLLP  +    S  NQ       +ST   + S+     I  GT+ H   P  
Sbjct: 561 TVLSLDQFPLLPSLEKSVTSEINQS-SHPILQSTHAKEYSTS-SRNIPTGTFRHIPFPVA 618

Query: 615 LPSP-SARMKVSGASTTLDTEPGLTIMERQKQQE 647
           LPS  + R   SG STTL+  P  + M   KQQ+
Sbjct: 619 LPSSVTGRQTESGVSTTLEIMPASSEMGMLKQQK 652


>gi|224146203|ref|XP_002325920.1| predicted protein [Populus trichocarpa]
 gi|222862795|gb|EEF00302.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  609 bits (1570), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 304/467 (65%), Positives = 363/467 (77%), Gaps = 20/467 (4%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
             ID ELWL+AE+R QEIL TIQP   SE KR EVINYIQ LI  Y+ +EVF FGSVPLK
Sbjct: 26  LQIDPELWLMAEKRTQEILYTIQPTFASEHKRMEVINYIQSLIKYYFTVEVFAFGSVPLK 85

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
           TYLPDGDIDL VL HQN+EE+LAR VC +L+ E+ D  F++ DVQY+ AQVK+VKCSV+N
Sbjct: 86  TYLPDGDIDLMVLSHQNMEEELARGVCTLLQREELDPEFQVNDVQYIHAQVKLVKCSVKN 145

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           I VDISFNQMAG SALCFLE+VD+LIG+DHLFKRS+ILIKAWC+YESRILGAH+GLISTY
Sbjct: 146 ISVDISFNQMAGPSALCFLEQVDQLIGQDHLFKRSIILIKAWCFYESRILGAHHGLISTY 205

Query: 219 ALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
           AL+++                 VL++FLDYY+ FDWDNYC+SINGP+ ISS P+   ++ 
Sbjct: 206 ALQILVLNIINVFHSSLPDPLAVLYKFLDYYSAFDWDNYCVSINGPIPISSFPQ--TDST 263

Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
            N+G+E L+S EFL+  RE ++ P+  LEN  HEF IKHLNIVDPLK +NNLGRSV+KGN
Sbjct: 264 HNNGNESLISQEFLRNFREKFAFPMKELENGAHEFPIKHLNIVDPLKSSNNLGRSVNKGN 323

Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
           FHRIR ALSYGAQRLGEI+ LPG  +G  LEKFF+NTL+RNG+G RPD  VP+ AFG+G+
Sbjct: 324 FHRIRGALSYGAQRLGEIIALPGEAMGGRLEKFFMNTLDRNGRGQRPDADVPVPAFGTGR 383

Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKG 441
           SEA+DL+GDYD YY+GLL+GQ YH Y LPV  Q S PSSPSQI  +SA DVL Q +Q K 
Sbjct: 384 SEASDLNGDYDKYYSGLLHGQWYHSYALPVPPQPSSPSSPSQIKQKSARDVLPQLLQSKQ 443

Query: 442 NLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE 488
           +++ Q GTEVF PR   CHPYASQV  +    D   KSRGTGTYIP+
Sbjct: 444 DILSQRGTEVFFPRQK-CHPYASQVHVAISGIDTMRKSRGTGTYIPD 489


>gi|42565972|ref|NP_191191.2| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
           thaliana]
 gi|30725328|gb|AAP37686.1| At3g56320 [Arabidopsis thaliana]
 gi|110736147|dbj|BAF00045.1| hypothetical protein [Arabidopsis thaliana]
 gi|332645988|gb|AEE79509.1| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
           thaliana]
          Length = 603

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 270/532 (50%), Positives = 353/532 (66%), Gaps = 39/532 (7%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           IDA+ W++AEER  EILCTIQPA+VS++ R E+I+Y++ LI  + GIEVF FGSVPLKTY
Sbjct: 35  IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           LPDGDIDLTVL  QN+++D    +C  L+NE+++S F   DVQ++PAQVK++KC+++NI 
Sbjct: 95  LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVKVIKCNIRNIA 154

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VDISFNQ AGL ALCFLE+VD+L G+DHLFKRS+IL+KAWCYYESRILGA+ GLISTYAL
Sbjct: 155 VDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTYAL 214

Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
            ++                 VL++FLDYY +FDW+NYCIS+NGPV ISSLPE+ A + EN
Sbjct: 215 AVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASPEN 274

Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
            G ELLL  +FL+ C E+YS P  A+++ G EF IKHLNIVDPLK +NNLG+SV++GN  
Sbjct: 275 -GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNNLGKSVTQGNVQ 333

Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
           RIR A + GA++L ++L+LPG  +G  LEKFF N+LERNGKG R DV  P+ AFG+G+SE
Sbjct: 334 RIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDPVTAFGTGRSE 393

Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
            ++LSGD+DS +  L+YGQ YH ++LP T Q       S +   S WD++   V  + N 
Sbjct: 394 LSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL---SGWDIVRHLVTYRKNE 450

Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
            +     V     PF  P  S             K+RGTGTYIP+ ++    +  R    
Sbjct: 451 FHLRSLNVSTSMQPF--PLHSLPNG----CQNMRKTRGTGTYIPDMSQQVYSDRFRE-SG 503

Query: 504 RNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN--VENGCSLNMENSC 553
               S NH+  S     ID        D   S C+L+  V   C+ N    C
Sbjct: 504 TGTSSTNHLEASS--EAIDN-------DGASSCCNLSGEVSTSCTSNKGEEC 546


>gi|356570171|ref|XP_003553264.1| PREDICTED: uncharacterized protein LOC100797780 [Glycine max]
          Length = 644

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 288/582 (49%), Positives = 376/582 (64%), Gaps = 60/582 (10%)

Query: 12  DLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRK 71
           D QE LL S          S P  L   SID ELW +AEER QEIL TIQP ++SE  RK
Sbjct: 2   DRQENLLPS----------SMPSQL--LSIDKELWQMAEERAQEILWTIQPNVLSEVNRK 49

Query: 72  EVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
           +VI+Y+QRLI GYYG EV PFGSVPLKTYLPDGDIDLT L H++ EEDLA+ VC +L++ 
Sbjct: 50  DVIDYVQRLIRGYYGAEVLPFGSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCYVLQSG 109

Query: 132 DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFK 191
           D D  +++KD++Y+ AQV++VKC+V+NI VDISFNQMAG+  L FLE+VD+L+GK+H+FK
Sbjct: 110 D-DPEYQVKDIKYIRAQVRLVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFK 168

Query: 192 RSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYYNT 234
           RS+ILIKAWCYYESR+LG H+GL+STYA+E++VL+                  FLDYY +
Sbjct: 169 RSIILIKAWCYYESRLLGGHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGS 228

Query: 235 FDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
           FDWD+  +SI GP  +SS PEI AET+E D  E LL  EFL+  R + S P  A +   H
Sbjct: 229 FDWDHNYVSIWGPKPLSSFPEI-AETLECDHGEFLLQKEFLRNYRNMCSFPSRATKTMTH 287

Query: 295 EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKF 354
           EF +K +NI+DPL+++NNLGRSV+  + HR R ALSYGA+RL +ILTLPG  +G  LEKF
Sbjct: 288 EFPVKFMNILDPLRNDNNLGRSVNIASLHRFRFALSYGARRLKQILTLPGETMGAALEKF 347

Query: 355 FINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQ 414
           F +TL+RNGKG R DV VP+  FG G+S  + L GD +SY+ GL Y Q Y +Y LPVT  
Sbjct: 348 FFSTLDRNGKGERADVDVPVSPFGIGRSIESVLCGDCESYFGGLQYVQLYRNYALPVTVH 407

Query: 415 FSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTD 474
            S PSSPSQ       D+L        ++ YQ GT+V++P     HP        T+S +
Sbjct: 408 SSSPSSPSQD------DILASSTHQNWSMFYQGGTDVYIPAQTLYHP--------TYSLE 453

Query: 475 EGEKSRGTGTYIPERTRPPPIEI-NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNL 533
              KSRGTGTYIP+       ++  +P R R   S  +  + K   K  +AE + S  ++
Sbjct: 454 GRGKSRGTGTYIPDLNYNSYWDMRTKPSRPRRFASSKYNALPKSPPKKLQAEEVHSETDI 513

Query: 534 ESGCSLNVENGCSLNMENSCSLILSLEEFPLLPVT-KNLPPS 574
            +             + +S    LS E+FPLLP   K++PP+
Sbjct: 514 NA-------------ISDSRLFELSSEDFPLLPCNCKDIPPT 542


>gi|356561857|ref|XP_003549193.1| PREDICTED: uncharacterized protein LOC100787145 [Glycine max]
          Length = 684

 Score =  505 bits (1301), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 276/536 (51%), Positives = 357/536 (66%), Gaps = 46/536 (8%)

Query: 12  DLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRK 71
           D QE LL          ++S P  L   SID ELW +AEER QEIL TI+P ++SE  RK
Sbjct: 2   DRQENLL----------AFSMPSQL--LSIDEELWRMAEERAQEILWTIEPIVLSEVNRK 49

Query: 72  EVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
           +VI+Y+QRLI GYYG EV PFGSVPLKTYLPDGDIDLT L H++ EEDLA+ VC IL++ 
Sbjct: 50  DVIDYVQRLIRGYYGAEVLPFGSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCNILQSG 109

Query: 132 DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFK 191
           D D  +++KD+QY+ AQV++VKC+V+NI VDISFNQMAG+  L FLE+VD+L+GK+H+FK
Sbjct: 110 D-DPEYQVKDIQYIRAQVRLVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFK 168

Query: 192 RSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYYNT 234
            S+ILIKAWCYYESR+LG H+GL+STYA+E++VL+                  FLDYY +
Sbjct: 169 HSIILIKAWCYYESRLLGGHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGS 228

Query: 235 FDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
           FDWD+  +SI GP  +SSLPEI AET E D  E LL  EFL+  R + S P  A E   H
Sbjct: 229 FDWDHNYVSIWGPKPLSSLPEI-AETPECDQGEFLLQKEFLRNYRNMCSFPSRASETMTH 287

Query: 295 EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKF 354
           EF +K +NI+DPL+++NNLGRSV+  N HR+R ALSYGA+RL +ILTLPG  +G  LEKF
Sbjct: 288 EFPVKFMNILDPLRNDNNLGRSVNIANLHRVRFALSYGARRLKQILTLPGENMGAALEKF 347

Query: 355 FINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQ 414
           F +TL+RNGKG R DV VP+  FG G+SE + L GD +SY  GL Y Q Y +Y +P+   
Sbjct: 348 FFSTLDRNGKGERADVAVPVSPFGIGRSEESVLRGDCESYCGGLQYVQLYRNYAMPIAVH 407

Query: 415 FSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTD 474
            S PSSPSQ       D+L    Q   ++ YQ GT+V++P     HP        T++ +
Sbjct: 408 SSSPSSPSQD------DILAPSTQQNWSMFYQGGTDVYIPGQTLYHP--------TYNLE 453

Query: 475 EGEKSRGTGTYIPERTRPPPIEIN-RPVRARNPESLNHVPVSKLRRKIDKAERLPS 529
              KSRGTGTYIP+       +++ +P R R   S  +  + K   K  +AE + S
Sbjct: 454 GRGKSRGTGTYIPDLNYNSYWDMHTKPSRPRRFASSKYNALPKSPSKKQQAEEVHS 509


>gi|7572930|emb|CAB87431.1| putative protein [Arabidopsis thaliana]
          Length = 614

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 270/543 (49%), Positives = 353/543 (65%), Gaps = 50/543 (9%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           IDA+ W++AEER  EILCTIQPA+VS++ R E+I+Y++ LI  + GIEVF FGSVPLKTY
Sbjct: 35  IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           LPDGDIDLTVL  QN+++D    +C  L+NE+++S F   DVQ++PAQVK++KC+++NI 
Sbjct: 95  LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVKVIKCNIRNIA 154

Query: 161 VDISFNQMAGLSALCFLEK-----------VDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           VDISFNQ AGL ALCFLE+           VD+L G+DHLFKRS+IL+KAWCYYESRILG
Sbjct: 155 VDISFNQTAGLCALCFLEQVLSAIQNQAPEVDQLFGRDHLFKRSIILVKAWCYYESRILG 214

Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
           A+ GLISTYAL ++                 VL++FLDYY +FDW+NYCIS+NGPV ISS
Sbjct: 215 ANTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISS 274

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
           LPE+ A + EN G ELLL  +FL+ C E+YS P  A+++ G EF IKHLNIVDPLK +NN
Sbjct: 275 LPELTAASPEN-GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNN 333

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
           LG+SV++GN  RIR A + GA++L ++L+LPG  +G  LEKFF N+LERNGKG R DV  
Sbjct: 334 LGKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVND 393

Query: 373 PIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDV 432
           P+ AFG+G+SE ++LSGD+DS +  L+YGQ YH ++LP T Q       S +   S WD+
Sbjct: 394 PVTAFGTGRSELSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL---SGWDI 450

Query: 433 LCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRP 492
           +   V  + N  +     V     PF  P  S             K+RGTGTYIP+ ++ 
Sbjct: 451 VRHLVTYRKNEFHLRSLNVSTSMQPF--PLHSLPNG----CQNMRKTRGTGTYIPDMSQQ 504

Query: 493 PPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN--VENGCSLNME 550
              +  R        S NH+  S     ID        D   S C+L+  V   C+ N  
Sbjct: 505 VYSDRFRE-SGTGTSSTNHLEASS--EAIDN-------DGASSCCNLSGEVSTSCTSNKG 554

Query: 551 NSC 553
             C
Sbjct: 555 EEC 557


>gi|356518940|ref|XP_003528133.1| PREDICTED: uncharacterized protein LOC100815787 [Glycine max]
          Length = 680

 Score =  504 bits (1298), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 282/604 (46%), Positives = 379/604 (62%), Gaps = 77/604 (12%)

Query: 10  MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
           M+D QE LLT           S P  L   SID ELW +AE+R+QEIL TI+P ++SE  
Sbjct: 1   MDDKQENLLT----------LSLPSQL--VSIDEELWRMAEDRVQEILWTIEPNVLSEVN 48

Query: 70  RKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
           RK+VI+Y+QRLI GYYG +V PFGSVPLKTYLPDGD+DLT L H++ EEDLA+ +C IL+
Sbjct: 49  RKDVIDYVQRLIKGYYGAKVLPFGSVPLKTYLPDGDVDLTTLIHEDAEEDLAQAICNILK 108

Query: 130 NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
           + D DS +++KD+QY+ AQV++VKC+V+NI VDISFNQMAG+  L FLE+VD+L+GK+H+
Sbjct: 109 SGD-DSEYQVKDIQYIRAQVRLVKCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHI 167

Query: 190 FKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYY 232
           FKRS+ILIKAWCYY+SR+LG HYGL+STYA+E++VL+                  FLDYY
Sbjct: 168 FKRSIILIKAWCYYDSRLLGGHYGLLSTYAVEILVLYIINRFHSVVRGPLEVLYIFLDYY 227

Query: 233 NTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENT 292
           ++FDWD+  +SI GP ++SSLPEI   T E D  E LL  EFL   + + S P  A E  
Sbjct: 228 SSFDWDHNYVSIWGPKSLSSLPEI---TPECDQGEFLLQKEFLTNYKNMCSYPTRASETL 284

Query: 293 GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLE 352
            HEF +K +NI+DPL+++NNLGRSVS  + HR+R A +Y AQ+L +I TLPG  +G  LE
Sbjct: 285 THEFPVKFMNILDPLRNDNNLGRSVSIASLHRLRFAFAYSAQKLKQIFTLPGENMGAALE 344

Query: 353 KFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVT 412
           KFF +TLERNGKG R DV VP+  FG+G+ E   L+GD ++Y  G  Y Q YH+Y +PV 
Sbjct: 345 KFFFSTLERNGKGERADVGVPVAPFGTGRFEEPVLNGDCENYCGGSQYVQLYHNYAMPVI 404

Query: 413 AQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFS 472
              S  + P         D+L    Q   +L Y  GT+V++PR    HP        T++
Sbjct: 405 VDSSSLALPD--------DILASSTQQNWSLFYHGGTDVYIPRHTLYHP--------TYN 448

Query: 473 TDEGEKSRGTGTYIPERTRPP--PIEINRPVRARNPESL-NHVPVSKLRRKIDKAERLPS 529
            + G KSRGTGTYIP+        +    P   R P S  N  P S+L++K  + E + S
Sbjct: 449 LEGGGKSRGTGTYIPDLNYNSYWDMSTKAPRPRRFPSSKHNAFPRSRLKKK--QVEEVHS 506

Query: 530 RDNLESGCSLNVENGCSLNMENSCSLILSL--EEFPLLPV-TKNLPPSATN-----QQFD 581
             ++ +               NS S +     E+FP+LP  +K +PP+        ++F 
Sbjct: 507 EIDINA---------------NSDSRLFEFLNEDFPVLPCNSKAIPPTQAEESTPLEKFH 551

Query: 582 QSTD 585
             TD
Sbjct: 552 SETD 555


>gi|297820390|ref|XP_002878078.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323916|gb|EFH54337.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 602

 Score =  503 bits (1295), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/478 (52%), Positives = 330/478 (69%), Gaps = 35/478 (7%)

Query: 31  SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
           ++ P      I+ E W++AEER  EILCTIQPA+VS+K R E+I+Y++ LI  + GIEVF
Sbjct: 25  TAQPKADSLLIEEESWMIAEERAHEILCTIQPALVSDKSRNEIIDYVRTLIKSHDGIEVF 84

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVK 150
            FGSVPLKTYLPDGDIDLTVL  QN+++D    +C  L+NE+++S F   DVQ++PAQVK
Sbjct: 85  SFGSVPLKTYLPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVK 144

Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
           ++KC+++NI VDISFNQ AGL ALCFLE+VD+L G+DHLFKRS+IL+KAWCYYESRILGA
Sbjct: 145 VIKCNIRNIAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGA 204

Query: 211 HYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSL 253
           + GLISTYAL ++                 VL++FLDYY +FDW+NYCIS+NGPV ISSL
Sbjct: 205 NTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSL 264

Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNL 313
           PE+ A + EN G ELLL  +FL+ C E++S P  A+++ G +F IKHLNIVDPLK +NNL
Sbjct: 265 PELTAASPEN-GHELLLDEKFLRNCVELFSAPTKAVDSNGLDFPIKHLNIVDPLKYSNNL 323

Query: 314 GRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           G+SV++GN  RIR A + GA++L ++L+LPG  +G  LEKFF N+LERNGKG R DV  P
Sbjct: 324 GKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDP 383

Query: 374 IHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVL 433
           + AFG+G+SE ++LSGD++ Y+  L+YGQ  H Y+LP T Q       SQ+   S WD++
Sbjct: 384 VTAFGTGRSELSELSGDFEGYFGRLVYGQI-HGYSLPGTFQHGYIPVSSQVTDPSGWDIV 442

Query: 434 CQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
              V  + N  Y     V       C                    RGTGTYIP+ ++
Sbjct: 443 RHLVTYRKNEFYLKSLNVSTSLPNGCQ----------------NMRRGTGTYIPDMSQ 484


>gi|356570173|ref|XP_003553265.1| PREDICTED: uncharacterized protein LOC100798838 [Glycine max]
          Length = 626

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 288/597 (48%), Positives = 369/597 (61%), Gaps = 71/597 (11%)

Query: 10  MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
           M+D QE LL S          S P  L   SID ELW + EER QEIL TIQP ++SE  
Sbjct: 1   MDDKQENLLPS----------SLPSQL--LSIDEELWRMIEERAQEILWTIQPNVLSEVN 48

Query: 70  RKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
           RK +I+Y+QRLI  Y G +VFPFGS PLKTYLPDGDIDLT L H++ EEDL R VC IL+
Sbjct: 49  RKNIIDYVQRLIGEYCGAQVFPFGSFPLKTYLPDGDIDLTALSHEDEEEDLVRAVCNILK 108

Query: 130 NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
           +ED DS +++KD++++ AQV++VKC+V+NIPVDISFNQMAGL  L FLE+VD+L+GK+H+
Sbjct: 109 SED-DSEYQVKDIEHIRAQVQVVKCTVKNIPVDISFNQMAGLYTLFFLEQVDQLVGKNHI 167

Query: 190 FKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYY 232
           FKRSVILIK+WCYYESRILGAH GL+STYA E++VL+                  FLDYY
Sbjct: 168 FKRSVILIKSWCYYESRILGAHCGLLSTYATEILVLYIINRFHSSVRGPLAVLYVFLDYY 227

Query: 233 NTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENT 292
           ++FDW++  ISI GP  +SSLPEIV +T E D  E LL  EFLK  R++ S    A E  
Sbjct: 228 SSFDWEHNYISIWGPKVLSSLPEIV-DTPEYDQGEFLLQKEFLKNYRDMCSSKAKASETM 286

Query: 293 GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLE 352
            + F +KH+NI+DPL++NNNLGRSV+ GN  RIR A S G+QRL +ILTL G  +G  LE
Sbjct: 287 TNAFPVKHMNILDPLRNNNNLGRSVNIGNLSRIRLAFSLGSQRLKQILTLAGENMGAALE 346

Query: 353 KFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVT 412
           KFF NTLE NGKG R DV VP+  FG+G+ E + LSGDYDSYY    Y Q Y +Y +PVT
Sbjct: 347 KFFFNTLENNGKGERADVGVPVSPFGTGRFEESVLSGDYDSYYGWSQYVQQYPNYAMPVT 406

Query: 413 AQFSP-PSSPSQ-----IPSRSAW---DVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYA 463
              S  PS P Q     + ++  W   D      Q   ++ YQ GT V++P     HP  
Sbjct: 407 TVPSHFPSPPHQDNILALSTQLNWSEGDFPALSAQQNWSMFYQSGTNVYIPGQTLFHP-- 464

Query: 464 SQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDK 523
                 T+S  +G +SRGTGTYIP+                    L++     +  K ++
Sbjct: 465 ------TYSLGKGGRSRGTGTYIPD--------------------LSYNCYWDIHAKANR 498

Query: 524 AERLP-SRDNLESGCSLNVENGCSLNMENSCSLILSLEEFPLLP-VTKNLPPSATNQ 578
             R P ++ N     S   +     + E      LS E+FPL+P V K  PP+   +
Sbjct: 499 PRRFPYAKHNALLKSSPKSQQEKEFHTETK-PFELSDEDFPLIPNVYKATPPTQAQE 554


>gi|356507300|ref|XP_003522406.1| PREDICTED: uncharacterized protein LOC100813790 [Glycine max]
          Length = 692

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 268/569 (47%), Positives = 357/569 (62%), Gaps = 70/569 (12%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
            SID ELW +AE+R+QEIL TIQP ++SE  RK+VI+Y+QRLI  YYG EV PFGSVPLK
Sbjct: 18  LSIDEELWQMAEDRVQEILWTIQPNVLSEVNRKDVIDYVQRLIRDYYGAEVLPFGSVPLK 77

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
           TYLPDGD+DLT L H++ E+DLA+ +C +L++ D DS +++KD+QY+ AQV++VKC+V+N
Sbjct: 78  TYLPDGDVDLTTLIHEDAEDDLAQAICNVLKSGD-DSEYQVKDIQYIRAQVRLVKCTVKN 136

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           I VDISFNQMAG+  L FLE+VD+L+GK+H+FKRS+ILIK WCYY+SR+LG H+GL+STY
Sbjct: 137 IAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKGWCYYDSRLLGGHHGLLSTY 196

Query: 219 ALEMMVLHR-----------------FLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
           A+E++VL+                  FLDYY +FDWD+  ISI GP ++SSLPEI AE  
Sbjct: 197 AVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYISIWGPKSLSSLPEI-AEAP 255

Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
           E D  E LL  EFL   + + S P  A E   HEF +K +NI+DPL+++NNLGRSVS  +
Sbjct: 256 ECDQGEFLLQKEFLGNYKNMCSYPAGASETLTHEFPVKFMNILDPLRNDNNLGRSVSIAS 315

Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
            HR+R A SYG Q+L +I TLPG  +G  LEKFF +TL RNGKG R DV VP+  FG+G+
Sbjct: 316 LHRLRFAFSYGVQKLKQIFTLPGENMGAALEKFFSSTLNRNGKGERADVSVPVAPFGTGR 375

Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQI--PSRSAWDVLCQYVQG 439
            E   L GD ++Y  GL Y Q YH+Y +PV    + P+SP  I  P++  W         
Sbjct: 376 FEEPVLYGDCENYCGGLQYVQLYHNYAMPVIVDSNSPASPDDILAPTQQNW--------- 426

Query: 440 KGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINR 499
             +L YQ  T+V++P     HP        T+S + G KSRGTGTYIP+           
Sbjct: 427 --SLFYQGATDVYIPAQKLYHP--------TYSLEGGGKSRGTGTYIPD----------- 465

Query: 500 PVRARNPESLNHVPVSKLRRKIDKAERLPS-------RDNLESGCSLNVENGCSLNMENS 552
                    LN+     +R K  +  R  S       R  L+      V +   +   NS
Sbjct: 466 ---------LNYNSYWDMRTKASRPRRFASSKYNAFPRSPLKKQQVEEVHSEIDIINANS 516

Query: 553 CSLI--LSLEEFPLLPVTKN-LPPSATNQ 578
            S +   S E+FP+ P   N +PP+   +
Sbjct: 517 DSRLFEFSSEDFPVRPCNSNAIPPTQAQE 545


>gi|356518706|ref|XP_003528019.1| PREDICTED: uncharacterized protein LOC100788864 [Glycine max]
          Length = 721

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 265/559 (47%), Positives = 358/559 (64%), Gaps = 62/559 (11%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
            SID ELW + EERIQEIL TIQP ++SE  RK V+NY+Q+LI  YY  +VFPFGS PLK
Sbjct: 18  LSIDEELWRMTEERIQEILWTIQPNVLSEMNRKNVLNYVQKLIGDYYDTKVFPFGSFPLK 77

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
           TYLPDGDIDLTV+ H++ EE+LA+ +C ILE  + D ++++KD++++ AQV++VKC+V+N
Sbjct: 78  TYLPDGDIDLTVINHEDEEENLAKEICTILECAN-DLIYQVKDIEHIRAQVQVVKCTVKN 136

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           IP+DI+FNQM GL  LCFLE+VD+L GK+H+FKRS+ILIKAWC Y+SR+LG+ +GL+STY
Sbjct: 137 IPIDITFNQMTGLCTLCFLEQVDQLAGKNHIFKRSIILIKAWCCYDSRLLGSQHGLLSTY 196

Query: 219 ALEMMVLHR-----------------FLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
           A E++VL+                  F DYY TFDW++  +SI GP A+SSLPEIV +  
Sbjct: 197 ATEVLVLYIINRFHASVRDPLEVLYIFFDYYGTFDWEHNYMSIWGPKALSSLPEIV-DRP 255

Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
           E D DE LL  EFL   R+I+S    + E T + F +KH+NI+DPL+++NNLGRSV++ +
Sbjct: 256 ECDQDEFLLHKEFLINYRDIFSSKAKSSETTTNTFPVKHINILDPLRNDNNLGRSVNEAS 315

Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
           FHRIR ALSYGA++  +I TL G  +G  LEKFF +TL+RNGKG R DV VP+  FG+G+
Sbjct: 316 FHRIRFALSYGAKKFKQIFTLAGENMGEALEKFFFDTLQRNGKGERADVDVPVSPFGTGR 375

Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFS-PPSSPSQ-----IPSRSAW---DV 432
            E + L GD DSYY G  Y Q Y +Y +P T   S  PSSPSQ     + ++  W   D+
Sbjct: 376 YEKSVLDGDCDSYYGGSQYAQQYPNYAMPTTTIHSNSPSSPSQDDILALSTQQNWSEGDL 435

Query: 433 LCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRP 492
           L    Q   ++ YQ     ++P     HP        T++ +EG +SRGTGTYIP+    
Sbjct: 436 LALSTQQNWSMYYQSVYNGYIPGQTLFHP--------TYNFEEGGRSRGTGTYIPD---- 483

Query: 493 PPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPS--RDNLESGCSLNVEN---GCSL 547
                           LN+     +R K+++  R PS   + L   C    E        
Sbjct: 484 ----------------LNYNCYWDIRAKVNRPRRFPSAKHNALLKSCPKRKEAEEVHFET 527

Query: 548 NM-ENSCSLILSLEEFPLL 565
           NM +NS    LS E+FPL+
Sbjct: 528 NMNDNSKPFELSNEDFPLI 546


>gi|449449962|ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
          Length = 898

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 228/369 (61%), Positives = 269/369 (72%), Gaps = 19/369 (5%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P  I  + W  AEE  Q I+  +QP +VSE++RK VI+Y+QRLI G    EVFPFGSVPL
Sbjct: 36  PTPIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPL 95

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT LG  NVEE LA  VC +L +EDQ+    F +KDVQ + A+VK+VKC 
Sbjct: 96  KTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCL 155

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLEK+DR IGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 156 VQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 215

Query: 216 STYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE                 + VL++FLDY++ FDWDNYCIS+NGPV ISSLPE+VA
Sbjct: 216 STYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVA 275

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           ET +N G +LLLS +FL+ C E +SVP    E     F IKHLNIVDPLK+NNNLGRSVS
Sbjct: 276 ETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVS 335

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFG 378
           KGNF+RIR A SYGA++LG IL+ P   +   + KFF NTL+R+G G RPDVQ P    G
Sbjct: 336 KGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQRPDVQDPAPVSG 395

Query: 379 SGKSEAADL 387
             +S AA L
Sbjct: 396 GYESCAALL 404



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 15/173 (8%)

Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ 402
           P H + +  E F+ ++  R  + L  +V  P  AF +     +DL+GDY+S+ N L  G+
Sbjct: 600 PLHSVSLSSEDFYPSS--RGYRFLTSNVGPP-EAFNA----LSDLNGDYESHCNSLQIGR 652

Query: 403 CYHDYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHP 461
            Y++Y L   A    PP  PSQ P+++ WD++ + VQ K N   Q  +   + R P  +P
Sbjct: 653 WYYEYALSAAALSPIPPPLPSQYPNKNPWDIIRRSVQVKQNAFAQINSNGLLAR-PAFYP 711

Query: 462 YASQV--RASTFSTDEGEKSRGTGTYIPE----RTRPPPIEINRPVRARNPES 508
             S +    +T + +E  K RGTGTY P     R RP        V  R+P +
Sbjct: 712 MPSPILPGGATLAMEEMPKPRGTGTYFPNMNHYRDRPASARGRNQVSVRSPRN 764


>gi|359486339|ref|XP_002274554.2| PREDICTED: uncharacterized protein LOC100253615 [Vitis vinifera]
          Length = 755

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 270/606 (44%), Positives = 361/606 (59%), Gaps = 60/606 (9%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
           N +  S+ AE W + +  IQEILC IQP IVSE++RKE+I+Y+QRLI   +G EV PFGS
Sbjct: 21  NPHLMSVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEVLPFGS 80

Query: 95  VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYVPAQVKIVK 153
           +PLKTYLPDGDIDLT L  +N EED AR VC +LE E Q  S F ++D+ Y+ A+VKIVK
Sbjct: 81  MPLKTYLPDGDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYIRAKVKIVK 140

Query: 154 CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYG 213
           C VQ+I VDISFNQ  GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE RILG+H G
Sbjct: 141 CMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCG 200

Query: 214 LISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
           L+STYALE++                 VL+RFLDYY+TFDW+ + +S+ GPV+ISSL   
Sbjct: 201 LLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSLLTG 260

Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
             ET +     LL++ EFL  C+E ++V I A E T   F +KH+NI DPL+D NNLGRS
Sbjct: 261 APETADK---PLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRS 317

Query: 317 VSKGNFHRIRCALSYGAQRLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIH 375
           +S GN +R R A+S GAQRL EI L LP   +  GL++FF NTL+RNG G   D +  + 
Sbjct: 318 ISLGNSYRFRYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLV 376

Query: 376 AFGSGKSEAADLSGDYDSYYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP-- 421
            FG G S+  +L GDY    + L YGQ YH           +L V  Q   F+  + P  
Sbjct: 377 PFGPGSSKFCNLQGDYYGCLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWG 436

Query: 422 ------SQIPSRSAWDVLC---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFS 472
                 S +PS++ +       Q+ + + N+        +VP LPFC+  ASQ+    + 
Sbjct: 437 YTNHFASMLPSQNTFQFYSPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYH 490

Query: 473 TDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDN 532
            +E +  +GTGT+IP   +    ++NR    ++PE   H P S + +     +R P   +
Sbjct: 491 FEE-KVPQGTGTFIPNVAQIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--S 544

Query: 533 LESGCSLNVENGCSLNMENSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQV 590
           + S   L  E     N E +  ++     +E  ++  +KN  PS   +Q  QS  +S   
Sbjct: 545 MASSVLLEPEQEGGKNSEPTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQ 604

Query: 591 DDVSSP 596
           D V  P
Sbjct: 605 DLVLKP 610


>gi|224145449|ref|XP_002325647.1| predicted protein [Populus trichocarpa]
 gi|222862522|gb|EEF00029.1| predicted protein [Populus trichocarpa]
          Length = 533

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 243/490 (49%), Positives = 312/490 (63%), Gaps = 33/490 (6%)

Query: 27  SSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG 86
           S S+SS P+  P+SI  E W  AEE  +EI+  I P + S  KRK++I Y+QRLI    G
Sbjct: 41  SPSFSSNPD--PWSIVEENWERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLG 98

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQY 144
            EVFP+GSVPLKTYLPDGDIDLT +    +EE L   +  +L  E+  +DS FE+KDV  
Sbjct: 99  FEVFPYGSVPLKTYLPDGDIDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHC 158

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           + A+VK++KC VQN  VDISFNQ+ GL  LCFLE+VDRL+GK+HLFKRS+ILIKAWCYYE
Sbjct: 159 IDAEVKLIKCIVQNTVVDISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYE 218

Query: 205 SRILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGP 247
           SRILGAH+GLISTYALE +                 VL+RFL+Y++ FDW+NYCIS+NGP
Sbjct: 219 SRILGAHHGLISTYALETLILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGP 278

Query: 248 VAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
           V  SSLP IVAE +EN   ELLLS EFLK C + +SVP    E     F  KHLNIVDPL
Sbjct: 279 VCKSSLPNIVAEPLENGQGELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPL 338

Query: 308 KDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLR 367
           K+NNNLGRSV++GNF RIR A  YGA++LG+IL LP   +   L+ FF NTL+R+G    
Sbjct: 339 KENNNLGRSVNRGNFFRIRSAFKYGARKLGQILLLPKERIADELKIFFANTLDRHGSDYW 398

Query: 368 PDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ-CY-HDYTLPVTAQFSPPSSPSQIP 425
            +V     A G+  S+ +   GD++ +   L Y Q C+ H  + P+      P  PS +P
Sbjct: 399 TEVGNSELASGARSSDNSVSRGDHNGHLQSLAYSQYCHMHAVSAPI------PPCPSMLP 452

Query: 426 ---SRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGT 482
              +++ W+ + Q +Q K N   Q  T        +C       RA+T  ++E +  RGT
Sbjct: 453 LSENKNRWETVQQSLQLKQNGHSQMNTNHIFGTQLYCVNPGGPFRAAT-DSEEKKIRRGT 511

Query: 483 GTYIPERTRP 492
           GTYIP    P
Sbjct: 512 GTYIPNMVFP 521


>gi|297735556|emb|CBI18050.3| unnamed protein product [Vitis vinifera]
          Length = 824

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 216/359 (60%), Positives = 262/359 (72%), Gaps = 20/359 (5%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P +I A  W  AE  +QEI+C +QP  VSE++RKEV++Y+Q LI    G EVFPFGSVPL
Sbjct: 31  PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPL 90

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT  G   VE+ LA  V  +LE EDQ+    F +KDVQ + A+VK+VKC 
Sbjct: 91  KTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCL 150

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLE++DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210

Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE +VL+                 +FLDY++ FDWDNYC+S+NGPV ISSLPE++A
Sbjct: 211 STYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIA 270

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           ET EN G + LL+ + L+ C + +SVP   LE     F  KH NIVDPLK+NNNLGRSVS
Sbjct: 271 ETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVS 330

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ-VPIHA 376
           KGNF+RIR A +YGA++LG IL  P   +   L KFF NTLER+G+G RPDV  +P+ A
Sbjct: 331 KGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDVDLIPLDA 389



 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)

Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
           SG  EA    ADLSGDYDS++N L YG   +DY     A   P + PSQ  S ++WD + 
Sbjct: 554 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 613

Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
           Q    + N+  Q      +PR PF       +  + F  +E  K RGTGTY P
Sbjct: 614 QSAHIRRNIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEMPKPRGTGTYFP 666


>gi|359481238|ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
          Length = 884

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 214/354 (60%), Positives = 259/354 (73%), Gaps = 19/354 (5%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P +I A  W  AE  +QEI+C +QP  VSE++RKEV++Y+Q LI    G EVFPFGSVPL
Sbjct: 31  PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPL 90

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT  G   VE+ LA  V  +LE EDQ+    F +KDVQ + A+VK+VKC 
Sbjct: 91  KTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCL 150

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLE++DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210

Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE +VL+                 +FLDY++ FDWDNYC+S+NGPV ISSLPE++A
Sbjct: 211 STYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIA 270

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           ET EN G + LL+ + L+ C + +SVP   LE     F  KH NIVDPLK+NNNLGRSVS
Sbjct: 271 ETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVS 330

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
           KGNF+RIR A +YGA++LG IL  P   +   L KFF NTLER+G+G RPDV +
Sbjct: 331 KGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDVDL 384



 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)

Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
           SG  EA    ADLSGDYDS++N L YG   +DY     A   P + PSQ  S ++WD + 
Sbjct: 614 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 673

Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
           Q    + N+  Q      +PR PF       +  + F  +E  K RGTGTY P
Sbjct: 674 QSAHIRRNIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEMPKPRGTGTYFP 726


>gi|297823987|ref|XP_002879876.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325715|gb|EFH56135.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 516

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 225/468 (48%), Positives = 303/468 (64%), Gaps = 50/468 (10%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           I  ELWL+AEER QEIL  IQP  +SE+ R E+IN++Q L+    GIEVF FGSVPLKTY
Sbjct: 28  IGEELWLIAEERAQEILFAIQPMYLSERSRNEIINHLQTLMRERLGIEVFLFGSVPLKTY 87

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           LPDGDIDLTVL    +EE+ A+ +  ILE E  +S F++ DVQY+ AQVK++KC+++N+ 
Sbjct: 88  LPDGDIDLTVLTPYGMEENCAKALRNILEAERGESDFQVTDVQYIHAQVKVIKCTIRNVA 147

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           +DISFNQMAGLSALCFLE+VDR  G+DHLFKRS+ILIKAWC+YESRILGA+ GLISTYAL
Sbjct: 148 LDISFNQMAGLSALCFLEQVDRAFGRDHLFKRSIILIKAWCFYESRILGANNGLISTYAL 207

Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
            ++                 VL++F+D+Y +FDW+NYCI++ G V ISS P+I     E 
Sbjct: 208 AILVLNIVNMSYSSVSGPLAVLYKFMDFYGSFDWENYCITVTGLVPISSFPDI----TET 263

Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
              E+ L  +F ++C E YS P   +E     F +KH NI+DPLK +NNLGRSVS+GN  
Sbjct: 264 RNHEVFLDEKFFRECIESYSGPANVVEANRKYFPVKHYNILDPLKHSNNLGRSVSEGNAI 323

Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
           R+R     GAQ+L ++LT PG  +G  LE FF N+L+RNGKG R DV+ P+  FG+GK++
Sbjct: 324 RLRHCFRRGAQKLRDVLTFPGETVGWKLEDFFGNSLDRNGKGQRQDVEEPVIPFGTGKAD 383

Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSR---SAWDVLCQYVQGK 440
            ++L GD+  Y+   +YG+ +H  +L            + IP     S+W+ +  ++  +
Sbjct: 384 YSNLRGDFYGYFLCFIYGKWFHRESL-----------HNWIPQGLDISSWETVRWFMTRR 432

Query: 441 GNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE 488
            N        VF  R            +++ S     KS+GTGTYIPE
Sbjct: 433 RN--------VFNGR-------NLNGGSTSTSMPNKRKSKGTGTYIPE 465


>gi|30688308|ref|NP_850331.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|145330711|ref|NP_001078031.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|186506897|ref|NP_001118485.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|186506900|ref|NP_001118486.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|60547743|gb|AAX23835.1| hypothetical protein At2g40520 [Arabidopsis thaliana]
 gi|330254746|gb|AEC09840.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254747|gb|AEC09841.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254748|gb|AEC09842.1| nucleotidyltransferase protein [Arabidopsis thaliana]
 gi|330254749|gb|AEC09843.1| nucleotidyltransferase protein [Arabidopsis thaliana]
          Length = 502

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 219/468 (46%), Positives = 302/468 (64%), Gaps = 41/468 (8%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           I+AE+WL+AE R QEILC IQP  ++E+ R ++I+ +Q L+    GIEV+ FGS+PLKTY
Sbjct: 28  IEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKTY 87

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           LPDGDIDLTVL H   EED AR VC +LE E  +S  ++  VQYV A+VK++KCS++++ 
Sbjct: 88  LPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQVTGVQYVQAKVKVIKCSIRDVA 147

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            DISFNQ+AGL ALCFLE+VD+  G+DHLFK+S+IL+KAWC+YESRILGA+ GLISTYAL
Sbjct: 148 FDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLISTYAL 207

Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
            ++                 VL++F++YY +FDW NYC+++ GPV ISSLP+I     E 
Sbjct: 208 AILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDI----TET 263

Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
              E+ L  +F ++C E+YS     +E +   F +K+ NI+DPLK +NNLGRSV+KGN  
Sbjct: 264 GNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTKGNMV 323

Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
           R+R     G Q+L ++LTLPG  +G  LEKFF  +LERNGKG R DV+ P+ AFG+G ++
Sbjct: 324 RLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGAAD 383

Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
            + L GD+D Y N L+YG+ +H  +L     + PPS  +     ++W  +  Y+    N 
Sbjct: 384 YSQLKGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN- 434

Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
               G        P   P   Q       +    +S GTGTYIP  ++
Sbjct: 435 ----GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMSK 471


>gi|21805733|gb|AAM76764.1| hypothetical protein [Arabidopsis thaliana]
          Length = 502

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 218/468 (46%), Positives = 302/468 (64%), Gaps = 41/468 (8%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           I+AE+WL+AE R QEILC +QP  ++E+ R ++I+ +Q L+    GIEV+ FGS+PLKTY
Sbjct: 28  IEAEVWLIAEARAQEILCAVQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKTY 87

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           LPDGDIDLTVL H   EED AR VC +LE E  +S  ++  VQYV A+VK++KCS++++ 
Sbjct: 88  LPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQVTGVQYVQAKVKVIKCSIRDVA 147

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            DISFNQ+AGL ALCFLE+VD+  G+DHLFK+S+IL+KAWC+YESRILGA+ GLISTYAL
Sbjct: 148 FDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLISTYAL 207

Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
            ++                 VL++F++YY +FDW NYC+++ GPV ISSLP+I     E 
Sbjct: 208 AILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDI----TET 263

Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
              E+ L  +F ++C E+YS     +E +   F +K+ NI+DPLK +NNLGRSV+KGN  
Sbjct: 264 GNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTKGNMV 323

Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
           R+R     G Q+L ++LTLPG  +G  LEKFF  +LERNGKG R DV+ P+ AFG+G ++
Sbjct: 324 RLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGVAD 383

Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
            + L GD+D Y N L+YG+ +H  +L     + PPS  +     ++W  +  Y+    N 
Sbjct: 384 YSQLKGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN- 434

Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
               G        P   P   Q       +    +S GTGTYIP  ++
Sbjct: 435 ----GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMSK 471


>gi|224124740|ref|XP_002319410.1| predicted protein [Populus trichocarpa]
 gi|222857786|gb|EEE95333.1| predicted protein [Populus trichocarpa]
          Length = 681

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 234/462 (50%), Positives = 295/462 (63%), Gaps = 23/462 (4%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGD 105
           W  AEE   EI+  I P + S  KRK+VI+Y+QRLI    G EVFP+GSVPLKTYLPDGD
Sbjct: 58  WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117

Query: 106 IDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           IDLT +    +EE L   V  +L  E+  +D+++E+KDV  + A+VK++KC VQN  VDI
Sbjct: 118 IDLTAISSPAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVDI 177

Query: 164 SFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
           SFNQ+ GL  LCFLE+VDRL+GK+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +
Sbjct: 178 SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237

Query: 224 -----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
                            VL++FLDY++ FDW+NYCIS+NGPV  SSLP IVA+  EN   
Sbjct: 238 ILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVSG 297

Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIR 326
           ELLLS EFLK C + + VP    E     F  KHLNIVDPLK+NNNLGRSV++GNF RIR
Sbjct: 298 ELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357

Query: 327 CALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAAD 386
            A  YG ++LG IL LP   +   L+ FF NTL+R+G     DVQ    A  +  S+ + 
Sbjct: 358 SAFKYGGRKLGRILLLPREKIADELKTFFANTLDRHGSDYWSDVQNSELASVARSSDNSV 417

Query: 387 LSGDYDSYYNGLLYGQCYHDYTLPVTAQFSP-PSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
             GD+D +   L Y Q  H Y   V+A   P PS   Q  + + W+ + Q +Q K N   
Sbjct: 418 SLGDHDDHLQSLAYSQYCHMYA--VSAPIPPCPSMSPQSENNNRWETVRQSLQLKRNGHS 475

Query: 446 QWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
           Q  T        +C    +  RA+T S ++ E+ RGTGTYIP
Sbjct: 476 QMNTNHVYGTQFYCVNPVAPFRAATNSEEKKER-RGTGTYIP 516


>gi|297736507|emb|CBI25378.3| unnamed protein product [Vitis vinifera]
          Length = 893

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 270/657 (41%), Positives = 360/657 (54%), Gaps = 108/657 (16%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE------ 88
           N +  S+ AE W + +  IQEILC IQP IVSE++RKE+I+Y+QRLI   +G E      
Sbjct: 105 NPHLMSVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEREIEES 164

Query: 89  ---------------------------------------------VFPFGSVPLKTYLPD 103
                                                        V PFGS+PLKTYLPD
Sbjct: 165 SRASTFQSGDERPLFKQAMLERRNSRSIEVLLQVYFEEPKNIKLYVLPFGSMPLKTYLPD 224

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           GDIDLT L  +N EED AR VC +LE E Q  S F ++D+ Y+ A+VKIVKC VQ+I VD
Sbjct: 225 GDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYIRAKVKIVKCMVQDISVD 284

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           ISFNQ  GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE RILG+H GL+STYALE+
Sbjct: 285 ISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLSTYALEI 344

Query: 223 M-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG 265
           +                 VL+RFLDYY+TFDW+ + +S+ GPV+ISSL     E  E   
Sbjct: 345 LVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSLLTGAPEAAETAD 404

Query: 266 DELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRI 325
             LL++ EFL  C+E ++V I A E T   F +KH+NI DPL+D NNLGRS+S GN +R 
Sbjct: 405 KPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRF 464

Query: 326 RCALSYGAQRLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEA 384
           R A+S GAQRL EI L LP   +  GL++FF NTL+RNG G   D +  +  FG G S+ 
Sbjct: 465 RYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLVPFGPGSSKF 523

Query: 385 ADLSGDYDSYYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP--------SQI 424
            +L GDY    + L YGQ YH           +L V  Q   F+  + P        S +
Sbjct: 524 CNLQGDYYGCLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWGYTNHFASML 583

Query: 425 PSRSAWDVLC---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRG 481
           PS++ +       Q+ + + N+        +VP LPFC+  ASQ+    +  +E +  +G
Sbjct: 584 PSQNTFQFYSPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYHFEE-KVPQG 636

Query: 482 TGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNV 541
           TGT+IP   +    ++NR    ++PE   H P S + +     +R P   ++ S   L  
Sbjct: 637 TGTFIPNVAQIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--SMASSVLLEP 691

Query: 542 ENGCSLNMENSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSP 596
           E     N E +  ++     +E  ++  +KN  PS   +Q  QS  +S   D V  P
Sbjct: 692 EQEGGKNSEPTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQDLVLKP 748


>gi|359478494|ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
          Length = 854

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 202/350 (57%), Positives = 259/350 (74%), Gaps = 19/350 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           I  + W  AE   QEI+  +QP + S ++R+EVI+Y+QRLI    G EVFP+GSVPLKTY
Sbjct: 35  IAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTY 94

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQN 158
           L DGDIDLT L   NVEE LA  V  +L+ E+Q+  + FE+KD+Q++ A+VK+VKC V++
Sbjct: 95  LLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKD 154

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           I +DISFNQ+ GLS LCFLE+VDRLIGKDHLFKRS+ILIK+WCYYESRILGAH+GLISTY
Sbjct: 155 IVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYESRILGAHHGLISTY 214

Query: 219 ALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
           ALE++VL+                 RFLDY++ FDWDNYCIS+NGPV  SSLP+IVAE  
Sbjct: 215 ALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELP 274

Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
           EN  D+LLLS EFL+ C +++SVP   LE     F +KHLNI+DPL++NNNLGRSV+KGN
Sbjct: 275 ENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGN 334

Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           F+RIR A  YG+ +LG+IL+LP   +   L+ FF +TLER+      ++Q
Sbjct: 335 FYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYMAEIQ 384



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 9/137 (6%)

Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
           DLSGDYDS+   L YGQC + + LP     SPP SPSQ+   + WD + Q++Q   NL  
Sbjct: 597 DLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKVRQHLQFTQNLHS 656

Query: 446 QWGTE--VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
           Q  +   +     P  HP  S    + F  ++ +K RGTGTY P  +  P    +RPV  
Sbjct: 657 QMDSNGVILGNHFPVKHPARS---ITAFGLEDKQKPRGTGTYFPNMSHLP--NRDRPVGQ 711

Query: 504 RNPESLNHVPVSKLRRK 520
           R  ++L     S+L R+
Sbjct: 712 RRNQALE--SHSQLHRR 726


>gi|297816424|ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 829

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 204/346 (58%), Positives = 247/346 (71%), Gaps = 19/346 (5%)

Query: 44  ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
           E W+  EE  +EI+  + P +VSE +R++VI Y+Q+LI    G EV  FGSVPLKTYLPD
Sbjct: 32  EFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPD 91

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
           GDIDLT  G    EE+LA  V  +LE E+ +  S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92  GDIDLTAFGGLYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFNQ+ G+  LCFLEK+D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 222 MMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
            +VL+                 +FLDY++ FDWDNYCIS+NGPV +SSLPEIV ET EN 
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENG 271

Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
           G++ LL+ EFLK+C E+YSVP    E     F  KHLNIVDPLK+ NNLGRSVSKGNF+R
Sbjct: 272 GEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           IR A +YGA++LG+I       +   L KFF N L R+G G RPDV
Sbjct: 332 IRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRHGSGQRPDV 377



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 64/130 (49%), Gaps = 8/130 (6%)

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
           +DLSGDY+S  N L +G+ + DY    P++   SPP  P Q+P+ ++W+V+   +  + N
Sbjct: 578 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVIRHALPFRRN 635

Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
                     VPR  F H     +    F+ +E  K RGTGTY P     R RP      
Sbjct: 636 APTPVNANGVVPRQVFFHVNPQMIPGPGFAIEELPKPRGTGTYFPNANHYRDRPFSPRGR 695

Query: 499 RPVRARNPES 508
              +AR+P +
Sbjct: 696 SSHQARSPRN 705


>gi|79597803|ref|NP_850678.2| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
 gi|332645293|gb|AEE78814.1| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
          Length = 829

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 204/350 (58%), Positives = 250/350 (71%), Gaps = 19/350 (5%)

Query: 44  ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
           ELW+  EE  +EI+  + P +VSE +R++VI Y+Q+LI    G EV  FGSVPLKTYLPD
Sbjct: 32  ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
           GDIDLT  G    EE+LA  V  +LE E+ +  S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92  GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFNQ+ G+  LCFLEK+D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211

Query: 222 MMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
            +VL+                 +FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN 
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENG 271

Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
           G++LLL+ EFLK+C E+YSVP    E     F  KHLNIVDPLK+ NNLGRSVSKGNF+R
Sbjct: 272 GEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331

Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
           IR A +YGA++LG++       +   L KFF N L R+G G RPDV   I
Sbjct: 332 IRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 381



 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
           +DLSGDY+S  N L +G+ + DY    P++   SPP  P Q+P+ ++W+V+   +  + N
Sbjct: 578 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 635

Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
                     VPR  F H     +    F  +E  K RGTGTY P     R RP      
Sbjct: 636 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 695

Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
              +AR+P + N   +S+   +++     P R+  E       +   S +M ++     S
Sbjct: 696 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 746

Query: 559 LEEFPLLPVTKNLP 572
           L+ FP    + N P
Sbjct: 747 LDSFPDTNGSTNHP 760


>gi|449526634|ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
          Length = 816

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 204/320 (63%), Positives = 239/320 (74%), Gaps = 19/320 (5%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQY 144
           ++VFPFGSVPLKTYLPDGDIDLT LG  NVEE LA  VC +L +EDQ+    F +KDVQ 
Sbjct: 3   VQVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQL 62

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           + A+VK+VKC VQNI VDISFNQ+ GL  LCFLEK+DR IGKDHLFKRS+ILIKAWCYYE
Sbjct: 63  IRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYE 122

Query: 205 SRILGAHYGLISTYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGP 247
           SRILGAH+GLISTYALE                 + VL++FLDY++ FDWDNYCIS+NGP
Sbjct: 123 SRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGP 182

Query: 248 VAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
           V ISSLPE+VAET +N G +LLLS +FL+ C E +SVP    E     F IKHLNIVDPL
Sbjct: 183 VRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPL 242

Query: 308 KDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLR 367
           K+NNNLGRSVSKGNF+RIR A SYGA++LG IL+ P   +   + KFF NTL+R+G G R
Sbjct: 243 KENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQR 302

Query: 368 PDVQVPIHAFGSGKSEAADL 387
           PDVQ P    G  +S AA L
Sbjct: 303 PDVQDPAPVSGGYESCAALL 322



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 15/173 (8%)

Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ 402
           P H + +  E F+ ++  R  + L  +V  P  AF +     +DL+GDY+S+ N L  G+
Sbjct: 518 PLHSVSLSSEDFYPSS--RGYRFLTSNVGPP-EAFNA----LSDLNGDYESHCNSLQIGR 570

Query: 403 CYHDYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHP 461
            Y++Y L   A    PP  PSQ P+++ WD++ + VQ K N   Q  +   + R P  +P
Sbjct: 571 WYYEYALSAAALSPIPPPLPSQYPNKNPWDIIRRSVQVKQNAFAQINSNGLLAR-PAFYP 629

Query: 462 YASQV--RASTFSTDEGEKSRGTGTYIPE----RTRPPPIEINRPVRARNPES 508
             S +    +T + +E  K RGTGTY P     R RP        V  R+P +
Sbjct: 630 MPSPILPGGATLAMEEMPKPRGTGTYFPNMNHYRDRPASARGRNQVSVRSPRN 682


>gi|449443945|ref|XP_004139736.1| PREDICTED: uncharacterized protein LOC101209112 [Cucumis sativus]
          Length = 1341

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/350 (54%), Positives = 247/350 (70%), Gaps = 20/350 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D+E W  AEER  E++  IQP   SE++R  V +Y+QRLI   +  +VF FGSVPLKTY
Sbjct: 31  LDSERWSKAEERTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVFTFGSVPLKTY 90

Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDIDLT    +QN++E  A  V  +LE+E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 91  LPDGDIDLTAFSKNQNLKETWAHQVRDMLESEEKNENAEFRVKEVQYIKAEVKIIKCLVE 150

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISF+Q+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 151 NIVVDISFDQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 210

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDWDN+C+S+ GPV ISSLP++ AE 
Sbjct: 211 YALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEP 270

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              DG ELLLS  FL+ C  +Y+V     EN G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 271 PRKDGGELLLSKLFLEACSAVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 330

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NF RIR A ++GA+RL  +   P   +   L +FF+NT ER+G G RPDV
Sbjct: 331 NFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERHGSGQRPDV 380


>gi|356500940|ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 [Glycine max]
          Length = 780

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 189/333 (56%), Positives = 243/333 (72%), Gaps = 20/333 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLT 109
           E    EIL  I+P + ++++R+EV++Y+QRLI      EVFP+GSVPLKTYLPDGDIDLT
Sbjct: 45  ERNTAEILRRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104

Query: 110 VLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            L  +N+E+ L   V  +L  E  ++ + +E+KDV+++ A+VK+VKC VQ+I VDISFNQ
Sbjct: 105 ALSCENIEDGLVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQ 164

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
           + GLS LCFLEKVDRL+ KDHLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +    
Sbjct: 165 LGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYI 224

Query: 224 -------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
                        VL+RFLDY++ FDWDNYC+S+ GPV+ +SLP IVAE  EN G+  LL
Sbjct: 225 FHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPENGGN-TLL 283

Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
           + EF++ C E +SVP    +     F  KHLNI+DPLK+NNNLGRSV+KGNF+RIR A  
Sbjct: 284 TEEFIRSCVESFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFK 343

Query: 331 YGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
           YGA++LG IL LP   +   L +FF NTLER+G
Sbjct: 344 YGARKLGWILRLPEDRIAEELIRFFANTLERHG 376



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 62/130 (47%), Gaps = 26/130 (20%)

Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV-------- 437
           DL+GDYDS+   L YG   + Y + +    SPP SP + P+R+ W+ + Q V        
Sbjct: 515 DLTGDYDSHIGNLQYGHMCNGYPVSLVVP-SPPRSP-KFPNRNPWETVHQCVPINHSIRS 572

Query: 438 QGKGNLVYQWGTEVFV---PRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPP 494
           Q   N V   G +V+V   P LP           ++F ++E  K RGTG Y P  T   P
Sbjct: 573 QANSNCV--MGQQVYVINHPTLPM----------TSFGSEEKRKVRGTGAYFPNMT-SRP 619

Query: 495 IEINRPVRAR 504
              NRP+  R
Sbjct: 620 YRDNRPIPGR 629


>gi|356553166|ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 [Glycine max]
          Length = 779

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 190/333 (57%), Positives = 242/333 (72%), Gaps = 20/333 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLT 109
           E+   EIL  I+P + ++++R+EV++Y+QRLI      EVFP+GSVPLKTYLPDGDIDLT
Sbjct: 45  EKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104

Query: 110 VLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            L  QN+E+ L   V  +L  E  ++ S +E+KDV+++ A+VK+VKC VQ+I VDISFNQ
Sbjct: 105 ALSCQNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQ 164

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
           + GLS LCFLEKVDRL+ KDHLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +    
Sbjct: 165 LGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYI 224

Query: 224 -------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
                        VL+RFLDY++ FDWDNYC+S+ GPV  SS P IVAE  EN G+  LL
Sbjct: 225 FHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSPPNIVAEVPENGGN-TLL 283

Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
           + EF++ C E +S+P    +     F  KHLNI+DPLK+NNNLGRSV+KGNF+RIR A  
Sbjct: 284 TEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFK 343

Query: 331 YGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
           YGA++LG IL LP   +   L +FF NTLER+G
Sbjct: 344 YGARKLGWILMLPEDRITEELIRFFTNTLERHG 376



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 22/128 (17%)

Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQ------G 439
           DL+GDYDS+   L YG   + Y +             + P+R+ W+ + Q VQ       
Sbjct: 515 DLTGDYDSHIGNLQYGHMCNGYPVSPVVPSP--PRSPKFPNRNPWETVRQCVQINHSIRS 572

Query: 440 KGNLVYQWGTEVFV---PRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIE 496
           + N     G +V+V   P LP           ++F ++E  K RGTG Y P  T   P  
Sbjct: 573 QANSNSVMGQQVYVINHPSLPM----------TSFGSEEKRKVRGTGAYFPNMT-SRPYR 621

Query: 497 INRPVRAR 504
            NRP+  R
Sbjct: 622 DNRPMPGR 629


>gi|293332253|ref|NP_001168029.1| uncharacterized protein LOC100381756 [Zea mays]
 gi|223945595|gb|ACN26881.1| unknown [Zea mays]
 gi|413924674|gb|AFW64606.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
          Length = 833

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 201/362 (55%), Positives = 250/362 (69%), Gaps = 21/362 (5%)

Query: 31  SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
           S P +  P S+  + W   E     ++  IQP   SE  R  V++Y+QRL       +VF
Sbjct: 13  SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVF 72

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
           PFGSVPLKTYLPDGDIDLT+ G    +E+LA  VC IL++E+  +DS FE+KDVQYVPA+
Sbjct: 73  PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAE 132

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK+VKC VQNI VDIS NQ+ GL  LCFLEKVD+  GKDHLFK+S+ILIK WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRIL 192

Query: 209 GAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAIS 251
           GAH+GLISTYALE +                 VL+RFLDYY+ FDWDN  IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252

Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
           SLPE+V +  +   D+ L   EFLK+C E +SV     E     F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSN 312

Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
           NLGRSVSKGNF+RIR A  +GA++LG+IL +P  CL +G + +FF NTL+RN  GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVGEVNQFFRNTLKRNRTGLRPDV 371

Query: 371 QV 372
            V
Sbjct: 372 LV 373


>gi|413924673|gb|AFW64605.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
          Length = 815

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 201/362 (55%), Positives = 250/362 (69%), Gaps = 21/362 (5%)

Query: 31  SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
           S P +  P S+  + W   E     ++  IQP   SE  R  V++Y+QRL       +VF
Sbjct: 13  SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVF 72

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
           PFGSVPLKTYLPDGDIDLT+ G    +E+LA  VC IL++E+  +DS FE+KDVQYVPA+
Sbjct: 73  PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAE 132

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK+VKC VQNI VDIS NQ+ GL  LCFLEKVD+  GKDHLFK+S+ILIK WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRIL 192

Query: 209 GAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAIS 251
           GAH+GLISTYALE +                 VL+RFLDYY+ FDWDN  IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252

Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
           SLPE+V +  +   D+ L   EFLK+C E +SV     E     F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSN 312

Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
           NLGRSVSKGNF+RIR A  +GA++LG+IL +P  CL +G + +FF NTL+RN  GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVGEVNQFFRNTLKRNRTGLRPDV 371

Query: 371 QV 372
            V
Sbjct: 372 LV 373


>gi|357153090|ref|XP_003576335.1| PREDICTED: uncharacterized protein LOC100826374, partial
           [Brachypodium distachyon]
          Length = 769

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 205/391 (52%), Positives = 260/391 (66%), Gaps = 31/391 (7%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P SI AE W   EE    ++  IQP+  SE +R  V++Y+QRL+    G EVFPFGSVPL
Sbjct: 3   PASISAERWRAFEEAALGVVGRIQPSAPSEGRRAAVVHYVQRLVRHAVGCEVFPFGSVPL 62

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT  G  + +E+LA  V  +LE+E+  +D+ FE+KDVQY+ A+VK+VKC 
Sbjct: 63  KTYLPDGDIDLTAFGSISSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCL 122

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLE+VD+  GK+HLFK+S++LIKAWCYYESRILGAH+GLI
Sbjct: 123 VQNIVVDISFNQIGGLCTLCFLEQVDQRFGKEHLFKKSIMLIKAWCYYESRILGAHHGLI 182

Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE++                 VL+RFLDYY+ FDWDN  IS+ GPV +SSLPE+V+
Sbjct: 183 STYALEILVLCIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVLLSSLPELVS 242

Query: 259 ET-VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
           +  V +DGD  L   EFL++C + ++VP    E     F+ K LNIVDPLK NNNLGRSV
Sbjct: 243 DAPVTHDGD-FLKREEFLRECAQTFTVPPRNSEKNTRLFSRKFLNIVDPLKQNNNLGRSV 301

Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
           SKGNF RIR A   GA++LG+IL          + +FF NTL+RN   +RPDVQ      
Sbjct: 302 SKGNFFRIRSAFDLGARKLGKILKEASSSAVPEVNQFFRNTLKRNRTMVRPDVQ------ 355

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYT 408
                 A D S + D+  +  LY   Y D +
Sbjct: 356 ----DIALDFSVERDNEDHTPLYSNSYGDLS 382


>gi|255564741|ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
 gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis]
          Length = 1334

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/350 (53%), Positives = 245/350 (70%), Gaps = 20/350 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D+E W  AEER  E++  I+P   SE++R  V +Y++RLI   +   VF FGSVPLKTY
Sbjct: 24  LDSERWAKAEERTAELIDCIKPNEPSERRRNAVADYVERLITKCFPCRVFTFGSVPLKTY 83

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDIDLT     Q+++E  A  V  +LENE+++    F +K+VQY+ A+VKI+KC V+
Sbjct: 84  LPDGDIDLTAFSEGQSMKETWAHQVRDVLENEEKNENAEFRVKEVQYIQAEVKIIKCLVE 143

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISF+Q+ GL  LCFLE+VD LI +DHLFK+S+ILIKAWCYYESRILGAH+GLIST
Sbjct: 144 NIVVDISFDQLGGLCTLCFLEEVDHLINQDHLFKKSIILIKAWCYYESRILGAHHGLIST 203

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDWDN+C+S+ GPV ISSLP++ AE 
Sbjct: 204 YALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEP 263

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              DG ELLLS  FLK C  +Y+V     E+ G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 264 PRKDGGELLLSKLFLKACGAVYAVSPGGPESQGQTFTSKHFNVIDPLRVNNNLGRSVSKG 323

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NF RIR A ++GA+RL  +L  P   +   + +FF+NT +R+G GLRPD 
Sbjct: 324 NFFRIRSAFAFGAKRLARLLDCPKEDIHFEVNQFFMNTWDRHGSGLRPDA 373


>gi|302143676|emb|CBI22537.3| unnamed protein product [Vitis vinifera]
          Length = 1359

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/354 (52%), Positives = 244/354 (68%), Gaps = 20/354 (5%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           ++D E   LAEER ++++  IQP   SE++R+ V +Y++ LI   +  +VFPFGSVPLKT
Sbjct: 29  ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88

Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
           YLPDGDIDLT      N+++  A  V  ILE E++  D+ F +K+VQY+ A+VKI+KC V
Sbjct: 89  YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKIIKCLV 148

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           +NI VDISFNQ+ GL  LCFLE+VD LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 149 ENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 208

Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +VL+                 RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 209 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 268

Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
               D  ELLLS  FL  C  +Y+V     EN    F  K+ N++DPL+ NNNLGRSVSK
Sbjct: 269 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 328

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           GNF RIR A ++GAQRL  +L  P   +   + +FF+NT ER+GKG RPD   P
Sbjct: 329 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 382


>gi|225462743|ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
          Length = 1353

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/354 (52%), Positives = 244/354 (68%), Gaps = 20/354 (5%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           ++D E   LAEER ++++  IQP   SE++R+ V +Y++ LI   +  +VFPFGSVPLKT
Sbjct: 29  ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88

Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
           YLPDGDIDLT      N+++  A  V  ILE E++  D+ F +K+VQY+ A+VKI+KC V
Sbjct: 89  YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKIIKCLV 148

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           +NI VDISFNQ+ GL  LCFLE+VD LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 149 ENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 208

Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +VL+                 RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 209 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 268

Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
               D  ELLLS  FL  C  +Y+V     EN    F  K+ N++DPL+ NNNLGRSVSK
Sbjct: 269 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 328

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           GNF RIR A ++GAQRL  +L  P   +   + +FF+NT ER+GKG RPD   P
Sbjct: 329 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 382


>gi|297745424|emb|CBI40504.3| unnamed protein product [Vitis vinifera]
          Length = 1229

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/350 (52%), Positives = 244/350 (69%), Gaps = 21/350 (6%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E WL+AEER  E++  IQP   SE+ R  V +Y+QR++   +  +VF FGSVPLKTY
Sbjct: 30  LDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTY 89

Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDIDLT    +QN+++  A  V  +L++E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 90  LPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVE 149

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 150 NIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 209

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +                 VL+RFL+++++FDWDN+C+S+ GPV ISSLP++ AE 
Sbjct: 210 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEP 269

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLLS  FL  C  +Y+V     E  G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 270 PRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKG 329

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NF RIR A ++GA+RL  +L  P   +   + + F+NT ER+G G RPD 
Sbjct: 330 NFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPDT 378


>gi|225454502|ref|XP_002277075.1| PREDICTED: uncharacterized protein LOC100241322 [Vitis vinifera]
          Length = 1295

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/350 (52%), Positives = 244/350 (69%), Gaps = 21/350 (6%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E WL+AEER  E++  IQP   SE+ R  V +Y+QR++   +  +VF FGSVPLKTY
Sbjct: 30  LDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTY 89

Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDIDLT    +QN+++  A  V  +L++E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 90  LPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVE 149

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 150 NIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 209

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +                 VL+RFL+++++FDWDN+C+S+ GPV ISSLP++ AE 
Sbjct: 210 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEP 269

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLLS  FL  C  +Y+V     E  G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 270 PRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKG 329

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NF RIR A ++GA+RL  +L  P   +   + + F+NT ER+G G RPD 
Sbjct: 330 NFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPDT 378


>gi|413956606|gb|AFW89255.1| hypothetical protein ZEAMMB73_893455 [Zea mays]
          Length = 1316

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 20/353 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W +AE+R  E++  IQP + SE +R  V +Y+QRLI      +VF FGSVPLKTY
Sbjct: 20  LDPERWAVAEDRTAELIACIQPNVYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T   + + ++E  A  V   LE E  D+++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80  LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKDENAEFHVKEVQYIQAEVKIIKCLVE 139

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 199

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDW+ +C+S+ GPV ISSLP++ A  
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAIP 259

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLL+  FL  C   Y V     EN G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPHTQENQGQPFVSKHFNVIDPLRTNNNLGRSVSKG 319

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           NF RIR A +YGA+RLG++L  P   L   L +FF NT  R+G G RPDV +P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEALIPELNQFFTNTWIRHGSGSRPDVPIP 372


>gi|356520288|ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
          Length = 1331

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 189/352 (53%), Positives = 246/352 (69%), Gaps = 22/352 (6%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY--GIEVFPFGSVPLK 98
           +D+E WL AE+R  E++  IQP   SE++R  V +Y+QRLI   +   + VF FGSVPLK
Sbjct: 31  LDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVGVFTFGSVPLK 90

Query: 99  TYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
           TYLPDGDIDLT    +QN+++  A  V  +LENE+++    F +K+VQY+ A+VKI+KC 
Sbjct: 91  TYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCL 150

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           V+NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGAHHGLI 210

Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE +VL+                 RFL++++ FDW+N+C+S+ GPV ISSLP++ A
Sbjct: 211 STYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTA 270

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           E    DG +LLLS  FL  C  +Y+V     EN G  F  KH N++DPL+ NNNLGRSVS
Sbjct: 271 EPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 330

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           KGNF RIR A ++GA++L  +L  P   L   + +FF NT ER+G G RPDV
Sbjct: 331 KGNFFRIRSAFAFGAKKLARLLDCPEEELFSEVNQFFFNTWERHGSGERPDV 382


>gi|242036527|ref|XP_002465658.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
 gi|241919512|gb|EER92656.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
          Length = 1333

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 185/353 (52%), Positives = 239/353 (67%), Gaps = 20/353 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W +AE+R  E++  IQP   SE +R  V +Y+QRLI      +VF FGSVPLKTY
Sbjct: 20  LDPERWAVAEDRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T   + + ++E  A  V   LE E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80  LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISRNHLFKRSIILIKAWCFYESRILGAHHGLIST 199

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDW+ +C+S+ GPV ISSLP++ AE 
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLL+  FL  C   Y V     EN G  F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPRTQENQGQPFVSKHFNVIDPLRANNNLGRSVSKG 319

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           NF RIR A +YGA+RLG++L  P   L   L +FF NT  R+G G RPDV  P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIAELNQFFTNTWIRHGSGSRPDVPTP 372


>gi|356560284|ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
          Length = 1337

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/352 (53%), Positives = 245/352 (69%), Gaps = 22/352 (6%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY--GIEVFPFGSVPLK 98
           +D+E WL AE+R  E++  IQP   SE++R  V +Y+QRLI   +   + VF FGSVPLK
Sbjct: 31  LDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVRVFTFGSVPLK 90

Query: 99  TYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
           TYLPDGDIDLT    +QN+++  A  V  +LENE+++    F +K+VQY+ A+VKI+KC 
Sbjct: 91  TYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCL 150

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           V+NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGAHHGLI 210

Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE +VL+                 RFL++++ FDW+N+C+S+ GPV ISSLP++ A
Sbjct: 211 STYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTA 270

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           E    DG +LLLS  FL  C  +Y+V     EN G  F  KH N++DPL+ NNNLGRSVS
Sbjct: 271 EPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 330

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           KGNF RIR A ++GA+RL  +L      L   + +FF NT ER+G G RPDV
Sbjct: 331 KGNFFRIRSAFAFGAKRLARLLDCSEDELFSEVNQFFFNTWERHGSGERPDV 382


>gi|357155485|ref|XP_003577136.1| PREDICTED: uncharacterized protein LOC100840351 [Brachypodium
           distachyon]
          Length = 739

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 202/387 (52%), Positives = 254/387 (65%), Gaps = 29/387 (7%)

Query: 10  MEDLQE-ELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEK 68
           M D++E  L   P H        +P N  P S+  E+W   E     ++  IQP I SE 
Sbjct: 1   MVDIREVSLAPEPKH--------APANPDPSSVSPEVWEPLEAAALAVIGRIQPTIPSEG 52

Query: 69  KRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
            R  V++YIQRL+    G +VFPFGSVPLKTYLPDGDIDLT  G    +E LA  V  IL
Sbjct: 53  LRASVVDYIQRLVRCSVGCQVFPFGSVPLKTYLPDGDIDLTAFGSTYSDESLANEVRAIL 112

Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
           E E++     FE+KDVQY+ A+VK+VKC VQNI VDISFNQM GL  LCFLE+VD+   K
Sbjct: 113 EAEERREDAEFEVKDVQYIHAEVKLVKCFVQNIVVDISFNQMGGLCTLCFLEQVDQRFEK 172

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
           +HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+                 RFL
Sbjct: 173 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFL 232

Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
           DYY+ FDWDN  IS+ GPV++SSLPE+V E      D  L   EFLK+C ++++VP    
Sbjct: 233 DYYSKFDWDNKGISLYGPVSLSSLPELVTEPTGTHDDSFLQREEFLKECAKMFTVPPRLN 292

Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
           E     F  K+ NIVDPLK +NNLGRSVSKGNF+RIR A   GA++LG+IL +P +    
Sbjct: 293 EKNTRPFYQKYFNIVDPLKQSNNLGRSVSKGNFYRIRSAFDLGARKLGKILQMPANSTVD 352

Query: 350 GLEKFFINTLERNGKGLRPDVQ-VPIH 375
            + +FF +TL+RN   +RPD+Q +P++
Sbjct: 353 EVNQFFKSTLKRNHSMVRPDIQDIPVN 379


>gi|147780178|emb|CAN75522.1| hypothetical protein VITISV_043595 [Vitis vinifera]
          Length = 733

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 246/588 (41%), Positives = 327/588 (55%), Gaps = 91/588 (15%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYV 145
           IEV PFGS+PLKTYLPDGDIDLT L  +N EED AR VC +LE E Q  S F ++D+ Y+
Sbjct: 10  IEVLPFGSMPLKTYLPDGDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYI 69

Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
            A+VKIVKC VQ+I VDISFNQ  GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE 
Sbjct: 70  RAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEG 129

Query: 206 RILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPV 248
           RILG+H GL+STYALE++                 VL+RFLDYY+TFDW+ + +S+ GPV
Sbjct: 130 RILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPV 189

Query: 249 AISSL-------------------------------PE---IVAETVENDGDELLLSPEF 274
           +ISSL                               P+    VAE  E     LL++ EF
Sbjct: 190 SISSLLTGARESCLIMWLCLMVCFFRLIGLPFYLIFPDFVLFVAEAAETADKPLLINEEF 249

Query: 275 LKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQ 334
           L  C+E ++V I A E T   F +KH+NI DPL+D NNLGRS+S GN +R R A+S GAQ
Sbjct: 250 LWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYAISVGAQ 309

Query: 335 RLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDS 393
           RL EI L LP   +  GL++FF NTL+RNG G   D +  +  FG G S+  +L GDY  
Sbjct: 310 RLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLVPFGPGSSKFCNLQGDYYG 368

Query: 394 YYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP--------SQIPSRSAWDVL 433
             + L YGQ YH           +L V  Q   F+  + P        S +PS++ +   
Sbjct: 369 CLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWGYTNHFASMLPSQNTFQFY 428

Query: 434 C---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERT 490
               Q+ + + N+        +VP LPFC+  ASQ+    +  +E +  +GTGT+IP   
Sbjct: 429 SPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYHFEE-KVPQGTGTFIPNVA 481

Query: 491 RPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNME 550
           +    ++NR    ++PE   H P S + +     +R P   ++ S   L  E     N E
Sbjct: 482 QIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--SMASSVLLEPEQEGGKNSE 536

Query: 551 NSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSP 596
            +  ++     +E  ++  +KN  PS   +Q  QS  +S   D V  P
Sbjct: 537 PTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQDLVLKP 584


>gi|242069725|ref|XP_002450139.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
 gi|241935982|gb|EES09127.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
          Length = 835

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 198/362 (54%), Positives = 248/362 (68%), Gaps = 21/362 (5%)

Query: 31  SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
           S P +  P S+  + W   E     ++  IQP   SE+ R  VI Y+QRL       +VF
Sbjct: 13  SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEQLRAAVIEYVQRLFWFQARYQVF 72

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
           PFGSVPLKTYLPDGDIDLT+ G    +E+LA  VC IL++E+  +DS FE+KDV YVPA+
Sbjct: 73  PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCAILKSEERRKDSEFEVKDVHYVPAE 132

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK+VKC VQNI VDIS NQ+ GL  LCFLEKVD+  GK+HLFKRS++L+K WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQNFGKNHLFKRSIMLVKDWCYYESRIL 192

Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
           GAH+GLISTYALE +VL+                 RFLDYY+ FDWDN  IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252

Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
           SLPE+V +  +   D+ L   EFLK+C E +SV     E     F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDTQDDDFLQREEFLKECTESFSVLPRNSETNPRVFSRRFLNIVDPLKQSN 312

Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
           NLGRSVSKGNF+RIR A  +GA++LG+IL +P  CL +  + +FF NTL+RN  GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVSEVNQFFRNTLKRNRTGLRPDV 371

Query: 371 QV 372
            V
Sbjct: 372 WV 373


>gi|255554485|ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223542501|gb|EEF44041.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 821

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 187/345 (54%), Positives = 239/345 (69%), Gaps = 19/345 (5%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P  I  E W  AE+   +I+  I P + ++  RK V+ Y+Q LI    G +VFP+GSVPL
Sbjct: 43  PALISEENWERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSVPL 102

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT + +    +     V  +L  E+Q  D+ +++KDV ++ A+VK++KC 
Sbjct: 103 KTYLPDGDIDLTAIINPAGVDASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIKCI 162

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           V +I VDISFNQ+ GLS LCFLE+VD+LIGK HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 163 VHDIVVDISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHGLI 222

Query: 216 STYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE                 +MVL+RFLDY++ FDWDNYCIS+NGPV  SSLP+IVA
Sbjct: 223 STYALETLILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKIVA 282

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           E  E     LLL  EFL+   ++ SVP  + E     F  KHLNIVDPL++NNNLGRSV+
Sbjct: 283 EPPETGRGNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRSVN 342

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
           +GNF+RIR A  YGA++LG IL+L    +   L+KFF NTL+R+G
Sbjct: 343 RGNFYRIRSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRHG 387



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 12/160 (7%)

Query: 386 DLSGDYDSYYNGLLYGQ--CYHDYTLPVTAQFSPPSSPSQIPSRSA--WDVLCQYVQGKG 441
           DL+GDYDS+   + +GQ  C+   + PV      P SP+   S++   W+ + Q +Q K 
Sbjct: 551 DLTGDYDSHLKSVKFGQGCCFFAVSAPVL-----PCSPTAPHSKNKNPWETVRQSLQLKR 605

Query: 442 NLVYQWGTE-VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRP 500
           N+  Q  T  +F  +  F +       A  FS++E  K RGTGTYIP  +     E    
Sbjct: 606 NVHSQINTNGIFGHQQHFLNHLVPFTTA--FSSEEKRKQRGTGTYIPNMSYHSNRERPSS 663

Query: 501 VRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN 540
            R +N  + N+  + +  R    A   P  ++ + G  L+
Sbjct: 664 ERRKNHVTANNGDLHRRTRDNGLAATRPGINSYQHGHELS 703


>gi|414882102|tpg|DAA59233.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
          Length = 875

 Score =  370 bits (950), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 193/360 (53%), Positives = 244/360 (67%), Gaps = 19/360 (5%)

Query: 32  SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
           +P +  P SI  + W   E     IL TIQP+  SE  R  +I+Y+QRL+  + G++VFP
Sbjct: 15  APAHPDPSSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFP 74

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
           FGSVPLKTYLPDGDIDLT  G    +E LA  VC IL++E+  +DS F++KDVQY+ A+V
Sbjct: 75  FGSVPLKTYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEV 134

Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K+VKC VQNI VDIS NQ+ GL  LCFLEKVD   GK HLFKRSV+LIK WCYYE+RILG
Sbjct: 135 KLVKCVVQNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILG 194

Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
           AH+GLISTYALE++                 VL+RFLDYY+ FDWD   IS+ GPV++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSS 254

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
           LP++V +      D  LL  +FL++C + +SVP    E     F+ K LNIVDPLK +NN
Sbjct: 255 LPDLVTDPPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNN 314

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
           LGRSVS+GNF+RIR A  +GA++LG+IL  P       + +FF NTL+RN  G R DV V
Sbjct: 315 LGRSVSRGNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDVLV 374


>gi|414882101|tpg|DAA59232.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
          Length = 906

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 193/360 (53%), Positives = 244/360 (67%), Gaps = 19/360 (5%)

Query: 32  SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
           +P +  P SI  + W   E     IL TIQP+  SE  R  +I+Y+QRL+  + G++VFP
Sbjct: 15  APAHPDPSSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFP 74

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
           FGSVPLKTYLPDGDIDLT  G    +E LA  VC IL++E+  +DS F++KDVQY+ A+V
Sbjct: 75  FGSVPLKTYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEV 134

Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K+VKC VQNI VDIS NQ+ GL  LCFLEKVD   GK HLFKRSV+LIK WCYYE+RILG
Sbjct: 135 KLVKCVVQNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILG 194

Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
           AH+GLISTYALE++                 VL+RFLDYY+ FDWD   IS+ GPV++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSS 254

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
           LP++V +      D  LL  +FL++C + +SVP    E     F+ K LNIVDPLK +NN
Sbjct: 255 LPDLVTDPPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNN 314

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
           LGRSVS+GNF+RIR A  +GA++LG+IL  P       + +FF NTL+RN  G R DV V
Sbjct: 315 LGRSVSRGNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDVLV 374


>gi|357112328|ref|XP_003557961.1| PREDICTED: uncharacterized protein LOC100823912 [Brachypodium
           distachyon]
          Length = 1051

 Score =  369 bits (947), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 182/330 (55%), Positives = 236/330 (71%), Gaps = 19/330 (5%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           +QP   SE++R EVI+Y +R++    G EVF FGSVPLKTYLPDGDIDLTVLG+ + +  
Sbjct: 42  VQPTEASERRRAEVIDYARRIVGTALGCEVFAFGSVPLKTYLPDGDIDLTVLGNASCDST 101

Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
           L   V  IL + +Q  D+ FE+KD++++ A+VK++KC+++NI VDISFNQ  G+ ALCFL
Sbjct: 102 LIDDVYCILGSGEQNSDAEFEVKDLEHIDAEVKLIKCTIENIIVDISFNQTGGICALCFL 161

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
           E VDR IGK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +              
Sbjct: 162 ELVDRKIGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALETLILYIFNLFHKSLHG 221

Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
              VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E      D+LL   EFL    E
Sbjct: 222 PLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEGTNIPVDDLLFDKEFLHSSVE 281

Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
             SVP    +    +F +KHLNI+DPLK+ NNLGRSV+K NF RIR A SYGA++LG+ L
Sbjct: 282 KASVPPRDSDARCTKFRVKHLNIIDPLKECNNLGRSVNKANFSRIRTAFSYGARKLGQYL 341

Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
            LP   +   +  FF NTL+RNG+G+R D+
Sbjct: 342 MLPSERISGEIFGFFKNTLKRNGRGVRADI 371


>gi|414865289|tpg|DAA43846.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
          Length = 1348

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 184/353 (52%), Positives = 237/353 (67%), Gaps = 20/353 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W +AE R  E++  IQP   SE +R  V +Y+QRLI      +VF FGSVPLKTY
Sbjct: 20  LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T   + + ++E  A  V   LE E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80  LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDW+ +C+S+ GPV ISSLP++ AE 
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLL+  FL  C   Y V     EN    F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           NF RIR A +YGA+RLG++L  P   L   L +FF NT  R+G G RPDV  P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372


>gi|414865287|tpg|DAA43844.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
 gi|414865288|tpg|DAA43845.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
          Length = 1332

 Score =  368 bits (944), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 184/353 (52%), Positives = 237/353 (67%), Gaps = 20/353 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W +AE R  E++  IQP   SE +R  V +Y+QRLI      +VF FGSVPLKTY
Sbjct: 20  LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T   + + ++E  A  V   LE E  ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80  LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDW+ +C+S+ GPV ISSLP++ AE 
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLL+  FL  C   Y V     EN    F  KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           NF RIR A +YGA+RLG++L  P   L   L +FF NT  R+G G RPDV  P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372


>gi|224118186|ref|XP_002317752.1| predicted protein [Populus trichocarpa]
 gi|222858425|gb|EEE95972.1| predicted protein [Populus trichocarpa]
          Length = 353

 Score =  367 bits (943), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 183/353 (51%), Positives = 248/353 (70%), Gaps = 20/353 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           ++ E W +AEER  E++  IQP   SE++R  V+ Y+QRLI   +  +VF FGSVPLKTY
Sbjct: 1   LELERWAIAEERTAELIACIQPNQPSEERRTAVLGYVQRLIMKCFPCQVFTFGSVPLKTY 60

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+TV    Q++++  A  V  IL++E+  +++ F +K+VQY+ A+VKI+KC V+
Sbjct: 61  LPDGDIDITVFTESQDLKKTWADEVKDILQHEEKSENAEFHVKEVQYIQAEVKIIKCLVE 120

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD+LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 121 NIVVDISFNQLGGLCTLCFLEEVDQLISQNHLFKRSIILIKAWCYYESRILGAHHGLIST 180

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL++++ FDW+++CIS+ GPV ISSLP + A +
Sbjct: 181 YALETLVLYIFHVFNNRFAGPLEVLYRFLEFFSKFDWEHFCISLWGPVPISSLPNVTALS 240

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              DG ++LLS  FL+ C  +Y+V  +  EN    F  K+ N++DPL+ NNNLGRSVSKG
Sbjct: 241 PREDGGQILLSQLFLEVCSSVYAVFPSQQENQEQSFVSKYFNVIDPLRTNNNLGRSVSKG 300

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           NF+RIR A ++GAQRL  +L  P   L     +FF+NT +R+ KG RPD   P
Sbjct: 301 NFYRIRSAFAFGAQRLARLLDCPKENLLAEFNQFFMNTWDRHCKGHRPDAPSP 353


>gi|222616508|gb|EEE52640.1| hypothetical protein OsJ_34991 [Oryza sativa Japonica Group]
          Length = 801

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 190/359 (52%), Positives = 245/359 (68%), Gaps = 20/359 (5%)

Query: 32  SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
           +P N  P SI  E W   E     ++  IQP   SE +R  VI Y+Q L+    G +VFP
Sbjct: 16  TPSNPDPSSISQEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQVFP 75

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
           FGSVPLKTYLPDGDIDLT  GH + +E LA+ V  +LE+E+  +D+ FE+KDVQY+ A+V
Sbjct: 76  FGSVPLKTYLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEV 134

Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K+VKC VQNI VDISFNQ  GL  LCFLEKVD+   K+HLFKRS++LIKAWCYYESRILG
Sbjct: 135 KLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYESRILG 194

Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
           AH+GLISTYALE++                 VL+RFLDYY+ FDWDN  IS+ GP+++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSS 254

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
           LPE+V ++ +   D+  +  +FLK+C + ++V     E     F  K  NIVDPLK +NN
Sbjct: 255 LPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNN 314

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           LGRSVSKGNF RIR A  +GA++LG+IL +P +     + +FF NTL+R+   +RPDVQ
Sbjct: 315 LGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRNTLKRHCSRVRPDVQ 373



 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 9/114 (7%)

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPP--SSPSQIPSRSAWDVLC 434
           G+  S  +DL+GDY + +N LLY Q +H DY +     F P     PS  P+R   +   
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYQMPSPPPARYQ-NRRS 589

Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFST-DEGEKSRGTGTYIP 487
                + N+    GT    P    C P    +R  T+S  D+  ++RGTGTY P
Sbjct: 590 SNGHSRNNVYGYAGTNGIGPAP--CPPGYLIMR--TYSQIDDSHRARGTGTYFP 639


>gi|326531888|dbj|BAK01320.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 702

 Score =  367 bits (942), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 187/353 (52%), Positives = 242/353 (68%), Gaps = 19/353 (5%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P SI  + W         ++  IQP + SE +R  V++Y+QRL+    G  VFPFGSVPL
Sbjct: 24  PSSISPDAWAPFGAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVFPFGSVPL 83

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDL   G    +E +A  V  ILE+E+  +D+ FEIKDVQY+ A+VK+VKC 
Sbjct: 84  KTYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINAEVKLVKCF 143

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLE+VD+   K+HLFKRS++LIKAWCYYESRILGAH+GLI
Sbjct: 144 VQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGAHHGLI 203

Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE +VL+                 RFLDYY+ FDWDN  IS++GP+++SSLP++V 
Sbjct: 204 STYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSLPDLVT 263

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           +      D  L   EFL++C ++++VP    E T   F  K LNIVDPLK +NNLGRSVS
Sbjct: 264 DPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNLGRSVS 323

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           KGNF+RIR A   GA++LG+IL +P + +   + +FF +TL+RN   +RPDVQ
Sbjct: 324 KGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDVQ 376


>gi|218192316|gb|EEC74743.1| hypothetical protein OsI_10487 [Oryza sativa Indica Group]
          Length = 1316

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 181/351 (51%), Positives = 239/351 (68%), Gaps = 21/351 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W  AE R  E++  IQP   SE++R+ V +Y++RLI      +VF FGSVPLKTY
Sbjct: 18  LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77

Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T     + +++  A  V   LE+E+  +++ F +K+VQY+ A+VKI+KC V 
Sbjct: 78  LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +                 VL+RFL++++ FDW+ +C+S++GPV ISSLP++ AE 
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLGRSVSK 319
              D  ELLLS  FL KC   Y+V     E+ G + F  KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           GNF RIR A S+GA+RL ++L  P   L   + +FF NT  R+G G RPD 
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368


>gi|108706800|gb|ABF94595.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
           sativa Japonica Group]
          Length = 1316

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 181/351 (51%), Positives = 239/351 (68%), Gaps = 21/351 (5%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W  AE R  E++  IQP   SE++R+ V +Y++RLI      +VF FGSVPLKTY
Sbjct: 18  LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77

Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T     + +++  A  V   LE+E+  +++ F +K+VQY+ A+VKI+KC V 
Sbjct: 78  LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +                 VL+RFL++++ FDW+ +C+S++GPV ISSLP++ AE 
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLGRSVSK 319
              D  ELLLS  FL KC   Y+V     E+ G + F  KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           GNF RIR A S+GA+RL ++L  P   L   + +FF NT  R+G G RPD 
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368


>gi|242041009|ref|XP_002467899.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
 gi|241921753|gb|EER94897.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
          Length = 1046

 Score =  365 bits (937), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 177/335 (52%), Positives = 238/335 (71%), Gaps = 19/335 (5%)

Query: 55  EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
           E++  ++P   SE++R +V++Y +RL+    G EVF FGSVPLKTYLPDGDIDLTVLG+ 
Sbjct: 35  EVVRRVRPTEASERRRADVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNT 94

Query: 115 NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
           + +  L   V  ILE+E+Q  D+ F +K+++ + A+V+++KC++ NI +DISFNQ  G+ 
Sbjct: 95  SYDSTLVNDVYCILESEEQNSDAEFIVKNLERIDAEVRLIKCTIGNIIIDISFNQTGGIC 154

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
           ALCFLE VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++         
Sbjct: 155 ALCFLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYIFNLFH 214

Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL 275
                   VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP +  E       +LL   EFL
Sbjct: 215 KSLHSPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLTVEATITHTSDLLFDKEFL 274

Query: 276 KKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
           K   +  +VP    ++    F  KHLNIVDPLK++NNLGRSV++ +F+RIR A  YGA++
Sbjct: 275 KSSMDKATVPPKNSDSCYTRFRPKHLNIVDPLKEHNNLGRSVNRASFNRIRTAFLYGARK 334

Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           LG IL LP   +   +  FF NTLERNG G+RPD+
Sbjct: 335 LGHILMLPSEVIPDEIYGFFKNTLERNGIGVRPDI 369


>gi|326492351|dbj|BAK01959.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 724

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 188/360 (52%), Positives = 243/360 (67%), Gaps = 26/360 (7%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-------IEVF 90
           P SI  + W   E     ++  IQP + SE +R  V++Y+QRL+    G         VF
Sbjct: 24  PSSISPDAWAPFEAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVPVTPFPVF 83

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
           PFGSVPLKTYLPDGDIDL   G    +E +A  V  ILE+E+  +D+ FEIKDVQY+ A+
Sbjct: 84  PFGSVPLKTYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINAE 143

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK+VKC VQNI VDISFNQ+ GL  LCFLE+VD+   K+HLFKRS++LIKAWCYYESRIL
Sbjct: 144 VKLVKCFVQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRIL 203

Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
           GAH+GLISTYALE +VL+                 RFLDYY+ FDWDN  IS++GP+++S
Sbjct: 204 GAHHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLS 263

Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
           SLP++V +      D  L   EFL++C ++++VP    E T   F  K LNIVDPLK +N
Sbjct: 264 SLPDLVTDPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSN 323

Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           NLGRSVSKGNF+RIR A   GA++LG+IL +P + +   + +FF +TL+RN   +RPDVQ
Sbjct: 324 NLGRSVSKGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDVQ 383


>gi|414866687|tpg|DAA45244.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
          Length = 1050

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 182/351 (51%), Positives = 244/351 (69%), Gaps = 22/351 (6%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           ++P   SE++R EV++Y +RL+    G EVF FGSVPLKTYLPDGDIDLTVLG+ + +  
Sbjct: 38  VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97

Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
           L   V  ILE+E+Q  D+ F +KD++ + A+V+++KC++ NI VDISFNQ  G+ ALCFL
Sbjct: 98  LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
           E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++              
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLHS 217

Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
              VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E       +LL   EFLK   +
Sbjct: 218 PVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSMD 277

Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
             +VP    ++    F  KHLNIVDPLK+ NNLGRSV++ +F+RIR A  YGA++LG I+
Sbjct: 278 KATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHIV 337

Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV--QVPIH-AFGSGKSEAADLS 388
           TLP   +   + +FF NTL RN  G RPD+      H +FG+ ++   D+S
Sbjct: 338 TLPSEVIPDEIYEFFKNTLGRNELGARPDIDSNYAFHSSFGTAETILEDIS 388


>gi|414866686|tpg|DAA45243.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
          Length = 1056

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 182/351 (51%), Positives = 244/351 (69%), Gaps = 22/351 (6%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           ++P   SE++R EV++Y +RL+    G EVF FGSVPLKTYLPDGDIDLTVLG+ + +  
Sbjct: 38  VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97

Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
           L   V  ILE+E+Q  D+ F +KD++ + A+V+++KC++ NI VDISFNQ  G+ ALCFL
Sbjct: 98  LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
           E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++              
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLHS 217

Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
              VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E       +LL   EFLK   +
Sbjct: 218 PVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSMD 277

Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
             +VP    ++    F  KHLNIVDPLK+ NNLGRSV++ +F+RIR A  YGA++LG I+
Sbjct: 278 KATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHIV 337

Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV--QVPIH-AFGSGKSEAADLS 388
           TLP   +   + +FF NTL RN  G RPD+      H +FG+ ++   D+S
Sbjct: 338 TLPSEVIPDEIYEFFKNTLGRNELGARPDIDSNYAFHSSFGTAETILEDIS 388


>gi|42566126|ref|NP_191728.2| nucleotidyltransferase [Arabidopsis thaliana]
 gi|332646720|gb|AEE80241.1| nucleotidyltransferase [Arabidopsis thaliana]
          Length = 1303

 Score =  362 bits (929), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)

Query: 23  HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
           H S ++S  SP  L+P  +            DAE W  AE+R  +++  IQP   SE +R
Sbjct: 4   HESWAASPPSPSGLHPNGLLPGKAASVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63

Query: 71  KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
             V +Y++RLI   +  +++F FGSVPLKTYLPDGDIDLT    +QN+++  A  V  +L
Sbjct: 64  NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123

Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
           E E+++    F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL  LCFLE+VD  I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
           +HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+                 RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243

Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
           ++++ FDW N+C+S+ GPV +SSLP++ AE    D  EL +S  F + C  +Y+V I   
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQ 303

Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
           E  G  F  KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L  +L  P   L  
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIH 363

Query: 350 GLEKFFINTLERNGKGLRPDV 370
            + +FF+NT ER+G G RPD 
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384


>gi|77548394|gb|ABA91191.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
          Length = 783

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 186/351 (52%), Positives = 240/351 (68%), Gaps = 20/351 (5%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           SI  E W   E     ++  IQP   SE +R  VI Y+Q L+    G +VFPFGSVPLKT
Sbjct: 24  SISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKT 83

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           YLPDGDIDLT  GH + +E LA+ V  +LE+E+  +D+ FE+KDVQY+ A+VK+VKC VQ
Sbjct: 84  YLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQ 142

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ  GL  LCFLEKVD+   K HLFKRS++LIKAWCYYESRILGAH+GLIST
Sbjct: 143 NIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIST 202

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE++                 VL+RFLDYY+ FDWDN  IS+ GP+++SSLPE+V ++
Sbjct: 203 YALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDS 262

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
            +   D+  +  +FLK+C + ++V     E     F  K  NIVDPLK +NNLGRSVSKG
Sbjct: 263 PDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKG 322

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           NF RIR A  +GA++LG+I+ +P +     + +FF NTL+R+   +RPDVQ
Sbjct: 323 NFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQ 373



 Score = 43.1 bits (100), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 3/111 (2%)

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPPSSPSQIPSRSAWDVLCQY 436
           G+  S  +DL+GDY + +N LLY Q +H DY +     F P   P   P  + +      
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYPIPSPPPARYQNRRSS 590

Query: 437 VQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
                N VY +     +   P C P    +R  +   D+  ++RGTGTY P
Sbjct: 591 NGHSRNNVYGYAGTNGIGPAP-CPPGYLIMRPYS-QIDDSNRARGTGTYFP 639


>gi|6850860|emb|CAB71099.1| putative protein [Arabidopsis thaliana]
          Length = 1388

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)

Query: 23  HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
           H S ++S  SP  L+P  +            DAE W  AE+R  +++  IQP   SE +R
Sbjct: 4   HESWAASPPSPSGLHPNGLLPGKAASVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63

Query: 71  KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
             V +Y++RLI   +  +++F FGSVPLKTYLPDGDIDLT    +QN+++  A  V  +L
Sbjct: 64  NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123

Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
           E E+++    F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL  LCFLE+VD  I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
           +HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+                 RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243

Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
           ++++ FDW N+C+S+ GPV +SSLP++ AE    D  EL +S  F + C  +Y+V I   
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQ 303

Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
           E  G  F  KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L  +L  P   L  
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIH 363

Query: 350 GLEKFFINTLERNGKGLRPDV 370
            + +FF+NT ER+G G RPD 
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384


>gi|115483835|ref|NP_001065579.1| Os11g0114700 [Oryza sativa Japonica Group]
 gi|77548393|gb|ABA91190.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113644283|dbj|BAF27424.1| Os11g0114700 [Oryza sativa Japonica Group]
 gi|215694848|dbj|BAG90039.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218185112|gb|EEC67539.1| hypothetical protein OsI_34858 [Oryza sativa Indica Group]
 gi|222615390|gb|EEE51522.1| hypothetical protein OsJ_32709 [Oryza sativa Japonica Group]
          Length = 801

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 186/351 (52%), Positives = 240/351 (68%), Gaps = 20/351 (5%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           SI  E W   E     ++  IQP   SE +R  VI Y+Q L+    G +VFPFGSVPLKT
Sbjct: 24  SISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKT 83

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           YLPDGDIDLT  GH + +E LA+ V  +LE+E+  +D+ FE+KDVQY+ A+VK+VKC VQ
Sbjct: 84  YLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQ 142

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ  GL  LCFLEKVD+   K HLFKRS++LIKAWCYYESRILGAH+GLIST
Sbjct: 143 NIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIST 202

Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE++                 VL+RFLDYY+ FDWDN  IS+ GP+++SSLPE+V ++
Sbjct: 203 YALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDS 262

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
            +   D+  +  +FLK+C + ++V     E     F  K  NIVDPLK +NNLGRSVSKG
Sbjct: 263 PDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKG 322

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
           NF RIR A  +GA++LG+I+ +P +     + +FF NTL+R+   +RPDVQ
Sbjct: 323 NFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQ 373



 Score = 43.1 bits (100), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 3/111 (2%)

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPPSSPSQIPSRSAWDVLCQY 436
           G+  S  +DL+GDY + +N LLY Q +H DY +     F P   P   P  + +      
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYPIPSPPPARYQNRRSS 590

Query: 437 VQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
                N VY +     +   P C P    +R  +   D+  ++RGTGTY P
Sbjct: 591 NGHSRNNVYGYAGTNGIGPAP-CPPGYLIMRPYS-QIDDSNRARGTGTYFP 639


>gi|297817502|ref|XP_002876634.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
 gi|297322472|gb|EFH52893.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
          Length = 1302

 Score =  361 bits (926), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)

Query: 23  HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
           H S ++S  SP  LYP  +            DAE W  AE+R  +++  IQP   SE +R
Sbjct: 4   HESWAASPPSPTLLYPNGLLPGKAASVTRQLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63

Query: 71  KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
             V +Y++RLI   +  +++F FGSVPLKTYLPDGDIDLT    +QN+++  A  V  +L
Sbjct: 64  NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123

Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
           E E+++    F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL  LCFLE+VD  I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
           +HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+                 RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243

Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
           ++++ FDW N+C+S+ GPV +SSLP++ A     D  EL +S  F + C ++Y+V I   
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAAPPRKDVGELRVSEAFYRACSKVYAVNIAPQ 303

Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
           E  G  F  KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L  +L  P   L  
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLARLLECPKENLIH 363

Query: 350 GLEKFFINTLERNGKGLRPDV 370
            + +FF+NT ER+G G RPD 
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384


>gi|357113459|ref|XP_003558520.1| PREDICTED: uncharacterized protein LOC100841269 [Brachypodium
           distachyon]
          Length = 1305

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 181/369 (49%), Positives = 237/369 (64%), Gaps = 24/369 (6%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W +AE R  E++  IQP   SE +R  V NY++RLI      EVF FGSVPLKTY
Sbjct: 18  LDPERWAVAESRTAELIARIQPNAHSEGRRLAVYNYVRRLIMNCLSCEVFTFGSVPLKTY 77

Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
           LPDGDID+T   + + +++  A  V   LE+E+  +++ F +K+VQY+ A+VKI+KC V 
Sbjct: 78  LPDGDIDVTAFSNSEELKDTWANLVRDALEHEEKSENAEFCVKEVQYIQAEVKIIKCLVD 137

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI VDISFNQ+ GL  LCFLE+VD LI   HLFKRS+IL+KAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDNLINHSHLFKRSIILVKAWCFYESRILGAHHGLIST 197

Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
           YALE +VL+                 RFL+++  FDW+ +C+S+ GPV ISSLP++ AE 
Sbjct: 198 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFGNFDWEKFCLSLWGPVPISSLPDMTAEP 257

Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
              D  ELLL   FL  C + Y V     E  G  F  KH N++DPL+ NNNLGRSV KG
Sbjct: 258 PRMDTGELLLGKPFLDNCNQAYGVMPRTQETQGQPFVSKHFNVIDPLRTNNNLGRSVGKG 317

Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ----VPIHA 376
           N+ RIR A  +GA++L ++L  P   +   + +FF NTL R+G G RPD      VP H 
Sbjct: 318 NYFRIRSAFCFGAKKLAKLLECPKEDIITEVNQFFTNTLTRHGSGNRPDAPTPNLVPEHT 377

Query: 377 FGSGKSEAA 385
                +EA+
Sbjct: 378 LKVAPAEAS 386


>gi|302802985|ref|XP_002983246.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
 gi|300148931|gb|EFJ15588.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
          Length = 351

 Score =  358 bits (918), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 184/354 (51%), Positives = 241/354 (68%), Gaps = 31/354 (8%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E WL AE R  E++  IQP   SE +R+ V +Y++RLI   +  EVF FGSVPL+TY
Sbjct: 1   LDDERWLQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60

Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV-KIVKCSV 156
           LPDGDIDLT   GHQ+++E  A  V  +LE E+  +D+ F +K+VQY+ A+V KI+KC V
Sbjct: 61  LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           +NI VDISFNQ+ GL  LCFLE+VDRLIG+DHLFKRS+IL+KAWCYYESRILGAH+GLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180

Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +                 VL+RFL++++ FDWD YC+S+ GP+ +S+LP++   
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237

Query: 260 TVENDGDELLLSPEFLKKCREIYSV----PITALENTGHEFAIKHLNIVDPLKDNNNLGR 315
               DG  LLL+  FL  C   Y+V     I         F  K+LN+VDPLK  NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294

Query: 316 SVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
           SV+KGNF+RIR A  +GA++L  IL  P   +   ++KFF+NT +R+G G RPD
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348


>gi|115452887|ref|NP_001050044.1| Os03g0336700 [Oryza sativa Japonica Group]
 gi|108708028|gb|ABF95823.1| expressed protein [Oryza sativa Japonica Group]
 gi|113548515|dbj|BAF11958.1| Os03g0336700 [Oryza sativa Japonica Group]
          Length = 1035

 Score =  357 bits (915), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 177/341 (51%), Positives = 234/341 (68%), Gaps = 24/341 (7%)

Query: 49  AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDL 108
           AEE   E++  ++P   SE++R  V+ Y +RL+    G EVF +GSVPLKTYLPDGD+DL
Sbjct: 36  AEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVDL 95

Query: 109 TVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
           TVLG+ +    L   +  IL++E+Q  D+ FE+KD+Q + A+V+++KC+++NI VDISFN
Sbjct: 96  TVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISFN 155

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--- 223
           Q  G+ ALCFLE VDR +GK+HL K S+ILIKAWCYYESR+LGAH+GLISTYALE +   
Sbjct: 156 QTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLILY 215

Query: 224 --------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
                         VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP  + E     G +LL
Sbjct: 216 IFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDLL 275

Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCAL 329
              EFL       SV  T       EF  K+LNI+DPLK++NNLGRSV+K +F+RIR A 
Sbjct: 276 FDKEFLNN-----SVQKTDSNACNTEFRSKYLNIIDPLKEHNNLGRSVNKASFNRIRTAF 330

Query: 330 SYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           SYGAQ+LG++L L    +   +  FF NTL R G G+RPD+
Sbjct: 331 SYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDI 371


>gi|108708029|gb|ABF95824.1| expressed protein [Oryza sativa Japonica Group]
          Length = 1004

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 177/341 (51%), Positives = 234/341 (68%), Gaps = 24/341 (7%)

Query: 49  AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDL 108
           AEE   E++  ++P   SE++R  V+ Y +RL+    G EVF +GSVPLKTYLPDGD+DL
Sbjct: 36  AEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVDL 95

Query: 109 TVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
           TVLG+ +    L   +  IL++E+Q  D+ FE+KD+Q + A+V+++KC+++NI VDISFN
Sbjct: 96  TVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISFN 155

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--- 223
           Q  G+ ALCFLE VDR +GK+HL K S+ILIKAWCYYESR+LGAH+GLISTYALE +   
Sbjct: 156 QTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLILY 215

Query: 224 --------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
                         VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP  + E     G +LL
Sbjct: 216 IFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDLL 275

Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCAL 329
              EFL       SV  T       EF  K+LNI+DPLK++NNLGRSV+K +F+RIR A 
Sbjct: 276 FDKEFLNN-----SVQKTDSNACNTEFRSKYLNIIDPLKEHNNLGRSVNKASFNRIRTAF 330

Query: 330 SYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           SYGAQ+LG++L L    +   +  FF NTL R G G+RPD+
Sbjct: 331 SYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDI 371


>gi|302755776|ref|XP_002961312.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
 gi|300172251|gb|EFJ38851.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
          Length = 351

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 183/354 (51%), Positives = 241/354 (68%), Gaps = 31/354 (8%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D E W+ AE R  E++  IQP   SE +R+ V +Y++RLI   +  EVF FGSVPL+TY
Sbjct: 1   LDDERWVQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60

Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV-KIVKCSV 156
           LPDGDIDLT   GHQ+++E  A  V  +LE E+  +D+ F +K+VQY+ A+V KI+KC V
Sbjct: 61  LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           +NI VDISFNQ+ GL  LCFLE+VDRLIG+DHLFKRS+IL+KAWCYYESRILGAH+GLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180

Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +                 VL+RFL++++ FDWD YC+S+ GP+ +S+LP++   
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237

Query: 260 TVENDGDELLLSPEFLKKCREIYSV----PITALENTGHEFAIKHLNIVDPLKDNNNLGR 315
               DG  LLL+  FL  C   Y+V     I         F  K+LN+VDPLK  NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294

Query: 316 SVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
           SV+KGNF+RIR A  +GA++L  IL  P   +   ++KFF+NT +R+G G RPD
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348


>gi|326490774|dbj|BAJ90054.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1030

 Score =  351 bits (901), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 172/329 (52%), Positives = 231/329 (70%), Gaps = 19/329 (5%)

Query: 61  QPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL 120
           QP   S+++R EV+++ +R++    G EVF FGSVPLKTYLPDGDIDLTV+G+ +    L
Sbjct: 43  QPTQASDRRRAEVVDHARRIVGTALGCEVFVFGSVPLKTYLPDGDIDLTVIGNTSCGSTL 102

Query: 121 ARFVCKILEN--EDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLE 178
              V  ILE+  E+ D+ FE+KD++++ A+V+++KC++ NI VDISFNQ  G+ A+ FLE
Sbjct: 103 IDDVYHILESGEENGDAEFEVKDLEHIDAEVRLIKCTIGNIIVDISFNQTGGICAVSFLE 162

Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------------- 223
            VDR +GK+HLFKRS+ILIK WCYYESR+LGAH+GLISTYALE +               
Sbjct: 163 LVDRKVGKNHLFKRSIILIKGWCYYESRLLGAHHGLISTYALETLILYVFNLFHKSLHGP 222

Query: 224 --VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
             VL+RFL+Y++ FDWD YCIS+NGPVA+SSLP ++ E +   GD+LL   EFL    E 
Sbjct: 223 LEVLYRFLEYFSKFDWDKYCISLNGPVALSSLPNLIVEGLNVPGDDLLFDREFLDNSVEK 282

Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
            S P    +    +F +K LNI+DPLK+ NNLGRSV++ NFHRIR A S+GA++LG+IL 
Sbjct: 283 ASAPPRNSDARCSKFRVKCLNIIDPLKECNNLGRSVNRANFHRIRTAFSFGARKLGQILM 342

Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           LP   +   +  FF NTLERN  G+R D+
Sbjct: 343 LPPELIPDDIFAFFKNTLERNENGVRSDI 371


>gi|414591190|tpg|DAA41761.1| TPA: hypothetical protein ZEAMMB73_453733 [Zea mays]
          Length = 918

 Score =  337 bits (864), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 173/358 (48%), Positives = 237/358 (66%), Gaps = 23/358 (6%)

Query: 32  SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
           +PP   P +I  +   +AE    E+L  + P   +E++R++VI Y+ RLI    G EVF 
Sbjct: 61  APP---PSAIRRDAVRVAEAAAGEVLLRVHPTREAERRRQDVIAYLTRLIGSSLGCEVFA 117

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQV 149
           FGSVPL+TYLPDGD+D+TVLG+  +   L   V  +L++E +  D+  ++  + ++ A+V
Sbjct: 118 FGSVPLRTYLPDGDVDITVLGNTWLNSTLIDDVRSMLQSEQENCDAELKLTGLHFIDAEV 177

Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K++KC ++NI VD+SFNQ+ G+S  CFLE VDR +GK+HLFKRS++L KAWCY+ESRILG
Sbjct: 178 KLIKCVIENIIVDVSFNQIGGVSTFCFLELVDRQVGKNHLFKRSIMLTKAWCYHESRILG 237

Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
           AH+GLISTYALE +                 VL++FL+Y++ FDWD Y IS+NGPV +SS
Sbjct: 238 AHHGLISTYALETLVLYIFNMFHKSLHGPLEVLYKFLEYFSKFDWDRYGISLNGPVDLSS 297

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
           LP +  E  E  G ELLL  +F +   +   V     +    +F  K LNIVDPLK NNN
Sbjct: 298 LPSLTVEPTEVQG-ELLLGKDFHQGSLDRLVVIPNEFDGCDTQFRQKFLNIVDPLKANNN 356

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           LGRSVSK NF+RIR A S+GAQ+LG+IL LP   +   +  FF NTL+R+GKG R D+
Sbjct: 357 LGRSVSKANFYRIRSAFSFGAQKLGQILLLPSEYICDEIYGFFSNTLKRHGKGERLDI 414


>gi|326517667|dbj|BAK03752.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 334

 Score =  331 bits (848), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 170/304 (55%), Positives = 214/304 (70%), Gaps = 19/304 (6%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P SI A  W   E+    ++  IQP++ SE +R  V++Y+QRLI    G EVFPFGSVPL
Sbjct: 27  PSSISAGAWRPFEDAAAAVVGRIQPSVSSEDRRAAVVHYVQRLIRCSVGCEVFPFGSVPL 86

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
           KTYLPDGDIDLT  G  + +E+LA  V  +LE+E+  +D+ FE+KDVQY+ A+VK+VKC 
Sbjct: 87  KTYLPDGDIDLTAFGSASSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCL 146

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
           VQNI VDISFNQ+ GL  LCFLE+VD   GK HLFK+S++LIKAWCYYESRILGAH+GLI
Sbjct: 147 VQNIVVDISFNQIGGLCTLCFLEQVDERFGKKHLFKKSIMLIKAWCYYESRILGAHHGLI 206

Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           STYALE++                 VL+RFLDYY+ FDWDN  IS+ GPV +SSLPE+V+
Sbjct: 207 STYALEILVLYIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVPLSSLPELVS 266

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           +T +    + L   EFLK+  ++++VP  + E     F  K LNIVDPLK NNNLGRSVS
Sbjct: 267 DTPDTHDVDFLKREEFLKEFAQMFTVPPRSFERNNRLFLRKFLNIVDPLKQNNNLGRSVS 326

Query: 319 KGNF 322
           KG F
Sbjct: 327 KGFF 330


>gi|115488182|ref|NP_001066578.1| Os12g0283100 [Oryza sativa Japonica Group]
 gi|77554657|gb|ABA97453.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649085|dbj|BAF29597.1| Os12g0283100 [Oryza sativa Japonica Group]
 gi|222616913|gb|EEE53045.1| hypothetical protein OsJ_35772 [Oryza sativa Japonica Group]
          Length = 989

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 166/334 (49%), Positives = 230/334 (68%), Gaps = 30/334 (8%)

Query: 56  ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
           +L  + P   +E++R++V+ Y++RL+    G EV  FGSVPLK+YLPDGD+D+TVLG+  
Sbjct: 56  VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115

Query: 116 VEEDLARFVCKILENEDQDS--VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
           ++      V  ILE+E+QDS    EIK + ++ A+VK++KC ++NI VDISFNQ+ G+S 
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175

Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
           LCFLE  DR +GK+HLFKRS++LIKAWCY+ESRILGAH+GL+STYALE +V         
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235

Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
                   L++FL+Y++ FDWD YCIS+NGPV +SSLP    E   +  DELL   + L 
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-SSIQDELLFGKKTLP 294

Query: 277 KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
           +  +  ++           F +KHLNI+DPLK +NNLGRSVS+G+F+RIR ALS+GAQ+L
Sbjct: 295 EVSDGSNI----------NFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344

Query: 337 GEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           G+IL L    +   +  FF NTL+R+G+G R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378


>gi|218186672|gb|EEC69099.1| hypothetical protein OsI_37998 [Oryza sativa Indica Group]
          Length = 989

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 166/334 (49%), Positives = 230/334 (68%), Gaps = 30/334 (8%)

Query: 56  ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
           +L  + P   +E++R++V+ Y++RL+    G EV  FGSVPLK+YLPDGD+D+TVLG+  
Sbjct: 56  VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115

Query: 116 VEEDLARFVCKILENEDQDS--VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
           ++      V  ILE+E+QDS    EIK + ++ A+VK++KC ++NI VDISFNQ+ G+S 
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175

Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
           LCFLE  DR +GK+HLFKRS++LIKAWCY+ESRILGAH+GL+STYALE +V         
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235

Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
                   L++FL+Y++ FDWD YCIS+NGPV +SSLP    E   +  DELL   + L 
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-SSIQDELLFGKKTLP 294

Query: 277 KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
           +  +  ++           F +KHLNI+DPLK +NNLGRSVS+G+F+RIR ALS+GAQ+L
Sbjct: 295 EVSDGSNI----------NFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344

Query: 337 GEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           G+IL L    +   +  FF NTL+R+G+G R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378


>gi|414888115|tpg|DAA64129.1| TPA: hypothetical protein ZEAMMB73_121752 [Zea mays]
          Length = 942

 Score =  325 bits (833), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 166/335 (49%), Positives = 226/335 (67%), Gaps = 23/335 (6%)

Query: 55  EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
           E++  + P   +E++R++VI Y++RLI    G EVF FGSVPL+TYLPDGD+D+TVLG+ 
Sbjct: 69  EVVLRVHPTREAERRRQDVIAYLRRLIGSCLGCEVFAFGSVPLRTYLPDGDVDITVLGNT 128

Query: 115 NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
            +       V  +L++E +  D+ F++  +Q++ A+VK++KC ++NI VD+SFNQ+ G+S
Sbjct: 129 WLNSTFIDDVRSMLQSEQENCDAEFKLTGLQFINAEVKLIKCVIENIIVDVSFNQIGGVS 188

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-------- 224
             CFLE VDR IG++HLFKRS++LIKAWCY+ESRILGAH+GLISTYALE +V        
Sbjct: 189 TFCFLELVDRQIGQNHLFKRSIMLIKAWCYHESRILGAHHGLISTYALETLVLYIFNMFH 248

Query: 225 ---------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL 275
                    L+RFL+Y++ FDWD Y IS+NG V +SSL     E  +  G E LL  E  
Sbjct: 249 KSLHGPLEALYRFLEYFSKFDWDRYGISLNGQVDLSSL---TVEPTDVQG-ESLLGKELQ 304

Query: 276 KKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
           +   +   V     +  G +F  K LNI+DPLK NNNLGRSVSK NF+RIR A S+GAQ+
Sbjct: 305 QGYLDRLVVIPNEFDGCGTQFRQKFLNIIDPLKANNNLGRSVSKANFYRIRSAFSFGAQK 364

Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           LG+IL LP   +   +  FF NTL+R+G G RPDV
Sbjct: 365 LGQILLLPSEYIRDEIYGFFANTLKRHGNGERPDV 399


>gi|242082774|ref|XP_002441812.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
 gi|241942505|gb|EES15650.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
          Length = 546

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 212/532 (39%), Positives = 281/532 (52%), Gaps = 93/532 (17%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE--------- 88
           P SI  + W   E     ++  IQP + SE  R  VI+Y++RL+    G++         
Sbjct: 26  PSSIPPDAWRRFESAALGVVNKIQPTVASENFRSAVIDYLKRLLGSRAGVQSWLLPFLPF 85

Query: 89  ------------------------------------VFPFGSVPLKTYLPDGDIDLTVLG 112
                                               VFPFGSVPLKTYLPDGDIDLT   
Sbjct: 86  HFYVFFGAKPVRDYEYKCVTVWIYFVGCALESLCDLVFPFGSVPLKTYLPDGDIDLTAFS 145

Query: 113 HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAG 170
               +E+LA  V  IL +E   +DS F++KDVQY+ A+VK+VKC VQNI VDIS NQ+ G
Sbjct: 146 PAISDENLANQVYAILSSEQHRKDSEFDVKDVQYIHAEVKLVKCLVQNIVVDISVNQIGG 205

Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM------- 223
           LS LCFLEKVD   GK HL KRS++LIK WCYYESRILGA  GL+STYALE++       
Sbjct: 206 LSTLCFLEKVDENFGKKHLLKRSIVLIKDWCYYESRILGAQNGLLSTYALEVLVLYVFLI 265

Query: 224 ----------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE--TVENDGDELLLS 271
                     VL+RFLD+Y+ FDWD+  IS+ GPV++SSLP +V +      D D  +  
Sbjct: 266 FHRSLGGPLAVLYRFLDFYSKFDWDSKGISLFGPVSLSSLPNLVTDPHLPAIDDDFFVPR 325

Query: 272 PEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
            + L+K  E +S P    E     F+ K LNIVDPLK +NNLGRSV+KGNF+RIR A  +
Sbjct: 326 EKILRKYAEDFSAPPRNSERDAQVFSRKFLNIVDPLKQSNNLGRSVNKGNFYRIRSAFDF 385

Query: 332 GAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRP-DVQVPIHAFGSG------KSEA 384
           GA++LG+IL +P       + +FF NTL+RN  G RP D   P+++   G       +  
Sbjct: 386 GARKLGKILQMPVCYTVNEVNQFFSNTLKRNHTGFRPDDGSSPLYSNPYGDLSYQFNNIL 445

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTLPVTA----QFSPPSSPSQIPSRSAWDVLCQYVQGK 440
            DL+GD++  +  L Y Q +     P  +    + +PP  P Q  +    D      QG+
Sbjct: 446 NDLTGDFEINFRNLQYAQGFQQVN-PANSFYYGRLAPP--PLQYQNTHPSD-----SQGR 497

Query: 441 GNLVYQWGTEVFVPRLPFCHPYASQVRA--STFSTDE--GEKSRGTGTYIPE 488
            N          +P    C PY S   A       D+    ++RG GTY P+
Sbjct: 498 KNPHAYASMNGAIP----CPPYPSGCLAWRPFIQPDDPMAMRARGIGTYFPD 545


>gi|413924678|gb|AFW64610.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
          Length = 474

 Score =  323 bits (827), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 172/342 (50%), Positives = 222/342 (64%), Gaps = 20/342 (5%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLK 98
           SI  + W   E     ++C I P + S+  R  VI+Y+QRL   ++ G +V  FGSVPLK
Sbjct: 66  SISPDDWRRLEGATFSVMCKIHPTVSSQHLRARVIDYVQRLFRLHHDGYQVISFGSVPLK 125

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
           TYLPDGDIDLT+L     +E+L   VC IL++E+Q  DS FE+KDV+YVPA+VK+VKC V
Sbjct: 126 TYLPDGDIDLTLLCAAISDENLENEVCAILKSEEQRKDSEFEVKDVKYVPAEVKLVKCKV 185

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           QNI VDIS NQ+ G + + FLEKVD+ +GK++L +RS++LIK WCYYES ILGA  GL+S
Sbjct: 186 QNIAVDISVNQIGGPNKVYFLEKVDQNLGKNNLLRRSIMLIKHWCYYESCILGAQRGLVS 245

Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +                 VL+RFLDYY+ FDWDN  IS+ GP+++SSLPE+V E
Sbjct: 246 TYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPISLSSLPELVTE 305

Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
                 D  L    FLK C + +SVP    E     F+ K +NIVDPLK +NNLGRS+SK
Sbjct: 306 PPYTRDDGFLSREAFLKDCAKAFSVPPINSEENPQVFSKKFVNIVDPLKQSNNLGRSISK 365

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLER 361
           GN  RIR    +GA +LG+IL  P       + +FF NTL R
Sbjct: 366 GNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRNTLSR 407


>gi|30693508|ref|NP_190730.2| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
 gi|332645292|gb|AEE78813.1| NT domain of poly(A) polymerase and terminal uridylyl
           transferase-containing protein [Arabidopsis thaliana]
          Length = 755

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 167/333 (50%), Positives = 209/333 (62%), Gaps = 46/333 (13%)

Query: 44  ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
           ELW+  EE  +EI+  + P +VSE +R++VI Y+Q+LI    G EV  FGSVPLKTYLPD
Sbjct: 32  ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
           GDIDLT  G    EE+LA  V  +LE E+ +  S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92  GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFNQ+ G+  LCFLEK                                          
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
             VL++FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227

Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
           YSVP    E     F  KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++  
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287

Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
                +   L KFF N L R+G G RPDV   I
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 320



 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
           +DLSGDY+S  N L +G+ + DY    P++   SPP  P Q+P+ ++W+V+   +  + N
Sbjct: 517 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 574

Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
                     VPR  F H     +    F  +E  K RGTGTY P     R RP      
Sbjct: 575 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 634

Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
              +AR+P + N   +S+   +++     P R+  E       +   S +M ++     S
Sbjct: 635 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 685

Query: 559 LEEFPLLPVTKNLP 572
           L+ FP    + N P
Sbjct: 686 LDSFPDTNGSTNHP 699


>gi|147867191|emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
          Length = 1388

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 165/354 (46%), Positives = 218/354 (61%), Gaps = 51/354 (14%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           ++D E   LAEER ++++  IQP   SE++R+ V +Y++ LI   +  +VFPFGSVPLKT
Sbjct: 29  ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88

Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
           YLPDGDIDLT      N+++  A  V  ILE E++  D+ F +K+VQY+ A+V       
Sbjct: 89  YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEV------- 141

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
                                   D LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 142 ------------------------DHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 177

Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           TYALE +VL+                 RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 178 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 237

Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
               D  ELLLS  FL  C  +Y+V     EN    F  K+ N++DPL+ NNNLGRSVSK
Sbjct: 238 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 297

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
           GNF RIR A ++GAQRL  +L  P   +   + +FF+NT ER+GKG RPD   P
Sbjct: 298 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 351


>gi|6572083|emb|CAB63026.1| putative protein [Arabidopsis thaliana]
          Length = 764

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 167/333 (50%), Positives = 209/333 (62%), Gaps = 46/333 (13%)

Query: 44  ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
           ELW+  EE  +EI+  + P +VSE +R++VI Y+Q+LI    G EV  FGSVPLKTYLPD
Sbjct: 32  ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
           GDIDLT  G    EE+LA  V  +LE E+ +  S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92  GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFNQ+ G+  LCFLEK                                          
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
             VL++FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227

Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
           YSVP    E     F  KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++  
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287

Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
                +   L KFF N L R+G G RPDV   I
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 320



 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
           +DLSGDY+S  N L +G+ + DY    P++   SPP  P Q+P+ ++W+V+   +  + N
Sbjct: 517 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 574

Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
                     VPR  F H     +    F  +E  K RGTGTY P     R RP      
Sbjct: 575 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 634

Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
              +AR+P + N   +S+   +++     P R+  E       +   S +M ++     S
Sbjct: 635 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 685

Query: 559 LEEFPLLPVTKNLP 572
           L+ FP    + N P
Sbjct: 686 LDSFPDTNGSTNHP 699


>gi|218200261|gb|EEC82688.1| hypothetical protein OsI_27344 [Oryza sativa Indica Group]
          Length = 1001

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 164/337 (48%), Positives = 223/337 (66%), Gaps = 27/337 (8%)

Query: 55  EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
           E++  +QP   +E+ R+ +I Y++ L     G EVF FGSVPLKTYLPDGDID+T+LG+ 
Sbjct: 54  EVVLRVQPTEEAERTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113

Query: 115 NVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
             +      V  ILE E+Q+  +   I  +Q++ A+VK++KC + NI VDISFNQ+ G++
Sbjct: 114 APDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVT 173

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
            LC LE VD  +G DHLFKRS++LIKAWCY+ES ILGAH GLISTYALE++         
Sbjct: 174 TLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFH 233

Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG--DELLLSPE 273
                   VL++FL+Y++ FDWD YCIS+NGPV +SSLP +   TVE  G  DELL  P 
Sbjct: 234 KSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNL---TVEPSGIHDELLFGPN 290

Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGA 333
               C  +  +   + + +   F  K+LNI+DP+K +NNLGRSVSKG+F+RIR A S+GA
Sbjct: 291 --GSCDRLIVLKKDS-DGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGA 347

Query: 334 QRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           Q L +IL LP   +   +  FF+NTL+ +G+G R DV
Sbjct: 348 QNLSQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384


>gi|222637691|gb|EEE67823.1| hypothetical protein OsJ_25591 [Oryza sativa Japonica Group]
          Length = 1001

 Score =  314 bits (805), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 163/337 (48%), Positives = 223/337 (66%), Gaps = 27/337 (8%)

Query: 55  EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
           E++  +QP   +++ R+ +I Y++ L     G EVF FGSVPLKTYLPDGDID+T+LG+ 
Sbjct: 54  EVVLRVQPTEEADRTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113

Query: 115 NVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
             +      V  ILE E+Q+  +   I  +Q++ A+VK++KC + NI VDISFNQ+ G++
Sbjct: 114 APDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVT 173

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
            LC LE VD  +G DHLFKRS++LIKAWCY+ES ILGAH GLISTYALE++         
Sbjct: 174 TLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFH 233

Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG--DELLLSPE 273
                   VL++FL+Y++ FDWD YCIS+NGPV +SSLP +   TVE  G  DELL  P 
Sbjct: 234 KSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNL---TVEPSGIHDELLFGPN 290

Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGA 333
               C  +  +   + + +   F  K+LNI+DP+K +NNLGRSVSKG+F+RIR A S+GA
Sbjct: 291 --GSCDRLIVLKKDS-DGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGA 347

Query: 334 QRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           Q L +IL LP   +   +  FF+NTL+ +G+G R DV
Sbjct: 348 QNLSQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384


>gi|147820621|emb|CAN67650.1| hypothetical protein VITISV_005081 [Vitis vinifera]
          Length = 1572

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 157/301 (52%), Positives = 195/301 (64%), Gaps = 44/301 (14%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP 146
            VF FGSVPLKTYLPDGDIDLT    +QN+++  A                         
Sbjct: 227 RVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWAN------------------------ 262

Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            QVKI+KC V+NI VDISFNQ+ GL  LCFLE+VD LI ++HLFKRS+ILIKAWCYYESR
Sbjct: 263 -QVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESR 321

Query: 207 ILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVA 249
           ILGAH+GLISTYALE +VL+                 RFL+++++FDWDN+C+S+ GPV 
Sbjct: 322 ILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVP 381

Query: 250 ISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD 309
           ISSLP++ AE    D  ELLLS  FL  C  +Y+V     E  G  F  KH N++DPL+ 
Sbjct: 382 ISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRV 441

Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
           NNNLGRSVSKGNF RIR A ++GA+RL  +L  P   +   + + F+NT ER+G G RPD
Sbjct: 442 NNNLGRSVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPD 500

Query: 370 V 370
            
Sbjct: 501 T 501


>gi|326521958|dbj|BAK04107.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1031

 Score =  302 bits (773), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 150/335 (44%), Positives = 227/335 (67%), Gaps = 22/335 (6%)

Query: 56  ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
           +L  + P   +E++R ++I+Y + LI   +G EVF FGSVPLKTYLPDGD+D+T+L + N
Sbjct: 157 VLLRLHPTEEAERRRHKIIDYAKNLIGTTFGCEVFAFGSVPLKTYLPDGDVDITILTNVN 216

Query: 116 VEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
           ++ +  + VC +L  E  ++ + F +K++Q + A+VKI+KC + N+ +DISFNQ+ G+S 
Sbjct: 217 LDNNFVQDVCCLLAAEQSNEAAEFALKEIQVINAKVKIIKCVIDNLVMDISFNQVGGVST 276

Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
           LCFLE  ++ IGKDHLFKRS+ILIKAWCY+E  I G+++ L+STYALE+++         
Sbjct: 277 LCFLEMANKEIGKDHLFKRSIILIKAWCYHEGSIHGSNHWLMSTYALEVLILYIFNLFHT 336

Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
                   L++FL+YY+ FDWDN C+++NGPV +SSL    A    ++ +ELLLS E L+
Sbjct: 337 VLHGPLQALYKFLEYYSKFDWDNQCLTLNGPVPLSSLRNYTAGPTGSN-EELLLSKEPLE 395

Query: 277 -KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
              R ++ +P  + +  G EF +K+LNI+DPLK  NNLG S+S+ N   IR A + GA++
Sbjct: 396 PSLRRLFDLPAGS-DGRGPEFRLKYLNIIDPLKGGNNLGTSISEANSRVIRDAFAAGAEK 454

Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           LG+IL LP   +   +  FF +TL ++G+G R D+
Sbjct: 455 LGQILKLPCELIAEQVYVFFTHTLGKHGRGERQDL 489


>gi|357463851|ref|XP_003602207.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
 gi|355491255|gb|AES72458.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
          Length = 768

 Score =  301 bits (770), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 157/348 (45%), Positives = 225/348 (64%), Gaps = 22/348 (6%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
           P  ++   W   E+R  E+L  ++P   SE  R  +++YI+ LI  +  ++VF FGSVPL
Sbjct: 19  PKVLERSKWSQVEDRTIELLQFLEPNPKSETLRNNIVSYIKGLIISHVPVKVFEFGSVPL 78

Query: 98  KTYLPDGDIDLTVLGHQNV-EEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKC 154
           KTYL DGDIDLT+ G+  +  E     + +ILE+E  ++ S F +K+VQ V A+VKI+KC
Sbjct: 79  KTYLRDGDIDLTIFGNNELFPEIFIPHIQQILESEMNNEFSKFRVKEVQLVNAEVKIIKC 138

Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
            V+   +DISFNQ++GL +LCFL++VD LI ++H+FKRSVILIKAWCY+ESR+LG+  GL
Sbjct: 139 LVEKFVIDISFNQLSGLCSLCFLDEVDYLISRNHIFKRSVILIKAWCYHESRLLGSKSGL 198

Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
            STYALE++VL+                 RFL++++ FDW NYCIS++GPV + SLP + 
Sbjct: 199 FSTYALEILVLYLFNLYNNEFVGPLEVLFRFLEFFSKFDWGNYCISLSGPVPLDSLPNMT 258

Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
           A+    D  +LLL+  FL   +  Y       +N    F  KH+NI+DPL++NNNLG S+
Sbjct: 259 ADCPRKDRQDLLLTESFLIASKFCYG--WRNQKNREKHFVSKHINIIDPLQENNNLGHSI 316

Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKG 365
           S+GNF RI+ A++YGA+++  IL      L    + FF NT  R+G G
Sbjct: 317 SRGNFFRIKSAIAYGAEQMMRILDCTDEYLISEFDHFFENTWNRHGNG 364


>gi|357491471|ref|XP_003616023.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
 gi|355517358|gb|AES98981.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
          Length = 387

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 168/354 (47%), Positives = 215/354 (60%), Gaps = 83/354 (23%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI---------- 87
           P S+  E    AE+   EIL  IQP + ++++R+EV++Y+QRLI   YG           
Sbjct: 33  PSSVTDEASSAAEQTTAEILRRIQPTLAADRRRREVVDYVQRLIR--YGARCEKLLPNVW 90

Query: 88  -------------EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD 134
                        +VFP+GSVPLKTYLPDGDIDLT L  QN+E+ L   V  +L  E+ +
Sbjct: 91  RKLDFEVRIFRIGKVFPYGSVPLKTYLPDGDIDLTALSPQNIEDGLVSDVHAVLRGEENN 150

Query: 135 SV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKR 192
               +E+KDV+++ A+         NI VDISFNQ+ GLS LCFLEKVDRL+ KDH+FKR
Sbjct: 151 DAAEYEVKDVRFIDAE---------NIVVDISFNQLGGLSTLCFLEKVDRLVAKDHIFKR 201

Query: 193 SVILIKAWCYYESRILGAHYGLISTYALE------------------------------M 222
           S+ILIKAWCYYESRILGAH+GLISTYALE                              M
Sbjct: 202 SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHRFHVSLDGPLAEKERKRNLNHIM 261

Query: 223 MVLH----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
           +V+H                RFLDY++ FDWDNYC+S+ GPVA SS P++VAE +EN G+
Sbjct: 262 LVMHPFNKHFMHPALFQVLYRFLDYFSKFDWDNYCVSLKGPVAKSSPPDVVAEALENGGN 321

Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
             LL+ EF++ C E +SVP   L+     F  KHLNI+DPLK+NNNLGRSV+KG
Sbjct: 322 T-LLTDEFIRSCVESFSVPPRGLDLNLRAFPHKHLNIIDPLKENNNLGRSVNKG 374


>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 2035

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 162/392 (41%), Positives = 227/392 (57%), Gaps = 62/392 (15%)

Query: 44  ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEV-------------- 89
           + W  AE +  E++ +++P  +SE++R  V  +++RLI   +  EV              
Sbjct: 585 DWWTRAEGQTAELIDSLKPTRLSEERRTAVTGFVERLIRDRFECEVSALPHELNGFIVRS 644

Query: 90  ----------FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILEN--EDQDSVF 137
                       FGSVPLKTYLPDGDIDL +    +++E  A+ V K L+   +D D+ F
Sbjct: 645 SAGAVRYSAVIRFGSVPLKTYLPDGDIDLYIFARNDLKETWAQDVLKALKQAEDDADAEF 704

Query: 138 EIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
            +K+VQY+ A+VK++KC V+NI VDISFNQ+ GLS LCFLE+VD  +G +HLFKRSVIL+
Sbjct: 705 RVKEVQYIQAEVKLIKCLVENIVVDISFNQIGGLSTLCFLERVDEEVGLNHLFKRSVILV 764

Query: 198 KAWCYYESRILGAHYGLISTYALEMM-------------------VLHRFLDYYNTFDWD 238
           KAWCYYESRILGAH+GLIST+ALE +                   VL+ FL Y+  FDWD
Sbjct: 765 KAWCYYESRILGAHHGLISTFALETLVLYIFHVFHSMRSLHGPLEVLYLFLTYFCNFDWD 824

Query: 239 NYCISINGPVAISSLPEIVAETVEND-------------GDELLLSPEFLKKCREIYS-V 284
            YC+SI GPV +  +P+  +E  + D             G +L  S EF+++C   YS V
Sbjct: 825 QYCLSIWGPVPLDHIPKNSSELSQKDGGWRTVARSPWEVGGKLYFSEEFIEECINRYSDV 884

Query: 285 PITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPG 344
              +  + G  F  K+LN++DP++  NNLGRSV+ G+F RIR A   GA+ LGE+   P 
Sbjct: 885 RAGSESSQGRIFNPKYLNVLDPIRHTNNLGRSVNVGSFKRIRSAFGLGARTLGEVFECPK 944

Query: 345 HCLGMGLEKFFINTLERNGK---GLRPDVQVP 373
             +    + FF  T +  G+     RPD  +P
Sbjct: 945 DQITEKFKSFFSCTFKSLGRYRSAGRPDSGIP 976


>gi|168035287|ref|XP_001770142.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678668|gb|EDQ65124.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1504

 Score =  281 bits (720), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 173/444 (38%), Positives = 248/444 (55%), Gaps = 63/444 (14%)

Query: 92  FGSVPLKTYLPDGDIDLTVLG-HQNVEEDLARFVCKILEN--EDQDSVFEIKDVQYVPAQ 148
           FGSVPLKTYLPDGDIDL+      +V+    +     L+   ++ +S F +K+VQ + A+
Sbjct: 144 FGSVPLKTYLPDGDIDLSAFTPSPDVKRTWIQDTYNALQKAKDNPNSEFRVKEVQLIHAE 203

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VKIVKC V+NI VD+SF+Q+ GL  LCFL +VD+LIG+DHLFKRS+IL+KAWCYYESRIL
Sbjct: 204 VKIVKCFVENILVDVSFDQLGGLGTLCFLVEVDKLIGEDHLFKRSIILVKAWCYYESRIL 263

Query: 209 GAHYGLISTYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAIS 251
           GAH GL+STYA+E                 + VL+ FL+++++FDWDNYC+S++ P+ + 
Sbjct: 264 GAHCGLMSTYAVEALVLYIFDKFHASLRGPLQVLYLFLEFFSSFDWDNYCVSLSSPIPLK 323

Query: 252 SLP---------EIVAETVENDGDELLLSPEFLKKCREIYS-VPITALENTGHEFAIKHL 301
           SL          + +A +   DG EL  + EFL  C   Y  VP++ +  + ++F +K L
Sbjct: 324 SLSKDSEKLEDLQKLALSTRRDGGELFFTKEFLVACETEYGVVPVSQITKS-NKFTVKCL 382

Query: 302 NIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN-TLE 360
           NI DPL+ +NNLGRSV++GNF RIR A  +GA+ L  +L+     +   LE+FF N  L 
Sbjct: 383 NISDPLRSSNNLGRSVNQGNFARIRRAFDFGARTLRRVLSCTEEDVPAELEQFFKNCNLR 442

Query: 361 RNGKGLRPDVQVP----------IHAFGSGKSEA-----ADLSGDYDSYYNGLLYGQCYH 405
            +G   RPDV  P              G  K EA     +DL+     + NG  + +   
Sbjct: 443 LHGGYQRPDVASPRPCRTKASNAAQKIGCAKREAMGNERSDLTSHSTPFQNGNQFLRTEP 502

Query: 406 DYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYAS 464
           ++ LPV    F   + P      S+ ++LC  +Q  G+ V QW            HP  S
Sbjct: 503 NHALPVEQIHFDSNALPG---GESSQEILCNDMQCTGDSV-QW-----------THPRCS 547

Query: 465 QVRASTFSTDEGEKSRGTGTYIPE 488
            ++ +      G +  GT   + E
Sbjct: 548 SLKTTPCREKNGSQFVGTPKSVAE 571


>gi|358347363|ref|XP_003637727.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
 gi|355503662|gb|AES84865.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
          Length = 827

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 133/239 (55%), Positives = 167/239 (69%), Gaps = 17/239 (7%)

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK+VKC V+NI VDISFNQ+ GL  LCFLE+VD LI  +HLFKRS+ILIKAWCYYESRIL
Sbjct: 109 VKLVKCLVENIVVDISFNQLGGLCTLCFLEEVDGLINHNHLFKRSIILIKAWCYYESRIL 168

Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
           GAH+GLISTYALE +VL+                 RFL++++ FDWDN+C+S+ GPV IS
Sbjct: 169 GAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPIS 228

Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
           SLP++ AE    D  ELLL   FL  C  +Y+V     EN G  F  KH N++DPL+ NN
Sbjct: 229 SLPDVTAEPPRKDAGELLLHKSFLDACSTVYAVFPGGPENQGQPFVSKHFNVIDPLRVNN 288

Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NLGRSVSKGNF RIR A ++GA++L  +L  P   L + + +FF+NT +R+G G RPD 
Sbjct: 289 NLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPKDELFLEVNQFFLNTWDRHGSGQRPDA 347


>gi|147817122|emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
          Length = 1147

 Score =  265 bits (676), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 129/211 (61%), Positives = 155/211 (73%), Gaps = 17/211 (8%)

Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH------------ 226
           K+DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+            
Sbjct: 405 KIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSLLNGP 464

Query: 227 -----RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
                +FLDY++ FDWDNYC+S+NGPV ISSLPE++AET EN G + LL  + L+ C + 
Sbjct: 465 LAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLGNDXLRDCLDR 524

Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
           +SVP   LE     F  KH NIVDPLK+NNNLGRSVSKGNF+RIR A +YGA++LG IL 
Sbjct: 525 FSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILL 584

Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
            P   +   L KFF NTLER+G+G RPDV +
Sbjct: 585 QPEDKISEELCKFFTNTLERHGRGQRPDVDL 615



 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/117 (56%), Positives = 79/117 (67%), Gaps = 10/117 (8%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVP 146
           VFPFGSVPLKTYLPDGDIDLT  G   VE+ LA  V  +LE EDQ+    F +KDVQ + 
Sbjct: 186 VFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIH 245

Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKV--------DRLIGKDHLFKRSVI 195
           A+VK+VKC VQNI VDISFNQ+ GL  LCFLE+         +R + +  L+KR  I
Sbjct: 246 AEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQQKAIWDGVEERFLKRLSLWKRQYI 302



 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)

Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
           SG  EA    ADLSGDYDS++N L YG   +DY     A   P + PSQ  S ++WD + 
Sbjct: 845 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 904

Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
           Q    + N+  Q      +PR PF       +  + F  +E  K RGTGTY P
Sbjct: 905 QSAHIRRNIFPQITANGIIPRPPFYPMNPPMISGTGFGVEEMPKPRGTGTYFP 957



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/57 (49%), Positives = 37/57 (64%), Gaps = 2/57 (3%)

Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE 88
          S PN  P +I A  W  AE  +QEI+C +QP  VSE++RKEV++Y+Q LI    G E
Sbjct: 27 SHPN--PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCE 81


>gi|297745772|emb|CBI15828.3| unnamed protein product [Vitis vinifera]
          Length = 929

 Score =  261 bits (668), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 132/241 (54%), Positives = 166/241 (68%), Gaps = 24/241 (9%)

Query: 155 SVQNIPVDISFNQMAGLSALCFLE-------KVDRLIGKDHLFKRSVILIKAWCYYESRI 207
           SVQN   +  +  M G+  LC           VDRLIGKDHLFKRS+ILIK+WCYYESRI
Sbjct: 219 SVQNHKRENPWFHMYGVQFLCNYSFKPLLSVIVDRLIGKDHLFKRSIILIKSWCYYESRI 278

Query: 208 LGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAI 250
           LGAH+GLISTYALE++VL+                 RFLDY++ FDWDNYCIS+NGPV  
Sbjct: 279 LGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCK 338

Query: 251 SSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDN 310
           SSLP+IVAE  EN  D+LLLS EFL+ C +++SVP   LE     F +KHLNI+DPL++N
Sbjct: 339 SSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLREN 398

Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           NNLGRSV+KGNF+RIR A  YG+ +LG+IL+LP   +   L+ FF +TLER+      ++
Sbjct: 399 NNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYMAEI 458

Query: 371 Q 371
           Q
Sbjct: 459 Q 459



 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 78/141 (55%), Positives = 104/141 (73%), Gaps = 2/141 (1%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           I  + W  AE   QEI+  +QP + S ++R+EVI+Y+QRLI    G EVFP+GSVPLKTY
Sbjct: 35  IAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTY 94

Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQN 158
           L DGDIDLT L   NVEE LA  V  +L+ E+Q  ++ FE+KD+Q++ A+VK+VKC V++
Sbjct: 95  LLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKD 154

Query: 159 IPVDISFNQMAGLSALCFLEK 179
           I +DISFNQ+ GLS LCFLE+
Sbjct: 155 IVIDISFNQLGGLSTLCFLEQ 175



 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 9/137 (6%)

Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
           DLSGDYDS+   L YGQC + + LP     SPP SPSQ+   + WD + Q++Q   NL  
Sbjct: 672 DLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKVRQHLQFTQNLHS 731

Query: 446 QWGTE--VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
           Q  +   +     P  HP  S    + F  ++ +K RGTGTY P  +  P    +RPV  
Sbjct: 732 QMDSNGVILGNHFPVKHPARS---ITAFGLEDKQKPRGTGTYFPNMSHLP--NRDRPVGQ 786

Query: 504 RNPESLNHVPVSKLRRK 520
           R  ++L     S+L R+
Sbjct: 787 RRNQALE--SHSQLHRR 801


>gi|110738268|dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
          Length = 660

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/212 (59%), Positives = 152/212 (71%), Gaps = 17/212 (8%)

Query: 180 VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH------------- 226
           +D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE +VL+             
Sbjct: 1   IDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPL 60

Query: 227 ----RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIY 282
               +FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+Y
Sbjct: 61  AVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMY 120

Query: 283 SVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
           SVP    E     F  KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++   
Sbjct: 121 SVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQ 180

Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
               +   L KFF N L R+G G RPDV   I
Sbjct: 181 SDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 212



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)

Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
           +DLSGDY+S  N L +G+ + DY    P++   SPP  P Q+P+ ++W+V+   +  + N
Sbjct: 409 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 466

Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
                     VPR  F H     +    F  +E  K RGTGTY P     R RP      
Sbjct: 467 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 526

Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
              +AR+P + N   +S+   +++     P R+  E       +   S +M ++     S
Sbjct: 527 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 577

Query: 559 LEEFPLLPVTKNLP 572
           L+ FP    + N P
Sbjct: 578 LDSFPDTNGSTNHP 591


>gi|357116041|ref|XP_003559793.1| PREDICTED: uncharacterized protein LOC100830879 [Brachypodium
           distachyon]
          Length = 899

 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 194/313 (61%), Gaps = 37/313 (11%)

Query: 41  IDAELWLLAEE----------RIQE-----ILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
           +D +LW  A E          R+ E     +L  + P   +E++R++V ++ +RLI   +
Sbjct: 16  VDGKLWEPATEKTAIQVPEQMRVAEAAAAGVLRCLLPTEEAERRRRQVTDHARRLIGTNF 75

Query: 86  GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQ 143
           G +V  +GSVPLKTYLPDGDID+T+L H+ ++  +   V  +L  E++  D+ F ++  +
Sbjct: 76  GCQVLTYGSVPLKTYLPDGDIDVTILTHKPLDSTIIDDVRNLLNAEEKNTDAEFVLESRR 135

Query: 144 YVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
           YV AQVK+ KC++ NI VDISFNQ+ G+S LCFLE VD  +GKDHLFKRS+ILIKAWCY 
Sbjct: 136 YVDAQVKVFKCNIANIDVDISFNQIGGVSTLCFLELVDTEVGKDHLFKRSIILIKAWCYN 195

Query: 204 ESRILGAHYGLISTYALEMMV-----------------LHRFLDYYNTFDWDNYCISING 246
           E+RI G+   L+STYALE+++                 L+ FL+YY+ FDW  YC++++G
Sbjct: 196 EARIQGSDQWLLSTYALEILILYIFNMFHNSLHGPFEALYMFLEYYSKFDWGKYCVTLDG 255

Query: 247 PVAISSLPEIVAE-TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVD 305
           PV +SSL    AE  V N  DELLL  E L    +   V     +    EF  K LNI+D
Sbjct: 256 PVPLSSLANFTAEPAVAN--DELLLGKESLSASSDRLLVLPKGSDRHDPEFRPKILNIID 313

Query: 306 PLKDNNNLGRSVS 318
           PLK +NNLGRS+S
Sbjct: 314 PLKGDNNLGRSIS 326


>gi|384253068|gb|EIE26543.1| hypothetical protein COCSUDRAFT_39611 [Coccomyxa subellipsoidea
           C-169]
          Length = 1155

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 136/293 (46%), Positives = 182/293 (62%), Gaps = 24/293 (8%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARF-VCKILENEDQDSV--FEIKDV 142
           +E + FGSVPLKTYLPDGDIDL V  G      D+  + +  +LE E ++++    +KDV
Sbjct: 4   VEAYMFGSVPLKTYLPDGDIDLAVFQGKGPRLRDVWTYELSALLEAEGRNALNPHRVKDV 63

Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
           Q + A+VK++KC V NI VDISF+ + GL  + FLE +DR IGK HLFKRSVIL+KAWCY
Sbjct: 64  QIINAEVKLLKCLVDNIVVDISFDTLGGLCTVAFLESIDRHIGKQHLFKRSVILVKAWCY 123

Query: 203 YESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISIN 245
           YESR+LGAH+GL+STYALE MVL+                 +FL  ++ FDWD + +S+ 
Sbjct: 124 YESRLLGAHHGLLSTYALETMVLYIFNMYHHELQSPLKVLRKFLVVFSKFDWDGHALSLQ 183

Query: 246 GPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVD 305
           GP+ +SS P+   E V       LL  + LK   E+YS P+   +  G  F IK++NI+D
Sbjct: 184 GPIPLSSFPDPQVEPVAGAEGGALLRGDVLKTMLEMYS-PVQ--QGPGKAFTIKNMNIMD 240

Query: 306 PLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINT 358
           PL   NNLGRSV+K +  RIR AL++G   L  I    G      ++ FF NT
Sbjct: 241 PLLPTNNLGRSVNKASKARIRKALAHGCHMLDSIFDKVGQEATEAVDGFFRNT 293


>gi|159471748|ref|XP_001694018.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277185|gb|EDP02954.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 633

 Score =  244 bits (624), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 150/395 (37%), Positives = 218/395 (55%), Gaps = 48/395 (12%)

Query: 27  SSSYSSPPNLYPFSIDAELWLLAEE-------RIQEILCTIQPAIVSEKKRKEVINYIQR 79
           S S+S P   YP S  A L  L+ E       R   ++  I+P  +S ++R  +  ++ +
Sbjct: 2   SQSFSQP---YP-SAPAALRKLSPEFGNDLLSRTDTLISRIRPTTLSLQRRFVITEHVTQ 57

Query: 80  LINGYYG---IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL--------ARFVCKIL 128
           L+   +    +   PFGSVPLKTYLPDGDIDL++  + +  + L        A  +   L
Sbjct: 58  LVKRCFAPHDVTAVPFGSVPLKTYLPDGDIDLSIYSYSSRAQSLKDQLRDTWATTLQLCL 117

Query: 129 ENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
           E+E  +  + F++ +VQ + A+VK++KC V NI VDISF Q+ GL+   FLE VD  + K
Sbjct: 118 EDEANNPHAAFKVANVQVIHAEVKLLKCLVDNIVVDISFFQIGGLNTYNFLEDVDAFVDK 177

Query: 187 -----DHLFKRSVILIKAWCYYESRILGAHYGLISTYALE-----------------MMV 224
                 HLFK S+IL+K WCYYESR+LGAH+GLISTYALE                 + V
Sbjct: 178 AITARKHLFKDSIILVKGWCYYESRVLGAHHGLISTYALETLVLYVINLYHRELSNPLQV 237

Query: 225 LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSV 284
           L++FL   + FDW+ YC+++ GP+ ++S P  V ET E    E LL+  F+ +    Y+ 
Sbjct: 238 LYKFLVECSGFDWERYCLTLQGPIPLASFPNPVVETPEPLQREPLLTEHFMTRAYNKYTA 297

Query: 285 P-ITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLP 343
           P + A+      FAIK LN++DP+  NNNLGRSVSK ++ RIR A  +GA+ L  I    
Sbjct: 298 PQVAAMGGEVKPFAIKQLNVMDPILPNNNLGRSVSKASYLRIRRAFEHGARMLAAIAEQT 357

Query: 344 GHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFG 378
                +   + F N   +     RP+ + P+ A G
Sbjct: 358 KELGAVVASRNFDNFFGKVWNAQRPNRK-PLAATG 391


>gi|255083767|ref|XP_002508458.1| predicted protein [Micromonas sp. RCC299]
 gi|226523735|gb|ACO69716.1| predicted protein [Micromonas sp. RCC299]
          Length = 1269

 Score =  239 bits (611), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 137/354 (38%), Positives = 206/354 (58%), Gaps = 47/354 (13%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDIDLT 109
           + E++  ++P   S+++R+ V  +I  L++G +  E   V  FGSVPL+TYLPDGDID+ 
Sbjct: 28  VNELIDVLRPTEQSDRRRRGVFRHIASLVDGCFAGENVLVTAFGSVPLRTYLPDGDIDVC 87

Query: 110 VLG-HQNVEED-----LARFVCKILENEDQDSV--------FEIKDVQYVPAQVKIVKCS 155
           +LG H+ +  D     L   V +      + S+        F + ++  + A+VK++K  
Sbjct: 88  LLGPHELLSRDDWTVRLRAHVERAEAAAAEASIELGSPVAEFAVSEIHIIHAEVKLMKLI 147

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
              + VD+S NQ  GL+AL FLE+V+  IGK  +FKRS++LIKAW +YE R+LGAH+ LI
Sbjct: 148 CDGVVVDVSANQFGGLAALGFLEEVNAFIGKGEIFKRSIVLIKAWGFYEGRLLGAHHALI 207

Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI-- 256
           STYALE +                 VLH+FL ++  FDWD + +S++GPV +  L ++  
Sbjct: 208 STYALETLVLYILNRFHKELSTPLEVLHKFLVFFADFDWDKFAVSVHGPVPLEDLHKVTG 267

Query: 257 -VAETVENDGDELLLSPEFLKKCREIY---SVPIT---ALENTGHEFAIKHLNIVDPLKD 309
            + +  E   +  LL+P+F+ +  + Y   SV        ++T    A K+LN+VDPL  
Sbjct: 268 PIGKRPEVHAEGALLTPDFMWRMMDKYGNESVSAKLGGGADSTPRPMARKYLNVVDPLLS 327

Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLGEIL--TLPGHCLGM--GLEKFFINTL 359
           +NNLGRSVS+GN  RIR AL+ GAQRL  +   +  G C G    LE+FF NT+
Sbjct: 328 SNNLGRSVSQGNAKRIRKALALGAQRLTALRESSTGGECFGAVRMLEQFFGNTM 381


>gi|302125450|emb|CBI35537.3| unnamed protein product [Vitis vinifera]
          Length = 398

 Score =  238 bits (606), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 126/206 (61%), Positives = 153/206 (74%), Gaps = 20/206 (9%)

Query: 80  LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVF 137
           LI    G EVFP+GSVPLK YL DGDIDLTVL   NVEE LA  V  +L+ E Q+  + F
Sbjct: 55  LIRCCLGCEVFPYGSVPLKIYLLDGDIDLTVLCSSNVEEALASDVHAVLKGERQNENAEF 114

Query: 138 EIKDVQY-VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           E+K+VQ+ +  +VK VKC V++I +DISFNQ+ GLS LCFL++VDRLIGKDHLFKRS+IL
Sbjct: 115 EVKNVQFNIIVEVKPVKCLVKDIVIDISFNQLGGLSTLCFLKQVDRLIGKDHLFKRSIIL 174

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL-----------------HRFLDYYNTFDWDN 239
           IK+ CYYESRILGA++GLISTYALE++VL                 +RFLDY++ FDWDN
Sbjct: 175 IKSRCYYESRILGAYHGLISTYALEILVLYIFHLFHSSLDGPLAVGYRFLDYFSKFDWDN 234

Query: 240 YCISINGPVAISSLPEIVAETVENDG 265
           YCIS+NG V  SSLP+IVAE  EN G
Sbjct: 235 YCISLNGSVCKSSLPDIVAELPENGG 260


>gi|145341816|ref|XP_001415999.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576222|gb|ABO94291.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 904

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 126/334 (37%), Positives = 191/334 (57%), Gaps = 32/334 (9%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDI 106
           E    E++ +++P  +SE +R+ V  +I++L    +G     V  +GSVPL+ YLPDGDI
Sbjct: 36  ETLTNELVASLRPTEMSEIRRRAVFEHIKQLAQECFGTAHTLVSAYGSVPLRAYLPDGDI 95

Query: 107 DLTVLGHQNVEEDLARFVCKILEN-------EDQDSVFEIKDVQYVPAQVKIVKCSVQNI 159
           D+ +LG   V  D A++  K  ++        D    F + +V  + A+V+++KC V  +
Sbjct: 96  DVCLLGDHRVI-DKAQWTTKFRKHIEKAEAEADPPHEFAVSEVSVINAEVRLMKCIVDGM 154

Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
            VD+S NQ  GL++L FLE+++  IG+D LF RS+IL+KAW +YE RILGAH+ LISTYA
Sbjct: 155 MVDVSANQFGGLASLGFLEEMNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALISTYA 214

Query: 220 LEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE 262
           LE +                 VLH+ L  +  FDW+ Y ++I+GPVAI  +     E +E
Sbjct: 215 LETLVLYIINKYHADLTCPLSVLHKLLSVFAEFDWEGYALTIHGPVAIEGIATPPDECLE 274

Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNF 322
                 L++ EF++     YS        +     +K++NI+DPL  NNNLGRSVS GN+
Sbjct: 275 GG----LITEEFMRTMLSTYSCEFMRAAASSAPVTVKYMNIIDPLLPNNNLGRSVSCGNY 330

Query: 323 HRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFI 356
            R+R AL  GAQRL  ++        +G+ + F+
Sbjct: 331 RRVRAALKLGAQRLDALMASSAARDDVGVLRGFV 364


>gi|302835555|ref|XP_002949339.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
           nagariensis]
 gi|300265641|gb|EFJ49832.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
           nagariensis]
          Length = 3433

 Score =  234 bits (598), Expect = 8e-59,   Method: Composition-based stats.
 Identities = 146/355 (41%), Positives = 206/355 (58%), Gaps = 45/355 (12%)

Query: 25  SSSSSYSSPP----NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRL 80
           S +  YSS P    NL P  +D    LLA  R   ++  I+P  +S ++R  +  ++  +
Sbjct: 7   SPNQPYSSNPRSLKNLQPKFVDE---LLA--RTDTLISRIRPTGLSLQRRWVITEHVTSI 61

Query: 81  INGYYG---IEVFPFGSVPLKTYLPDGDIDLTVLGH----QNVEEDLARF------VCKI 127
           +   +    +   PFGSVPLKTYLPDGDIDL++       Q ++E L         VC  
Sbjct: 62  VKRCFAPHDVTAIPFGSVPLKTYLPDGDIDLSIYSESPRAQALKEALRDTWATQLQVCLE 121

Query: 128 LENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIG-- 185
            E  +  +VF + +VQ + A+VK++KC V NI VDISF Q+ GL+   FLE VDR +   
Sbjct: 122 EEANNPTAVFRVANVQVIHAEVKLLKCLVDNIVVDISFFQVGGLNTYNFLEDVDRFVDQC 181

Query: 186 ---KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE-----------------MMVL 225
              + HLFK S+IL+K WCYYESR+LGAH+GLISTYALE                 + VL
Sbjct: 182 IPVRKHLFKDSIILVKGWCYYESRVLGAHHGLISTYALETLVLYVINLYHRELTNPLQVL 241

Query: 226 HRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVP 285
           ++FL   + FDW+NYC+S+ GP+ +SS P+ V ET E    + LL+ +F+ +    Y+ P
Sbjct: 242 YKFLVECSCFDWENYCLSLEGPIPLSSFPKPVVETPEALQRDALLTKDFMARAYFKYTEP 301

Query: 286 -ITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEI 339
            + A       FAIK LN++DP+   NNLGRSVSK ++ RIR A  +GA+ L +I
Sbjct: 302 QLRAQGGEPKPFAIKQLNVMDPILPGNNLGRSVSKASYLRIRRAFEHGARMLADI 356


>gi|290976573|ref|XP_002671014.1| predicted protein [Naegleria gruberi]
 gi|284084579|gb|EFC38270.1| predicted protein [Naegleria gruberi]
          Length = 763

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 141/378 (37%), Positives = 210/378 (55%), Gaps = 77/378 (20%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV-- 110
           IQ++L  IQP+  SEK RKEV + I  ++     ++ + +GSV  KTYLPDGDIDL+V  
Sbjct: 169 IQQLLYRIQPSSESEKHRKEVFDIIAAVLE-LANLKTYLYGSVAFKTYLPDGDIDLSVFV 227

Query: 111 -------LGHQNVEEDLAR----------FVCKILENE---------------------- 131
                  L  QNV   L+           +V  +L                         
Sbjct: 228 SNEEYLELSSQNVNNLLSHQPQVNDSTISYVHNVLLKNMHIGLKQQLADPSIPWYNKARS 287

Query: 132 -----DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
                 ++++  I+D+ ++ A+VK++KC+V NIP+D+S  Q+ GLS LCFL +VD  I  
Sbjct: 288 LFSEIQRNNLAYIEDMTFINAEVKLIKCTVNNIPIDMSSGQIGGLSTLCFLHEVDDKIAD 347

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV---------------LHRFLDY 231
           +HLFKRS+IL+K+W YYESRILG+H+GL+STY L +++               L+RFL+Y
Sbjct: 348 NHLFKRSIILMKSWSYYESRILGSHHGLVSTYGLTVLLMYMFRLYKIETPLQALYRFLNY 407

Query: 232 YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL-------LSPEFLKKCREIYSV 284
           Y+TFDW N+ ISI GP+ + ++ +   +++E+   E L       L+  FL+ C+  Y  
Sbjct: 408 YSTFDWTNFGISIYGPIPLGAIND--HKSIEDFYYENLPPERHDSLTSSFLQSCKSKYGT 465

Query: 285 PITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPG 344
                 ++   F IK+LNIVDPL+D NNLGRSV+  NF RIR A+  G++ + +IL    
Sbjct: 466 V-----DSSKTFTIKNLNIVDPLRDFNNLGRSVNYNNFLRIRRAIKKGSKTITDILISND 520

Query: 345 HCLGMGLEK-FFINTLER 361
                 + K FF N +E+
Sbjct: 521 LMESEKILKLFFKNVVEK 538


>gi|308799699|ref|XP_003074630.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
 gi|116000801|emb|CAL50481.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
          Length = 875

 Score =  221 bits (563), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 127/331 (38%), Positives = 189/331 (57%), Gaps = 33/331 (9%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDI 106
           E    E++ +++P   SE +R+ V  +I+ L  G +G     V  +GSVPL+ YLPDGDI
Sbjct: 32  ETLTNELVESLRPTAKSEMRRRAVFEHIKELAQGCFGTAHTLVSVYGSVPLRAYLPDGDI 91

Query: 107 DLTVLGHQNVEEDLARFVCKI---LENEDQDSVFE--IKDVQYVPAQVKIVKCSVQNIPV 161
           D+ +LG   V  D A +  K    +E  + +S FE  + +V  + A+V+++KC V  + V
Sbjct: 92  DVCLLGDHRVI-DKASWTTKFQKHIEKVEAESDFEFAVSEVSVINAEVRLMKCIVDGMMV 150

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           D+S NQ  GL++L FLE+ +  IG+D LF RS+IL+KAW +YE RILGAH+ LI+TYALE
Sbjct: 151 DVSANQFGGLASLGFLEETNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALIATYALE 210

Query: 222 MM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
            +                 VLH+ L  +  FDW+ Y ++I+GPVA+     I    +E  
Sbjct: 211 TLVLYIINKYYAELTCPLSVLHKLLRVFGDFDWEGYVLTIHGPVALEDANNIPPGCLEGG 270

Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
               LL+ EF++     Y    T+         +K++NI+DPL  NNNLGRSVS GN+ R
Sbjct: 271 ----LLTEEFMQSMLCQYGQIETS---NSAPVVVKYMNIIDPLVPNNNLGRSVSCGNYRR 323

Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFF 355
           +R AL  GA+ L +++        +G+ K F
Sbjct: 324 VRAALRLGARHLDKLMERSEARDDLGVLKAF 354


>gi|414866688|tpg|DAA45245.1| TPA: hypothetical protein ZEAMMB73_273182, partial [Zea mays]
          Length = 260

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 103/171 (60%), Positives = 137/171 (80%), Gaps = 2/171 (1%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           ++P   SE++R EV++Y +RL+    G EVF FGSVPLKTYLPDGDIDLTVLG+ + +  
Sbjct: 38  VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97

Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
           L   V  ILE+E+Q  D+ F +KD++ + A+V+++KC++ NI VDISFNQ  G+ ALCFL
Sbjct: 98  LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRF 228
           E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE+++L+ F
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVF 208


>gi|297612542|ref|NP_001065982.2| Os12g0114200 [Oryza sativa Japonica Group]
 gi|255669984|dbj|BAF29001.2| Os12g0114200, partial [Oryza sativa Japonica Group]
          Length = 178

 Score =  212 bits (540), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 107/167 (64%), Positives = 132/167 (79%), Gaps = 12/167 (7%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYV 145
           +VFPFGSVPLKTYLPDGDIDLT  GH + +E LA+ V  +LE+E+  +D+ FE+KDVQY+
Sbjct: 1   QVFPFGSVPLKTYLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYI 59

Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
            A+VK+VKC VQNI VDISFNQ  GL  LCFLEKVD+   K+HLFKRS++LIKAWCYYES
Sbjct: 60  HAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYES 119

Query: 206 RILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
           RILGAH+GLISTYALE++VL+ F  ++ T D         GP+A+SS
Sbjct: 120 RILGAHHGLISTYALEILVLYIFHLFHGTLD---------GPLAVSS 157


>gi|428171015|gb|EKX39935.1| hypothetical protein GUITHDRAFT_113927 [Guillardia theta CCMP2712]
          Length = 632

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/369 (35%), Positives = 193/369 (52%), Gaps = 52/369 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE------VFPFGSVPLKTYLPDG 104
           E+  EI+  +QP   +E+ R  V  Y+++LI      E      V  FGSVPLKTYLP G
Sbjct: 17  EQADEIVRQLQPHRRAERHRLTVFEYVKKLIKHVADEENKTEIYVHRFGSVPLKTYLPHG 76

Query: 105 DIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP----AQ------------ 148
           D+D+T     ++   L R   K+ +   ++ ++ +  V  VP    AQ            
Sbjct: 77  DLDVTAFAANDLW--LERLKAKLEDEAKKNDMYVVSGVHSVPRDLRAQSREELGKKDQGP 134

Query: 149 ---VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
              VK+VKC V  I VDI+ N + G+  LCFLEKVD ++ +DHLFKR+ IL+K+WCY+ES
Sbjct: 135 VEIVKVVKCQVNGISVDITANALGGMCNLCFLEKVDTMLKRDHLFKRATILVKSWCYFES 194

Query: 206 RILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPV 248
            IL +  GL+STYALE +                 VL RFL+YY  FDW N+C+++ GPV
Sbjct: 195 HILSSQNGLLSTYALETLVLCIVNIFHEELQTPLDVLKRFLEYYANFDWRNHCLTMRGPV 254

Query: 249 AISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLK 308
             S++P        ++    LL+   L++   +  + ++ L++    F  K++NI DPL 
Sbjct: 255 NRSNIPPGGEVPHLDNEPSYLLNDAILQEDSHLQFL-MSGLQDDNRGFQWKYMNICDPLS 313

Query: 309 DNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLP-GHCLGMGLEK------FFINTLER 361
             NN+GRSVS+ + +RI  A  +G Q L  +L     H   + L K       F +   +
Sbjct: 314 TRNNIGRSVSRSSAYRIASAFRHGWQSLSGLLYCSLHHNSSISLTKSEKSARAFFHFTGK 373

Query: 362 NGKGLRPDV 370
              G RPDV
Sbjct: 374 TLTGHRPDV 382


>gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis]
          Length = 740

 Score =  205 bits (522), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 125/328 (38%), Positives = 189/328 (57%), Gaps = 59/328 (17%)

Query: 68  KKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL---------------- 111
           ++R+  +  +Q  +     +E F FGSVPL+  LPDGDID++                  
Sbjct: 376 EQRQAAVALVQECLQ----VEAFMFGSVPLRAVLPDGDIDISFFATAATTPSSPSGNGGE 431

Query: 112 --GHQ-------NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIP 160
             GH+       ++ +  A  + + LE E    D+ F+I+DVQ + A+VK+VKC V ++ 
Sbjct: 432 QPGHRAGASPPGDLRDTWASQLLRALEREAVRPDAPFKIRDVQIIQAEVKLVKCVVHDVV 491

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VD+SF+ + GL  + FLE  DR IG+ HLFKRS++L+KAWCYYESR+LGAH+GLIS+YAL
Sbjct: 492 VDVSFDTVGGLCTVAFLEAADRRIGRQHLFKRSILLLKAWCYYESRLLGAHHGLISSYAL 551

Query: 221 EMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE--TV 261
           E++VL+                 RFL    +FDW+ YC+++ GP+ I+ L ++  +   +
Sbjct: 552 EVLVLYIFNLHHAELHTPLDVLRRFLAVLGSFDWERYCLALQGPLPIADLHKLHVDRTAL 611

Query: 262 ENDGDELLLSPEFLKKCREIYSV---------PITALENTGHEFAIKHLNIVDPLKDNNN 312
            + G E LL  +F++   + YSV             ++     F +KHLNIVDPL  +NN
Sbjct: 612 VSSGTEPLLDADFMRGVLQHYSVQHLSQQQQQEAAGMQLVAPRFPLKHLNIVDPLLPSNN 671

Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEIL 340
           LGRSVSK ++ R++ AL+ G + L E L
Sbjct: 672 LGRSVSKASYARVKKALALGNRMLEEAL 699


>gi|77553482|gb|ABA96278.1| nucleotidyltransferase family protein, putative, expressed [Oryza
           sativa Japonica Group]
 gi|215769169|dbj|BAH01398.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 622

 Score =  195 bits (495), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 98/194 (50%), Positives = 131/194 (67%), Gaps = 17/194 (8%)

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDW 237
           +LIKAWCYYESRILGAH+GLISTYALE++VL+                 RFLDYY+ FDW
Sbjct: 1   MLIKAWCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDW 60

Query: 238 DNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFA 297
           DN  IS+ GP+++SSLPE+V ++ +   D+  +  +FLK+C + ++V     E     F 
Sbjct: 61  DNKGISLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFP 120

Query: 298 IKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN 357
            K  NIVDPLK +NNLGRSVSKGNF RIR A  +GA++LG+IL +P +     + +FF N
Sbjct: 121 RKFFNIVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRN 180

Query: 358 TLERNGKGLRPDVQ 371
           TL+R+   +RPDVQ
Sbjct: 181 TLKRHCSRVRPDVQ 194



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 9/114 (7%)

Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPP--SSPSQIPSRSAWDVLC 434
           G+  S  +DL+GDY + +N LLY Q +H DY +     F P     PS  P+R   +   
Sbjct: 352 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYQMPSPPPARYQ-NRRS 410

Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFST-DEGEKSRGTGTYIP 487
                + N+    GT    P    C P    +R  T+S  D+  ++RGTGTY P
Sbjct: 411 SNGHSRNNVYGYAGTNGIGPAP--CPPGYLIMR--TYSQIDDSHRARGTGTYFP 460


>gi|168035607|ref|XP_001770301.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678518|gb|EDQ64976.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1631

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 107/247 (43%), Positives = 138/247 (55%), Gaps = 43/247 (17%)

Query: 171 LSALCFLEKVDRLIGKD-------HLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
           LS L   E VDR I ++       HLFKRSVIL+KAWCYYESRILGAH+GLISTYALE +
Sbjct: 242 LSNLATRELVDREIDRNDFELKQNHLFKRSVILVKAWCYYESRILGAHHGLISTYALETL 301

Query: 224 -------------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI-------- 256
                              VL+ FL Y+  FDWD YC+++ GPV ++ + EI        
Sbjct: 302 VLYIFHVFHPKRRLRGPLEVLYLFLVYFCNFDWDKYCVTMWGPVPLARITEISSGSARKT 361

Query: 257 -----VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDN 310
                 AE    D  +LLLS EFL++C + YS      E++    F  K LN++DP++D 
Sbjct: 362 FRISDFAEAPRKDRGKLLLSKEFLERCIDSYSDAKGGQESSQRRNFITKFLNVLDPIRDT 421

Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFF---INTLERNGKGLR 367
           NNLGRSV+ G+F RIR A   GA+ LGE+L  P   +    + FF     +LER   G R
Sbjct: 422 NNLGRSVNVGSFKRIRSAFGLGARTLGEVLECPTDQINEKFKSFFSCTFKSLERYRIGGR 481

Query: 368 PDVQVPI 374
           PD   P+
Sbjct: 482 PDTGNPL 488



 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 70/147 (47%), Positives = 97/147 (65%), Gaps = 6/147 (4%)

Query: 39  FSIDAEL----WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
           FS  A+L    W   E    E++ +I+P   SE++R  V  ++QRLI   +  +V  FGS
Sbjct: 25  FSSAAKLEDGWWSRVEGHTAELIDSIKPTRSSEERRTAVTAFVQRLIRDRFDCKVVKFGS 84

Query: 95  VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILEN--EDQDSVFEIKDVQYVPAQVKIV 152
           VPLKTYLPDGDIDLT+    +++E  A+ V K L+   ED ++ F +K+VQY+ A+VK++
Sbjct: 85  VPLKTYLPDGDIDLTIFARNDLKETWAQDVVKALKQAEEDTNAEFRVKEVQYIQAEVKLI 144

Query: 153 KCSVQNIPVDISFNQMAGLSALCFLEK 179
           KC V+NI VDISFNQ  GLS  CFLE+
Sbjct: 145 KCLVENIVVDISFNQTGGLSTFCFLEE 171


>gi|218192781|gb|EEC75208.1| hypothetical protein OsI_11468 [Oryza sativa Indica Group]
          Length = 860

 Score =  179 bits (454), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 62/234 (26%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVP 146
           VF +GSVPLKTYLPDGD+DLTVLG+ +    L   +  IL++E+Q  D+ FE+KD+Q + 
Sbjct: 20  VFAYGSVPLKTYLPDGDVDLTVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLIN 79

Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
           A+V                               DR +GK+HL K S+ILIKAWCYYESR
Sbjct: 80  AEV-------------------------------DRKVGKNHLVKNSIILIKAWCYYESR 108

Query: 207 ILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVA 249
           +LGAH+GLISTYALE +                 VL+RFL+Y++ FDWDNYCIS+NGPVA
Sbjct: 109 LLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVA 168

Query: 250 ISSLPEIVAE----------TVENDGDELLLSPEFLKKCREIYSVPITALENTG 293
           +SSLP  +A             +  G  LLL PE +    EIY      L   G
Sbjct: 169 LSSLPNQIASFNRIRTAFSYGAQKLGQVLLLQPELIPD--EIYGFFKNTLNRIG 220


>gi|2651305|gb|AAB87585.1| hypothetical protein [Arabidopsis thaliana]
          Length = 384

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 166/324 (51%), Gaps = 82/324 (25%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEV----------- 89
           I+AE+WL+AE R QEILC IQP  ++E+ R ++I+ +Q L+    GIEV           
Sbjct: 28  IEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVRTFLLLLDELS 87

Query: 90  -----------FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE 138
                      + FGS+PLKTYLPDGDIDLTVL H   EED AR VC +LE E  +S  +
Sbjct: 88  FSLQRIRNAKVYLFGSMPLKTYLPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQ 147

Query: 139 IKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIK 198
           +  VQYV A                               KVD+  G+DHLFK+S+IL+K
Sbjct: 148 VTGVQYVQA-------------------------------KVDKAFGRDHLFKKSIILVK 176

Query: 199 AWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
           AWC+YESRILGA+ GLISTYAL ++VL+     Y+         S++GP+A   L +++ 
Sbjct: 177 AWCFYESRILGANSGLISTYALAILVLNIVNMSYS---------SLSGPLA--KLRDVLT 225

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
              EN G           K  + ++V   +LE  G     +  ++ +P+          S
Sbjct: 226 LPGENVG----------WKLEKFFNV---SLERNGKG---QRQDVEEPVVAFGTGAADYS 269

Query: 319 --KGNFHRIRCALSYGAQRLGEIL 340
             KG+F R R  L YG    GE L
Sbjct: 270 QLKGDFDRYRNNLIYGKWFHGESL 293



 Score = 90.1 bits (222), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 84/166 (50%), Gaps = 20/166 (12%)

Query: 328 ALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADL 387
           +LS    +L ++LTLPG  +G  LEKFF  +LERNGKG R DV+ P+ AFG+G ++ + L
Sbjct: 212 SLSGPLAKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGAADYSQL 271

Query: 388 SGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQW 447
            GD+D Y N L+YG+ +H  +L     + PPS  +     ++W  +  Y+    N     
Sbjct: 272 KGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN----- 318

Query: 448 GTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPP 493
           G        P   P   Q       +    +S GTGTYIP     P
Sbjct: 319 GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMVHSP 357


>gi|226506494|ref|NP_001141604.1| uncharacterized protein LOC100273722 [Zea mays]
 gi|194705246|gb|ACF86707.1| unknown [Zea mays]
 gi|413924676|gb|AFW64608.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
          Length = 251

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/184 (48%), Positives = 111/184 (60%), Gaps = 17/184 (9%)

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDW 237
           +LIK WCYYES ILGA  GL+STYALE +VL+                 RFLDYY+ FDW
Sbjct: 1   MLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDW 60

Query: 238 DNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFA 297
           DN  IS+ GP+++SSLPE+V E      D  L    FLK C + +SVP    E     F+
Sbjct: 61  DNKGISLFGPISLSSLPELVTEPPYTRDDGFLSREAFLKDCAKAFSVPPINSEENPQVFS 120

Query: 298 IKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN 357
            K +NIVDPLK +NNLGRS+SKGN  RIR    +GA +LG+IL  P       + +FF N
Sbjct: 121 KKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRN 180

Query: 358 TLER 361
           TL R
Sbjct: 181 TLSR 184


>gi|303287038|ref|XP_003062808.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455444|gb|EEH52747.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 781

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/212 (43%), Positives = 123/212 (58%), Gaps = 28/212 (13%)

Query: 152 VKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH 211
           +KC    + VDIS NQ  GL+ L FLE+VD  I +D +FKRS+ILIKAW +YE R+LGAH
Sbjct: 1   MKCIADGVVVDISANQFGGLATLGFLEEVDAFIARDGIFKRSIILIKAWGFYEGRVLGAH 60

Query: 212 YGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLP 254
           + LISTYALE +                 VLH+FL Y+  F+WD Y +SI+GPV + +L 
Sbjct: 61  HALISTYALETLVLYVLNAYHEELSTPLEVLHKFLTYFADFEWDAYAVSIHGPVRLDALE 120

Query: 255 EIVAETVENDGDELLLSPEFLKKCREIYS--VPITALENTG--------HEFAIKHLNIV 304
           + V +  +      LL+P F K+  + Y     I A +                KHLN++
Sbjct: 121 KGVRD-ADAPARGPLLTPAFTKRVLDKYGNDAIINAEKGQAGPGGGGNRRAMQPKHLNVI 179

Query: 305 DPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
           DPL  +NNLGRSVS+GN  RI+ AL  GA +L
Sbjct: 180 DPLLPSNNLGRSVSQGNAKRIQKALRLGAAKL 211


>gi|412992209|emb|CCO19922.1| predicted protein [Bathycoccus prasinos]
          Length = 1318

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 142/299 (47%), Gaps = 69/299 (23%)

Query: 136 VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
           + E+KD+  + A V+++KC V  I VD+S NQ  GL+ L FL++V+  IGK+ LFKRSVI
Sbjct: 288 LLEVKDIVVIHADVRLLKCVVDGIVVDVSANQFGGLATLAFLKEVNSKIGKNDLFKRSVI 347

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMM------------------------------VL 225
           L+KAW +YESRILGA Y L+STYAL+ +                              VL
Sbjct: 348 LVKAWAFYESRILGAPYALLSTYALKTLIICALRRFNKKESKSDATKTKKREIATPLDVL 407

Query: 226 HRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDEL----------------- 268
             F +Y + F W+ + ++I G V +  L ++      +                      
Sbjct: 408 RIFFEYVSDFPWETHAVTIFGDVPVEKLDKVSVREFSSSSKSEKNKNKNNDDEREEKDDE 467

Query: 269 ------LLSPEFLKKCREIY---SVP-ITALENTGH---------EFAIKHLNIVDPLKD 309
                 LL   F+    + Y   S P    L N G+             KHL+I+DPL +
Sbjct: 468 EAEEDPLLDDTFVDTILKSYGPDSRPDANVLLNIGNGKKAPFRRRAIGAKHLHILDPLSE 527

Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLG--EILTLPGHCLGMGLEKFFINTLERNGKGL 366
            NNLGRSVS GNF R+R A   GA+RL   E+ + P + +  G E FF   L   G  L
Sbjct: 528 TNNLGRSVSLGNFARVRAAFRLGAERLKRLEMESEPEN-ITRGFEYFFKVALANRGGKL 585



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 14/83 (16%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE--------------VFPFG 93
           + E   +E++ +++P+  SEK+R+ V   ++ LI   +  E              V  FG
Sbjct: 127 MLEALTEELIASLRPSKQSEKRRRMVFRKMESLIRECFEKEFEGEGVNEKKNTIVVSAFG 186

Query: 94  SVPLKTYLPDGDIDLTVLGHQNV 116
           SVP  TYLPDGDID+ +LG   V
Sbjct: 187 SVPFGTYLPDGDIDVCILGDHEV 209


>gi|301093296|ref|XP_002997496.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110638|gb|EEY68690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 782

 Score =  145 bits (367), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 168/369 (45%), Gaps = 89/369 (24%)

Query: 62  PAIVSEKKRKEVINYIQRLINGYYGIEVFPF----GSVPLKTYLPDGDIDLTVLGHQNVE 117
           P+ V+++ R++V++++QR+I  ++ +   P     GS P+KTYLP  D+D+ +L  Q +E
Sbjct: 257 PSDVADRVRQQVLSFVQRVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLLVPQELE 316

Query: 118 ED----LARFVCKILENEDQDSVFE--------------------------------IKD 141
                 + + +C    +    +V +                                +++
Sbjct: 317 SSWYYIVTQALCVAGGSGGAGTVLDLGNSASSDVSGSSSPSGPAAASGGGPLLLTNTVRN 376

Query: 142 VQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           V ++ A V++VKC+V NIPVD + N++  L A+  L+ +   +G+ HLFK+S+ILIKAWC
Sbjct: 377 VTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAARVGRQHLFKKSLILIKAWC 436

Query: 202 YYESR---------------------ILGAHYGLISTYALEMMV---------------- 224
            +ESR                     ++GA +G +STYA+  +V                
Sbjct: 437 THESRPFMQRASNEAGGSVPGSTPASVMGASHGALSTYAVNTIVMALFNQHGDALTHPLQ 496

Query: 225 -LHRFLDYYNTFDWDNYCISINGPVAISSLPEI----------VAETVENDGDELLLSPE 273
            L+ FLD    F W    ++++GPV +S L               +T + D  ++    +
Sbjct: 497 ALYLFLDRLAEFPWHECALTLHGPVPLSRLASTPLNGTTSYRSKLKTAKLDASDVEAIRD 556

Query: 274 FLKKCREIYSVPITALENT-GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYG 332
            L      +   + + + T    F I+  NIVDPL D NNL RSVS   F  ++ A    
Sbjct: 557 TLADQFGAFDAALKSSKGTPTGLFPIRACNIVDPLDDKNNLARSVSAEGFPVMKRAFRLA 616

Query: 333 AQRLGEILT 341
             +L  +L 
Sbjct: 617 RDQLAAMLA 625


>gi|325189429|emb|CCA23919.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1193

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/419 (27%), Positives = 191/419 (45%), Gaps = 78/419 (18%)

Query: 50   EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY----GIEVFPFGSVPLKTYLPDGD 105
            E  +++++  + P   +++ R  ++ Y++ L+   +     I  FP GS P KTYLPD D
Sbjct: 723  ETSVKKLIHALSPTHEADQARCNILAYLRHLLELQFPRSSSILFFPTGSFPCKTYLPDAD 782

Query: 106  IDLTVLGHQNVEEDLARFVCKIL------------------------ENEDQDSVFEIKD 141
            +D+ +L  +++E      V ++L                         N    +   +++
Sbjct: 783  LDVCLLVPRSMEPTWFFSVVQMLCFAATNDVHAEPKHSLESVQAPSWMNSTSSTGNTVRN 842

Query: 142  VQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
            V ++ A V++VKC++ N+ VDI+ N++  L AL  L+  D  +G+ HLFK+S++LIKAWC
Sbjct: 843  VTFINADVRVVKCTIDNVAVDITVNRVGALGALVLLDTFDLRVGRHHLFKQSLVLIKAWC 902

Query: 202  YYESR-------ILGAHYGLISTYALEMMV-----------------LHRFLDYYNTFDW 237
              +         +LG+  G  STYA+  MV                 LH FLD    F W
Sbjct: 903  ALDCLEGGQGCGVLGSKNGAFSTYAVNTMVMTLFNRWGYRIQHPLEALHLFLDIMTQFPW 962

Query: 238  DNYCISINGPVAISSL-----PEIVAETVENDGDELLLSPEFLKKCR----EIY-SVPIT 287
                 +I GPV  + L       IV    E      L++ E +++ R    E + S  ++
Sbjct: 963  QECAWTIFGPVLFTQLYQNLSSRIVPPGWETASANCLITREDIEQIRVCLNEYFGSFDVS 1022

Query: 288  ALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCL 347
                T   F ++  N++DPL+  NNL RSV    F  ++     G  RL  +L+     +
Sbjct: 1023 LGTETNAVFPLRSFNMIDPLQLGNNLARSVLPEIFPSLQVMFRDGRDRLDRVLSEEKTVM 1082

Query: 348  GMGLEKFFINT--LERNGKGLRPDV-----QVPIHAFGSGKSEAADLSGDYDSYYNGLL 399
                 +FF ++  L   G G RPD+     Q+ + A G  +    D SG  D+ +  L+
Sbjct: 1083 -----EFFKHSWKLYGRGDGWRPDLLLHPRQLWVQANGRDEK---DESGS-DTKWKSLI 1132


>gi|403357215|gb|EJY78230.1| hypothetical protein OXYTRI_24618 [Oxytricha trifallax]
          Length = 831

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 164/339 (48%), Gaps = 55/339 (16%)

Query: 56  ILCTIQPAIVSEKKRKEVINYIQRLINGYYG----IEVFPFGSVPLKTYLPDGDIDLTVL 111
            L  I P   SE+KR ++   ++ LI    G    + V  +GS PLKTYLPD DID+TV+
Sbjct: 28  FLNKIGPTQESERKRVKIFEQVKFLIEKALGGKSQVMVIRYGSDPLKTYLPDSDIDITVI 87

Query: 112 GHQNVEEDLA---------RFVCKILE--NEDQDSVFEIKDVQYV-PAQVKIVKCSVQNI 159
               ++ +           + + K +E   E Q+    +K +  +  A V+I+K + QN 
Sbjct: 88  RRDYLQGNQTNQLTALTQLKLIKKEIEIFGETQNGKNFVKSMVLIDQADVEIIKLNFQNT 147

Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
            VDIS  Q+ G+  L F+  + + IGK  L K+S+IL+KAW  Y++ ILG+    ++TYA
Sbjct: 148 FVDISIKQVGGICTLYFMNYMAKRIGKQQLLKKSIILLKAWFTYDASILGSQAACMATYA 207

Query: 220 LEMMV---LHRFLDYYNT--------------FDWDNYCISINGPVAISSLPEIVAETVE 262
           L +MV   L+ F D  N+              FDW+N  ++I GP+  S   E + E  +
Sbjct: 208 LYVMVLFILNNFYDELNSPMDVIMMFFKVWSHFDWENNIVTIFGPIKSSGFYERLKE-CQ 266

Query: 263 NDGDELLLSPEFLKKCREIYSVPIT-------ALENTG--------------HEFAIKHL 301
            D D L +    L +  +   + +T        L+ +G                F  K+ 
Sbjct: 267 FDIDRLTMLDRSLHQEYQYRKLLVTPDELSFLNLQFSGVRLSDVSSYNLANKKSFNTKYF 326

Query: 302 NIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
           NI+DP    NNLG+S+SK N  RI+  L     ++ +I 
Sbjct: 327 NIIDPTFSKNNLGKSISKLNSSRIKQVLRLQNMKMRQIF 365


>gi|401410712|ref|XP_003884804.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
 gi|325119222|emb|CBZ54776.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
          Length = 3449

 Score =  136 bits (342), Expect = 4e-29,   Method: Composition-based stats.
 Identities = 72/194 (37%), Positives = 113/194 (58%), Gaps = 29/194 (14%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNV------EEDLARFVCKILENEDQDSV---- 136
           I V+ +GS PL+T+LPDGD+D+ ++ +         EE+    +  +L+   ++ V    
Sbjct: 356 ITVYRYGSFPLRTFLPDGDLDIGIISYNRRTGVVEGEEESDALLAVLLDKFQREDVKTHK 415

Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
            F +++   V A+V+I+KC V  I VD+S N++ G  +L FLE  DR IG+ HLFKRSV+
Sbjct: 416 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRHHLFKRSVL 475

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
           LIK+W  YES +LG+  GL++TY +E +VLH                   F  YY++F W
Sbjct: 476 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVLPASLLPTPLHLLYHFFSYYSSFHW 535

Query: 238 DNYCISINGPVAIS 251
           D Y ++  GP+ ++
Sbjct: 536 DRYAVTACGPLPLT 549



 Score = 44.3 bits (103), Expect = 0.21,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFF 355
           F  + +N+VDPL + NNL RSVS+  F+R+  A+  G Q L  IL          L  F 
Sbjct: 890 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTHILATGDAARFRTL--FL 947

Query: 356 INTLERNGKGLRPDVQVPI 374
            N+ +   +   PDV  P+
Sbjct: 948 PNSYQLLDRIKSPDVAYPV 966


>gi|221502484|gb|EEE28211.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 3297

 Score =  134 bits (337), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
           I V+ +GS PL+T+LPDGD+D+ V+      G    EE+    +  +LE   +  V    
Sbjct: 222 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 281

Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
            F +++   V A+V+I+KC V  I VD+S N++ G  +L FLE  DR IG++HLFKRSV+
Sbjct: 282 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 341

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
           LIK+W  YES +LG+  GL++TY +E +VLH                  +F  YY++F W
Sbjct: 342 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 401

Query: 238 DNYCISINGPVAIS 251
           D Y ++  G + ++
Sbjct: 402 DRYAVTACGALPLT 415



 Score = 44.3 bits (103), Expect = 0.24,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 30/45 (66%)

Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
           F  + +N+VDPL + NNL RSVS+  F+R+  A+  G Q L ++L
Sbjct: 739 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 783


>gi|221482136|gb|EEE20497.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 3441

 Score =  134 bits (337), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
           I V+ +GS PL+T+LPDGD+D+ V+      G    EE+    +  +LE   +  V    
Sbjct: 367 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 426

Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
            F +++   V A+V+I+KC V  I VD+S N++ G  +L FLE  DR IG++HLFKRSV+
Sbjct: 427 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 486

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
           LIK+W  YES +LG+  GL++TY +E +VLH                  +F  YY++F W
Sbjct: 487 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 546

Query: 238 DNYCISINGPVAIS 251
           D Y ++  G + ++
Sbjct: 547 DRYAVTACGALPLT 560



 Score = 44.3 bits (103), Expect = 0.24,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 30/45 (66%)

Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
           F  + +N+VDPL + NNL RSVS+  F+R+  A+  G Q L ++L
Sbjct: 884 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 928


>gi|237843045|ref|XP_002370820.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
 gi|211968484|gb|EEB03680.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
          Length = 3436

 Score =  134 bits (337), Expect = 2e-28,   Method: Composition-based stats.
 Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
           I V+ +GS PL+T+LPDGD+D+ V+      G    EE+    +  +LE   +  V    
Sbjct: 361 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 420

Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
            F +++   V A+V+I+KC V  I VD+S N++ G  +L FLE  DR IG++HLFKRSV+
Sbjct: 421 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 480

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
           LIK+W  YES +LG+  GL++TY +E +VLH                  +F  YY++F W
Sbjct: 481 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 540

Query: 238 DNYCISINGPVAIS 251
           D Y ++  G + ++
Sbjct: 541 DRYAVTACGALPLT 554



 Score = 44.3 bits (103), Expect = 0.24,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 30/45 (66%)

Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
           F  + +N+VDPL + NNL RSVS+  F+R+  A+  G Q L ++L
Sbjct: 878 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 922


>gi|348683529|gb|EGZ23344.1| hypothetical protein PHYSODRAFT_485178 [Phytophthora sojae]
          Length = 793

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 110/446 (24%), Positives = 186/446 (41%), Gaps = 115/446 (25%)

Query: 49  AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPF----GSVPLKTYLPDG 104
           A  +   ++  + P+  +++ R++V++++Q++I  ++ +   P     GS P+KTYLP  
Sbjct: 248 AARQADALIEWMGPSDAADRVRQQVLSFVQQVITAHFPLAAAPLFFATGSYPMKTYLPGS 307

Query: 105 DIDLTVLGHQNVEED----LARFVCKILENEDQDSVFEI--------------------- 139
           D+D+ +L  Q +E      + + +C    +    +V ++                     
Sbjct: 308 DLDICLLVPQELESSWYFIVTQALCIAGGSGGAGTVLDVGNPGGSVDGSGSSSPSGPAVG 367

Query: 140 -------------KDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
                        ++V ++ A V++VKC+V NIPVD + N++  L A+  L+ +   +G+
Sbjct: 368 SGSSGALLLTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAVRVGR 427

Query: 187 DHLFKRSVILIKAWCYYES-------------------------RILGAHYGLISTYALE 221
            HLFK+S+ILIKAWC +ES                          ++GA +G +STYA+ 
Sbjct: 428 QHLFKKSLILIKAWCTHESSPFMQAASVECGGLGPSVVPGSTPTSVMGASHGALSTYAVN 487

Query: 222 MMV-----------------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
            +V                 L+ FLD    F W    ++++G V +S L           
Sbjct: 488 TIVMALFNQHGDALTHPLQALYLFLDRLAEFPWHEAALTLHGAVPLSRLATTPLNGTTPS 547

Query: 265 GDEL---LLSPEFLKKCREIYSVPITALE---NTGHE-----FAIKHLNIVDPLKDNNNL 313
             +L    L    ++  R+  S    A +    +G       F I+  NIVDPL D NNL
Sbjct: 548 KSKLKAAKLDAGDVEAIRDTLSDQFGAFDAGLRSGKSAPTGLFPIRACNIVDPLDDKNNL 607

Query: 314 GRSVSKGNFHRIRCALSYGAQRLGEILT---------------LPGHCLGMGLEKFFIN- 357
            RSVS   F  ++ A      +L  +L                  G  +GM    FF   
Sbjct: 608 ARSVSAEGFPVMKRAFRLARDQLAAMLAPRTSHRDDDAELLSEETGSDVGMA---FFSRC 664

Query: 358 -TLERNGKGLRPDVQVPIHAFGSGKS 382
             L   G G RPD+ +       GK+
Sbjct: 665 WQLYGRGDGWRPDLLIHPRQIWHGKA 690


>gi|242051292|ref|XP_002463390.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
 gi|241926767|gb|EER99911.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
          Length = 208

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 56/124 (45%), Positives = 89/124 (71%), Gaps = 2/124 (1%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           + P   +E++R++VI+Y++RLI    G EVF FGSVPL+TYLPDGD+D+TVLG+  +   
Sbjct: 84  VHPTQEAERRRQDVISYLRRLIGSSLGCEVFAFGSVPLRTYLPDGDVDITVLGNTWLNST 143

Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
               V  +LE+E +  D+ F++  + ++ A+VK++KC ++NI VD+SFNQ+ G+S  CFL
Sbjct: 144 FIDDVRSMLESEQENCDAEFKLTGLHFINAEVKLIKCIIENIIVDVSFNQIGGVSTFCFL 203

Query: 178 EKVD 181
           E ++
Sbjct: 204 ELIN 207


>gi|308163112|gb|EFO65472.1| Hypothetical protein GLP15_5146 [Giardia lamblia P15]
          Length = 719

 Score =  121 bits (304), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 134/273 (49%), Gaps = 44/273 (16%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           +R   I+  + P   SE+ R ++  +I R+I        + P+GS   K YLP  D+D+ 
Sbjct: 48  QRTDYIVSLVSPDKASEEFRLKIFTFISRVIEAVLPNTLIVPYGSFISKIYLPSSDLDIC 107

Query: 110 VLGH--------QNVEEDLARF---------------VCKILENE-DQDSVFEIKDVQYV 145
              H        Q + E L  F               V +++ +    +   E+++++++
Sbjct: 108 CYNHGLDEIPLLQKILEALTIFSDPSLRPTGVRVSPAVSQLINSRISAEERLELENIEFI 167

Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
            A+V ++KC+V  + VDIS  Q   L     +EK+ + IG+++L KRS +LI++WC YE+
Sbjct: 168 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 227

Query: 206 RILGAHYGLISTYALEMMV----------------LHRFLDYYNTFDWDNYCISINGPVA 249
           RI+G H  ++S+YAL +MV                L+ FL YY+TFD+D   +  +GP  
Sbjct: 228 RIVGGHSQMLSSYALRVMVINILLNCKDIYTPFQALYVFLAYYSTFDYDKNIVHPSGPFP 287

Query: 250 ISSLPEIVAETVENDGDELL---LSPEFLKKCR 279
                E +  T  +   E+L   +SP +L   R
Sbjct: 288 KPPAHESLTMTQLSRSMEVLESHISPSYLNHLR 320



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 18/114 (15%)

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           E V ND D LLL    LK+   + + P   + +    F   +++IVDPL+  NNLGRSVS
Sbjct: 556 EPVINDHDILLL----LKRYFSMGTFP--DVLSDSRVFLPSYISIVDPLQVINNLGRSVS 609

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL---------EKFFINTLERNG 363
           + NF RI  +       L +I+ +   C+   +         + FFI+TL   G
Sbjct: 610 EPNFMRITRSFQTAHIVLSDIVQM---CITGSMTITEALAEYDCFFISTLSIFG 660


>gi|159108047|ref|XP_001704297.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
 gi|157432356|gb|EDO76623.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
          Length = 512

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 116/199 (58%), Gaps = 20/199 (10%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKI 127
           R ++I YI+  ++  +  +++ P+GS   + +LPDGDIDL ++ G  +  + LA+F   +
Sbjct: 55  RYQIIKYIRDKLHSLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAADVLAQFYIYL 114

Query: 128 LE--NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIG 185
            E     +D+ F++ ++  + A+V I++  +  + +DIS  +  GL    +L+ ++  IG
Sbjct: 115 KEVAASHEDTPFKLTNLSKIQAEVPIIRLVINGVFIDISSARPVGLVTSLYLQLLNDAIG 174

Query: 186 KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM----------------VLHRFL 229
           +++L KRSVILI+AWC YE+ ILG+H  ++++YAL +M                VL +F 
Sbjct: 175 RNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTTFILTNSPELVHPLQVLFKFF 234

Query: 230 DYYNTFDWDNYCISINGPV 248
            +Y+ FD+ N  I+  G V
Sbjct: 235 AFYSAFDFTNNTITAFGVV 253


>gi|253742434|gb|EES99267.1| Hypothetical protein GL50581_3482 [Giardia intestinalis ATCC 50581]
          Length = 711

 Score =  119 bits (297), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 77/276 (27%), Positives = 134/276 (48%), Gaps = 41/276 (14%)

Query: 14  QEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEV 73
           + EL  S  H +S     +   +   +  A+      +R   I+  + P   SE+ R ++
Sbjct: 11  RNELHKSRDHKASEEGEDTEEAIVEAARVAQALEALSQRTDYIISLVSPDRASEEFRLKI 70

Query: 74  INYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH--------QNVEEDLARF- 123
             +I ++I+       + P+GS   K YLP  D+D+    H        Q + E L  F 
Sbjct: 71  FTFISKVIDVVLPNTLIVPYGSFISKIYLPSSDLDICCYNHSIDEIPLLQKILEALMVFS 130

Query: 124 --------------VCKILENE-DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQM 168
                         V +++ +    D   E+++++++ A+V ++KC+V  + VDIS  Q 
Sbjct: 131 DPNLQSTGTRVSPVVSQLINSHISADERLELENIEFIMAKVSLIKCTVCGLGVDISAAQP 190

Query: 169 AGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM----- 223
             L     +EK+ + IG+++L KRS +LI++WC YE+RI+G H  ++S+YAL +M     
Sbjct: 191 GSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARIVGGHSQMLSSYALRVMIINIL 250

Query: 224 -----------VLHRFLDYYNTFDWDNYCISINGPV 248
                      VL+ FL YY+ FD+D   I  +GP+
Sbjct: 251 INCKDIYTPFQVLYVFLAYYSNFDYDRNIIHPSGPL 286



 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 62/135 (45%), Gaps = 19/135 (14%)

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
           E V ND D L L    LK  R   +   T + +    F   +++IVDPL+  NNLGRSVS
Sbjct: 546 EPVLNDHDVLFL----LK--RYFSTSTFTDVLSDSQVFLPSYISIVDPLQVTNNLGRSVS 599

Query: 319 KGNFHRIRCALSYGAQRLGEILTL---PGHCLGMGLEK---FFINTLERNG-------KG 365
           + NF RI  +       L +I+ +    G  +   L +   FFI+TL   G        G
Sbjct: 600 EPNFMRITRSFQTAHTVLSDIVQMCITGGTTITEALAEYDCFFISTLSIFGDAHAEKLHG 659

Query: 366 LRPDVQVPIHAFGSG 380
             P++ + I    +G
Sbjct: 660 DVPNLSINISVLEAG 674


>gi|308159127|gb|EFO61675.1| Topoisomerase I-related protein [Giardia lamblia P15]
          Length = 512

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 66/217 (30%), Positives = 121/217 (55%), Gaps = 20/217 (9%)

Query: 52  RIQEILCTIQPAIVSEKKRKEVINYIQ-RLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
           R+  +   + P   S   R ++I YI+ +L + +  +++ P+GS   + +LPDGDIDL +
Sbjct: 37  RLNVLYHILAPTEDSITYRYQIIKYIRDKLHDLFPELQLIPYGSFVTRIFLPDGDIDLAI 96

Query: 111 L-GHQNVEEDLARFVCKI--LENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
           + G  +  + L +F   +  +    +D+ F + ++  + A+V I++  +  I +DIS  +
Sbjct: 97  IVGEDDAADVLTQFYIHLKDIVASQEDTPFRVTNLSKIQAEVPIIRLVINGIFIDISSAR 156

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
             GL    +L+ ++  IG+++L KRSVILI+AWC YE+ ILG+H  ++++YAL +M    
Sbjct: 157 PVGLVTSLYLQLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTIFI 216

Query: 224 ------------VLHRFLDYYNTFDWDNYCISINGPV 248
                       VL +F  +Y+ FD+ N  I+  G +
Sbjct: 217 LTNSPELVHPLQVLFKFFAFYSAFDFTNNTITAFGVI 253


>gi|159115240|ref|XP_001707843.1| Hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
 gi|157435951|gb|EDO80169.1| hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
          Length = 731

 Score =  118 bits (296), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 134/273 (49%), Gaps = 44/273 (16%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           +R   I+  + P   SE+ R ++  +I ++I        + P+GS   K YLP  D+D+ 
Sbjct: 63  QRTDYIVSLVSPDKASEEFRLKIFTFISKVIEAVLPNTLIVPYGSFISKIYLPSSDLDIC 122

Query: 110 VLGH--------QNVEEDLARF---------------VCKILENE-DQDSVFEIKDVQYV 145
              H        Q + E L  F               V +++ +    +   E+++++++
Sbjct: 123 CFNHGLDEIPLLQKILEALTVFSDPSLRPTGVRVPPAVSQLINSRIPTEERLELENIEFI 182

Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
            A+V ++KC+V  + VDIS  Q   L     +EK+ + IG+++L KRS +LI++WC YE+
Sbjct: 183 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 242

Query: 206 RILGAHYGLISTYALEMMV----------------LHRFLDYYNTFDWDNYCISINGPVA 249
           RI+G H  ++S+YAL +MV                L+ FL YY++FD+D   +  +GP+ 
Sbjct: 243 RIVGGHSQMLSSYALRVMVINILLNCRDIYTPFQALYVFLAYYSSFDYDRDIVHPSGPLP 302

Query: 250 ISSLPEIVAETVENDGDELL---LSPEFLKKCR 279
                E +     +   E+L   +SP +L   R
Sbjct: 303 KPPAHESLTAAQLSGSMEVLESHISPSYLNHLR 335


>gi|253744327|gb|EET00549.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
          Length = 511

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 124/217 (57%), Gaps = 20/217 (9%)

Query: 52  RIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTV 110
           R+  +   + P+  S   R ++I YI+  ++  +  +++ P+GS   + +LPDGD+DL++
Sbjct: 37  RLNVLYHILAPSEDSISCRYQIIKYIRDELHSIFPELQLIPYGSFVTRIFLPDGDVDLSI 96

Query: 111 LGHQNVEEDL-ARFVCKILE--NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
           +  ++   D+ ++F   + E  +  + + F+I ++  + A++ I++  +  I +DIS  +
Sbjct: 97  IVAEDDANDVFSQFYTHLKEIASSQEHATFKITNLSKIQAEMSIIRLVINGIFIDISAAR 156

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
             GL    +++ ++  IG+++L KRSVIL++AW  YE+ ILG+H  ++++YAL +M    
Sbjct: 157 PTGLVTSLYIQLLNDSIGRNNLLKRSVILVQAWSLYEAHILGSHSQMLNSYALRVMTAFI 216

Query: 224 ------------VLHRFLDYYNTFDWDNYCISINGPV 248
                       VL +F  +Y+TFD+ N  I+  G +
Sbjct: 217 LTNSPELVHPLQVLFKFFAFYSTFDFTNNTITAFGVI 253


>gi|297600524|ref|NP_001049344.2| Os03g0210800 [Oryza sativa Japonica Group]
 gi|255674303|dbj|BAF11258.2| Os03g0210800 [Oryza sativa Japonica Group]
          Length = 871

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 1/116 (0%)

Query: 256 IVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLG 314
           + AE    D  ELLLS  FL KC   Y+V     E+ G + F  KH N++DPL+ NNNLG
Sbjct: 1   MTAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60

Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           RSVSKGNF RIR A S+GA+RL ++L  P   L   + +FF NT  R+G G RPD 
Sbjct: 61  RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116


>gi|222624434|gb|EEE58566.1| hypothetical protein OsJ_09878 [Oryza sativa Japonica Group]
          Length = 1064

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 1/116 (0%)

Query: 256 IVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLG 314
           + AE    D  ELLLS  FL KC   Y+V     E+ G + F  KH N++DPL+ NNNLG
Sbjct: 1   MGAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60

Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           RSVSKGNF RIR A S+GA+RL ++L  P   L   + +FF NT  R+G G RPD 
Sbjct: 61  RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116


>gi|298707565|emb|CBJ30149.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
          Length = 1301

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 78/292 (26%), Positives = 122/292 (41%), Gaps = 79/292 (27%)

Query: 33  PPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPF 92
           P N  P +ID         R+ ++L  ++PA  +E  R+ V  ++ R +    G + FP 
Sbjct: 607 PSNKQPLAID--------RRVDDLLRLLRPAPRAEGYRRSVFRFVTRQVKRALGAQCFPV 658

Query: 93  GSVPLKTYLPDGDIDLTV-LGHQNVEEDLARF---VCK---------------------- 126
           G   ++ YLPD ++ ++  L H   +    R    +CK                      
Sbjct: 659 GGYAIQAYLPDEEVGISAFLCHGQEKSWFVRVNETLCKVSSEASEEAAEEEGSGTSVGGC 718

Query: 127 ------------------------ILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQN-IP 160
                                   I + E       + +V ++   +V+ +KC V N + 
Sbjct: 719 PEKITAVGEGATPKQEGGDGGGAAIRKEEGSSYRHRLSNVNFINMGRVQKIKCVVDNQVA 778

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL--ISTY 218
           VDI  NQ+  ++ +  LE+ D+L+GKDHLFKRS++LIK+W  YESR       L  I+  
Sbjct: 779 VDIGANQVGDIATVALLEETDQLLGKDHLFKRSLLLIKSWWVYESRAYTGSNMLSRITES 838

Query: 219 ALEMMVL-----HR------------FLDYYNTFDWDNYCISINGPVAISSL 253
           AL  MVL     H             F   ++ FDW  YC  I GP  + +L
Sbjct: 839 ALATMVLAVVNQHHARLHTPLQVMALFFQMHSHFDWSRYCWCIEGPRRLDTL 890


>gi|298710234|emb|CBJ26309.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1317

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 80/270 (29%), Positives = 118/270 (43%), Gaps = 71/270 (26%)

Query: 138 EIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
           EI++V  + A+  IV   V N+ VD++ NQ   ++A   LE+ D LI ++HLFKRS++L+
Sbjct: 116 EIRNVSLINARTPIVTMVVGNVVVDLTENQGGSVAASALLEEADNLIQRNHLFKRSLLLL 175

Query: 198 KAWCYYES------RILGAHYGLISTYALEMMVLH-------------------RFLDYY 232
           KAW + E+      R+LGA  G +++Y L +MVLH                   RF + Y
Sbjct: 176 KAWAWCETPRLVGNRVLGARKGGLTSYGLSVMVLHLFAASASADALVHPLDVLIRFFEVY 235

Query: 233 NTFDWDNYCISINGPVAISSL--PEIVAETVENDGDELLLSPEFLKKCREIYSVPIT--- 287
           + FDW  YC++++GPV + S+           N G      P   K   E++ VP     
Sbjct: 236 SEFDWARYCLTLDGPVPLESVRGSHPYEGWHGNGGTSSRFEPLVKKVLAELFRVPEKDRE 295

Query: 288 ----------------------------------ALENTGH-------EFAIKHLNIVDP 306
                                             AL  T          F  +  NI DP
Sbjct: 296 KEREKAKEKDKDKAGRRGRRASRVGRLSDRHAEDALPETPRFAAGAMPHFPRRDCNIQDP 355

Query: 307 LKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
           L   NNLG SV+K N   ++ AL  G ++L
Sbjct: 356 LNALNNLGHSVTKNNLKALKRALQQGQKKL 385


>gi|224064218|ref|XP_002301405.1| predicted protein [Populus trichocarpa]
 gi|222843131|gb|EEE80678.1| predicted protein [Populus trichocarpa]
          Length = 141

 Score = 90.1 bits (222), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 51/111 (45%), Positives = 73/111 (65%), Gaps = 3/111 (2%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D+E W  AEER  E++  IQP   SE+ R  V +Y+QRLI   +  +VF FGSVPLKTY
Sbjct: 31  LDSERWSKAEERTAELIACIQPNQPSEELRNAVADYVQRLIAKCFPCQVFTFGSVPLKTY 90

Query: 101 LPDGDIDLTVLGHQ-NVEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQ 148
           LPDGDIDLT      N+++  A  V  +L  E +++++ F +K+VQY+ A+
Sbjct: 91  LPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAE 141


>gi|342184813|emb|CCC94295.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 1108

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 109/265 (41%), Gaps = 80/265 (30%)

Query: 72  EVINYIQRLINGYYGIE-VFPFGSVPLKTYLPDGDIDLTV-----LGHQNVE---EDLAR 122
           E IN +       YG +  + FGS+   T LPDGD D+T+     L    +E   E    
Sbjct: 362 EAINSVAWNAGIKYGEKRYYVFGSLATGTVLPDGDNDITIEVDGLLNPAKIEIQGEAQNS 421

Query: 123 FVCKILENEDQDSVF-----------------EIKDV-----------QYVPAQVKIVKC 154
           F      +   DS+                  EI D              V A+V+++K 
Sbjct: 422 FPNGAATSSCNDSISATSSSHATAIAGGELLSEIADYLRENNASVYVDTVVVAEVRVLKL 481

Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
            +     D++  Q+ G+S + FL ++D  +G +HL KR+++L+KAWC YE+ +L    G 
Sbjct: 482 VMDGSSYDVTVGQLGGVSCIRFLHEMDMRVGCEHLLKRTLLLMKAWCCYEAHVLSGQGGY 541

Query: 215 ISTYALEMMVL-------------------------------------------HRFLDY 231
           +S+YA  +M++                                            RFL Y
Sbjct: 542 MSSYAATVMLITMINTVEFLEDVEAEGSDGKTCSNCPEGHKSEGHVQISPLQLFARFLKY 601

Query: 232 YNTFDWDNYCISINGPVAISSLPEI 256
           Y+ FD+D YC+++ GPV    + +I
Sbjct: 602 YSYFDFDRYCLTLFGPVPCDRVNQI 626


>gi|340057832|emb|CCC52183.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 1145

 Score = 85.5 bits (210), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 67/262 (25%), Positives = 109/262 (41%), Gaps = 88/262 (33%)

Query: 72  EVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTV----------LGHQNVEEDL 120
           E IN++ R     YG +  + FGS+  +T LPDGD D+T+          +G Q   +  
Sbjct: 398 EAINHVARRKGVKYGKVRYYVFGSLATRTVLPDGDNDITIDIDGLLDPVKVGPQGDTQST 457

Query: 121 AR-----FVC------------------KILEN------EDQDSVFEIKDVQYVPAQVKI 151
           ++       C                  ++L N      E+ DSVF    V       K+
Sbjct: 458 SQDGGEASGCSSEFSGASPAQAAAIAGGELLSNIADYLRENNDSVFVDAVVVAEVRVAKL 517

Query: 152 VKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH 211
           +   +     D++  Q+ G+S + FL +VD  IG  HL KR+++L+KAWC YE+ +L   
Sbjct: 518 I---MDGNSYDVTVGQLGGVSCIRFLHQVDTKIGCGHLLKRTLLLMKAWCCYEAHVLSGQ 574

Query: 212 YGLISTYALEMMVL---------------------------------------------H 226
            G +S+YA  +M++                                              
Sbjct: 575 GGYMSSYAATVMLIAMINTIEFLEDAESEACTELEEPARTHALEGRLGALNGVSPLQLFA 634

Query: 227 RFLDYYNTFDWDNYCISINGPV 248
           RFL Y++ FD++ YC+++ GPV
Sbjct: 635 RFLKYFSCFDFERYCVTLFGPV 656


>gi|261333426|emb|CBH16421.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 1120

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 73/147 (49%), Gaps = 43/147 (29%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  +     DI+  Q+ G+S + FL ++D  IG +HL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542

Query: 205 SRILGAHYGLISTYALEMMVLH-------------------------------------- 226
           + +L    G IS+YA  +M++                                       
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLDERQRGEYQHISP 602

Query: 227 -----RFLDYYNTFDWDNYCISINGPV 248
                RFL Y++ FD+++YC+++ GPV
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPV 629


>gi|71748824|ref|XP_823467.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70833135|gb|EAN78639.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 1120

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 73/147 (49%), Gaps = 43/147 (29%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  +     DI+  Q+ G+S + FL ++D  IG +HL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542

Query: 205 SRILGAHYGLISTYALEMMVLH-------------------------------------- 226
           + +L    G IS+YA  +M++                                       
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLEERQRGEYQHISP 602

Query: 227 -----RFLDYYNTFDWDNYCISINGPV 248
                RFL Y++ FD+++YC+++ GPV
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPV 629


>gi|224135259|ref|XP_002322023.1| predicted protein [Populus trichocarpa]
 gi|222869019|gb|EEF06150.1| predicted protein [Populus trichocarpa]
          Length = 85

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 9/67 (13%)

Query: 186 KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISIN 245
           ++HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ F  + N F          
Sbjct: 1   QNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNKF---------A 51

Query: 246 GPVAISS 252
           GP+ +S+
Sbjct: 52  GPLEVST 58


>gi|224127915|ref|XP_002320195.1| predicted protein [Populus trichocarpa]
 gi|222860968|gb|EEE98510.1| predicted protein [Populus trichocarpa]
          Length = 145

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/112 (44%), Positives = 72/112 (64%), Gaps = 3/112 (2%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +D + W  AEER  E++  IQP   SE+ R  V +Y+QRLI   +  +VF FGSVPLKTY
Sbjct: 31  LDLDRWSKAEERTAELIDCIQPNQPSEELRNAVADYVQRLILKCFPCQVFTFGSVPLKTY 90

Query: 101 LPDGDIDLTVLGHQ-NVEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQV 149
           LPDGDIDLT      N+++  A  V  +L  E +++++ F +K+VQY+ A+ 
Sbjct: 91  LPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAEA 142


>gi|157868001|ref|XP_001682554.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68126008|emb|CAJ04245.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 1964

 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 55/81 (67%)

Query: 145  VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
            V A+V+++K +++    DI+  Q  G++ + FL ++D +IG  H+ KR+++L+KAWC YE
Sbjct: 971  VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1030

Query: 205  SRILGAHYGLISTYALEMMVL 225
            + ILG   G I +YA  +M++
Sbjct: 1031 AHILGGQAGYIGSYAATVMLI 1051


>gi|401419332|ref|XP_003874156.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
 gi|322490390|emb|CBZ25650.1| conserved hypothetical protein [Leishmania mexicana
            MHOM/GT/2001/U1103]
          Length = 2020

 Score = 79.0 bits (193), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 55/81 (67%)

Query: 145  VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
            V A+V+++K +++    DI+  Q  G++ + FL ++D +IG  H+ KR+++L+KAWC YE
Sbjct: 1051 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1110

Query: 205  SRILGAHYGLISTYALEMMVL 225
            + ILG   G I +YA  +M++
Sbjct: 1111 AHILGGQAGYIGSYAATVMLI 1131


>gi|389601018|ref|XP_001564070.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|322504611|emb|CAM38122.2| conserved hypothetical protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 2016

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 55/81 (67%)

Query: 145  VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
            V A+V+++K +++    DI+  Q  G++ + FL ++D +IG  H+ KR+++L+KAWC YE
Sbjct: 1055 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1114

Query: 205  SRILGAHYGLISTYALEMMVL 225
            + ILG   G I +YA  +M++
Sbjct: 1115 AHILGGQAGYIGSYAATVMLI 1135


>gi|224114896|ref|XP_002316887.1| predicted protein [Populus trichocarpa]
 gi|222859952|gb|EEE97499.1| predicted protein [Populus trichocarpa]
          Length = 199

 Score = 72.0 bits (175), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 42/79 (53%), Positives = 56/79 (70%), Gaps = 3/79 (3%)

Query: 104 GDIDLTVLGHQ-NVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIP 160
           GDIDLT      N+++  A  VC +LENE+ +    F +K+V+Y+ A+VKI+KC V+NI 
Sbjct: 18  GDIDLTAFSENPNLKDTWAPQVCDMLENEENNENAEFGVKEVEYIQAEVKIIKCLVENIV 77

Query: 161 VDISFNQMAGLSALCFLEK 179
           VDISFNQ+ GL  LCFLEK
Sbjct: 78  VDISFNQLGGLFTLCFLEK 96



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 8/72 (11%)

Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG--------N 321
           LS  FL+ C  IY+V     +N G  F  KH N++DPL+ NNNLG SV+K          
Sbjct: 106 LSKLFLEACSAIYAVLPAGQDNQGQPFLSKHFNVIDPLRINNNLGHSVNKDIQIELCYVV 165

Query: 322 FHRIRCALSYGA 333
            H+++C L+  A
Sbjct: 166 CHQVKCHLNTSA 177


>gi|398013931|ref|XP_003860157.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498376|emb|CBZ33450.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 2047

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/81 (38%), Positives = 55/81 (67%)

Query: 145  VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
            V A+V+++K +++    DI+  Q  G++ + FL ++D +IG  H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117

Query: 205  SRILGAHYGLISTYALEMMVL 225
            + ILG   G I +YA  +M++
Sbjct: 1118 AHILGGQAGYIGSYAATVMLI 1138


>gi|339897903|ref|XP_001464956.2| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399300|emb|CAM67197.2| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 2047

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 31/81 (38%), Positives = 55/81 (67%)

Query: 145  VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
            V A+V+++K +++    DI+  Q  G++ + FL ++D +IG  H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117

Query: 205  SRILGAHYGLISTYALEMMVL 225
            + ILG   G I +YA  +M++
Sbjct: 1118 AHILGGQAGYIGSYAATVMLI 1138


>gi|452823525|gb|EME30535.1| nucleotidyltransferase [Galdieria sulphuraria]
          Length = 1412

 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/274 (21%), Positives = 109/274 (39%), Gaps = 77/274 (28%)

Query: 52  RIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDG--DIDLT 109
           R+   L    P   SE +R+ V   +  +I    G + F +GS   KTY  D   +I   
Sbjct: 700 RVNRFLDVCVPTSFSELRREAVFRVVASIIKRSIGAQAFCYGSFATKTYHADSILEIGAF 759

Query: 110 VLGHQNVEED----LARFVCK--ILENEDQDSVFE----------------IKDVQYV-- 145
           ++G  +   +    L   +C+   L ++   S  E                ++++ Y   
Sbjct: 760 LVGKNDTAAEWSAKLMAALCEDATLASDHSSSSLEFSYLSLIQQKHPVPLPVRNISYFRP 819

Query: 146 ----------PAQVKIVKCSVQ--------------------NIPVDISFNQMAGLSALC 175
                     PA    V   ++                    N+ V ++ N +AG+   C
Sbjct: 820 KPTPSGCQPPPAVTFTVNWPIEDPRSGLVALDTNSTERDIAPNVRVSVTLNHVAGIHTAC 879

Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV----------- 224
            LE+ D  +G++HLFKRS++L++ W  Y  ++      ++ + A+E++V           
Sbjct: 880 VLEEFDHAMGRNHLFKRSLLLVRTWVDYGVKLT----DILPSRAVEVLVVFVANCFHSSI 935

Query: 225 ------LHRFLDYYNTFDWDNYCISINGPVAISS 252
                 L+RFL Y+  FDW  + +   G + +++
Sbjct: 936 ETPFDLLYRFLTYFVHFDWRKFGLCETGIIDLAT 969


>gi|224064842|ref|XP_002301578.1| predicted protein [Populus trichocarpa]
 gi|222843304|gb|EEE80851.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 30/47 (63%), Positives = 37/47 (78%)

Query: 137 FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRL 183
           F +K V+Y+ A+VKI+KC V+NI VDISFNQ+ GL  LCFLEKV  L
Sbjct: 13  FRVKKVEYIQAEVKIIKCLVKNIVVDISFNQLGGLFTLCFLEKVSAL 59


>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
 gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
          Length = 604

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 86/320 (26%), Positives = 138/320 (43%), Gaps = 55/320 (17%)

Query: 24  TSSSSSYSSPPN-LYPFSIDAEL--------WLLAEERIQEILCTIQPAIVSEKKRKEVI 74
           T S++  S+ PN LYP+  D +         WL  E  +++ +  I P+      R  VI
Sbjct: 133 TGSTNEPSTEPNSLYPWIKDHDHSTQKEIADWLTME--MKDFVSYISPSRAEIVTRNNVI 190

Query: 75  NYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENE 131
           N ++R ++ ++ G E   FGS     YLP  DID+ V+      E+ +R   +   L  +
Sbjct: 191 NTLKREVSSFWPGTEAHVFGSCATDLYLPGSDIDMVVISSTGDYENRSRLYQLSSFLRAK 250

Query: 132 DQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
           +     E+       A+V I+K      N+P+DISF +  GL A     ++ R +     
Sbjct: 251 NLAKNVEV----IASAKVPIIKFVDPESNLPIDISFERTNGLDA---ARRIRRWLLATPG 303

Query: 190 FKRSVILIKAWCYYESRIL-GAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPV 248
            +  V+++K   +  SR L   H G +  YA  +M  H F+  +         IS N   
Sbjct: 304 LRELVLVVKQ--FLRSRKLNNVHVGGLGGYATIIMCYH-FMQLHPK-------ISTNTMN 353

Query: 249 AISSLPEIVAETVENDG-----DELLLS-------PEFLKKCREIYSVPITALENTGHEF 296
           A  +L  ++ E  E  G     D L++S       P +L K R        +L    + F
Sbjct: 354 APDNLGVLLIEFFELYGRNFSYDNLIISIDSETQLPRYLHKGRH------PSLNTARNTF 407

Query: 297 AIKHLNIVDPLKDNNNLGRS 316
           +I    I DP   +NN+ RS
Sbjct: 408 SIV---IQDPADPSNNITRS 424


>gi|407407321|gb|EKF31173.1| hypothetical protein MOQ_004991 [Trypanosoma cruzi marinkellei]
          Length = 1349

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 55/81 (67%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  ++    DI+  Q+ G+  + FL+++D LIG  HL KR+++L+KAWC YE
Sbjct: 594 VFAEVRVLKLVMEGSCFDITVGQLGGVECVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 653

Query: 205 SRILGAHYGLISTYALEMMVL 225
           + IL    G +S+YA  +M++
Sbjct: 654 AHILSGQGGYLSSYAATIMLI 674


>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 876

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 71/140 (50%), Gaps = 17/140 (12%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
           I   +  IQP     + R+ +I  I++ ++  +   +V PFGS   K YLP GDIDL ++
Sbjct: 76  IGSFVAYIQPTHEEHQLRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLPAGDIDLVII 135

Query: 112 GHQNVEEDLARFVCK----ILENE-DQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDIS 164
             Q + E  +R + K    I EN   QD V   K      A+V I+K      NI VDIS
Sbjct: 136 STQMMNEQKSRILYKLAPLIRENNIGQDVVVIAK------AKVPIIKFKTIFGNINVDIS 189

Query: 165 FNQMAGLSALCFLEKVDRLI 184
            NQ  G+ A   ++KV+ L+
Sbjct: 190 INQTNGIVA---MKKVNELL 206


>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
          Length = 422

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 147/344 (42%), Gaps = 46/344 (13%)

Query: 28  SSYSSPPNLYPFSIDAELWLLA--EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
           S Y    N  P+ +   +W +    E I++ +  +QP    +  R +V+  I++++   +
Sbjct: 65  SYYIRHRNRVPWKLRRYVWGVVGLHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKELW 124

Query: 86  -GIEVFPFGSVPLKTYLPDGDIDLTVLGHQN---VEEDLARFVCKILENEDQDSVFEIKD 141
              ++  FGS     YLPDGDID+ + G      + +   + V + +  E+  +V E   
Sbjct: 125 PSAKLETFGSYNTGLYLPDGDIDMVIQGQWEQLPMWQLRNKLVERRIAREENITVIE--- 181

Query: 142 VQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA 199
                A V I+K    N  + VDISFN   G  A   ++K    + +    K+ V+L+K 
Sbjct: 182 ----KAVVPIIKLIESNTLVHVDISFNTSNGREAAALVKK---YMAEYPNLKQLVVLLKY 234

Query: 200 WCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
                 R L   + G + +YAL ++V++ FL  ++           N      +L  ++ 
Sbjct: 235 --ILNHRGLNEVWKGGLGSYALTLLVVN-FLQQHS---------RKNAKEDGENLGVLLL 282

Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH--LNIVDPLKDNNNLGRS 316
           E  E  G +       ++   E   +PI  L    +    K+  L I DPL   N++GRS
Sbjct: 283 EFFELYGRQFNYETCGIRIRDEAGYIPIDTLRKQMNAHGTKYGPLCIEDPLNTTNDVGRS 342

Query: 317 VSKGNFHRIRCALSYGAQRLGEIL-----------TLPGHCLGM 349
             +  +  ++    +  ++L + L           TL GH LG+
Sbjct: 343 TFQ--WKHVQACFDHCCRKLKKALEEQPDPAMRGGTLIGHILGV 384


>gi|71652853|ref|XP_815075.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70880102|gb|EAN93224.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1276

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 55/81 (67%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  ++    DI+  Q+ G+  + FL+++D LIG  HL KR+++L+KAWC YE
Sbjct: 565 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 624

Query: 205 SRILGAHYGLISTYALEMMVL 225
           + IL    G +S+YA  +M++
Sbjct: 625 AHILSGQGGYLSSYAATIMLI 645


>gi|71408844|ref|XP_806800.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870651|gb|EAN84949.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 1239

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 55/81 (67%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  ++    DI+  Q+ G+  + FL+++D LIG  HL KR+++L+KAWC YE
Sbjct: 568 VVAEVRVLKLVMEGGCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 627

Query: 205 SRILGAHYGLISTYALEMMVL 225
           + IL    G +S+YA  +M++
Sbjct: 628 AHILSGQGGYLSSYAATIMLI 648


>gi|407846652|gb|EKG02680.1| hypothetical protein TCSYLVIO_006286 [Trypanosoma cruzi]
          Length = 893

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 55/81 (67%)

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           V A+V+++K  ++    DI+  Q+ G+  + FL+++D LIG  HL KR+++L+KAWC YE
Sbjct: 222 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 281

Query: 205 SRILGAHYGLISTYALEMMVL 225
           + IL    G +S+YA  +M++
Sbjct: 282 AHILSGQGGYLSSYAATIMLI 302


>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
          Length = 360

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 84/174 (48%), Gaps = 18/174 (10%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLG---HQN 115
           I P++   K R+  I  I+R I   +   EVF FGS   + YLPDGDIDL V+    +Q 
Sbjct: 86  ISPSLTEHKTREYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGDIDLVVMRKSVNQY 145

Query: 116 VEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSA 173
            ++ +   +  +L   +     ++       A+V I+K   S    P+DIS NQ  G+ A
Sbjct: 146 NKQSMLHTMASMLRQANLAQSIQV----ISKARVPIIKFTSSFGGYPIDISLNQTNGVDA 201

Query: 174 LCFL-EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
              + E +DR      L     +L+K  C+   R +   Y G +S+Y++  +V+
Sbjct: 202 GRMVNEILDRYPAARPL----SMLLK--CFLSQRSMNEVYTGGVSSYSVICLVV 249


>gi|354544020|emb|CCE40742.1| hypothetical protein CPAR2_107770 [Candida parapsilosis]
          Length = 608

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 122/286 (42%), Gaps = 40/286 (13%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  +++ +  I P+      R  VIN +++ I+ ++ G E   FGS     YLP  
Sbjct: 160 WLTME--MKDFVRYISPSKAEIITRNNVINTLKKEISSFWPGTEAHVFGSCATDLYLPGS 217

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID+ V+      E+ +R   +   L  ++     E+     VP  +K V     N+PVD
Sbjct: 218 DIDMVVISSTGDYENRSRLYQLSSFLRVKNLAKNVEVIANAKVPI-IKFVDPD-SNLPVD 275

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           ISF +  GL A   + K   L+    L  R ++L+        ++   H G +  YA  +
Sbjct: 276 ISFERTNGLDAARRIRKW--LLATPGL--RELVLVVKQFLRSRKLNNVHVGGLGGYATII 331

Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG-----DELLLS------ 271
           M  H F+  +         IS N   A  +L  ++ E  E  G     D L++S      
Sbjct: 332 MCYH-FMQLHPK-------ISTNTMDAPDNLGVLLIEFFELYGRNFSYDNLIISIDPETQ 383

Query: 272 -PEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
            P +L K R     PI  L    + F+I    + DP   +NN+ RS
Sbjct: 384 LPRYLLKGRH----PI--LSTARNTFSIV---VQDPADSSNNITRS 420


>gi|449533401|ref|XP_004173664.1| PREDICTED: uncharacterized LOC101209112 [Cucumis sativus]
          Length = 831

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/51 (49%), Positives = 33/51 (64%)

Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
           GNF RIR A ++GA+RL  +   P   +   L +FF+NT ER+G G RPDV
Sbjct: 5   GNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERHGSGQRPDV 55


>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
 gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
          Length = 494

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 94/205 (45%), Gaps = 19/205 (9%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
           L+L   E I + +  I P     + R+ +I  ++ L++  +    V  FGS   + +LP 
Sbjct: 122 LYLCLHEEILDFVHFISPHDEELQARENLIAQMKNLVSNLWPRAAVETFGSHETQMFLPQ 181

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPV 161
            DIDL + G    +E L     + LE  D  S  E+ D     A++ IVK   +N  I V
Sbjct: 182 SDIDLVIFGAPTGKESLFVLAAE-LEARDMVSYLEVID----KARIPIVKFVDKNSAIQV 236

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFN  +GL+    +++  R+        R ++L+  +   +  +     G I ++ L+
Sbjct: 237 DISFNISSGLATADLIKQYMRIFPS----FRPLVLVLKYFLAQRELNETFQGGIGSFLLQ 292

Query: 222 MMVLHRFLDY-------YNTFDWDN 239
           +MV+     Y       Y+ F ++N
Sbjct: 293 LMVVSFLQQYRRQLGTLYDDFRYNN 317


>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
 gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
           commune H4-8]
          Length = 671

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 50/184 (27%), Positives = 84/184 (45%), Gaps = 9/184 (4%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           L    I+     I P    ++ R  ++  I R+I   +   EV PFGS   K YLP GDI
Sbjct: 158 LLHTEIKAFDSWISPTPAEDEVRSMIVLLIARIIQDKFPDAEVRPFGSYGTKLYLPHGDI 217

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK----CSVQNIPVD 162
           DL V  +  +E++  + V + L +  + +      VQ + A+V I+K           +D
Sbjct: 218 DLVVQSN-TLEQNNKKTVLQRLADLIRSARLSSGKVQVIGARVPIIKFITAAEYGRFQID 276

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
           IS NQ +GL +   +    R +      +  V+++K   Y   R +   Y G + +Y++ 
Sbjct: 277 ISVNQFSGLVSSDIINGFQRGMQCPIAIRSLVLILK--LYLSQRGMNEVYTGGLGSYSIV 334

Query: 222 MMVL 225
            +VL
Sbjct: 335 CLVL 338


>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Metaseiulus occidentalis]
          Length = 491

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 94/203 (46%), Gaps = 17/203 (8%)

Query: 27  SSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG 86
           +S + SP   Y   ++     L EE I +     QP    + +R++VI  ++  I   + 
Sbjct: 69  ASPWRSPNRSYSLGLEG----LTEE-IHDFFMYAQPNAADQSRREQVIEKVRAAIREKWP 123

Query: 87  ---IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDV 142
              +EVF  GS     YLP GDID+ + G+  +   L     +++E +  + + F++ D 
Sbjct: 124 DCVVEVF--GSYKTGLYLPTGDIDMVIQGNWEIIPPLFDLERQLIEKKVGEKNTFKVLDK 181

Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
             VP  +K      + I VD+SFNQ     A  F+++  R         + + ++K +  
Sbjct: 182 ASVPL-IKFKDADTE-IRVDLSFNQANCTEAAAFVKQCCRTFPP---LAKLIFVLKQYLS 236

Query: 203 YESRILGAHYGLISTYALEMMVL 225
               +    +G IS+Y+L +M+L
Sbjct: 237 LHG-LNEVFHGGISSYSLTLMIL 258


>gi|313226931|emb|CBY22076.1| unnamed protein product [Oikopleura dioica]
          Length = 424

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 80/346 (23%), Positives = 147/346 (42%), Gaps = 48/346 (13%)

Query: 28  SSYSSPPNLYPFSIDAELWLLA--EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
           S Y    N  P+ +   +W +    E I++ +  +QP    +  R +V+  I++++   +
Sbjct: 65  SYYIRHRNRVPWKLRRYVWGVVGLHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKELW 124

Query: 86  ---GIEVFPFGSVPLKTYLPDGDIDLTVLGHQN---VEEDLARFVCKILENEDQDSVFEI 139
               + +  F ++    YLPDGDID+ + G      + +   + V + +  E+  +V E 
Sbjct: 125 LARSLALRIFRTLNTGLYLPDGDIDMVIQGQWEQLPMWQLRNKLVERRIAREENITVIE- 183

Query: 140 KDVQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
                  A V I+K    N  + VDISFN   G  A   ++K    + +    K+ V+L+
Sbjct: 184 ------KAVVPIIKLIESNTLVHVDISFNTSNGREAAALVKK---YMAEYPNLKQLVVLL 234

Query: 198 KAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
           K       R L   + G + +YAL ++V++ FL  ++           N      +L  +
Sbjct: 235 KY--ILNHRGLNEVWKGGLGSYALTLLVVN-FLQQHS---------RKNAKEDGENLGVL 282

Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH--LNIVDPLKDNNNLG 314
           + E  E  G +       ++   E   +PI  L    +    K+  L I DPL   N++G
Sbjct: 283 LLEFFELYGRQFNYETCGIRIRDEAGYIPIDTLRKQMNAHGTKYGPLCIEDPLNTTNDVG 342

Query: 315 RSVSKGNFHRIRCALSYGAQRLGEIL-----------TLPGHCLGM 349
           RS  +  +  ++    +  ++L + L           TL GH LG+
Sbjct: 343 RSTFQ--WKHVQACFDHCCRKLKKALEEQPDPAMRGGTLIGHILGV 386


>gi|343427054|emb|CBQ70582.1| related to TRF4-topoisomerase I-related protein [Sporisorium
           reilianum SRZ2]
          Length = 697

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 81/199 (40%), Gaps = 33/199 (16%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ--NV 116
           + P +   + R  V+  I R I   +   EV PFGS   K YLP GD+DL V+     N+
Sbjct: 112 MAPTVAEHETRCMVVELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSQSMANL 171

Query: 117 EEDLA-RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
               A R +   L   +  +     DVQ +  A+V I+K   +   + VDIS N   GL+
Sbjct: 172 RTQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 226

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH----YGLISTYALEMMVLHRF 228
              ++                   ++ W +     L  H     G I       ++L  F
Sbjct: 227 TASYVNS----------------WLRKWPHIRISFLQLHPKVQRGEIEADRSLGVLLLEF 270

Query: 229 LDYY-NTFDWDNYCISING 246
           L+ Y   F +DN  ISI G
Sbjct: 271 LELYGKNFGYDNCGISIRG 289


>gi|149244754|ref|XP_001526920.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449314|gb|EDK43570.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 664

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 79/282 (28%), Positives = 123/282 (43%), Gaps = 32/282 (11%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  +++ +  I P+      R +V+N ++  I  ++ G E   FGS     YLP  
Sbjct: 222 WLTME--MKDFVNYISPSSEEIVIRNKVVNTLKTQIALFWPGTEAHVFGSSATDLYLPGS 279

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID+ VL      E+ +R   +   L+ +   +  E+     VP  +K V     NI VD
Sbjct: 280 DIDMVVLSDTGDYENRSRLYQLSSFLKAKKLATNVEVIASAKVPI-IKFVDPD-SNIHVD 337

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL-GAHYGLISTYALE 221
           ISF +  GL A     ++ R +      +  V+++K   +  SR L   H G +  YA  
Sbjct: 338 ISFERKNGLDA---ARRIRRWLASTPGLRELVLVVKQ--FLRSRKLNNVHVGGLGGYAT- 391

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLS-------PEF 274
           +++ + FL  +     ++     N  V +    E+       D   L++S       P +
Sbjct: 392 IIICYHFLRLHPKLSTESMSSLDNLGVLLIEFFELYGRNFSYDN--LIISLDPKTEEPRY 449

Query: 275 LKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
           L K R  YSV  TA  NT   FAI    + DP   NNN+ RS
Sbjct: 450 LPKGR--YSVLNTA-RNT---FAIV---VQDPADPNNNITRS 482


>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
           98AG31]
          Length = 987

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 19/146 (13%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
           L AE  I   +  I+P    ++ R  +I  I++ +   +   +V PFGS   K YLP GD
Sbjct: 234 LTAE--IGSFVAYIRPTREEDELRLMIIEMIRKAVTMQWPDADVVPFGSFGTKLYLPGGD 291

Query: 106 IDLTVLGHQNVEEDLARFVCKIL-----ENEDQDSVFEIKDVQYVPAQVKIVKCSV--QN 158
           IDL +L  + +++  ++ + ++      +N  QD V   K      A+V I+K      N
Sbjct: 292 IDLVILSTRMMKDAKSKILYRLAPLLREQNIGQDVVVIAK------AKVPIIKFKTIFGN 345

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLI 184
             VDIS NQ  GL A   LEKV+ L+
Sbjct: 346 FQVDISINQSNGLVA---LEKVNELL 368


>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
 gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
          Length = 527

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 14/159 (8%)

Query: 73  VINYIQR-LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
           VI+ +QR L + +    V+ FGS   + YLP GDIDL VL   NV  D+ R +  + E  
Sbjct: 2   VISLLQRALCSKWPDARVYSFGSQDTQLYLPQGDIDLVVLS--NVMNDMPREIT-LSEMA 58

Query: 132 DQDSVFEIK-DVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKD 187
                +++   VQ +  A+V I+K  C      VDIS NQ  GL A  F   V+  + K 
Sbjct: 59  ACLRSYQLAIHVQVLARAKVPIIKFVCPYGQFNVDISINQANGLQASKF---VNGWLKKQ 115

Query: 188 HLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
              +  V++IK   + + R L   Y G + +Y++ +MVL
Sbjct: 116 PAIRPLVMVIKQ--FLQQRALSEVYTGGLGSYSVTLMVL 152


>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
 gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
          Length = 730

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 91/219 (41%), Gaps = 41/219 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE- 117
           + P     + R  VI  I R I   +   EV+PFGS   K YLP GD+DL V+ +     
Sbjct: 110 MTPTAAEHETRCMVIELISRAIKSQFRDAEVYPFGSQETKLYLPQGDLDLVVVSNSMANL 169

Query: 118 --EDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
             +   R +   L   +  +     DVQ +  A+V I+K   +   + VDIS N   GL+
Sbjct: 170 RVQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 224

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-----LH- 226
              ++    R     H+  R +IL+  +   +  +     G + +Y++ +MV     LH 
Sbjct: 225 TASYVNSWLRKW--PHI--RPLILVVKYLLMQRGMSEVFSGGLGSYSVIIMVISFLQLHP 280

Query: 227 ------------------RFLDYY-NTFDWDNYCISING 246
                              FL+ Y   F +DN  ISI G
Sbjct: 281 KVQRGEIDADRSLGVLLLEFLELYGKNFGYDNCGISIRG 319


>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 752

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 69/143 (48%), Gaps = 5/143 (3%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVP 96
           P+S D  L +L +E I + +  ++P       RK  I  +++++   +   EV  FGS  
Sbjct: 169 PYSNDT-LTMLNQE-IDDYVDYVRPTEAEHSLRKLTIARVRKIVKQIWADAEVHVFGSFQ 226

Query: 97  LKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV 156
            K YLP  D+D+ V+G   V     R + K  E  D  S  E+ +   VP    + K + 
Sbjct: 227 TKLYLPSSDVDIVVVGDSCVLPKCLRQLAKAFEKADTLSRMEVIEKTKVPIIKGVDKLT- 285

Query: 157 QNIPVDISFNQMAGLSALCFLEK 179
            +  +DISFN + G+ +   +++
Sbjct: 286 -HFSLDISFNMVNGIKSANIVKR 307


>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Oreochromis niloticus]
          Length = 672

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 116/276 (42%), Gaps = 34/276 (12%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    I P    EK R EV++ I+ +I+  +   EV  FGS     YLP  DIDL 
Sbjct: 182 EEIIDFYNYISPRPEEEKMRLEVVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDIDLV 241

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +   K + +E+   V +        A V I+K   S   + V
Sbjct: 242 VFGKWESLPLWTLEEALRK---KNVADENSIKVLD-------KATVPIIKLTDSYTEVKV 291

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DISFN M+G+ A   +++      K  +    V+++K +     R L   + G I +Y+L
Sbjct: 292 DISFNVMSGVKAARLIKEFKE---KYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 346

Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
            +M +     +Y     D + ++IN  V +    E+             L      K   
Sbjct: 347 FLMAVSFLQLHYRE---DVFGLNINIGVLLIEFFELYGRNFN------YLKTGIRIKDGG 397

Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
            Y       +N    +    L I DPL+ +N++GRS
Sbjct: 398 CYVAKDEVQKNMLDGYRPSMLYIEDPLQPDNDVGRS 433


>gi|346972692|gb|EGY16144.1| Poly(A) RNA polymerase cid14 [Verticillium dahliae VdLs.17]
          Length = 726

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 98/226 (43%), Gaps = 26/226 (11%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLK 98
           D  +WL  E  I +    ++P    ++ R E+I  I+   R    Y G EV PFGS    
Sbjct: 405 DMSVWLHKE--IVDFYEHVRPRAFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSG 462

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVP-AQVKIV 152
            YLP  D+D+ +   + +   +  F     + K      Q+ + +   V+ +  A+V +V
Sbjct: 463 LYLPTADMDIVICSKEWLSGRMTAFPGGSSLYKFRAFLTQNRLADPSSVEVIAKARVPLV 522

Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K   +V  + VDISF++M G +A+  FL   ++      L    V +IK +         
Sbjct: 523 KYIDAVTGLRVDISFDRMDGPAAIKTFLNWKEQYPALPIL----VTIIKHFLAMRGLNEP 578

Query: 210 AHYG--------LISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
            + G        L+  + L  M++  F  Y N FD+    I IN P
Sbjct: 579 VNGGIGSFSSKNLVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 624


>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
          Length = 1487

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 9/183 (4%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDIDLTVL 111
           ++  +  I P     + R  V+  I R +   +   EV PFGS   K YLP GDIDL V+
Sbjct: 160 VEAFVNYISPTPEENEVRSLVVALITRAVTQAFPDAEVHPFGSYDTKLYLPVGDIDL-VV 218

Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDISFNQMA 169
             Q++       V   + N  + +    +      A+V IVK      NIPVDIS NQ  
Sbjct: 219 HSQSMAYSKKEAVLHSIANTMKRAGITDRVRIISKAKVPIVKFVTLHGNIPVDISINQGN 278

Query: 170 GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFL 229
           G++A   ++     + +    +  V+++K++    S +   + G + +Y++  +V+  FL
Sbjct: 279 GVTAGTMIK---HFLAELPALRSLVLIVKSFLSQRS-MNEVYTGGLGSYSIVCLVIS-FL 333

Query: 230 DYY 232
             +
Sbjct: 334 QMH 336


>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
          Length = 897

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 78/300 (26%), Positives = 136/300 (45%), Gaps = 24/300 (8%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDID 107
           E I ++   I+P  +  + R +V   ++  ++  +    I++  FGS+    +LP  DID
Sbjct: 84  EEIVDLYHWIKPNEIEVRLRTKVYEKVRDSVSQRWQHKPIKISMFGSLRTNLFLPTSDID 143

Query: 108 LTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
           + V     V    D      + LEN++      +    +VP  VK+V    + + +DISF
Sbjct: 144 VLVECDDWVGTPGDWLGETARGLENDNIAESVTVFGGAFVPI-VKMVDRDTR-LSIDISF 201

Query: 166 NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMV 224
           N + G+ A  ++ KV        L +  V+L+K + +Y  R L   + G +S+Y L +++
Sbjct: 202 NTVQGVRAASYIAKVKEEFP---LIEPLVLLLKQFLHY--RNLNQTFTGGLSSYGLVLLL 256

Query: 225 LHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYS 283
           ++ F  Y  N      Y   +N    +    E+   ++E + +E+ +SP    +C   Y 
Sbjct: 257 VNFFQLYALNMRHRTIYDSGVNLGHLLLRFLEVY--SMEFNYEEIGISP---GQC--CYI 309

Query: 284 VPITALENTGHEFAI-KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
               A    GH+ A   +L + DPL   N++GRS    NF  I  A     Q L   +TL
Sbjct: 310 SKSAAGARYGHKRAQPGNLALEDPLLTANDVGRSTY--NFSSIANAFGQAFQILAVAVTL 367


>gi|294654384|ref|XP_456434.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
 gi|199428840|emb|CAG84386.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
          Length = 600

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/232 (23%), Positives = 101/232 (43%), Gaps = 25/232 (10%)

Query: 9   EMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAEL--------WLLAEERIQEILCTI 60
           + +D  ++ + S   + +  + ++  +LYP+  D +         W   E  I++ +  I
Sbjct: 126 DYDDYSDDGVLSDDESGTIHTQTTTKSLYPWIKDHDHSKQKEIADWFTME--IKDFVNYI 183

Query: 61  QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
            P+      R  V+N +++ I  ++   E+  FGS     YLP  DID+ V+      E 
Sbjct: 184 SPSEAEIMTRNRVVNQLKQQIGQFWPATELHVFGSCATDLYLPGSDIDMVVVSETGDYEH 243

Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALC 175
            +R   +   L N+      E+       A+V I+K      NI +DISF +  G+ A  
Sbjct: 244 RSRLYQLSSFLRNKKLAKNIEV----IAKAKVPIIKFVDPTSNIHIDISFERTNGIDA-- 297

Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL-GAHYGLISTYALEMMVLH 226
             +K+ R +      +  V+++K   +  SR L   H G +  Y+  ++  H
Sbjct: 298 -AKKIRRWLSSTPGLRELVLIVKQ--FLRSRKLNNVHVGGLGGYSTIILCYH 346


>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
 gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 781

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 11/138 (7%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I +    I+P    ++ R+EV++ I R +   +    V+PFGS P   YLP G
Sbjct: 448 WLHKE--IIDFYEYIKPRAFEKRIRQEVLDEINRFVRSTFPDAGVYPFGSFPSGLYLPTG 505

Query: 105 DIDLTVLGHQ-----NVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKC--SV 156
           D+D+ +   Q       + D  R + ++ +   Q  +    +V+ +  A+V +VK   S 
Sbjct: 506 DMDMVLCSDQYKRNYRAKYDTRRTMYRLSDALKQQKLAFQNEVEIIAFAKVPLVKWVDSR 565

Query: 157 QNIPVDISFNQMAGLSAL 174
             + +D+SF    GL A+
Sbjct: 566 TGLKIDVSFENDTGLQAI 583


>gi|116201727|ref|XP_001226675.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
 gi|88177266|gb|EAQ84734.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
          Length = 780

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/241 (22%), Positives = 103/241 (42%), Gaps = 32/241 (13%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLK 98
           S +  +WL  E  + +    I+P    E+ R E++  +++     +   E++PFGS P  
Sbjct: 405 SANMAVWLHKE--VVDFYEYIKPRDFEERLRGELVRDLKQFCRKVFRDAEMYPFGSFPSG 462

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIK-----DVQYVP-AQVKIV 152
            YLP GD+D+  +    ++  + ++  K      +  +   K     +++ +P A+V +V
Sbjct: 463 LYLPTGDMDMAFMSDGYMKGGVPKYSTKNTLYRLRGQLMNHKVAWEDEIEVIPSAKVPLV 522

Query: 153 KC--SVQNIPVDISFNQMAGLSALC-FLEKVDRLIGKDHLFK--RSVILIKAWCYYESRI 207
           K       + VD+SF   +G++A+  F    DR  G   L    +  +L++      +  
Sbjct: 523 KFIEHKTGLKVDVSFENNSGVTAIATFKAWRDRYPGMPALVTLVKHFLLMRGLNEPVNGG 582

Query: 208 LGAHY------------------GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVA 249
           +G                      L + + L  ++LH F  Y N F++    IS+N P  
Sbjct: 583 IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISLNPPHW 642

Query: 250 I 250
           I
Sbjct: 643 I 643


>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
          Length = 501

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 74/334 (22%), Positives = 127/334 (38%), Gaps = 60/334 (17%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKT 99
            ++ ++    E I + +  I P       R E+I  ++ ++ G +    V  FGS   + 
Sbjct: 124 FNSNVYACLHEEIMDFVSFISPTEQELSSRAELIEEMREIVKGLWPEATVETFGSHYTQM 183

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNI 159
           +LP  DID+ + G    +  L + + + LE ++  S  E+ D   +P    + K S  +I
Sbjct: 184 FLPQSDIDMVLFGVPEGKAPLFK-LAQCLEEKELVSYLEVIDKARIPIVKMVHKAS--DI 240

Query: 160 PVDISFNQMAGLSALCFLEKVDR---------LIGKDHLFKRSV--------------IL 196
            VD+SFN   GL+    ++   R         L+ K  + +R +              ++
Sbjct: 241 HVDVSFNVAGGLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYTGGVGSFLLQMM 300

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
           + ++  +  R LGA +       L  +++  F  Y   F++ +  IS             
Sbjct: 301 VVSFLQHHGRALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAIS------------- 347

Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD--NNNLG 314
               V N G        F K  R  Y        + G  F I   N  +P  D   N+  
Sbjct: 348 ----VRNGGS------YFYKDDRRWY--------DDGRPFLISMENPNEPSLDIGKNSYE 389

Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLG 348
               K +F   R  L +   R G+   LPG  LG
Sbjct: 390 MRTVKRSFDYARLVLQHEINRHGQFNALPGSILG 423


>gi|348512463|ref|XP_003443762.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
          Length = 789

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 113/278 (40%), Gaps = 36/278 (12%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    E  R++V+N I+ +I   +  ++V  FGS     YLP  DIDL
Sbjct: 232 HEEILDFFSFMSPKPEEESMRRDVVNRIEGIIKDLWPTVQVEIFGSFSTGLYLPTSDIDL 291

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
            V G+      Q +E+ L +       N       ++ D   VP  +K+  C  + + VD
Sbjct: 292 VVFGNWDHPPLQELEQALKK------HNVSGSHPIKLLDKATVPI-IKLTDCETR-VKVD 343

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLISTY 218
           ISFN    + A  F+        K +L K  V    I +      +  +     G IS+Y
Sbjct: 344 ISFNIETAVKAAQFI--------KSYLKKYPVLPPLIFVLKQFLLQRELNEVFTGGISSY 395

Query: 219 ALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKC 278
           +L +M +  FL  +   D      +IN  + +    E+     +     + +       C
Sbjct: 396 SLILMAI-SFLQLHPRIDTSR--PNINLGILLIEFFELYGRDFDYIKTAIRVKNGGAYLC 452

Query: 279 REIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
           +E        L+  G+      L I DP++  N++GRS
Sbjct: 453 KE------EMLKEMGNGNRPSMLCIEDPVQPGNDVGRS 484


>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
 gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
          Length = 723

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 26/226 (11%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLK 98
           D  +WL  E  I +    ++P    ++ R E+I  I+   R    Y G EV PFGS    
Sbjct: 402 DMSVWLHKE--IVDFYEHVRPRDFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSG 459

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVP-AQVKIV 152
            YLP  D+D+ +   + +   +  F     + K      Q+ + +   V+ +  A+V +V
Sbjct: 460 LYLPTADMDIVICSKEWLSGRMTAFPGGSSLYKFRGFLTQNRLADPSSVEVIAKARVPLV 519

Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K   +V  + VDISF++M G +A+  FL+  ++      L    V +IK +         
Sbjct: 520 KYIDAVTGLRVDISFDRMDGPAAIKTFLDWKEQYPALPIL----VTIIKHFLAMRGLNEP 575

Query: 210 AHYG--------LISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
            + G        L+  + L  M++  F  Y N FD+    I IN P
Sbjct: 576 VNGGIGSFSSKNLVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 621


>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
          Length = 689

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 90/219 (41%), Gaps = 41/219 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ--NV 116
           + P     + R  VI  I R I   +   EV PFGS   K YLP GD+DL V+     N+
Sbjct: 109 MAPTAAEHETRCMVIELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSRSMANL 168

Query: 117 EEDLA-RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
               A R +   L   +  +     DVQ +  A+V I+K   +   + VDIS N   GL+
Sbjct: 169 RTQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 223

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-----LH- 226
              F+    R     H+  R +I++      +  +     G + +Y++ +MV     LH 
Sbjct: 224 TASFVNSWLRKW--PHI--RPLIIVVKHLLMQRGMSEVFSGGLGSYSIIIMVISFLQLHP 279

Query: 227 ------------------RFLDYY-NTFDWDNYCISING 246
                              FL+ Y   F +DN  ISI G
Sbjct: 280 KVQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 318


>gi|357491469|ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
 gi|355517357|gb|AES98980.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
          Length = 490

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 31/45 (68%)

Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
            GNF+RIR A  YGA++LG IL LP   +   L +FF NTL+R+G
Sbjct: 10  NGNFYRIRSAFKYGARKLGWILMLPEDRIADELNRFFANTLDRHG 54



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 10/131 (7%)

Query: 379 SGKSEAA----DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
           SG +EA+    DL+GDYDS+   L YG   + Y +        P    +  +R++W+ + 
Sbjct: 214 SGGTEASKSLLDLAGDYDSHIANLHYGHMCNGYPV-SPVVVPSPPRSPKFHNRNSWETVR 272

Query: 435 QYVQGKGNLVYQWGTE-VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPP 493
           Q +Q   ++  Q  +  V  P     HP    +  ++F  +E  K RGTG Y P  T   
Sbjct: 273 QCLQMNHSIHPQTNSNGVVGPLYLVNHP---TIPMASFGAEEKRKPRGTGAYFPNMT-SR 328

Query: 494 PIEINRPVRAR 504
           P   NRP+  R
Sbjct: 329 PFRDNRPMPGR 339


>gi|345313578|ref|XP_001519035.2| PREDICTED: DNA polymerase sigma-like, partial [Ornithorhynchus
           anatinus]
          Length = 641

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 83/176 (47%), Gaps = 24/176 (13%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ +I   +   +V  FGS     YLP  DIDL V G         R   ++L
Sbjct: 111 RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------KWERPPLQLL 164

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
           E    + SV E K ++ +  A V I+K + Q  ++ VDISFN   G+ A  F+       
Sbjct: 165 EQALRKHSVAEPKTIKVLDKATVPIIKLTDQETDVKVDISFNMETGVKAARFI------- 217

Query: 185 GKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
            KD++ K S    +IL+      +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 218 -KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 271


>gi|2642156|gb|AAB87123.1| hypothetical protein [Arabidopsis thaliana]
          Length = 474

 Score = 52.4 bits (124), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 73/306 (23%), Positives = 127/306 (41%), Gaps = 44/306 (14%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
           +QEIL  I+P       R  VI+ ++ ++       G  V PFGS     +   GD+D++
Sbjct: 11  LQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDIS 70

Query: 110 V----------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQ 157
           V           G +  +  L   +  +     + S    K    + A+V I+K     Q
Sbjct: 71  VDLFSGSSILFTGKKQKQTLLGHLLRAL-----RASGLWYKLQFVIHARVPILKVVSGHQ 125

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
            I  DIS + + GL    FL  +  + G+   F+  V+L+K W    + I  +  G  ++
Sbjct: 126 RISCDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKTGTFNS 181

Query: 218 YALEMMVLHRFLDYYNTFDWDNYCI-SINGPVAI----SSLPEI--VAETVENDGDELLL 270
           Y+L ++V+  F            C+ +I  P+ +    S++ ++  V +T E    ++  
Sbjct: 182 YSLSLLVIFHF----------QTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTA 231

Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
           +     K     SV  ++L      F  K   + DP +   N  RSVS+ N  RI     
Sbjct: 232 ANIARFKSERAKSVNRSSLSELLVSFFAK---VEDPFEQPVNAARSVSRRNLDRIAQVFQ 288

Query: 331 YGAQRL 336
             ++RL
Sbjct: 289 ITSRRL 294


>gi|440291374|gb|ELP84643.1| PAP-associated domain containing protein, putative [Entamoeba
           invadens IP1]
          Length = 475

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 35/212 (16%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI--NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE 117
           ++P  +  + R+ V+    R+I  +    I+V PFGS   K +LP  DID TV+      
Sbjct: 27  VEPNPIEYEIRRYVLEKYTRVIENDKKSEIKVVPFGSTQSKLFLPSSDIDFTVVTKGGKT 86

Query: 118 EDLARFVCKILENEDQDSVFEIKDVQY-VPAQVKIVKCSVQN--IPVDISFNQMAGLSAL 174
             +   V +IL      S++ ++D +  + A V ++K + +   I +DIS N  +G+  +
Sbjct: 87  NMVLNSVARIL------SLYTMEDEKRALRATVPVIKLTDRETGIVLDISHNNESGVDTV 140

Query: 175 CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV---------- 224
            ++EK    +  + L +  + +IK         L A  GL  TY+L MMV          
Sbjct: 141 RWMEKE---MKSNALIRPLLFIIKTVLSSYELNLPALGGL-GTYSLFMMVFCFFREKGSD 196

Query: 225 ---------LHRFLDYYNT-FDWDNYCISING 246
                    L RFL YY T FD   + +S+ G
Sbjct: 197 LKDKRGGAILLRFLKYYATEFDSRKFGLSVTG 228


>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
 gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
          Length = 759

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 98/245 (40%), Gaps = 40/245 (16%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLK 98
           S +  +WL  E  + +    I+P    E+ R E++ +++      +   EV+PFGS P  
Sbjct: 386 SANMAVWLHKE--VVDFYEYIKPRDFEERLRGELVEHLKTFCRKTFKDAEVYPFGSFPSG 443

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKIL------ENEDQDSVFEIKDVQYVPAQVKIV 152
            YLP  D+DL  +     +  + R+  K        + ++    +E +    V A+V +V
Sbjct: 444 LYLPTADMDLAFISDSYAKGGVPRYGTKSFLYRFRSQLKNHRIAWEDEIELIVGAKVPLV 503

Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           K       + VDISF    GL+A+  F    ++  G   L    V LIK +         
Sbjct: 504 KFIEHRTGLKVDISFENRTGLTAIETFKAWREQYPGMPAL----VTLIKHFLLMRGLNEP 559

Query: 210 AHYG------------------------LISTYALEMMVLHRFLDYYNTFDWDNYCISIN 245
            + G                        L + + L  ++LH F  Y N F++    IS+N
Sbjct: 560 VNGGIGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISMN 619

Query: 246 GPVAI 250
            P  I
Sbjct: 620 PPRYI 624


>gi|449707156|gb|EMD46861.1| PAPassociated domain containing protein [Entamoeba histolytica
           KU27]
          Length = 400

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 22/191 (11%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           S + EL L  E  +Q  +  I+P     K R+E++    +++    G  V  FGS   K 
Sbjct: 9   SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
           +LP  DID +VL ++     +   V  IL +   EDQ   F+        A + ++K + 
Sbjct: 66  FLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117

Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
           +   I +DIS N  +G   + F+E+V   I KD   ++ V+LIK+  C Y+        G
Sbjct: 118 KKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDFH--QPANG 172

Query: 214 LISTYALEMMV 224
            + TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183


>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 419

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 70/144 (48%), Gaps = 14/144 (9%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLAR----FVCKILENEDQDSVFEIKDV 142
            +V  FGS   K +LPD DID+ VL   ++     R     + +  + E+  S  EI   
Sbjct: 106 FDVHVFGSEATKVFLPDSDIDMVVLPPTDLPLHQIRKNLFTLAEAFKQEESVSGMEIISQ 165

Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGL-SALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
             VP    IVK   QN+ VDISF+  +GL SA   LEK++ +        R +IL+  + 
Sbjct: 166 ARVP----IVKLRFQNLQVDISFSSDSGLKSARYMLEKMEAMPP-----LRPLILVLKYF 216

Query: 202 YYESRILGAHYGLISTYALEMMVL 225
             +  +   + G   ++ L++MV+
Sbjct: 217 LAQRELNQTYMGGCGSFLLQLMVI 240


>gi|67465021|ref|XP_648697.1| topoisomerase [Entamoeba histolytica HM-1:IMSS]
 gi|56464936|gb|EAL43308.1| topoisomerase, putative [Entamoeba histolytica HM-1:IMSS]
          Length = 400

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 22/191 (11%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           S + EL L  E  +Q  +  I+P     K R+E++    +++    G  V  FGS   K 
Sbjct: 9   SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
           +LP  DID +VL ++     +   V  IL +   EDQ   F+        A + ++K + 
Sbjct: 66  FLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117

Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
           +   I +DIS N  +G   + F+E+V   I KD   ++ V+LIK+  C Y+        G
Sbjct: 118 KKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDFH--QPANG 172

Query: 214 LISTYALEMMV 224
            + TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183


>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
          Length = 653

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 15/135 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I++    I P    E+ R EV+  IQR+I   +   EV  FGS     YLP  DIDL 
Sbjct: 170 EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 229

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           V G+        +EE L +         D++S+ ++ D   VP  +K++    + + VDI
Sbjct: 230 VFGNWETLPLWTLEEALRKRKVA-----DENSI-KVLDKATVPI-IKLMDSHTE-VKVDI 281

Query: 164 SFNQMAGLSALCFLE 178
           SFN  +G+ A   ++
Sbjct: 282 SFNVQSGVKAANLIK 296


>gi|238609344|ref|XP_002397464.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
 gi|215471952|gb|EEB98394.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
          Length = 174

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 50/113 (44%), Gaps = 14/113 (12%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           I P+ V ++ R  ++  I   I   Y   EV PFGS   K YLP GDID+ VL   +   
Sbjct: 27  ISPSPVEDEIRSLLVQLISSAIKTRYPDAEVHPFGSYATKLYLPTGDIDIVVLSRTHT-- 84

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGL 171
               F C +     +           VP    + +  +  IPVDISFNQ  G+
Sbjct: 85  --IAFRCFVTAKLAK---------ARVPIVKFVTRVELGGIPVDISFNQPGGV 126


>gi|301093772|ref|XP_002997731.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
 gi|262109980|gb|EEY68032.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
          Length = 489

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 127/324 (39%), Gaps = 60/324 (18%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           + I + +  I P     + R +++  ++ ++ G +    V  FGS   + +LP  DID+ 
Sbjct: 123 DEIMDFVTFISPTEDELRSRAQLVEEMRGVVKGLWPEATVETFGSHYTQMFLPQSDIDMV 182

Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMA 169
           + G    +E L + + + LE +D+ S  E+ D   +P    + K S  +I VD+SFN   
Sbjct: 183 LFGVPEGKEPLYK-LAQCLEEKDRVSYLEVIDKARIPIVKMVHKGS--DIHVDVSFNVAG 239

Query: 170 GLSALCFLEKVDR---------LIGKDHLFKRSV--------------ILIKAWCYYESR 206
           GL+    ++   R         L+ K  + +R +              +++ ++  +  R
Sbjct: 240 GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYSGGVGSFLLQMMVVSFLQHHGR 299

Query: 207 ILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
            LGA +       L  +++  F  Y   F++ +  +S                 V N G 
Sbjct: 300 ALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAVS-----------------VRNGGS 342

Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD--NNNLGRSVSKGNFHR 324
                  F K+ R  Y        + G  F I   N  +P  D   N+      K +F  
Sbjct: 343 ------YFPKEKRRWY--------DDGRPFLIAMENPNEPSLDIGKNSYEMRTVKRSFDY 388

Query: 325 IRCALSYGAQRLGEILTLPGHCLG 348
            R  L     R G+  TLPG  LG
Sbjct: 389 ARQVLQNEIYRHGQFNTLPGSILG 412


>gi|194208578|ref|XP_001491462.2| PREDICTED: PAP-associated domain-containing protein 5 [Equus
           caballus]
          Length = 528

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 61/130 (46%), Gaps = 15/130 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I G +   +V  FGS     YLP  DIDL 
Sbjct: 31  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKGLWPSADVQIFGSFKTGLYLPTSDIDLV 90

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           V G         +EE L +         D+DSV ++ D   VP  +K+   S   + VDI
Sbjct: 91  VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVPI-IKLTD-SFTEVKVDI 142

Query: 164 SFNQMAGLSA 173
           SFN   G+ A
Sbjct: 143 SFNVQNGVRA 152


>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
 gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
          Length = 567

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/230 (26%), Positives = 94/230 (40%), Gaps = 21/230 (9%)

Query: 9   EMEDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTI 60
           E EDL ++ + S   + S     S  + YP+      S   E+  WL  E  I++ +  I
Sbjct: 112 EAEDLSDDGVLSDDESGSRHHSQSATSPYPWVKGHDHSKQREIADWLTME--IKDFVNYI 169

Query: 61  QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
            P+      R  VI  ++R I  ++   +   FGS     YLP  DID+ V+      E 
Sbjct: 170 SPSKEEIVVRNTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGSDIDMVVISTTGDYEQ 229

Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
             +   +   L         E+     VP  +K V     NI VDISF +  GL A   +
Sbjct: 230 RGKLYQLSSFLRTNKLAKNIEVIATAKVPI-IKFVDPQY-NIHVDISFERTNGLDAARRI 287

Query: 178 EK-VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            K +D + G      R ++LI        R+   H G +  YA  +++ H
Sbjct: 288 RKWLDSMPG-----LRELVLIVKQFLRSRRLNNVHVGGLGGYATIILMYH 332


>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
          Length = 535

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 15/135 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I++    I P    E+ R EV+  IQR+I   +   EV  FGS     YLP  DIDL 
Sbjct: 52  EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 111

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           V G+        +EE L +         D++S+ ++ D   VP  +K++    + + VDI
Sbjct: 112 VFGNWETLPLWTLEEALRKRKVA-----DENSI-KVLDKATVPI-IKLMDSHTE-VKVDI 163

Query: 164 SFNQMAGLSALCFLE 178
           SFN  +G+ A   ++
Sbjct: 164 SFNVQSGVKAANLIK 178


>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
 gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
          Length = 650

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 126/309 (40%), Gaps = 44/309 (14%)

Query: 30  YSSP--PNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR 79
           YS+P   N YP+      S   E+  WL  E  +++ +  I P+      R  V+  ++R
Sbjct: 162 YSAPNETNDYPWVKNHDHSTQKEIADWLTLE--MKDFVNYISPSKEEILTRNRVVKDLKR 219

Query: 80  LINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSV 136
            IN  +   E   FGS     YLP  DID+ V  +    E+ ++   +   L N      
Sbjct: 220 EINNLWPDTEAHVFGSSATDLYLPGSDIDMVVTSNTGDYENRSKLYQLSSYLRNRKL--- 276

Query: 137 FEIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKR 192
              KD++ +  A+V IVK      NI +DISF +  G+ A   + + +DR  G      R
Sbjct: 277 --AKDIEVIAKAKVPIVKFVDPSSNIHIDISFERRNGIEAAKRIRRWLDRTPG-----LR 329

Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
            ++LI        R+   H G +  Y+  ++  H FL  +     +N  I  N    +  
Sbjct: 330 ELVLIVKQFLRSRRLNNVHVGGLGGYSTIILCYH-FLRLHPRISTNNISILDNLGSLLIE 388

Query: 253 LPEIVAETVENDGDELLLSPE-----FLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
             E+       D   + + PE     +L K    Y  P      + + F+I    I DP 
Sbjct: 389 FFELYGRNFSYDNLIIAIDPETDEVKYLPKKDHAYLNP------SKNPFSIV---IQDPA 439

Query: 308 KDNNNLGRS 316
              NN+ RS
Sbjct: 440 DSTNNISRS 448


>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
           latipes]
          Length = 679

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 77/284 (27%), Positives = 117/284 (41%), Gaps = 50/284 (17%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I++    I P    EK R EV++ I+ +I+  +   EV  FGS     YLP  DIDL 
Sbjct: 188 EEIKDFYDYISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLV 247

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +       N    S  ++ D   VP    I+K   SV  + V
Sbjct: 248 VFGKWETLPLWTLEEALRK------RNVADKSAIKVLDKATVP----IIKLTDSVTEVKV 297

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DISFN  +G+ A   +++      K  +    V+++K +     R L   + G I +Y+L
Sbjct: 298 DISFNVESGVKAARLIKEFKE---KYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 352

Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSP 272
            +M +  FL  +  F  D    +IN  V +    E+               DG   +   
Sbjct: 353 FLMAVS-FLQLH--FREDVCSPNINIGVLLIEFFELYGRHFNYLKTGIRIKDGGSYVAKD 409

Query: 273 EFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
           E  K   + Y   +              L I DPL+ +N++GRS
Sbjct: 410 EVQKNMMDGYRPSM--------------LYIEDPLQPDNDVGRS 439


>gi|410911160|ref|XP_003969058.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
          Length = 803

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 12/193 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + +    + P       RKEV+N I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 255 HEEVIDFYNFMSPRPEEAAMRKEVVNRIETIIKELWPTADVQIFGSFSTGLYLPTSDIDL 314

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +    F IK +    A V I+K + Q   + VDISFN
Sbjct: 315 VVFG--KWERPPLQELEQALRKHNVAEPFSIKVLDK--ATVPIIKLTDQETEVKVDISFN 370

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
              G+ A  F++   ++     +    + ++K +   +  +     G IS+Y+L +MV+ 
Sbjct: 371 VETGVKAASFIKDYVKMYP---VLPYLIFVLKQFL-LQRDLNEVFTGGISSYSLILMVI- 425

Query: 227 RFLDYYNTFDWDN 239
            FL  +   D  N
Sbjct: 426 SFLQLHPRIDARN 438


>gi|348512677|ref|XP_003443869.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
          Length = 805

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 92/199 (46%), Gaps = 24/199 (12%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + +    I P       RKEV+N I+ +I   +  ++V  +GS     YLP  DIDL
Sbjct: 254 HEEVMDFYKYISPRPEEAAMRKEVVNRIEVVIKELWPTVDVQIYGSFSTGLYLPTSDIDL 313

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIP 160
            V G       Q +E+ L +      E  D +S+ ++ D   VP    I+K + Q  ++ 
Sbjct: 314 MVTGKWERPPLQELEQALRKH-----EVADPNSI-KVLDKATVP----IIKLTDQRTDVK 363

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VDISFN   G+ A  F++     + K  +    + ++K +   +  +     G IS+Y+L
Sbjct: 364 VDISFNVETGVKAASFIKG---YMKKYPVLPYLIFVLKQFL-LQRDLNEVFTGGISSYSL 419

Query: 221 EMMVLHRFLDYYNTFDWDN 239
            +MV+  FL  +   D  N
Sbjct: 420 ILMVI-SFLQLHPRIDPTN 437


>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 603

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 59/231 (25%), Positives = 98/231 (42%), Gaps = 25/231 (10%)

Query: 11  EDLQEELLTSPSHTSSSS-SYSSPPNL-YPFSIDAEL--------WLLAEERIQEILCTI 60
           ED  EE+  S +  S+   ++   PN  YP+  D +         WL  E  +++ +  I
Sbjct: 123 EDDSEEISNSDNDLSNGDYAHGGTPNSSYPWVKDHDHSKQKEIADWLTLE--MKDFVNYI 180

Query: 61  QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
            P+      R  VI+ +++ I  ++ G E   FGS     YLP  DID+ V+      E+
Sbjct: 181 SPSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGDYEN 240

Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALC 175
            +R   +   L  +      E+       A+V I+K    V  + +D+SF +  GL A  
Sbjct: 241 RSRLYQLSTFLRTKKLAKNVEV----IASAKVPIIKFVDPVSELHIDVSFERTNGLDAAK 296

Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            + +   LI    L  R ++L+        R+   H G +  YA  +M  H
Sbjct: 297 RIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 343


>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
 gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
          Length = 646

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 126/309 (40%), Gaps = 44/309 (14%)

Query: 30  YSSP--PNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR 79
           YS+P   N YP+      S   E+  WL  E  +++ +  I P+      R  V+  ++R
Sbjct: 161 YSAPNETNDYPWVKNHDHSTQKEIADWLTLE--MKDFVNYISPSKEEILTRNRVVKDLKR 218

Query: 80  LINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSV 136
            IN  +   E   FGS     YLP  DID+ V       E+ ++   +   L N      
Sbjct: 219 EINSLWPDTETHVFGSSATDLYLPGSDIDMVVTSKTGDYENRSKLYQLSSYLRNRKL--- 275

Query: 137 FEIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKR 192
              KD++ +  A+V I+K      NI +DISF +  G+ A   + K +D+  G      R
Sbjct: 276 --AKDIEVIAKAKVPIIKFVDPSSNIHIDISFERRNGIEAAKRIRKWLDKTPG-----LR 328

Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
            ++LI        R+   H G +  Y+  ++  H FL  +     +N  I  N    +  
Sbjct: 329 ELVLIIKQFLRSRRLNNVHVGGLGGYSTIILCYH-FLRLHPRISTNNMSILDNLGSLLIE 387

Query: 253 LPEIVAETVENDGDELLLSPE-----FLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
             E+       D   + + PE     +L K    Y      L ++ + F+I    I DP 
Sbjct: 388 FFELYGRNFSYDNLIIAIDPETDEPKYLPKKDHAY------LNSSKNPFSIV---IQDPA 438

Query: 308 KDNNNLGRS 316
              NN+ RS
Sbjct: 439 DSTNNISRS 447


>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
 gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
 gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
          Length = 603

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 101/235 (42%), Gaps = 33/235 (14%)

Query: 11  EDLQEELLTSPSHTSSSS-SYSSPPNL-YPFSIDAEL--------WLLAEERIQEILCTI 60
           ED  EE+  S +  S+   ++   PN  YP+  D +         WL  E  +++ +  I
Sbjct: 123 EDDSEEISNSDNDLSNGDYAHGETPNSSYPWVKDHDHSKQKEIADWLTLE--MKDFVNYI 180

Query: 61  QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
            P+      R  VI+ +++ I  ++ G E   FGS     YLP  DID+ V+      E+
Sbjct: 181 SPSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGDYEN 240

Query: 120 LARFVCKILENEDQDSVF-----EIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGL 171
            +R          Q S F       K+V+ +  A+V I+K    V  + +D+SF +  GL
Sbjct: 241 RSRLY--------QLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGL 292

Query: 172 SALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            A   + +   LI    L  R ++L+        R+   H G +  YA  +M  H
Sbjct: 293 DAAKRIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 343


>gi|255730627|ref|XP_002550238.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240132195|gb|EER31753.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 603

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 73/304 (24%), Positives = 125/304 (41%), Gaps = 34/304 (11%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P+      R +VI  +++ I+ ++ G  V  FGS     YLP  
Sbjct: 166 WLTME--IRDFVNYISPSSDEIITRNKVIAALKKSISDFWPGTTVHVFGSCATDLYLPGS 223

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
           DID+ V+      E+ +R   +   L      +  E+       A+V I+K       + 
Sbjct: 224 DIDMVVVSDTGSYENASRLYQLSTFLRTNKLATEVEV----IAHAKVPIIKFVDPKSRLH 279

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           +D+SF +  G+ A    +++ R +      +  V+++K +     R+   H G +  YA 
Sbjct: 280 IDVSFERTNGIDA---AKRIRRWLVSTPGLRELVLVVKQFLRTR-RLNNVHVGGLGGYAT 335

Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPE-----FL 275
            +M  H FL  +     D      N  V +    E+       D   L L PE     ++
Sbjct: 336 IIMCYH-FLRLHPKITTDAMSALDNLGVLLIEFFELYGRNFSYDDLVLALDPENEEPKYV 394

Query: 276 KKCREIYSVPITALENTGH-EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQ 334
           +K +       + L+++    FAI    I DP   +NN+ RS    N   ++ A     Q
Sbjct: 395 RKSK-------SPLQHSARSSFAII---IQDPADPSNNITRSSY--NLRDLKKAFGGAYQ 442

Query: 335 RLGE 338
            L E
Sbjct: 443 LLSE 446


>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
 gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
 gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
 gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
          Length = 703

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 45/246 (18%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING--YYGIEVFPFGSVPLKT 99
           D  +WL  E  I +     +P    EK R+ +I+ + +LI    +    V+PFGS     
Sbjct: 369 DGGVWLHKE--IVDFYNYAKPRDFEEKLRQGLIDELAKLIRNSQFRDATVYPFGSFKSNL 426

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCK--------ILENEDQDSVFEIKDVQYVPAQVKI 151
           YLP GD+DL       +    AR+  K         +E +       ++ +    A+V +
Sbjct: 427 YLPTGDMDLVFCSDSYMSGRAARYSSKNHVFKFGAFIERKQLAVDNHVEKIS--KARVPL 484

Query: 152 VKC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           VK   S   + VD+SF  + G+ A+  FL   ++      L    V  IK +        
Sbjct: 485 VKYVDSRTGLKVDVSFENITGIRAIETFLAWREQFPDMPVL----VTCIKHFLAMRGLNE 540

Query: 209 GAHYGLISTYALEMMV-----------------------LHRFLDYY-NTFDWDNYCISI 244
            A+ G+  T  + ++V                       L RF D Y N F +D   IS+
Sbjct: 541 PANGGIGGTTVICLVVSMLQLSPDVQSRSMTPESHLGQLLLRFFDLYGNRFSYDRVAISM 600

Query: 245 NGPVAI 250
           N P  I
Sbjct: 601 NPPRYI 606


>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 779

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 87/200 (43%), Gaps = 28/200 (14%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
           LL EE I      + P     + R  +I  I R IN  +   EV PFGS   + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200

Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
           IDL V      + +  R + ++ +   Q ++ ++  +    A+V I+K            
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVSS 259

Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
                    V  I VDIS NQ  G++A   + + +D L G      R +ILI  +   + 
Sbjct: 260 LEYFSKQEGVGKINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314

Query: 206 RILGAHYGLISTYALEMMVL 225
            +   + G + +Y++  MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334


>gi|407039791|gb|EKE39813.1| topoisomerase, putative [Entamoeba nuttalli P19]
          Length = 400

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 55/191 (28%), Positives = 91/191 (47%), Gaps = 22/191 (11%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           S + EL L  E  +Q  +  I+P     K R+E++    +++    G  V  FGS   K 
Sbjct: 9   SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
           +LP  DID +V+ ++     +   V  IL +   EDQ   F+        A + ++K + 
Sbjct: 66  FLPTSDIDFSVITNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117

Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
           +   I +DIS N   G   + F+E+V   I KD   ++ V+LIK+  C Y+        G
Sbjct: 118 KKTLIVLDISHNNTNGTKTVNFIEEV---IKKDDRIRKLVLLIKSLLCCYDFH--QPANG 172

Query: 214 LISTYALEMMV 224
            + TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183


>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
          Length = 701

 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 102/248 (41%), Gaps = 49/248 (19%)

Query: 38  PFSIDAELWLL----AEERIQEILCTIQ---PAIVSEKKRKEVINYIQRLING-YYGIEV 89
           P+S D E W      AE   +E++   Q   P     + R  VI  I R I   +   EV
Sbjct: 82  PWSADVE-WSKCQNGAEALHRELIAFDQWMAPTGAEHETRCMVIELIARAIKSQFRDAEV 140

Query: 90  FPFGSVPLKTYLPDGDIDLTVLGHQ--NVEEDLA-RFVCKILENEDQDSVFEIKDVQYVP 146
            PFGS   K YLP GD+DL V+     N+    A R +   L   +  +     DVQ + 
Sbjct: 141 RPFGSQETKLYLPQGDLDLVVVSRSMANLRTQSALRTMAACLRRHNLAT-----DVQVIA 195

Query: 147 -AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
            A+V I+K   +   + VDIS N   GL+   ++     L    H+  R +IL+      
Sbjct: 196 KAKVPIIKFVTTYARLKVDISLNHTNGLTTASYVNGW--LRKWPHI--RPLILVIKHLLM 251

Query: 204 ESRILGAHYGLISTYALEMMV-----LH-------------------RFLDYY-NTFDWD 238
           +  +     G + +Y++ +MV     LH                    FL+ Y   F +D
Sbjct: 252 QRGMSEVFSGGLGSYSVIIMVISFLQLHPKLQRGEIEPGRSLGVLLLEFLELYGKNFGYD 311

Query: 239 NYCISING 246
           N  ISI G
Sbjct: 312 NCGISIRG 319


>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Monodelphis domestica]
          Length = 809

 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 359 EEISDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 418

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 419 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 468

Query: 162 DISFNQMAGLSALCFLE 178
           DISFN   G+ A   ++
Sbjct: 469 DISFNVQNGVKAAQLIK 485


>gi|384485719|gb|EIE77899.1| hypothetical protein RO3G_02603 [Rhizopus delemar RA 99-880]
          Length = 494

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 24/144 (16%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------------GI 87
           +I  E WL   ER   I+ T +  I    KRK VI+ +Q ++  +              I
Sbjct: 7   NISNEDWLSYLERY-NIIETEEEKI----KRKNVIDLLQHILVNFQRAVTKDLDWKRGDI 61

Query: 88  EVF--PFGSVPLKTYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSVFEIKDVQY 144
           E F  PFGS  L  Y+ D DIDL ++   Q + +   +F  ++L+   Q  V  ++ +Q 
Sbjct: 62  ECFLSPFGSYALGGYIRDADIDLVLVCPIQVLRKYFFKFFPQLLKQ--QTLVSNVESIQ- 118

Query: 145 VPAQVKIVKCSVQNIPVDISFNQM 168
             A V I+KC++ NI +DISF ++
Sbjct: 119 -KANVPIIKCTIDNISIDISFVRL 141


>gi|241948905|ref|XP_002417175.1| topoisomerase 1-related protein TRF4, putative [Candida
           dubliniensis CD36]
 gi|223640513|emb|CAX44767.1| topoisomerase 1-related protein TRF4, putative [Candida
           dubliniensis CD36]
          Length = 606

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 36/238 (15%)

Query: 11  EDLQEELLTSPSHTS--SSSSYS---SPPNLYPFSIDAEL--------WLLAEERIQEIL 57
           ED  EE L S   +S  +   Y+   +P + YP+  D +         WL  E  +++ +
Sbjct: 123 EDESEEFLDSEDDSSYKNEGDYANGGTPNSHYPWVKDHDHSKQKEIADWLTLE--MKDFV 180

Query: 58  CTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNV 116
             I P+      R  VI+ ++  I  ++ G E   FGS     YLP  DID+ V+     
Sbjct: 181 NYISPSSEEIVTRNNVISTLKTEIGMFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGD 240

Query: 117 EEDLARFVCKILENEDQDSVF-----EIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQM 168
            E+ +R          Q S F       K+V+ +  A+V I+K    +  + +D+SF + 
Sbjct: 241 YENRSRLY--------QLSTFLRTKKLAKNVEVIASAKVPIIKFVDPISELHIDVSFERT 292

Query: 169 AGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            GL A   + +   LI    L  R ++L+        R+   H G +  YA  +M  H
Sbjct: 293 NGLDAAKRIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 346


>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
 gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
          Length = 467

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 61/135 (45%), Gaps = 19/135 (14%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           L  E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DI
Sbjct: 14  LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 73

Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQN 158
           DL V G         +EE L +         D+DSV ++ D   VP    I+K   S   
Sbjct: 74  DLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTE 123

Query: 159 IPVDISFNQMAGLSA 173
           + VDISFN   G+ A
Sbjct: 124 VKVDISFNVQNGVRA 138


>gi|224135265|ref|XP_002322024.1| predicted protein [Populus trichocarpa]
 gi|222869020|gb|EEF06151.1| predicted protein [Populus trichocarpa]
          Length = 122

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 55/112 (49%), Gaps = 31/112 (27%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           +++ E W  AEER  E++  IQP   SE++R  V+ Y+QRLI     +  FP        
Sbjct: 29  ALEPERWATAEERTAELIACIQPNQPSEERRNAVLCYVQRLI-----MNCFP-------- 75

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
                            +E  A  V  ILE+E+  +++ F +K+VQY+ A+V
Sbjct: 76  ----------------CQETWANEVRDILEHEEKNENAEFHVKEVQYIQAEV 111


>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
           [Sarcophilus harrisii]
          Length = 615

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 118 EEISDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 177

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 178 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 227

Query: 162 DISFNQMAGLSALCFLE 178
           DISFN   G+ A   ++
Sbjct: 228 DISFNVQNGVKAAQLIK 244


>gi|340939540|gb|EGS20162.1| hypothetical protein CTHT_0046690 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 836

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 101/240 (42%), Gaps = 42/240 (17%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGD 105
           +L  + + +    ++P    ++ R E++  ++R     +G  EV+PFGS     YLP+GD
Sbjct: 467 VLLHKEVADFYAYVKPRDFEDRLRGELVQDLKRFSRRLFGDAEVYPFGSFCSGLYLPNGD 526

Query: 106 IDLTVL-------GHQNVEEDLARFVCK---ILENEDQDSVFEIKDVQYVPAQVKIVKCS 155
           +DL ++       G     ++ A F  +   + E    ++  E+     VP  VK V+  
Sbjct: 527 MDLVLMSDLYKNGGPPRYHDNHALFKFRNGLVKEGMIHNNNIELILHARVPL-VKFVERK 585

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLI-GKDHLFKRSVILIKAWCYYESRILGAHYGL 214
              + +DISF   +G+ A+    K   L  G   L    V +IK +          + G+
Sbjct: 586 T-GLRIDISFENRSGIVAINTFHKWRNLYPGMPAL----VSVIKHFLLMRGLNEPVNGGI 640

Query: 215 IS-------TYALEMM----------------VLHRFLDYY-NTFDWDNYCISINGPVAI 250
            S          L+MM                +L RF DYY N F+++   IS++ P  I
Sbjct: 641 GSFSVTCLVVSMLQMMPEVQSGNMDTRHHLGELLLRFFDYYGNRFNYEGLAISLDPPALI 700


>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo
           abelii]
          Length = 588

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 61/135 (45%), Gaps = 19/135 (14%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           L  E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DI
Sbjct: 88  LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 147

Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQN 158
           DL V G         +EE L +         D+DSV ++ D   VP    I+K   S   
Sbjct: 148 DLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTE 197

Query: 159 IPVDISFNQMAGLSA 173
           + VDISFN   G+ A
Sbjct: 198 VKVDISFNVQNGVRA 212


>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 779

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 28/200 (14%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
           LL EE I      + P     + R  +I  I R IN  +   EV PFGS   + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200

Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
           IDL V      + +  R + ++ +   Q ++ ++  +    A+V I+K            
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVSS 259

Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
                    +  I VDIS NQ  G++A   + + +D L G      R +ILI  +   + 
Sbjct: 260 LEYFSKQEGIGKINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314

Query: 206 RILGAHYGLISTYALEMMVL 225
            +   + G + +Y++  MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334


>gi|363738258|ref|XP_414098.3| PREDICTED: PAP-associated domain-containing protein 5 [Gallus
           gallus]
          Length = 534

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 89/197 (45%), Gaps = 25/197 (12%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPL 97
           F+I A +  +  E I +    + P    E+ R EV+N I+ +I   +   +V  FGS   
Sbjct: 24  FAIAAVVKAILHEEIIDFYKYMSPRHEEERMRMEVVNRIENVIKELWPNADVQIFGSFKT 83

Query: 98  KTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKI 151
             YLP  DIDL V G         +EE L +         D++SV ++ D   VP    I
Sbjct: 84  GLYLPTSDIDLVVFGKWETLPLWTLEEALRKHNVA-----DENSV-KVLDKATVP----I 133

Query: 152 VKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
           +K   S   + VDISFN   G+ A   ++     I K  +    V+++K +     R L 
Sbjct: 134 IKLTDSFTEVKVDISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLN 188

Query: 210 AHY-GLISTYALEMMVL 225
             + G I +Y+L +M +
Sbjct: 189 EVFTGGIGSYSLFLMAV 205


>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
 gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
          Length = 667

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 74/310 (23%), Positives = 129/310 (41%), Gaps = 49/310 (15%)

Query: 38  PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVP 96
           P+  + E+  L  + I   +  ++P       RK +I  + R I   +  + V+ FGS  
Sbjct: 221 PYPEEPEVARLLHQDILNFINYLEPTPQEHAVRKSLITKLDRAIRAKWPEVTVYVFGSFE 280

Query: 97  LKTYLPDGDIDLTVLGHQNVEEDLARFV---CKILENEDQDSVFEIKDVQYVPAQVKIVK 153
            + YLP  DID+ V+    V     + +    + L+N    +  ++     VP  +K V 
Sbjct: 281 TRLYLPTSDIDMVVMSSDTVHRGTKKHMYSLARHLKNCKLATEIQVITTANVPI-IKFVD 339

Query: 154 CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY- 212
                I VD+SFNQ  GL   C +  V+  + K    +   +L+K   +   R L   + 
Sbjct: 340 -PFTRIHVDVSFNQPGGLKT-CLV--VNGFLKKFPAVRPLTMLVKH--FLNMRALNEVFL 393

Query: 213 GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSP 272
           G +S+YA+  +V+  FL  +                     P +   +V  + +  +L  
Sbjct: 394 GGLSSYAIVCLVVS-FLQMH---------------------PRLSTSSVRQEDNLGVLFL 431

Query: 273 EFLK--KCREIYSVPITALENTGHEFAIKH-----------LNIVDPLKDNNNLGRSVSK 319
           EFL+    R  Y     A+ N G  F+ K            L+I DP+  +N++ +S S+
Sbjct: 432 EFLELYGKRYNYDAVGIAVHNGGFYFSKKKMGWVKPSQPYLLSIQDPVDYDNDISKS-SR 490

Query: 320 GNFHRIRCAL 329
           G   R++  L
Sbjct: 491 G-ILRVKATL 499


>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
          Length = 833

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 64/273 (23%), Positives = 111/273 (40%), Gaps = 54/273 (19%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA------- 121
           R +V+  ++  I   + G +V  FGS     YLP  DID+ VLG + +E           
Sbjct: 111 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLG-EKIEPRYGNPQNGPH 169

Query: 122 -RFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKV 180
            R   ++L+    +  + IK +      +  ++  + +I VDISFN   G++A+  ++  
Sbjct: 170 YRLQDRLLKQGIAER-YSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGY 228

Query: 181 DR---------LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
            R         L+ K  L +R   + + W            G IS+Y L +MV+  FL +
Sbjct: 229 IRQFPALRYLVLVLKQFLLQRD--MNEVWT-----------GGISSYGLILMVVS-FLQH 274

Query: 232 YNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPEFLKKCREIYS 283
                 DN    +N  V +           E         +G + +   E   + +E   
Sbjct: 275 QGA---DNTADDVNLGVLLIKFLRFYGMEFEYSKCCIRVKNGGQFIKKEEMATQMKE--- 328

Query: 284 VPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
                   TG ++    L+I DPL  +N++GR+
Sbjct: 329 ------SPTGPKYVPNFLSIEDPLTPSNDVGRA 355


>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
          Length = 887

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 64/271 (23%), Positives = 109/271 (40%), Gaps = 50/271 (18%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQ------NVEEDLAR 122
           R +V+  ++  I   + G +V  FGS     YLP  DID+ VLG +      N +     
Sbjct: 165 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHY 224

Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDR 182
            +   L  +     + IK +      +  ++  + +I VDISFN   G++A+  ++   R
Sbjct: 225 RLQDRLLKQGIAERYSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGYIR 284

Query: 183 ---------LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYN 233
                    L+ K  L +R   + + W            G IS+Y L +MV+  FL +  
Sbjct: 285 QFPALRYLVLVLKQFLLQRD--MNEVWT-----------GGISSYGLILMVVS-FLQHQG 330

Query: 234 TFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPEFLKKCREIYSVP 285
               DN    +N  V +           E         +G + +   E   + +E    P
Sbjct: 331 A---DNTGDDVNLGVLLIKFLRFYGMEFEYSKCCIRVKNGGQFIKKEEMATQMKE---AP 384

Query: 286 ITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
                 TG ++    L+I DPL  +N++GR+
Sbjct: 385 ------TGPKYVPNFLSIEDPLTPSNDVGRA 409


>gi|367030285|ref|XP_003664426.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
           42464]
 gi|347011696|gb|AEO59181.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
           42464]
          Length = 772

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 103/248 (41%), Gaps = 46/248 (18%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLK 98
           S +  +WL  E  + +    I+P    E+ R E++  +++     +   EV+PFGS P  
Sbjct: 397 SANMAVWLHKE--VVDFYDYIKPRDFEERLRNELVQDLKQFCRRVFKDAEVYPFGSFPSG 454

Query: 99  TYLPDGDIDLTVL-------GHQ--NVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
            YLP  D+DL  +       GH   + +  L RF  +++ ++     +E +    + A+V
Sbjct: 455 LYLPTADMDLVFMSDAYASGGHAKYHTKSFLHRFKGQLVNHK---VAWENEIELILHAKV 511

Query: 150 KIVKC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYES- 205
            +VK       + VD+SF    G++A+  F    ++  G   L    V LIK +      
Sbjct: 512 PLVKFIEHKTGLKVDVSFENNTGVNAIKTFKAWREQYPGMPAL----VTLIKHFLLMRGL 567

Query: 206 --RILGAHYG---------------------LISTYALEMMVLHRFLDYYNTFDWDNYCI 242
              + G   G                     L + + L  ++LH F  Y N F++    I
Sbjct: 568 NEPVNGGIGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAI 627

Query: 243 SINGPVAI 250
           S+N P  I
Sbjct: 628 SLNPPRWI 635


>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 538

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 4/129 (3%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           +    ++  +  + P+ V ++ R  VI+ + + ++  +   +V PFGS   K YLPDGDI
Sbjct: 184 MLHREVEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDI 243

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDIS 164
           DL V+  +++       V   L N  + +    K      A+V IVK    +  + VDIS
Sbjct: 244 DL-VIQSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDIS 302

Query: 165 FNQMAGLSA 173
            NQ  G+ A
Sbjct: 303 INQGNGVIA 311


>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Gorilla gorilla gorilla]
          Length = 664

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 213 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
            V G         +EE L +         D+DSV ++ D   VP    I+K   S   + 
Sbjct: 273 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 322

Query: 161 VDISFNQMAGLSA 173
           VDISFN   G+ A
Sbjct: 323 VDISFNVQNGVRA 335


>gi|353241543|emb|CCA73351.1| related to TRF4-topoisomerase I-related protein [Piriformospora
           indica DSM 11827]
          Length = 628

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 17/110 (15%)

Query: 84  YYGIEVFP------FGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENE 131
           YY  +++P      FGS     YLP GDID+ +         +N  +     +  IL + 
Sbjct: 121 YYARKIWPDATATAFGSYATGLYLPTGDIDVVIETKYATASTKNAAQRALSQLATILRSA 180

Query: 132 DQDSVFEIKDVQYVPAQVKIVKC-SVQN-IPVDISFNQMAGLSALCFLEK 179
               + E + +Q + A+V I+K  SV   IPVDIS NQ  G+SA+  + +
Sbjct: 181 ---GLAERRKIQVISARVSIIKFDSVHGGIPVDISLNQTTGVSAIPVINR 227


>gi|167384281|ref|XP_001736885.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
 gi|165900593|gb|EDR26889.1| PAP-associated domain-containing protein, putative [Entamoeba
           dispar SAW760]
          Length = 400

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 91/188 (48%), Gaps = 16/188 (8%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           S + EL L  E  +Q  +  I+P     K R+E++    +++    G  + PFGS   K 
Sbjct: 9   SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNIMPFGSTQSKL 65

Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN- 158
           +LP  DID +V+ +    E   R V   + +     V E +   +  A V ++K + +  
Sbjct: 66  FLPTSDIDFSVITN----EYNTRKVLNSISSILSSYVLEDQKRNF-KASVPVLKLTDKQT 120

Query: 159 -IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYGLIS 216
            I +DIS N  +G   + F+E++   I KD   +R V+LIK+  C Y+        G + 
Sbjct: 121 LIVLDISHNNTSGTKTVDFIEEI---IKKDDRIRRLVLLIKSILCCYDFH--QPANGGLG 175

Query: 217 TYALEMMV 224
           TY++ +MV
Sbjct: 176 TYSVFVMV 183


>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Nomascus leucogenys]
          Length = 666

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 215 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 274

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
            V G         +EE L +         D+DSV ++ D   VP    I+K   S   + 
Sbjct: 275 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 324

Query: 161 VDISFNQMAGLSA 173
           VDISFN   G+ A
Sbjct: 325 VDISFNVQNGVRA 337


>gi|440637467|gb|ELR07386.1| hypothetical protein GMDG_08401 [Geomyces destructans 20631-21]
          Length = 753

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 66/155 (42%), Gaps = 26/155 (16%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFG 93
           N YP S +   WL  E  I +     +PA V E  R  +I+ ++R +   ++  ++ PFG
Sbjct: 410 NHYP-SSNMGSWLHME--IMDFFHRFKPAEVEENMRGALISDLRRAVQKVWHDADILPFG 466

Query: 94  SVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDV---QYVPAQVK 150
           S P   YLP  D+DL  +    ++    ++  K       +++F  +D    + + A   
Sbjct: 467 SYPAGLYLPTADMDLVFVSRGYMDGGYGKYTNK-------NALFRFRDFLDREKIAAPYS 519

Query: 151 IVKCSVQNIP------------VDISFNQMAGLSA 173
           I   S   +P            VD+SF    GL A
Sbjct: 520 IEVISKAKVPLVKYIDYYTGLRVDVSFENDTGLIA 554


>gi|334325414|ref|XP_001371537.2| PREDICTED: DNA polymerase sigma-like [Monodelphis domestica]
          Length = 860

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 84/194 (43%), Gaps = 20/194 (10%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 309 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDL 368

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 369 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 424

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
              G+       K  RLI KD++ K S    +IL+      +  +     G IS+Y+L +
Sbjct: 425 METGV-------KAARLI-KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 476

Query: 223 MVLHRFLDYYNTFD 236
           M +  FL  +   D
Sbjct: 477 MAI-SFLQLHPRID 489


>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Loxodonta africana]
          Length = 595

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 61/133 (45%), Gaps = 15/133 (11%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           L  E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DI
Sbjct: 95  LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 154

Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
           DL V G         +EE L +         D+DSV ++ D   VP  +K+   S   + 
Sbjct: 155 DLVVFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVPI-IKLTD-SFTEVK 206

Query: 161 VDISFNQMAGLSA 173
           VDISFN   G+ A
Sbjct: 207 VDISFNVQNGVRA 219


>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Callithrix jacchus]
          Length = 664

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 213 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
            V G         +EE L +         D+DSV ++ D   VP    I+K   S   + 
Sbjct: 273 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 322

Query: 161 VDISFNQMAGLSA 173
           VDISFN   G+ A
Sbjct: 323 VDISFNVQNGVRA 335


>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 588

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 96/229 (41%), Gaps = 23/229 (10%)

Query: 11  EDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTIQP 62
           +D  ++ + S   + +      P +LYP+      S   E+  WL  E  I++ +  I P
Sbjct: 124 DDYSDDGIISDDQSGNRHIAKEPSSLYPWIKNQDHSKQQEIADWLTLE--IKDFVNYISP 181

Query: 63  AIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA 121
           + +    R  VI  ++  I  ++   EV  FGS     YLP  DID+ V+      E  +
Sbjct: 182 SKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVVISRDGDREQRS 241

Query: 122 RF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFL 177
           R   +   L ++      E+       A+V IVK      NI +D+SF +  G+ A    
Sbjct: 242 RLYQLSTHLRSKKLAKNIEV----IAKAKVPIVKFVDPDSNIHIDVSFERSNGIDAAI-- 295

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            K+   +      +  V+++K +     R+   H G +  Y+  ++  H
Sbjct: 296 -KIREWLASTPGLRELVLVVKQFL-RSRRLNNVHVGGLGGYSTIILCYH 342


>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
          Length = 1047

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 65/140 (46%), Gaps = 15/140 (10%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 648 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 707

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           V G         +EE L +      +  D+DSV ++ D   VP  +K+   S   + VDI
Sbjct: 708 VFGKWENLPLWTLEEALRKH-----KVADEDSV-KVLDKATVPI-IKLTD-SFTEVKVDI 759

Query: 164 SFNQMAGLSALCFLEKVDRL 183
           SFN   G+ A   ++   +L
Sbjct: 760 SFNVQNGVRAADLIKDFTKL 779


>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 592

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 4/124 (3%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
           ++  +  + P+ V ++ R  VI+ + + ++  +   +V PFGS   K YLPDGDIDL V+
Sbjct: 189 VEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDL-VI 247

Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDISFNQMA 169
             +++       V   L N  + +    K      A+V IVK    +  + VDIS NQ  
Sbjct: 248 QSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGN 307

Query: 170 GLSA 173
           G+ A
Sbjct: 308 GVIA 311


>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 588

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 96/229 (41%), Gaps = 23/229 (10%)

Query: 11  EDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTIQP 62
           +D  ++ + S   + +      P +LYP+      S   E+  WL  E  I++ +  I P
Sbjct: 124 DDYSDDGIISDDQSGNRHIAKEPSSLYPWIKNQDHSKQQEIADWLTLE--IKDFVNYISP 181

Query: 63  AIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA 121
           + +    R  VI  ++  I  ++   EV  FGS     YLP  DID+ V+      E  +
Sbjct: 182 SKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVVISRDGDREQRS 241

Query: 122 RF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFL 177
           R   +   L ++      E+       A+V IVK      NI +D+SF +  G+ A    
Sbjct: 242 RLYQLSTHLRSKKLAKNIEV----IAKAKVPIVKFVDPDSNIHIDVSFERSNGIDAAI-- 295

Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            K+   +      +  V+++K +     R+   H G +  Y+  ++  H
Sbjct: 296 -KIREWLASTPGLRELVLVVKQFL-RSRRLNNVHVGGLGGYSTIILCYH 342


>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
 gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
          Length = 698

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 201 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 260

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 261 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 310

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 311 DISFNVQNGVRA 322


>gi|374724400|gb|EHR76480.1| terminal uridylyltransferase [uncultured marine group II
           euryarchaeote]
          Length = 730

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 87/191 (45%), Gaps = 14/191 (7%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR-LINGYYGIEVFPFGSVPLK 98
           S+  E+ L   E I+ +L +  P     K R+ +  +++R L + +  +E+  FGS    
Sbjct: 36  SVSEEVILNIGEGIEAMLKSEGPTPKQLKAREGLFRHLKRTLEHRFKNVELQQFGSSQSG 95

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ 157
             L  GD+DL +    ++     R V ++L+  D      ++D+  +P A+V I+K   +
Sbjct: 96  LTLQAGDLDLCLQFKGDIPAKALRQVNRLLKQHD------MEDIVMLPRAKVPIIKFKDE 149

Query: 158 --NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
              IPVDIS N      AL   E + R    D   +  ++ +K W      +  A  G  
Sbjct: 150 RTKIPVDISINNTL---ALHNTELLKRYSSCDERIRSVILAVKHWANRRD-VCDASTGTF 205

Query: 216 STYALEMMVLH 226
           S+YA  ++ + 
Sbjct: 206 SSYAWTLLAVQ 216


>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
          Length = 651

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 201 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 260

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 261 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 310

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 311 DISFNVQNGVRA 322


>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Macaca mulatta]
          Length = 700

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 203 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 262

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 263 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 312

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 313 DISFNVQNGVRA 324


>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
          Length = 514

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 64  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 123

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 124 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 173

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 174 DISFNVQNGVRA 185


>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
          Length = 588

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 137 HEEISDFYEYMSPRPEEEKMRMEVVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDIDL 196

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
            V G         +EE L +         D+DSV ++ D   VP    I+K   S   + 
Sbjct: 197 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 246

Query: 161 VDISFNQMAGLSA 173
           VDISFN   G+ A
Sbjct: 247 VDISFNVQNGVRA 259


>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
           mulatta]
          Length = 653

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 203 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 262

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 263 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 312

Query: 162 DISFNQMAGLSALCFLE 178
           DISFN   G+ A   ++
Sbjct: 313 DISFNVQNGVRAADLIK 329


>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Gorilla gorilla gorilla]
          Length = 631

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 244 DISFNVQNGVRA 255


>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
          Length = 361

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 47/171 (27%), Positives = 85/171 (49%), Gaps = 15/171 (8%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           I P     ++R++ I  ++  I  +   +++  FGS   K YLP+ DID+ ++      +
Sbjct: 75  IIPTSEEHRRREQAIMRVETFIKEFASEVDIQAFGSFKTKLYLPNADIDVVMIDKSMSAK 134

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK-CSVQN-IPVDISFNQMAGLSALC 175
           +L + V + L   D+      ++V  +  A+V I+K   V++    DISFNQM GL  + 
Sbjct: 135 ELYKKVAQSLMKSDK-----FENVNLIANAKVPIIKFFEVESQYQFDISFNQMDGLKQID 189

Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
            + K   +  +   FK  ++++K  C  + R L   Y G I ++ L  M+L
Sbjct: 190 EIRKAFTIYPE---FKYLIMILK--CMLKQRELNETYSGGIGSFLLFQMIL 235


>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Nomascus leucogenys]
 gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
          Length = 631

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 244 DISFNVQNGVRA 255


>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC
           30864]
          Length = 558

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 13/133 (9%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDI 106
           E+ + + +  I+P  +  + R+E++  I+ +I G +    +EVF  GS     YLP  DI
Sbjct: 213 EQEMYDFVEFIKPTPLEHQMREEIVQRIREVITGAWKHARVEVF--GSFATGLYLPMSDI 270

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDIS 164
           D+ V G  N ++     + K+LE        ++ D   VP    I+K   ++  + VDIS
Sbjct: 271 DIVVFG--NWDQIPLFTLGKLLEESRIAKNVKVIDKTSVP----IIKLADALSGVFVDIS 324

Query: 165 FNQMAGLSALCFL 177
           FN  +GL  + F+
Sbjct: 325 FNLESGLRTVEFI 337


>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
 gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
          Length = 784

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 28/200 (14%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
           LL EE I      + P     + R  +I  I R IN  +   EV PFGS   + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200

Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
           IDL V      + +  R + ++ +   Q ++ ++  +    A+V I+K            
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVFS 259

Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
                    +  I VDIS NQ  G++A   + + +D L G      R +ILI  +   + 
Sbjct: 260 LAYLTKQEGIGKINVDISLNQGNGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314

Query: 206 RILGAHYGLISTYALEMMVL 225
            +   + G + +Y++  MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334


>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 615

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 49/184 (26%), Positives = 79/184 (42%), Gaps = 11/184 (5%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  + P+      R  V+N ++  I  ++ G E   FGS     YLP  
Sbjct: 183 WLTME--IKDFVDYVSPSSDEIVTRNTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGS 240

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID+ V+      E+ +R   +   L ++      E+     VP  +K V    + I +D
Sbjct: 241 DIDMVVISETGDYENRSRLYQLSSFLRSKKLAKNVEVIANAKVPI-IKFVDPESE-IHID 298

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  G+ A   + K   LI    L  R ++LI        R+   H G +  YA  +
Sbjct: 299 VSFERTNGIDAAKRIRKW--LITTPGL--RELVLIVKQFLRSRRLNNVHVGGLGGYATII 354

Query: 223 MVLH 226
           M  H
Sbjct: 355 MCYH 358


>gi|268637610|ref|XP_002649102.1| PAP/25A-associated domain-containing protein [Dictyostelium
           discoideum AX4]
 gi|256012839|gb|EEU04050.1| PAP/25A-associated domain-containing protein [Dictyostelium
           discoideum AX4]
          Length = 938

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 85/175 (48%), Gaps = 15/175 (8%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE- 117
           ++P+ +  + R+++I  +  ++   +    V  FGS     ++P  DID+ + G  N E 
Sbjct: 457 LEPSELESRIRQKIIRDLDAIVKSNWPKANVVVFGSFSSNLFIPSSDIDIQISGINNAES 516

Query: 118 -----EDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAG 170
                ++  R +  I+    QDS   ++++    A+V I+K   S  +  +DI F+   G
Sbjct: 517 VNKYNQNPIRDLFDIIIRNHQDSFINVRNI--FGAKVPIIKMTSSHSHYNIDICFDTPNG 574

Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
           +        V + + K +   R+++LI  +  +++ +   + G I +YAL +MV+
Sbjct: 575 IENTA----VVKGLLKQYKSMRTLLLIIKFFLHQNNLNETYTGGIGSYALALMVV 625


>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
           paniscus]
          Length = 593

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 96  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 155

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 156 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 205

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 206 DISFNVQNGVRA 217


>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
           mutus]
          Length = 563

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 66  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 125

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 126 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 175

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 176 DISFNVQNGVRA 187


>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
           Full=Terminal uridylyltransferase 3; Short=TUTase 3;
           AltName: Full=Topoisomerase-related function protein
           4-2; Short=TRF4-2
          Length = 572

 Score = 48.5 bits (114), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 122 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 181

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 182 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 231

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 232 DISFNVQNGVRA 243


>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
           boliviensis boliviensis]
          Length = 631

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 244 DISFNVQNGVRA 255


>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio
           anubis]
          Length = 605

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 108 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 167

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 168 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 217

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 218 DISFNVQNGVRA 229


>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
           [Metaseiulus occidentalis]
          Length = 512

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 59/122 (48%), Gaps = 9/122 (7%)

Query: 61  QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
           +P     + R+EV+N ++ ++   +   +   FGS     YLP  DIDL +LG     E 
Sbjct: 106 KPTRTEHQVRQEVVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVILGDW---ET 162

Query: 120 LARFVC-KILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCF 176
           L  F   K L  E   S   IK +    A V IVK + Q  N+ VDISFNQ  G+ +   
Sbjct: 163 LPMFTLHKALIQEKIASASTIKVLDR--ASVPIVKFTEQSTNVKVDISFNQKNGVKSAKL 220

Query: 177 LE 178
           ++
Sbjct: 221 IK 222


>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
           familiaris]
          Length = 641

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 144 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 203

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 204 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 253

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 254 DISFNVQNGVRA 265


>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
          Length = 649

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 46/174 (26%), Positives = 79/174 (45%), Gaps = 27/174 (15%)

Query: 73  VINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
           +I  I R IN  +   EV PFGS   + YLP GDIDL V      + +  R + ++ +  
Sbjct: 32  MIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSDANKQRLLAELGKAM 91

Query: 132 DQDSVFEIKDVQYVPAQVKIVK-------------------CSVQNIPVDISFNQMAGLS 172
            Q ++ ++  +    A+V I+K                    ++  I VDIS NQ  G++
Sbjct: 92  RQANITDVVAI-IARARVPIIKFVTLEGESHVTSLADSSKQGAIGKINVDISLNQGNGVT 150

Query: 173 ALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
           A   + + +D L G      R +ILI  +   +  +   + G + +Y++  MV+
Sbjct: 151 AGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLGSYSVICMVI 199


>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Callithrix jacchus]
          Length = 631

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 244 DISFNVQNGVRA 255


>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
           troglodytes]
          Length = 586

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 89  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 148

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 149 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 198

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 199 DISFNVQNGVRA 210


>gi|395510809|ref|XP_003759662.1| PREDICTED: DNA polymerase sigma [Sarcophilus harrisii]
          Length = 594

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 78/303 (25%), Positives = 120/303 (39%), Gaps = 51/303 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 43  HEEIIDFYNFMSPCPEEAAMRREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDL 102

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 103 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 158

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
              G+       K  RLI KD++ K S    +IL+      +  +     G IS+Y+L +
Sbjct: 159 METGV-------KAARLI-KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 210

Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI- 281
           M +  FL  +   D            A  +L  ++ E  E  G        +LK    I 
Sbjct: 211 MAI-SFLQLHPRIDARR---------ADENLGMLLVEFFELYGRNF----NYLKTGIRIK 256

Query: 282 ----YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLG 337
               Y      ++   + +    L I DPL   N++GRS             SYGA ++ 
Sbjct: 257 NGGAYIAKEEIMKAMTNGYRPSMLCIEDPLLPGNDVGRS-------------SYGAMQVK 303

Query: 338 EIL 340
           ++ 
Sbjct: 304 QVF 306


>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial
           [Macaca mulatta]
          Length = 618

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 121 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 180

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 181 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 230

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 231 DISFNVQNGVRA 242


>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
          Length = 522

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 25  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 84

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 85  VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 134

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 135 DISFNVQNGVRA 146


>gi|150951520|ref|XP_001387852.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
 gi|149388662|gb|EAZ63829.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
          Length = 605

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 83/186 (44%), Gaps = 15/186 (8%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  +++ +  I P+    + R  +IN ++  I+ Y+   E   FGS     YLP  
Sbjct: 177 WLTME--MKDFVNYISPSSEEIRTRNRLINKLKSSISSYWPETETHVFGSSATDLYLPGS 234

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
           DID+ ++      E+ +R   +   L ++      E+       A+V I+K      N+ 
Sbjct: 235 DIDIVIVSRTGDYENRSRLYQLSSYLRHKGLAKNMEV----IAKAKVPIIKFVDPESNVN 290

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           +D+SF +  G+ A    +K+ R +      +  V++IK +     R+   H G +  YA 
Sbjct: 291 IDVSFERRNGIEA---AKKIRRWMTTTPGLRELVLIIKQFL-SSRRLNNVHSGGLGGYAT 346

Query: 221 EMMVLH 226
            ++  H
Sbjct: 347 IILCYH 352


>gi|327270606|ref|XP_003220080.1| PREDICTED: DNA polymerase sigma-like [Anolis carolinensis]
          Length = 686

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 80/174 (45%), Gaps = 20/174 (11%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           RKEV+  I+ +I   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 212 RKEVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ESPPLQLLEQAL 269

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +    + IK +    A V I+K + Q   + VDISFN   G+       K  RLI K
Sbjct: 270 RKHNVAEPYSIKVLDK--ATVPIIKLTDQETEVKVDISFNVETGV-------KAARLI-K 319

Query: 187 DHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
           D++ K +    +IL+      +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 320 DYMKKYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 372


>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
           scrofa]
          Length = 511

 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 14  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 73

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 74  VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 123

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 124 DISFNVQNGVRA 135


>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
           [Ailuropoda melanoleuca]
          Length = 593

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 143 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 202

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 203 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 252

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 253 DISFNVQNGVRA 264


>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
          Length = 778

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 54/197 (27%), Positives = 84/197 (42%), Gaps = 26/197 (13%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    I P    E  R++V+N I+R+I   +    V  FGS     YLP  DIDL
Sbjct: 238 HEEIMDFFNFISPRPEEEAMRRDVVNRIERVIKDLWPTARVEIFGSFSTGLYLPTSDIDL 297

Query: 109 TVLG---HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDI 163
            V G   H  ++E     + + L+  +    + IK +    A V I+K +     + VDI
Sbjct: 298 VVFGKWDHPPLQE-----LEQALKKRNVAGPYPIKVLD--KATVPIIKLTDHETEVKVDI 350

Query: 164 SFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLISTYA 219
           SFN    + A  F+        K +L K +V    I +      +  +     G I +Y+
Sbjct: 351 SFNVETAVKAAQFI--------KSYLKKYTVLPPLIFVLKQFLLQRDLNEVFTGGIGSYS 402

Query: 220 LEMMVLHRFLDYYNTFD 236
           L +M +  FL  +   D
Sbjct: 403 LILMAI-SFLQLHPRID 418


>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
          Length = 632

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 135 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 194

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 195 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 244

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 245 DISFNVQNGVRA 256


>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
           MF3/22]
          Length = 732

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 79/181 (43%), Gaps = 18/181 (9%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDI 106
           L  + ++  L  + P  V  + R  V+  I   I   Y   EV PFGS   K YLP GDI
Sbjct: 148 LMHKEVEAYLKYVSPTPVEHEVRWMVVQLISSSIKRVYSDSEVLPFGSFGTKLYLPQGDI 207

Query: 107 DLTVLGHQNVEEDLARF----VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
           DL V         LA F      K L N  + +    K      A+V I+K +       
Sbjct: 208 DLVVQSRT-----LASFEKVTALKSLANIVKRTGLADKVTIISQARVPIIKFTTLYGRFA 262

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYA 219
           VDIS NQ  G+      + ++R + +    +  V+++K+  + + R L   Y G + +YA
Sbjct: 263 VDISMNQSNGVKT---GDMINRFLNEFPALRAIVLIVKS--FLKQRNLNEVYSGGLGSYA 317

Query: 220 L 220
           +
Sbjct: 318 I 318


>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
          Length = 637

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 64/142 (45%), Gaps = 19/142 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 183 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 242

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 243 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 292

Query: 162 DISFNQMAGLSALCFLEKVDRL 183
           DISFN   G+ A   ++   +L
Sbjct: 293 DISFNVQNGVRAADLIKDFTKL 314


>gi|354479063|ref|XP_003501733.1| PREDICTED: DNA polymerase sigma-like [Cricetulus griseus]
          Length = 624

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 55/195 (28%), Positives = 87/195 (44%), Gaps = 15/195 (7%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           W L EE I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  
Sbjct: 69  WRLHEE-IIDFYNFMSPCPEEAAMRREVVTRIETVVKDLWPTADVQIFGSFSTGLYLPTS 127

Query: 105 DIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVD 162
           DIDL V G    E    + + + L   +      IK +    A V I+K + Q   + VD
Sbjct: 128 DIDLVVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVD 183

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
           ISFN   G+ A  F++     + K  L    ++++K   +   R L   + G IS+Y+L 
Sbjct: 184 ISFNMETGVRAAEFIKN---YMKKYSLLPYLILVLKQ--FLLQRDLNEVFTGGISSYSLI 238

Query: 222 MMVLHRFLDYYNTFD 236
           +M +  FL  +   D
Sbjct: 239 LMAIS-FLQLHPRID 252


>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
 gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
          Length = 527

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 30  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 90  VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 139

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 140 DISFNVQNGVRA 151


>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
          Length = 564

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 67  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 126

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 127 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 176

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 177 DISFNVQNGVRA 188


>gi|444729764|gb|ELW70170.1| DNA polymerase sigma [Tupaia chinensis]
          Length = 729

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 78  RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 135

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 136 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 190

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 191 YSLLPYLILVLKQF-LLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 238


>gi|395857085|ref|XP_003800943.1| PREDICTED: DNA polymerase sigma [Otolemur garnettii]
          Length = 837

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 286 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 345

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 346 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 401

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
              G+ A  F+        K+++ K S    +IL+      +  +     G IS+Y+L +
Sbjct: 402 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 453

Query: 223 MVLHRFLDYYNTFD 236
           M +  FL  +   D
Sbjct: 454 MAIS-FLQLHPRID 466


>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
          Length = 590

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 64/142 (45%), Gaps = 19/142 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 136 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 195

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 196 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 245

Query: 162 DISFNQMAGLSALCFLEKVDRL 183
           DISFN   G+ A   ++   +L
Sbjct: 246 DISFNVQNGVRAADLIKDFTKL 267


>gi|432884542|ref|XP_004074488.1| PREDICTED: uncharacterized protein LOC101158959 [Oryzias latipes]
          Length = 421

 Score = 47.8 bits (112), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 7/137 (5%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + +    + P       RKEV+  I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 243 HEEVMDFYKYMSPRPEEAAMRKEVVKRIETIIKEQWPSADVQIFGSFSTGLYLPTSDIDL 302

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +    F IK +    A V I+K + Q   + VDISFN
Sbjct: 303 VVFG--KWERPPLQELEQALRKHNVAEPFSIKVLD--KATVPIIKLTDQETEVKVDISFN 358

Query: 167 QMAGLSALCFLEKVDRL 183
              G+ A  F++   +L
Sbjct: 359 VETGVKAASFIKDYVKL 375


>gi|224112707|ref|XP_002316267.1| predicted protein [Populus trichocarpa]
 gi|222865307|gb|EEF02438.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score = 47.8 bits (112), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 129/310 (41%), Gaps = 48/310 (15%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDI 106
           E  +++IL  IQP       R +VI  ++ ++       G  V PFGS     +   GD+
Sbjct: 2   EPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGDL 61

Query: 107 DLTVL---------GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV 156
           D++++           +  +++L   V K L           + +Q++P A+V I+K   
Sbjct: 62  DISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGG-----WQRLQFIPNARVPILKFEN 116

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
            +I  D+S + M GL    FL  ++ +   D  F+  V+L+K W    + I     G ++
Sbjct: 117 ASISCDVSIDNMQGLMKSKFLFWINEI---DRRFRDMVLLVKEWAKTHN-INNPKTGSLN 172

Query: 217 TYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
           +Y+L ++V+  F            C+    P  +  L EI    V +D   +    E  +
Sbjct: 173 SYSLSLLVIFHF----------QTCV----PAILPPLKEIYPRNVIDDLTGVRTDAE--R 216

Query: 277 KCREIYSVPITAL---------ENTGHEFAIKHL-NIVDPLKDNNNLGRSVSKGNFHRIR 326
           +  EI +  I+            N+  E  I  L  I DP +   N  R+VS  N  +I 
Sbjct: 217 RIGEICAANISRYRSNKSRAINRNSLSELFISFLTKIEDPFEQPENTARAVSAANLMKIS 276

Query: 327 CALSYGAQRL 336
            A+     RL
Sbjct: 277 EAIQTTHHRL 286


>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
 gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
          Length = 717

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 100/243 (41%), Gaps = 50/243 (20%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI-----EVFPFGSVP 96
           D E WL  E  I +    + P     ++R  ++  +Q ++ GY+        +  FGS P
Sbjct: 348 DPEKWLHNE--ILDFYDFVAPKPYEHEQRNLLVQRVQSVL-GYHRFPQDNGRILCFGSFP 404

Query: 97  LKTYLPDGDIDLTV--------------LGHQNVEEDLARFVCKILENEDQDSVFEIKDV 142
              YLP  D+DL                +  +N    L + V  +L+   ++  F     
Sbjct: 405 AGLYLPTADMDLVYTSDRHFNGGPPVMDVTARNATAPLLKGVRNVLQR--RNMAFGAISC 462

Query: 143 QYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-LCFLEKVDRLIGKDHLFKRSVILIKA 199
            Y  A+V +VK   SV  + VDISF  ++G+ A   F +  D+    D ++   V L+K 
Sbjct: 463 IYG-AKVPLVKFTDSVTRLQVDISFENLSGMQAQATFAQWKDKY--PDMIYM--VALLKQ 517

Query: 200 WCYYESRILGAHYGLISTYALEMMVLH----------------RFLDYY-NTFDWDNYCI 242
           +      +   H G I  +A+  +++H                 FLDYY N FD   + I
Sbjct: 518 FLVMRG-LNEVHTGGIGGFAIICLIVHYIHQAGKAENLAELFKGFLDYYGNKFDLTKHRI 576

Query: 243 SIN 245
            +N
Sbjct: 577 QMN 579


>gi|326676716|ref|XP_686065.4| PREDICTED: DNA polymerase sigma [Danio rerio]
          Length = 706

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 89/200 (44%), Gaps = 26/200 (13%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV++ I+ +I   +   +V  FGS     +LP  DIDL
Sbjct: 211 HEEIMDFYNFMSPRPEEATMRQEVVDRIESVIKELWPTADVQIFGSFSTGLFLPTSDIDL 270

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYV-PAQVKIVKCSVQ--NI 159
            V G       Q +E+ L +            SV E   ++ +  A V I+K + Q   +
Sbjct: 271 VVFGKWEKPPLQQLEQALRKH-----------SVAEPYSIKVLDKATVPIIKLTDQETEV 319

Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
            VDISFN   G+ A  F+++    + K  +    + ++K +   +  +     G IS+Y+
Sbjct: 320 KVDISFNVETGIKAASFIKE---YVKKYTVLPYLIFVLKQFL-LQRDLNEVFTGGISSYS 375

Query: 220 LEMMVLHRFLDYYNTFDWDN 239
           L +MV+  FL  +   D  N
Sbjct: 376 LILMVI-SFLQLHPRIDTRN 394


>gi|351706462|gb|EHB09381.1| DNA polymerase sigma [Heterocephalus glaber]
          Length = 542

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 80/172 (46%), Gaps = 16/172 (9%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G      +  R   ++L
Sbjct: 11  RREVVQRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------EWERPPLQLL 64

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
           E    + SV E   ++ +  A V I+K + Q   + VDISFN   G+ A  F++      
Sbjct: 65  EQALRKHSVAEPCSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY---- 120

Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
            K++     +IL+      +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 121 MKEYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
          Length = 374

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 30  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 90  VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 139

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 140 DISFNVQNGVRA 151


>gi|222619531|gb|EEE55663.1| hypothetical protein OsJ_04061 [Oryza sativa Japonica Group]
          Length = 461

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 85/372 (22%), Positives = 146/372 (39%), Gaps = 92/372 (24%)

Query: 39  FSIDAEL-WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN------GYYGIEVFP 91
           FS+   + + + E+ ++ IL  I+P    E  R++ ++ IQ L N         G    P
Sbjct: 22  FSMTEHINYDVVEQCVKNILSLIKPV---EDDRRKRLSAIQELSNSIPKVAALRGAVFKP 78

Query: 92  FGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV 145
           FGS     Y   GD+D++V      +  +  ++ + R + ++L+N           VQ+V
Sbjct: 79  FGSFVSNLYSNSGDLDISVHLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-----VQFV 133

Query: 146 P-AQVKIVK--CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKRSVILIKA 199
           P A+V +++   +   I  DIS N   G       C++  +D   G        V+LIK 
Sbjct: 134 PFARVPVLQYVSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKE 187

Query: 200 WCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
           W   ++ I     G +++Y+L ++VL  F            C     P  +  L EI   
Sbjct: 188 WAKAQN-INDPKTGTLNSYSLCLLVLFHF----------QTC----EPAILPPLKEIYEG 232

Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN-IVDPLKDNNNLGRSVS 318
            +E    E+ +  E                         +HL+ + DP++  +N  R+V 
Sbjct: 233 NIEEGIAEMTVYDE-------------------------EHLDEVKDPIERPDNAARAVD 267

Query: 319 KGNFHRIRCALSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI-----NTLE 360
                RI  A +   ++             E+L  P  G  LG+ + +  +     NT  
Sbjct: 268 LKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMANSCTNTPR 327

Query: 361 RNGKGLRPDVQV 372
           RN    RP  QV
Sbjct: 328 RNRHHGRPTEQV 339


>gi|402871130|ref|XP_003899535.1| PREDICTED: DNA polymerase sigma [Papio anubis]
          Length = 775

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 224 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 283

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 284 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 339

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
              G+ A  F+        K+++ K S    +IL+      +  +     G IS+Y+L +
Sbjct: 340 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 391

Query: 223 MVLHRFLDYYNTFD 236
           M +  FL  +   D
Sbjct: 392 MAI-SFLQLHPRID 404


>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
          Length = 371

 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 45/132 (34%), Positives = 61/132 (46%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 30  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +      +  D+DSV ++ D   VP    I+K   S   + V
Sbjct: 90  VFGKWENLPLWTLEEALRKH-----KVADEDSV-KVLDKATVP----IIKLTDSFTEVKV 139

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 140 DISFNVQNGVRA 151


>gi|431900711|gb|ELK08155.1| DNA polymerase sigma [Pteropus alecto]
          Length = 539

 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVGRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 647

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 68/154 (44%), Gaps = 13/154 (8%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
           ++  +  + P  + ++ R   +  I R I+  Y   +V PFGS   K YLP GDIDL + 
Sbjct: 162 VEAFVDYMSPTPIEDEVRSLSVQLIARAISKSYPDAKVLPFGSYETKLYLPSGDIDLVIY 221

Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMA 169
            H  +  D    V   L N  + +    +      A+V I+K   +     VDIS NQ  
Sbjct: 222 SHSMMRMDKVS-VLHSLANIMKRAGITDRVTIIAKAKVPIIKFVTAHGRFSVDISVNQGN 280

Query: 170 GLSALCFLEKVDR---------LIGKDHLFKRSV 194
           G+     +++  R         LI K+ L +RS+
Sbjct: 281 GVDTGKMVKQFLRELPALRSLVLIIKNFLSQRSM 314


>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
          Length = 901

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 123/265 (46%), Gaps = 29/265 (10%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQY 144
           I+V  FGS+    +LP  DID+ V   + +    D      + LE ++      +    +
Sbjct: 129 IKVSMFGSLRTTLFLPTSDIDVLVECDEWIGTPGDWLTETARGLEIDNIAESVSVYGGAF 188

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           VP  VK+V    + + +DISFN + G+ A  +++KV        L +  V+L+K + +Y 
Sbjct: 189 VPI-VKMVDRDTR-LSIDISFNTVQGVRAASYIDKVKEEFP---LIEPLVLLLKQFLHY- 242

Query: 205 SRILGAHY-GLISTYALEMMVLHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVE 262
            R L   + G +S+Y L +++++ F  Y  N      Y   +N    +    E+ +  +E
Sbjct: 243 -RNLNQTFTGGLSSYGLVLLLVNFFQLYALNMRHRTIYDRGVNLGHLLLRFLEVYS--LE 299

Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIK-----HLNIVDPLKDNNNLGRSV 317
            + +E+ +SP    +C   Y VP +A    G  + +K     +L + DPL   N++GRS 
Sbjct: 300 FNYEEIGISP---GQC--CY-VPKSA---AGARYGLKRAQPGNLALEDPLLTANDVGRST 350

Query: 318 SKGNFHRIRCALSYGAQRLGEILTL 342
              NF  I  A     Q L   +TL
Sbjct: 351 Y--NFSSIANAFGQAFQILLVAVTL 373


>gi|255566462|ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
 gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis]
          Length = 493

 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 82/170 (48%), Gaps = 25/170 (14%)

Query: 71  KEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL---------GHQNVEEDLA 121
           K+VI  I+ L     G  V PFGS     +   GD+D++++           +  ++++ 
Sbjct: 36  KDVIASIESL----RGATVEPFGSFVSNLFTRWGDLDISIMLANGSYISSAAKKRKQNVL 91

Query: 122 RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLE 178
           R   K L  +        + +Q+VP A+V ++K     QNI  D+S + + G     FL 
Sbjct: 92  REFHKALRQKG-----GWRRLQFVPNARVPLLKFESGRQNISCDVSIDNLQGQIKSNFLF 146

Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRF 228
            ++++ G+   F+  V+L+K W    + I     G +++Y+L ++V+  F
Sbjct: 147 WLNQIDGR---FRDMVLLVKEWAKAHN-INNPKTGTLNSYSLSLLVIFHF 192


>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
          Length = 901

 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 123/265 (46%), Gaps = 29/265 (10%)

Query: 87  IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQY 144
           I+V  FGS+    +LP  DID+ V   + +    D      + LE ++      +    +
Sbjct: 129 IKVSMFGSLRTTLFLPTSDIDVLVECDEWIGTPGDWLTETARGLEIDNIAESVSVYGGAF 188

Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           VP  VK+V    + + +DISFN + G+ A  +++KV        L +  V+L+K + +Y 
Sbjct: 189 VPI-VKMVDRDTR-LSIDISFNTVQGVRAASYIDKVKEEFP---LIEPLVLLLKQFLHY- 242

Query: 205 SRILGAHY-GLISTYALEMMVLHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVE 262
            R L   + G +S+Y L +++++ F  Y  N      Y   +N    +    E+ +  +E
Sbjct: 243 -RNLNQTFTGGLSSYGLVLLLVNFFQLYALNMRHRTIYDRGVNLGHLLLRFLEVYS--LE 299

Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIK-----HLNIVDPLKDNNNLGRSV 317
            + +E+ +SP    +C   Y VP +A    G  + +K     +L + DPL   N++GRS 
Sbjct: 300 FNYEEIGISP---GQC--CY-VPKSA---AGARYGLKRAQPGNLALEDPLLTANDVGRST 350

Query: 318 SKGNFHRIRCALSYGAQRLGEILTL 342
              NF  I  A     Q L   +TL
Sbjct: 351 Y--NFSSIANAFGQAFQILLVAVTL 373


>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
          Length = 530

 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 65/148 (43%), Gaps = 19/148 (12%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFG 93
            LYP  + +       E I +    + P    EK R EV++ I+ +I   +   +V  FG
Sbjct: 17  QLYPGLVRSAQTGRLHEEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFG 76

Query: 94  SVPLKTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPA 147
           S     YLP  DIDL V G         +EE L +         D+DSV ++ D   VP 
Sbjct: 77  SFKTGLYLPTSDIDLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP- 129

Query: 148 QVKIVKC--SVQNIPVDISFNQMAGLSA 173
              I+K   S   + VDISFN   G+ A
Sbjct: 130 ---IIKLTDSFTEVKVDISFNVQNGVRA 154


>gi|293332275|ref|NP_001169645.1| hypothetical protein [Zea mays]
 gi|224030617|gb|ACN34384.1| unknown [Zea mays]
 gi|414879730|tpg|DAA56861.1| TPA: hypothetical protein ZEAMMB73_892019 [Zea mays]
          Length = 574

 Score = 47.0 bits (110), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 21/190 (11%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY---GIEVFPFGSVPLKTYLPDGDIDLT 109
           I++IL  I+P      KR   I  ++  I+      G  V PFGS     Y   GD+DL+
Sbjct: 15  IKDILALIKPVEDDRSKRLSTIQELENCIHSLASLSGAAVKPFGSFVSDLYSKSGDLDLS 74

Query: 110 V-LG-------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK-CSVQ-N 158
           V  G       ++  +++  R V K L +     V     +Q++P A+V +++  S Q  
Sbjct: 75  VQFGNGSNHPINKKKKQNALRDVRKALLSR---GVTGYMQMQFIPHARVPVLQYVSKQFG 131

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           I  DIS    AG         V+ +   D  F   V+LIK W   ++ I     G +++Y
Sbjct: 132 ISCDISIGNFAGRIKSKIFYWVNTV---DERFGDMVLLIKEWAKAQN-INDPKSGTLNSY 187

Query: 219 ALEMMVLHRF 228
           +L ++VL+ F
Sbjct: 188 SLCLLVLYHF 197


>gi|358055188|dbj|GAA98957.1| hypothetical protein E5Q_05645 [Mixia osmundae IAM 14324]
          Length = 813

 Score = 47.0 bits (110), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 47/188 (25%), Positives = 83/188 (44%), Gaps = 9/188 (4%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPD 103
           + +L +E +Q     + P+      R  VI+ ++  +   +   ++ PFGS   + YLP 
Sbjct: 280 IQMLTKE-LQAFKAYVTPSRAEHAFRGHVIDQVRNALRQIWADTDLQPFGSYLTQLYLPG 338

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPV 161
           GDIDL +L      +  +R + +I +   +D+      V    A+V IVK   +     +
Sbjct: 339 GDIDLVMLSATAASQTPSRVLHRIAQIM-RDANIGYDFVVISRAKVPIVKFISTTGGFNI 397

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DIS NQ  G+ A      V R+I  +       ++     Y   R +   + G + +Y+L
Sbjct: 398 DISLNQPGGIRAGTV---VQRMIDHNGGEAARTLIYCIKLYLSIRGMSEVFTGGLGSYSL 454

Query: 221 EMMVLHRF 228
             MV+  F
Sbjct: 455 ICMVISFF 462


>gi|126139251|ref|XP_001386148.1| Poly(A) polymerase PAPalpha (Polynucleotide adenylyltransferase
           alpha) [Scheffersomyces stipitis CBS 6054]
 gi|126093430|gb|ABN68119.1| Poly(A) polymerase PAPalpha (Polynucleotide adenylyltransferase
           alpha) [Scheffersomyces stipitis CBS 6054]
          Length = 556

 Score = 47.0 bits (110), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 48/226 (21%), Positives = 90/226 (39%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + KKR EV+   Q+++  +                  G +VF FGS  L  Y P  D+D 
Sbjct: 43  ATKKRVEVLTQFQKMVQEFVFTVSKKKNMSDGMAKDAGGKVFTFGSYRLGVYGPGSDLDT 102

Query: 109 TVLGHQNV-EEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
            V+  Q+V  ED      +I+    +        D+   I  ++Y  VP  +     +V 
Sbjct: 103 LVVVPQHVSREDFFTVFAEIIRKRPELEEIVPVPDAFVPIIKIEYDGVPLDLIFACLNVP 162

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            IP+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   ++ 
Sbjct: 163 RIPLDMTLDDKNLLRNLDERDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKMWAQQKA- 221

Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 222 VYGNVFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267


>gi|449326076|gb|AGE92663.1| PAPD7 [Homo sapiens]
          Length = 772

 Score = 47.0 bits (110), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 221 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 280

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 281 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 336

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
              G+ A  F+        K+++ K S    +IL+      +  +     G IS+Y+L +
Sbjct: 337 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 388

Query: 223 MVLHRFLDYYNTFD 236
           M +  FL  +   D
Sbjct: 389 MAI-SFLQLHPRID 401


>gi|218189365|gb|EEC71792.1| hypothetical protein OsI_04417 [Oryza sativa Indica Group]
          Length = 557

 Score = 47.0 bits (110), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 83/362 (22%), Positives = 141/362 (38%), Gaps = 91/362 (25%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN------GYYGIEVFPFGSVPLKTYL 101
           + E+ ++ IL  I+P    E  R++ ++ IQ L N         G    PFGS     Y 
Sbjct: 9   VVEQCVKNILSLIKPV---EDDRRKRLSAIQELSNSIPKVAALRGAVFKPFGSFVSNLYS 65

Query: 102 PDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK- 153
             GD+D++V      +  +  ++ + R + ++L+N           VQ++P A+V +++ 
Sbjct: 66  NSGDLDISVQLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-----VQFIPFARVPVLQY 120

Query: 154 -CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
             +   I  DIS N   G       C++  +D   G        V+LIK W   ++ I  
Sbjct: 121 VSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKEWAKAQN-IND 173

Query: 210 AHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
              G +++Y+L ++VL  F            C     P  +  L EI    +E    E+ 
Sbjct: 174 PKTGTLNSYSLCLLVLFHF----------QTC----EPAILPPLKEIYEGNIEEGIAEMT 219

Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIV-DPLKDNNNLGRSVSKGNFHRIRCA 328
           +  E                         +HL+ V DP++  +N  R+V      RI  A
Sbjct: 220 VYDE-------------------------EHLDEVEDPIERPDNAARAVGLKGLERIAGA 254

Query: 329 LSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI-----NTLERNGKGLRPDV 370
            +   ++             E+L  P  G  LG+ + +  +     NT  RN    RP  
Sbjct: 255 FTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMANSCTNTPRRNRHHGRPTE 314

Query: 371 QV 372
           QV
Sbjct: 315 QV 316


>gi|335304248|ref|XP_003134203.2| PREDICTED: DNA polymerase sigma [Sus scrofa]
          Length = 625

 Score = 46.6 bits (109), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 53/191 (27%), Positives = 83/191 (43%), Gaps = 14/191 (7%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   EV  FGS     YLP  DIDL
Sbjct: 74  HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTAEVQIFGSFSTGLYLPTSDIDL 133

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L          IK +    A V I+K + Q   + VDISFN
Sbjct: 134 VVFG--KWERPPLQLLEQALRKHKVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 189

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
              G+ A  F++     + K  L    ++++K   +   R L   + G IS+Y+L +M +
Sbjct: 190 METGVRAAEFIKN---YMKKYSLLPYLILVLKQ--FLLQRDLNEVFTGGISSYSLILMAI 244

Query: 226 HRFLDYYNTFD 236
             FL  +   D
Sbjct: 245 S-FLQLHPRID 254


>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
           in germ line development protein 4; AltName:
           Full=Germline development defective-4
 gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
 gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
          Length = 845

 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 76/300 (25%), Positives = 134/300 (44%), Gaps = 24/300 (8%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDID 107
           E I ++   I+P  +  + R +V   ++  +   +    I++  FGS+    +LP  DID
Sbjct: 82  EEIVDMYHWIKPNEIESRLRTKVFEKVRDSVLRRWKQKTIKISMFGSLRTNLFLPTSDID 141

Query: 108 LTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
           + V     V    D      + LE ++      +    +VP  VK+V    + + +DISF
Sbjct: 142 VLVECDDWVGTPGDWLAETARGLEADNIAESVMVYGGAFVPI-VKMVDRDTR-LSIDISF 199

Query: 166 NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMV 224
           N + G+ A  ++ KV        L +  V+L+K + +Y  R L   + G +S+Y L +++
Sbjct: 200 NTVQGVRAASYIAKVKEEFP---LIEPLVLLLKQFLHY--RNLNQTFTGGLSSYGLVLLL 254

Query: 225 LHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYS 283
           ++ F  Y  N      Y   +N    +    E+   ++E + +E+ +SP    +C   Y 
Sbjct: 255 VNFFQLYALNMRSRTIYDRGVNLGHLLLRFLELY--SLEFNFEEMGISP---GQC--CYI 307

Query: 284 VPITALENTGHEFAI-KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
               +    GH+ A   +L + DPL   N++GRS    NF  I  A     Q L   +TL
Sbjct: 308 PKSASGARYGHKQAQPGNLALEDPLLTANDVGRSTY--NFSSIANAFGQAFQILLVAVTL 365


>gi|402083045|gb|EJT78063.1| DNA polymerase sigma [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 732

 Score = 46.6 bits (109), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 35/144 (24%), Positives = 67/144 (46%), Gaps = 21/144 (14%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRK----EVINYIQRLINGYYGIEVFPFGSVPLKTY 100
           +WL  ++ I++    ++P    EK R+    E++ +++R  N      V+PFGS     Y
Sbjct: 400 VWL--DKEIKDFYDYVKPTHFEEKLRQSLVDELVTHVRRTWND---ASVYPFGSFKSGLY 454

Query: 101 LPDGDIDLTVLGHQNVEEDLARFV--------CKILENEDQDSVFEIKDVQYVPAQVKIV 152
           LP GD+DL     + +   +A++          + +E         ++ +    A+V +V
Sbjct: 455 LPTGDMDLVFCSDKYLSRHIAQYTPKKQVFHFARFVEKRGLAHQHRVERIHK--ARVPLV 512

Query: 153 KC--SVQNIPVDISFNQMAGLSAL 174
           K   +   + VDISF    G++A+
Sbjct: 513 KYVDARTGLKVDISFENSTGITAV 536


>gi|74003095|ref|XP_545180.2| PREDICTED: DNA polymerase sigma [Canis lupus familiaris]
          Length = 672

 Score = 46.6 bits (109), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 12/190 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 121 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDL 180

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 181 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 236

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
              G+ A    E +   + K  L    ++++K +   +  +     G IS+Y+L +M + 
Sbjct: 237 METGVRA---AELIKNYMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS 292

Query: 227 RFLDYYNTFD 236
            FL  +   D
Sbjct: 293 -FLQLHPRID 301


>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 964

 Score = 46.6 bits (109), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 13/135 (9%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    I P    E  R++V+N I+++I   +   +V  FGS     YLP  DIDL
Sbjct: 451 HEEIMDFFNFISPRPEEEAMRRDVVNRIEKVIKDLWPTAQVEIFGSFSTGLYLPTSDIDL 510

Query: 109 TVLG---HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDI 163
            V G   H  ++E     + + L+  +    + IK +    A V I+K +     + VDI
Sbjct: 511 VVFGKWDHPPLQE-----LEQALKKRNVAGPYPIKVLD--KATVPIIKLTDHETEVKVDI 563

Query: 164 SFNQMAGLSALCFLE 178
           SFN    + A  F++
Sbjct: 564 SFNVETAVKAAQFIK 578


>gi|157819111|ref|NP_001100803.1| DNA polymerase sigma [Rattus norvegicus]
 gi|149032753|gb|EDL87608.1| polymerase (DNA directed) sigma (predicted) [Rattus norvegicus]
          Length = 541

 Score = 46.6 bits (109), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPSADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|440905114|gb|ELR55540.1| DNA polymerase sigma [Bos grunniens mutus]
          Length = 542

 Score = 46.6 bits (109), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
 gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
          Length = 680

 Score = 46.6 bits (109), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 183 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 242

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 243 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 292

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 293 DISFNVQNGVRA 304


>gi|237512970|ref|NP_001153629.1| DNA polymerase sigma [Bos taurus]
          Length = 541

 Score = 46.6 bits (109), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
           FP-101664 SS1]
          Length = 660

 Score = 46.6 bits (109), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 72/162 (44%), Gaps = 18/162 (11%)

Query: 43  AELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYL 101
           AE++   E  ++  +  I P  + ++ R  V+  + R +   Y   +V PFGS   K YL
Sbjct: 164 AEMYARIE--VEAFVKYISPTPIEDEVRSLVVALVSRAVTRTYTDAQVLPFGSYETKLYL 221

Query: 102 PDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNI 159
           P GDIDL +        D    V   L N  + +    +      A+V I+K   +    
Sbjct: 222 PLGDIDLVIYSQSMARMDRVS-VLHSLANIVKRAGITDRVTIIAKAKVPIIKFVTTHGRF 280

Query: 160 PVDISFNQMAGLSA----LCFLEKVDRLIGKDHLFKRSVILI 197
            VDIS NQ  G++A      FLE++  L        RS++LI
Sbjct: 281 SVDISINQGNGVTAGKMVKQFLEELPAL--------RSLVLI 314


>gi|281354437|gb|EFB30021.1| hypothetical protein PANDA_012597 [Ailuropoda melanoleuca]
          Length = 542

 Score = 46.6 bits (109), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A    E +   + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRA---AELIKNYMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|449465848|ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
 gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
          Length = 464

 Score = 46.2 bits (108), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 49/193 (25%), Positives = 88/193 (45%), Gaps = 16/193 (8%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRL---INGYYGIEVFPFGSVPLKTYLPD 103
           L  +  I++IL  ++P       R +VIN ++ +   I    G  + PFGS     +   
Sbjct: 4   LTLDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRW 63

Query: 104 GDIDLTVLGHQNVEEDLA-----RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCS-- 155
           GD+DL+V  +       A     + + + ++N  + +    K +Q +P A+V I+K    
Sbjct: 64  GDLDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYK-LQLIPHARVPILKIEHI 122

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
             NI  DIS + + G      L  V+ + G+   F   V+L+K W      I  +  G  
Sbjct: 123 QHNISCDISIDNLVGQIKSKILLWVNEIDGR---FHDMVLLVKEWAKAHD-INNSKQGTF 178

Query: 216 STYALEMMVLHRF 228
           ++Y+L ++V+  F
Sbjct: 179 NSYSLSLLVIFHF 191


>gi|395510432|ref|XP_003759479.1| PREDICTED: poly(A) RNA polymerase GLD2 [Sarcophilus harrisii]
          Length = 729

 Score = 46.2 bits (108), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 6/126 (4%)

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           DGD+ L V      ++  AR++  +++      +  I+  Q +PA+V IVK    V  + 
Sbjct: 464 DGDLCLVVKEEPVNQKTEARYILSLVQKHFCTRLCYIERPQLIPAKVPIVKFRDKVSCVE 523

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            D++ N + G+     L     L  +    +  V+++K W  +   I  A  G +++Y+L
Sbjct: 524 FDLNVNNIVGIRNTFLLRTYAYLESR---VRPLVLVVKKWASHHE-INDASRGTLNSYSL 579

Query: 221 EMMVLH 226
            +MVLH
Sbjct: 580 VLMVLH 585


>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
           Full=Topoisomerase-related function protein 4-2;
           Short=TRF4-2
 gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
 gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
          Length = 633

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 136 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 195

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 196 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 245

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 246 DISFNVQNGVRA 257


>gi|198469802|ref|XP_001355128.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
 gi|198147042|gb|EAL32185.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
          Length = 1202

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 131/331 (39%), Gaps = 61/331 (18%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H +  + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 284 NHMAMIAKYKGEPWRKPDYPYGEGVVGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIENVV 343

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD-SVFEI 139
           +  +   V   FGS     +LP  DIDL VLG   + E L     + LE E  +  + E 
Sbjct: 344 HTIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEYELVNRGIAEA 397

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
             V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R   ++GK  L  + 
Sbjct: 398 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 457

Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
            +L++        +     G IS+Y+L +M                 CIS      +   
Sbjct: 458 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 488

Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
           P  +     N G  LL   E   +      + I +++N G   A   L            
Sbjct: 489 PRGIYHDTTNLGVLLLEFFELYGRRFNYMRIGI-SIKNGGRYMAKDELQRDMVDGHRPSL 547

Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
             I DPL   N++GRS S G FH     +CA
Sbjct: 548 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 577


>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
          Length = 563

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 83/185 (44%), Gaps = 25/185 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 71  EEIIDFYKYMSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 130

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D++SV ++ D   VP    I+K   S   + V
Sbjct: 131 VFGKWENLPLWTLEEALRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKV 180

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DISFN   G+ A   ++     I K  +    V+++K +     R L   + G I +Y+L
Sbjct: 181 DISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 235

Query: 221 EMMVL 225
            +M +
Sbjct: 236 FLMAV 240


>gi|330792667|ref|XP_003284409.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
 gi|325085656|gb|EGC39059.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
          Length = 1460

 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 85/166 (51%), Gaps = 17/166 (10%)

Query: 72   EVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKIL 128
            +VI +I+ ++   +   ++  FGS     +LP  DID+ V    N  ++   A+F+ K+L
Sbjct: 1091 QVIGWIRAVVKKLWSHADLDLFGSFMTGLWLPSSDIDIVVNYGNNMSIKPKNAQFLLKVL 1150

Query: 129  ENEDQDSV--FEIKDVQYVPAQVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK- 179
            E + ++ +  F +  V    A++ ++K  + +NI VDISF +       G++A   +   
Sbjct: 1151 EKQIRNDLDGFILSMVCIPSAKIPVIKLVTTENISVDISFRESPTSIHTGIAARDLIADC 1210

Query: 180  VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
            V  ++G   L+  +++L   W   E  +   + G +S+Y L +M++
Sbjct: 1211 VKDVVG---LYPLAIVL--KWFLRERGLNNTYTGGLSSYCLVLMLI 1251


>gi|426246851|ref|XP_004017201.1| PREDICTED: LOW QUALITY PROTEIN: DNA polymerase sigma [Ovis aries]
          Length = 602

 Score = 46.2 bits (108), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 80/180 (44%), Gaps = 32/180 (17%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEEDLAR 122
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G       Q +E+ L +
Sbjct: 11  RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALRK 70

Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKV 180
                  N  +    ++ D   VP    I+K + Q   + VDISFN   G+ A  F+   
Sbjct: 71  ------HNVAEPGSIKVLDKATVP----IIKLTDQETEVKVDISFNMETGVRAAEFI--- 117

Query: 181 DRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
                K+++ K S    +IL+      +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 118 -----KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
           glaber]
          Length = 599

 Score = 46.2 bits (108), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 102 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 161

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 162 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 211

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 212 DISFNVQNGVRA 223


>gi|301776096|ref|XP_002923468.1| PREDICTED: DNA polymerase sigma-like [Ailuropoda melanoleuca]
          Length = 656

 Score = 46.2 bits (108), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 12/190 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 105 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDL 164

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 165 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 220

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
              G+ A    E +   + K  L    ++++K +   +  +     G IS+Y+L +M + 
Sbjct: 221 METGVRA---AELIKNYMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS 276

Query: 227 RFLDYYNTFD 236
            FL  +   D
Sbjct: 277 -FLQLHPRID 285


>gi|338718783|ref|XP_001501238.3| PREDICTED: DNA polymerase sigma [Equus caballus]
          Length = 574

 Score = 46.2 bits (108), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 83/190 (43%), Gaps = 12/190 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P       R+EV+  I+ ++   +   +V  FGS     YLP  DIDL
Sbjct: 23  HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 82

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
            V G    E    + + + L   +      IK +    A V I+K + Q   + VDISFN
Sbjct: 83  VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 138

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
              G+ A  F++     + K  L    ++++K +   +  +     G IS+Y+L +M + 
Sbjct: 139 METGVRAAEFIKN---YMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAI- 193

Query: 227 RFLDYYNTFD 236
            FL  +   D
Sbjct: 194 SFLQLHPRID 203


>gi|281306711|ref|NP_941002.2| DNA polymerase sigma isoform 1 [Mus musculus]
 gi|148705067|gb|EDL37014.1| polymerase (DNA directed) sigma [Mus musculus]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
 gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
          Length = 442

 Score = 46.2 bits (108), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 57/123 (46%), Gaps = 19/123 (15%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----- 113
           + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL V G      
Sbjct: 1   MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP 60

Query: 114 -QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAG 170
              +EE L +         D+DSV ++ D   VP    I+K   S   + VDISFN   G
Sbjct: 61  LWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNG 110

Query: 171 LSA 173
           + A
Sbjct: 111 VRA 113


>gi|384246771|gb|EIE20260.1| Nucleotidyltransferase [Coccomyxa subellipsoidea C-169]
          Length = 454

 Score = 46.2 bits (108), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 59/260 (22%), Positives = 109/260 (41%), Gaps = 54/260 (20%)

Query: 40  SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
           S++ E     ++ I + +  + P       R   +  +Q      +  +V+PFGS     
Sbjct: 186 SVNPECTARLQQEIVDFVANVAPTWEESNLRDAALGRVQGACAMMHLYDVYPFGSKASGL 245

Query: 100 YLPDGDIDLTVLGHQNVEED---------------LARFVCKILENEDQDSVFEIKDVQY 144
            L + DID+ VLG     +D               L + V ++  +      F I+    
Sbjct: 246 ELWNSDIDVVVLGIVEPSKDNLGYTTEEKVPVNDVLGKIVQQLRRSNSVRKTFHIRQ--- 302

Query: 145 VPAQVKIVKC-SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
             ++V I+KC +V+ + VD+S N   G+ A  FL  +D+   +  L +  ++LIKA    
Sbjct: 303 --SRVPIIKCTTVEGVDVDVSVNGDRGICAAQFL--IDQQARRPAL-RPLILLIKAVL-- 355

Query: 204 ESRILG---AHYGLISTYALEMMVLHR--------------------FLDYYNT-FDWDN 239
             ++LG      G + +++L  MV+                      FL  Y T F++D 
Sbjct: 356 --KVLGLGDVSQGGLGSFSLANMVIAHLQEEEKVGRGQENLGVSLLAFLLRYGTYFNYDQ 413

Query: 240 YCISI--NGPVAISSLPEIV 257
           + ++I   G V+ +++P  V
Sbjct: 414 HVVAIGRGGIVSRTAVPGAV 433


>gi|432104646|gb|ELK31258.1| DNA polymerase sigma [Myotis davidii]
          Length = 513

 Score = 46.2 bits (108), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 81/172 (47%), Gaps = 16/172 (9%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G         R   ++L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------KWERPPLQLL 64

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
           E    + +V E   ++ +  A V I+K + Q   + VDISFN   G+ A  F++     +
Sbjct: 65  EQALRKHNVAEPGSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YM 121

Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
            K  L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 122 KKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAI-SFLQLHPRID 171


>gi|281306714|ref|NP_001162602.1| DNA polymerase sigma isoform 2 [Mus musculus]
 gi|74220251|dbj|BAE31304.1| unnamed protein product [Mus musculus]
          Length = 541

 Score = 46.2 bits (108), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|417402533|gb|JAA48112.1| Putative dna polymerase sigma [Desmodus rotundus]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|344255499|gb|EGW11603.1| DNA polymerase sigma [Cricetulus griseus]
          Length = 534

 Score = 46.2 bits (108), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 2   RREVVTRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 59

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 60  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 114

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 115 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 162


>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
           bisporus H97]
          Length = 481

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 44/185 (23%), Positives = 77/185 (41%), Gaps = 12/185 (6%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDI 106
           +  + ++     + P  + ++ R+  +  I R I   + G +VFPFGS   K YLP GDI
Sbjct: 142 MLHKEVEAFTKWMAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDI 201

Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
           DL +    + + N    L      +       +V  I       A+V IVK         
Sbjct: 202 DLVIVSDSMAYSNKSSVLHSLASVLRRAGIASNVTVI-----AKAKVPIVKFVTIHGRFN 256

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VDIS NQ  G+     ++   + +    L  RS++LI      +  +     G + +Y++
Sbjct: 257 VDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSI 316

Query: 221 EMMVL 225
             + +
Sbjct: 317 VCLAI 321


>gi|60392923|sp|Q6PB75.1|PAPD7_MOUSE RecName: Full=DNA polymerase sigma; AltName: Full=PAP-associated
           domain-containing protein 7
 gi|37590586|gb|AAH59846.1| Polymerase (DNA directed) sigma [Mus musculus]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|255550972|ref|XP_002516534.1| Poly(A) polymerase beta, putative [Ricinus communis]
 gi|223544354|gb|EEF45875.1| Poly(A) polymerase beta, putative [Ricinus communis]
          Length = 754

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 56/251 (22%), Positives = 102/251 (40%), Gaps = 53/251 (21%)

Query: 35  NLYPFSIDAELWLLAEERI--QEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVF 90
           +L  F +D+ L+   EE +  +E+LC I   IV +  K+      Y  ++++    + +F
Sbjct: 48  DLVKFLVDSGLYESKEENLKREEVLCRID-QIVKDWIKQLTRQRGYSDQMVDEANAV-IF 105

Query: 91  PFGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-A 147
            FGS  L  + P  DID   +G  + N EED    +  IL   D     EI ++Q VP A
Sbjct: 106 TFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFILLHDILAEMD-----EITELQPVPDA 160

Query: 148 QVKIVKCSVQNIPVDISFNQMA-------------------------GLSALCFLEKVDR 182
            V ++K   Q I +D+ +  ++                          L+     +++ +
Sbjct: 161 HVPVMKFKFQGISIDLLYASISLPVVPEDLDISHGSVLYDVDEQTVRSLNGCRVADQILK 220

Query: 183 LIGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTY-------ALEMMVLHRF 228
           L+     F+ ++  +K W             + G ++ L+          A+  M++ RF
Sbjct: 221 LVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARLCQLYPNAIPSMLVSRF 280

Query: 229 LDYYNTFDWDN 239
              Y  + W N
Sbjct: 281 FRVYTQWRWPN 291


>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
          Length = 583

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 86  EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 145

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 146 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 195

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 196 DISFNVQNGVRA 207


>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
           garnettii]
          Length = 629

 Score = 46.2 bits (108), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 132 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 191

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 192 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 241

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 242 DISFNVQNGVRA 253


>gi|326927263|ref|XP_003209812.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Meleagris gallopavo]
          Length = 571

 Score = 46.2 bits (108), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 90/199 (45%), Gaps = 26/199 (13%)

Query: 37  YPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSV 95
           Y F +    ++L EE I +    + P    E+ R EV+N I+ +I   +   +V  FGS 
Sbjct: 60  YVFPVWVPEFVLHEE-IIDFYKYMSPRHEEERMRMEVVNRIENVIKELWPNADVQIFGSF 118

Query: 96  PLKTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
               YLP  DIDL V G         +EE L +         D++SV ++ D   VP   
Sbjct: 119 KTGLYLPTSDIDLVVFGKWETLPLWTLEEALRKHNVA-----DENSV-KVLDKATVP--- 169

Query: 150 KIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRI 207
            I+K   S   + VDISFN   G+ A   ++     I K  +    V+++K +     R 
Sbjct: 170 -IIKLTDSFTEVKVDISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRD 223

Query: 208 LGAHY-GLISTYALEMMVL 225
           L   + G I +Y+L +M +
Sbjct: 224 LNEVFTGGIGSYSLFLMAV 242


>gi|403282221|ref|XP_003932554.1| PREDICTED: DNA polymerase sigma [Saimiri boliviensis boliviensis]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|284507293|ref|NP_001165276.1| DNA polymerase sigma isoform 2 [Homo sapiens]
 gi|219518042|gb|AAI43883.1| POLS protein [Homo sapiens]
          Length = 541

 Score = 46.2 bits (108), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|5902142|ref|NP_008930.1| DNA polymerase sigma isoform 1 [Homo sapiens]
 gi|397472993|ref|XP_003808011.1| PREDICTED: DNA polymerase sigma [Pan paniscus]
 gi|60392922|sp|Q5XG87.2|PAPD7_HUMAN RecName: Full=DNA polymerase sigma; AltName: Full=DNA polymerase
           kappa; AltName: Full=LAK-1; AltName: Full=PAP-associated
           domain-containing protein 7; AltName: Full=Terminal
           uridylyltransferase 5; Short=TUTase 5; AltName:
           Full=Topoisomerase-related function protein 4-1;
           Short=TRF4-1
 gi|5139669|dbj|BAA24434.2| LAK-1 [Homo sapiens]
 gi|47777665|gb|AAT38110.1| polymerase (DNA directed) sigma [Homo sapiens]
 gi|109658946|gb|AAI17138.1| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|116496943|gb|AAI26107.1| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|119628506|gb|EAX08101.1| polymerase (DNA directed) sigma [Homo sapiens]
 gi|127799472|gb|AAH84567.2| Polymerase (DNA directed) sigma [Homo sapiens]
 gi|158260737|dbj|BAF82546.1| unnamed protein product [Homo sapiens]
 gi|313883146|gb|ADR83059.1| PAP associated domain containing 7 [synthetic construct]
 gi|410210372|gb|JAA02405.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410259248|gb|JAA17590.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410303968|gb|JAA30584.1| PAP associated domain containing 7 [Pan troglodytes]
 gi|410337733|gb|JAA37813.1| PAP associated domain containing 7 [Pan troglodytes]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|386782099|ref|NP_001248228.1| DNA polymerase sigma [Macaca mulatta]
 gi|355691204|gb|EHH26389.1| DNA polymerase sigma [Macaca mulatta]
 gi|355749808|gb|EHH54146.1| DNA polymerase sigma [Macaca fascicularis]
 gi|380809116|gb|AFE76433.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
 gi|383415413|gb|AFH30920.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
 gi|384945014|gb|AFI36112.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
          Length = 542

 Score = 46.2 bits (108), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|326917206|ref|XP_003204892.1| PREDICTED: DNA polymerase sigma-like [Meleagris gallopavo]
          Length = 429

 Score = 46.2 bits (108), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 81/171 (47%), Gaps = 14/171 (8%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ +I   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +    + IK +    A V I+K + Q   + VDISFN   G+ A  F+++    + K
Sbjct: 69  RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKE---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +     R L   + G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFLL--QRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|387273371|gb|AFJ70180.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
          Length = 542

 Score = 45.8 bits (107), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|410949807|ref|XP_003981609.1| PREDICTED: DNA polymerase sigma [Felis catus]
          Length = 542

 Score = 45.8 bits (107), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A    E +   + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRA---AELIKNYMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|118794510|ref|XP_321547.3| AGAP001564-PA [Anopheles gambiae str. PEST]
 gi|19572384|emb|CAD27930.1| putative DNA topoisomerase [Anopheles gambiae]
 gi|116116329|gb|EAA01236.4| AGAP001564-PA [Anopheles gambiae str. PEST]
          Length = 1039

 Score = 45.8 bits (107), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 70/271 (25%), Positives = 115/271 (42%), Gaps = 37/271 (13%)

Query: 70  RKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R  V++ I++++ N +    V  FGS     YLP  DIDL V+G   +         + L
Sbjct: 292 RVMVVSRIEKIVQNLWPSARVEMFGSFRTGLYLPTSDIDLVVIGQWTM------LPLRTL 345

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
           E E     + E   V+ +  A V IVK + +   + VDISFN  +G+ +   ++   R  
Sbjct: 346 EMELISQGIAEPNSVRVLDKASVPIVKLTDRQTQVKVDISFNMESGVQSAKLIKGFKR-- 403

Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISI 244
             ++     ++L+      +  +     G IS+Y+L +M +  FL  ++           
Sbjct: 404 --EYPVLEKLVLVLKQFLLQRDLNEVFTGGISSYSLILMCI-SFLQQHH----------- 449

Query: 245 NGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH---- 300
             P A S+L  ++ E  E  G +      ++K    + S      E    E    H    
Sbjct: 450 QKPNACSNLGVLLIEFFELYGRKF----NYMKIGISVKSGRYIPKEELQREMIDGHRPSL 505

Query: 301 LNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
           L I DPL   N++GRS S G  H ++ A  Y
Sbjct: 506 LCIEDPLTPGNDIGRS-SYGALH-VKQAFEY 534


>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
           var. burnettii JB137-S8]
          Length = 559

 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 44/185 (23%), Positives = 77/185 (41%), Gaps = 12/185 (6%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDI 106
           +  + ++     + P  + ++ R+  +  I R I   + G +VFPFGS   K YLP GDI
Sbjct: 141 MLHKEVEAFTKWMAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDI 200

Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
           DL +    + + N    L      +       +V  I       A+V IVK         
Sbjct: 201 DLVIVSDSMAYSNKSSVLHSLASVLRRAGIASNVTVI-----AKAKVPIVKFVTIHGRFN 255

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VDIS NQ  G+     ++   + +    L  RS++LI      +  +     G + +Y++
Sbjct: 256 VDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSI 315

Query: 221 EMMVL 225
             + +
Sbjct: 316 VCLAI 320


>gi|357131279|ref|XP_003567266.1| PREDICTED: poly(A) RNA polymerase protein cid1-like [Brachypodium
           distachyon]
          Length = 595

 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 51/190 (26%), Positives = 80/190 (42%), Gaps = 17/190 (8%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLKTYLPDGDI 106
           E+ I+EIL  I+PA V   KR   I  +    + +    G    PFGS     Y   GD+
Sbjct: 9   EKCIKEILSQIKPAEVDRNKRLSAIKELDISIQSVAALKGAAAKPFGSFLSNLYSKSGDL 68

Query: 107 DLTV--LGHQNV------EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
           DL+V  +   N+      ++ + R + K L+        E      VP  ++ V  S   
Sbjct: 69  DLSVQLMNSSNLPVSKKKKQSILRVLRKALQRNGVAGYMEFIPHARVPV-LQYVSNSF-G 126

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           I  D+S +   G         +  L   D  F   V+LIK W   ++ I     G +++Y
Sbjct: 127 ISCDLSIDNYPGRIKSRIFYWISTL---DERFGDMVLLIKEWAKCQN-INDPKTGTLNSY 182

Query: 219 ALEMMVLHRF 228
           +L ++VL  F
Sbjct: 183 SLCLLVLFHF 192


>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
           [Ornithorhynchus anatinus]
          Length = 578

 Score = 45.8 bits (107), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 81  EEINDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDIDLV 140

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           V G         +EE L +      +  D+ SV ++ D   VP  +K+   S   + VDI
Sbjct: 141 VFGKWENLPLWTLEEALRKH-----KVADEHSV-KVLDKATVPI-IKLTD-SFTEVKVDI 192

Query: 164 SFNQMAGLSA 173
           SFN   G+ A
Sbjct: 193 SFNVQNGVKA 202


>gi|291411166|ref|XP_002721862.1| PREDICTED: DNA polymerase sigma [Oryctolagus cuniculus]
          Length = 542

 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKS---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|432908932|ref|XP_004078069.1| PREDICTED: uncharacterized protein LOC101163355 [Oryzias latipes]
          Length = 779

 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 84/200 (42%), Gaps = 32/200 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I +    + P    E  R++V+N I+ +I   +    V  FGS     YLP  DIDL
Sbjct: 218 HEEILDFFHFMSPKPEEEAMRRDVVNKIEGVIKDLWPTARVEIFGSFSTGLYLPTSDIDL 277

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIP 160
            V G       Q +E+ L +  C +         + +K +    A V I+K + Q   + 
Sbjct: 278 VVFGKWEHPPLQELEQALKK--CNVA------GPYPVKVLDK--ASVPIIKLTDQESKVK 327

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLIS 216
           VDISFN    + A  F+        K +L K +V    I +      +  +     G IS
Sbjct: 328 VDISFNVETAVKAAQFI--------KSYLKKYAVLPPLIFVLKQFLLQRDLNEVFTGGIS 379

Query: 217 TYALEMMVLHRFLDYYNTFD 236
           +Y+L +M +  FL  +   D
Sbjct: 380 SYSLILMAI-SFLQLHPRID 398


>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 478

 Score = 45.8 bits (107), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 60/139 (43%), Gaps = 12/139 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           +  + ++  +  I P    ++ R  ++  I R +   +    V PFGS   K YLP GDI
Sbjct: 152 MLHKEVEAFVKYISPTQEEDEIRSLIVESISRAVTKAFPDARVLPFGSYETKLYLPLGDI 211

Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IP 160
           DL +    + + N    L      +      D V  I       A+V I+K   ++    
Sbjct: 212 DLVIESDSMAYNNKVNVLQALATTMKRAGITDKVTII-----AKAKVPIIKFVTRHGRFS 266

Query: 161 VDISFNQMAGLSALCFLEK 179
           VDIS NQM G+ A   +++
Sbjct: 267 VDISLNQMNGVKAGTMIKR 285


>gi|395735628|ref|XP_002815457.2| PREDICTED: DNA polymerase sigma [Pongo abelii]
          Length = 542

 Score = 45.8 bits (107), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 69  RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Cricetulus griseus]
          Length = 464

 Score = 45.8 bits (107), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV++ I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 14  EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 73

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+DSV ++ D   VP    I+K   S   + V
Sbjct: 74  VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 123

Query: 162 DISFNQMAGLSA 173
           DISFN   G+ A
Sbjct: 124 DISFNVQNGVRA 135


>gi|61098442|ref|NP_001012968.1| DNA polymerase sigma [Gallus gallus]
 gi|53130894|emb|CAG31774.1| hypothetical protein RCJMB04_10o8 [Gallus gallus]
          Length = 429

 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ +I   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +    + IK +    A V I+K + Q   + VDISFN   G+ A  F+++      K
Sbjct: 69  RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKEY----MK 122

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
            +     +IL+      +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 123 KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|296475642|tpg|DAA17757.1| TPA: DNA polymerase sigma [Bos taurus]
          Length = 541

 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 81/172 (47%), Gaps = 16/172 (9%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G         R   ++L
Sbjct: 11  RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWE------RPPLQLL 64

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
           E    + +V E   ++ +  A V I+K + Q   + VDISFN   G+ A  F++     +
Sbjct: 65  EQALRKHNVAEPGSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YM 121

Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
            K  L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 122 KKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|449493801|ref|XP_002190556.2| PREDICTED: DNA polymerase sigma [Taeniopygia guttata]
          Length = 429

 Score = 45.8 bits (107), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 81/171 (47%), Gaps = 14/171 (8%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ +I   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 11  RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +    + IK +    A V I+K + Q   + VDISFN   G+ A  F+++    + K
Sbjct: 69  RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKE---YMKK 123

Query: 187 DHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +     R L   + G IS+Y+L +M +  FL  +   D
Sbjct: 124 YSLLPYLILVLKQFLL--QRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171


>gi|344230457|gb|EGV62342.1| hypothetical protein CANTEDRAFT_126141 [Candida tenuis ATCC 10573]
          Length = 615

 Score = 45.8 bits (107), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 76/305 (24%), Positives = 126/305 (41%), Gaps = 42/305 (13%)

Query: 32  SPPNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING 83
           +P +LYP+      S   E+  WL  E  I++ +  I P+      R  V+  +++ I  
Sbjct: 157 TPTSLYPWIKAHDHSKQKEIADWLTME--IKDFVSYISPSKEEIMARNSVVKTLKQQIKV 214

Query: 84  YY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIK 140
            +   E   FGS     YLP  DID+ V+      E+  +   +   L ++        K
Sbjct: 215 CWPDAEAHVFGSFATDLYLPGSDIDMVVVSKNGDCENRHKLYQLSSFLRSKKL-----AK 269

Query: 141 DVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVIL 196
           D++ +  A+V I+K      NI +DISF +  GL A   + K ++   G      R ++L
Sbjct: 270 DIEVIAGAKVPIIKFVDPKTNIHLDISFERTNGLDAARRIRKWLETTAG-----LRELVL 324

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
           +        ++   H G +  YA  ++  H F+  +     +N   S N    +    E+
Sbjct: 325 VVKQFLRSRKLNNVHVGGLGGYATIILCYH-FIKMHPRVSTENMTASENLGTLLIEFFEL 383

Query: 257 VAETVENDG-----DELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
                  D      D    SP++LKK    Y  P  +L  + + FA   L + DP    N
Sbjct: 384 YGRNFSFDNLVIAFDGKTDSPKYLKK----YKYP--SLIASRNPFA---LVVQDPDDPGN 434

Query: 312 NLGRS 316
           N+ RS
Sbjct: 435 NITRS 439


>gi|195163618|ref|XP_002022646.1| GL14678 [Drosophila persimilis]
 gi|194104669|gb|EDW26712.1| GL14678 [Drosophila persimilis]
          Length = 1197

 Score = 45.8 bits (107), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 83/331 (25%), Positives = 131/331 (39%), Gaps = 61/331 (18%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H +  + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 285 NHMAMIAKYKGEPWRKPDYPYGEGVVGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIENVV 344

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD-SVFEI 139
           +  +   V   FGS     +LP  DIDL VLG   + E L     + LE E  +  + E 
Sbjct: 345 HTIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEYELVNRGIAEA 398

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
             V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R   ++GK  L  + 
Sbjct: 399 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 458

Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
            +L++        +     G IS+Y+L +M                 CIS      +   
Sbjct: 459 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 489

Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
           P  +     N G  LL   E   +      + I +++N G   A   L            
Sbjct: 490 PRGIYHDTTNLGVLLLEFFELYGRRFNYMRIGI-SIKNGGRYMAKDELQRDMVDGHRPSL 548

Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
             I DPL   N++GRS S G FH     +CA
Sbjct: 549 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 578


>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
           B]
          Length = 547

 Score = 45.8 bits (107), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 55/122 (45%), Gaps = 14/122 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNV-- 116
           I P    ++ R  V+  I+R I   +   +V PFGS   K YLP GDIDL +  H N   
Sbjct: 183 ISPTPQEDEVRSLVVELIRRAITRQFPDAQVLPFGSYETKLYLPLGDIDLVI--HSNTMA 240

Query: 117 ---EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDISFNQMAGL 171
              +E++ R +   L         +I       A+V IVK         VDIS NQ  G+
Sbjct: 241 YSDKENVLRALANTLRRAGITDNVKI----IAKAKVPIVKFVTIHGRFSVDISINQGNGV 296

Query: 172 SA 173
           +A
Sbjct: 297 AA 298


>gi|359486610|ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
 gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera]
          Length = 453

 Score = 45.8 bits (107), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 24/188 (12%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
           +++IL  I P+      R ++I   +  ++      G  V PFGS     Y   GD+D++
Sbjct: 11  LKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNLYTQWGDLDIS 70

Query: 110 V---------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ-- 157
           +            +  ++ L   V   L ++        + +Q++P A+V I+K      
Sbjct: 71  IELPNGAYISSAGKRHKQTLLGHVLNALRSKG-----GWRKLQFIPNARVPIIKFESYHP 125

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           NI  D+S N + G     FL  +    G D  F+  V+L+K W      I  +  G +++
Sbjct: 126 NISCDVSINNLKGQMKSKFLFWIS---GIDGRFRDLVLLVKEWARAHD-INNSKTGTLNS 181

Query: 218 YALEMMVL 225
           Y+L ++V+
Sbjct: 182 YSLSLLVV 189


>gi|5565687|gb|AAD45198.1|AF089896_1 topoisomerase-related function protein [Homo sapiens]
          Length = 517

 Score = 45.8 bits (107), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + + + L
Sbjct: 51  RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 108

Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
              +      IK +    A V I+K + Q   + VDISFN   G+ A  F++     + K
Sbjct: 109 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 163

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 164 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 211


>gi|448531596|ref|XP_003870285.1| Pap1 poly(A) polymerase [Candida orthopsilosis Co 90-125]
 gi|380354639|emb|CCG24155.1| Pap1 poly(A) polymerase [Candida orthopsilosis]
          Length = 557

 Score = 45.4 bits (106), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)

Query: 69  KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
           KR EV+  +Q+L   +                  G ++F FGS  L  Y P  DID L V
Sbjct: 51  KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 110

Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
           +      ED      KIL    E E+ +SV E    I  +++    + ++  K  +  +P
Sbjct: 111 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 170

Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
            D++ +            M  L+     +++ RL+ K  +FK ++  IK W   + R + 
Sbjct: 171 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 228

Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           A+ YG     A  M+V               L +F   Y+ ++W
Sbjct: 229 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 272


>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 624

 Score = 45.4 bits (106), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 78/170 (45%), Gaps = 23/170 (13%)

Query: 70  RKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTV----LGHQNVEEDLARFV 124
           RK +I+ I  +I   +    V PFGS   + YLP GDIDL V    L  +N    L +  
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185

Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEK-VD 181
             +  N   ++V  I       A+V I+K   +   I VDIS NQ  G+SA+  +   + 
Sbjct: 186 RMMRGNHITETVAVI-----TRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLK 240

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY-----ALEMMVLH 226
            L G   L    +++IKA+    S +   + G + +Y     AL  + +H
Sbjct: 241 ALPGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLALSFLQMH 285


>gi|324975490|gb|ADY62673.1| PAPa [Candida orthopsilosis]
          Length = 372

 Score = 45.4 bits (106), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)

Query: 69  KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
           KR EV+  +Q+L   +                  G ++F FGS  L  Y P  DID L V
Sbjct: 46  KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 105

Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
           +      ED      KIL    E E+ +SV E    I  +++    + ++  K  +  +P
Sbjct: 106 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 165

Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
            D++ +            M  L+     +++ RL+ K  +FK ++  IK W   + R + 
Sbjct: 166 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 223

Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           A+ YG     A  M+V               L +F   Y+ ++W
Sbjct: 224 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 267


>gi|324975506|gb|ADY62687.1| PAPa [Candida orthopsilosis]
          Length = 552

 Score = 45.4 bits (106), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)

Query: 69  KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
           KR EV+  +Q+L   +                  G ++F FGS  L  Y P  DID L V
Sbjct: 46  KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 105

Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
           +      ED      KIL    E E+ +SV E    I  +++    + ++  K  +  +P
Sbjct: 106 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 165

Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
            D++ +            M  L+     +++ RL+ K  +FK ++  IK W   + R + 
Sbjct: 166 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 223

Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           A+ YG     A  M+V               L +F   Y+ ++W
Sbjct: 224 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 267


>gi|449017212|dbj|BAM80614.1| hypothetical protein CYME_CMK272C [Cyanidioschyzon merolae strain
            10D]
          Length = 1647

 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 32/127 (25%), Positives = 46/127 (36%), Gaps = 26/127 (20%)

Query: 151  IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
            +V+C    +      N    L   CFL + D LIG+ HL  R +IL+K W +  S     
Sbjct: 1160 VVRCRTNGLTTQFLLNPAVALCRSCFLVECDELIGRRHLLIRCLILLKVW-WRHSLATAQ 1218

Query: 211  HYGLISTYALEMM------------------------VLHRFLDYYN-TFDWDNYCISIN 245
               L+S  +  ++                        VL     +Y    DW    +SI 
Sbjct: 1219 ARALLSPLSGSLVSPFLALLLLSYLNCRGLPGDEPAHVLQGLFSFYGFDMDWSRCGMSIY 1278

Query: 246  GPVAISS 252
            GP  I S
Sbjct: 1279 GPFDIQS 1285


>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 631

 Score = 45.4 bits (106), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 78/170 (45%), Gaps = 23/170 (13%)

Query: 70  RKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTV----LGHQNVEEDLARFV 124
           RK +I+ I  +I   +    V PFGS   + YLP GDIDL V    L  +N    L +  
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185

Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEK-VD 181
             +  N   ++V  I       A+V I+K   +   I VDIS NQ  G+SA+  +   + 
Sbjct: 186 RMMRGNHITETVAVI-----TRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLK 240

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY-----ALEMMVLH 226
            L G   L    +++IKA+    S +   + G + +Y     AL  + +H
Sbjct: 241 ALPGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLALSFLQMH 285


>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
           [Rhipicephalus pulchellus]
          Length = 627

 Score = 45.4 bits (106), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 47/177 (26%), Positives = 76/177 (42%), Gaps = 9/177 (5%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           + I++    +QP     + R  VI  I+ +I G +   EV  FGS     YLP  DID+ 
Sbjct: 144 DEIEDFYRYMQPTPAEHQMRLGVIQRIKDVILGLWPQAEVEIFGSFRTGLYLPTSDIDVV 203

Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMA 169
           VLG              +     +    ++ D   VP  VK+       + VDISFN   
Sbjct: 204 VLGKWETLPMWTLEKALLSHGIAEPQSIKVLDKASVPI-VKLTDAKT-TVKVDISFNMNN 261

Query: 170 GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
           G+ + C ++       K     + V+++K   +   R L   + G IS+Y+L +M +
Sbjct: 262 GVKSACLIQSFKE---KFPALPKLVLVLKQ--FLLQRDLNEVFTGGISSYSLILMTV 313


>gi|42565594|ref|NP_190161.2| Nucleotidyltransferase family protein [Arabidopsis thaliana]
 gi|30793987|gb|AAP40443.1| unknown protein [Arabidopsis thaliana]
 gi|110739217|dbj|BAF01523.1| hypothetical protein [Arabidopsis thaliana]
 gi|332644545|gb|AEE78066.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
          Length = 682

 Score = 45.4 bits (106), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 92/210 (43%), Gaps = 27/210 (12%)

Query: 36  LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
           L  + ID+ + L  ++ + ++ C+ +P       RKE++  +  +    YG   E  P  
Sbjct: 30  LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVL 89

Query: 92  --FGSVPLKTYLPDGDIDLTV-LGHQNVE-------EDLARFVCKI--LENEDQDSVFEI 139
             +GS  +  Y    D+D+++  G+   E       E L RF  K+  L+ E Q     +
Sbjct: 90  EAYGSFVMDMYSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQ-----V 144

Query: 140 KDVQYV-PAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           K+V+ +  A+V IVK S Q   V  D+S     G+     +  + ++ G+   F++  +L
Sbjct: 145 KNVESIFSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCLL 201

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +K W          H  L S     ++ LH
Sbjct: 202 VKHWAKAHEVNSALHRTLNSVSITLLVALH 231


>gi|452988727|gb|EME88482.1| hypothetical protein MYCFIDRAFT_159726 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 658

 Score = 45.4 bits (106), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 14/158 (8%)

Query: 32  SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGY---YGIE 88
           +P N Y F+ +  L L  E  I +    + P    +  R++++  +   I+G+     I 
Sbjct: 196 APANGYDFAANVGLHLHRE--ILDFFDYVSPRDYEKNCREDLVKRVDYFISGFGRGRSIS 253

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENE-DQDSVFEIKDV 142
           V PFGS     YL D DIDL     Q +         +R     L NE  +  + + +DV
Sbjct: 254 VQPFGSYASGLYLTDADIDLVATSSQYMRNGTKVFCQSRTQMNKLSNELRKTGMAKNRDV 313

Query: 143 -QYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
                 +V ++K   S  NI VDISF   +GL+ +  L
Sbjct: 314 ITLTRTKVPLIKFVDSRTNIKVDISFENDSGLNTIPTL 351


>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
          Length = 316

 Score = 45.4 bits (106), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 47/188 (25%), Positives = 87/188 (46%), Gaps = 10/188 (5%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPL 97
             ID +L  + E    + +  + P+   ++ R +V   I+ +I G +    VF FGS   
Sbjct: 6   LKIDWQL-SMTETSTHDFVNFVTPSKEDKEIRNKVATSIEEVIKGVFPDCHVFVFGSCAT 64

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ 157
              LP+ DIDL V      E  +   V   +  + +    ++     VP  +KI      
Sbjct: 65  GLNLPNSDIDLIVYQPDVSESRMITKVADAIVRQKKCKTIDVLKNTKVPL-IKITDSEF- 122

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLIS 216
            + VDISFN+  G+  +  ++++ ++  +    K  ++++K  C+ +SR L   Y G + 
Sbjct: 123 GVNVDISFNRTNGVYCVKLVKQLLQMFPE---LKPLMMVLK--CFLKSRQLNEPYSGGVG 177

Query: 217 TYALEMMV 224
           ++ L MMV
Sbjct: 178 SFLLTMMV 185


>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
 gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
          Length = 348

 Score = 45.4 bits (106), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 88/186 (47%), Gaps = 25/186 (13%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDI 106
            E I+     + P       R E+++ ++R+I G +    +E+F  GS  L   LP+ DI
Sbjct: 27  HEEIEHFYRYMLPTPTEHAARIELLSRVERVIQGLWPEALVEIF--GSFRLGINLPNSDI 84

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDS------VFEIKDVQYVPAQVKIVKCSVQNIP 160
           DL VLG     E L     + LE+E + S        ++ D   VP  ++   C   ++ 
Sbjct: 85  DLVVLG---CWEHLP---LRSLESELRSSGIVLPGTLQVVDTAAVPI-IRFTDCET-HLK 136

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYA 219
           VDISFN   G+ +    E + + + +  +  + V+++K   + E R L +   G IS+Y 
Sbjct: 137 VDISFNMPNGIDS---SELIKKFLHEHPVLGKLVLVLKQ--FLEQRNLNSTLNGGISSYN 191

Query: 220 LEMMVL 225
           L +M +
Sbjct: 192 LIIMCI 197


>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
          Length = 756

 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 103/245 (42%), Gaps = 48/245 (19%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLK 98
           D  +WL  E  I +    ++P     + R +++  ++R +     Y   +V PFGS    
Sbjct: 423 DMSVWLHKE--IIDFYEVVRPRDFEHEMRTQLVERLRRSLKTSHFYKDCDVRPFGSYMSG 480

Query: 99  TYLPDGDIDLTVLGHQNVEEDLARFV-CKILENED----QDSVFEIKDVQYVP-AQVKIV 152
            YLP  D+DL V     ++   + F   K L N      Q+ V     ++++  A+V +V
Sbjct: 481 LYLPTADMDLVVCARSWLDGAHSNFFGMKALRNFGKFLAQNKVTHYNTMEFIASAKVPLV 540

Query: 153 KC--SVQNIPVDISFNQMAGLSAL---------------------CFL------EKVDRL 183
           K   ++  + VDISF+++ G  A+                      FL      E V+  
Sbjct: 541 KYIDNITGLRVDISFDRLDGPQAVKTFAEWKEQYPAMPILVTMIKHFLAMRGLNEPVNGG 600

Query: 184 IGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYY-NTFDWDNYCI 242
           IG   +    V +++     +SR       LI  + L  M++  FLD Y N FD+ N  I
Sbjct: 601 IGSFTVTCMVVSMLQLMPQVQSR------NLIPEHHLGEMMME-FLDLYGNRFDYVNTAI 653

Query: 243 SINGP 247
            +N P
Sbjct: 654 RMNPP 658


>gi|7019641|emb|CAB75788.1| putative protein [Arabidopsis thaliana]
          Length = 690

 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 92/210 (43%), Gaps = 27/210 (12%)

Query: 36  LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
           L  + ID+ + L  ++ + ++ C+ +P       RKE++  +  +    YG   E  P  
Sbjct: 30  LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVL 89

Query: 92  --FGSVPLKTYLPDGDIDLTV-LGHQNVE-------EDLARFVCKI--LENEDQDSVFEI 139
             +GS  +  Y    D+D+++  G+   E       E L RF  K+  L+ E Q     +
Sbjct: 90  EAYGSFVMDMYSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQ-----V 144

Query: 140 KDVQYV-PAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           K+V+ +  A+V IVK S Q   V  D+S     G+     +  + ++ G+   F++  +L
Sbjct: 145 KNVESIFSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCLL 201

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +K W          H  L S     ++ LH
Sbjct: 202 VKHWAKAHEVNSALHRTLNSVSITLLVALH 231


>gi|297823863|ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 500

 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 81/191 (42%), Gaps = 30/191 (15%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
           +QEIL  I+P       R  VI+ ++ ++       G  V PFGS     +   GD+DL+
Sbjct: 11  LQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGDLDLS 70

Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIK------------DVQYV-PAQVKIVKC-- 154
           V        DL      +   + Q                    +Q+V  A+V I+K   
Sbjct: 71  V--------DLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKVVS 122

Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
             Q I  DIS + + GL    FL  +  + G+   F+  V+L+K W    + I  +  G 
Sbjct: 123 GHQRIACDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKNGT 178

Query: 215 ISTYALEMMVL 225
            ++Y+L ++V+
Sbjct: 179 FNSYSLSLLVI 189


>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
          Length = 501

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 83/185 (44%), Gaps = 25/185 (13%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    E+ R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 3   EEIIDFYKYMSPRPEEERMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 62

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D++SV ++ D   VP    I+K   S   + V
Sbjct: 63  VFGKWETLPLWTLEEALRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKV 112

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DISFN   G+ A   ++     I K  +    V+++K +     R L   + G I +Y+L
Sbjct: 113 DISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 167

Query: 221 EMMVL 225
            +M +
Sbjct: 168 FLMAV 172


>gi|194769888|ref|XP_001967033.1| GF21834 [Drosophila ananassae]
 gi|190622828|gb|EDV38352.1| GF21834 [Drosophila ananassae]
          Length = 1092

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 82/331 (24%), Positives = 131/331 (39%), Gaps = 61/331 (18%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H    + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 327 NHREMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 386

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
           +  +   V   FGS     +LP  DIDL VLG   + E L     + LE E     + E 
Sbjct: 387 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEFELVSRGIAEA 440

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
             V+ +  A V I+K + +  ++ VDISFN  +G+ +   ++K  R   ++GK  L  + 
Sbjct: 441 CTVRVLDKASVPIIKLTDRETHVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 500

Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
            +L++        +     G IS+Y+L +M                 CIS      +   
Sbjct: 501 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 531

Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
           P  + +   N G  LL   E   +      + I+ ++N G     + L            
Sbjct: 532 PRGIYQDTANLGVLLLEFFELYGRRFNYMKIGIS-IKNGGRYMPKEDLQRDMVDGHRPSL 590

Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
             I DPL   N++GRS S G FH     +CA
Sbjct: 591 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 620


>gi|194891045|ref|XP_001977427.1| GG18260 [Drosophila erecta]
 gi|190649076|gb|EDV46354.1| GG18260 [Drosophila erecta]
          Length = 1008

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 88/197 (44%), Gaps = 24/197 (12%)

Query: 37  YPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSV 95
           YP+  D  + L   E I+     + P       R EV+  I+ +++  +   V   FGS 
Sbjct: 250 YPYG-DGVIGL--HEEIEHFYQYVLPTACEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSF 306

Query: 96  PLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVK 153
               +LP  DIDL VLG         +   + LE E     + E   V+ +  A V I+K
Sbjct: 307 RTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEACTVRVLDKASVPIIK 360

Query: 154 CSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRIL 208
            + +   + VDISFN  +G+ +   ++K  R   ++GK  L  +  +L++        + 
Sbjct: 361 LTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQFLLLRD-------LN 413

Query: 209 GAHYGLISTYALEMMVL 225
               G IS+Y+L +M +
Sbjct: 414 EVFTGGISSYSLILMCI 430


>gi|312085976|ref|XP_003144894.1| hypothetical protein LOAG_09318 [Loa loa]
          Length = 554

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)

Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
            K  QY+PA+V I+K  +    + + +DI+ N +AG+     L    R+   D  F    
Sbjct: 322 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 378

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +L+K W    + I  A  G +++Y+L +MVLH
Sbjct: 379 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 409


>gi|119482616|ref|XP_001261336.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
           NRRL 181]
 gi|119409491|gb|EAW19439.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
           NRRL 181]
          Length = 701

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 59/246 (23%), Positives = 101/246 (41%), Gaps = 43/246 (17%)

Query: 42  DAELWLLAEE-------RIQ-EILCT---IQPAIVSEKKRKEVINYIQRLING-YYGIEV 89
           DA  W+L  E       R+  EIL     ++P    +  R ++I  +Q      YYG+++
Sbjct: 242 DATPWMLLMEPSLHLGSRLHNEILSFYHWVKPMQYEQIVRADLITRLQVAFQSRYYGVQL 301

Query: 90  FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------VCKILENEDQDSVFEIK 140
            PFGS     YLP  DIDL +L    +   +  F             L+N +      I+
Sbjct: 302 RPFGSFASGLYLPTADIDLVLLSSNFMRNGIKTFGERKGQIYAFAAFLKNLEIAVPNSIE 361

Query: 141 DVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-----------------LCFLEKVD 181
            + +  A+V I+K    +  + VD+SF+  +GL A                 +  +++  
Sbjct: 362 TIAH--ARVPILKFVDKMTGLRVDLSFDNDSGLIANNTFQNWKSEYPAMPVIVAVVKQFL 419

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYC 241
            L G + +    +      C   S +    +G  +   L  +++  F  Y NTFD++N  
Sbjct: 420 LLRGLNEVPTGGLGGFSITCLVTSLLQHLPHGHAAP-NLGSILMDFFEFYGNTFDFENVG 478

Query: 242 ISINGP 247
           I +N P
Sbjct: 479 IRLNPP 484


>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
           carolinensis]
          Length = 665

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 51/184 (27%), Positives = 81/184 (44%), Gaps = 23/184 (12%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    ++ R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 171 EEINDFYKYMSPRPEEQRMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 230

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
           V G         +EE L +         D+ SV ++ D   VP    I+K   S   + V
Sbjct: 231 VFGKWETLPLWTLEEALRKHNV-----ADKGSV-KVLDKATVP----IIKLTDSFTEVKV 280

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           DISFN   G+ A   ++     I K  +    V+++K +   +  +     G I +Y+L 
Sbjct: 281 DISFNVQNGVKAADLIKD---FIKKYPVLPYLVLVLKQFL-LQRDLNEVFTGGIGSYSLF 336

Query: 222 MMVL 225
           +M +
Sbjct: 337 LMAV 340


>gi|239835858|gb|ACS29269.1| pap1 [Meyerozyma guilliermondii]
          Length = 564

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 43/226 (19%), Positives = 90/226 (39%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + +KR E++  +Q+++  +                  G ++F FGS  L  Y P  DID 
Sbjct: 43  ATRKRVEILALLQKMVQEFVYTVSKKKNMSDGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQYVPAQVKIV--KCSVQ 157
           L V+    V ED      +++    +        D+   I  +++    + ++  + +V 
Sbjct: 103 LIVVPKHVVREDFFTIFDQMIRQRPELEEITAVPDAFVPIIMIEFSGISIDLIFARLNVS 162

Query: 158 NIPVDISF-----------NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++            N M  L+     +++ +L+ K  +FK ++  IK W   +  
Sbjct: 163 RVPLDMTLEDNNLLKNIDENDMRALNGTRVTDQILQLVPKVTVFKHALRCIKLWA-QQRA 221

Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 222 VYGNMFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267


>gi|345563688|gb|EGX46674.1| hypothetical protein AOL_s00097g578 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1706

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 15/146 (10%)

Query: 40   SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLK 98
            S+   +WL  E  I +    I+P       R +++  ++ LI   +   ++  FGS   +
Sbjct: 1266 SLYTSVWLHKE--IVDFHDYIKPQQYEHIVRHDLVRRLRELIQSRFDDADIQAFGSFAAE 1323

Query: 99   TYLPDGDIDLTVLGHQNVEEDLARF--------VCKILENEDQDSVFEIKDVQYVPAQVK 150
             YLP  D+D+ +L  Q +E    ++        + +IL N D      +  V  + A+V 
Sbjct: 1324 IYLPTSDMDIVLLSRQYLETSRPKYDSVNEIRKLARILRNSDLPQSGTL--VPILSAKVP 1381

Query: 151  IVKCS--VQNIPVDISFNQMAGLSAL 174
            I+K    +  + VDISF   +GL A+
Sbjct: 1382 IIKYKDRLTGLSVDISFENPSGLIAI 1407


>gi|312077329|ref|XP_003141256.1| PAP/25A associated domain-containing protein [Loa loa]
 gi|307763579|gb|EFO22813.1| PAP/25A associated domain-containing protein [Loa loa]
          Length = 419

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 63/295 (21%), Positives = 132/295 (44%), Gaps = 20/295 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + ++   ++P+ + +  R  V   ++ ++   +   ++  FGS+    +LP  DID+
Sbjct: 66  HEELLDLYAWLKPSPLEKALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 125

Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            V      EE  L +    + E+   +S+  + D  +VP  VK+V    + I +DISFN 
Sbjct: 126 VVESDLVSEEPPLWKTAVALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 182

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
           + G+ +  F+E +     +  + +  V+++K +   + ++     G +S+Y L +M++  
Sbjct: 183 VQGVRSARFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 237

Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
           FL  + ++D+    I+ +N  V + S  ++  +        L +       C++   V +
Sbjct: 238 FLQLHPSYDYSYKGITEVNMGVLLLSFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 297

Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
               N+        L I DPL+  N++GR     N   +R A  +    L  +  
Sbjct: 298 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 343


>gi|344231415|gb|EGV63297.1| Poly(A) polymerase PAPalpha [Candida tenuis ATCC 10573]
          Length = 560

 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 48/228 (21%), Positives = 89/228 (39%), Gaps = 59/228 (25%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + +KR E++N  Q+L+  +                  G ++F FGS  L  Y P  DID 
Sbjct: 43  ATRKRVEILNLFQKLVQEFVHKVSKSKNMSEGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102

Query: 109 TVL--GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKI--------VKCSVQN 158
            V+   H   E+  + F   + E  + + +  + D  +VP  +KI        + C+  N
Sbjct: 103 LVVVPKHVTREDFFSVFESIVRERPELEEITSVPDA-FVPI-IKIEFDGISIDLICATLN 160

Query: 159 IPV--------------DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           +P               ++    M  L+     +++ +L+ K  +FK ++  IK W    
Sbjct: 161 VPQVPLNMTLDDKNLLRNLDERDMRALNGTRVTDEILQLVPKPTVFKHALRCIKMWAQAR 220

Query: 205 SRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           + + G  +G     A  MMV               + +F + Y  ++W
Sbjct: 221 A-VYGNVFGFPGGVAWAMMVARICQLYPNAVGAVIVDKFFNIYTKWNW 267


>gi|156096867|ref|XP_001614467.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803341|gb|EDL44740.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 567

 Score = 44.7 bits (104), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 56/225 (24%), Positives = 100/225 (44%), Gaps = 32/225 (14%)

Query: 21  PSHTSSSSS--YSSPPNLYPFS-IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYI 77
           PS T  S +    +PP+ YP S +DAEL         ++   ++P+       K  + ++
Sbjct: 212 PSDTPQSDTPPSDTPPSEYPPSALDAEL--------SKLEVALRPSQNDVNSMKTFLAFL 263

Query: 78  QRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT----VLGHQNVEEDLARFVCKILENED 132
           Q+ IN +Y    V PFGS+    +  + DID+     +L  +  +    + +C IL N  
Sbjct: 264 QKEINKHYKNCHVTPFGSIINGFWTRNSDIDICIQIPILLSRKDQITFLKKICLIL-NSF 322

Query: 133 QDSVFEIKDVQYVPAQVKIVKCSVQN------IPVDISFNQMAGLSALCFLEKVDRLIGK 186
            D + E    Q   A+V I+    ++      +  DIS N +    A+   + + + +  
Sbjct: 323 NDGIIE----QRFSAKVPIIHFYCKSLRHSFELSCDISVNNIL---AVVNSKLIQKYVSI 375

Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
           D   +   I +K W      I     G +S+++L +M++H FL Y
Sbjct: 376 DKRLQLMGIALKYWS-KNRNINDRSKGFLSSFSLILMIIH-FLQY 418


>gi|449448062|ref|XP_004141785.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) polymerase-like [Cucumis
           sativus]
          Length = 772

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 17/153 (11%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
           L  F +DA L+   EE  + +E+L  I   IV +  K+   +  Y  +++     + +F 
Sbjct: 48  LEKFLVDAGLYESKEESAKREEVLSRIG-QIVKDWVKQLTRIKGYTDQMVEDANAV-IFT 105

Query: 92  FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
           FGS  L  + P  DID   +G  + N EED    +  ILE  +     E+ ++Q VP A 
Sbjct: 106 FGSYRLGVHGPGADIDTLCVGPSYVNREEDFFYMLHNILEEME-----EVSELQPVPDAH 160

Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVD 181
           V ++K     I +D+ +   A +S L   E +D
Sbjct: 161 VPVMKFKFDGISIDLLY---ASISCLVVPEDLD 190


>gi|403338429|gb|EJY68454.1| hypothetical protein OXYTRI_10932 [Oxytricha trifallax]
          Length = 1545

 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 114/280 (40%), Gaps = 62/280 (22%)

Query: 88   EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDS-----VFEIKDV 142
            EV  +GS   K  LP  DIDL +  +        + V  I+  E Q       + E+K V
Sbjct: 1195 EVKVYGSHATKLCLPWSDIDLVIKTNSTDHYSTPKHVLSIITRELQSDHTTKWIQEVKFV 1254

Query: 143  QYVPAQVKIVKCSV-----------QNI---------PVDISFNQMA-GLSALCFLEKVD 181
            +     V  VKC +           QNI         P  I   Q+    + L  ++ V 
Sbjct: 1255 ENASVPVVKVKCQIDHIMQTSGLASQNISKYQTFLEQPFSIDITQLTDNHNGLECVKLVQ 1314

Query: 182  RLIGKDHLFKRSVILIKAW---CYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWD 238
              + ++ + +  ++++K +   C Y       ++G IS+YAL +M++             
Sbjct: 1315 EFLSENEVIEPLILVLKQYLKVCQYND----PYFGGISSYALFLMIV------------- 1357

Query: 239  NYCISINGPVAIS--SLPEIVAETVENDGDELLLSPEFLKKCREIYS-VPITALENTGH- 294
            +Y  SI  P  IS  +L  I+    +  GD       F  +   IY+ +P    E T H 
Sbjct: 1358 SYLQSIQAPKLISQVNLGHILISFFQFYGD-------FQYQSYGIYTHLPGKISEKTNHY 1410

Query: 295  ---EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
                F  + + I DPL  +NN+G+S  K  F+ I+ +  +
Sbjct: 1411 AIVNFLTQTVQIDDPLHVHNNVGKSSFK--FYEIKDSFKF 1448


>gi|384249905|gb|EIE23385.1| hypothetical protein COCSUDRAFT_41642 [Coccomyxa subellipsoidea
           C-169]
          Length = 758

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 46/185 (24%), Positives = 83/185 (44%), Gaps = 29/185 (15%)

Query: 62  PAIVSEKKRKEVINYIQRLI----NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLG----- 112
           P      +R++++  +  ++    +G+  + V P+GS     Y P GD+D+++ G     
Sbjct: 39  PGPQDAARRRQILEKMGGIVGLGLDGHTELRVEPYGSFVSGLYAPTGDLDISIEGFCGKE 98

Query: 113 ---------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
                     ++ +  L R + K LE       + I+ + +  A+V I+K   +   IP 
Sbjct: 99  GRGRDVRDMGKSAKAALLRALSKKLERSRLHRGY-IQRILH--ARVPILKLVWAESGIPC 155

Query: 162 DISFNQMAGLSALCF-LEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           D+S     G S   F  E V  L+  D  F++ + +IK W      +  A  G  +T+AL
Sbjct: 156 DVSV----GSSNSRFKAEVVKALVRLDGRFEQMLRVIKVWSGAHG-LNDASNGTFNTFAL 210

Query: 221 EMMVL 225
            +MVL
Sbjct: 211 SLMVL 215


>gi|115395044|ref|XP_001213471.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193040|gb|EAU34740.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 680

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R+++I+ +Q      YYG+++ PFGS     YLP  DIDL +L       
Sbjct: 268 VKPVDYEQIVREDLISRLQSAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFTRN 327

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L N D      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 328 GIKTFGERKGQIYAFAAFLRNLDIAVPGSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 385

Query: 168 MAGLSA 173
            +GL A
Sbjct: 386 DSGLIA 391


>gi|347830146|emb|CCD45843.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 788

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 65/137 (47%), Gaps = 10/137 (7%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
           LWL  E  I +    ++P    E  R++++  ++     YY   ++ PFGS P   YLP 
Sbjct: 450 LWLHKE--IMDFYYHVKPRKFEEVIRQKLLVDLRNNFRRYYPDADILPFGSFPAGLYLPT 507

Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
            D+D+ ++  + ++    ++     + ++  N ++D +   +      A+V + K   ++
Sbjct: 508 ADMDVVMVSDEFMDGGWPKYSSKSHIWRVKSNIERDRLNTGRIDVITSAKVPLAKWVDAI 567

Query: 157 QNIPVDISFNQMAGLSA 173
             + VD+SF    GL A
Sbjct: 568 TGLKVDMSFENDTGLIA 584


>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 663

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 60/129 (46%), Gaps = 14/129 (10%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
           ++  +  + P  + ++ R  V+  + + +   +   +V PFGS   K YLP GDIDL   
Sbjct: 155 VEAFVDYMSPTSIEDEIRGLVVKLVGKAVTSAFPDAKVLPFGSYGTKLYLPSGDIDLV-- 212

Query: 112 GHQNVEEDLARFVCK-----ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDIS 164
               +E D  ++V K      L N  + +    K      A+V IVK   ++  + VDIS
Sbjct: 213 ----IESDSMQYVPKNSVLHSLANVLKRAGIADKVTIIAKAKVPIVKFITRHGRLNVDIS 268

Query: 165 FNQMAGLSA 173
            NQ  GL A
Sbjct: 269 INQSNGLVA 277


>gi|296194961|ref|XP_002745186.1| PREDICTED: DNA polymerase sigma [Callithrix jacchus]
          Length = 866

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 12/175 (6%)

Query: 65  VSEKKRKEVINYIQRLINGYYGI-EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF 123
           V + + +EV+  I+ ++   +   +V  FGS     YLP  DIDL V G    E    + 
Sbjct: 331 VFKGQLREVVKRIETVVKDLWPFFQVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQL 388

Query: 124 VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVD 181
           + + L   +      IK +    A V I+K + Q   + VDISFN   G+ A  F++   
Sbjct: 389 LEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY- 445

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
             + K  L    ++++K +   +  +     G IS+Y+L +M +  FL  +   D
Sbjct: 446 --MKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 496


>gi|242081815|ref|XP_002445676.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
 gi|241942026|gb|EES15171.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
          Length = 690

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 43/177 (24%), Positives = 75/177 (42%), Gaps = 21/177 (11%)

Query: 41  IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE-----VFPFGSV 95
           ID  L  + EE +Q+   ++QP  V  + R  +IN   ++  G +G       V PFGS 
Sbjct: 55  IDPSLLPVLEELLQDTYASLQPQPVDYEHRYHMINIFNKIAEGIFGKNNGLPIVEPFGSF 114

Query: 96  PLKTYLPDGDIDLTVLGHQNVEEDLAR----FVCKILEN-----EDQDSVFEIKDVQYVP 146
            +  + P  D+DL++  + +  +   R    +  + L N     + Q     +  +  V 
Sbjct: 115 IMDLFTPKSDLDLSINFNTDTNDQYPRRNKIYAIRKLANVLFSHQRQGLCHGVSPI--VT 172

Query: 147 AQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           A+V ++K   Q   V  DIS     G+S     + +  +   D  F+    L+K W 
Sbjct: 173 ARVPVLKVIDQKTGVECDISVENKDGMSRSVIFKFISSI---DKRFRILCYLMKFWA 226


>gi|195480074|ref|XP_002101128.1| GE15792 [Drosophila yakuba]
 gi|194188652|gb|EDX02236.1| GE15792 [Drosophila yakuba]
          Length = 1014

 Score = 44.3 bits (103), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 53/212 (25%), Positives = 90/212 (42%), Gaps = 21/212 (9%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H    + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 233 NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 292

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
           +  +   V   FGS     +LP  DIDL VLG         +   + LE E     + E 
Sbjct: 293 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 346

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
             V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R   ++GK  L  + 
Sbjct: 347 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 406

Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVL 225
            +L++        +     G IS+Y+L +M +
Sbjct: 407 FLLLRD-------LNEVFTGGISSYSLILMCI 431


>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 397

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 121/298 (40%), Gaps = 43/298 (14%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
           ++  +  I P+ V ++ R  ++  I   +   +    V PFGS   K YLP GDIDL +L
Sbjct: 106 VKAFVHWISPSPVEDEVRGLIVTQISNTVKASFPDARVLPFGSYETKLYLPLGDIDLVIL 165

Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDISFNQMA 169
                  +    V   L N  + S           A+V IVK    +    VDIS NQ  
Sbjct: 166 SDSMAYSNKVN-VLHALANTLKRSGVTSHVTIIAKAKVPIVKFVTTHGRFHVDISLNQSN 224

Query: 170 GLSAL----CFLEKV--DRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
           GL +      FL+ +  +   GK  +  RS++++      +  +   + G + +Y++   
Sbjct: 225 GLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKAFLTQRSMNEVYTGGLGSYSI--- 281

Query: 224 VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEF--LKKCREI 281
                            C++++    +   P+I    ++ + +  +L+ EF  L  C   
Sbjct: 282 ----------------VCLAVS---FLQMHPKIRNGEIDPEKNLGVLAMEFFELYGCYFN 322

Query: 282 YSVPITALENTGHEFAIKHLNIVD-------PLKDNNNLGRSVSKGN--FHRIRCALS 330
           Y     +L + G  F+ +     D        L+D  +    +SKG+  FH++R A +
Sbjct: 323 YDEVGISLRDGGMYFSKRKRGWYDYDRRGILSLEDPADPSNDISKGSYGFHKVRTAFA 380


>gi|315054687|ref|XP_003176718.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
 gi|311338564|gb|EFQ97766.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
          Length = 627

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 32/140 (22%), Positives = 71/140 (50%), Gaps = 16/140 (11%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 272 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 331

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVF----------EIKDVQYVPAQVKIVKC-- 154
           DL +L  Q +  +  +++C+ + +    + +           I+ + +  A+V I+K   
Sbjct: 332 DLVLLSRQFMSSN-RKYICQKVRDIYSFAEYIRGLGIAAPGSIETIAH--ARVPIIKFVD 388

Query: 155 SVQNIPVDISFNQMAGLSAL 174
           S+  + VD+SF+  +GL+A+
Sbjct: 389 SLTGLKVDLSFDNSSGLAAI 408


>gi|393912435|gb|EJD76738.1| hypothetical protein LOAG_16408 [Loa loa]
          Length = 430

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)

Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
            K  QY+PA+V I+K  +    + + +DI+ N +AG+     L    R+   D  F    
Sbjct: 198 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 254

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +L+K W    + I  A  G +++Y+L +MVLH
Sbjct: 255 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 285


>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 642

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 108/246 (43%), Gaps = 35/246 (14%)

Query: 4   MSAFLEMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAEL--------WLLAEERIQE 55
           +S FL M D  E  +T  S  S  +++S+    YP+  +           WL +E  I++
Sbjct: 128 VSGFL-MPDQYE--ITKNSTQSHENTHSTSNTEYPWIRNHNHSRQRKIADWLTSE--IKD 182

Query: 56  ILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
            +  I P+    K R   I+ +++ +   +   ++  FGS     YLP  DID  +    
Sbjct: 183 FVHYISPSKSEIKCRNRTIDKLRQAVKKLWSDADLHVFGSFATDLYLPGSDIDCVINSRH 242

Query: 115 NVEED------LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFN 166
           + +ED      LAR+    L+NE      E+     V  +V I+K    +  + +D+SF 
Sbjct: 243 HDKEDRNYIYELARY----LKNEGLAIRMEV----IVRTRVPIIKFIEPLSQLHIDVSFE 294

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +  GL A   +    R   +D    R ++L+     +  R+   H G +  + + + +++
Sbjct: 295 RTNGLEAARLI----REWLRDSPGLRELVLVIKQFLHSRRLNNVHTGGLGGFTV-ICLVY 349

Query: 227 RFLDYY 232
            FL+ +
Sbjct: 350 SFLNMH 355


>gi|154323524|ref|XP_001561076.1| hypothetical protein BC1G_00161 [Botryotinia fuckeliana B05.10]
          Length = 692

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 65/137 (47%), Gaps = 10/137 (7%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
           LWL  E  I +    ++P    E  R++++  ++     YY   ++ PFGS P   YLP 
Sbjct: 410 LWLHKE--IMDFYYHVKPRKFEEVIRQKLLVDLRNNFRRYYPDADILPFGSFPAGLYLPT 467

Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
            D+D+ ++  + ++    ++     + ++  N ++D +   +      A+V + K   ++
Sbjct: 468 ADMDVVMVSDEFMDGGWPKYSSKSHIWRVKSNIERDRLNTGRIDVITSAKVPLAKWVDAI 527

Query: 157 QNIPVDISFNQMAGLSA 173
             + VD+SF    GL A
Sbjct: 528 TGLKVDMSFENDTGLIA 544


>gi|297819094|ref|XP_002877430.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323268|gb|EFH53689.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 680

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 46/207 (22%), Positives = 87/207 (42%), Gaps = 21/207 (10%)

Query: 36  LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
           L  + ID+ + L  ++ + ++ C+ +P       RKE++  +  +    YG   E  P  
Sbjct: 30  LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYDTRKELVKNLNAMAIDIYGNSEESSPVL 89

Query: 92  --FGSVPLKTYLPDGDIDLTV--------LGHQNVEEDLARFVCKILENEDQDSVFEIKD 141
             +GS  +  Y    D+D+++        L  +   E L RF  K+   + +  V  ++ 
Sbjct: 90  EAYGSFVMDMYSSQSDLDVSINFGNGTPELPREKKLEILKRFAKKLRSLQGEGHVKNVES 149

Query: 142 VQYVPAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA 199
           +    A+V IVK S Q   V  D+S     G+     +  + ++ G+   F++  +L+K 
Sbjct: 150 I--FSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCMLVKH 204

Query: 200 WCYYESRILGAHYGLISTYALEMMVLH 226
           W          H  L S     ++ LH
Sbjct: 205 WAKAHEVNSALHRTLNSVSITLLVALH 231


>gi|326470670|gb|EGD94679.1| topoisomerase TRF4 [Trichophyton tonsurans CBS 112818]
          Length = 631

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
           DL +L  Q +  +  +++C+ +      + +       VP        A+V I+K   ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 394

Query: 157 QNIPVDISFNQMAGLSAL 174
             + VD+SF+  +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLAAI 412


>gi|327307976|ref|XP_003238679.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
 gi|326458935|gb|EGD84388.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
          Length = 631

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 31/138 (22%), Positives = 69/138 (50%), Gaps = 12/138 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335

Query: 107 DLTVLGHQNVEEDLARFVCK-------ILENEDQDSVFEIKDVQYVP-AQVKIVKC--SV 156
           DL +L  Q +  +  +++C+         E      +   + ++ +  A+V I+K   ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPRSIETIAHARVPIIKFVDAL 394

Query: 157 QNIPVDISFNQMAGLSAL 174
             + VD+SF+  +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLTAI 412


>gi|326479586|gb|EGE03596.1| topoisomerase TRF4 [Trichophyton equinum CBS 127.97]
          Length = 631

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
           DL +L  Q +  +  +++C+ +      + +       VP        A+V I+K   ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 394

Query: 157 QNIPVDISFNQMAGLSAL 174
             + VD+SF+  +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLAAI 412


>gi|63101121|gb|AAY33178.1| PAPa [Candida parapsilosis]
 gi|354544642|emb|CCE41367.1| hypothetical protein CPAR2_303560 [Candida parapsilosis]
          Length = 552

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 51/228 (22%), Positives = 81/228 (35%), Gaps = 65/228 (28%)

Query: 69  KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
           KR EV+  +Q+L   +                  G ++F FGS  L  Y P  DID L V
Sbjct: 46  KRVEVLTTLQKLTEQFVYEVSLQRNMSQGMAKDAGGKIFTFGSYKLGVYGPSSDIDALVV 105

Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFNQ-- 167
           +      +D      KIL         E++++  V  A V I+K     I +D+ F +  
Sbjct: 106 VPRHVTRDDFFTVFEKILRGRQ-----ELEEINCVKEAFVPIIKLEFAGISIDLLFAKLD 160

Query: 168 -----------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
                                  M  L+     +++ RL+ K  +FK ++  +K W   +
Sbjct: 161 IPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKSTVFKNALRFVKMWA-QQ 219

Query: 205 SRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
             I    YG     A  M+V               L +F   Y+ + W
Sbjct: 220 RAIYANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWSW 267


>gi|302657626|ref|XP_003020531.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
 gi|291184373|gb|EFE39913.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
          Length = 577

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 222 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 281

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
           DL +L  Q +  +  +++C+ +      + +       VP        A+V I+K   ++
Sbjct: 282 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGLAVPGSIETIAHARVPIIKFVDAL 340

Query: 157 QNIPVDISFNQMAGLSAL 174
             + VD+SF+  +GL+A+
Sbjct: 341 TGLKVDLSFDNSSGLAAI 358


>gi|170059968|ref|XP_001865594.1| sigma DNA polymerase [Culex quinquefasciatus]
 gi|167878539|gb|EDS41922.1| sigma DNA polymerase [Culex quinquefasciatus]
          Length = 618

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 86/185 (46%), Gaps = 17/185 (9%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGD 105
           ++  E I++    + P       R +V+  I+ ++ N +    V  FGS     YLP  D
Sbjct: 1   MILHEEIEQFYAHMIPTGTEHTLRVQVVARIEAIVLNLWPMARVEMFGSFRTGLYLPTSD 60

Query: 106 IDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPV 161
           IDL V+G         +   + LENE     + E   V+ +  A V IVK + +   + V
Sbjct: 61  IDLVVIGR------WEKLPLRTLENELISRGIAEPMSVRVLDKASVPIVKLTDRETQVKV 114

Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
           DISFN  +G+ +   ++   R   +  +  + V+++K +     R L   + G IS+Y+L
Sbjct: 115 DISFNMQSGVQSAELIKDFKR---QYPVLAKLVLVLKQFLL--QRDLNEVFTGGISSYSL 169

Query: 221 EMMVL 225
            +M +
Sbjct: 170 ILMCI 174


>gi|195447518|ref|XP_002071250.1| GK25231 [Drosophila willistoni]
 gi|194167335|gb|EDW82236.1| GK25231 [Drosophila willistoni]
          Length = 1048

 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 53/211 (25%), Positives = 89/211 (42%), Gaps = 21/211 (9%)

Query: 23  HTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN 82
           H    + Y   P   P     E  +   E I+     + P       R EV+  I+ +++
Sbjct: 260 HMEMIAKYKGTPWRKPDYHYGEGAIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEGVVH 319

Query: 83  GYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIK 140
             +   V   FGS     +LP  DIDL VLG         +   + LE E     + E  
Sbjct: 320 SIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEAC 373

Query: 141 DVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRSV 194
            V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R   ++GK  L  +  
Sbjct: 374 TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQF 433

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVL 225
           +L++        +     G IS+Y+L +M +
Sbjct: 434 LLLRD-------LNEVFTGGISSYSLILMCI 457


>gi|170584484|ref|XP_001897029.1| PAP/25A associated domain containing protein [Brugia malayi]
 gi|158595564|gb|EDP34107.1| PAP/25A associated domain containing protein [Brugia malayi]
          Length = 747

 Score = 43.5 bits (101), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)

Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
            K  QY+PA+V I+K  +    + + +DI+ N +AG+     L    R+   D  F    
Sbjct: 515 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 571

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +L+K W    + I  A  G +++Y+L +MVLH
Sbjct: 572 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 602


>gi|302508901|ref|XP_003016411.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
 gi|291179980|gb|EFE35766.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
          Length = 577

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 222 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 281

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
           DL +L  Q +  +  +++C+ +      + +       VP        A+V I+K   ++
Sbjct: 282 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 340

Query: 157 QNIPVDISFNQMAGLSAL 174
             + VD+SF+  +GL+A+
Sbjct: 341 TGLKVDLSFDNSSGLAAI 358


>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
          Length = 464

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 4/129 (3%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           + +  ++  +  I P    ++ R  ++  I R +   +   +V PFGS   K YLP GDI
Sbjct: 140 MLQRDVEAFIDYISPTPAEDEIRGLIVQLISRAVTQAFPDAQVLPFGSYETKLYLPLGDI 199

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
           DL +        D    V   L N  + +    +      A+V I+K   +     VDIS
Sbjct: 200 DLVIQSPSMAYSDKVT-VLHALANTMRRAGITDRVTIVAKAKVPIIKFITTHGRFAVDIS 258

Query: 165 FNQMAGLSA 173
            NQ  G++A
Sbjct: 259 LNQTNGVAA 267


>gi|449480779|ref|XP_004155993.1| PREDICTED: poly(A) polymerase-like [Cucumis sativus]
          Length = 758

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 58/250 (23%), Positives = 100/250 (40%), Gaps = 53/250 (21%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
           L  F +DA L+   EE  + +E+L  I   IV +  K+   +  Y  +++       +F 
Sbjct: 48  LEKFLVDAGLYESKEESAKREEVLSRIG-QIVKDWVKQLTRIKGYTDQMVEDANAA-IFT 105

Query: 92  FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
           FGS  L  + P  DID   +G  + N EED    +  ILE  +     E+ ++Q VP A 
Sbjct: 106 FGSYRLGVHGPGADIDTLCVGPSYVNREEDFFYMLHNILEEME-----EVSELQPVPDAH 160

Query: 149 VKIVKCSVQNIPVDISFNQMAGL--------SALCFLEKVD-----------------RL 183
           V ++K     I +D+ +  ++ L        S +  L  VD                 +L
Sbjct: 161 VPVMKFKFDGISIDLLYASISCLVVPEDLDISDVSVLYNVDEPTVRSLNGCRVADQILKL 220

Query: 184 IGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRFL 229
           +     F+ ++  +K W             + G ++ L+     ++       M+L RF 
Sbjct: 221 VPNVESFRTALRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVPSMLLSRFF 280

Query: 230 DYYNTFDWDN 239
             Y  + W N
Sbjct: 281 RVYTLWRWPN 290


>gi|340371638|ref|XP_003384352.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Amphimedon queenslandica]
          Length = 462

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 74/146 (50%), Gaps = 16/146 (10%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV-- 145
           +V+PFGS     YLP  DID+ VLG     E LA  +   LE+    +   I+D   V  
Sbjct: 133 QVYPFGSFCTNLYLPTSDIDIVVLG-----EWLALPLFS-LEDAFLKAQIAIEDSIMVLD 186

Query: 146 PAQVKIVKCSVQ--NIPVDISFNQMAGL-SALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
              V I+K + +   + VDISFNQ  G+ SA    + V +     +L     +++K +  
Sbjct: 187 KTTVPIIKFTDRETEVKVDISFNQETGIYSANLICQYVQQFPYLPYL----ALIVKQFL- 241

Query: 203 YESRILGAHYGLISTYALEMMVLHRF 228
            + ++   +YG I++Y+L +M++  F
Sbjct: 242 AQRQLNEVYYGGINSYSLILMLVSFF 267


>gi|324975502|gb|ADY62684.1| PAPa [Candida orthopsilosis]
          Length = 547

 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 45/226 (19%), Positives = 88/226 (38%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + KKR EV++  Q L+  +                  G ++F FGS  L  Y P  DID 
Sbjct: 42  ATKKRAEVLSLFQTLVQQFVYEVSISKNMSDGMAKDAGGKIFTFGSYRLGVYGPSSDIDA 101

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
           L V+      ED      KI+    +        D+   I  +++  +   + + + +V 
Sbjct: 102 LVVVPRHVTREDFFTTFDKIIRQRSELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   E  
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220

Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
           + G  +G     A  M+               ++ +F + Y  ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSAVIVEKFFNIYTKWNW 266


>gi|363751202|ref|XP_003645818.1| hypothetical protein Ecym_3523 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889452|gb|AET39001.1| Hypothetical protein Ecym_3523 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 683

 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 47/190 (24%), Positives = 83/190 (43%), Gaps = 18/190 (9%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P     +KR + I  I++ +  ++   ++  FGS     YLP  
Sbjct: 187 WLTLE--IKDFVSYISPNREEIRKRNDAITKIRKAVKSFWPDSDLHCFGSYATDLYLPGS 244

Query: 105 DIDLTV---LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNI 159
           DID  V    G ++ +  L  F   + +N     V  I       A+V I+K    V  I
Sbjct: 245 DIDCVVNSKSGDKDNKNALYSFASYLRKNGLASQVSVI-----AKARVPIIKFVEPVSQI 299

Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
            +D+SF +  G+ A   +      +G     +  V+ IK + Y   R+   H G +  ++
Sbjct: 300 HIDVSFERTNGVDAAKIIRG---WLGDTPGLRELVLTIKQFLYAR-RLNDVHIGGLGGFS 355

Query: 220 LEMMVLHRFL 229
           + + + + FL
Sbjct: 356 I-ICLTYSFL 364


>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
 gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
           Full=Caffeine-induced death protein 14; AltName:
           Full=Polynucleotide adenylyltransferase cid14
 gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
          Length = 684

 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 33/222 (14%)

Query: 39  FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYI-QRLINGYYGIEVFPFGSVPL 97
           + ++ E+  +  + I   +  I P       RK +++ I Q ++  +  + ++ FGS   
Sbjct: 231 YKVEREVSRIFHQDILHFIDYITPTPEEHAVRKTLVSRINQAVLQKWPDVSLYVFGSFET 290

Query: 98  KTYLPDGDIDLTVLG----HQNVEED---LARFVCKI-LENEDQDSVFEIKDVQYV-PAQ 148
           K YLP  D+DL ++     ++  ++D   LA  + K+ L +E          VQ +  A 
Sbjct: 291 KLYLPTSDLDLVIISPEHHYRGTKKDMFVLAHHLKKLKLASE----------VQVITTAN 340

Query: 149 VKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
           V I+K    +  + VDISFNQ  GL   C +  V+  + K    +  VI+IK   +   R
Sbjct: 341 VPIIKFVDPLTKVHVDISFNQPGGLKT-CLV--VNGFMKKYPALRPLVIIIKH--FLNMR 395

Query: 207 ILGAHY-GLISTYALEMMV-----LHRFLDYYNTFDWDNYCI 242
            L   + G +S+YA+  +V     LH  L   +  + DN+ +
Sbjct: 396 ALNEVFLGGLSSYAIVCLVVSFLQLHPRLSTGSMREEDNFGV 437


>gi|449296989|gb|EMC93008.1| hypothetical protein BAUCODRAFT_49110, partial [Baudoinia
           compniacensis UAMH 10762]
          Length = 611

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 18/140 (12%)

Query: 52  RIQEILCT----IQPAIVSEKKRKEVINYIQRLI-----NGYYGIEVFPFGSVPLKTYLP 102
           R+ + +C     +QP    E+ R+++I+ ++R I      G +G ++  FGS     YLP
Sbjct: 288 RLHKEICDFFNYVQPKNFEEEARRDLISRVERAILTSNIAGSHGTKIHCFGSFAAGLYLP 347

Query: 103 DGDIDLTVLGHQNVEEDLARF------VCKILENEDQDSVFEIKDVQYV-PAQVKIVKC- 154
             D+DL  +     ++ L         + ++  +  +  V     VQ V  A+V IVK  
Sbjct: 348 TADMDLVAISSTFQQQGLRTVGQSGTQMHRLSAHLTRSGVAAAGSVQVVLHAKVPIVKFI 407

Query: 155 -SVQNIPVDISFNQMAGLSA 173
                I VDISF    G+ A
Sbjct: 408 DKKTGIKVDISFENRTGIVA 427


>gi|392867853|gb|EAS33553.2| topoisomerase TRF4 [Coccidioides immitis RS]
          Length = 680

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I   +RL+ N + G ++  FGS     YLP  D+DL +L    + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
              +F+C+ +++    + + I+D +  VP        A+V I+K    +  + VD+SF+ 
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375

Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
            +GL+A    ++      K+H           +  +L++      +  LG    + + T 
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGGFSIICLVTS 430

Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
            L+ +           VL  F D+Y N FD+    I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471


>gi|389748468|gb|EIM89645.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
          Length = 479

 Score = 43.5 bits (101), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 56/130 (43%), Gaps = 4/130 (3%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           L    +   +  I P  V ++ R  V+  IQR I+  +   +V  FGS   K YLP GDI
Sbjct: 102 LLHREVDAFVRYISPTPVEDEIRSLVVLQIQRCISSKFPDAKVRSFGSYETKLYLPLGDI 161

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
           DL ++       D    V   + N  + +    +      A+V IVK   +     VDIS
Sbjct: 162 DLVIISKSMAYSDRVT-VLHAVANTLRTAGITDRVSVIAKAKVPIVKFVTTFGRFAVDIS 220

Query: 165 FNQMAGLSAL 174
            N   G+ A+
Sbjct: 221 INMSNGVEAI 230


>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella
           mesenterica DSM 1558]
          Length = 303

 Score = 43.5 bits (101), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 17/94 (18%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVL-------GHQNVEEDLARFVCKILENEDQDSVFEIKD 141
           V PFGS   + YLP GDIDL V          +N+  DLAR +   +     D+V  I  
Sbjct: 62  VTPFGSWQTQLYLPQGDIDLVVTHPTLTEHNKKNLLNDLARTMRYAMIT---DNVVVIS- 117

Query: 142 VQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSA 173
                A+V I+K   ++  + VDIS NQ+ G+SA
Sbjct: 118 ----KARVPIIKFVTKHGKLNVDISLNQVNGISA 147


>gi|324975520|gb|ADY62700.1| PAPa [Candida orthopsilosis]
          Length = 547

 Score = 43.5 bits (101), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 46/226 (20%), Positives = 88/226 (38%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + KKR EV++  Q L+  +                  G ++F FGS  L  Y P  DID 
Sbjct: 42  ATKKRAEVLSLFQTLVQQFVYEVSISKNMSDGMAKDAGGKIFTFGSYRLGVYGPSSDIDA 101

Query: 108 LTVLGHQNVEEDLARFVCKILE--------NEDQDSVFEIKDVQY--VPAQVKIVKCSVQ 157
           L V+      ED      KI+         N   D+   I  +++  +   + + + +V 
Sbjct: 102 LVVVPRHVTREDFFTTFDKIIRQRPELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   E  
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220

Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
           + G  +G     A  M+               ++ +F + Y  ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSAVIVEKFFNIYTKWNW 266


>gi|303323645|ref|XP_003071814.1| PAP/25A associated domain containing protein [Coccidioides
           posadasii C735 delta SOWgp]
 gi|240111516|gb|EER29669.1| PAP/25A associated domain containing protein [Coccidioides
           posadasii C735 delta SOWgp]
          Length = 680

 Score = 43.5 bits (101), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I   +RL+ N + G ++  FGS     YLP  D+DL +L    + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
              +F+C+ +++    + + I+D +  VP        A+V I+K    +  + VD+SF+ 
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375

Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
            +GL+A    ++      K+H           +  +L++      +  LG    + + T 
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTS 430

Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
            L+ +           VL  F D+Y N FD+    I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471


>gi|70987233|ref|XP_749095.1| topoisomerase family protein TRF4 [Aspergillus fumigatus Af293]
 gi|66846725|gb|EAL87057.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
           Af293]
          Length = 702

 Score = 43.1 bits (100), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 58/246 (23%), Positives = 100/246 (40%), Gaps = 43/246 (17%)

Query: 42  DAELWLLAEE-------RIQ-EILCT---IQPAIVSEKKRKEVINYIQRLING-YYGIEV 89
           DA  W+L  E       R+  EIL     ++P    +  R ++I  +Q      YYG+++
Sbjct: 243 DATPWMLLMEPSLHLGSRLHNEILSFYHWVKPMQYEQIVRADLITRLQVAFQSRYYGVQL 302

Query: 90  FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------VCKILENEDQDSVFEIK 140
            PFGS     YLP  DIDL +L    +   +  F             L+N +      I+
Sbjct: 303 RPFGSFASGLYLPTADIDLVLLSSNFMRNGIKTFGERKGQIYAFAAFLKNLEIAVPNSIE 362

Query: 141 DVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-----------------LCFLEKVD 181
            + +  A+V I+K    +  + VD+SF+  +GL A                 +  +++  
Sbjct: 363 TIAH--ARVPILKFVDKMTGLRVDLSFDNDSGLIANNTFQNWKSEYPAMPVIVAVVKQFL 420

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYC 241
            L G + +    +      C   S +    +G  +   L  +++  F  Y N FD++N  
Sbjct: 421 LLRGLNEVPTGGLGGFSITCLVTSLLQHLPHGHTAP-NLGSILMDFFEFYGNNFDFENVG 479

Query: 242 ISINGP 247
           I +N P
Sbjct: 480 IRLNPP 485


>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
           [Taeniopygia guttata]
          Length = 443

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 19/121 (15%)

Query: 67  EKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEED 119
           E  R EV+N I+ +I   +   +V  FGS     YLP  DIDL V G         +EE 
Sbjct: 8   ETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLPLWTLEEA 67

Query: 120 LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
           L +         D++SV ++ D   VP    I+K   S   + VDISFN   G+ A   +
Sbjct: 68  LRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVKAAQLI 117

Query: 178 E 178
           +
Sbjct: 118 K 118


>gi|344301689|gb|EGW31994.1| Poly(A) polymerase PAPalpha [Spathaspora passalidarum NRRL Y-27907]
          Length = 556

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 44/226 (19%), Positives = 88/226 (38%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + KKR EV+   Q+++  +                  G +VF FGS  L  Y P  DID 
Sbjct: 43  ATKKRAEVLALFQKMVQEFVYTVSKSKNMSDGMAKDAGGKVFTFGSYRLGVYGPGSDIDT 102

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
           L V+      ED      +I+    +        D+   I  +++  +   + + + +V 
Sbjct: 103 LVVVPKHVTREDFFTVFEQIIRKRPELQEIASVPDAFVPIIKIEFDGISIDLILARLNVP 162

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W    + 
Sbjct: 163 RVPLDMTLDDKNLLKNIDERDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWAQQRA- 221

Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 222 VYGNVFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267


>gi|320035002|gb|EFW16944.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
          Length = 680

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I   +RL+ N + G ++  FGS     YLP  D+DL +L    + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
              +F+C+ +++    + + I+D +  VP        A+V I+K    +  + VD+SF+ 
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375

Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
            +GL+A    ++      K+H           +  +L++      +  LG    + + T 
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTS 430

Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
            L+ +           VL  F D+Y N FD+    I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471


>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 642

 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 46/196 (23%), Positives = 88/196 (44%), Gaps = 24/196 (12%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ +++ +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEED------LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
           DID  +    + +ED      LAR+    L+NE      E+     V  +V I+K    +
Sbjct: 233 DIDCVINSRHHDKEDRNYIYELARY----LKNEGLAIRMEV----IVRTRVPIIKFIEPL 284

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
             + +D+SF +  GL A   +    R   +D    R ++L+     +  R+   H G + 
Sbjct: 285 SQLHIDVSFERTNGLEAARLI----REWLRDSPGLRELVLVIKQFLHSRRLNNVHTGGLG 340

Query: 217 TYALEMMVLHRFLDYY 232
            + + + +++ FL+ +
Sbjct: 341 GFTV-ICLVYSFLNMH 355


>gi|221055315|ref|XP_002258796.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193808866|emb|CAQ39569.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 548

 Score = 43.1 bits (100), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 98/214 (45%), Gaps = 32/214 (14%)

Query: 31  SSPPNLYPFS-IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIE 88
           S PP  YP S +DAEL         ++   ++P+       K  + ++Q+ IN +Y    
Sbjct: 205 SPPPGEYPPSALDAEL--------NKLEIALRPSQNDVNSIKTFLAFLQKEINKHYKNCH 256

Query: 89  VFPFGSVPLKTYLPDGDIDLT----VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQY 144
           V PFGS+    +  + DID+     +L  +  +    + +C IL N + D + E    Q 
Sbjct: 257 VTPFGSIINGFWTRNSDIDICIQIPILLSRKDQITFLKKICLILNNFN-DGIIE----QR 311

Query: 145 VPAQVKIVKCSVQN------IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIK 198
             A+V I+    ++      +  DIS N +    A+   + + + +  D   +   I +K
Sbjct: 312 FSAKVPIIHFYCKSLRHSFELSCDISVNNIL---AVINSKLIQKYVSIDRRLQLMGIALK 368

Query: 199 AWCYYESR-ILGAHYGLISTYALEMMVLHRFLDY 231
            W   +SR I     G +S+++L +M++H FL Y
Sbjct: 369 YWS--KSRNINDRSKGFLSSFSLILMIIH-FLQY 399


>gi|68482706|ref|XP_714750.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
 gi|3334283|sp|O42617.1|PAP_CANAL RecName: Full=Poly(A) polymerase PAPalpha; AltName:
           Full=Polynucleotide adenylyltransferase alpha
 gi|2696030|dbj|BAA23802.1| poly A polymerase [Candida albicans]
 gi|5771514|gb|AAD51412.1| unknown [Candida albicans]
 gi|46436342|gb|EAK95706.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
          Length = 558

 Score = 43.1 bits (100), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 47/227 (20%), Positives = 90/227 (39%), Gaps = 57/227 (25%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + KKR EV+   QRL+  +                  G +VF FGS  L  Y P  DID 
Sbjct: 43  ATKKRVEVLTLFQRLVQEFVYTVSKSKNMSDSMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102

Query: 109 TVLGHQNVEED--LARFVCKILENEDQDSVFEIKDVQYVP----------AQVKIVKCSV 156
            V+  ++V  D   + F   I +  + + +  + D  YVP            + + + ++
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDA-YVPIIKLEFDGISIDLIMARLNI 161

Query: 157 QNIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
             +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   + 
Sbjct: 162 PRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWA-QQR 220

Query: 206 RILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
            + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 221 AVYGNIFGFPGGVAWAMLVARICQLYPNAVSSAIVEKFFNIYTKWNW 267


>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
           [Taeniopygia guttata]
          Length = 490

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 19/121 (15%)

Query: 67  EKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEED 119
           E  R EV+N I+ +I   +   +V  FGS     YLP  DIDL V G         +EE 
Sbjct: 8   ETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLPLWTLEEA 67

Query: 120 LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
           L +         D++SV ++ D   VP    I+K   S   + VDISFN   G+ A   +
Sbjct: 68  LRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVKAAQLI 117

Query: 178 E 178
           +
Sbjct: 118 K 118


>gi|195131647|ref|XP_002010257.1| GI15833 [Drosophila mojavensis]
 gi|193908707|gb|EDW07574.1| GI15833 [Drosophila mojavensis]
          Length = 807

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 118/305 (38%), Gaps = 61/305 (20%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
            E I      + P       R EV+  I+ +++  +   V   FGS     +LP  DIDL
Sbjct: 301 HEEIDHFYQYVLPTACEHAIRNEVVKRIESVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 360

Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
            VLG         +   + LE E     + E   V+ +  A V I+K + +   + VDIS
Sbjct: 361 VVLGLWE------KLPLRTLEFELVARGIAEACTVRVLDKASVPIIKLTDRETQVKVDIS 414

Query: 165 FNQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
           FN  +G+ +   +++  R   ++GK  L  +  +L++        +     G IS+Y+L 
Sbjct: 415 FNMQSGVQSAELIKQFKRDYPVLGKLVLVLKQFLLLRD-------LNEVFTGGISSYSLI 467

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
           +M                 CIS      +   P  +     N G  LL   E   +    
Sbjct: 468 LM-----------------CISF-----LQLHPRGIYHDTTNLGVLLLEFFELYGQSFNY 505

Query: 282 YSVPITALENTGHEFAIKHLN-------------IVDPLKDNNNLGRSVSKGNFH---RI 325
            ++ I +++N G       L              I DPL   N++GRS S G  H     
Sbjct: 506 INIGI-SIKNGGRYMPKDELQRDMVDGHRPSLLCIEDPLTPGNDIGRS-SYGALHVRQAF 563

Query: 326 RCALS 330
           RCA S
Sbjct: 564 RCAFS 568


>gi|119188673|ref|XP_001244943.1| hypothetical protein CIMG_04384 [Coccidioides immitis RS]
          Length = 785

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 42/222 (18%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I   +RL+ N + G ++  FGS     YLP  D+DL +L    + +
Sbjct: 363 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 422

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
              +F+C+ +++    + + I+D +  VP        A+V I+K    +  + VD+SF+ 
Sbjct: 423 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 480

Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGLIS--T 217
            +GL+A    ++      K+H           +  +L++      +  LG  + +I   T
Sbjct: 481 NSGLAANRTFQQ-----WKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGG-FSIICLVT 534

Query: 218 YALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
             L+ +           VL  F D+Y N FD+    I +N P
Sbjct: 535 SLLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 576


>gi|238882575|gb|EEQ46213.1| Poly(A) polymerase [Candida albicans WO-1]
          Length = 558

 Score = 43.1 bits (100), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 47/227 (20%), Positives = 90/227 (39%), Gaps = 57/227 (25%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + KKR EV+   QRL+  +                  G +VF FGS  L  Y P  DID 
Sbjct: 43  ATKKRVEVLTLFQRLVQEFVYTVSKSKNMSDSMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102

Query: 109 TVLGHQNVEED--LARFVCKILENEDQDSVFEIKDVQYVP----------AQVKIVKCSV 156
            V+  ++V  D   + F   I +  + + +  + D  YVP            + + + ++
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDA-YVPIIKLEFDGISIDLIMARLNI 161

Query: 157 QNIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
             +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   + 
Sbjct: 162 PRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWA-QQR 220

Query: 206 RILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
            + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 221 AVYGNIFGFPGGVAWAMLVARICQLYPNAVSSAIVEKFFNIYTKWNW 267


>gi|195393956|ref|XP_002055618.1| GJ18687 [Drosophila virilis]
 gi|194150128|gb|EDW65819.1| GJ18687 [Drosophila virilis]
          Length = 1111

 Score = 43.1 bits (100), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 79/183 (43%), Gaps = 19/183 (10%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
            E I      + P       R EV+  I+ +++  +   V   FGS     +LP  DIDL
Sbjct: 270 HEEIDHFYQYVLPTPCEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 329

Query: 109 TVLGHQNVEEDLARFVCK---ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
            VLG   + E L     +   +  N  +     + D   VP  +K+     Q + VDISF
Sbjct: 330 VVLG---LWEKLPLRTLEFELVSRNIAEACTVRVLDKASVPI-IKLTDRETQ-VKVDISF 384

Query: 166 NQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           N  +G+ +   +++  R   ++GK  L  +  +L++        +     G IS+Y+L +
Sbjct: 385 NMQSGVQSAELIKQFKRDYPVLGKLVLVLKQFLLLR-------DLNEVFTGGISSYSLIL 437

Query: 223 MVL 225
           M +
Sbjct: 438 MCI 440


>gi|452839453|gb|EME41392.1| hypothetical protein DOTSEDRAFT_46399 [Dothistroma septosporum
           NZE10]
          Length = 754

 Score = 43.1 bits (100), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 82/186 (44%), Gaps = 35/186 (18%)

Query: 10  MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
           + D+ +E L  P+HT+ +      P     +  + + L   + I +    ++P    E+ 
Sbjct: 290 LGDIVDEWL--PNHTNPTPWIQPDP-----AFTSSVGLKLHKEILDFYDYVRPHRHEEEL 342

Query: 70  RKEVINYIQRLINGYY-------GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLAR 122
           R  +I+ +QR +  +         IEV  FGS P   YLP  D+DL  L    ++  L R
Sbjct: 343 RAGIIDRLQRDLQYFRQIGPNVNKIEVRSFGSFPAGLYLPTADMDLVALSSDYLDHGLKR 402

Query: 123 FVCKILENEDQDSVFEIKD-------------VQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +C+I ++     ++++ D                + A+V +VK       I VD+SF  
Sbjct: 403 -LCQIRKH-----MWKMSDHFNRSRLPAPGTVAPVIGAKVPLVKFVDGHTGIKVDLSFEN 456

Query: 168 MAGLSA 173
            +GL+A
Sbjct: 457 DSGLTA 462


>gi|410730487|ref|XP_003671423.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
 gi|401780241|emb|CCD26180.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
          Length = 582

 Score = 42.7 bits (99), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 15/185 (8%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P+    K+R + I  I+  +N ++    +  FGS     YLP  
Sbjct: 168 WLTME--IRDFVSYISPSKEEIKERNDTIGRIRDAVNHFWNDANLHVFGSYATDLYLPGS 225

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           DID  ++  +  ++  +    +   L+     +  E+       A+V I+K       I 
Sbjct: 226 DIDCVIISEKGDKDSRSSLYALANFLKKRGLATDIEV----IAKARVPIIKFIDPRSKIH 281

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           +D+SF ++ GL A     K+ R    D    R + LI     +  R+   H G +  +++
Sbjct: 282 IDVSFERINGLEAA----KLIREWLNDTPGLREITLIVKQFLHARRLNNVHTGGLGGFSI 337

Query: 221 EMMVL 225
             +V 
Sbjct: 338 ICLVF 342


>gi|45554544|ref|NP_996381.1| Trf4-1, isoform E [Drosophila melanogaster]
 gi|161077643|ref|NP_996382.2| Trf4-1, isoform C [Drosophila melanogaster]
 gi|442615523|ref|NP_996383.2| Trf4-1, isoform H [Drosophila melanogaster]
 gi|442615525|ref|NP_572490.2| Trf4-1, isoform I [Drosophila melanogaster]
 gi|442615527|ref|NP_001259340.1| Trf4-1, isoform J [Drosophila melanogaster]
 gi|33636605|gb|AAQ23600.1| RE04457p [Drosophila melanogaster]
 gi|45446864|gb|AAS65290.1| Trf4-1, isoform E [Drosophila melanogaster]
 gi|158031751|gb|AAS65289.2| Trf4-1, isoform C [Drosophila melanogaster]
 gi|440216540|gb|AAS65288.2| Trf4-1, isoform H [Drosophila melanogaster]
 gi|440216541|gb|AAF46390.3| Trf4-1, isoform I [Drosophila melanogaster]
 gi|440216542|gb|AGB95183.1| Trf4-1, isoform J [Drosophila melanogaster]
          Length = 1001

 Score = 42.7 bits (99), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 86/209 (41%), Gaps = 15/209 (7%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H    + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 244 NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 303

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
           +  +   V   FGS     +LP  DIDL VLG         +   + LE E     + E 
Sbjct: 304 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 357

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
             V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R    D+     ++L
Sbjct: 358 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKR----DYPVLEKLVL 413

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL 225
           +         +     G IS+Y+L +M +
Sbjct: 414 VLKQFLLLRDLNEVFTGGISSYSLILMCI 442


>gi|147787660|emb|CAN69576.1| hypothetical protein VITISV_028613 [Vitis vinifera]
          Length = 192

 Score = 42.7 bits (99), Expect = 0.58,   Method: Composition-based stats.
 Identities = 18/35 (51%), Positives = 23/35 (65%)

Query: 236 DWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
           DWD++C+S+ GPV ISSLP+   E       ELLL
Sbjct: 148 DWDSFCVSLWGPVPISSLPDATTEPPRQGSRELLL 182


>gi|195355122|ref|XP_002044042.1| GM21691 [Drosophila sechellia]
 gi|194129295|gb|EDW51338.1| GM21691 [Drosophila sechellia]
          Length = 1000

 Score = 42.7 bits (99), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 78/181 (43%), Gaps = 15/181 (8%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
            E I+     + P       R EV+  I+ +++  +   V   FGS     +LP  DIDL
Sbjct: 273 HEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 332

Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
            VLG         +   + LE E     + E   V+ +  A V I+K + +   + VDIS
Sbjct: 333 VVLGL------WEKLPLRTLEFELVSRGIAEACTVRVLDKASVPIIKLTDRETQVKVDIS 386

Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
           FN  +G+ +   ++K  R    D+     ++L+         +     G IS+Y+L +M 
Sbjct: 387 FNMQSGVQSAELIKKFKR----DYPVLEKLVLVLKQFLLLRDLNEVFTGGISSYSLILMC 442

Query: 225 L 225
           +
Sbjct: 443 I 443


>gi|50557292|ref|XP_506054.1| YALI0F30525p [Yarrowia lipolytica]
 gi|49651924|emb|CAG78867.1| YALI0F30525p [Yarrowia lipolytica CLIB122]
          Length = 716

 Score = 42.7 bits (99), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 64/141 (45%), Gaps = 15/141 (10%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL  +E + + +  I P     K R++++  ++  +NG +G  +V   GS     +LP  
Sbjct: 248 WL--QEEVMDFVKYISPTQSEIKARQDLVERVRGAVNGLWGDAKVHVLGSFTTNMHLPQS 305

Query: 105 DIDLTVL---GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV-PAQVKIVKC--SVQN 158
           DIDL V    GH      + +    I       S  ++ ++Q +  A+V I+K   S   
Sbjct: 306 DIDLVVCSPHGHYGERACIYQLSSVIR------SRMKVAELQTITKARVPIIKFIDSRTG 359

Query: 159 IPVDISFNQMAGLSALCFLEK 179
           + VDISF +  G+     + K
Sbjct: 360 VHVDISFEKDGGIKTASTITK 380


>gi|323307579|gb|EGA60848.1| Trf5p [Saccharomyces cerevisiae FostersO]
          Length = 575

 Score = 42.7 bits (99), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 108 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 165

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 166 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 223

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 224 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 278

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 279 CLVYSFLNMH 288


>gi|256271295|gb|EEU06367.1| Trf5p [Saccharomyces cerevisiae JAY291]
 gi|349580652|dbj|GAA25811.1| K7_Trf5p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 642

 Score = 42.7 bits (99), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
 gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
          Length = 816

 Score = 42.4 bits (98), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 54/127 (42%), Gaps = 4/127 (3%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
           +  + ++  +  I P  V ++ R  ++  I   +   +    V PFGS   K YLP GDI
Sbjct: 277 MMHKEVEAFVKWISPTPVEDEIRGLIVKQIAVTVQSKFPDASVLPFGSYETKLYLPMGDI 336

Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
           DL +L       +    V   L N  + +    +      A+V IVK   +     VDIS
Sbjct: 337 DLVILSESMAYSNKVS-VLHTLANTLKRAGITSRVTVIAKARVPIVKFVTTHGRFNVDIS 395

Query: 165 FNQMAGL 171
            NQ  GL
Sbjct: 396 INQENGL 402


>gi|190409265|gb|EDV12530.1| hypothetical protein SCRG_03424 [Saccharomyces cerevisiae RM11-1a]
          Length = 642

 Score = 42.4 bits (98), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|348690009|gb|EGZ29823.1| hypothetical protein PHYSODRAFT_466551 [Phytophthora sojae]
          Length = 666

 Score = 42.4 bits (98), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 35/145 (24%), Positives = 68/145 (46%), Gaps = 10/145 (6%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENEDQDSVFEIKDV 142
           +V  FGS     +LP  D+DL +L   +V +        + + K+LE +       + D 
Sbjct: 202 KVETFGSYSTGIWLPSSDVDLVILDVVDVNDSQLTAKHLKELAKVLEKKKWVESLLVLDT 261

Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWC 201
             VP  +K+V     ++P+DI+F   A  S L   + + R       L+  +++  +   
Sbjct: 262 AKVPV-LKLVSAET-SVPIDITFESAATHSGLLARDLIKRYADNMPELYPLAIVFKQ--L 317

Query: 202 YYESRILGAHYGLISTYALEMMVLH 226
             E  +  A+ G +S+Y++ +M++H
Sbjct: 318 LRERDLNDAYTGGLSSYSVVLMIIH 342


>gi|81360384|ref|NP_014100.2| non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
           S288c]
 gi|148887014|sp|P48561.2|TRF5_YEAST RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
           Full=Topoisomerase 1-related protein TRF5
 gi|151944249|gb|EDN62528.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
 gi|285814367|tpg|DAA10261.1| TPA: non-canonical poly(A) polymerase TRF5 [Saccharomyces
           cerevisiae S288c]
          Length = 642

 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|207341968|gb|EDZ69877.1| YNL299Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 642

 Score = 42.4 bits (98), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|241955483|ref|XP_002420462.1| poly(A) polymerase, putative; polynucleotide adenylyltransferase,
           putative [Candida dubliniensis CD36]
 gi|223643804|emb|CAX41541.1| poly(A) polymerase, putative [Candida dubliniensis CD36]
          Length = 558

 Score = 42.4 bits (98), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 47/226 (20%), Positives = 91/226 (40%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + KKR EV+   Q+L+  +                  G +VF FGS  L  Y P  DID 
Sbjct: 43  ATKKRVEVLTLFQQLVQEFVYTVSKSKNMSDAMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102

Query: 109 TVLGHQNVEED-----LARFVCKILENED----QDSVFEIKDVQY--VPAQVKIVKCSVQ 157
            V+  ++V  D      A  + K  E E+     D+   I  +++  +   + + K ++ 
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVPIIKIEFDGISIDLIMAKLNIP 162

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   +  
Sbjct: 163 RVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKMWA-QQRA 221

Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
           + G  +G     A  M+V               + +F + Y  ++W
Sbjct: 222 VYGNIFGFPGGVAWAMLVARICQLYPNAVSSVIVEKFFNIYTKWNW 267


>gi|392296994|gb|EIW08095.1| Trf5p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 608

 Score = 42.4 bits (98), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 257 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 312 CLVYSFLNMH 321


>gi|242014489|ref|XP_002427922.1| PAP-associated domain-containing protein, putative [Pediculus
           humanus corporis]
 gi|212512406|gb|EEB15184.1| PAP-associated domain-containing protein, putative [Pediculus
           humanus corporis]
          Length = 531

 Score = 42.4 bits (98), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 85/179 (47%), Gaps = 11/179 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            + I E    + P     + R +V++ I+  I+  +   +V  FGS     YLP  DIDL
Sbjct: 116 HDEIIEFYNYMSPRKCEHELRLKVVHKIKSAIHELWPQAKVEVFGSFRTGLYLPTSDIDL 175

Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            V+G       L +   K+LENE  +    ++ D   VP  +  ++    ++ VDISFN 
Sbjct: 176 VVIGCWPTLP-LWKLEQKLLENEIAEQHTIKVLDKASVP--IIKLRDKESDVKVDISFNM 232

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
            +G+ +   ++   +   K  +  + V+++K   +   R L   + G IS+Y+L +M +
Sbjct: 233 SSGVQSAQLIKTFKK---KFPVLAKLVLVLKQ--FLLQRDLNEVFTGGISSYSLILMTI 286


>gi|301119917|ref|XP_002907686.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106198|gb|EEY64250.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 647

 Score = 42.4 bits (98), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 74/173 (42%), Gaps = 14/173 (8%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENEDQDSVFEIKDV 142
           +V  FGS     +LP  D+DL +L    V +        R + K+LE +         D 
Sbjct: 189 KVETFGSYSTGIWLPSSDVDLVILDVVEVNDSKLTAKHLRQLAKVLEKKKWVESLLCLDA 248

Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWC 201
             VP  +K+V     ++P+DI+F   A  S L   + + R       L+  +++  +   
Sbjct: 249 AKVPV-LKLVSAET-SVPIDITFESAATHSGLLARDLIKRYADNMPELYPLAIVFKQ--L 304

Query: 202 YYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLP 254
             E  +  A+ G +S+Y++ +M++H F   +   D    C       A  SLP
Sbjct: 305 LRERDLNDAYTGGLSSYSVVLMIIH-FSQLWRNGD---LCFEAASIYASGSLP 353


>gi|159123133|gb|EDP48253.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
           A1163]
          Length = 703

 Score = 42.4 bits (98), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 49/217 (22%), Positives = 89/217 (41%), Gaps = 32/217 (14%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I  +Q      YYG+++ PFGS     YLP  DIDL +L    +  
Sbjct: 273 VKPMQYEQIVRADLITRLQVAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSSNFMRN 332

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L+N +      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 333 GIKTFGERKGQIYAFAAFLKNLEIAVPNSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 390

Query: 168 MAGLSA-----------------LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
            +GL A                 +  +++   L G + +    +      C   S +   
Sbjct: 391 DSGLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGGLGGFSITCLVTSLLQHL 450

Query: 211 HYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
            +G  +   L  +++  F  Y N FD++N  I +N P
Sbjct: 451 PHGHTAP-NLGSILMDFFEFYGNNFDFENVGIRLNPP 486


>gi|259149072|emb|CAY82314.1| Trf5p [Saccharomyces cerevisiae EC1118]
          Length = 642

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|170592851|ref|XP_001901178.1| PAP/25A associated domain containing protein [Brugia malayi]
 gi|158591245|gb|EDP29858.1| PAP/25A associated domain containing protein [Brugia malayi]
          Length = 421

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 131/295 (44%), Gaps = 20/295 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + ++   ++P+ +    R  V   ++ ++   +   ++  FGS+    +LP  DID+
Sbjct: 68  HEELLDLYAWLKPSPLERALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 127

Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            V      EE  L +    + E+   +S+  + D  +VP  VK+V    + I +DISFN 
Sbjct: 128 VVESDLVSEEPPLWKTAIALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 184

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
           + G+ +  F+E +     +  + +  V+++K +   + ++     G +S+Y L +M++  
Sbjct: 185 VQGVRSAKFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 239

Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
           FL  + ++D+    I+ +N  V + +  ++  +        L +       C++   V +
Sbjct: 240 FLQLHPSYDYSYKRITEVNMGVLLLNFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 299

Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
               N+        L I DPL+  N++GR     N   +R A  +    L  +  
Sbjct: 300 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 345


>gi|323303298|gb|EGA57094.1| Trf5p [Saccharomyces cerevisiae FostersB]
          Length = 419

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 68/300 (22%), Positives = 126/300 (42%), Gaps = 28/300 (9%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 257 VSFERTXGLEA----AKLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311

Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL----KKC 278
            +++ FL+ +     ++  +  N  V +    E+  +    D   + +S  +     K C
Sbjct: 312 CLVYSFLNMHPRIKSNDIDVLDNLGVLLIDFFELYGKNFGYDDVAISISDGYASYIPKSC 371

Query: 279 REIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGE 338
                     LE +  +F+   L I DP   NNN+ R     N   I+  L +  + +G+
Sbjct: 372 -------WRTLEPSRSKFS---LAIQDPGDXNNNISR--GSFNMKDIKKXLCWCLRTVGQ 419


>gi|225681443|gb|EEH19727.1| polymerase sigma [Paracoccidioides brasiliensis Pb03]
          Length = 684

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R ++I  +QRL    +YG ++  FGS     YLP  D+DL +L  Q + +
Sbjct: 268 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 327

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
           +      +  E     +  +  D+  VP        A+V I+K    +  + VD+SF+  
Sbjct: 328 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 386

Query: 169 AGLSA 173
            GL+A
Sbjct: 387 TGLAA 391


>gi|161077646|ref|NP_001096913.1| Trf4-1, isoform F [Drosophila melanogaster]
 gi|161077648|ref|NP_001096914.1| Trf4-1, isoform G [Drosophila melanogaster]
 gi|158031752|gb|ABW09362.1| Trf4-1, isoform F [Drosophila melanogaster]
 gi|158031753|gb|ABW09363.1| Trf4-1, isoform G [Drosophila melanogaster]
          Length = 802

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 86/209 (41%), Gaps = 15/209 (7%)

Query: 22  SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
           +H    + Y   P   P     E  +   E I+     + P       R EV+  I+ ++
Sbjct: 45  NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 104

Query: 82  NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
           +  +   V   FGS     +LP  DIDL VLG         +   + LE E     + E 
Sbjct: 105 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 158

Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
             V+ +  A V I+K + +   + VDISFN  +G+ +   ++K  R    D+     ++L
Sbjct: 159 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKR----DYPVLEKLVL 214

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL 225
           +         +     G IS+Y+L +M +
Sbjct: 215 VLKQFLLLRDLNEVFTGGISSYSLILMCI 243


>gi|365763604|gb|EHN05131.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 642

 Score = 42.4 bits (98), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 911

 Score = 42.4 bits (98), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 79/169 (46%), Gaps = 12/169 (7%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           + P+   ++ R++VI  I +++   +  +++  FGS     YLP  DIDL ++G      
Sbjct: 272 VSPSAEEKQMREDVIARISKVVETLWPSVQLRVFGSCATDIYLPTSDIDLCIMGANACSP 331

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLE 178
                +   L       V  I   + VP  +K+V  +   + VDISF+   G + +  ++
Sbjct: 332 SPIDELASALRRRSMGRVQAIATAR-VPI-IKLVDAATGCL-VDISFDVPTGPAHINLIK 388

Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESR--ILGAHYGLISTYALEMMVL 225
              R + ++   K   +LIK   YY  +  +   + G + +YAL +M++
Sbjct: 389 ---RYLDEEPSVKPLALLIK---YYLKQFGMNEPYTGGLGSYALIIMII 431


>gi|124806252|ref|XP_001350671.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|23496797|gb|AAN36351.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 1172

 Score = 42.4 bits (98), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 77/144 (53%), Gaps = 15/144 (10%)

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE---IKDVQYVPAQ 148
           FGS      + + DID+ +  +  ++ DL   + K+ +N     +FE   IK +  V A+
Sbjct: 454 FGSCNNDIDIYNSDIDICI--YNTIQSDLIN-IRKLYKNMLHHPLFENVSIKKI--VHAK 508

Query: 149 VKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
           V I+KC  +   + VDISFNQ++ L++     +    I K+ L K  +I +K +  Y+  
Sbjct: 509 VPIIKCFFNYTKLSVDISFNQLSALTSTI---QTQTYIQKNPLIKYILIFLKMFL-YQHD 564

Query: 207 ILGAHYGLISTYALEMMVLHRFLD 230
           +  A  G IS++ + +++L +FL+
Sbjct: 565 LNDASKGGISSFNI-LLLLAQFLN 587


>gi|323346953|gb|EGA81231.1| Trf5p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 616

 Score = 42.4 bits (98), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|295671679|ref|XP_002796386.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226283366|gb|EEH38932.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 725

 Score = 42.4 bits (98), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R ++I  +QRL    +YG ++  FGS     YLP  D+DL +L  Q + +
Sbjct: 306 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 365

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
           +      +  E     +  +  D+  VP        A+V I+K    +  + VD+SF+  
Sbjct: 366 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 424

Query: 169 AGLSA 173
            GL+A
Sbjct: 425 TGLAA 429


>gi|115441021|ref|NP_001044790.1| Os01g0846500 [Oryza sativa Japonica Group]
 gi|56784029|dbj|BAD82657.1| unknown protein [Oryza sativa Japonica Group]
 gi|56784702|dbj|BAD81828.1| unknown protein [Oryza sativa Japonica Group]
 gi|113534321|dbj|BAF06704.1| Os01g0846500 [Oryza sativa Japonica Group]
 gi|222619532|gb|EEE55664.1| hypothetical protein OsJ_04062 [Oryza sativa Japonica Group]
          Length = 381

 Score = 42.4 bits (98), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
           E+  ++IL  I+P    E  R + I  IQ L +  Y      G  V PFGS   + Y   
Sbjct: 13  EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
           GD+D++V       E        I + + QD++ E+          + ++++P A+V ++
Sbjct: 70  GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122

Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
           +   +   I  DIS +   G         ++ L   D  F   V+L+K W   ++ I   
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178

Query: 211 HYGLISTYALEMMVLHRF 228
             G +++Y+L ++VL  F
Sbjct: 179 KNGTLNSYSLCLLVLFHF 196


>gi|226288571|gb|EEH44083.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 684

 Score = 42.0 bits (97), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R ++I  +QRL    +YG ++  FGS     YLP  D+DL +L  Q + +
Sbjct: 268 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 327

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
           +      +  E     +  +  D+  VP        A+V I+K    +  + VD+SF+  
Sbjct: 328 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 386

Query: 169 AGLSA 173
            GL+A
Sbjct: 387 TGLAA 391


>gi|323331833|gb|EGA73245.1| Trf5p [Saccharomyces cerevisiae AWRI796]
          Length = 453

 Score = 42.0 bits (97), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|344255571|gb|EGW11675.1| PAP-associated domain-containing protein 5 [Cricetulus griseus]
          Length = 382

 Score = 42.0 bits (97), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 19/123 (15%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEEDLAR 122
           R EV++ I+ +I   +   +V  FGS     YLP  DIDL V G         +EE L +
Sbjct: 2   RMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLPLWTLEEALRK 61

Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKV 180
                    D+DSV ++ D   VP    I+K   S   + VDISFN   G+ A   ++  
Sbjct: 62  HKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVRAADLIKDF 111

Query: 181 DRL 183
            +L
Sbjct: 112 TKL 114


>gi|323335976|gb|EGA77253.1| Trf5p [Saccharomyces cerevisiae Vin13]
          Length = 576

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|56784701|dbj|BAD81827.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 408

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 75/320 (23%), Positives = 124/320 (38%), Gaps = 72/320 (22%)

Query: 85  YGIEVFPFGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFE 138
           YG    PFGS     Y   GD+D++V      +  +  ++ + R + ++L+N        
Sbjct: 7   YGAVFKPFGSFVSNLYSNSGDLDISVHLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-- 64

Query: 139 IKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKR 192
              VQ+VP A+V +++   +   I  DIS N   G       C++  +D   G       
Sbjct: 65  ---VQFVPFARVPVLQYVSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------ 115

Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
            V+LIK W   ++ I     G +++Y+L ++VL  F            C     P  +  
Sbjct: 116 MVLLIKEWAKAQN-INDPKTGTLNSYSLCLLVLFHF----------QTC----EPAILPP 160

Query: 253 LPEIVAETVEND-GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN-IVDPLKDN 310
           L EI    +E      L L+   L     +Y                +HL+ + DP++  
Sbjct: 161 LKEIYEGNIEEGIAAYLYLNSLHLHTEMTVYDE--------------EHLDEVKDPIERP 206

Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI- 356
           +N  R+V      RI  A +   ++             E+L  P  G  LG+ + +  + 
Sbjct: 207 DNAARAVDLKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMA 266

Query: 357 ----NTLERNGKGLRPDVQV 372
               NT  RN    RP  QV
Sbjct: 267 NSCTNTPRRNRHHGRPTEQV 286


>gi|1050861|gb|AAC49099.1| Ynl0440p [Saccharomyces cerevisiae]
 gi|1302392|emb|CAA96217.1| TRF5 [Saccharomyces cerevisiae]
          Length = 625

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 346 CLVYSFLNMH 355


>gi|67539048|ref|XP_663298.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
 gi|40743597|gb|EAA62787.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
 gi|259484832|tpe|CBF81391.1| TPA: topoisomerase family protein TRF4, putative (AFU_orthologue;
           AFUA_7G04130) [Aspergillus nidulans FGSC A4]
          Length = 694

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 21/171 (12%)

Query: 15  EELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVI 74
           +E    PS T +      PP L+       L     + I      ++P       R++++
Sbjct: 228 DEWKLRPSETGTPWLSLMPPTLH-------LGTRLHDEILSFYHWVKPVRYEHIVRQDLV 280

Query: 75  NYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------V 124
             +Q      YYG+E+  FGS     YLP+ DIDL +L        +  F          
Sbjct: 281 ARLQAAFQSRYYGVEIHAFGSFASGLYLPNADIDLVLLSTSFRRTGVKTFGERKGQIYAF 340

Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA 173
              L+N++      ++ + +  A+V I+K    +  + VD+SF+  +GL A
Sbjct: 341 SAFLKNQNIAVPGSVETIAH--ARVPILKFVDKLTGLKVDLSFDNDSGLIA 389


>gi|345326611|ref|XP_003431063.1| PREDICTED: hypothetical protein LOC100681530 [Ornithorhynchus
           anatinus]
          Length = 411

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 6/69 (8%)

Query: 460 HPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRR 519
           H Y  QV  S FS       RG G + P+      IE++ P   R P+S NH+ V   RR
Sbjct: 209 HSY--QVFESPFSVVNFSSQRGQGVFSPQTI----IEVDSPGVCRVPKSDNHIAVEDKRR 262

Query: 520 KIDKAERLP 528
           +++++ R P
Sbjct: 263 RVEESRRAP 271


>gi|118376886|ref|XP_001021625.1| Poly polymerase central domain containing protein [Tetrahymena
           thermophila]
 gi|89303391|gb|EAS01379.1| Poly polymerase central domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 707

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 21/143 (14%)

Query: 52  RIQEILCTIQPAI-VSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
           +IQEI+      I ++E+K  E + ++Q          V PFGS  L     DGD+DL  
Sbjct: 176 KIQEIVNQWMVNININERKIPEQVAHMQLA-------RVLPFGSYFLGVSSVDGDVDLVC 228

Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQY----VPAQVKIVKCSVQNIPVDISFN 166
           +    V+    RF  K    +  D +  +  V+Y    + A+V I++  +Q +P+DISF 
Sbjct: 229 IAPNFVD----RF--KHFNGQLYDMISNMDGVEYHNNVIDAKVPIMQFEIQGVPIDISF- 281

Query: 167 QMAGLSALCFLEKVDRLIGKDHL 189
             A L      + +D+ I   HL
Sbjct: 282 --AQLDVETLPDDIDKQIPDTHL 302


>gi|66821005|ref|XP_644037.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
 gi|60472363|gb|EAL70316.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
          Length = 1880

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 16/145 (11%)

Query: 92   FGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENE-DQDSVFEIKDVQYVP-A 147
            +GS     +LP  DID+ V   +N  ++   A+F+ K+LE +   D    I  +  +P A
Sbjct: 1518 YGSFMTGLWLPSSDIDIVVNYGKNMAIKPKNAQFLLKVLEKQFRTDLNGMILTMLCIPSA 1577

Query: 148  QVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAW 200
            ++ ++K  + +NI VDISF +       G++A   +   V  +IG   L+  +++L   W
Sbjct: 1578 KIPVIKLVTTENISVDISFRESPTSVHTGIAARDLVADCVREIIG---LYPLAIVL--KW 1632

Query: 201  CYYESRILGAHYGLISTYALEMMVL 225
               E  +   + G +S+Y L +M++
Sbjct: 1633 FLRERGLNNTYTGGLSSYCLVLMLI 1657


>gi|157133264|ref|XP_001656207.1| sigma DNA polymerase, putative [Aedes aegypti]
 gi|108870912|gb|EAT35137.1| AAEL012692-PA [Aedes aegypti]
          Length = 491

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 81/168 (48%), Gaps = 17/168 (10%)

Query: 62  PAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL 120
           P       R +V+  I++++ N +    V  FGS     YLP  DIDL V+G   + E L
Sbjct: 3   PTQTEHALRVQVVARIEQIVLNLWPAARVEMFGSFRTGLYLPTSDIDLVVIG---LWEKL 59

Query: 121 ARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCF 176
                + LENE     + E   V+ +  A V IVK + +   + VDISFN  +G+ +   
Sbjct: 60  P---LRTLENELINRGIAEPMSVRVLDKASVPIVKLTDRETQVKVDISFNMQSGVQSAEL 116

Query: 177 LEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMM 223
           ++   R   K  +  + V+++K +     R L   + G IS+Y+L +M
Sbjct: 117 IKDFKR---KYPVLAKLVLVLKQFLL--QRDLNEVFTGGISSYSLILM 159


>gi|189204442|ref|XP_001938556.1| PAP-associated domain containing protein 5 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187985655|gb|EDU51143.1| PAP-associated domain containing protein 5 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 599

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 60/269 (22%), Positives = 108/269 (40%), Gaps = 47/269 (17%)

Query: 15  EELLTSPSHTSSSSSYSSPPNLY-PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEV 73
           ++ L  P+ TS+   + S  ++Y   S + + WL  E  I +    + P      +R  +
Sbjct: 224 DDWLPLPNQTSTP--WLSAEDVYVHLSGEPDKWLHNE--ILDFYDFVAPKAYEHDQRNRL 279

Query: 74  INYIQRLIN----GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
           +N I   +      +    +  FGS P+  YLP  D+DL  +  ++        V  + +
Sbjct: 280 VNRINSALGQRRFAHDNGRLLCFGSFPVGLYLPTADLDLVYVSDRHYRG--GEPVIDVSD 337

Query: 130 NEDQDSVF-----EIKD-------VQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-L 174
            +    +       +KD        Q + A+V I+K    +  + VDISF  ++G+ A  
Sbjct: 338 RKASKGILFKAARRLKDTDIPQGYAQVINAKVPIIKFQDKLTKLQVDISFENLSGVQAQA 397

Query: 175 CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-------- 226
            F+E   +    D ++   V L+K +      +   H G I  Y++  +++         
Sbjct: 398 TFVEWKAKY--PDMIYM--VALMKQFLVMRG-LNEVHTGGIGGYSIICLIVSYLQLEKKP 452

Query: 227 --------RFLDYYNTFDWDNYCISINGP 247
                    FL YY  FD   Y I +N P
Sbjct: 453 ENLGECFLGFLKYYGNFDLARYRIQMNPP 481


>gi|156057775|ref|XP_001594811.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980]
 gi|154702404|gb|EDO02143.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 794

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 10/137 (7%)

Query: 45  LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
            WL  E  I +    ++P    E  R+ ++  ++     YY   ++  FGS P   YLP 
Sbjct: 459 FWLHKE--IMDFYHYVKPRKFEEVIRQRLLVNLRENFRKYYRDADILAFGSFPAGLYLPT 516

Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
            D+D+ ++ +Q ++    ++     + ++  N ++D +   +    V A+V + K     
Sbjct: 517 ADMDVVMVSNQYMDGGYPKYNSKSHIWRVKGNVEKDRLNTGRIEVIVGAKVPLAKWVDGF 576

Query: 157 QNIPVDISFNQMAGLSA 173
             + VD+SF    GL A
Sbjct: 577 TGLKVDMSFENDTGLIA 593


>gi|115673160|ref|XP_796681.2| PREDICTED: uncharacterized protein LOC592046 [Strongylocentrotus
           purpuratus]
          Length = 830

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 11/115 (9%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
           R+EV+  IQ ++   +   +V  +GS     YLP  DIDL + G      D+       L
Sbjct: 156 RREVVQRIQGIVRSIWPKAKVEIYGSTRTMLYLPTSDIDLVLFG------DIGESPFFRL 209

Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK 179
            NE ++  + E   ++ +  A V IVK +  V  + VDISFN   G      +E+
Sbjct: 210 GNELEKSGIAEQGSIKVLDKASVPIVKLTDNVTKVRVDISFNMQTGTDCAKLIEE 264


>gi|356569346|ref|XP_003552863.1| PREDICTED: poly(A) RNA polymerase cid11-like [Glycine max]
          Length = 415

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 86/192 (44%), Gaps = 26/192 (13%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
           + +IL  + P     + R  +IN ++ ++       G  V PFGS     +   GD+D++
Sbjct: 11  VNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNLFTRWGDLDIS 70

Query: 110 V----------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV-- 156
           +           G +  +  L   V K L  +   S     ++Q++  A+V I+K     
Sbjct: 71  IELSNGLHISSAGKKQKQTFLGD-VLKALRMKGGGS-----NLQFISNARVPILKFKSYR 124

Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
           Q +  DIS N + G      L  ++++ G+   F+  V+L+K W     +I  +  G  +
Sbjct: 125 QGVSCDISINNLPGQMKSKILLWINKIDGR---FRHMVLLVKEWAKAH-KINNSKAGTFN 180

Query: 217 TYALEMMVLHRF 228
           +Y+L ++V+  F
Sbjct: 181 SYSLSLLVIFYF 192


>gi|21240658|gb|AAM44369.1| putative protein contains similarity to topoisomerase-related
            function protein [Dictyostelium discoideum]
          Length = 1880

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 16/145 (11%)

Query: 92   FGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENE-DQDSVFEIKDVQYVP-A 147
            +GS     +LP  DID+ V   +N  ++   A+F+ K+LE +   D    I  +  +P A
Sbjct: 1518 YGSFMTGLWLPSSDIDIVVNYGKNMAIKPKNAQFLLKVLEKQFRTDLNGMILTMLCIPSA 1577

Query: 148  QVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAW 200
            ++ ++K  + +NI VDISF +       G++A   +   V  +IG   L+  +++L   W
Sbjct: 1578 KIPVIKLVTTENISVDISFRESPTSVHTGIAARDLVADCVREIIG---LYPLAIVL--KW 1632

Query: 201  CYYESRILGAHYGLISTYALEMMVL 225
               E  +   + G +S+Y L +M++
Sbjct: 1633 FLRERGLNNTYTGGLSSYCLVLMLI 1657


>gi|402592089|gb|EJW86018.1| hypothetical protein WUBG_03069 [Wuchereria bancrofti]
          Length = 360

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 37/203 (18%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
           L+ EE IQ     +Q      + R  ++  ++R++    G  + P GS          DI
Sbjct: 71  LIREEAIQRHNSYLQTL----QYRNSIVVGLRRVLARNRGQYLVPVGSTASGLASGISDI 126

Query: 107 DLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIK-----------------DVQYVPA 147
           DL  L    +   E L R + K  EN  +  + E+K                 D+ +  A
Sbjct: 127 DLVYLSTTDEKQRETLLRLLSK--ENYRRTFMTEVKTRIEESSLCKEFDWSQTDIIH-TA 183

Query: 148 QVKIVKCSVQ-NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC--YYE 204
           +V I+    Q ++ VDI F + A +   CF+    +       + + V L+  W   + E
Sbjct: 184 RVPILHLQTQKHMQVDIQFEKYASIRNTCFVRHCAQ-------YDKRVALLNMWAQKWLE 236

Query: 205 S-RILGAHYGLISTYALEMMVLH 226
           S R+  + +GL STY + M+VLH
Sbjct: 237 SQRLKDSKHGLFSTYHVLMLVLH 259


>gi|402583139|gb|EJW77083.1| PAP/25A associated domain-containing protein, partial [Wuchereria
           bancrofti]
          Length = 416

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 131/295 (44%), Gaps = 20/295 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E + ++   ++P+ +    R  V   ++ ++   +   ++  FGS+    +LP  DID+
Sbjct: 2   HEELLDLYAWLKPSPLERALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 61

Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
            V      EE  L +    + E+   +S+  + D  +VP  VK+V    + I +DISFN 
Sbjct: 62  VVESDLVSEEPPLWKTAIALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 118

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
           + G+ +  F+E +     +  + +  V+++K +   + ++     G +S+Y L +M++  
Sbjct: 119 VQGVRSAKFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 173

Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
           FL  + ++D+    I+ +N  V + +  ++  +        L +       C++   V +
Sbjct: 174 FLQLHPSYDYSYKRITEVNMGVLLLNFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 233

Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
               N+        L I DPL+  N++GR     N   +R A  +    L  +  
Sbjct: 234 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 279


>gi|1493831|gb|AAC49397.1| Trf5p [Saccharomyces cerevisiae]
          Length = 625

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 85/188 (45%), Gaps = 12/188 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345

Query: 223 MVLHRFLD 230
            +++ FL+
Sbjct: 346 CLVYSFLN 353


>gi|357620388|gb|EHJ72600.1| hypothetical protein KGM_01329 [Danaus plexippus]
          Length = 564

 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 76/309 (24%), Positives = 123/309 (39%), Gaps = 68/309 (22%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I+     + P+      R  V+  I+  I   +    V  FGS     YLP  DIDL
Sbjct: 115 HEEIEHFYMYMSPSETEHLVRTTVVTRIRSAILSLWPQARVEVFGSFRTGLYLPTSDIDL 174

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
            V+G         +E +L   V + +  +D   V E        A V IVK +    ++ 
Sbjct: 175 VVIGQWEKLPLWTLEREL---VAQDIAEQDSIKVLE-------KATVPIVKMTDKYSDVK 224

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
           VDISFN  +G+ +   +++      +     R V+++K +   +  +     G IS+Y+L
Sbjct: 225 VDISFNMSSGVKSAELIKQFKEQYPE---LSRLVMVLKQFL-LQRDLNEVFTGGISSYSL 280

Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
            +M                 CIS    + +   PE + +   N G  L+   EF +    
Sbjct: 281 ILM-----------------CISF---LQLHPRPERLRQR-HNLGVLLI---EFFELYGR 316

Query: 281 IYSVPITALE-----------------NTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
            ++   TA+                  N GH  ++  L I DPL   N++GRS S G   
Sbjct: 317 KFNYVKTAIRVKNGGSYVSKDEISKEMNDGHRPSL--LCIEDPLTPGNDIGRS-SYGAI- 372

Query: 324 RIRCALSYG 332
           +++ A  YG
Sbjct: 373 QVKQAFDYG 381


>gi|392598007|gb|EIW87329.1| Nucleotidyltransferase [Coniophora puteana RWD-64-598 SS2]
          Length = 383

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 74/167 (44%), Gaps = 15/167 (8%)

Query: 67  EKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTV-LGHQNVEEDLARFV 124
           E  R++V   + R++ N +   EV  FGS      LP GD+D+ + L H +   +L   +
Sbjct: 91  EYMREQVRALVHRILRNRFRRSEVNVFGSAAQGLSLPGGDVDIVLSLPHADSRSELKTSL 150

Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP------VDISFNQMAGLSALCFLE 178
            ++     +  V    +V++   Q +I   S Q +P       DI  N   GL AL  +E
Sbjct: 151 FQLAAMLKRTGVASDVEVRH---QARIPIISFQTVPELCTLRFDIGVNNEDGLKALPIVE 207

Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
           K      +DH   + ++L+         +  A  G I+++ L  MV+
Sbjct: 208 KY----MQDHPVLKPLMLVLKSFLSRRGLSSASNGGINSFVLLCMVV 250


>gi|350638915|gb|EHA27270.1| hypothetical protein ASPNIDRAFT_44704 [Aspergillus niger ATCC 1015]
          Length = 682

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R++++  +Q      YYG+++ PFGS     YLP  DIDL +L    +  
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L+N +      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389

Query: 168 MAGLSA 173
            +GL A
Sbjct: 390 DSGLVA 395


>gi|317034217|ref|XP_001396205.2| topoisomerase family protein TRF4 [Aspergillus niger CBS 513.88]
          Length = 682

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R++++  +Q      YYG+++ PFGS     YLP  DIDL +L    +  
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L+N +      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389

Query: 168 MAGLSA 173
            +GL A
Sbjct: 390 DSGLVA 395


>gi|328713984|ref|XP_001948412.2| PREDICTED: PAP-associated domain-containing protein 5-like
           [Acyrthosiphon pisum]
          Length = 595

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 43/179 (24%), Positives = 81/179 (45%), Gaps = 12/179 (6%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            + I++    +         R +V+  +  +I+  +   +V  FGS     YLP  D+DL
Sbjct: 119 HKEIEDFFAYMSATPEEHSMRLDVLERVTNVIHAEWPKAKVEVFGSFRTGLYLPTSDMDL 178

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFN 166
            V+G  +    L      +L+N   +   ++ D   VP    IVK +    ++ VDISFN
Sbjct: 179 VVIGEWDTLP-LHSLAQALLKNSVCNENIQVLDKASVP----IVKMTDKATDVRVDISFN 233

Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
              G+ +    E +   +    +  + V+++K +   +  +    YG IS+Y+L +MV+
Sbjct: 234 MNNGVKS---AEMIKHYMDVFPMLPKLVLVLKQFL-AQRDMNEVFYGGISSYSLILMVV 288


>gi|312384741|gb|EFR29395.1| hypothetical protein AND_01692 [Anopheles darlingi]
          Length = 1101

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 15/181 (8%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDL 108
            + I +    + P       R  V+N I++++ N +    V  FGS     YLP  DIDL
Sbjct: 272 HQEIDQFYNHMIPTPTEHALRVMVVNRIEQIVLNLWPSARVEMFGSFRTGLYLPTSDIDL 331

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDS-VFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
            V+G         +   + LE E  +  + E   V+ +  A V IVK + +   + VDIS
Sbjct: 332 VVIGQ------WEKLPLRTLEMELINRYIAEPNSVRVLDKASVPIVKLTDRETQVKVDIS 385

Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
           FN  +G+ +   +++      +D+     ++L+      +  +     G IS+Y+L +M 
Sbjct: 386 FNMESGVQSAKLIKEYK----QDYPVLEKLVLVLKQFLLQRDLNEVFTGGISSYSLILMC 441

Query: 225 L 225
           +
Sbjct: 442 I 442


>gi|83765069|dbj|BAE55212.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391870459|gb|EIT79642.1| DNA polymerase sigma [Aspergillus oryzae 3.042]
          Length = 390

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I  +Q      YYG+++  FGS     YLP  DIDL +L    +  
Sbjct: 74  VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 133

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L N D      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 134 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 191

Query: 168 MAGLSA 173
            +GL A
Sbjct: 192 DSGLVA 197


>gi|328872412|gb|EGG20779.1| hypothetical protein DFA_00644 [Dictyostelium fasciculatum]
          Length = 1767

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 78/149 (52%), Gaps = 16/149 (10%)

Query: 88   EVFPFGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENEDQDSV--FEIKDVQ 143
            E+  +GS     +LP  DID+ V  ++N  +    A+F+ ++LE + +  +  F +  + 
Sbjct: 1420 EIDIYGSFMTGLWLPSSDIDIVVDYNKNMAIRPKNAQFLLEVLEKQIRHDLNSFILTMLC 1479

Query: 144  YVPAQVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFL-EKVDRLIGKDHLFKRSVIL 196
               A++ ++K  + +NI VDISF +       G++A   + + V ++IG   L+  +++L
Sbjct: 1480 IPSAKIPVIKLVTKENIAVDISFRESPTSPHTGIAARDLVADNVKKVIG---LYPLAIVL 1536

Query: 197  IKAWCYYESRILGAHYGLISTYALEMMVL 225
               W   +  +   + G +S+Y L +M++
Sbjct: 1537 --KWFLRDRGLNNTYSGGLSSYCLVLMLI 1563


>gi|449461393|ref|XP_004148426.1| PREDICTED: poly(A) polymerase-like [Cucumis sativus]
          Length = 863

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 59/267 (22%), Positives = 105/267 (39%), Gaps = 63/267 (23%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVI-NYIQRLINGYYGIEVFPF 92
           L  F +D+EL+   EE  + +E+L  I   + S  K+  ++  Y ++++     + +F F
Sbjct: 42  LEKFLVDSELYESKEEAAKREEVLGRIDQIVKSWVKQLTLLRGYTEQMVEDANAV-IFTF 100

Query: 93  GSVPLKTYLPDGDIDLTVLGHQNV-EEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           GS  L  + P  DID   +G   V  ED    +  IL   +     E+ D+Q VP A V 
Sbjct: 101 GSYRLGVHGPGSDIDTLCVGPSYVNREDFFIILHNILAEME-----EVTDLQPVPDAHVP 155

Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVD----------------------------R 182
           +++     I +D+ +   A +S L   E +D                            R
Sbjct: 156 VMRFKFLGISIDLLY---ASISLLVVPEDLDISHGSVLYNVDEQTVRSLNGCRVADQILR 212

Query: 183 LIGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRF 228
           L+     F+ ++  +K W             + G ++ L+     ++       M++ RF
Sbjct: 213 LVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVAQVCQLYPNAIPSMLVSRF 272

Query: 229 LDYYNTFDWDNYCISINGPVAISSLPE 255
              Y  + W N       PV + S+ E
Sbjct: 273 FRVYTQWRWPN-------PVMLCSIEE 292


>gi|218189366|gb|EEC71793.1| hypothetical protein OsI_04418 [Oryza sativa Indica Group]
          Length = 381

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
           E+  ++IL  I+P    E  R + I  IQ L +  Y      G  V PFGS   + Y   
Sbjct: 13  EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
           GD+D++V       E        I + + QD++ E+          + ++++P A+V ++
Sbjct: 70  GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122

Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
           +   +   I  DIS +   G         ++ L   D  F   V+L+K W   ++ I   
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178

Query: 211 HYGLISTYALEMMVLHRF 228
             G +++Y+L ++VL  F
Sbjct: 179 KNGTLNSYSLCLLVLCHF 196


>gi|405977000|gb|EKC41473.1| Poly(A) polymerase gamma [Crassostrea gigas]
          Length = 264

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/190 (22%), Positives = 84/190 (44%), Gaps = 38/190 (20%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
           E +  I   ++  I    ++K  INY++  IN + G +V  FGS  L  +    DID   
Sbjct: 51  EILNRINTLVRKWIKDVSRQKSEINYLEYKINAFGG-KVCIFGSYRLGAHTKGADIDTVC 109

Query: 111 L--GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFNQ 167
           +  GH +  +  + FV ++L++       E+KD++ +  A V ++K +   I +D+ F +
Sbjct: 110 IAPGHVDRSDFFSSFV-EVLKHTP-----EVKDLRAIEEAYVPLIKMTFDGIELDMVFAR 163

Query: 168 MA-------------------------GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
           +                           L+     +++  L+   H F+R+++ IK W  
Sbjct: 164 LTLPQIPDEIDLRDESLLRHLNTKCVRSLNGCRVTDEILHLVPNQHNFRRTLLAIKIWAK 223

Query: 203 YE---SRILG 209
            +   S++LG
Sbjct: 224 KKGIYSKVLG 233


>gi|242051340|ref|XP_002463414.1| hypothetical protein SORBIDRAFT_02g043400 [Sorghum bicolor]
 gi|241926791|gb|EER99935.1| hypothetical protein SORBIDRAFT_02g043400 [Sorghum bicolor]
          Length = 460

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/171 (23%), Positives = 73/171 (42%), Gaps = 22/171 (12%)

Query: 62  PAIVSEKKRKEVINYIQRLI--------------NGYYGIEVFPFGSVPLKTYLPDGDID 107
           P+   EKKR++VI  +++++               G     V  +GS  L  + P+ DID
Sbjct: 102 PSPEDEKKREKVIRELKKIVMHWANAVAYEQSVPQGLATATVLTYGSYTLGAHGPESDID 161

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
           +  +G          FV      E +  V E++ ++   A+V +++     I VD ++ Q
Sbjct: 162 VLCVGPCIATLQYHFFVVLRQLLEGRPEVSELQTIE--KAKVPLMRFRFTGIAVDFTYAQ 219

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           +  + AL   + + R I    L+ R   L    C+Y     G H  +++ Y
Sbjct: 220 LPVIDALKKFQALLRCI---KLWARKRGLH---CHYLGFFAGIHLAILAAY 264


>gi|170593787|ref|XP_001901645.1| hypothetical protein Bm1_50885 [Brugia malayi]
 gi|158590589|gb|EDP29204.1| hypothetical protein Bm1_50885 [Brugia malayi]
          Length = 380

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 37/203 (18%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
           L+ EE IQ     +Q      + R  ++  ++R++    G  + P GS          DI
Sbjct: 71  LIREEAIQRHNSYLQTL----QYRNSIVVGLRRVLARNRGQYLVPVGSTASGLASGISDI 126

Query: 107 DLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIK-----------------DVQYVPA 147
           DL  L    +   E L R + K  EN  +  + E+K                 D+ +  A
Sbjct: 127 DLVYLSTTDEKQRETLLRLLSK--ENYRRTFMTEVKTRIEESSLCKEFDWSQTDIIHT-A 183

Query: 148 QVKIVKCSVQ-NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC--YYE 204
           +V I+    Q ++ VDI F + A +   CF+    +       + + V L+  W   + E
Sbjct: 184 RVPILHLQTQKHMQVDIQFEKYASIRNTCFVRHCAQ-------YDKRVALLNMWAQKWLE 236

Query: 205 S-RILGAHYGLISTYALEMMVLH 226
           S R+  + +GL STY + M+VLH
Sbjct: 237 SQRLKDSKHGLFSTYHVLMLVLH 259


>gi|124802317|ref|XP_001347437.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
 gi|23495017|gb|AAN35350.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
          Length = 615

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/179 (24%), Positives = 83/179 (46%), Gaps = 20/179 (11%)

Query: 59  TIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT----VLGH 113
           +++P+I      K  +N++Q  IN +Y    V PFGSV    ++ + DID+     +L +
Sbjct: 289 SLKPSINDINNMKIFLNFLQNEINKHYKNSYVTPFGSVINGFWMRNSDIDICIQIPILLN 348

Query: 114 QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN------IPVDISFNQ 167
           +  +    + +C +L N + + V E    Q   A+V I+     N      +  DIS N 
Sbjct: 349 RKDQITFLKKICLLLNNFN-NGVIE----QRFSAKVPIIHFYCNNREKSFELSCDISVNN 403

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +  +     ++K    +  D   +   I++K W    + I     G +S+++L +M++H
Sbjct: 404 ILAVINSKLIQKY---VAIDKRLQTMGIVLKYWSKIRN-INDRSKGFLSSFSLILMIIH 458


>gi|358373036|dbj|GAA89636.1| topoisomerase family protein Trf4 [Aspergillus kawachii IFO 4308]
          Length = 682

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R++++  +Q      YYG+++ PFGS     YLP  DIDL +L    +  
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L+N +      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389

Query: 168 MAGLSA 173
            +GL A
Sbjct: 390 DSGLVA 395


>gi|323352825|gb|EGA85127.1| Trf5p [Saccharomyces cerevisiae VL3]
          Length = 510

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL +E  I++ +  I P+    K R   I+ ++R +   +   ++  FGS     YLP  
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  V      +ED      + + L+N+      E+     VP  +K ++   Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           +SF +  GL A     K+ R   +D    R ++LI     +  R+   H G +  + + +
Sbjct: 257 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311

Query: 223 MVLHRFLDYY 232
            +++ FL+ +
Sbjct: 312 CLVYSFLNMH 321


>gi|395783917|ref|ZP_10463765.1| ribonuclease P protein component [Bartonella melophagi K-2C]
 gi|395425185|gb|EJF91355.1| ribonuclease P protein component [Bartonella melophagi K-2C]
          Length = 123

 Score = 41.2 bits (95), Expect = 1.9,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 460 HPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRAR-----NPESLNHVPV 514
           HP   + RA   +   GEK RG    +  + R   +EI  P+ AR       ++ N V  
Sbjct: 5   HPRRIRKRADFLAVRTGEKRRGPLFLLEIKPRKQTVEIKHPLVARVGFTVTRKNGNAVKR 64

Query: 515 SKLRRKIDKAERLPSRDNLESG 536
           ++++R++ +A R+  +DN+E+G
Sbjct: 65  NRIKRRLREAVRVGIKDNIEAG 86


>gi|296821440|ref|XP_002850129.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
 gi|238837683|gb|EEQ27345.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
          Length = 620

 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 14/139 (10%)

Query: 48  LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
           +  + I +    ++P    +  R+++I+ +Q      +YG +++ FGS     YLP  D+
Sbjct: 265 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQSCFGQRFYGSKLYAFGSFASGLYLPTADM 324

Query: 107 DLTVLGHQNVEEDLARFVCKI---------LENEDQDSVFEIKDVQYVPAQVKIVKC--S 155
           DL +L  Q +  +      K+         L      +   I+ + +  A+V I+K    
Sbjct: 325 DLVLLSRQFITSNRKTICQKVRDIYSFAGYLRGLGIAAPGSIETIAH--ARVPIIKFVDG 382

Query: 156 VQNIPVDISFNQMAGLSAL 174
           +  + VD+SF+  +GL+A+
Sbjct: 383 LTGLKVDLSFDNNSGLAAI 401


>gi|158296263|ref|XP_316693.3| AGAP006659-PA [Anopheles gambiae str. PEST]
 gi|157016427|gb|EAA11489.4| AGAP006659-PA [Anopheles gambiae str. PEST]
          Length = 703

 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 68/142 (47%), Gaps = 12/142 (8%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTV--LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP 146
           ++ FGS+     L D D+D  +     +   E+  + +  +    D+D+ F  K V+ + 
Sbjct: 117 IYEFGSIKSGLLLKDSDLDFYIHYAREKTEREEQIKLIHVVCSRMDRDTSFTGK-VKILG 175

Query: 147 AQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWCYY 203
           A+V +++      N+  DI+F+   G    C+  K    I K D    +  +++K W   
Sbjct: 176 AKVPLLRAVHVRTNLQCDINFSNARG----CYNSKFIHAIMKFDERIHQLTVMVKFWAQC 231

Query: 204 ESRILGAHYGLISTYALEMMVL 225
            + IL AH+ + ++Y L MMV+
Sbjct: 232 -AHILTAHHQM-NSYCLIMMVI 251


>gi|345491496|ref|XP_001605928.2| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 1
           [Nasonia vitripennis]
 gi|345491498|ref|XP_003426625.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 2
           [Nasonia vitripennis]
          Length = 683

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 52/90 (57%), Gaps = 6/90 (6%)

Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           +++++ + A+V I+K   SV N+ VD+++N + G+     L    R+   D   +  V++
Sbjct: 428 VRELELIQAKVPILKIHDSVYNLDVDLNYNNVVGIRNTHLLYCYSRI---DWRVRPLVLV 484

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +K W   ++ I  A +  +S+Y+L +MV+H
Sbjct: 485 VKMWAQCQN-INNARHMTMSSYSLVLMVIH 513


>gi|134080951|emb|CAK41466.1| unnamed protein product [Aspergillus niger]
          Length = 614

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R++++  +Q      YYG+++ PFGS     YLP  DIDL +L    +  
Sbjct: 204 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 263

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L+N +      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 264 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 321

Query: 168 MAGLSA 173
            +GL A
Sbjct: 322 DSGLVA 327


>gi|324975487|gb|ADY62671.1| PAPa [Candida metapsilosis]
          Length = 547

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 45/226 (19%), Positives = 87/226 (38%), Gaps = 55/226 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + KKR EV++  Q L+  +                  G ++F FGS  L  Y P  DID 
Sbjct: 42  ATKKRAEVLSLFQTLVQEFVYEVSKSKNMSDGMSRDAGGKIFTFGSYRLGVYGPSSDIDA 101

Query: 108 LTVLGHQNVEEDLARFVCKILE--------NEDQDSVFEIKDVQY--VPAQVKIVKCSVQ 157
           L V+      ED      +I+         N   D+   I  +++  +   + + + +V 
Sbjct: 102 LVVVPRHVTREDFFTTFDQIIRKRPELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161

Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
            +P+D++ +            +  L+     +++ +L+ K  +FK ++  IK W   E  
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220

Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
           + G  +G     A  M+               ++ +F   Y  ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSSVIVEKFFSIYTKWNW 266


>gi|125538688|gb|EAY85083.1| hypothetical protein OsI_06439 [Oryza sativa Indica Group]
          Length = 683

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
           L  F ++A L+   EE  + +E+L  I   IV E  +K  I   Y ++++     + +F 
Sbjct: 44  LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 101

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           FGS  L  + P  DID   +G   V+ +   FV       + +   E+ ++Q VP A V 
Sbjct: 102 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVTLYGALSEME---EVTELQPVPDAHVP 158

Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
           ++K   + +P+D+ +             ++  S LC                +++ RL+ 
Sbjct: 159 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 218

Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
              +F++++  +K W             + G  + L+     ++       M++ RF   
Sbjct: 219 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 278

Query: 232 YNTFDWDN 239
           +  + W N
Sbjct: 279 FTQWQWPN 286


>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
 gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
          Length = 531

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 45/194 (23%), Positives = 87/194 (44%), Gaps = 20/194 (10%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P+    + R + I  ++  +   + G ++  FGS     YLP  
Sbjct: 100 WLTLE--IRDFVAYISPSRQEIELRNKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGS 157

Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
           DID  +      +E+ +    +   L+N    +  E+     VP  +K V+ + Q I VD
Sbjct: 158 DIDCVINSKTGDKENRSSLYELAHFLKNRKLATQVEVIAKARVPI-IKFVEPTSQ-IHVD 215

Query: 163 ISFNQMAGLSAL----CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           +SF +  GL A      +L++   L        R ++LI     +  R+   H G +  +
Sbjct: 216 VSFERTNGLEAAKLIRSWLQQTPGL--------RELVLIVKQFLHARRLNNVHTGGLGGF 267

Query: 219 ALEMMVLHRFLDYY 232
           ++ + +++ FL+ +
Sbjct: 268 SI-ICLVYAFLNLH 280


>gi|146086153|ref|XP_001465471.1| RNA polymerase II [Leishmania infantum JPCM5]
 gi|134069569|emb|CAM67892.1| RNA polymerase II [Leishmania infantum JPCM5]
          Length = 382

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 72/191 (37%), Gaps = 20/191 (10%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI--------K 140
           V P+GS+   T   DGD D  V      E       C I     +  + +I         
Sbjct: 76  VLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGACVIARERQEKCLSDIFSHIRKCNS 135

Query: 141 DVQYVP-----AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
           DV+  P     A+V IV+   ++      F+    L  L     +   +  D   +  V+
Sbjct: 136 DVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRHYMAGDPRLRLGVL 195

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPE 255
             K W   E +IL A  G IS YAL +M +H   D   T       +S +       +  
Sbjct: 196 GAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------ALSFDEEAVSQRVNA 248

Query: 256 IVAETVENDGD 266
           IV+   E++GD
Sbjct: 249 IVSTAAESEGD 259


>gi|325183687|emb|CCA18146.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 737

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 77/163 (47%), Gaps = 19/163 (11%)

Query: 73  VINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCK---- 126
           V   +Q L   ++   +  FGS     +LP  D+DL +L   NV E  DL   V      
Sbjct: 220 VTKSVQEL---WHDSTIETFGSYSTGLWLPTSDVDLVIL---NVLENCDLRTTVANLGKL 273

Query: 127 --ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLI 184
             +L  +D  +   + +   VP  +K+V   V +IP+DISF   +  S L   + V   +
Sbjct: 274 ADVLSQQDWVASIVVLETAKVPV-IKLVSKDV-SIPIDISFETSSTHSGLLARDLVSLYV 331

Query: 185 GK-DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
            +   L+  ++I+ +     E  +  A+ G +S+Y++ +M++H
Sbjct: 332 KELPELYPAAIIMKQ--LLRERDLNDAYTGGLSSYSIVLMLIH 372


>gi|350415058|ref|XP_003490519.1| PREDICTED: PAP-associated domain-containing protein 5-like [Bombus
           impatiens]
          Length = 572

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 70/283 (24%), Positives = 119/283 (42%), Gaps = 46/283 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDL 108
            E I++    + P+      R  V+  I+++I + +   +V  FGS     YLP  DIDL
Sbjct: 122 HEEIEDFFAYMCPSNEEHSLRMRVVKRIEQVIYDLWQDSKVEVFGSFRTGLYLPTSDIDL 181

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
            V+G       + +E  L      + +N  + S  ++ D   VP  VK+     + I VD
Sbjct: 182 VVIGMWTNLPLRTLERAL------LDQNIAEPSSIKVLDKASVPI-VKLTDKETE-IKVD 233

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
           ISFN   G+ +    E ++    +  + ++ V+++K   +   R L   + G IS+Y+L 
Sbjct: 234 ISFNMNNGVKS---AELINSFKKRFPVLEKLVMVLKQ--FLLQRDLNEVFTGGISSYSLI 288

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPE 273
           +M +  FL  +     + YC S N  V +    E+               DG   +   E
Sbjct: 289 LMTI-SFLQLHPR--QNAYCSSANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEE 345

Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
             +   +            GH  ++  L I DPL   N++GRS
Sbjct: 346 VQRDMID------------GHRPSL--LCIEDPLTPGNDIGRS 374


>gi|169598326|ref|XP_001792586.1| hypothetical protein SNOG_01964 [Phaeosphaeria nodorum SN15]
 gi|160704377|gb|EAT90176.2| hypothetical protein SNOG_01964 [Phaeosphaeria nodorum SN15]
          Length = 575

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 65/141 (46%), Gaps = 5/141 (3%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
           L   +R +E+L  +Q  +    +R   +  I +      G ++F FGS  L  + P  DI
Sbjct: 44  LEGNKRREEVLAHVQKVVEEFVRRAGKLKGIAQSTLDVAGGKIFFFGSYALGVHGPTSDI 103

Query: 107 D--LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDIS 164
           D  + V  H  +E+    F     E  + + + E+  V+   A V I+K   + + +D+ 
Sbjct: 104 DTLIVVPKHVFIEDFFKIFPSIFRELSNAEDISELVPVE--DAYVPIIKMEYRGVSLDLL 161

Query: 165 FNQMAGLSAL-CFLEKVDRLI 184
           F  M  +S++   LE VD+ +
Sbjct: 162 FASMPTMSSVPKELETVDKSV 182


>gi|320583303|gb|EFW97518.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Ogataea
           parapolymorpha DL-1]
          Length = 538

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 5/142 (3%)

Query: 35  NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFG 93
           N +   I+   WL  E  I++ +  I P+    K R   +  ++  I G +   EV  FG
Sbjct: 100 NDHSRQIEISDWLTME--IKDFIKYISPSAEEIKARNNTVGKLRDCITGMWPDAEVHCFG 157

Query: 94  SVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK 153
           S     YLP  DID+ V+      +   R     L +  ++    +       A+V I+K
Sbjct: 158 SFATDLYLPGSDIDMVVVSKARNNKYDNRSSLYQLSSYIRNHRLGVNVEAIAKAKVPIIK 217

Query: 154 C--SVQNIPVDISFNQMAGLSA 173
                  I +DISF +  G+ A
Sbjct: 218 FVDPATKIHIDISFERTNGIKA 239


>gi|198426610|ref|XP_002126682.1| PREDICTED: similar to PAP associated domain containing 4 [Ciona
           intestinalis]
          Length = 713

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 75/168 (44%), Gaps = 22/168 (13%)

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCS--VQN 158
           + D DL ++     +    + V K+L    Q  ++   +K++Q + A V I+K    +  
Sbjct: 440 NSDADLCLVFDPRNKTVNRKTVLKMLNRMKQLLNNAHFVKNLQLIYATVPILKFEDRISG 499

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           +  D++ N + G+     L    R    D   +  V+ IK W +  + I  A  G +S+Y
Sbjct: 500 MECDLNVNNLTGIRNSFLLLAYARC---DPRVRPMVLCIKEWAHVNN-INSAQLGTLSSY 555

Query: 219 ALEMMVLH-------RFLDYYNTFDWDNYCISINGPV-----AISSLP 254
           AL +MVLH       R +  +     DN+  S N P+      ++SLP
Sbjct: 556 ALVLMVLHYLQIVKPRVIPSFQALHKDNF--SSNLPIHCLGEKVASLP 601


>gi|242054959|ref|XP_002456625.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
 gi|241928600|gb|EES01745.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
          Length = 257

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 86/194 (44%), Gaps = 31/194 (15%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPDGDI 106
           I++IL TI P  V + KRK  ++ IQ L +  Y      G  V PFGS     Y   GD+
Sbjct: 15  IEDILSTINP--VEDDKRKR-LSAIQELADSIYSVGPLRGAAVKPFGSFLSNLYAKSGDL 71

Query: 107 DLTVLGH--------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ 157
           D++V           +  +++  R + K L+        E     ++P A+V I+K    
Sbjct: 72  DVSVDLRNGSRLPISKKKKQNALRELMKALQMRGVARCME-----FIPTARVPILKYMSN 126

Query: 158 N--IPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWCYYESRILGAHYGL 214
           +  I  D+S N   G        ++   IG  D  F   V+L+K W    + I     G 
Sbjct: 127 HFGISCDVSVNNYPGQ----IKSRILYWIGTIDERFGDMVLLVKEWAKARN-INDPKNGT 181

Query: 215 ISTYALEMMVLHRF 228
           +++Y+L ++V+  F
Sbjct: 182 LNSYSLCLLVIFHF 195


>gi|49388677|dbj|BAD25861.1| putative poly(A) polymerase [Oryza sativa Japonica Group]
 gi|125581374|gb|EAZ22305.1| hypothetical protein OsJ_05959 [Oryza sativa Japonica Group]
          Length = 610

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVI--NYIQRLINGYYGIEVFP 91
           L  F ++A L+   EE  + +E+L  I   IV E  +K  I   Y ++++     + +F 
Sbjct: 45  LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 102

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           FGS  L  + P  DID   +G   V+ +   FV       + +   E+ ++Q VP A V 
Sbjct: 103 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVMLYGALSEME---EVTELQPVPDAHVP 159

Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
           ++K   + +P+D+ +             ++  S LC                +++ RL+ 
Sbjct: 160 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 219

Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
              +F++++  +K W             + G  + L+     ++       M++ RF   
Sbjct: 220 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 279

Query: 232 YNTFDWDN 239
           +  + W N
Sbjct: 280 FTQWQWPN 287


>gi|255081724|ref|XP_002508084.1| predicted protein [Micromonas sp. RCC299]
 gi|226523360|gb|ACO69342.1| predicted protein [Micromonas sp. RCC299]
          Length = 1070

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 76/355 (21%), Positives = 129/355 (36%), Gaps = 74/355 (20%)

Query: 3   EMSAFLEMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQP 62
           E + FL+  D + + L+  SH                  + EL+     ++     T  P
Sbjct: 602 EDAEFLDASDYERDSLSGDSHE-----------------NCELYGTLHSQVSSFASTAGP 644

Query: 63  AIVSEKKRKEVINYIQRLINGYY---GIEVFPFGSVPLKTYLPDGDIDLTVLG------- 112
                + R  V   +   I   Y    +EVF  G+  L   L   DIDL VLG       
Sbjct: 645 DDSDLQDRLVVQKKVSEAITALYPEARLEVFGSGATGL--ALKCADIDLVVLGVGPEPSA 702

Query: 113 -----HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
                 +N  EDL R + KI ++  ++ +    +V    A+V I+K +     VD++   
Sbjct: 703 GGGGFSRNDREDLVRMLRKIEKSLRREKIVWKANV-ISTAKVPIIKMNAGRYAVDLTVGA 761

Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
             GLSA+ +++  DR+   D +  R +IL+         +  A  G    Y L  +V+  
Sbjct: 762 SNGLSAVQWIK--DRV--NDFVAMRPLILVLKKLLKTHHLDDASTGGCGGYLLVSLVVSH 817

Query: 228 ---------------------FLDYY-NTFDWDNYCIS-------------INGPVAISS 252
                                FL  + N FD+    ++             + GP A   
Sbjct: 818 LRQSGEPGRSANPNLGALLLGFLRRFGNEFDYSRMAVAAGRPSGVIRAKDLVVGPGAFGM 877

Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
            P I+AE  +  G  +  +    K+ R ++   +  +   G    ++ L    PL
Sbjct: 878 RPMILAEDPQETGRNITGAAYRFKEVRALFRSALEYMSAVGELAFLQELASPAPL 932


>gi|327349744|gb|EGE78601.1| topoisomerase TRF4 [Ajellomyces dermatitidis ATCC 18188]
          Length = 681

 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R ++I  +QR     +YG ++  FGS     YLP  D+DL +L  Q + +
Sbjct: 263 VKPKPFENSIRNDLIARLQRHFERRHYGSQLRAFGSFASGLYLPTADMDLVLLSRQFIRQ 322

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
           D  + +C+    E       +KD+   VP        A+V I+K    +  + VD+SF+ 
Sbjct: 323 D-RKVLCQ-RPREIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380

Query: 168 MAGLSA 173
             G++A
Sbjct: 381 STGIAA 386


>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
 gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
          Length = 463

 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 77/169 (45%), Gaps = 13/169 (7%)

Query: 80  LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI 139
           L++   G EV  FGS   + YLP+ DID+ ++      + L + V   + N   D ++E 
Sbjct: 192 LLDAVPGCEVKTFGSFSTELYLPNSDIDMVIVKDDIQNKSLYKKVADKIMN--CDDIYE- 248

Query: 140 KDVQYVP-AQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
            ++  V  A+V I+K   +   I  DISFN+  G+  L  ++K     G +   +   ++
Sbjct: 249 -NINLVTNAKVPIIKFVEKETQINFDISFNKEDGVKQLSEVKK-----GLELYPEMKYLI 302

Query: 197 IKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISI 244
           +   C    R L   Y G I ++ L  M+L    D    ++ +N    I
Sbjct: 303 MVMKCILRQRDLHETYSGGIGSFLLFCMILAFLRDLRRQYEKENRVQEI 351


>gi|156096154|ref|XP_001614111.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802985|gb|EDL44384.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 808

 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 75/159 (47%), Gaps = 17/159 (10%)

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVK 150
           PFGS      +P  D+D+  LG     EDL   +         D V +I+ ++   A+VK
Sbjct: 426 PFGSFVTGLSIPGSDLDVCFLGIP--LEDLDALLIISYALVKLDIVADIRLIK--DARVK 481

Query: 151 IVKC--SVQNIPVDISFNQMAGLSALCFLE-KVDRLIGKDHLFKRSVILIKAWCYYESRI 207
           I+K       + VD+  NQ++      F++ K+ + I      +  VIL+K   +  +R 
Sbjct: 482 ILKYIDRETGVQVDVCTNQLSSRQTTDFIKSKLQKFI----YMRPLVILLK--FFLNTRN 535

Query: 208 LGAHY-GLISTYALEMMVLHRFLDYYNTFDWD---NYCI 242
           L   Y G I ++ L  MVLH    + +TFDW+   N C+
Sbjct: 536 LNETYIGGIGSFLLCCMVLHFLQLHPSTFDWNVFSNSCL 574


>gi|255723652|ref|XP_002546758.1| Poly(A) polymerase PAPa [Candida tropicalis MYA-3404]
 gi|240130534|gb|EER30100.1| Poly(A) polymerase PAPa [Candida tropicalis MYA-3404]
          Length = 557

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 49/231 (21%), Positives = 85/231 (36%), Gaps = 65/231 (28%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + KKR EV+N +Q++   +                  G ++F FGS  L  Y P  DID 
Sbjct: 43  ATKKRVEVLNILQKMTEDFVFKVSIKKNMSEGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFN 166
           L V+       D      ++L+  D     E++++  VP A V I+K     I +D+ F 
Sbjct: 103 LIVVPKHVTRNDFFETFYELLKTRD-----ELEEIAPVPDAFVPIIKIEFAGISIDLIFA 157

Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           +                         +  L+     +++  L+ K  +FK ++  IK W 
Sbjct: 158 RLDIPRVSRDLTLDDKNLLKNIDEKDLRALNGTRVTDEILTLVPKQTVFKHALRCIKMWA 217

Query: 202 YYESRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
              + I    +G     A  M+V               + +F   Y+ + W
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLVARICQLYPNAVSAVIIEKFFHIYSQWAW 267


>gi|239615134|gb|EEQ92121.1| topoisomerase TRF4 [Ajellomyces dermatitidis ER-3]
          Length = 681

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R ++I  +QR     +YG ++  FGS     YLP  D+DL +L  Q + +
Sbjct: 263 VKPKPFENSIRNDLIARLQRHFERRHYGSQLRAFGSFASGLYLPTADMDLVLLSRQFIRQ 322

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
           D  + +C+    E       +KD+   VP        A+V I+K    +  + VD+SF+ 
Sbjct: 323 D-RKVLCQ-RPREIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380

Query: 168 MAGLSA 173
             G++A
Sbjct: 381 STGIAA 386


>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 787

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 91/205 (44%), Gaps = 21/205 (10%)

Query: 32  SPPNLYPFSIDAELWLLA-----EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY- 85
           +P    P++   + W+ +      + I +    + P    ++ R+  +  +  ++   + 
Sbjct: 181 APKEAPPWARGRDSWIQSPLLQLHQEIVDFCEFVAPTEEEQQMRETAVERVSGVVQSIWP 240

Query: 86  GIEVFPFGSVPLKTYLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQDSVFEIKDVQY 144
             +V  FGS     YLP  D+D+ VL       +D  + + K L           K++Q 
Sbjct: 241 HSQVKVFGSFATGLYLPTSDVDVVVLDSGCTALQDGLKALAKALTRGHVG-----KNIQV 295

Query: 145 V-PAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           +  A+V I+K   +V NIP DISF+   G  A  F++     +G     +   +++K   
Sbjct: 296 IGKARVPIIKFVETVSNIPFDISFDVANGPEAADFIKAA---MGAIPPLRPLCLVLK--I 350

Query: 202 YYESRILGAHY-GLISTYALEMMVL 225
           + + R L   Y G I +YAL +M+L
Sbjct: 351 FLQQRELNEVYQGGIGSYALLVMLL 375


>gi|240276761|gb|EER40272.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus H143]
          Length = 680

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R +++  +QR     +YG ++  FGS     YLP  D+DL +   Q V +
Sbjct: 263 VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 322

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
           D  + +C+    E       +KD+   VP        A+V I+K    +  + VD+SF+ 
Sbjct: 323 D-RKVLCQ-RPREIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380

Query: 168 MAGLSA 173
             G++A
Sbjct: 381 STGIAA 386


>gi|325095205|gb|EGC48515.1| topoisomerase TRF4 [Ajellomyces capsulatus H88]
          Length = 443

 Score = 40.4 bits (93), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R +++  +QR     +YG ++  FGS     YLP  D+DL +   Q V +
Sbjct: 26  VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 85

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
           D  + +C+    E       +KD+   VP        A+V I+K    +  + VD+SF+ 
Sbjct: 86  D-RKVLCQ-RPREIHSFAAYLKDLDIPVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 143

Query: 168 MAGLSA 173
             G++A
Sbjct: 144 STGIAA 149


>gi|58220773|gb|AAW68018.1| putative polynucleotide adenylyltransferase [Oryza sativa Indica
           Group]
          Length = 652

 Score = 40.4 bits (93), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
           L  F ++A L+   EE  + +E+L  I   IV E  +K  I   Y ++++     + +F 
Sbjct: 13  LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 70

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           FGS  L  + P  DID   +G   V+ +   FV       + +   E+ ++Q VP A V 
Sbjct: 71  FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVTLYGALSEME---EVTELQPVPDAHVP 127

Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
           ++K   + +P+D+ +             ++  S LC                +++ RL+ 
Sbjct: 128 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 187

Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
              +F++++  +K W             + G  + L+     ++       M++ RF   
Sbjct: 188 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 247

Query: 232 YNTFDWDN 239
           +  + W N
Sbjct: 248 FTQWQWPN 255


>gi|19075773|ref|NP_588273.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe 972h-]
 gi|74582471|sp|O74518.1|CID12_SCHPO RecName: Full=Poly(A) RNA polymerase cid12; Short=PAP; AltName:
           Full=Caffeine-induced death protein 12; AltName:
           Full=Polynucleotide adenylyltransferase cid12
 gi|3426138|emb|CAA20372.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe]
          Length = 336

 Score = 40.4 bits (93), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 44/177 (24%), Positives = 83/177 (46%), Gaps = 14/177 (7%)

Query: 52  RIQEILCTIQPAIVSEKKRKEVINYIQRLINGY-YGIEVFPFGSVPLKTYLPDGDIDLTV 110
           R+   L  + P I   K RK ++  +Q  I       E+  +GS+ + T L   D+D+++
Sbjct: 23  RLYSFLEFVSPKIEELKYRKLLLEKLQTHIREVVLDAELQVYGSMYIGTTLSISDVDVSL 82

Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV-KIVKCSVQNIPVDISFNQMA 169
              +  E +  R    + +  D D+ F      +  A+V +I    V  I VD++F    
Sbjct: 83  KSPRVGELEKRRVTMVLRKYLDADADF------HSSARVPRINLVDVSGIGVDLTF---- 132

Query: 170 GLSALCFLEKVDRLIGKDH-LFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
           G    C   ++ +   ++H +F R ++L+K W + E  +   H+G I++ AL  M++
Sbjct: 133 GNDKACRTAELQKAYNEEHPIFGRLLMLLKHWLF-ERDLENVHHGGIASCALSYMLI 188


>gi|225556159|gb|EEH04448.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus G186AR]
          Length = 680

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P       R +++  +QR     +YG ++  FGS     YLP  D+DL +   Q V +
Sbjct: 263 VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 322

Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
           D  + +C+    E       +KD+   VP        A+V I+K    +  + VD+SF+ 
Sbjct: 323 D-RKVLCQ-RPREIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380

Query: 168 MAGLSA 173
             G++A
Sbjct: 381 STGIAA 386


>gi|398015076|ref|XP_003860728.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322498950|emb|CBZ34023.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 382

 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 71/191 (37%), Gaps = 20/191 (10%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI--------K 140
           V P+GS+   T   DGD D  V      E       C I     +  + +I         
Sbjct: 76  VLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGACVIARERQEKCLSDIFSHIRKCNS 135

Query: 141 DVQYVP-----AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
           DV+  P     A+V IV+   ++      F+    L  L     +   +  D   +  V+
Sbjct: 136 DVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRHYMAGDPRLRLGVL 195

Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPE 255
             K W   E +IL A  G IS YAL +M +H   D   T       +S +       +  
Sbjct: 196 GAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------ALSFDEEAVSQRVNA 248

Query: 256 IVAETVENDGD 266
           IV+   E +GD
Sbjct: 249 IVSTAAEAEGD 259


>gi|401421907|ref|XP_003875442.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322491679|emb|CBZ26952.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 382

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 76/194 (39%), Gaps = 26/194 (13%)

Query: 89  VFPFGSVPLKTYLPDGDIDLT----------------VLGHQNVEEDLARFVCKILENED 132
           V P+GS+   T   DGD D                  V+G +  E+ L+     I +   
Sbjct: 76  VLPYGSIVSGTSFTDGDADYIVSFPTLSENTRQSGACVIGRERQEKFLSDIFSHIRKC-- 133

Query: 133 QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKR 192
            +S  E+   +   A+V IV+   ++      F+    L  L     + + +  D   + 
Sbjct: 134 -NSDVELHPQRIFRARVPIVQYVRKSPKESTKFDLSLSLDGLKNSLLLRQYMAGDPRLRL 192

Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
            V+  K W   E +IL A  G IS YAL +M +H   D   T       +S +       
Sbjct: 193 GVLGAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------GLSFDEEAVSQR 245

Query: 253 LPEIVAETVENDGD 266
           +  IV+   E++GD
Sbjct: 246 VNAIVSTAAESEGD 259


>gi|242061050|ref|XP_002451814.1| hypothetical protein SORBIDRAFT_04g008100 [Sorghum bicolor]
 gi|241931645|gb|EES04790.1| hypothetical protein SORBIDRAFT_04g008100 [Sorghum bicolor]
          Length = 672

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 55/250 (22%), Positives = 103/250 (41%), Gaps = 53/250 (21%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
           L  F ++A L+   EE  + +++LC I   IV E  K+      Y  +L+     + +F 
Sbjct: 42  LEKFLVEAGLYEGKEESAKREDVLCEIG-QIVKEWVKQLTSKKGYADQLVEQANAV-LFT 99

Query: 92  FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
           FGS  L  + P+ DID   +G  + N EED    +  IL  ++     E+ ++Q VP A 
Sbjct: 100 FGSYRLGVHGPEADIDTLCVGPSYVNREEDFFVTLHGILAEKE-----EVTELQPVPDAH 154

Query: 149 VKIVKCSVQNIPVDISFNQM-----------AGLSALC--------------FLEKVDRL 183
           V ++K   + I +D+ +  +           +  S LC                +++ RL
Sbjct: 155 VPVLKFKFRGISIDLLYASLSLSVIPADFDISQGSVLCDVDEATVRSLNGCRVADQILRL 214

Query: 184 IGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRFL 229
           +     F+ ++  +K W             + G ++ L+     ++       M++ RF 
Sbjct: 215 VPNVENFRTTLRCLKYWAKRRGVYSNITGFLGGVNWALLVARVCQLYPNAVPSMLVSRFF 274

Query: 230 DYYNTFDWDN 239
             +  + W N
Sbjct: 275 RVFTQWQWPN 284


>gi|317139015|ref|XP_001817214.2| topoisomerase family protein TRF4 [Aspergillus oryzae RIB40]
          Length = 659

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I  +Q      YYG+++  FGS     YLP  DIDL +L    +  
Sbjct: 247 VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 306

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L N D      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 307 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 364

Query: 168 MAGLSA 173
            +GL A
Sbjct: 365 DSGLVA 370


>gi|303389764|ref|XP_003073114.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
 gi|303302258|gb|ADM11754.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
          Length = 378

 Score = 40.0 bits (92), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 15/135 (11%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTV- 110
           + ++   + P       RK +   I+RLI       EV PFGS      +P  DID+ + 
Sbjct: 70  LHQLYQKLAPTTTEINSRKYIFEKIKRLIVREIPNAEVEPFGSYTTGLIIPSSDIDINIQ 129

Query: 111 LGHQNVEEDLARFVCKI---LENED---QDSVFEIKDVQYVPAQVKIVKCS--VQNIPVD 162
           LG+ + +E   R++ KI   +   D   ++++F I+       ++ I+K S  V    +D
Sbjct: 130 LGNNHDKEYSNRYLSKIKSLMLKADFIRKETLFHIRK-----CRIPILKFSDKVFGFKID 184

Query: 163 ISFNQMAGLSALCFL 177
           IS NQ  G+ A  F+
Sbjct: 185 ISVNQTNGIEAAKFV 199


>gi|238482039|ref|XP_002372258.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
           NRRL3357]
 gi|220700308|gb|EED56646.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
           NRRL3357]
          Length = 608

 Score = 40.0 bits (92), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)

Query: 60  IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           ++P    +  R ++I  +Q      YYG+++  FGS     YLP  DIDL +L    +  
Sbjct: 196 VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 255

Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
            +  F             L N D      I+ + +  A+V I+K    +  + VD+SF+ 
Sbjct: 256 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 313

Query: 168 MAGLSA 173
            +GL A
Sbjct: 314 DSGLVA 319


>gi|297598868|ref|NP_001046352.2| Os02g0227400 [Oryza sativa Japonica Group]
 gi|255670737|dbj|BAF08266.2| Os02g0227400, partial [Oryza sativa Japonica Group]
          Length = 602

 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)

Query: 36  LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
           L  F ++A L+   EE  + +E+L  I   IV E  +K  I   Y ++++     + +F 
Sbjct: 37  LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 94

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           FGS  L  + P  DID   +G   V+ +   FV       + +   E+ ++Q VP A V 
Sbjct: 95  FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVMLYGALSEME---EVTELQPVPDAHVP 151

Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
           ++K   + +P+D+ +             ++  S LC                +++ RL+ 
Sbjct: 152 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 211

Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
              +F++++  +K W             + G  + L+     ++       M++ RF   
Sbjct: 212 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 271

Query: 232 YNTFDWDN 239
           +  + W N
Sbjct: 272 FTQWQWPN 279


>gi|260818300|ref|XP_002604321.1| hypothetical protein BRAFLDRAFT_88611 [Branchiostoma floridae]
 gi|229289647|gb|EEN60332.1| hypothetical protein BRAFLDRAFT_88611 [Branchiostoma floridae]
          Length = 482

 Score = 40.0 bits (92), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 66/157 (42%), Gaps = 27/157 (17%)

Query: 411 VTAQF-SPPSS------PSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYA 463
           VT QF SPPS          +P  S+W VL Q ++   + V Q     F+P   FCHP  
Sbjct: 30  VTNQFVSPPSHWLYGHLMQLLPDESSWPVLKQMMRDYSSHVLQLWVGPFLPLFIFCHPET 89

Query: 464 SQVRASTFSTDEGEKSRGTGTYIPERTRPPP--IEINRPVRARNPESLNHVPVSKLRRKI 521
           S+   ST S D    S G+  +   R   P    EI +P      +S N +        I
Sbjct: 90  SRAILST-SGDGLLLSHGSKWHRNRRLLTPAFHFEILKPYVTLFSQSTNVL--------I 140

Query: 522 DKAERLPSRDNLESGCSLNVENGCSLNMENS---CSL 555
           DK  +      L SGCS+ +    SL   +S   CSL
Sbjct: 141 DKWTK------LGSGCSVEMFEHVSLMTLDSILKCSL 171


>gi|221058739|ref|XP_002260015.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193810088|emb|CAQ41282.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 821

 Score = 40.0 bits (92), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 75/153 (49%), Gaps = 16/153 (10%)

Query: 91  PFGSVPLKTYLPDGDIDLTVLGHQNVE-EDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
           PFGS      +P  D+D+  L   N+  EDL   +         D V +I+ ++   A+V
Sbjct: 435 PFGSFVTGLSIPGSDLDVCFL---NIPLEDLDALLLIAYALVKLDMVTDIRLIK--DARV 489

Query: 150 KIVKCSVQ--NIPVDISFNQMAGLSALCFLE-KVDRLIGKDHLFKRSVILIKAWCYYESR 206
           KI+K + +   + VD+  NQ++      F++ K+++ I      +  VIL+K   +  +R
Sbjct: 490 KILKYTDKETGVQVDVCTNQLSSRQTTDFIKSKMEKYI----YLRPLVILLK--FFLNTR 543

Query: 207 ILGAHY-GLISTYALEMMVLHRFLDYYNTFDWD 238
            L   Y G I ++ L  MVLH    +  TFDW+
Sbjct: 544 NLNETYIGGIGSFMLCCMVLHFLQLHPTTFDWN 576


>gi|215707095|dbj|BAG93555.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score = 40.0 bits (92), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
           E+  ++IL  I+P    E  R + I  IQ L +  Y      G  V PFGS   + Y   
Sbjct: 13  EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69

Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
           GD+D++V       E        I + + QD++ E+          + ++++P A+V ++
Sbjct: 70  GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122

Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
           +   +   I  DIS +   G         ++ L   D  F   V+L+K W   ++ I   
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178

Query: 211 HYGLISTYALEMMVLHRF 228
             G +++Y+L ++VL  F
Sbjct: 179 KNGTLNSYSLCLLVLFHF 196


>gi|147900520|ref|NP_001087078.1| PAP associated domain containing 4 a [Xenopus laevis]
 gi|51234260|gb|AAT98005.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
          Length = 509

 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 6/126 (4%)

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           D D+ L +      +   AR +  +L       +  I+  Q++ A+V IVK    V    
Sbjct: 240 DADLCLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAE 299

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            D++ N + G+     L     L   D   +  V++IK W  +   I  A  G +S+Y +
Sbjct: 300 FDLNVNNVVGIRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTI 355

Query: 221 EMMVLH 226
            +MVLH
Sbjct: 356 VLMVLH 361


>gi|148230683|ref|NP_001086580.1| poly(A) RNA polymerase GLD2-B [Xenopus laevis]
 gi|82182837|sp|Q6DFA8.1|GLD2B_XENLA RecName: Full=Poly(A) RNA polymerase GLD2-B; Short=xGLD-2; AltName:
           Full=PAP-associated domain-containing protein 4-B
 gi|49903424|gb|AAH76832.1| MGC83852 protein [Xenopus laevis]
          Length = 509

 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 6/126 (4%)

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           D D+ L +      +   AR +  +L       +  I+  Q++ A+V IVK    V    
Sbjct: 240 DADLCLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAE 299

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            D++ N + G+     L     L   D   +  V++IK W  +   I  A  G +S+Y +
Sbjct: 300 FDLNVNNVVGIRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTI 355

Query: 221 EMMVLH 226
            +MVLH
Sbjct: 356 VLMVLH 361


>gi|82541397|ref|XP_724941.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii 17XNL]
 gi|23479769|gb|EAA16506.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii]
          Length = 534

 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 88/193 (45%), Gaps = 23/193 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E + ++   ++P+    K  K  +N++Q  IN  +    V PFGSV    ++ + DID+ 
Sbjct: 190 EELNKLEIALRPSQNDIKNIKMFLNFLQNEINKQFKNCHVTPFGSVINGFWMKNSDIDIC 249

Query: 110 ----VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIV-------KCSVQN 158
               +L ++  + +  + +C IL N   + + E    Q   A+V I+       K S Q 
Sbjct: 250 IQIPILLNRKDQINFLKKICLILNNY-HNGIIE----QRFSAKVPIIHFYCDDHKNSFQ- 303

Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
           +  DIS N +  +     ++K    +  D   +   I +K W   +  I     G +S++
Sbjct: 304 LSCDISVNNILAVINSKLIQK---YVSIDKRLQLMGIALKYWS-KKRNINDRSKGFLSSF 359

Query: 219 ALEMMVLHRFLDY 231
           +L +M +H FL Y
Sbjct: 360 SLILMAIH-FLQY 371


>gi|301615931|ref|XP_002937419.1| PREDICTED: PAP-associated domain-containing protein 5-like [Xenopus
           (Silurana) tropicalis]
          Length = 572

 Score = 40.0 bits (92), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 31/63 (49%), Gaps = 1/63 (1%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 158 EEIMDFYKYMSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 217

Query: 110 VLG 112
           V G
Sbjct: 218 VFG 220


>gi|134026254|gb|AAI36216.1| papd4 protein [Xenopus (Silurana) tropicalis]
          Length = 523

 Score = 40.0 bits (92), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 43/186 (23%), Positives = 77/186 (41%), Gaps = 19/186 (10%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL--------KTYLP 102
           ++I E+   +Q  +   KK+      +QR I      ++FP   + L         T   
Sbjct: 199 KQILELFQALQQQVCDLKKKDICRAELQREIQ-----QIFPQSRLYLVGSSLNGFGTRSS 253

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           D D+ L +      +   AR +  +L       +  I+  Q++ A+V IVK    V    
Sbjct: 254 DADLCLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERPQFIKAKVPIVKFRDKVSGAE 313

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            D++ N + G+     L     +   ++  +  V++IK W  Y   +  A  G +S+Y L
Sbjct: 314 FDLNVNNVVGIRNTFLLRTYAYI---ENRVRPLVLVIKMWANYHG-LNDASRGTLSSYTL 369

Query: 221 EMMVLH 226
            +M LH
Sbjct: 370 VLMALH 375


>gi|332030078|gb|EGI69903.1| PAP-associated domain-containing protein 5 [Acromyrmex echinatior]
          Length = 662

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 75/297 (25%), Positives = 124/297 (41%), Gaps = 33/297 (11%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I++    + P+      R  V+  I+ +I   +   +V  FGS     YLP  DIDL
Sbjct: 208 HEEIEDFFSYMCPSHEEHVLRLRVVKRIENVIYDLWPDSKVEVFGSFRTGLYLPTSDIDL 267

Query: 109 TVLGHQNVEEDLARFVCKILENEDQD-SVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
            V+G             + LE    D ++ E   ++ +  A V IVK + +   I VDIS
Sbjct: 268 VVIGM------WTNLPLRTLERALLDRNIAEPSSIKVLDKASVPIVKLTDKESEIKVDIS 321

Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
           FN   G+ +   +    R   +  + ++ V+++K +   +  +     G IS+Y+L +M 
Sbjct: 322 FNMNNGVKSAELINSYKR---QYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSLILMT 377

Query: 225 LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--NDGDELLLSPEFLKKCREIY 282
           +  FL  +   D   +C + N  V +    E+          G  +    +++ K  EI 
Sbjct: 378 I-SFLQLHPRKDI--HCPNTNLGVLLIEFLELYGRKFNYVKTGIRIKDGGQYISK-EEIQ 433

Query: 283 SVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS-----VSKGNFHRIRCALSYGAQ 334
              I      GH  ++  L I DPL   N++GRS       K  F+   C LS  A 
Sbjct: 434 RDMI-----DGHRPSL--LCIEDPLTPGNDIGRSSYGALYVKSAFNWAYCILSQAAN 483


>gi|410075647|ref|XP_003955406.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
 gi|372461988|emb|CCF56271.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
          Length = 630

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 86/192 (44%), Gaps = 22/192 (11%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P+    K R + ++ + R I   +   E+  FGS     YLP  
Sbjct: 168 WLTLE--IKDFVAYISPSREEIKLRNKAVSKLGRAIKELWSDSELLVFGSYATDLYLPGS 225

Query: 105 DIDLTVLG------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
           DID  V        H++   +LARF+ K    ++  +  E+     VP  +K ++     
Sbjct: 226 DIDCVVNSASGNKEHRSYLYELARFLKK----KNLATSIEVIARARVPI-IKFIEPE-SG 279

Query: 159 IPVDISFNQMAGLSALCFL-EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
           + +DISF +  G+ A   + E +D   G      R ++LI        R+   H G +  
Sbjct: 280 VHIDISFERTNGVEAAKLIREWLDMTPG-----LRELVLIVKQFLTARRLNDVHTGGLGG 334

Query: 218 YALEMMVLHRFL 229
           +++ + +++ FL
Sbjct: 335 FSI-ICLVYSFL 345


>gi|146180895|ref|XP_001021673.2| Poly polymerase central domain containing protein [Tetrahymena
            thermophila]
 gi|146144391|gb|EAS01427.2| Poly polymerase central domain containing protein [Tetrahymena
            thermophila SB210]
          Length = 1765

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 69/264 (26%), Positives = 102/264 (38%), Gaps = 58/264 (21%)

Query: 85   YGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQ-DSVFEIKD 141
            +G E+  FGS  LK    D DID   L    V+ D   F  +  IL+N  +   +F IK+
Sbjct: 1184 FGGELLFFGSYKLKVNSHDSDIDTVCLVPNFVDRDQHFFGDLVNILQNRKEVTELFPIKE 1243

Query: 142  VQYVPAQVKIVKCSVQNIPVDISFN----QMAG----LSALCFLEKVDR----------- 182
                 A V I+K     I  D+SF     Q  G    L++   L  +D            
Sbjct: 1244 -----ASVPIIKMKFSGILFDLSFARAPPQYNGENIDLTSNQMLLSMDERSYRSFNGCRV 1298

Query: 183  ---LIGKDHL-----FKRSVILIKAWCYYE---SRILGAHYGLISTYALEMM-------- 223
               LI KD++     FK ++  IK W       S I+G   G+  T  +  +        
Sbjct: 1299 AEWLIRKDNIQNYENFKITLKCIKLWATNRGIYSNIIGYLSGIGWTILVAKICQLYPNYP 1358

Query: 224  ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
               ++ RF   Y+ + W +       PV +  + E VA T  N  D+    PEF  +   
Sbjct: 1359 PNQLIERFFFLYSHWKWHDL------PVYVEHIQEDVANTKLN-ADQ--WRPEFRDRNSM 1409

Query: 281  IYSVPITALENTGHEFAIKHLNIV 304
            +   P     NT H  +   L I+
Sbjct: 1410 MIITPCFPCMNTAHNVSPTTLRII 1433


>gi|118404514|ref|NP_001072915.1| poly(A) RNA polymerase GLD2 [Xenopus (Silurana) tropicalis]
 gi|123906238|sp|Q0VFA3.1|GLD2_XENTR RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
           Full=PAP-associated domain-containing protein 4
 gi|110645459|gb|AAI18910.1| PAP associated domain containing 4 [Xenopus (Silurana) tropicalis]
          Length = 528

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 43/186 (23%), Positives = 77/186 (41%), Gaps = 19/186 (10%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL--------KTYLP 102
           ++I E+   +Q  +   KK+      +QR I      ++FP   + L         T   
Sbjct: 204 KQILELFQALQQQVCDLKKKDICRAELQREIQ-----QIFPQSRLYLVGSSLNGFGTRSS 258

Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
           D D+ L +      +   AR +  +L       +  I+  Q++ A+V IVK    V    
Sbjct: 259 DADLCLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERPQFIKAKVPIVKFRDKVSGAE 318

Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
            D++ N + G+     L     +   ++  +  V++IK W  Y   +  A  G +S+Y L
Sbjct: 319 FDLNVNNVVGIRNTFLLRTYAYI---ENRVRPLVLVIKMWANYHG-LNDASRGTLSSYTL 374

Query: 221 EMMVLH 226
            +M LH
Sbjct: 375 VLMALH 380


>gi|154337346|ref|XP_001564906.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|134061944|emb|CAM38985.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 374

 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 80/200 (40%), Gaps = 38/200 (19%)

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVF----------- 137
           V P+GS+   T L DGD D  V      E       C ++E E Q+ +            
Sbjct: 68  VLPYGSIVSGTSLRDGDADYIVSFPLASESTCQSIAC-VIERERQEKLLSDIFVHIRKNN 126

Query: 138 ---EIKDVQYVPAQVKIVK------CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDH 188
              E+   +   A+V IV+      C +    + +S   + GL     L +    +  D 
Sbjct: 127 RDDELYPQRIFRARVPIVQYVRKSACEISKFDICLS---LGGLKNSLLLRQ---YMAGDP 180

Query: 189 LFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY--YNTFDWDNYCISING 246
             +  V+  K W   + +IL    G IS YAL +M +H F+      TF +D   +S   
Sbjct: 181 RLRLGVLGAKQWG-RDHQILNTRRGWISPYALSIMYIH-FMKSTGRTTFAFDEEAVS--- 235

Query: 247 PVAISSLPEIVAETVENDGD 266
                 + EI++   E++GD
Sbjct: 236 ----QRVNEIISTAAESEGD 251


>gi|449514383|ref|XP_004177159.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase GLD2
           [Taeniopygia guttata]
          Length = 509

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 95/204 (46%), Gaps = 19/204 (9%)

Query: 35  NLYPFSIDAELWLLAEERIQEIL-----CTIQPAIVSEKK--RKEVINYIQRLINGYYGI 87
            L+P   +A L +  ++  +++L     C  QP+ +  K+  R E+   IQ +   +   
Sbjct: 167 TLFPDPTEARLPVAKDQLSKQVLEWFQVCRQQPSDLDRKELCRTELQREIQLI---FPQS 223

Query: 88  EVFPFGSV--PLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE-IKDVQY 144
            +F  GS      T   DGD+ L V      ++  AR +  +++      +   I+  Q 
Sbjct: 224 RLFLVGSSLNGFGTRTSDGDLCLVVKEEPVNQKTEARRILSLVQKLFTTKLSSYIERPQL 283

Query: 145 VPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
           + A+V IVK    V N+  D++ N + G+    FL +    I  ++  +  V+++K W  
Sbjct: 284 IRAKVPIVKFRDKVSNVDFDLNVNNVIGIRN-TFLLRSYAFI--ENRVRPLVLVVKKWAS 340

Query: 203 YESRILGAHYGLISTYALEMMVLH 226
           +   I  A  G +++Y+L +MVLH
Sbjct: 341 FHE-INDASRGTLNSYSLVLMVLH 363


>gi|195045442|ref|XP_001991977.1| GH24509 [Drosophila grimshawi]
 gi|193892818|gb|EDV91684.1| GH24509 [Drosophila grimshawi]
          Length = 489

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 47/183 (25%), Positives = 81/183 (44%), Gaps = 21/183 (11%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDLT 109
           E I+     + P       R EV+  I+ ++   +   V   FGS     +LP  DIDL 
Sbjct: 270 EEIEHFYQYVLPTPCEHAIRNEVVKRIEAVVQSIWPQAVVEIFGSFRTGLFLPTSDIDLV 329

Query: 110 VLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISF 165
           VLG         +   + LE E     + E   V+ +  A V I+K + +   + VDISF
Sbjct: 330 VLGLWE------KLPLRTLEFELVSRGIAEACTVRVLDKASVPIIKLTDRETQVKVDISF 383

Query: 166 NQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
           N  +G+ +   +++  R   ++GK  L  +  +L++        +     G IS+Y+L +
Sbjct: 384 NMQSGVQSAELIKQFKREFPVLGKLVLVLKQFLLLRD-------LNEVFTGGISSYSLIL 436

Query: 223 MVL 225
           M +
Sbjct: 437 MCI 439


>gi|145550572|ref|XP_001460964.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428796|emb|CAK93567.1| unnamed protein product [Paramecium tetraurelia]
          Length = 463

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 44/196 (22%), Positives = 95/196 (48%), Gaps = 28/196 (14%)

Query: 47  LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP---------FGSVPL 97
           ++++++ + +L  ++   +++ K+K+ +N ++ LI+    ++  P         FGS   
Sbjct: 131 MISKDQDKVLLEILKQNEMTDDKKKKFLNSLKSLIHFLKQLDYDPFNAEIKYCIFGSQLN 190

Query: 98  KTYLPDGDIDLTVLGHQNVEEDLA-RFVCKILENEDQDSVFEIKD-VQYVPAQVKIVKCS 155
                + D+D T L +  V+E +A +++ + +E  DQ+  F+I + V++    V  ++  
Sbjct: 191 GFGTKESDVDFTFLTNSYVDERIALKYLRQEIEAVDQNK-FKINELVEFARIAVMKIQDQ 249

Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
              I +D+ FN + G+     L   +   G +   ++  IL+K W        G   G+I
Sbjct: 250 TNKIEIDMCFNNLLGVINTKLL---NAYAGLNEKVQQGGILLKLW--------GKKQGII 298

Query: 216 -----STYALEMMVLH 226
                S+YA+ +M LH
Sbjct: 299 NKNCFSSYAILIMWLH 314


>gi|350406748|ref|XP_003487869.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Bombus
           impatiens]
          Length = 655

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 67/238 (28%), Positives = 102/238 (42%), Gaps = 51/238 (21%)

Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           I+ ++ + A+V I+K   S+QN+ VD++ N   G+     L    R+   D   +  V++
Sbjct: 411 IEQLELIQAKVPILKFYDSIQNLEVDLNCNNAVGIRNTHLLYCYSRI---DWRVRPLVLV 467

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLD--------------YYNTF--DWDNY 240
           +K W   ++ I  A    IS+Y+L +MVLH FL               Y   F    D +
Sbjct: 468 VKLWAQSQN-INDAKNMTISSYSLVLMVLH-FLQCGVNPPVLPCLHSLYKGKFAPHTDIH 525

Query: 241 CISI----NGPVAI------SSLPEIVAE----TVENDGDELLLSPEFLKK-----CREI 281
           CI I    N PV+I       +L E+  E     V  D ++  +S     K     CR  
Sbjct: 526 CIDIQEELNIPVSILRPKNRQTLGELFVEFFRYYVMFDFNQYAISVRLANKIAIEECRRA 585

Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN-FHRIRCALSYGAQRLGE 338
            S      +N  H++  K+L I +P  D  N  RSV   + F RI+       Q L E
Sbjct: 586 RS-----YKNDPHQW--KYLCIEEPF-DLTNTARSVYDPDVFARIKHVFDCTYQNLKE 635


>gi|320038983|gb|EFW20918.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
          Length = 590

 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 56/122 (45%), Gaps = 25/122 (20%)

Query: 68  KKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDLTV 110
           +KR ++++ IQR+   +                  G ++FP+GS  L  Y P  DID  V
Sbjct: 48  EKRNQMLHLIQRVTVEFVKTVGRRKGLSQAAIDVSGGKIFPYGSYRLGVYGPGSDIDTLV 107

Query: 111 LG--HQNVEEDLARFVCKILENEDQDSVFEIKDV--QYVPAQVKIVKCSVQNIPVDISFN 166
           +G  H  +E+    F   +     +DS+ ++  V   YVP    I+K  +  I +D+ F 
Sbjct: 108 VGPKHVAIEDFFLDFPPTLERLAPKDSIEKLTPVPDAYVP----IIKLELSGISIDLIFA 163

Query: 167 QM 168
           ++
Sbjct: 164 RL 165


>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
          Length = 364

 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 13/148 (8%)

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
           FGS   + YLP+ DID+ ++     ++ L + V   +    Q + FE  +V  +  A+V 
Sbjct: 108 FGSFKTRMYLPNADIDIVMIETSCTQKQLFKKVAARMMK--QTNKFE--NVNLIANAKVP 163

Query: 151 IVK-CSVQN-IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
           I+K   V++    D+SFNQ+ GL  +  LEK   L       +   +L+   C    R L
Sbjct: 164 IIKFVEVESQYHFDLSFNQLDGLKQIEELEKAFELYP-----ELKFLLMTLKCVLRQRDL 218

Query: 209 GAHY-GLISTYALEMMVLHRFLDYYNTF 235
              Y G + ++ L  M+L    ++   F
Sbjct: 219 NETYSGGVGSFLLFQMILAFLREFRKDF 246


>gi|145546801|ref|XP_001459083.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426906|emb|CAK91686.1| unnamed protein product [Paramecium tetraurelia]
          Length = 364

 Score = 39.7 bits (91), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 66/146 (45%), Gaps = 9/146 (6%)

Query: 92  FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFV-CKILENEDQDSVFEIKDVQYVPAQVK 150
           FGS   + YLP+ DID+ ++     ++ L + V  K+++  ++     +     VP  +K
Sbjct: 108 FGSFKTRMYLPNADIDMVMIETSCTQKQLFKKVAAKMMKQTNKFENVNLIANAKVPI-IK 166

Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
            ++   Q    D+SFNQ+ GL  +  LEK   +       +   +L+   C    R L  
Sbjct: 167 FIEVESQ-YHFDLSFNQLDGLKQIEELEKAFEIYP-----ELKFLLMTLKCVLRQRDLNE 220

Query: 211 HY-GLISTYALEMMVLHRFLDYYNTF 235
            Y G + ++ L  M+L    +Y   F
Sbjct: 221 TYSGGVGSFLLFQMILAFLREYRKDF 246


>gi|303317852|ref|XP_003068928.1| poly(A) polymerase pla1, putative [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108609|gb|EER26783.1| poly(A) polymerase pla1, putative [Coccidioides posadasii C735
           delta SOWgp]
          Length = 599

 Score = 39.7 bits (91), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 56/122 (45%), Gaps = 25/122 (20%)

Query: 68  KKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDLTV 110
           +KR ++++ IQR+   +                  G ++FP+GS  L  Y P  DID  V
Sbjct: 48  EKRNQMLHLIQRVTVEFVKTVGRRKGLSQAAIDVSGGKIFPYGSYRLGVYGPGSDIDTLV 107

Query: 111 LG--HQNVEEDLARFVCKILENEDQDSVFEIKDV--QYVPAQVKIVKCSVQNIPVDISFN 166
           +G  H  +E+    F   +     +DS+ ++  V   YVP    I+K  +  I +D+ F 
Sbjct: 108 VGPKHVAIEDFFLDFPPTLERLAPKDSIEKLTPVPDAYVP----IIKLELSGISIDLIFA 163

Query: 167 QM 168
           ++
Sbjct: 164 RL 165


>gi|198451695|ref|XP_001358484.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
 gi|198131603|gb|EAL27623.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
          Length = 400

 Score = 39.7 bits (91), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 88/218 (40%), Gaps = 39/218 (17%)

Query: 60  IQPAIVSEKKRKEVINYIQ-RLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
           IQ        R E +  I+  L++ + G  V  FGS      LP  DIDL V   ++   
Sbjct: 39  IQSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPLSDIDLVVENRKSY-- 96

Query: 119 DLARFVCKILENEDQDSVFEIKDVQYV----PAQVKIVKCS--VQNIPVDISFNQMAGLS 172
               +   + E E +     + D Q V     A V +VK +  V  I  DISFN  +G+ 
Sbjct: 97  ---WYTPPLYELESELVARGVADPQTVNVVDTAAVPVVKFTERVSQIKFDISFNVGSGVK 153

Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
           A    E +  LI +     + V+++K +            G IS+Y + +M         
Sbjct: 154 A---AELIKDLIQEFPDLPKLVMVLKQFLAMHGLNDVYRSGGISSYGITLMCIGFLQHQS 210

Query: 224 --------------VLHRFLDYYN-TFDWDNYCISING 246
                         +L +FL+YY   FD++ Y IS+ G
Sbjct: 211 QSNKKYNNNNRLGILLLKFLEYYGRKFDFNKYAISVTG 248


>gi|448086239|ref|XP_004196053.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
 gi|359377475|emb|CCE85858.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
          Length = 558

 Score = 39.3 bits (90), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 46/203 (22%), Positives = 77/203 (37%), Gaps = 50/203 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + +KR EV+N +Q++   +                  G ++F FGS  L  Y P  DID 
Sbjct: 43  ATRKRVEVLNELQKMAEKFVYEVSLQKNMSDGMARDAGGKIFTFGSYRLGVYGPGSDIDA 102

Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFN 166
           L V+       D      KIL         E++++  VP A V I+K  +  I +D+ F 
Sbjct: 103 LVVVPKHVTRNDFFEVFEKILRERK-----ELEEIAAVPDAYVPIIKMVLNGISIDLIFA 157

Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           +                         +  L+     +++  L+ K  +FK ++  IK W 
Sbjct: 158 RLDVPKVPKDLKLDDKNLLRNIDEKDLRSLNGTRVTDEILTLVPKPTVFKHALRCIKMWA 217

Query: 202 YYESRILGAHYGLISTYALEMMV 224
              + I    +G     A  M+V
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLV 239


>gi|255732153|ref|XP_002551000.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
 gi|240131286|gb|EER30846.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
          Length = 558

 Score = 39.3 bits (90), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 52/238 (21%), Positives = 86/238 (36%), Gaps = 79/238 (33%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
           + KKR EV+   Q+L+  +                  G +VF FGS  L  Y P  DID 
Sbjct: 44  ATKKRVEVLTLFQKLVQEFVYTVSKSKNMSDGMAKDSGGKVFTFGSYRLGVYGPGSDIDT 103

Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVF--------EIKDVQYVP-AQVKIVKCSVQNI 159
            V+  ++V  D            D  SVF        E++++  VP A V I+K     I
Sbjct: 104 LVVVPKHVTRD------------DFFSVFPDIIRKRPELEEIACVPDAFVPIIKLEFDGI 151

Query: 160 PVDISFNQ-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSV 194
            +D+   +                         +  L+     +++ +L+ K  +FK ++
Sbjct: 152 SIDLIMARLNVPRVPLEMTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHAL 211

Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
             IK W   +  + G  YG     A  M+V               + +F   Y  ++W
Sbjct: 212 RCIKLWA-QQRAVYGNVYGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFSIYTKWNW 268


>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
 gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
          Length = 289

 Score = 39.3 bits (90), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 92/192 (47%), Gaps = 20/192 (10%)

Query: 42  DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTY 100
           D+ L  L EE I +    I P    +  R+ V+  ++ +I   +  ++V  FGS     Y
Sbjct: 4   DSSLARLHEE-IIDFYQYISPRPEEKNMRETVVEGVKEVILTLWPHVQVEVFGSFRTGLY 62

Query: 101 LPDGDIDLTVLG--HQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSV 156
           LP  DIDL + G   +   EDL +    ++++E  D+D++  I +     A V I+K + 
Sbjct: 63  LPTSDIDLVIFGIDGKGAFEDLEK---ALMQHEVCDRDNIKCIHN-----AMVPIIKLTE 114

Query: 157 Q--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
           +  N  +DI FN    + +    + +   I K    K  V+++K +  ++  +     G 
Sbjct: 115 KTCNYKMDIEFNIENSVKS---ADIIQTYIRKYEPLKYLVLVLKQFL-FQRELNEVFSGG 170

Query: 215 ISTYALEMMVLH 226
           +S+Y L MMV++
Sbjct: 171 VSSYTLVMMVVN 182


>gi|19115813|ref|NP_594901.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe 972h-]
 gi|15213942|sp|O13833.2|CID1_SCHPO RecName: Full=Poly(A) RNA polymerase protein cid1; AltName:
           Full=Caffeine-induced death protein 1
 gi|393715400|pdb|4E7X|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715401|pdb|4E7X|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715402|pdb|4E7X|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715403|pdb|4E7X|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715405|pdb|4E80|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715406|pdb|4E80|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715407|pdb|4E80|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715408|pdb|4E80|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715409|pdb|4E8F|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|393715410|pdb|4E8F|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
           Terminal U- Transferase
 gi|4324457|gb|AAD16889.1| caffeine-induced death protein 1 [Schizosaccharomyces pombe]
 gi|5524947|emb|CAB50789.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe]
          Length = 405

 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 62/146 (42%), Gaps = 16/146 (10%)

Query: 88  EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPA 147
           E+  FGS+     L + D+DL VL    V+ D    +      E     FE K +Q   A
Sbjct: 84  ELVAFGSLESGLALKNSDMDLCVLMDSRVQSDT---IALQFYEELIAEGFEGKFLQ--RA 138

Query: 148 QVKIVKCSVQ-------NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAW 200
           ++ I+K +         +   DI FN    +     L    +L   D   K  V+L+K W
Sbjct: 139 RIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKL---DARLKPMVLLVKHW 195

Query: 201 CYYESRILGAHYGLISTYALEMMVLH 226
                +I   ++G +S+Y   +MVL+
Sbjct: 196 A-KRKQINSPYFGTLSSYGYVLMVLY 220


>gi|418617333|ref|ZP_13180231.1| hypothetical protein SEVCU120_1888 [Staphylococcus epidermidis
           VCU120]
 gi|374818626|gb|EHR82776.1| hypothetical protein SEVCU120_1888 [Staphylococcus epidermidis
           VCU120]
          Length = 671

 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 55  EILCTIQPAIVSEKKRKEVINYIQRLIN-----GYYGIEVFPFGSVPLKTYLPDGDIDLT 109
           EIL +I+   +S KKR ++  Y +R +N       Y I +F   S+P  T LP  ++DL 
Sbjct: 258 EILRSIKKKNISWKKRLKIDKYGKRFLNFNNTLSKYAISIFSEKSLPF-TLLPKENLDLF 316

Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVF 137
           +  ++ V+      +  + EN ++D VF
Sbjct: 317 INDYKTVKNYSKGLIQYLNENSEKDIVF 344


>gi|270009939|gb|EFA06387.1| hypothetical protein TcasGA2_TC009265 [Tribolium castaneum]
          Length = 513

 Score = 39.3 bits (90), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 85/201 (42%), Gaps = 20/201 (9%)

Query: 30  YSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIE 88
           +   P++Y   I     L   + I      + P       R +V+  I+++I + +   +
Sbjct: 97  WRKDPHVYARGI-----LGLHQEIDHFYQYMSPTRTEHLVRDDVVCRIKQIILSKWPEAQ 151

Query: 89  VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVP 146
           V  FGS     YLP  DIDL V+G        +    + LE E  D D   E        
Sbjct: 152 VEVFGSYRTGLYLPTSDIDLVVIG------KWSNLPLRTLEQEFLDNDVAQENSIKVLDK 205

Query: 147 AQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
           A V IVK + +   I VDISFN   G+ +    E +   I +  +  + V ++K +   E
Sbjct: 206 ASVPIVKLTDKQTEIKVDISFNMSNGVKS---AELIKTYIHQFPVLPKLVYVLKQFL-LE 261

Query: 205 SRILGAHYGLISTYALEMMVL 225
             +     G IS+Y+L +M +
Sbjct: 262 RDLNEVFTGGISSYSLILMCI 282


>gi|47208265|emb|CAF92498.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 297

 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 55/121 (45%), Gaps = 21/121 (17%)

Query: 60  IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----- 113
           I P    EK R EV++ I+ +I+  +   EV  FGS     YLP  DIDL V G      
Sbjct: 1   ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 60

Query: 114 -QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
              +EE L +   + + +E+   V +        A V +       I VDISFN  +G+ 
Sbjct: 61  LWTLEEALRK---RKVADENSIKVLD-------KATVSLFSL----IFVDISFNMKSGVK 106

Query: 173 A 173
           A
Sbjct: 107 A 107


>gi|448081751|ref|XP_004194965.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
 gi|359376387|emb|CCE86969.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
          Length = 558

 Score = 39.3 bits (90), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 47/203 (23%), Positives = 78/203 (38%), Gaps = 50/203 (24%)

Query: 66  SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
           + +KR EV+N +Q++   +                  G ++F FGS  L  Y P  DID 
Sbjct: 43  ATRKRVEVLNELQKMAERFVYEVSLQKNMSDGMARDAGGKIFTFGSYRLGVYGPGSDIDA 102

Query: 108 LTVLGHQNVEEDLARFVCKIL-ENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
           L V+       D      KIL E +D + +  + D  YVP    I+K  +  I +D+ F 
Sbjct: 103 LVVVPKHVTRNDFFEVFEKILRERKDLEEIAAVPDA-YVP----IIKMILNGISIDLIFA 157

Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
           +                         +  L+     +++  L+ K  +FK ++  IK W 
Sbjct: 158 RLDVPKVPKDLKLDDKNLLRNIDEKDLRSLNGTRVTDEILTLVPKPTVFKHALRCIKMWA 217

Query: 202 YYESRILGAHYGLISTYALEMMV 224
              + I    +G     A  M+V
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLV 239


>gi|355756767|gb|EHH60375.1| hypothetical protein EGM_11721, partial [Macaca fascicularis]
          Length = 483

 Score = 39.3 bits (90), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 12/112 (10%)

Query: 51  ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
           E I +    + P    EK R EV+N I+ +I   +   +V  FGS     YLP  DIDL 
Sbjct: 21  EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 80

Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQY--VPAQVKIVK 153
           V G         +EE L +     + +ED   V +   V+Y  +P  V ++K
Sbjct: 81  VFGKWENLPLWTLEEALRKHK---VADEDSVKVLDKATVKYPVLPYLVLVLK 129


>gi|219130012|ref|XP_002185169.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403348|gb|EEC43301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 425

 Score = 39.3 bits (90), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 58/256 (22%), Positives = 111/256 (43%), Gaps = 50/256 (19%)

Query: 11  EDLQEELLTSPSHTSSSSSYSSPPNLYPFSI---DAELWLLAEERIQEILCTIQPAIVSE 67
           ED+ E++ TS    +SS     PP +    I    A         I + +  ++P     
Sbjct: 37  EDVDEDMDTS----TSSEDDPLPPWIEHHDIGLQKAHAATALHNEIVQFVRLMEPMPEEI 92

Query: 68  KKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED------- 119
           ++R++++  ++ L++  +   +V  FGS     +LP  D+DL V+ ++   ++       
Sbjct: 93  RQREDLVKRVEELVHRTFDNAQVHVFGSQATGLFLPSSDVDLLVITNEKANDETSQPDRQ 152

Query: 120 ----------LARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ--NIPVDISFN 166
                     L RF   + E    D + E+  ++ +   +V +VK +    NI VD+ F+
Sbjct: 153 EDWQKPSGSPLDRFESVLRE----DWLMELSYLEVIGNTKVPLVKFTHAPTNISVDVCFD 208

Query: 167 QMAGLSAL----CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
           Q +G  A      +LE +  L        R +  +  + +  +R L   Y G + +Y L+
Sbjct: 209 QESGPGAAQLMKTYLEALPPL--------RPLTFVLKY-FLSARGLNEPYSGGVGSYLLQ 259

Query: 222 MMVL----HRFLDYYN 233
           +M++    HR  D YN
Sbjct: 260 LMIVSFLQHRERDAYN 275


>gi|380015769|ref|XP_003691868.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Apis
           florea]
          Length = 652

 Score = 39.3 bits (90), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 7/95 (7%)

Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           I+ ++ + A+V I+K   S+QN+ VD++ N   G+     L    R+   D   +  V++
Sbjct: 408 IEQLELIQAKVPILKFHDSIQNLEVDLNCNNAVGIRNTHLLYCYSRI---DWRVRPLVLV 464

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
           +K W   +  I  A    IS+Y+L +MV+H FL Y
Sbjct: 465 VKLWAQSQD-INDAKNMTISSYSLVLMVIH-FLQY 497


>gi|430813412|emb|CCJ29233.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 398

 Score = 38.9 bits (89), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 87/190 (45%), Gaps = 23/190 (12%)

Query: 70  RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----QNVEEDLARFV 124
           R+ V+  I  L+  ++  +++  FGS     YLP  DIDL +L          +DL + +
Sbjct: 107 RELVVQRINALVQKHWKNVQLCAFGSFDTMLYLPTSDIDLVILSLGPRIYETRKDLHK-L 165

Query: 125 CKILENEDQDSVFEIKDVQYVP-AQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVD 181
            + L   +       KD+Q +  A V I+K   ++  I VDISFN+  GL +   +++  
Sbjct: 166 SRYLRCSNV-----AKDIQVITGASVPIIKFIDTLTQIHVDISFNKPGGLVSANIIKQ-- 218

Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM----MVLHRFLDYYNT-FD 236
             + + +  K  V+ IK +      +   H  L+S    E     ++L  F + Y   F+
Sbjct: 219 -YMKEHYALKPLVMFIKHFLNMRG-LNERHPKLLSKEIKEQDNLGVLLMEFFELYGKLFN 276

Query: 237 WDNYCISING 246
           ++   ISIN 
Sbjct: 277 YNEVGISINN 286


>gi|133919900|emb|CAL91353.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
          Length = 466

 Score = 38.9 bits (89), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 9/116 (7%)

Query: 113 HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAG 170
           +QN E   AR +  +L       +  I+  Q++ A+V IVK    V     D++ N + G
Sbjct: 210 NQNTE---ARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAEFDLNVNNVVG 266

Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
           +     L     L   D   +  V++IK W  +   I  A  G +S+Y + +MVLH
Sbjct: 267 IRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTIVLMVLH 318


>gi|145530898|ref|XP_001451221.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124418865|emb|CAK83824.1| unnamed protein product [Paramecium tetraurelia]
          Length = 583

 Score = 38.9 bits (89), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 84/350 (24%), Positives = 138/350 (39%), Gaps = 69/350 (19%)

Query: 75  NYIQRLIN--------GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCK 126
           N  Q+LI+         Y G ++  +GS   +  LP  DIDL V+   N  E     + +
Sbjct: 275 NAFQKLIDEISAVVSATYAGAQIDVYGSYATELCLPHSDIDL-VIKISNQHEKFVTDILQ 333

Query: 127 ILENEDQDSVF--EIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR 182
            +E E +   F  E K V      V   KC+ Q  N  +DIS  +       C   ++ R
Sbjct: 334 RIEVELKKCKFIEETKCVTQSTTPVLRAKCNKQYMNKRLDISIQETKHNGLQCV--QLIR 391

Query: 183 LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD--WDNY 240
              K++   + + LI     ++S +   + G +S+Y L +M++  FL  Y   D  W   
Sbjct: 392 KYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGLSSYGLILMIV-SFLQSYQNQDKSWPTI 450

Query: 241 CISINGPVAISSLP------EIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
                G + I  L       +   +T+  D       PE  ++   I   P        H
Sbjct: 451 -----GTLLIEFLNVYGCELDYAGKTICPD------QPEVFEQETTIIFDP--------H 491

Query: 295 EFAI---KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL 351
            FA    + L I+DPL   NN+GR     N  +++ A +    +L   LT         L
Sbjct: 492 NFAYCQQQSLVIIDPLNPQNNVGR--PSYNVAKLKLAFTVAFSKL---LTFDSSTQQYPL 546

Query: 352 EKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYG 401
            KFF N+ +        ++Q  +H+          L   Y +Y + +++G
Sbjct: 547 -KFFFNSAQ--------NIQCSLHSV---------LVNQYKAYADQIMFG 578


>gi|258563094|ref|XP_002582292.1| Poly(A) polymerase PAPalpha [Uncinocarpus reesii 1704]
 gi|237907799|gb|EEP82200.1| Poly(A) polymerase PAPalpha [Uncinocarpus reesii 1704]
          Length = 616

 Score = 38.9 bits (89), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 60/125 (48%), Gaps = 21/125 (16%)

Query: 63  AIVSEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGD 105
           A+++   RK++++ +QR+   +                  G ++F FGS  L  Y P  D
Sbjct: 60  ALLNINPRKQILHLLQRVTIEFIKSVGRKKGLSQGAIDAAGGKIFTFGSYRLGVYGPGSD 119

Query: 106 IDLTVLGHQNV--EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
           ID  V+G ++V  E+  A F   + +   +DS+ ++  V    A V I+K  +  I +D+
Sbjct: 120 IDTLVVGPKHVTREDFFADFPSTLEQLAPKDSIEKLTSVP--DAFVPIMKLELSGISIDL 177

Query: 164 SFNQM 168
            F ++
Sbjct: 178 IFARL 182


>gi|145478719|ref|XP_001425382.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392452|emb|CAK57984.1| unnamed protein product [Paramecium tetraurelia]
          Length = 592

 Score = 38.9 bits (89), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 84/350 (24%), Positives = 140/350 (40%), Gaps = 69/350 (19%)

Query: 75  NYIQRLIN--------GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCK 126
           N  Q+LI+         Y G ++  +GS   +  LP  DIDL V+   N  E     + +
Sbjct: 284 NAFQKLIDEISGVVSATYAGAQIDVYGSYATELCLPHSDIDL-VIKISNQHEKFVTDILQ 342

Query: 127 ILENEDQDSVF--EIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR 182
            +E E +   F  E K V      V   KC+ Q  N  +DIS  +       C   ++ R
Sbjct: 343 RIEVELKKCKFIEETKCVTQSTTPVLRAKCNKQYMNKRLDISIQETKHNGLQCV--QLIR 400

Query: 183 LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCI 242
              K++   + + LI     ++S +   + G +S+Y L +M++  FL  Y   D      
Sbjct: 401 KYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGLSSYGLILMIV-SFLQSYQNQDK----- 454

Query: 243 SINGPVAISSLPEIVAETVENDGDELLLS--------PEFLKKCREIYSVPITALENTGH 294
             N P    ++  ++ E +   G EL  +        PE  ++   I   P        H
Sbjct: 455 --NWP----TIGTLLIEFLNVYGCELDYAGKTICPDQPEVFEQETTIIFDP--------H 500

Query: 295 EFAI---KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL 351
            FA    + L I+DPL   NN+GR     N  +++ A +    +L   LT         L
Sbjct: 501 NFAYCQQQSLVIIDPLNPQNNVGR--PSYNVAKLKLAFTVAFSKL---LTFDSSNQQYPL 555

Query: 352 EKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYG 401
            KFF N+ +        ++Q  +H+          L   Y +Y + +++G
Sbjct: 556 -KFFFNSAQ--------NIQCSLHSV---------LVNQYKAYADQIMFG 587


>gi|383852647|ref|XP_003701838.1| PREDICTED: PAP-associated domain-containing protein 5-like
           [Megachile rotundata]
          Length = 573

 Score = 38.9 bits (89), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 69/283 (24%), Positives = 118/283 (41%), Gaps = 46/283 (16%)

Query: 50  EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
            E I++    + P+      R  V+  I+++I   +   +V  FGS     YLP  DIDL
Sbjct: 122 HEEIEDFFAYMCPSNEEHSLRIRVVKRIEQVIYDLWPDSKVEVFGSFRTGLYLPTSDIDL 181

Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
            V+G       + +E  L      + +N  + S  ++ D   VP  VK+     + I VD
Sbjct: 182 VVIGMWTNLPLRTLERAL------LDQNIAEPSSIKVLDKASVPI-VKLTDKETE-IKVD 233

Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
           ISFN   G+ +    E ++    +  + ++ V+++K   +   R L   + G IS+Y+L 
Sbjct: 234 ISFNMSNGVKS---AELINSFKKRYPVLEKLVMVLKQ--FLLQRDLNEVFTGGISSYSLI 288

Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPE 273
           +M +  FL  +     + YC + N  V +    E+               DG   +   E
Sbjct: 289 LMTI-SFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEE 345

Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
             +   +            GH  ++  L I DPL   N++GRS
Sbjct: 346 VQRDMID------------GHRPSL--LCIEDPLTPGNDIGRS 374


>gi|449669254|ref|XP_002166747.2| PREDICTED: poly(A) RNA polymerase GLD2-A-like [Hydra
           magnipapillata]
          Length = 437

 Score = 38.9 bits (89), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 6/95 (6%)

Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
           I+++Q++ A+V I+K   +V     DI+ N   G+     L    ++   D   +  ++ 
Sbjct: 213 IRNIQFIRAKVPILKFKDTVSGCDCDINTNNSIGIRNTHLLRTYSKI---DDRVRPLIMA 269

Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
           +K W    S I  A  G +S+Y+L MMV+H    Y
Sbjct: 270 VKHWAKSRS-INDASQGTLSSYSLVMMVIHYLQSY 303


>gi|145535359|ref|XP_001453418.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421129|emb|CAK86021.1| unnamed protein product [Paramecium tetraurelia]
          Length = 634

 Score = 38.9 bits (89), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 106/248 (42%), Gaps = 33/248 (13%)

Query: 53  IQEILCTIQPAIVSEKKRKEV-INYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL 111
           I E+   IQ      + RK++ +N +  LI   Y + ++P+GS      L   D+D+ + 
Sbjct: 308 IDELSNYIQQKSYQNRIRKQICLNRLNYLIYTNYHLNMYPYGSFETGLDLEISDVDVGIW 367

Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGL 171
           G QN+         ++L N  + + F IK  + + +Q+ I+K  +   P    +++   +
Sbjct: 368 GSQNLSYSQIVSFLQLLNNTLKQTPFLIKS-KLIQSQMPILKLELN--PKSSFYDEDQHI 424

Query: 172 S---ALCFLEKVDRLIGKDHLFKRSVILIK-AWCYYESRILGA-HYGLISTYALE----- 221
               +   L++ D+ I +   F   +I +   W Y  S +    H G  ST  ++     
Sbjct: 425 QQNWSYFHLDQNDQGIIQSSQFLGKIIQVDITWIYQWSNVYNNPHLGFASTTIVKDWVSR 484

Query: 222 -------MMVLHRFLDYYNTFDW-----DNYCISINGPVAISSLPEIVAETVENDGDELL 269
                  M++L   +   N  D       ++C+SI     + +   +     +ND  ++L
Sbjct: 485 FVWYRDIMLILKHLVKSKNLNDAHTGGISSFCLSI-----MLAAIYMCKHYTQNDKKQIL 539

Query: 270 LSPEFLKK 277
           L  +FLKK
Sbjct: 540 L--DFLKK 545


>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
          Length = 573

 Score = 38.9 bits (89), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 46/186 (24%), Positives = 81/186 (43%), Gaps = 17/186 (9%)

Query: 46  WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
           WL  E  I++ +  I P+    + R + I+ I+  +   +   ++  FGS     YLP  
Sbjct: 170 WLTFE--IKDFVAYISPSREEIEVRNQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGS 227

Query: 105 DIDLTV---LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--I 159
           DID  +   LG +    +L      +   + ++   EI+ V    A+V I+K    N  I
Sbjct: 228 DIDCVITSELGGKESRNNLFSLASHL---KKKNLATEIEVV--AKARVPIIKFVEPNSGI 282

Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
            +D+SF +  GL A     K+ R    D    R ++LI     +  R+   H G +  ++
Sbjct: 283 HIDVSFERTNGLEAA----KLIREWLNDTPGLRELVLIVKQFLHSRRLNNVHTGGLGGFS 338

Query: 220 LEMMVL 225
           +  +V 
Sbjct: 339 IICLVF 344


>gi|356537950|ref|XP_003537469.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Glycine max]
          Length = 328

 Score = 38.9 bits (89), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 43/191 (22%), Positives = 85/191 (44%), Gaps = 24/191 (12%)

Query: 53  IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
           + +IL  + P     + R  +IN  + ++       G  V P+GS     +   GD+D++
Sbjct: 11  VNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNLFTRWGDLDIS 70

Query: 110 V---------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV--Q 157
           +            +  ++ L   V K L  +   S     ++Q++  A+V I+K     Q
Sbjct: 71  IELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGS-----NLQFISNARVPILKFKSYRQ 125

Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
            +  DIS N + G      L  ++++ G+   F+  V+L+K W     +I  +  G  ++
Sbjct: 126 GVSCDISINNLPGQMKSKILLWINKIDGR---FRHMVLLVKEWAKAH-KINNSKAGTFNS 181

Query: 218 YALEMMVLHRF 228
           Y+L ++V+  F
Sbjct: 182 YSLSLLVIFYF 192


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.135    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,021,094,170
Number of Sequences: 23463169
Number of extensions: 496789835
Number of successful extensions: 1165967
Number of sequences better than 100.0: 611
Number of HSP's better than 100.0 without gapping: 159
Number of HSP's successfully gapped in prelim test: 452
Number of HSP's that attempted gapping in prelim test: 1164856
Number of HSP's gapped (non-prelim): 985
length of query: 659
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 510
effective length of database: 8,863,183,186
effective search space: 4520223424860
effective search space used: 4520223424860
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)