BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006144
(659 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255564100|ref|XP_002523048.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223537731|gb|EEF39352.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 644
Score = 719 bits (1856), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/633 (57%), Positives = 453/633 (71%), Gaps = 33/633 (5%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
+LYP SID+ELWL+AE+R QEIL +QP+ SE+KRKEVI+YIQRLI +Y EVFPFGS
Sbjct: 22 DLYPLSIDSELWLMAEKRTQEILWVLQPSSSSEQKRKEVIDYIQRLIKHHYATEVFPFGS 81
Query: 95 VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC 154
VPLKTYLPDGDIDLT L HQN+EEDLAR VC IL +Q+ E+KDV+Y+ AQVK+VKC
Sbjct: 82 VPLKTYLPDGDIDLTALSHQNMEEDLAREVCDILTYAEQNLESEVKDVRYIQAQVKVVKC 141
Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
SV+NI VDISFNQMAGL ALCFLE+VD+LIGKDHL K S+ILIKAWC+YESRILGAH+GL
Sbjct: 142 SVKNISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKHSIILIKAWCFYESRILGAHHGL 201
Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
+STYALE++VL+ RFL+YY+TFDWDNYC++INGPVAISSLPEI+
Sbjct: 202 LSTYALEILVLYIVNVFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAISSLPEIM 261
Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
E ++ +ELLL+PEFLK+C+E +SVPI A+EN GHEF+IKHLNI+DPLKD+NNLGRSV
Sbjct: 262 TEAPYSNRNELLLTPEFLKRCKERFSVPIKAVENGGHEFSIKHLNILDPLKDSNNLGRSV 321
Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
SKGNFHRI+CALSYGAQRLGEIL LPG +G GLE FFINTL+RNG+G RPD VP+ AF
Sbjct: 322 SKGNFHRIKCALSYGAQRLGEILMLPGENMGAGLENFFINTLDRNGRGERPDALVPVPAF 381
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV 437
G+G+SEA+DLSGDYD+YYNG+L GQ YH Y+LPV+ Q +PPSSPSQI WD L Q +
Sbjct: 382 GTGRSEASDLSGDYDNYYNGILQGQWYHSYSLPVSPQLTPPSSPSQIQQGCTWDTLSQLL 441
Query: 438 QGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEI 497
K N + Q GT VFVPR+P+ HPYA++V A+ STDE KSRGTGTYIP + P +
Sbjct: 442 WWKQNFLSQRGTNVFVPRVPYHHPYAAKVYATASSTDEMGKSRGTGTYIPHVSHHPCRDK 501
Query: 498 NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGC---SLNVENGCSLNMENSCS 554
R +NP+SL + ++ RRK +K E P + C SL+ N EN
Sbjct: 502 MSCFRVKNPDSLTYSSINS-RRKAEKVEGGPEAKSDNECCLNLSLDQNKAREENSENGSC 560
Query: 555 LILSLEEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSPVVTTIMFGTYEHSLKPSV 614
LSL++FPLL + Q +S Q + S+ + +I FGT+ H P
Sbjct: 561 PDLSLDQFPLL-----------SYQASHRILQSPQAKECSNS-IRSIPFGTFRHIPSPLA 608
Query: 615 LPSPSARMKVSGASTTLDTEPGLTIMERQKQQE 647
LPS ++ TT + P + KQQ+
Sbjct: 609 LPSSVTGVQTESVPTTSEIIPAIRETGMLKQQK 641
>gi|255559667|ref|XP_002520853.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223539984|gb|EEF41562.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 655
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/634 (57%), Positives = 457/634 (72%), Gaps = 24/634 (3%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
+L+P SID+ELWL+AE+R QEIL +QP++ SE+KRK VI+YIQRLI ++ EV PFGS
Sbjct: 22 DLFPLSIDSELWLMAEKRAQEILWILQPSLASEQKRKVVIDYIQRLIKHHFATEVLPFGS 81
Query: 95 VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC 154
VPLKTYLPDGDIDLT L HQN+EEDL R +C IL E+Q+S E+KDV+Y+ AQVKIVKC
Sbjct: 82 VPLKTYLPDGDIDLTALSHQNMEEDLVREICNILTYEEQNSESEVKDVRYIQAQVKIVKC 141
Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
SV+NI VDISFNQMAGL ALCFLE+VD+LIGKDHL K S+ILIKAWC+YESRILGAH+GL
Sbjct: 142 SVKNISVDISFNQMAGLCALCFLEQVDQLIGKDHLLKCSIILIKAWCFYESRILGAHHGL 201
Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
+STYALE++VL+ RFL+YY+TFDWDNYC++INGPVA+SSLPEI+
Sbjct: 202 LSTYALEILVLYIINAFHSSLPGPLAVLYRFLEYYSTFDWDNYCVTINGPVAVSSLPEIM 261
Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
E+ N+G+ELLL PEFLK+C+E +SVPI A+EN GHEF+IKHLNI+DPLKDNNNLGRSV
Sbjct: 262 TESPYNNGNELLLCPEFLKRCKEKFSVPIKAVENGGHEFSIKHLNILDPLKDNNNLGRSV 321
Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
SKGNFHRI+CALSYGAQRLGEIL LPG +G GLE FFINTL+RNG+G RPD VP+ F
Sbjct: 322 SKGNFHRIKCALSYGAQRLGEILALPGENMGAGLEIFFINTLDRNGRGERPDTLVPVPTF 381
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV 437
G+G+SEA DLSGDYD++Y+GLL GQ YH Y+LPV+ + +PPSSPSQI WD L Q +
Sbjct: 382 GTGRSEAFDLSGDYDNHYSGLLQGQWYHKYSLPVSPEMTPPSSPSQIQQSFTWDRLSQLL 441
Query: 438 QGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEI 497
+ K N + Q GT VFVPR+P HPYA +V A+ ++DE KS+GTGTYIP + P ++
Sbjct: 442 RCKQNFLSQRGTNVFVPRVPHRHPYAPKVYATASTSDEKGKSQGTGTYIPNVSHHPRKDM 501
Query: 498 NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGC---SLNVENGCSLNMENSCS 554
+R++NP+SL H ++ RK + E P + C S + N EN
Sbjct: 502 MSWMRSKNPDSLTHSSLNS-PRKAEIVEGGPGAKSDNYCCLDLSFDQYKAGQENRENGSC 560
Query: 555 LILSLEEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSPVVTTIMFGTYEHSLKPSV 614
+LSL++FPLLP + S NQ +ST + S+ I GT+ H P
Sbjct: 561 TVLSLDQFPLLPSLEKSVTSEINQS-SHPILQSTHAKEYSTS-SRNIPTGTFRHIPFPVA 618
Query: 615 LPSP-SARMKVSGASTTLDTEPGLTIMERQKQQE 647
LPS + R SG STTL+ P + M KQQ+
Sbjct: 619 LPSSVTGRQTESGVSTTLEIMPASSEMGMLKQQK 652
>gi|224146203|ref|XP_002325920.1| predicted protein [Populus trichocarpa]
gi|222862795|gb|EEF00302.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 304/467 (65%), Positives = 363/467 (77%), Gaps = 20/467 (4%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
ID ELWL+AE+R QEIL TIQP SE KR EVINYIQ LI Y+ +EVF FGSVPLK
Sbjct: 26 LQIDPELWLMAEKRTQEILYTIQPTFASEHKRMEVINYIQSLIKYYFTVEVFAFGSVPLK 85
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
TYLPDGDIDL VL HQN+EE+LAR VC +L+ E+ D F++ DVQY+ AQVK+VKCSV+N
Sbjct: 86 TYLPDGDIDLMVLSHQNMEEELARGVCTLLQREELDPEFQVNDVQYIHAQVKLVKCSVKN 145
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
I VDISFNQMAG SALCFLE+VD+LIG+DHLFKRS+ILIKAWC+YESRILGAH+GLISTY
Sbjct: 146 ISVDISFNQMAGPSALCFLEQVDQLIGQDHLFKRSIILIKAWCFYESRILGAHHGLISTY 205
Query: 219 ALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
AL+++ VL++FLDYY+ FDWDNYC+SINGP+ ISS P+ ++
Sbjct: 206 ALQILVLNIINVFHSSLPDPLAVLYKFLDYYSAFDWDNYCVSINGPIPISSFPQ--TDST 263
Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
N+G+E L+S EFL+ RE ++ P+ LEN HEF IKHLNIVDPLK +NNLGRSV+KGN
Sbjct: 264 HNNGNESLISQEFLRNFREKFAFPMKELENGAHEFPIKHLNIVDPLKSSNNLGRSVNKGN 323
Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
FHRIR ALSYGAQRLGEI+ LPG +G LEKFF+NTL+RNG+G RPD VP+ AFG+G+
Sbjct: 324 FHRIRGALSYGAQRLGEIIALPGEAMGGRLEKFFMNTLDRNGRGQRPDADVPVPAFGTGR 383
Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKG 441
SEA+DL+GDYD YY+GLL+GQ YH Y LPV Q S PSSPSQI +SA DVL Q +Q K
Sbjct: 384 SEASDLNGDYDKYYSGLLHGQWYHSYALPVPPQPSSPSSPSQIKQKSARDVLPQLLQSKQ 443
Query: 442 NLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE 488
+++ Q GTEVF PR CHPYASQV + D KSRGTGTYIP+
Sbjct: 444 DILSQRGTEVFFPRQK-CHPYASQVHVAISGIDTMRKSRGTGTYIPD 489
>gi|42565972|ref|NP_191191.2| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
gi|30725328|gb|AAP37686.1| At3g56320 [Arabidopsis thaliana]
gi|110736147|dbj|BAF00045.1| hypothetical protein [Arabidopsis thaliana]
gi|332645988|gb|AEE79509.1| PAP/OAS1 substrate-binding domain-containing protein [Arabidopsis
thaliana]
Length = 603
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 270/532 (50%), Positives = 353/532 (66%), Gaps = 39/532 (7%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
IDA+ W++AEER EILCTIQPA+VS++ R E+I+Y++ LI + GIEVF FGSVPLKTY
Sbjct: 35 IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
LPDGDIDLTVL QN+++D +C L+NE+++S F DVQ++PAQVK++KC+++NI
Sbjct: 95 LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVKVIKCNIRNIA 154
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VDISFNQ AGL ALCFLE+VD+L G+DHLFKRS+IL+KAWCYYESRILGA+ GLISTYAL
Sbjct: 155 VDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGANTGLISTYAL 214
Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
++ VL++FLDYY +FDW+NYCIS+NGPV ISSLPE+ A + EN
Sbjct: 215 AVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSLPELTAASPEN 274
Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
G ELLL +FL+ C E+YS P A+++ G EF IKHLNIVDPLK +NNLG+SV++GN
Sbjct: 275 -GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNNLGKSVTQGNVQ 333
Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
RIR A + GA++L ++L+LPG +G LEKFF N+LERNGKG R DV P+ AFG+G+SE
Sbjct: 334 RIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDPVTAFGTGRSE 393
Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
++LSGD+DS + L+YGQ YH ++LP T Q S + S WD++ V + N
Sbjct: 394 LSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL---SGWDIVRHLVTYRKNE 450
Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
+ V PF P S K+RGTGTYIP+ ++ + R
Sbjct: 451 FHLRSLNVSTSMQPF--PLHSLPNG----CQNMRKTRGTGTYIPDMSQQVYSDRFRE-SG 503
Query: 504 RNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN--VENGCSLNMENSC 553
S NH+ S ID D S C+L+ V C+ N C
Sbjct: 504 TGTSSTNHLEASS--EAIDN-------DGASSCCNLSGEVSTSCTSNKGEEC 546
>gi|356570171|ref|XP_003553264.1| PREDICTED: uncharacterized protein LOC100797780 [Glycine max]
Length = 644
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 288/582 (49%), Positives = 376/582 (64%), Gaps = 60/582 (10%)
Query: 12 DLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRK 71
D QE LL S S P L SID ELW +AEER QEIL TIQP ++SE RK
Sbjct: 2 DRQENLLPS----------SMPSQL--LSIDKELWQMAEERAQEILWTIQPNVLSEVNRK 49
Query: 72 EVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
+VI+Y+QRLI GYYG EV PFGSVPLKTYLPDGDIDLT L H++ EEDLA+ VC +L++
Sbjct: 50 DVIDYVQRLIRGYYGAEVLPFGSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCYVLQSG 109
Query: 132 DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFK 191
D D +++KD++Y+ AQV++VKC+V+NI VDISFNQMAG+ L FLE+VD+L+GK+H+FK
Sbjct: 110 D-DPEYQVKDIKYIRAQVRLVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFK 168
Query: 192 RSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYYNT 234
RS+ILIKAWCYYESR+LG H+GL+STYA+E++VL+ FLDYY +
Sbjct: 169 RSIILIKAWCYYESRLLGGHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGS 228
Query: 235 FDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
FDWD+ +SI GP +SS PEI AET+E D E LL EFL+ R + S P A + H
Sbjct: 229 FDWDHNYVSIWGPKPLSSFPEI-AETLECDHGEFLLQKEFLRNYRNMCSFPSRATKTMTH 287
Query: 295 EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKF 354
EF +K +NI+DPL+++NNLGRSV+ + HR R ALSYGA+RL +ILTLPG +G LEKF
Sbjct: 288 EFPVKFMNILDPLRNDNNLGRSVNIASLHRFRFALSYGARRLKQILTLPGETMGAALEKF 347
Query: 355 FINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQ 414
F +TL+RNGKG R DV VP+ FG G+S + L GD +SY+ GL Y Q Y +Y LPVT
Sbjct: 348 FFSTLDRNGKGERADVDVPVSPFGIGRSIESVLCGDCESYFGGLQYVQLYRNYALPVTVH 407
Query: 415 FSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTD 474
S PSSPSQ D+L ++ YQ GT+V++P HP T+S +
Sbjct: 408 SSSPSSPSQD------DILASSTHQNWSMFYQGGTDVYIPAQTLYHP--------TYSLE 453
Query: 475 EGEKSRGTGTYIPERTRPPPIEI-NRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNL 533
KSRGTGTYIP+ ++ +P R R S + + K K +AE + S ++
Sbjct: 454 GRGKSRGTGTYIPDLNYNSYWDMRTKPSRPRRFASSKYNALPKSPPKKLQAEEVHSETDI 513
Query: 534 ESGCSLNVENGCSLNMENSCSLILSLEEFPLLPVT-KNLPPS 574
+ + +S LS E+FPLLP K++PP+
Sbjct: 514 NA-------------ISDSRLFELSSEDFPLLPCNCKDIPPT 542
>gi|356561857|ref|XP_003549193.1| PREDICTED: uncharacterized protein LOC100787145 [Glycine max]
Length = 684
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 276/536 (51%), Positives = 357/536 (66%), Gaps = 46/536 (8%)
Query: 12 DLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRK 71
D QE LL ++S P L SID ELW +AEER QEIL TI+P ++SE RK
Sbjct: 2 DRQENLL----------AFSMPSQL--LSIDEELWRMAEERAQEILWTIEPIVLSEVNRK 49
Query: 72 EVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
+VI+Y+QRLI GYYG EV PFGSVPLKTYLPDGDIDLT L H++ EEDLA+ VC IL++
Sbjct: 50 DVIDYVQRLIRGYYGAEVLPFGSVPLKTYLPDGDIDLTALSHEDAEEDLAQAVCNILQSG 109
Query: 132 DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFK 191
D D +++KD+QY+ AQV++VKC+V+NI VDISFNQMAG+ L FLE+VD+L+GK+H+FK
Sbjct: 110 D-DPEYQVKDIQYIRAQVRLVKCTVKNIAVDISFNQMAGICTLRFLEQVDQLVGKNHIFK 168
Query: 192 RSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYYNT 234
S+ILIKAWCYYESR+LG H+GL+STYA+E++VL+ FLDYY +
Sbjct: 169 HSIILIKAWCYYESRLLGGHHGLLSTYAVEILVLYIINRFHSSVRGPLEVLYIFLDYYGS 228
Query: 235 FDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
FDWD+ +SI GP +SSLPEI AET E D E LL EFL+ R + S P A E H
Sbjct: 229 FDWDHNYVSIWGPKPLSSLPEI-AETPECDQGEFLLQKEFLRNYRNMCSFPSRASETMTH 287
Query: 295 EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKF 354
EF +K +NI+DPL+++NNLGRSV+ N HR+R ALSYGA+RL +ILTLPG +G LEKF
Sbjct: 288 EFPVKFMNILDPLRNDNNLGRSVNIANLHRVRFALSYGARRLKQILTLPGENMGAALEKF 347
Query: 355 FINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQ 414
F +TL+RNGKG R DV VP+ FG G+SE + L GD +SY GL Y Q Y +Y +P+
Sbjct: 348 FFSTLDRNGKGERADVAVPVSPFGIGRSEESVLRGDCESYCGGLQYVQLYRNYAMPIAVH 407
Query: 415 FSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTD 474
S PSSPSQ D+L Q ++ YQ GT+V++P HP T++ +
Sbjct: 408 SSSPSSPSQD------DILAPSTQQNWSMFYQGGTDVYIPGQTLYHP--------TYNLE 453
Query: 475 EGEKSRGTGTYIPERTRPPPIEIN-RPVRARNPESLNHVPVSKLRRKIDKAERLPS 529
KSRGTGTYIP+ +++ +P R R S + + K K +AE + S
Sbjct: 454 GRGKSRGTGTYIPDLNYNSYWDMHTKPSRPRRFASSKYNALPKSPSKKQQAEEVHS 509
>gi|7572930|emb|CAB87431.1| putative protein [Arabidopsis thaliana]
Length = 614
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 270/543 (49%), Positives = 353/543 (65%), Gaps = 50/543 (9%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
IDA+ W++AEER EILCTIQPA+VS++ R E+I+Y++ LI + GIEVF FGSVPLKTY
Sbjct: 35 IDADSWMIAEERAHEILCTIQPALVSDRSRNEIIDYVRTLIMSHEGIEVFSFGSVPLKTY 94
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
LPDGDIDLTVL QN+++D +C L+NE+++S F DVQ++PAQVK++KC+++NI
Sbjct: 95 LPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVKVIKCNIRNIA 154
Query: 161 VDISFNQMAGLSALCFLEK-----------VDRLIGKDHLFKRSVILIKAWCYYESRILG 209
VDISFNQ AGL ALCFLE+ VD+L G+DHLFKRS+IL+KAWCYYESRILG
Sbjct: 155 VDISFNQTAGLCALCFLEQVLSAIQNQAPEVDQLFGRDHLFKRSIILVKAWCYYESRILG 214
Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
A+ GLISTYAL ++ VL++FLDYY +FDW+NYCIS+NGPV ISS
Sbjct: 215 ANTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISS 274
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
LPE+ A + EN G ELLL +FL+ C E+YS P A+++ G EF IKHLNIVDPLK +NN
Sbjct: 275 LPELTAASPEN-GHELLLDEKFLRNCVELYSAPTKAVDSNGLEFPIKHLNIVDPLKYSNN 333
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
LG+SV++GN RIR A + GA++L ++L+LPG +G LEKFF N+LERNGKG R DV
Sbjct: 334 LGKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVND 393
Query: 373 PIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDV 432
P+ AFG+G+SE ++LSGD+DS + L+YGQ YH ++LP T Q S + S WD+
Sbjct: 394 PVTAFGTGRSELSELSGDFDSSFGRLVYGQMYHGHSLPGTFQHGYIPVSSHL---SGWDI 450
Query: 433 LCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRP 492
+ V + N + V PF P S K+RGTGTYIP+ ++
Sbjct: 451 VRHLVTYRKNEFHLRSLNVSTSMQPF--PLHSLPNG----CQNMRKTRGTGTYIPDMSQQ 504
Query: 493 PPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN--VENGCSLNME 550
+ R S NH+ S ID D S C+L+ V C+ N
Sbjct: 505 VYSDRFRE-SGTGTSSTNHLEASS--EAIDN-------DGASSCCNLSGEVSTSCTSNKG 554
Query: 551 NSC 553
C
Sbjct: 555 EEC 557
>gi|356518940|ref|XP_003528133.1| PREDICTED: uncharacterized protein LOC100815787 [Glycine max]
Length = 680
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 282/604 (46%), Positives = 379/604 (62%), Gaps = 77/604 (12%)
Query: 10 MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
M+D QE LLT S P L SID ELW +AE+R+QEIL TI+P ++SE
Sbjct: 1 MDDKQENLLT----------LSLPSQL--VSIDEELWRMAEDRVQEILWTIEPNVLSEVN 48
Query: 70 RKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
RK+VI+Y+QRLI GYYG +V PFGSVPLKTYLPDGD+DLT L H++ EEDLA+ +C IL+
Sbjct: 49 RKDVIDYVQRLIKGYYGAKVLPFGSVPLKTYLPDGDVDLTTLIHEDAEEDLAQAICNILK 108
Query: 130 NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
+ D DS +++KD+QY+ AQV++VKC+V+NI VDISFNQMAG+ L FLE+VD+L+GK+H+
Sbjct: 109 SGD-DSEYQVKDIQYIRAQVRLVKCTVKNIAVDISFNQMAGIYTLRFLEQVDQLVGKNHI 167
Query: 190 FKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR-----------------FLDYY 232
FKRS+ILIKAWCYY+SR+LG HYGL+STYA+E++VL+ FLDYY
Sbjct: 168 FKRSIILIKAWCYYDSRLLGGHYGLLSTYAVEILVLYIINRFHSVVRGPLEVLYIFLDYY 227
Query: 233 NTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENT 292
++FDWD+ +SI GP ++SSLPEI T E D E LL EFL + + S P A E
Sbjct: 228 SSFDWDHNYVSIWGPKSLSSLPEI---TPECDQGEFLLQKEFLTNYKNMCSYPTRASETL 284
Query: 293 GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLE 352
HEF +K +NI+DPL+++NNLGRSVS + HR+R A +Y AQ+L +I TLPG +G LE
Sbjct: 285 THEFPVKFMNILDPLRNDNNLGRSVSIASLHRLRFAFAYSAQKLKQIFTLPGENMGAALE 344
Query: 353 KFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVT 412
KFF +TLERNGKG R DV VP+ FG+G+ E L+GD ++Y G Y Q YH+Y +PV
Sbjct: 345 KFFFSTLERNGKGERADVGVPVAPFGTGRFEEPVLNGDCENYCGGSQYVQLYHNYAMPVI 404
Query: 413 AQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFS 472
S + P D+L Q +L Y GT+V++PR HP T++
Sbjct: 405 VDSSSLALPD--------DILASSTQQNWSLFYHGGTDVYIPRHTLYHP--------TYN 448
Query: 473 TDEGEKSRGTGTYIPERTRPP--PIEINRPVRARNPESL-NHVPVSKLRRKIDKAERLPS 529
+ G KSRGTGTYIP+ + P R P S N P S+L++K + E + S
Sbjct: 449 LEGGGKSRGTGTYIPDLNYNSYWDMSTKAPRPRRFPSSKHNAFPRSRLKKK--QVEEVHS 506
Query: 530 RDNLESGCSLNVENGCSLNMENSCSLILSL--EEFPLLPV-TKNLPPSATN-----QQFD 581
++ + NS S + E+FP+LP +K +PP+ ++F
Sbjct: 507 EIDINA---------------NSDSRLFEFLNEDFPVLPCNSKAIPPTQAEESTPLEKFH 551
Query: 582 QSTD 585
TD
Sbjct: 552 SETD 555
>gi|297820390|ref|XP_002878078.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323916|gb|EFH54337.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 602
Score = 503 bits (1295), Expect = e-139, Method: Compositional matrix adjust.
Identities = 251/478 (52%), Positives = 330/478 (69%), Gaps = 35/478 (7%)
Query: 31 SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
++ P I+ E W++AEER EILCTIQPA+VS+K R E+I+Y++ LI + GIEVF
Sbjct: 25 TAQPKADSLLIEEESWMIAEERAHEILCTIQPALVSDKSRNEIIDYVRTLIKSHDGIEVF 84
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVK 150
FGSVPLKTYLPDGDIDLTVL QN+++D +C L+NE+++S F DVQ++PAQVK
Sbjct: 85 SFGSVPLKTYLPDGDIDLTVLTKQNMDDDFYGQLCSRLQNEERESEFHATDVQFIPAQVK 144
Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
++KC+++NI VDISFNQ AGL ALCFLE+VD+L G+DHLFKRS+IL+KAWCYYESRILGA
Sbjct: 145 VIKCNIRNIAVDISFNQTAGLCALCFLEQVDQLFGRDHLFKRSIILVKAWCYYESRILGA 204
Query: 211 HYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSL 253
+ GLISTYAL ++ VL++FLDYY +FDW+NYCIS+NGPV ISSL
Sbjct: 205 NTGLISTYALAVLVLYIINLFHSSLSGPLAVLYKFLDYYGSFDWNNYCISVNGPVPISSL 264
Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNL 313
PE+ A + EN G ELLL +FL+ C E++S P A+++ G +F IKHLNIVDPLK +NNL
Sbjct: 265 PELTAASPEN-GHELLLDEKFLRNCVELFSAPTKAVDSNGLDFPIKHLNIVDPLKYSNNL 323
Query: 314 GRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
G+SV++GN RIR A + GA++L ++L+LPG +G LEKFF N+LERNGKG R DV P
Sbjct: 324 GKSVTQGNVQRIRHAFTLGARKLRDVLSLPGDTMGWRLEKFFRNSLERNGKGQRQDVNDP 383
Query: 374 IHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVL 433
+ AFG+G+SE ++LSGD++ Y+ L+YGQ H Y+LP T Q SQ+ S WD++
Sbjct: 384 VTAFGTGRSELSELSGDFEGYFGRLVYGQI-HGYSLPGTFQHGYIPVSSQVTDPSGWDIV 442
Query: 434 CQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
V + N Y V C RGTGTYIP+ ++
Sbjct: 443 RHLVTYRKNEFYLKSLNVSTSLPNGCQ----------------NMRRGTGTYIPDMSQ 484
>gi|356570173|ref|XP_003553265.1| PREDICTED: uncharacterized protein LOC100798838 [Glycine max]
Length = 626
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 288/597 (48%), Positives = 369/597 (61%), Gaps = 71/597 (11%)
Query: 10 MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
M+D QE LL S S P L SID ELW + EER QEIL TIQP ++SE
Sbjct: 1 MDDKQENLLPS----------SLPSQL--LSIDEELWRMIEERAQEILWTIQPNVLSEVN 48
Query: 70 RKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
RK +I+Y+QRLI Y G +VFPFGS PLKTYLPDGDIDLT L H++ EEDL R VC IL+
Sbjct: 49 RKNIIDYVQRLIGEYCGAQVFPFGSFPLKTYLPDGDIDLTALSHEDEEEDLVRAVCNILK 108
Query: 130 NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
+ED DS +++KD++++ AQV++VKC+V+NIPVDISFNQMAGL L FLE+VD+L+GK+H+
Sbjct: 109 SED-DSEYQVKDIEHIRAQVQVVKCTVKNIPVDISFNQMAGLYTLFFLEQVDQLVGKNHI 167
Query: 190 FKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYY 232
FKRSVILIK+WCYYESRILGAH GL+STYA E++VL+ FLDYY
Sbjct: 168 FKRSVILIKSWCYYESRILGAHCGLLSTYATEILVLYIINRFHSSVRGPLAVLYVFLDYY 227
Query: 233 NTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENT 292
++FDW++ ISI GP +SSLPEIV +T E D E LL EFLK R++ S A E
Sbjct: 228 SSFDWEHNYISIWGPKVLSSLPEIV-DTPEYDQGEFLLQKEFLKNYRDMCSSKAKASETM 286
Query: 293 GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLE 352
+ F +KH+NI+DPL++NNNLGRSV+ GN RIR A S G+QRL +ILTL G +G LE
Sbjct: 287 TNAFPVKHMNILDPLRNNNNLGRSVNIGNLSRIRLAFSLGSQRLKQILTLAGENMGAALE 346
Query: 353 KFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQCYHDYTLPVT 412
KFF NTLE NGKG R DV VP+ FG+G+ E + LSGDYDSYY Y Q Y +Y +PVT
Sbjct: 347 KFFFNTLENNGKGERADVGVPVSPFGTGRFEESVLSGDYDSYYGWSQYVQQYPNYAMPVT 406
Query: 413 AQFSP-PSSPSQ-----IPSRSAW---DVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYA 463
S PS P Q + ++ W D Q ++ YQ GT V++P HP
Sbjct: 407 TVPSHFPSPPHQDNILALSTQLNWSEGDFPALSAQQNWSMFYQSGTNVYIPGQTLFHP-- 464
Query: 464 SQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDK 523
T+S +G +SRGTGTYIP+ L++ + K ++
Sbjct: 465 ------TYSLGKGGRSRGTGTYIPD--------------------LSYNCYWDIHAKANR 498
Query: 524 AERLP-SRDNLESGCSLNVENGCSLNMENSCSLILSLEEFPLLP-VTKNLPPSATNQ 578
R P ++ N S + + E LS E+FPL+P V K PP+ +
Sbjct: 499 PRRFPYAKHNALLKSSPKSQQEKEFHTETK-PFELSDEDFPLIPNVYKATPPTQAQE 554
>gi|356507300|ref|XP_003522406.1| PREDICTED: uncharacterized protein LOC100813790 [Glycine max]
Length = 692
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 268/569 (47%), Positives = 357/569 (62%), Gaps = 70/569 (12%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
SID ELW +AE+R+QEIL TIQP ++SE RK+VI+Y+QRLI YYG EV PFGSVPLK
Sbjct: 18 LSIDEELWQMAEDRVQEILWTIQPNVLSEVNRKDVIDYVQRLIRDYYGAEVLPFGSVPLK 77
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
TYLPDGD+DLT L H++ E+DLA+ +C +L++ D DS +++KD+QY+ AQV++VKC+V+N
Sbjct: 78 TYLPDGDVDLTTLIHEDAEDDLAQAICNVLKSGD-DSEYQVKDIQYIRAQVRLVKCTVKN 136
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
I VDISFNQMAG+ L FLE+VD+L+GK+H+FKRS+ILIK WCYY+SR+LG H+GL+STY
Sbjct: 137 IAVDISFNQMAGIYTLRFLEQVDQLVGKNHIFKRSIILIKGWCYYDSRLLGGHHGLLSTY 196
Query: 219 ALEMMVLHR-----------------FLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
A+E++VL+ FLDYY +FDWD+ ISI GP ++SSLPEI AE
Sbjct: 197 AVEILVLYIINRFHSSVRGPLEVLYIFLDYYGSFDWDHNYISIWGPKSLSSLPEI-AEAP 255
Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
E D E LL EFL + + S P A E HEF +K +NI+DPL+++NNLGRSVS +
Sbjct: 256 ECDQGEFLLQKEFLGNYKNMCSYPAGASETLTHEFPVKFMNILDPLRNDNNLGRSVSIAS 315
Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
HR+R A SYG Q+L +I TLPG +G LEKFF +TL RNGKG R DV VP+ FG+G+
Sbjct: 316 LHRLRFAFSYGVQKLKQIFTLPGENMGAALEKFFSSTLNRNGKGERADVSVPVAPFGTGR 375
Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQI--PSRSAWDVLCQYVQG 439
E L GD ++Y GL Y Q YH+Y +PV + P+SP I P++ W
Sbjct: 376 FEEPVLYGDCENYCGGLQYVQLYHNYAMPVIVDSNSPASPDDILAPTQQNW--------- 426
Query: 440 KGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINR 499
+L YQ T+V++P HP T+S + G KSRGTGTYIP+
Sbjct: 427 --SLFYQGATDVYIPAQKLYHP--------TYSLEGGGKSRGTGTYIPD----------- 465
Query: 500 PVRARNPESLNHVPVSKLRRKIDKAERLPS-------RDNLESGCSLNVENGCSLNMENS 552
LN+ +R K + R S R L+ V + + NS
Sbjct: 466 ---------LNYNSYWDMRTKASRPRRFASSKYNAFPRSPLKKQQVEEVHSEIDIINANS 516
Query: 553 CSLI--LSLEEFPLLPVTKN-LPPSATNQ 578
S + S E+FP+ P N +PP+ +
Sbjct: 517 DSRLFEFSSEDFPVRPCNSNAIPPTQAQE 545
>gi|356518706|ref|XP_003528019.1| PREDICTED: uncharacterized protein LOC100788864 [Glycine max]
Length = 721
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 265/559 (47%), Positives = 358/559 (64%), Gaps = 62/559 (11%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLK 98
SID ELW + EERIQEIL TIQP ++SE RK V+NY+Q+LI YY +VFPFGS PLK
Sbjct: 18 LSIDEELWRMTEERIQEILWTIQPNVLSEMNRKNVLNYVQKLIGDYYDTKVFPFGSFPLK 77
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
TYLPDGDIDLTV+ H++ EE+LA+ +C ILE + D ++++KD++++ AQV++VKC+V+N
Sbjct: 78 TYLPDGDIDLTVINHEDEEENLAKEICTILECAN-DLIYQVKDIEHIRAQVQVVKCTVKN 136
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
IP+DI+FNQM GL LCFLE+VD+L GK+H+FKRS+ILIKAWC Y+SR+LG+ +GL+STY
Sbjct: 137 IPIDITFNQMTGLCTLCFLEQVDQLAGKNHIFKRSIILIKAWCCYDSRLLGSQHGLLSTY 196
Query: 219 ALEMMVLHR-----------------FLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
A E++VL+ F DYY TFDW++ +SI GP A+SSLPEIV +
Sbjct: 197 ATEVLVLYIINRFHASVRDPLEVLYIFFDYYGTFDWEHNYMSIWGPKALSSLPEIV-DRP 255
Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
E D DE LL EFL R+I+S + E T + F +KH+NI+DPL+++NNLGRSV++ +
Sbjct: 256 ECDQDEFLLHKEFLINYRDIFSSKAKSSETTTNTFPVKHINILDPLRNDNNLGRSVNEAS 315
Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGK 381
FHRIR ALSYGA++ +I TL G +G LEKFF +TL+RNGKG R DV VP+ FG+G+
Sbjct: 316 FHRIRFALSYGAKKFKQIFTLAGENMGEALEKFFFDTLQRNGKGERADVDVPVSPFGTGR 375
Query: 382 SEAADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFS-PPSSPSQ-----IPSRSAW---DV 432
E + L GD DSYY G Y Q Y +Y +P T S PSSPSQ + ++ W D+
Sbjct: 376 YEKSVLDGDCDSYYGGSQYAQQYPNYAMPTTTIHSNSPSSPSQDDILALSTQQNWSEGDL 435
Query: 433 LCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRP 492
L Q ++ YQ ++P HP T++ +EG +SRGTGTYIP+
Sbjct: 436 LALSTQQNWSMYYQSVYNGYIPGQTLFHP--------TYNFEEGGRSRGTGTYIPD---- 483
Query: 493 PPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPS--RDNLESGCSLNVEN---GCSL 547
LN+ +R K+++ R PS + L C E
Sbjct: 484 ----------------LNYNCYWDIRAKVNRPRRFPSAKHNALLKSCPKRKEAEEVHFET 527
Query: 548 NM-ENSCSLILSLEEFPLL 565
NM +NS LS E+FPL+
Sbjct: 528 NMNDNSKPFELSNEDFPLI 546
>gi|449449962|ref|XP_004142733.1| PREDICTED: uncharacterized protein LOC101207419 [Cucumis sativus]
Length = 898
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 228/369 (61%), Positives = 269/369 (72%), Gaps = 19/369 (5%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P I + W AEE Q I+ +QP +VSE++RK VI+Y+QRLI G EVFPFGSVPL
Sbjct: 36 PTPIGVDYWRRAEEATQAIISQVQPTVVSERRRKAVIDYVQRLIRGRLRCEVFPFGSVPL 95
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT LG NVEE LA VC +L +EDQ+ F +KDVQ + A+VK+VKC
Sbjct: 96 KTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQLIRAEVKLVKCL 155
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLEK+DR IGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 156 VQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 215
Query: 216 STYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE + VL++FLDY++ FDWDNYCIS+NGPV ISSLPE+VA
Sbjct: 216 STYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGPVRISSLPELVA 275
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
ET +N G +LLLS +FL+ C E +SVP E F IKHLNIVDPLK+NNNLGRSVS
Sbjct: 276 ETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPLKENNNLGRSVS 335
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFG 378
KGNF+RIR A SYGA++LG IL+ P + + KFF NTL+R+G G RPDVQ P G
Sbjct: 336 KGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQRPDVQDPAPVSG 395
Query: 379 SGKSEAADL 387
+S AA L
Sbjct: 396 GYESCAALL 404
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 15/173 (8%)
Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ 402
P H + + E F+ ++ R + L +V P AF + +DL+GDY+S+ N L G+
Sbjct: 600 PLHSVSLSSEDFYPSS--RGYRFLTSNVGPP-EAFNA----LSDLNGDYESHCNSLQIGR 652
Query: 403 CYHDYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHP 461
Y++Y L A PP PSQ P+++ WD++ + VQ K N Q + + R P +P
Sbjct: 653 WYYEYALSAAALSPIPPPLPSQYPNKNPWDIIRRSVQVKQNAFAQINSNGLLAR-PAFYP 711
Query: 462 YASQV--RASTFSTDEGEKSRGTGTYIPE----RTRPPPIEINRPVRARNPES 508
S + +T + +E K RGTGTY P R RP V R+P +
Sbjct: 712 MPSPILPGGATLAMEEMPKPRGTGTYFPNMNHYRDRPASARGRNQVSVRSPRN 764
>gi|359486339|ref|XP_002274554.2| PREDICTED: uncharacterized protein LOC100253615 [Vitis vinifera]
Length = 755
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 270/606 (44%), Positives = 361/606 (59%), Gaps = 60/606 (9%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
N + S+ AE W + + IQEILC IQP IVSE++RKE+I+Y+QRLI +G EV PFGS
Sbjct: 21 NPHLMSVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEVLPFGS 80
Query: 95 VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYVPAQVKIVK 153
+PLKTYLPDGDIDLT L +N EED AR VC +LE E Q S F ++D+ Y+ A+VKIVK
Sbjct: 81 MPLKTYLPDGDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYIRAKVKIVK 140
Query: 154 CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYG 213
C VQ+I VDISFNQ GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE RILG+H G
Sbjct: 141 CMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCG 200
Query: 214 LISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
L+STYALE++ VL+RFLDYY+TFDW+ + +S+ GPV+ISSL
Sbjct: 201 LLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSLLTG 260
Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
ET + LL++ EFL C+E ++V I A E T F +KH+NI DPL+D NNLGRS
Sbjct: 261 APETADK---PLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRS 317
Query: 317 VSKGNFHRIRCALSYGAQRLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIH 375
+S GN +R R A+S GAQRL EI L LP + GL++FF NTL+RNG G D + +
Sbjct: 318 ISLGNSYRFRYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLV 376
Query: 376 AFGSGKSEAADLSGDYDSYYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP-- 421
FG G S+ +L GDY + L YGQ YH +L V Q F+ + P
Sbjct: 377 PFGPGSSKFCNLQGDYYGCLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWG 436
Query: 422 ------SQIPSRSAWDVLC---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFS 472
S +PS++ + Q+ + + N+ +VP LPFC+ ASQ+ +
Sbjct: 437 YTNHFASMLPSQNTFQFYSPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYH 490
Query: 473 TDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDN 532
+E + +GTGT+IP + ++NR ++PE H P S + + +R P +
Sbjct: 491 FEE-KVPQGTGTFIPNVAQIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--S 544
Query: 533 LESGCSLNVENGCSLNMENSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQV 590
+ S L E N E + ++ +E ++ +KN PS +Q QS +S
Sbjct: 545 MASSVLLEPEQEGGKNSEPTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQ 604
Query: 591 DDVSSP 596
D V P
Sbjct: 605 DLVLKP 610
>gi|224145449|ref|XP_002325647.1| predicted protein [Populus trichocarpa]
gi|222862522|gb|EEF00029.1| predicted protein [Populus trichocarpa]
Length = 533
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 243/490 (49%), Positives = 312/490 (63%), Gaps = 33/490 (6%)
Query: 27 SSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG 86
S S+SS P+ P+SI E W AEE +EI+ I P + S KRK++I Y+QRLI G
Sbjct: 41 SPSFSSNPD--PWSIVEENWERAEEFTREIVYRIHPTVESNFKRKQIIGYVQRLIKSSLG 98
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQY 144
EVFP+GSVPLKTYLPDGDIDLT + +EE L + +L E+ +DS FE+KDV
Sbjct: 99 FEVFPYGSVPLKTYLPDGDIDLTSISSPAIEEALVSDIHAVLRREELNEDSTFEVKDVHC 158
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
+ A+VK++KC VQN VDISFNQ+ GL LCFLE+VDRL+GK+HLFKRS+ILIKAWCYYE
Sbjct: 159 IDAEVKLIKCIVQNTVVDISFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYE 218
Query: 205 SRILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGP 247
SRILGAH+GLISTYALE + VL+RFL+Y++ FDW+NYCIS+NGP
Sbjct: 219 SRILGAHHGLISTYALETLILYIFHLFHCSLNGPLAVLYRFLEYFSKFDWENYCISLNGP 278
Query: 248 VAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
V SSLP IVAE +EN ELLLS EFLK C + +SVP E F KHLNIVDPL
Sbjct: 279 VCKSSLPNIVAEPLENGQGELLLSDEFLKDCADRFSVPSRKPEMNSRPFPQKHLNIVDPL 338
Query: 308 KDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLR 367
K+NNNLGRSV++GNF RIR A YGA++LG+IL LP + L+ FF NTL+R+G
Sbjct: 339 KENNNLGRSVNRGNFFRIRSAFKYGARKLGQILLLPKERIADELKIFFANTLDRHGSDYW 398
Query: 368 PDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ-CY-HDYTLPVTAQFSPPSSPSQIP 425
+V A G+ S+ + GD++ + L Y Q C+ H + P+ P PS +P
Sbjct: 399 TEVGNSELASGARSSDNSVSRGDHNGHLQSLAYSQYCHMHAVSAPI------PPCPSMLP 452
Query: 426 ---SRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGT 482
+++ W+ + Q +Q K N Q T +C RA+T ++E + RGT
Sbjct: 453 LSENKNRWETVQQSLQLKQNGHSQMNTNHIFGTQLYCVNPGGPFRAAT-DSEEKKIRRGT 511
Query: 483 GTYIPERTRP 492
GTYIP P
Sbjct: 512 GTYIPNMVFP 521
>gi|297735556|emb|CBI18050.3| unnamed protein product [Vitis vinifera]
Length = 824
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/359 (60%), Positives = 262/359 (72%), Gaps = 20/359 (5%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P +I A W AE +QEI+C +QP VSE++RKEV++Y+Q LI G EVFPFGSVPL
Sbjct: 31 PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPL 90
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT G VE+ LA V +LE EDQ+ F +KDVQ + A+VK+VKC
Sbjct: 91 KTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCL 150
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLE++DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210
Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +VL+ +FLDY++ FDWDNYC+S+NGPV ISSLPE++A
Sbjct: 211 STYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIA 270
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
ET EN G + LL+ + L+ C + +SVP LE F KH NIVDPLK+NNNLGRSVS
Sbjct: 271 ETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVS 330
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ-VPIHA 376
KGNF+RIR A +YGA++LG IL P + L KFF NTLER+G+G RPDV +P+ A
Sbjct: 331 KGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDVDLIPLDA 389
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
SG EA ADLSGDYDS++N L YG +DY A P + PSQ S ++WD +
Sbjct: 554 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 613
Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
Q + N+ Q +PR PF + + F +E K RGTGTY P
Sbjct: 614 QSAHIRRNIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEMPKPRGTGTYFP 666
>gi|359481238|ref|XP_002266958.2| PREDICTED: uncharacterized protein LOC100258499 [Vitis vinifera]
Length = 884
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/354 (60%), Positives = 259/354 (73%), Gaps = 19/354 (5%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P +I A W AE +QEI+C +QP VSE++RKEV++Y+Q LI G EVFPFGSVPL
Sbjct: 31 PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCEVFPFGSVPL 90
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT G VE+ LA V +LE EDQ+ F +KDVQ + A+VK+VKC
Sbjct: 91 KTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIHAEVKLVKCL 150
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLE++DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLI 210
Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +VL+ +FLDY++ FDWDNYC+S+NGPV ISSLPE++A
Sbjct: 211 STYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIA 270
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
ET EN G + LL+ + L+ C + +SVP LE F KH NIVDPLK+NNNLGRSVS
Sbjct: 271 ETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVS 330
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
KGNF+RIR A +YGA++LG IL P + L KFF NTLER+G+G RPDV +
Sbjct: 331 KGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPDVDL 384
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
SG EA ADLSGDYDS++N L YG +DY A P + PSQ S ++WD +
Sbjct: 614 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 673
Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
Q + N+ Q +PR PF + + F +E K RGTGTY P
Sbjct: 674 QSAHIRRNIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEMPKPRGTGTYFP 726
>gi|297823987|ref|XP_002879876.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
gi|297325715|gb|EFH56135.1| hypothetical protein ARALYDRAFT_903345 [Arabidopsis lyrata subsp.
lyrata]
Length = 516
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 225/468 (48%), Positives = 303/468 (64%), Gaps = 50/468 (10%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
I ELWL+AEER QEIL IQP +SE+ R E+IN++Q L+ GIEVF FGSVPLKTY
Sbjct: 28 IGEELWLIAEERAQEILFAIQPMYLSERSRNEIINHLQTLMRERLGIEVFLFGSVPLKTY 87
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
LPDGDIDLTVL +EE+ A+ + ILE E +S F++ DVQY+ AQVK++KC+++N+
Sbjct: 88 LPDGDIDLTVLTPYGMEENCAKALRNILEAERGESDFQVTDVQYIHAQVKVIKCTIRNVA 147
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
+DISFNQMAGLSALCFLE+VDR G+DHLFKRS+ILIKAWC+YESRILGA+ GLISTYAL
Sbjct: 148 LDISFNQMAGLSALCFLEQVDRAFGRDHLFKRSIILIKAWCFYESRILGANNGLISTYAL 207
Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
++ VL++F+D+Y +FDW+NYCI++ G V ISS P+I E
Sbjct: 208 AILVLNIVNMSYSSVSGPLAVLYKFMDFYGSFDWENYCITVTGLVPISSFPDI----TET 263
Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
E+ L +F ++C E YS P +E F +KH NI+DPLK +NNLGRSVS+GN
Sbjct: 264 RNHEVFLDEKFFRECIESYSGPANVVEANRKYFPVKHYNILDPLKHSNNLGRSVSEGNAI 323
Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
R+R GAQ+L ++LT PG +G LE FF N+L+RNGKG R DV+ P+ FG+GK++
Sbjct: 324 RLRHCFRRGAQKLRDVLTFPGETVGWKLEDFFGNSLDRNGKGQRQDVEEPVIPFGTGKAD 383
Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSR---SAWDVLCQYVQGK 440
++L GD+ Y+ +YG+ +H +L + IP S+W+ + ++ +
Sbjct: 384 YSNLRGDFYGYFLCFIYGKWFHRESL-----------HNWIPQGLDISSWETVRWFMTRR 432
Query: 441 GNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE 488
N VF R +++ S KS+GTGTYIPE
Sbjct: 433 RN--------VFNGR-------NLNGGSTSTSMPNKRKSKGTGTYIPE 465
>gi|30688308|ref|NP_850331.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|145330711|ref|NP_001078031.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506897|ref|NP_001118485.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|186506900|ref|NP_001118486.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|60547743|gb|AAX23835.1| hypothetical protein At2g40520 [Arabidopsis thaliana]
gi|330254746|gb|AEC09840.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254747|gb|AEC09841.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254748|gb|AEC09842.1| nucleotidyltransferase protein [Arabidopsis thaliana]
gi|330254749|gb|AEC09843.1| nucleotidyltransferase protein [Arabidopsis thaliana]
Length = 502
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 219/468 (46%), Positives = 302/468 (64%), Gaps = 41/468 (8%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
I+AE+WL+AE R QEILC IQP ++E+ R ++I+ +Q L+ GIEV+ FGS+PLKTY
Sbjct: 28 IEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKTY 87
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
LPDGDIDLTVL H EED AR VC +LE E +S ++ VQYV A+VK++KCS++++
Sbjct: 88 LPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQVTGVQYVQAKVKVIKCSIRDVA 147
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
DISFNQ+AGL ALCFLE+VD+ G+DHLFK+S+IL+KAWC+YESRILGA+ GLISTYAL
Sbjct: 148 FDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLISTYAL 207
Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
++ VL++F++YY +FDW NYC+++ GPV ISSLP+I E
Sbjct: 208 AILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDI----TET 263
Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
E+ L +F ++C E+YS +E + F +K+ NI+DPLK +NNLGRSV+KGN
Sbjct: 264 GNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTKGNMV 323
Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
R+R G Q+L ++LTLPG +G LEKFF +LERNGKG R DV+ P+ AFG+G ++
Sbjct: 324 RLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGAAD 383
Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
+ L GD+D Y N L+YG+ +H +L + PPS + ++W + Y+ N
Sbjct: 384 YSQLKGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN- 434
Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
G P P Q + +S GTGTYIP ++
Sbjct: 435 ----GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMSK 471
>gi|21805733|gb|AAM76764.1| hypothetical protein [Arabidopsis thaliana]
Length = 502
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 218/468 (46%), Positives = 302/468 (64%), Gaps = 41/468 (8%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
I+AE+WL+AE R QEILC +QP ++E+ R ++I+ +Q L+ GIEV+ FGS+PLKTY
Sbjct: 28 IEAEVWLIAEARAQEILCAVQPNYLAERSRNKIISNLQTLLWERLGIEVYLFGSMPLKTY 87
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
LPDGDIDLTVL H EED AR VC +LE E +S ++ VQYV A+VK++KCS++++
Sbjct: 88 LPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQVTGVQYVQAKVKVIKCSIRDVA 147
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
DISFNQ+AGL ALCFLE+VD+ G+DHLFK+S+IL+KAWC+YESRILGA+ GLISTYAL
Sbjct: 148 FDISFNQLAGLGALCFLEQVDKAFGRDHLFKKSIILVKAWCFYESRILGANSGLISTYAL 207
Query: 221 EMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEN 263
++ VL++F++YY +FDW NYC+++ GPV ISSLP+I E
Sbjct: 208 AILVLNIVNMSYSSLSGPLAVLYKFINYYGSFDWKNYCVTVTGPVPISSLPDI----TET 263
Query: 264 DGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
E+ L +F ++C E+YS +E + F +K+ NI+DPLK +NNLGRSV+KGN
Sbjct: 264 GNHEVFLDEKFFRECMELYSGETGVVEASRKYFPVKYYNILDPLKHSNNLGRSVTKGNMV 323
Query: 324 RIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSE 383
R+R G Q+L ++LTLPG +G LEKFF +LERNGKG R DV+ P+ AFG+G ++
Sbjct: 324 RLRNCFMLGVQKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGVAD 383
Query: 384 AADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNL 443
+ L GD+D Y N L+YG+ +H +L + PPS + ++W + Y+ N
Sbjct: 384 YSQLKGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN- 434
Query: 444 VYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTR 491
G P P Q + +S GTGTYIP ++
Sbjct: 435 ----GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMSK 471
>gi|224124740|ref|XP_002319410.1| predicted protein [Populus trichocarpa]
gi|222857786|gb|EEE95333.1| predicted protein [Populus trichocarpa]
Length = 681
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 234/462 (50%), Positives = 295/462 (63%), Gaps = 23/462 (4%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGD 105
W AEE EI+ I P + S KRK+VI+Y+QRLI G EVFP+GSVPLKTYLPDGD
Sbjct: 58 WERAEEVATEIVYRIHPTVESSFKRKQVIDYVQRLIRYSLGFEVFPYGSVPLKTYLPDGD 117
Query: 106 IDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
IDLT + +EE L V +L E+ +D+++E+KDV + A+VK++KC VQN VDI
Sbjct: 118 IDLTAISSPAIEEALVSDVYTVLRGEELNEDALYEVKDVHCIDAEVKLIKCIVQNTVVDI 177
Query: 164 SFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
SFNQ+ GL LCFLE+VDRL+GK+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +
Sbjct: 178 SFNQLGGLCTLCFLEEVDRLVGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETL 237
Query: 224 -----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
VL++FLDY++ FDW+NYCIS+NGPV SSLP IVA+ EN
Sbjct: 238 ILYIFHLFHSSLNGPLAVLYKFLDYFSKFDWENYCISLNGPVCKSSLPNIVAKPPENVSG 297
Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIR 326
ELLLS EFLK C + + VP E F KHLNIVDPLK+NNNLGRSV++GNF RIR
Sbjct: 298 ELLLSDEFLKDCVDRFYVPSRKPEMNSRPFPQKHLNIVDPLKENNNLGRSVNRGNFFRIR 357
Query: 327 CALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAAD 386
A YG ++LG IL LP + L+ FF NTL+R+G DVQ A + S+ +
Sbjct: 358 SAFKYGGRKLGRILLLPREKIADELKTFFANTLDRHGSDYWSDVQNSELASVARSSDNSV 417
Query: 387 LSGDYDSYYNGLLYGQCYHDYTLPVTAQFSP-PSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
GD+D + L Y Q H Y V+A P PS Q + + W+ + Q +Q K N
Sbjct: 418 SLGDHDDHLQSLAYSQYCHMYA--VSAPIPPCPSMSPQSENNNRWETVRQSLQLKRNGHS 475
Query: 446 QWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
Q T +C + RA+T S ++ E+ RGTGTYIP
Sbjct: 476 QMNTNHVYGTQFYCVNPVAPFRAATNSEEKKER-RGTGTYIP 516
>gi|297736507|emb|CBI25378.3| unnamed protein product [Vitis vinifera]
Length = 893
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 270/657 (41%), Positives = 360/657 (54%), Gaps = 108/657 (16%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE------ 88
N + S+ AE W + + IQEILC IQP IVSE++RKE+I+Y+QRLI +G E
Sbjct: 105 NPHLMSVTAECWSITKLTIQEILCAIQPTIVSEQRRKEIIDYVQRLIRDSFGNEREIEES 164
Query: 89 ---------------------------------------------VFPFGSVPLKTYLPD 103
V PFGS+PLKTYLPD
Sbjct: 165 SRASTFQSGDERPLFKQAMLERRNSRSIEVLLQVYFEEPKNIKLYVLPFGSMPLKTYLPD 224
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
GDIDLT L +N EED AR VC +LE E Q S F ++D+ Y+ A+VKIVKC VQ+I VD
Sbjct: 225 GDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYIRAKVKIVKCMVQDISVD 284
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
ISFNQ GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE RILG+H GL+STYALE+
Sbjct: 285 ISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEGRILGSHCGLLSTYALEI 344
Query: 223 M-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG 265
+ VL+RFLDYY+TFDW+ + +S+ GPV+ISSL E E
Sbjct: 345 LVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPVSISSLLTGAPEAAETAD 404
Query: 266 DELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRI 325
LL++ EFL C+E ++V I A E T F +KH+NI DPL+D NNLGRS+S GN +R
Sbjct: 405 KPLLINEEFLWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRF 464
Query: 326 RCALSYGAQRLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEA 384
R A+S GAQRL EI L LP + GL++FF NTL+RNG G D + + FG G S+
Sbjct: 465 RYAISVGAQRLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLVPFGPGSSKF 523
Query: 385 ADLSGDYDSYYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP--------SQI 424
+L GDY + L YGQ YH +L V Q F+ + P S +
Sbjct: 524 CNLQGDYYGCLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWGYTNHFASML 583
Query: 425 PSRSAWDVLC---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRG 481
PS++ + Q+ + + N+ +VP LPFC+ ASQ+ + +E + +G
Sbjct: 584 PSQNTFQFYSPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYHFEE-KVPQG 636
Query: 482 TGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNV 541
TGT+IP + ++NR ++PE H P S + + +R P ++ S L
Sbjct: 637 TGTFIPNVAQIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--SMASSVLLEP 691
Query: 542 ENGCSLNMENSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSP 596
E N E + ++ +E ++ +KN PS +Q QS +S D V P
Sbjct: 692 EQEGGKNSEPTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQDLVLKP 748
>gi|359478494|ref|XP_002276607.2| PREDICTED: uncharacterized protein LOC100253523 [Vitis vinifera]
Length = 854
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/350 (57%), Positives = 259/350 (74%), Gaps = 19/350 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
I + W AE QEI+ +QP + S ++R+EVI+Y+QRLI G EVFP+GSVPLKTY
Sbjct: 35 IAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTY 94
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQN 158
L DGDIDLT L NVEE LA V +L+ E+Q+ + FE+KD+Q++ A+VK+VKC V++
Sbjct: 95 LLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKD 154
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
I +DISFNQ+ GLS LCFLE+VDRLIGKDHLFKRS+ILIK+WCYYESRILGAH+GLISTY
Sbjct: 155 IVIDISFNQLGGLSTLCFLEQVDRLIGKDHLFKRSIILIKSWCYYESRILGAHHGLISTY 214
Query: 219 ALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETV 261
ALE++VL+ RFLDY++ FDWDNYCIS+NGPV SSLP+IVAE
Sbjct: 215 ALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPDIVAELP 274
Query: 262 ENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN 321
EN D+LLLS EFL+ C +++SVP LE F +KHLNI+DPL++NNNLGRSV+KGN
Sbjct: 275 ENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLRENNNLGRSVNKGN 334
Query: 322 FHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
F+RIR A YG+ +LG+IL+LP + L+ FF +TLER+ ++Q
Sbjct: 335 FYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYMAEIQ 384
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 9/137 (6%)
Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
DLSGDYDS+ L YGQC + + LP SPP SPSQ+ + WD + Q++Q NL
Sbjct: 597 DLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKVRQHLQFTQNLHS 656
Query: 446 QWGTE--VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
Q + + P HP S + F ++ +K RGTGTY P + P +RPV
Sbjct: 657 QMDSNGVILGNHFPVKHPARS---ITAFGLEDKQKPRGTGTYFPNMSHLP--NRDRPVGQ 711
Query: 504 RNPESLNHVPVSKLRRK 520
R ++L S+L R+
Sbjct: 712 RRNQALE--SHSQLHRR 726
>gi|297816424|ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp.
lyrata]
Length = 829
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/346 (58%), Positives = 247/346 (71%), Gaps = 19/346 (5%)
Query: 44 ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
E W+ EE +EI+ + P +VSE +R++VI Y+Q+LI G EV FGSVPLKTYLPD
Sbjct: 32 EFWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPD 91
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
GDIDLT G EE+LA V +LE E+ + S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFNQ+ G+ LCFLEK+D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211
Query: 222 MMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
+VL+ +FLDY++ FDWDNYCIS+NGPV +SSLPEIV ET EN
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENG 271
Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
G++ LL+ EFLK+C E+YSVP E F KHLNIVDPLK+ NNLGRSVSKGNF+R
Sbjct: 272 GEDFLLTSEFLKECMEMYSVPSRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331
Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
IR A +YGA++LG+I + L KFF N L R+G G RPDV
Sbjct: 332 IRSAFTYGARKLGQIFLQSDEAIKSELRKFFSNMLLRHGSGQRPDV 377
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 64/130 (49%), Gaps = 8/130 (6%)
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
+DLSGDY+S N L +G+ + DY P++ SPP P Q+P+ ++W+V+ + + N
Sbjct: 578 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVIRHALPFRRN 635
Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
VPR F H + F+ +E K RGTGTY P R RP
Sbjct: 636 APTPVNANGVVPRQVFFHVNPQMIPGPGFAIEELPKPRGTGTYFPNANHYRDRPFSPRGR 695
Query: 499 RPVRARNPES 508
+AR+P +
Sbjct: 696 SSHQARSPRN 705
>gi|79597803|ref|NP_850678.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645293|gb|AEE78814.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 829
Score = 407 bits (1046), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/350 (58%), Positives = 250/350 (71%), Gaps = 19/350 (5%)
Query: 44 ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
ELW+ EE +EI+ + P +VSE +R++VI Y+Q+LI G EV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
GDIDLT G EE+LA V +LE E+ + S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFNQ+ G+ LCFLEK+D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE
Sbjct: 152 DISFNQIGGICTLCFLEKIDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALE 211
Query: 222 MMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
+VL+ +FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN
Sbjct: 212 TLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENG 271
Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
G++LLL+ EFLK+C E+YSVP E F KHLNIVDPLK+ NNLGRSVSKGNF+R
Sbjct: 272 GEDLLLTSEFLKECLEMYSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYR 331
Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
IR A +YGA++LG++ + L KFF N L R+G G RPDV I
Sbjct: 332 IRSAFTYGARKLGQLFLQSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 381
Score = 62.8 bits (151), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
+DLSGDY+S N L +G+ + DY P++ SPP P Q+P+ ++W+V+ + + N
Sbjct: 578 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 635
Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
VPR F H + F +E K RGTGTY P R RP
Sbjct: 636 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 695
Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
+AR+P + N +S+ +++ P R+ E + S +M ++ S
Sbjct: 696 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 746
Query: 559 LEEFPLLPVTKNLP 572
L+ FP + N P
Sbjct: 747 LDSFPDTNGSTNHP 760
>gi|449526634|ref|XP_004170318.1| PREDICTED: uncharacterized LOC101207419 [Cucumis sativus]
Length = 816
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/320 (63%), Positives = 239/320 (74%), Gaps = 19/320 (5%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQY 144
++VFPFGSVPLKTYLPDGDIDLT LG NVEE LA VC +L +EDQ+ F +KDVQ
Sbjct: 3 VQVFPFGSVPLKTYLPDGDIDLTALGGSNVEEALASDVCSVLNSEDQNGAAEFVVKDVQL 62
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
+ A+VK+VKC VQNI VDISFNQ+ GL LCFLEK+DR IGKDHLFKRS+ILIKAWCYYE
Sbjct: 63 IRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKIDRRIGKDHLFKRSIILIKAWCYYE 122
Query: 205 SRILGAHYGLISTYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGP 247
SRILGAH+GLISTYALE + VL++FLDY++ FDWDNYCIS+NGP
Sbjct: 123 SRILGAHHGLISTYALETLVLYIFHLFHSALNGPLQVLYKFLDYFSKFDWDNYCISLNGP 182
Query: 248 VAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
V ISSLPE+VAET +N G +LLLS +FL+ C E +SVP E F IKHLNIVDPL
Sbjct: 183 VRISSLPELVAETPDNGGGDLLLSTDFLQSCLETFSVPARGYEANSRAFPIKHLNIVDPL 242
Query: 308 KDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLR 367
K+NNNLGRSVSKGNF+RIR A SYGA++LG IL+ P + + KFF NTL+R+G G R
Sbjct: 243 KENNNLGRSVSKGNFYRIRSAFSYGARKLGFILSHPEDNVVDEVRKFFSNTLDRHGGGQR 302
Query: 368 PDVQVPIHAFGSGKSEAADL 387
PDVQ P G +S AA L
Sbjct: 303 PDVQDPAPVSGGYESCAALL 322
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 86/173 (49%), Gaps = 15/173 (8%)
Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYGQ 402
P H + + E F+ ++ R + L +V P AF + +DL+GDY+S+ N L G+
Sbjct: 518 PLHSVSLSSEDFYPSS--RGYRFLTSNVGPP-EAFNA----LSDLNGDYESHCNSLQIGR 570
Query: 403 CYHDYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHP 461
Y++Y L A PP PSQ P+++ WD++ + VQ K N Q + + R P +P
Sbjct: 571 WYYEYALSAAALSPIPPPLPSQYPNKNPWDIIRRSVQVKQNAFAQINSNGLLAR-PAFYP 629
Query: 462 YASQV--RASTFSTDEGEKSRGTGTYIPE----RTRPPPIEINRPVRARNPES 508
S + +T + +E K RGTGTY P R RP V R+P +
Sbjct: 630 MPSPILPGGATLAMEEMPKPRGTGTYFPNMNHYRDRPASARGRNQVSVRSPRN 682
>gi|449443945|ref|XP_004139736.1| PREDICTED: uncharacterized protein LOC101209112 [Cucumis sativus]
Length = 1341
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/350 (54%), Positives = 247/350 (70%), Gaps = 20/350 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D+E W AEER E++ IQP SE++R V +Y+QRLI + +VF FGSVPLKTY
Sbjct: 31 LDSERWSKAEERTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVFTFGSVPLKTY 90
Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDIDLT +QN++E A V +LE+E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 91 LPDGDIDLTAFSKNQNLKETWAHQVRDMLESEEKNENAEFRVKEVQYIKAEVKIIKCLVE 150
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISF+Q+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 151 NIVVDISFDQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 210
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDWDN+C+S+ GPV ISSLP++ AE
Sbjct: 211 YALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEP 270
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
DG ELLLS FL+ C +Y+V EN G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 271 PRKDGGELLLSKLFLEACSAVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVSKG 330
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NF RIR A ++GA+RL + P + L +FF+NT ER+G G RPDV
Sbjct: 331 NFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERHGSGQRPDV 380
>gi|356500940|ref|XP_003519288.1| PREDICTED: uncharacterized protein LOC100814626 [Glycine max]
Length = 780
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 189/333 (56%), Positives = 243/333 (72%), Gaps = 20/333 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLT 109
E EIL I+P + ++++R+EV++Y+QRLI EVFP+GSVPLKTYLPDGDIDLT
Sbjct: 45 ERNTAEILRRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104
Query: 110 VLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
L +N+E+ L V +L E ++ + +E+KDV+++ A+VK+VKC VQ+I VDISFNQ
Sbjct: 105 ALSCENIEDGLVSDVRAVLHGEEINEAAEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQ 164
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
+ GLS LCFLEKVDRL+ KDHLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +
Sbjct: 165 LGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYI 224
Query: 224 -------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
VL+RFLDY++ FDWDNYC+S+ GPV+ +SLP IVAE EN G+ LL
Sbjct: 225 FHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVSKTSLPNIVAEVPENGGN-TLL 283
Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
+ EF++ C E +SVP + F KHLNI+DPLK+NNNLGRSV+KGNF+RIR A
Sbjct: 284 TEEFIRSCVESFSVPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFK 343
Query: 331 YGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
YGA++LG IL LP + L +FF NTLER+G
Sbjct: 344 YGARKLGWILRLPEDRIAEELIRFFANTLERHG 376
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 62/130 (47%), Gaps = 26/130 (20%)
Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYV-------- 437
DL+GDYDS+ L YG + Y + + SPP SP + P+R+ W+ + Q V
Sbjct: 515 DLTGDYDSHIGNLQYGHMCNGYPVSLVVP-SPPRSP-KFPNRNPWETVHQCVPINHSIRS 572
Query: 438 QGKGNLVYQWGTEVFV---PRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPP 494
Q N V G +V+V P LP ++F ++E K RGTG Y P T P
Sbjct: 573 QANSNCV--MGQQVYVINHPTLPM----------TSFGSEEKRKVRGTGAYFPNMT-SRP 619
Query: 495 IEINRPVRAR 504
NRP+ R
Sbjct: 620 YRDNRPIPGR 629
>gi|356553166|ref|XP_003544929.1| PREDICTED: uncharacterized protein LOC100816328 [Glycine max]
Length = 779
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/333 (57%), Positives = 242/333 (72%), Gaps = 20/333 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLT 109
E+ EIL I+P + ++++R+EV++Y+QRLI EVFP+GSVPLKTYLPDGDIDLT
Sbjct: 45 EKTTAEILSRIRPTLAADRRRREVVDYVQRLIRYGARCEVFPYGSVPLKTYLPDGDIDLT 104
Query: 110 VLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
L QN+E+ L V +L E ++ S +E+KDV+++ A+VK+VKC VQ+I VDISFNQ
Sbjct: 105 ALSCQNIEDGLVSDVRAVLHGEEINEASEYEVKDVRFIDAEVKLVKCIVQDIVVDISFNQ 164
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
+ GLS LCFLEKVDRL+ KDHLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +
Sbjct: 165 LGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYESRVLGAHHGLISTYALETLVLYI 224
Query: 224 -------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
VL+RFLDY++ FDWDNYC+S+ GPV SS P IVAE EN G+ LL
Sbjct: 225 FHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPVGKSSPPNIVAEVPENGGN-TLL 283
Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
+ EF++ C E +S+P + F KHLNI+DPLK+NNNLGRSV+KGNF+RIR A
Sbjct: 284 TEEFIRSCVESFSLPSRGADLNLRAFPQKHLNIIDPLKENNNLGRSVNKGNFYRIRSAFK 343
Query: 331 YGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
YGA++LG IL LP + L +FF NTLER+G
Sbjct: 344 YGARKLGWILMLPEDRITEELIRFFTNTLERHG 376
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 56/128 (43%), Gaps = 22/128 (17%)
Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQ------G 439
DL+GDYDS+ L YG + Y + + P+R+ W+ + Q VQ
Sbjct: 515 DLTGDYDSHIGNLQYGHMCNGYPVSPVVPSP--PRSPKFPNRNPWETVRQCVQINHSIRS 572
Query: 440 KGNLVYQWGTEVFV---PRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIE 496
+ N G +V+V P LP ++F ++E K RGTG Y P T P
Sbjct: 573 QANSNSVMGQQVYVINHPSLPM----------TSFGSEEKRKVRGTGAYFPNMT-SRPYR 621
Query: 497 INRPVRAR 504
NRP+ R
Sbjct: 622 DNRPMPGR 629
>gi|293332253|ref|NP_001168029.1| uncharacterized protein LOC100381756 [Zea mays]
gi|223945595|gb|ACN26881.1| unknown [Zea mays]
gi|413924674|gb|AFW64606.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 833
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 201/362 (55%), Positives = 250/362 (69%), Gaps = 21/362 (5%)
Query: 31 SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
S P + P S+ + W E ++ IQP SE R V++Y+QRL +VF
Sbjct: 13 SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVF 72
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
PFGSVPLKTYLPDGDIDLT+ G +E+LA VC IL++E+ +DS FE+KDVQYVPA+
Sbjct: 73 PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAE 132
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK+VKC VQNI VDIS NQ+ GL LCFLEKVD+ GKDHLFK+S+ILIK WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRIL 192
Query: 209 GAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAIS 251
GAH+GLISTYALE + VL+RFLDYY+ FDWDN IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252
Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
SLPE+V + + D+ L EFLK+C E +SV E F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSN 312
Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
NLGRSVSKGNF+RIR A +GA++LG+IL +P CL +G + +FF NTL+RN GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVGEVNQFFRNTLKRNRTGLRPDV 371
Query: 371 QV 372
V
Sbjct: 372 LV 373
>gi|413924673|gb|AFW64605.1| hypothetical protein ZEAMMB73_425366 [Zea mays]
Length = 815
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 201/362 (55%), Positives = 250/362 (69%), Gaps = 21/362 (5%)
Query: 31 SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
S P + P S+ + W E ++ IQP SE R V++Y+QRL +VF
Sbjct: 13 SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEHLRAAVVDYVQRLFWFQARYQVF 72
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
PFGSVPLKTYLPDGDIDLT+ G +E+LA VC IL++E+ +DS FE+KDVQYVPA+
Sbjct: 73 PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCTILKSEERRKDSEFEVKDVQYVPAE 132
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK+VKC VQNI VDIS NQ+ GL LCFLEKVD+ GKDHLFK+S+ILIK WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQHFGKDHLFKKSIILIKDWCYYESRIL 192
Query: 209 GAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAIS 251
GAH+GLISTYALE + VL+RFLDYY+ FDWDN IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252
Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
SLPE+V + + D+ L EFLK+C E +SV E F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDIQDDDFLQREEFLKECIESFSVLPRNSETNPRLFSRRFLNIVDPLKQSN 312
Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
NLGRSVSKGNF+RIR A +GA++LG+IL +P CL +G + +FF NTL+RN GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVGEVNQFFRNTLKRNRTGLRPDV 371
Query: 371 QV 372
V
Sbjct: 372 LV 373
>gi|357153090|ref|XP_003576335.1| PREDICTED: uncharacterized protein LOC100826374, partial
[Brachypodium distachyon]
Length = 769
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 205/391 (52%), Positives = 260/391 (66%), Gaps = 31/391 (7%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P SI AE W EE ++ IQP+ SE +R V++Y+QRL+ G EVFPFGSVPL
Sbjct: 3 PASISAERWRAFEEAALGVVGRIQPSAPSEGRRAAVVHYVQRLVRHAVGCEVFPFGSVPL 62
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT G + +E+LA V +LE+E+ +D+ FE+KDVQY+ A+VK+VKC
Sbjct: 63 KTYLPDGDIDLTAFGSISSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCL 122
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLE+VD+ GK+HLFK+S++LIKAWCYYESRILGAH+GLI
Sbjct: 123 VQNIVVDISFNQIGGLCTLCFLEQVDQRFGKEHLFKKSIMLIKAWCYYESRILGAHHGLI 182
Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE++ VL+RFLDYY+ FDWDN IS+ GPV +SSLPE+V+
Sbjct: 183 STYALEILVLCIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVLLSSLPELVS 242
Query: 259 ET-VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
+ V +DGD L EFL++C + ++VP E F+ K LNIVDPLK NNNLGRSV
Sbjct: 243 DAPVTHDGD-FLKREEFLRECAQTFTVPPRNSEKNTRLFSRKFLNIVDPLKQNNNLGRSV 301
Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAF 377
SKGNF RIR A GA++LG+IL + +FF NTL+RN +RPDVQ
Sbjct: 302 SKGNFFRIRSAFDLGARKLGKILKEASSSAVPEVNQFFRNTLKRNRTMVRPDVQ------ 355
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYHDYT 408
A D S + D+ + LY Y D +
Sbjct: 356 ----DIALDFSVERDNEDHTPLYSNSYGDLS 382
>gi|255564741|ref|XP_002523365.1| hypothetical protein RCOM_0719270 [Ricinus communis]
gi|223537453|gb|EEF39081.1| hypothetical protein RCOM_0719270 [Ricinus communis]
Length = 1334
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/350 (53%), Positives = 245/350 (70%), Gaps = 20/350 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D+E W AEER E++ I+P SE++R V +Y++RLI + VF FGSVPLKTY
Sbjct: 24 LDSERWAKAEERTAELIDCIKPNEPSERRRNAVADYVERLITKCFPCRVFTFGSVPLKTY 83
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCSVQ 157
LPDGDIDLT Q+++E A V +LENE+++ F +K+VQY+ A+VKI+KC V+
Sbjct: 84 LPDGDIDLTAFSEGQSMKETWAHQVRDVLENEEKNENAEFRVKEVQYIQAEVKIIKCLVE 143
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISF+Q+ GL LCFLE+VD LI +DHLFK+S+ILIKAWCYYESRILGAH+GLIST
Sbjct: 144 NIVVDISFDQLGGLCTLCFLEEVDHLINQDHLFKKSIILIKAWCYYESRILGAHHGLIST 203
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDWDN+C+S+ GPV ISSLP++ AE
Sbjct: 204 YALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPISSLPDVTAEP 263
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
DG ELLLS FLK C +Y+V E+ G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 264 PRKDGGELLLSKLFLKACGAVYAVSPGGPESQGQTFTSKHFNVIDPLRVNNNLGRSVSKG 323
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NF RIR A ++GA+RL +L P + + +FF+NT +R+G GLRPD
Sbjct: 324 NFFRIRSAFAFGAKRLARLLDCPKEDIHFEVNQFFMNTWDRHGSGLRPDA 373
>gi|302143676|emb|CBI22537.3| unnamed protein product [Vitis vinifera]
Length = 1359
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/354 (52%), Positives = 244/354 (68%), Gaps = 20/354 (5%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
++D E LAEER ++++ IQP SE++R+ V +Y++ LI + +VFPFGSVPLKT
Sbjct: 29 ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88
Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
YLPDGDIDLT N+++ A V ILE E++ D+ F +K+VQY+ A+VKI+KC V
Sbjct: 89 YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKIIKCLV 148
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+NI VDISFNQ+ GL LCFLE+VD LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 149 ENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 208
Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE +VL+ RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 209 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 268
Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
D ELLLS FL C +Y+V EN F K+ N++DPL+ NNNLGRSVSK
Sbjct: 269 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 328
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
GNF RIR A ++GAQRL +L P + + +FF+NT ER+GKG RPD P
Sbjct: 329 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 382
>gi|225462743|ref|XP_002268106.1| PREDICTED: uncharacterized protein LOC100248390 [Vitis vinifera]
Length = 1353
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 187/354 (52%), Positives = 244/354 (68%), Gaps = 20/354 (5%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
++D E LAEER ++++ IQP SE++R+ V +Y++ LI + +VFPFGSVPLKT
Sbjct: 29 ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88
Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
YLPDGDIDLT N+++ A V ILE E++ D+ F +K+VQY+ A+VKI+KC V
Sbjct: 89 YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEVKIIKCLV 148
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+NI VDISFNQ+ GL LCFLE+VD LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 149 ENIVVDISFNQLGGLCTLCFLEEVDHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 208
Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE +VL+ RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 209 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 268
Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
D ELLLS FL C +Y+V EN F K+ N++DPL+ NNNLGRSVSK
Sbjct: 269 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 328
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
GNF RIR A ++GAQRL +L P + + +FF+NT ER+GKG RPD P
Sbjct: 329 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 382
>gi|297745424|emb|CBI40504.3| unnamed protein product [Vitis vinifera]
Length = 1229
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 184/350 (52%), Positives = 244/350 (69%), Gaps = 21/350 (6%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E WL+AEER E++ IQP SE+ R V +Y+QR++ + +VF FGSVPLKTY
Sbjct: 30 LDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTY 89
Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDIDLT +QN+++ A V +L++E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 90 LPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVE 149
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 150 NIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 209
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE + VL+RFL+++++FDWDN+C+S+ GPV ISSLP++ AE
Sbjct: 210 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEP 269
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLLS FL C +Y+V E G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 270 PRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKG 329
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NF RIR A ++GA+RL +L P + + + F+NT ER+G G RPD
Sbjct: 330 NFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPDT 378
>gi|225454502|ref|XP_002277075.1| PREDICTED: uncharacterized protein LOC100241322 [Vitis vinifera]
Length = 1295
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 184/350 (52%), Positives = 244/350 (69%), Gaps = 21/350 (6%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E WL+AEER E++ IQP SE+ R V +Y+QR++ + +VF FGSVPLKTY
Sbjct: 30 LDTERWLIAEERTAELIACIQPNQPSEELRNAVADYVQRIVVQCFPCQVFTFGSVPLKTY 89
Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDIDLT +QN+++ A V +L++E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 90 LPDGDIDLTAFSNNQNLKDTWANQVRDMLQSEEKNENAEFRVKEVQYIQAEVKIIKCLVE 149
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 150 NIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESRILGAHHGLIST 209
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE + VL+RFL+++++FDWDN+C+S+ GPV ISSLP++ AE
Sbjct: 210 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVPISSLPDVTAEP 269
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLLS FL C +Y+V E G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 270 PRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRVNNNLGRSVSKG 329
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NF RIR A ++GA+RL +L P + + + F+NT ER+G G RPD
Sbjct: 330 NFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPDT 378
>gi|413956606|gb|AFW89255.1| hypothetical protein ZEAMMB73_893455 [Zea mays]
Length = 1316
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 185/353 (52%), Positives = 240/353 (67%), Gaps = 20/353 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W +AE+R E++ IQP + SE +R V +Y+QRLI +VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEDRTAELIACIQPNVYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + + ++E A V LE E D+++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKDENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDW+ +C+S+ GPV ISSLP++ A
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAIP 259
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLL+ FL C Y V EN G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPHTQENQGQPFVSKHFNVIDPLRTNNNLGRSVSKG 319
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
NF RIR A +YGA+RLG++L P L L +FF NT R+G G RPDV +P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEALIPELNQFFTNTWIRHGSGSRPDVPIP 372
>gi|356520288|ref|XP_003528795.1| PREDICTED: uncharacterized protein LOC100809742 [Glycine max]
Length = 1331
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/352 (53%), Positives = 246/352 (69%), Gaps = 22/352 (6%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY--GIEVFPFGSVPLK 98
+D+E WL AE+R E++ IQP SE++R V +Y+QRLI + + VF FGSVPLK
Sbjct: 31 LDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVGVFTFGSVPLK 90
Query: 99 TYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
TYLPDGDIDLT +QN+++ A V +LENE+++ F +K+VQY+ A+VKI+KC
Sbjct: 91 TYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCL 150
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
V+NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGAHHGLI 210
Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +VL+ RFL++++ FDW+N+C+S+ GPV ISSLP++ A
Sbjct: 211 STYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTA 270
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
E DG +LLLS FL C +Y+V EN G F KH N++DPL+ NNNLGRSVS
Sbjct: 271 EPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 330
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
KGNF RIR A ++GA++L +L P L + +FF NT ER+G G RPDV
Sbjct: 331 KGNFFRIRSAFAFGAKKLARLLDCPEEELFSEVNQFFFNTWERHGSGERPDV 382
>gi|242036527|ref|XP_002465658.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
gi|241919512|gb|EER92656.1| hypothetical protein SORBIDRAFT_01g043240 [Sorghum bicolor]
Length = 1333
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 185/353 (52%), Positives = 239/353 (67%), Gaps = 20/353 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W +AE+R E++ IQP SE +R V +Y+QRLI +VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEDRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + + ++E A V LE E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISRNHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDW+ +C+S+ GPV ISSLP++ AE
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLL+ FL C Y V EN G F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRMDSGELLLNKSFLDTCSSAYGVVPRTQENQGQPFVSKHFNVIDPLRANNNLGRSVSKG 319
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
NF RIR A +YGA+RLG++L P L L +FF NT R+G G RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIAELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|356560284|ref|XP_003548423.1| PREDICTED: uncharacterized protein LOC100800527 [Glycine max]
Length = 1337
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/352 (53%), Positives = 245/352 (69%), Gaps = 22/352 (6%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY--GIEVFPFGSVPLK 98
+D+E WL AE+R E++ IQP SE++R V +Y+QRLI + + VF FGSVPLK
Sbjct: 31 LDSERWLKAEQRTAELIACIQPNPPSEERRNAVADYVQRLIMKCFPCQVRVFTFGSVPLK 90
Query: 99 TYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCS 155
TYLPDGDIDLT +QN+++ A V +LENE+++ F +K+VQY+ A+VKI+KC
Sbjct: 91 TYLPDGDIDLTAFSKNQNLKDSWAHQVRDMLENEEKNENAEFHVKEVQYIQAEVKIIKCL 150
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
V+NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 151 VENIVVDISFNQLGGLCTLCFLEEVDNLINQNHLFKRSIILIKAWCYYESRILGAHHGLI 210
Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +VL+ RFL++++ FDW+N+C+S+ GPV ISSLP++ A
Sbjct: 211 STYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWENFCVSLWGPVPISSLPDVTA 270
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
E DG +LLLS FL C +Y+V EN G F KH N++DPL+ NNNLGRSVS
Sbjct: 271 EPPRKDGGDLLLSKLFLDACSSVYAVFPGGQENQGQPFVSKHFNVIDPLRVNNNLGRSVS 330
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
KGNF RIR A ++GA+RL +L L + +FF NT ER+G G RPDV
Sbjct: 331 KGNFFRIRSAFAFGAKRLARLLDCSEDELFSEVNQFFFNTWERHGSGERPDV 382
>gi|357155485|ref|XP_003577136.1| PREDICTED: uncharacterized protein LOC100840351 [Brachypodium
distachyon]
Length = 739
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 202/387 (52%), Positives = 254/387 (65%), Gaps = 29/387 (7%)
Query: 10 MEDLQE-ELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEK 68
M D++E L P H +P N P S+ E+W E ++ IQP I SE
Sbjct: 1 MVDIREVSLAPEPKH--------APANPDPSSVSPEVWEPLEAAALAVIGRIQPTIPSEG 52
Query: 69 KRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R V++YIQRL+ G +VFPFGSVPLKTYLPDGDIDLT G +E LA V IL
Sbjct: 53 LRASVVDYIQRLVRCSVGCQVFPFGSVPLKTYLPDGDIDLTAFGSTYSDESLANEVRAIL 112
Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
E E++ FE+KDVQY+ A+VK+VKC VQNI VDISFNQM GL LCFLE+VD+ K
Sbjct: 113 EAEERREDAEFEVKDVQYIHAEVKLVKCFVQNIVVDISFNQMGGLCTLCFLEQVDQRFEK 172
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ RFL
Sbjct: 173 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFL 232
Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
DYY+ FDWDN IS+ GPV++SSLPE+V E D L EFLK+C ++++VP
Sbjct: 233 DYYSKFDWDNKGISLYGPVSLSSLPELVTEPTGTHDDSFLQREEFLKECAKMFTVPPRLN 292
Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
E F K+ NIVDPLK +NNLGRSVSKGNF+RIR A GA++LG+IL +P +
Sbjct: 293 EKNTRPFYQKYFNIVDPLKQSNNLGRSVSKGNFYRIRSAFDLGARKLGKILQMPANSTVD 352
Query: 350 GLEKFFINTLERNGKGLRPDVQ-VPIH 375
+ +FF +TL+RN +RPD+Q +P++
Sbjct: 353 EVNQFFKSTLKRNHSMVRPDIQDIPVN 379
>gi|147780178|emb|CAN75522.1| hypothetical protein VITISV_043595 [Vitis vinifera]
Length = 733
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 246/588 (41%), Positives = 327/588 (55%), Gaps = 91/588 (15%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ-DSVFEIKDVQYV 145
IEV PFGS+PLKTYLPDGDIDLT L +N EED AR VC +LE E Q S F ++D+ Y+
Sbjct: 10 IEVLPFGSMPLKTYLPDGDIDLTALCPENDEEDFARDVCTLLEGERQMGSEFRVEDISYI 69
Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
A+VKIVKC VQ+I VDISFNQ GLS LCFLE++D LIGKDHLFKRSVILIKAWCYYE
Sbjct: 70 RAKVKIVKCMVQDISVDISFNQTGGLSTLCFLEQIDILIGKDHLFKRSVILIKAWCYYEG 129
Query: 206 RILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPV 248
RILG+H GL+STYALE++ VL+RFLDYY+TFDW+ + +S+ GPV
Sbjct: 130 RILGSHCGLLSTYALEILVLYVINLFYSSLYCPLAVLYRFLDYYSTFDWEKFGVSVLGPV 189
Query: 249 AISSL-------------------------------PE---IVAETVENDGDELLLSPEF 274
+ISSL P+ VAE E LL++ EF
Sbjct: 190 SISSLLTGARESCLIMWLCLMVCFFRLIGLPFYLIFPDFVLFVAEAAETADKPLLINEEF 249
Query: 275 LKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQ 334
L C+E ++V I A E T F +KH+NI DPL+D NNLGRS+S GN +R R A+S GAQ
Sbjct: 250 LWSCKEAFAVSIRASECTKQPFLVKHINIQDPLRDYNNLGRSISLGNSYRFRYAISVGAQ 309
Query: 335 RLGEI-LTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDS 393
RL EI L LP + GL++FF NTL+RNG G D + + FG G S+ +L GDY
Sbjct: 310 RLKEILLMLPEGRMNEGLKEFFNNTLDRNGGGQGAD-EGDLVPFGPGSSKFCNLQGDYYG 368
Query: 394 YYNGLLYGQCYHDY---------TLPVTAQ---FSPPSSP--------SQIPSRSAWDVL 433
+ L YGQ YH +L V Q F+ + P S +PS++ +
Sbjct: 369 CLSNLHYGQFYHANLSSFSLHPGSLDVPFQWMYFNQSNVPWGYTNHFASMLPSQNTFQFY 428
Query: 434 C---QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERT 490
Q+ + + N+ +VP LPFC+ ASQ+ + +E + +GTGT+IP
Sbjct: 429 SPSFQFEEQESNM------NAYVPSLPFCYETASQLSGLAYHFEE-KVPQGTGTFIPNVA 481
Query: 491 RPPPIEINRPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNME 550
+ ++NR ++PE H P S + + +R P ++ S L E N E
Sbjct: 482 QIFHNDLNRWEHGQDPE---HEPRSGMPQTRWAVKRNPV--SMASSVLLEPEQEGGKNSE 536
Query: 551 NSCSLILSL--EEFPLLPVTKNLPPSATNQQFDQSTDKSTQVDDVSSP 596
+ ++ +E ++ +KN PS +Q QS +S D V P
Sbjct: 537 PTLQVLSPKPEDEGGMVSKSKNQAPSLKPEQERQSNQESKAQDLVLKP 584
>gi|242069725|ref|XP_002450139.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
gi|241935982|gb|EES09127.1| hypothetical protein SORBIDRAFT_05g001080 [Sorghum bicolor]
Length = 835
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 198/362 (54%), Positives = 248/362 (68%), Gaps = 21/362 (5%)
Query: 31 SSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVF 90
S P + P S+ + W E ++ IQP SE+ R VI Y+QRL +VF
Sbjct: 13 SVPAHPDPASVSPDAWRRFETAALAVVNKIQPTAASEQLRAAVIEYVQRLFWFQARYQVF 72
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
PFGSVPLKTYLPDGDIDLT+ G +E+LA VC IL++E+ +DS FE+KDV YVPA+
Sbjct: 73 PFGSVPLKTYLPDGDIDLTLFGPAISDENLANEVCAILKSEERRKDSEFEVKDVHYVPAE 132
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK+VKC VQNI VDIS NQ+ GL LCFLEKVD+ GK+HLFKRS++L+K WCYYESRIL
Sbjct: 133 VKLVKCLVQNIVVDISVNQIGGLCTLCFLEKVDQNFGKNHLFKRSIMLVKDWCYYESRIL 192
Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
GAH+GLISTYALE +VL+ RFLDYY+ FDWDN IS+ GPV++S
Sbjct: 193 GAHHGLISTYALETLVLYIFHIFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPVSLS 252
Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
SLPE+V + + D+ L EFLK+C E +SV E F+ + LNIVDPLK +N
Sbjct: 253 SLPELVTDPPDTQDDDFLQREEFLKECTESFSVLPRNSETNPRVFSRRFLNIVDPLKQSN 312
Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMG-LEKFFINTLERNGKGLRPDV 370
NLGRSVSKGNF+RIR A +GA++LG+IL +P CL + + +FF NTL+RN GLRPDV
Sbjct: 313 NLGRSVSKGNFYRIRSAFDFGARKLGKILQVPS-CLTVSEVNQFFRNTLKRNRTGLRPDV 371
Query: 371 QV 372
V
Sbjct: 372 WV 373
>gi|255554485|ref|XP_002518281.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223542501|gb|EEF44041.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 821
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/345 (54%), Positives = 239/345 (69%), Gaps = 19/345 (5%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P I E W AE+ +I+ I P + ++ RK V+ Y+Q LI G +VFP+GSVPL
Sbjct: 43 PALISEENWERAEQATLQIVYRIHPTVEADCNRKHVVEYVQSLIQSSLGFQVFPYGSVPL 102
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT + + + V +L E+Q D+ +++KDV ++ A+VK++KC
Sbjct: 103 KTYLPDGDIDLTAIINPAGVDASVSDVHAVLRREEQNRDAPYKVKDVHFIDAEVKLIKCI 162
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
V +I VDISFNQ+ GLS LCFLE+VD+LIGK HLFKRS+ILIKAWCYYESRILGAH+GLI
Sbjct: 163 VHDIVVDISFNQLGGLSTLCFLEQVDQLIGKSHLFKRSIILIKAWCYYESRILGAHHGLI 222
Query: 216 STYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +MVL+RFLDY++ FDWDNYCIS+NGPV SSLP+IVA
Sbjct: 223 STYALETLILYIFHLFHSSLNGPLMVLYRFLDYFSKFDWDNYCISLNGPVCKSSLPKIVA 282
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
E E LLL EFL+ ++ SVP + E F KHLNIVDPL++NNNLGRSV+
Sbjct: 283 EPPETGRGNLLLDDEFLRNSVKMLSVPSRSPEMNSRPFTQKHLNIVDPLRENNNLGRSVN 342
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
+GNF+RIR A YGA++LG IL+L + L+KFF NTL+R+G
Sbjct: 343 RGNFYRIRSAFKYGARKLGHILSLQSDRMINELDKFFANTLDRHG 387
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 12/160 (7%)
Query: 386 DLSGDYDSYYNGLLYGQ--CYHDYTLPVTAQFSPPSSPSQIPSRSA--WDVLCQYVQGKG 441
DL+GDYDS+ + +GQ C+ + PV P SP+ S++ W+ + Q +Q K
Sbjct: 551 DLTGDYDSHLKSVKFGQGCCFFAVSAPVL-----PCSPTAPHSKNKNPWETVRQSLQLKR 605
Query: 442 NLVYQWGTE-VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRP 500
N+ Q T +F + F + A FS++E K RGTGTYIP + E
Sbjct: 606 NVHSQINTNGIFGHQQHFLNHLVPFTTA--FSSEEKRKQRGTGTYIPNMSYHSNRERPSS 663
Query: 501 VRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLN 540
R +N + N+ + + R A P ++ + G L+
Sbjct: 664 ERRKNHVTANNGDLHRRTRDNGLAATRPGINSYQHGHELS 703
>gi|414882102|tpg|DAA59233.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 875
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 193/360 (53%), Positives = 244/360 (67%), Gaps = 19/360 (5%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
+P + P SI + W E IL TIQP+ SE R +I+Y+QRL+ + G++VFP
Sbjct: 15 APAHPDPSSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFP 74
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
FGSVPLKTYLPDGDIDLT G +E LA VC IL++E+ +DS F++KDVQY+ A+V
Sbjct: 75 FGSVPLKTYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEV 134
Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K+VKC VQNI VDIS NQ+ GL LCFLEKVD GK HLFKRSV+LIK WCYYE+RILG
Sbjct: 135 KLVKCVVQNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILG 194
Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
AH+GLISTYALE++ VL+RFLDYY+ FDWD IS+ GPV++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSS 254
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
LP++V + D LL +FL++C + +SVP E F+ K LNIVDPLK +NN
Sbjct: 255 LPDLVTDPPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNN 314
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
LGRSVS+GNF+RIR A +GA++LG+IL P + +FF NTL+RN G R DV V
Sbjct: 315 LGRSVSRGNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDVLV 374
>gi|414882101|tpg|DAA59232.1| TPA: hypothetical protein ZEAMMB73_861907 [Zea mays]
Length = 906
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 193/360 (53%), Positives = 244/360 (67%), Gaps = 19/360 (5%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
+P + P SI + W E IL TIQP+ SE R +I+Y+QRL+ + G++VFP
Sbjct: 15 APAHPDPSSIPRDAWRRFESAALGILYTIQPSATSEHLRAAIIDYVQRLLASHSGVQVFP 74
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
FGSVPLKTYLPDGDIDLT G +E LA VC IL++E+ +DS F++KDVQY+ A+V
Sbjct: 75 FGSVPLKTYLPDGDIDLTTFGPAISDEKLANEVCAILKSEEHRKDSEFDVKDVQYIHAEV 134
Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K+VKC VQNI VDIS NQ+ GL LCFLEKVD GK HLFKRSV+LIK WCYYE+RILG
Sbjct: 135 KLVKCVVQNIIVDISVNQIGGLCTLCFLEKVDENFGKKHLFKRSVMLIKDWCYYETRILG 194
Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
AH+GLISTYALE++ VL+RFLDYY+ FDWD IS+ GPV++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHIFHKSLNGPLAVLYRFLDYYSQFDWDAKGISLFGPVSLSS 254
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
LP++V + D LL +FL++C + +SVP E F+ K LNIVDPLK +NN
Sbjct: 255 LPDLVTDPPVIHDDGFLLREKFLRECADAFSVPPRNSEKDAQLFSRKFLNIVDPLKQSNN 314
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
LGRSVS+GNF+RIR A +GA++LG+IL P + +FF NTL+RN G R DV V
Sbjct: 315 LGRSVSRGNFYRIRSAFDFGARKLGKILQRPVCYTVDEVNQFFGNTLKRNRIGFRQDVLV 374
>gi|357112328|ref|XP_003557961.1| PREDICTED: uncharacterized protein LOC100823912 [Brachypodium
distachyon]
Length = 1051
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 182/330 (55%), Positives = 236/330 (71%), Gaps = 19/330 (5%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
+QP SE++R EVI+Y +R++ G EVF FGSVPLKTYLPDGDIDLTVLG+ + +
Sbjct: 42 VQPTEASERRRAEVIDYARRIVGTALGCEVFAFGSVPLKTYLPDGDIDLTVLGNASCDST 101
Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
L V IL + +Q D+ FE+KD++++ A+VK++KC+++NI VDISFNQ G+ ALCFL
Sbjct: 102 LIDDVYCILGSGEQNSDAEFEVKDLEHIDAEVKLIKCTIENIIVDISFNQTGGICALCFL 161
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
E VDR IGK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE +
Sbjct: 162 ELVDRKIGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALETLILYIFNLFHKSLHG 221
Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E D+LL EFL E
Sbjct: 222 PLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEGTNIPVDDLLFDKEFLHSSVE 281
Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
SVP + +F +KHLNI+DPLK+ NNLGRSV+K NF RIR A SYGA++LG+ L
Sbjct: 282 KASVPPRDSDARCTKFRVKHLNIIDPLKECNNLGRSVNKANFSRIRTAFSYGARKLGQYL 341
Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LP + + FF NTL+RNG+G+R D+
Sbjct: 342 MLPSERISGEIFGFFKNTLKRNGRGVRADI 371
>gi|414865289|tpg|DAA43846.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1348
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 184/353 (52%), Positives = 237/353 (67%), Gaps = 20/353 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W +AE R E++ IQP SE +R V +Y+QRLI +VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + + ++E A V LE E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDW+ +C+S+ GPV ISSLP++ AE
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLL+ FL C Y V EN F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
NF RIR A +YGA+RLG++L P L L +FF NT R+G G RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|414865287|tpg|DAA43844.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
gi|414865288|tpg|DAA43845.1| TPA: hypothetical protein ZEAMMB73_609786 [Zea mays]
Length = 1332
Score = 368 bits (944), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 184/353 (52%), Positives = 237/353 (67%), Gaps = 20/353 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W +AE R E++ IQP SE +R V +Y+QRLI +VF FGSVPLKTY
Sbjct: 20 LDPERWAVAEGRTAELIARIQPNAYSEGRRLAVYHYVQRLIMNCLSCQVFTFGSVPLKTY 79
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + + ++E A V LE E ++++ F +K+VQY+ A+VKI+KC V+
Sbjct: 80 LPDGDIDVTAFSNSEELKEIWANLVRDALEREEKNENAEFHVKEVQYIQAEVKIIKCLVE 139
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE++D LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 140 NIVVDISFNQVGGLCTLCFLEEIDNLISENHLFKRSIILIKAWCFYESRILGAHHGLIST 199
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDW+ +C+S+ GPV ISSLP++ AE
Sbjct: 200 YALETLVLYIFHIFNNSFTGPLEVLYRFLEFFSNFDWEKFCLSLWGPVPISSLPDMTAEP 259
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLL+ FL C Y V EN F KH N++DPL+ NNNLGRSVSKG
Sbjct: 260 PRIDSGELLLNKSFLDTCSSAYGVVPHTQENHSQPFISKHFNVIDPLRTNNNLGRSVSKG 319
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
NF RIR A +YGA+RLG++L P L L +FF NT R+G G RPDV P
Sbjct: 320 NFFRIRSAFAYGAKRLGKLLECPKEDLIGELNQFFTNTWIRHGSGSRPDVPTP 372
>gi|224118186|ref|XP_002317752.1| predicted protein [Populus trichocarpa]
gi|222858425|gb|EEE95972.1| predicted protein [Populus trichocarpa]
Length = 353
Score = 367 bits (943), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 183/353 (51%), Positives = 248/353 (70%), Gaps = 20/353 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
++ E W +AEER E++ IQP SE++R V+ Y+QRLI + +VF FGSVPLKTY
Sbjct: 1 LELERWAIAEERTAELIACIQPNQPSEERRTAVLGYVQRLIMKCFPCQVFTFGSVPLKTY 60
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+TV Q++++ A V IL++E+ +++ F +K+VQY+ A+VKI+KC V+
Sbjct: 61 LPDGDIDITVFTESQDLKKTWADEVKDILQHEEKSENAEFHVKEVQYIQAEVKIIKCLVE 120
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD+LI ++HLFKRS+ILIKAWCYYESRILGAH+GLIST
Sbjct: 121 NIVVDISFNQLGGLCTLCFLEEVDQLISQNHLFKRSIILIKAWCYYESRILGAHHGLIST 180
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL++++ FDW+++CIS+ GPV ISSLP + A +
Sbjct: 181 YALETLVLYIFHVFNNRFAGPLEVLYRFLEFFSKFDWEHFCISLWGPVPISSLPNVTALS 240
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
DG ++LLS FL+ C +Y+V + EN F K+ N++DPL+ NNNLGRSVSKG
Sbjct: 241 PREDGGQILLSQLFLEVCSSVYAVFPSQQENQEQSFVSKYFNVIDPLRTNNNLGRSVSKG 300
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
NF+RIR A ++GAQRL +L P L +FF+NT +R+ KG RPD P
Sbjct: 301 NFYRIRSAFAFGAQRLARLLDCPKENLLAEFNQFFMNTWDRHCKGHRPDAPSP 353
>gi|222616508|gb|EEE52640.1| hypothetical protein OsJ_34991 [Oryza sativa Japonica Group]
Length = 801
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 190/359 (52%), Positives = 245/359 (68%), Gaps = 20/359 (5%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
+P N P SI E W E ++ IQP SE +R VI Y+Q L+ G +VFP
Sbjct: 16 TPSNPDPSSISQEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQGLLRFNVGCQVFP 75
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
FGSVPLKTYLPDGDIDLT GH + +E LA+ V +LE+E+ +D+ FE+KDVQY+ A+V
Sbjct: 76 FGSVPLKTYLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEV 134
Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K+VKC VQNI VDISFNQ GL LCFLEKVD+ K+HLFKRS++LIKAWCYYESRILG
Sbjct: 135 KLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYESRILG 194
Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
AH+GLISTYALE++ VL+RFLDYY+ FDWDN IS+ GP+++SS
Sbjct: 195 AHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSS 254
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
LPE+V ++ + D+ + +FLK+C + ++V E F K NIVDPLK +NN
Sbjct: 255 LPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNN 314
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
LGRSVSKGNF RIR A +GA++LG+IL +P + + +FF NTL+R+ +RPDVQ
Sbjct: 315 LGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRNTLKRHCSRVRPDVQ 373
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 9/114 (7%)
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPP--SSPSQIPSRSAWDVLC 434
G+ S +DL+GDY + +N LLY Q +H DY + F P PS P+R +
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYQMPSPPPARYQ-NRRS 589
Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFST-DEGEKSRGTGTYIP 487
+ N+ GT P C P +R T+S D+ ++RGTGTY P
Sbjct: 590 SNGHSRNNVYGYAGTNGIGPAP--CPPGYLIMR--TYSQIDDSHRARGTGTYFP 639
>gi|326531888|dbj|BAK01320.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 702
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 187/353 (52%), Positives = 242/353 (68%), Gaps = 19/353 (5%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P SI + W ++ IQP + SE +R V++Y+QRL+ G VFPFGSVPL
Sbjct: 24 PSSISPDAWAPFGAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVFPFGSVPL 83
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDL G +E +A V ILE+E+ +D+ FEIKDVQY+ A+VK+VKC
Sbjct: 84 KTYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINAEVKLVKCF 143
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLE+VD+ K+HLFKRS++LIKAWCYYESRILGAH+GLI
Sbjct: 144 VQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRILGAHHGLI 203
Query: 216 STYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE +VL+ RFLDYY+ FDWDN IS++GP+++SSLP++V
Sbjct: 204 STYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLSSLPDLVT 263
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
+ D L EFL++C ++++VP E T F K LNIVDPLK +NNLGRSVS
Sbjct: 264 DPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSNNLGRSVS 323
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
KGNF+RIR A GA++LG+IL +P + + + +FF +TL+RN +RPDVQ
Sbjct: 324 KGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDVQ 376
>gi|218192316|gb|EEC74743.1| hypothetical protein OsI_10487 [Oryza sativa Indica Group]
Length = 1316
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 181/351 (51%), Positives = 239/351 (68%), Gaps = 21/351 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W AE R E++ IQP SE++R+ V +Y++RLI +VF FGSVPLKTY
Sbjct: 18 LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77
Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + +++ A V LE+E+ +++ F +K+VQY+ A+VKI+KC V
Sbjct: 78 LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE + VL+RFL++++ FDW+ +C+S++GPV ISSLP++ AE
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLGRSVSK 319
D ELLLS FL KC Y+V E+ G + F KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
GNF RIR A S+GA+RL ++L P L + +FF NT R+G G RPD
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368
>gi|108706800|gb|ABF94595.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
Length = 1316
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 181/351 (51%), Positives = 239/351 (68%), Gaps = 21/351 (5%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W AE R E++ IQP SE++R+ V +Y++RLI +VF FGSVPLKTY
Sbjct: 18 LDGERWAAAEVRTAELIARIQPNADSERRRRAVYDYVRRLITNCLSCQVFTFGSVPLKTY 77
Query: 101 LPDGDIDLTVLG-HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + +++ A V LE+E+ +++ F +K+VQY+ A+VKI+KC V
Sbjct: 78 LPDGDIDVTAFSDSEELKDTWANLVRDALEHEEKSENAEFRVKEVQYIQAEVKIIKCLVD 137
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDALISQNHLFKRSIILIKAWCFYESRILGAHHGLIST 197
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE + VL+RFL++++ FDW+ +C+S++GPV ISSLP++ AE
Sbjct: 198 YALETLVLYIFHVFNNCFTGPLEVLYRFLEFFSNFDWEKFCLSLSGPVPISSLPDMTAEP 257
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLGRSVSK 319
D ELLLS FL KC Y+V E+ G + F KH N++DPL+ NNNLGRSVSK
Sbjct: 258 PRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLGRSVSK 317
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
GNF RIR A S+GA+RL ++L P L + +FF NT R+G G RPD
Sbjct: 318 GNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 368
>gi|242041009|ref|XP_002467899.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
gi|241921753|gb|EER94897.1| hypothetical protein SORBIDRAFT_01g036080 [Sorghum bicolor]
Length = 1046
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 177/335 (52%), Positives = 238/335 (71%), Gaps = 19/335 (5%)
Query: 55 EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
E++ ++P SE++R +V++Y +RL+ G EVF FGSVPLKTYLPDGDIDLTVLG+
Sbjct: 35 EVVRRVRPTEASERRRADVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNT 94
Query: 115 NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
+ + L V ILE+E+Q D+ F +K+++ + A+V+++KC++ NI +DISFNQ G+
Sbjct: 95 SYDSTLVNDVYCILESEEQNSDAEFIVKNLERIDAEVRLIKCTIGNIIIDISFNQTGGIC 154
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
ALCFLE VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++
Sbjct: 155 ALCFLELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYIFNLFH 214
Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL 275
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP + E +LL EFL
Sbjct: 215 KSLHSPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLTVEATITHTSDLLFDKEFL 274
Query: 276 KKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
K + +VP ++ F KHLNIVDPLK++NNLGRSV++ +F+RIR A YGA++
Sbjct: 275 KSSMDKATVPPKNSDSCYTRFRPKHLNIVDPLKEHNNLGRSVNRASFNRIRTAFLYGARK 334
Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LG IL LP + + FF NTLERNG G+RPD+
Sbjct: 335 LGHILMLPSEVIPDEIYGFFKNTLERNGIGVRPDI 369
>gi|326492351|dbj|BAK01959.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 724
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 188/360 (52%), Positives = 243/360 (67%), Gaps = 26/360 (7%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-------IEVF 90
P SI + W E ++ IQP + SE +R V++Y+QRL+ G VF
Sbjct: 24 PSSISPDAWAPFEAAALGVVGRIQPTVASEGRRAAVVDYVQRLVKCSVGCSVPVTPFPVF 83
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQ 148
PFGSVPLKTYLPDGDIDL G +E +A V ILE+E+ +D+ FEIKDVQY+ A+
Sbjct: 84 PFGSVPLKTYLPDGDIDLAAFGSTCSDESIANEVRAILESEERRKDAEFEIKDVQYINAE 143
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK+VKC VQNI VDISFNQ+ GL LCFLE+VD+ K+HLFKRS++LIKAWCYYESRIL
Sbjct: 144 VKLVKCFVQNIVVDISFNQIGGLYTLCFLEQVDQRFEKNHLFKRSIVLIKAWCYYESRIL 203
Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
GAH+GLISTYALE +VL+ RFLDYY+ FDWDN IS++GP+++S
Sbjct: 204 GAHHGLISTYALETLVLYIFHLFHESLDGPLAVLYRFLDYYSKFDWDNRGISLHGPISLS 263
Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
SLP++V + D L EFL++C ++++VP E T F K LNIVDPLK +N
Sbjct: 264 SLPDLVTDPPGIHDDCFLEREEFLRECAQMFTVPPRHYERTTRPFPRKFLNIVDPLKPSN 323
Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
NLGRSVSKGNF+RIR A GA++LG+IL +P + + + +FF +TL+RN +RPDVQ
Sbjct: 324 NLGRSVSKGNFYRIRSAFDLGARKLGKILQVPANSIVDEVNQFFRSTLKRNRSRVRPDVQ 383
>gi|414866687|tpg|DAA45244.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1050
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 244/351 (69%), Gaps = 22/351 (6%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
++P SE++R EV++Y +RL+ G EVF FGSVPLKTYLPDGDIDLTVLG+ + +
Sbjct: 38 VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97
Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
L V ILE+E+Q D+ F +KD++ + A+V+++KC++ NI VDISFNQ G+ ALCFL
Sbjct: 98 LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLHS 217
Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E +LL EFLK +
Sbjct: 218 PVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSMD 277
Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
+VP ++ F KHLNIVDPLK+ NNLGRSV++ +F+RIR A YGA++LG I+
Sbjct: 278 KATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHIV 337
Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV--QVPIH-AFGSGKSEAADLS 388
TLP + + +FF NTL RN G RPD+ H +FG+ ++ D+S
Sbjct: 338 TLPSEVIPDEIYEFFKNTLGRNELGARPDIDSNYAFHSSFGTAETILEDIS 388
>gi|414866686|tpg|DAA45243.1| TPA: hypothetical protein ZEAMMB73_273182 [Zea mays]
Length = 1056
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 244/351 (69%), Gaps = 22/351 (6%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
++P SE++R EV++Y +RL+ G EVF FGSVPLKTYLPDGDIDLTVLG+ + +
Sbjct: 38 VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97
Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
L V ILE+E+Q D+ F +KD++ + A+V+++KC++ NI VDISFNQ G+ ALCFL
Sbjct: 98 LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM-------------- 223
E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE++
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVFNLFHKSLHS 217
Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP ++ E +LL EFLK +
Sbjct: 218 PVEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNLIVEATVTHTSDLLFDKEFLKSSMD 277
Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
+VP ++ F KHLNIVDPLK+ NNLGRSV++ +F+RIR A YGA++LG I+
Sbjct: 278 KATVPPKNSDSCYPRFRPKHLNIVDPLKEYNNLGRSVNRASFNRIRTAFLYGARKLGHIV 337
Query: 341 TLPGHCLGMGLEKFFINTLERNGKGLRPDV--QVPIH-AFGSGKSEAADLS 388
TLP + + +FF NTL RN G RPD+ H +FG+ ++ D+S
Sbjct: 338 TLPSEVIPDEIYEFFKNTLGRNELGARPDIDSNYAFHSSFGTAETILEDIS 388
>gi|42566126|ref|NP_191728.2| nucleotidyltransferase [Arabidopsis thaliana]
gi|332646720|gb|AEE80241.1| nucleotidyltransferase [Arabidopsis thaliana]
Length = 1303
Score = 362 bits (929), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)
Query: 23 HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
H S ++S SP L+P + DAE W AE+R +++ IQP SE +R
Sbjct: 4 HESWAASPPSPSGLHPNGLLPGKAASVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63
Query: 71 KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
V +Y++RLI + +++F FGSVPLKTYLPDGDIDLT +QN+++ A V +L
Sbjct: 64 NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123
Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
E E+++ F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL LCFLE+VD I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243
Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
++++ FDW N+C+S+ GPV +SSLP++ AE D EL +S F + C +Y+V I
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQ 303
Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
E G F KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L +L P L
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIH 363
Query: 350 GLEKFFINTLERNGKGLRPDV 370
+ +FF+NT ER+G G RPD
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384
>gi|77548394|gb|ABA91191.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 783
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 186/351 (52%), Positives = 240/351 (68%), Gaps = 20/351 (5%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
SI E W E ++ IQP SE +R VI Y+Q L+ G +VFPFGSVPLKT
Sbjct: 24 SISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKT 83
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
YLPDGDIDLT GH + +E LA+ V +LE+E+ +D+ FE+KDVQY+ A+VK+VKC VQ
Sbjct: 84 YLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQ 142
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ GL LCFLEKVD+ K HLFKRS++LIKAWCYYESRILGAH+GLIST
Sbjct: 143 NIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIST 202
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE++ VL+RFLDYY+ FDWDN IS+ GP+++SSLPE+V ++
Sbjct: 203 YALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDS 262
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
+ D+ + +FLK+C + ++V E F K NIVDPLK +NNLGRSVSKG
Sbjct: 263 PDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKG 322
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
NF RIR A +GA++LG+I+ +P + + +FF NTL+R+ +RPDVQ
Sbjct: 323 NFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQ 373
Score = 43.1 bits (100), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 3/111 (2%)
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPPSSPSQIPSRSAWDVLCQY 436
G+ S +DL+GDY + +N LLY Q +H DY + F P P P + +
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYPIPSPPPARYQNRRSS 590
Query: 437 VQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
N VY + + P C P +R + D+ ++RGTGTY P
Sbjct: 591 NGHSRNNVYGYAGTNGIGPAP-CPPGYLIMRPYS-QIDDSNRARGTGTYFP 639
>gi|6850860|emb|CAB71099.1| putative protein [Arabidopsis thaliana]
Length = 1388
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)
Query: 23 HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
H S ++S SP L+P + DAE W AE+R +++ IQP SE +R
Sbjct: 4 HESWAASPPSPSGLHPNGLLPGKAASVTRPLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63
Query: 71 KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
V +Y++RLI + +++F FGSVPLKTYLPDGDIDLT +QN+++ A V +L
Sbjct: 64 NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123
Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
E E+++ F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL LCFLE+VD I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243
Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
++++ FDW N+C+S+ GPV +SSLP++ AE D EL +S F + C +Y+V I
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAEPPRRDVGELRVSEAFYRACSRVYAVNIAPQ 303
Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
E G F KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L +L P L
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLTRLLECPKENLIH 363
Query: 350 GLEKFFINTLERNGKGLRPDV 370
+ +FF+NT ER+G G RPD
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384
>gi|115483835|ref|NP_001065579.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|77548393|gb|ABA91190.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113644283|dbj|BAF27424.1| Os11g0114700 [Oryza sativa Japonica Group]
gi|215694848|dbj|BAG90039.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218185112|gb|EEC67539.1| hypothetical protein OsI_34858 [Oryza sativa Indica Group]
gi|222615390|gb|EEE51522.1| hypothetical protein OsJ_32709 [Oryza sativa Japonica Group]
Length = 801
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 186/351 (52%), Positives = 240/351 (68%), Gaps = 20/351 (5%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
SI E W E ++ IQP SE +R VI Y+Q L+ G +VFPFGSVPLKT
Sbjct: 24 SISPEAWDPLEAAAGAVVARIQPNPPSEDRRAAVIAYVQHLLRCTVGCQVFPFGSVPLKT 83
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
YLPDGDIDLT GH + +E LA+ V +LE+E+ +D+ FE+KDVQY+ A+VK+VKC VQ
Sbjct: 84 YLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYIHAEVKLVKCIVQ 142
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ GL LCFLEKVD+ K HLFKRS++LIKAWCYYESRILGAH+GLIST
Sbjct: 143 NIIVDISFNQFGGLCTLCFLEKVDQKFEKYHLFKRSIMLIKAWCYYESRILGAHHGLIST 202
Query: 218 YALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE++ VL+RFLDYY+ FDWDN IS+ GP+++SSLPE+V ++
Sbjct: 203 YALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDWDNKGISLYGPISLSSLPELVTDS 262
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
+ D+ + +FLK+C + ++V E F K NIVDPLK +NNLGRSVSKG
Sbjct: 263 PDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFPRKFFNIVDPLKQSNNLGRSVSKG 322
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ 371
NF RIR A +GA++LG+I+ +P + + +FF NTL+R+ +RPDVQ
Sbjct: 323 NFLRIRSAFDFGARKLGKIIQVPDNFTMDEVNQFFRNTLKRHCSRVRPDVQ 373
Score = 43.1 bits (100), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 50/111 (45%), Gaps = 3/111 (2%)
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPPSSPSQIPSRSAWDVLCQY 436
G+ S +DL+GDY + +N LLY Q +H DY + F P P P + +
Sbjct: 531 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYPIPSPPPARYQNRRSS 590
Query: 437 VQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
N VY + + P C P +R + D+ ++RGTGTY P
Sbjct: 591 NGHSRNNVYGYAGTNGIGPAP-CPPGYLIMRPYS-QIDDSNRARGTGTYFP 639
>gi|297817502|ref|XP_002876634.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
gi|297322472|gb|EFH52893.1| nucleotidyltransferase [Arabidopsis lyrata subsp. lyrata]
Length = 1302
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 186/381 (48%), Positives = 250/381 (65%), Gaps = 33/381 (8%)
Query: 23 HTSSSSSYSSPPNLYPFSI------------DAELWLLAEERIQEILCTIQPAIVSEKKR 70
H S ++S SP LYP + DAE W AE+R +++ IQP SE +R
Sbjct: 4 HESWAASPPSPTLLYPNGLLPGKAASVTRQLDAERWAKAEDRTAKLIACIQPNPPSEDRR 63
Query: 71 KEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKIL 128
V +Y++RLI + +++F FGSVPLKTYLPDGDIDLT +QN+++ A V +L
Sbjct: 64 NAVASYVRRLIMECFPQVQIFMFGSVPLKTYLPDGDIDLTAFSANQNLKDSWANLVRDML 123
Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
E E+++ F +K+VQY+ A+VKI+KC V+NI VDISFNQ+ GL LCFLE+VD I +
Sbjct: 124 EKEEKNENAEFHVKEVQYIQAEVKIIKCLVENIVVDISFNQIGGLCTLCFLEEVDHYINQ 183
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFL 229
+HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ RFL
Sbjct: 184 NHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFYLFNNSFSGPLEVLYRFL 243
Query: 230 DYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITAL 289
++++ FDW N+C+S+ GPV +SSLP++ A D EL +S F + C ++Y+V I
Sbjct: 244 EFFSKFDWQNFCLSLWGPVPVSSLPDVTAAPPRKDVGELRVSEAFYRACSKVYAVNIAPQ 303
Query: 290 ENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGM 349
E G F KH N++DPL++NNNLGRSVSKGNF RIR A + GA++L +L P L
Sbjct: 304 EIQGQPFVSKHFNVIDPLRENNNLGRSVSKGNFFRIRSAFTLGAKKLARLLECPKENLIH 363
Query: 350 GLEKFFINTLERNGKGLRPDV 370
+ +FF+NT ER+G G RPD
Sbjct: 364 EVNQFFMNTWERHGSGRRPDA 384
>gi|357113459|ref|XP_003558520.1| PREDICTED: uncharacterized protein LOC100841269 [Brachypodium
distachyon]
Length = 1305
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 181/369 (49%), Positives = 237/369 (64%), Gaps = 24/369 (6%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W +AE R E++ IQP SE +R V NY++RLI EVF FGSVPLKTY
Sbjct: 18 LDPERWAVAESRTAELIARIQPNAHSEGRRLAVYNYVRRLIMNCLSCEVFTFGSVPLKTY 77
Query: 101 LPDGDIDLTVLGH-QNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQ 157
LPDGDID+T + + +++ A V LE+E+ +++ F +K+VQY+ A+VKI+KC V
Sbjct: 78 LPDGDIDVTAFSNSEELKDTWANLVRDALEHEEKSENAEFCVKEVQYIQAEVKIIKCLVD 137
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI VDISFNQ+ GL LCFLE+VD LI HLFKRS+IL+KAWC+YESRILGAH+GLIST
Sbjct: 138 NIVVDISFNQVGGLCTLCFLEEVDNLINHSHLFKRSIILVKAWCFYESRILGAHHGLIST 197
Query: 218 YALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAET 260
YALE +VL+ RFL+++ FDW+ +C+S+ GPV ISSLP++ AE
Sbjct: 198 YALETLVLYIFHVFNNSFTGPLEVLYRFLEFFGNFDWEKFCLSLWGPVPISSLPDMTAEP 257
Query: 261 VENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
D ELLL FL C + Y V E G F KH N++DPL+ NNNLGRSV KG
Sbjct: 258 PRMDTGELLLGKPFLDNCNQAYGVMPRTQETQGQPFVSKHFNVIDPLRTNNNLGRSVGKG 317
Query: 321 NFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQ----VPIHA 376
N+ RIR A +GA++L ++L P + + +FF NTL R+G G RPD VP H
Sbjct: 318 NYFRIRSAFCFGAKKLAKLLECPKEDIITEVNQFFTNTLTRHGSGNRPDAPTPNLVPEHT 377
Query: 377 FGSGKSEAA 385
+EA+
Sbjct: 378 LKVAPAEAS 386
>gi|302802985|ref|XP_002983246.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
gi|300148931|gb|EFJ15588.1| hypothetical protein SELMODRAFT_43579 [Selaginella moellendorffii]
Length = 351
Score = 358 bits (918), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 184/354 (51%), Positives = 241/354 (68%), Gaps = 31/354 (8%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E WL AE R E++ IQP SE +R+ V +Y++RLI + EVF FGSVPL+TY
Sbjct: 1 LDDERWLQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60
Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV-KIVKCSV 156
LPDGDIDLT GHQ+++E A V +LE E+ +D+ F +K+VQY+ A+V KI+KC V
Sbjct: 61 LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+NI VDISFNQ+ GL LCFLE+VDRLIG+DHLFKRS+IL+KAWCYYESRILGAH+GLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180
Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE + VL+RFL++++ FDWD YC+S+ GP+ +S+LP++
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237
Query: 260 TVENDGDELLLSPEFLKKCREIYSV----PITALENTGHEFAIKHLNIVDPLKDNNNLGR 315
DG LLL+ FL C Y+V I F K+LN+VDPLK NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294
Query: 316 SVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
SV+KGNF+RIR A +GA++L IL P + ++KFF+NT +R+G G RPD
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348
>gi|115452887|ref|NP_001050044.1| Os03g0336700 [Oryza sativa Japonica Group]
gi|108708028|gb|ABF95823.1| expressed protein [Oryza sativa Japonica Group]
gi|113548515|dbj|BAF11958.1| Os03g0336700 [Oryza sativa Japonica Group]
Length = 1035
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 234/341 (68%), Gaps = 24/341 (7%)
Query: 49 AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDL 108
AEE E++ ++P SE++R V+ Y +RL+ G EVF +GSVPLKTYLPDGD+DL
Sbjct: 36 AEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVDL 95
Query: 109 TVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
TVLG+ + L + IL++E+Q D+ FE+KD+Q + A+V+++KC+++NI VDISFN
Sbjct: 96 TVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISFN 155
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--- 223
Q G+ ALCFLE VDR +GK+HL K S+ILIKAWCYYESR+LGAH+GLISTYALE +
Sbjct: 156 QTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLILY 215
Query: 224 --------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP + E G +LL
Sbjct: 216 IFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDLL 275
Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCAL 329
EFL SV T EF K+LNI+DPLK++NNLGRSV+K +F+RIR A
Sbjct: 276 FDKEFLNN-----SVQKTDSNACNTEFRSKYLNIIDPLKEHNNLGRSVNKASFNRIRTAF 330
Query: 330 SYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
SYGAQ+LG++L L + + FF NTL R G G+RPD+
Sbjct: 331 SYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDI 371
>gi|108708029|gb|ABF95824.1| expressed protein [Oryza sativa Japonica Group]
Length = 1004
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 177/341 (51%), Positives = 234/341 (68%), Gaps = 24/341 (7%)
Query: 49 AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDL 108
AEE E++ ++P SE++R V+ Y +RL+ G EVF +GSVPLKTYLPDGD+DL
Sbjct: 36 AEEAAGEVVRRVRPTEASERRRAAVVGYARRLVGTALGCEVFAYGSVPLKTYLPDGDVDL 95
Query: 109 TVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
TVLG+ + L + IL++E+Q D+ FE+KD+Q + A+V+++KC+++NI VDISFN
Sbjct: 96 TVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLINAEVRLIKCTIENIVVDISFN 155
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--- 223
Q G+ ALCFLE VDR +GK+HL K S+ILIKAWCYYESR+LGAH+GLISTYALE +
Sbjct: 156 QTGGICALCFLELVDRKVGKNHLVKNSIILIKAWCYYESRLLGAHHGLISTYALETLILY 215
Query: 224 --------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
VL+RFL+Y++ FDWDNYCIS+NGPVA+SSLP + E G +LL
Sbjct: 216 IFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVALSSLPNQIVEATNTPGSDLL 275
Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCAL 329
EFL SV T EF K+LNI+DPLK++NNLGRSV+K +F+RIR A
Sbjct: 276 FDKEFLNN-----SVQKTDSNACNTEFRSKYLNIIDPLKEHNNLGRSVNKASFNRIRTAF 330
Query: 330 SYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
SYGAQ+LG++L L + + FF NTL R G G+RPD+
Sbjct: 331 SYGAQKLGQVLLLQPELIPDEIYGFFKNTLNRIGSGVRPDI 371
>gi|302755776|ref|XP_002961312.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
gi|300172251|gb|EFJ38851.1| hypothetical protein SELMODRAFT_70578 [Selaginella moellendorffii]
Length = 351
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 183/354 (51%), Positives = 241/354 (68%), Gaps = 31/354 (8%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D E W+ AE R E++ IQP SE +R+ V +Y++RLI + EVF FGSVPL+TY
Sbjct: 1 LDDERWVQAENRTGELITRIQPTKFSEDRRRAVADYVERLIRKCFDCEVFTFGSVPLRTY 60
Query: 101 LPDGDIDLTVL-GHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV-KIVKCSV 156
LPDGDIDLT GHQ+++E A V +LE E+ +D+ F +K+VQY+ A+V KI+KC V
Sbjct: 61 LPDGDIDLTAFSGHQHLQESWANDVRAVLEAEERSKDAEFRVKEVQYIQAEVVKIIKCLV 120
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+NI VDISFNQ+ GL LCFLE+VDRLIG+DHLFKRS+IL+KAWCYYESRILGAH+GLIS
Sbjct: 121 ENIVVDISFNQLGGLCTLCFLEEVDRLIGRDHLFKRSIILVKAWCYYESRILGAHHGLIS 180
Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE + VL+RFL++++ FDWD YC+S+ GP+ +S+LP++
Sbjct: 181 TYALETLVLYIFHVFHASLRGPLGVLYRFLEFFSNFDWDKYCLSLWGPIPLSALPDM--- 237
Query: 260 TVENDGDELLLSPEFLKKCREIYSV----PITALENTGHEFAIKHLNIVDPLKDNNNLGR 315
DG LLL+ FL C Y+V I F K+LN+VDPLK NNLGR
Sbjct: 238 ---QDGGPLLLTKHFLDSCSRAYAVMPNGNINGSIVQSRVFGSKYLNVVDPLKTTNNLGR 294
Query: 316 SVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
SV+KGNF+RIR A +GA++L IL P + ++KFF+NT +R+G G RPD
Sbjct: 295 SVNKGNFYRIRNAFGFGARKLARILECPLEDVADEVDKFFLNTWDRHGSGRRPD 348
>gi|326490774|dbj|BAJ90054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1030
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 172/329 (52%), Positives = 231/329 (70%), Gaps = 19/329 (5%)
Query: 61 QPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL 120
QP S+++R EV+++ +R++ G EVF FGSVPLKTYLPDGDIDLTV+G+ + L
Sbjct: 43 QPTQASDRRRAEVVDHARRIVGTALGCEVFVFGSVPLKTYLPDGDIDLTVIGNTSCGSTL 102
Query: 121 ARFVCKILEN--EDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLE 178
V ILE+ E+ D+ FE+KD++++ A+V+++KC++ NI VDISFNQ G+ A+ FLE
Sbjct: 103 IDDVYHILESGEENGDAEFEVKDLEHIDAEVRLIKCTIGNIIVDISFNQTGGICAVSFLE 162
Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------------- 223
VDR +GK+HLFKRS+ILIK WCYYESR+LGAH+GLISTYALE +
Sbjct: 163 LVDRKVGKNHLFKRSIILIKGWCYYESRLLGAHHGLISTYALETLILYVFNLFHKSLHGP 222
Query: 224 --VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
VL+RFL+Y++ FDWD YCIS+NGPVA+SSLP ++ E + GD+LL EFL E
Sbjct: 223 LEVLYRFLEYFSKFDWDKYCISLNGPVALSSLPNLIVEGLNVPGDDLLFDREFLDNSVEK 282
Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
S P + +F +K LNI+DPLK+ NNLGRSV++ NFHRIR A S+GA++LG+IL
Sbjct: 283 ASAPPRNSDARCSKFRVKCLNIIDPLKECNNLGRSVNRANFHRIRTAFSFGARKLGQILM 342
Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LP + + FF NTLERN G+R D+
Sbjct: 343 LPPELIPDDIFAFFKNTLERNENGVRSDI 371
>gi|414591190|tpg|DAA41761.1| TPA: hypothetical protein ZEAMMB73_453733 [Zea mays]
Length = 918
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 173/358 (48%), Positives = 237/358 (66%), Gaps = 23/358 (6%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP 91
+PP P +I + +AE E+L + P +E++R++VI Y+ RLI G EVF
Sbjct: 61 APP---PSAIRRDAVRVAEAAAGEVLLRVHPTREAERRRQDVIAYLTRLIGSSLGCEVFA 117
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQV 149
FGSVPL+TYLPDGD+D+TVLG+ + L V +L++E + D+ ++ + ++ A+V
Sbjct: 118 FGSVPLRTYLPDGDVDITVLGNTWLNSTLIDDVRSMLQSEQENCDAELKLTGLHFIDAEV 177
Query: 150 KIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K++KC ++NI VD+SFNQ+ G+S CFLE VDR +GK+HLFKRS++L KAWCY+ESRILG
Sbjct: 178 KLIKCVIENIIVDVSFNQIGGVSTFCFLELVDRQVGKNHLFKRSIMLTKAWCYHESRILG 237
Query: 210 AHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISS 252
AH+GLISTYALE + VL++FL+Y++ FDWD Y IS+NGPV +SS
Sbjct: 238 AHHGLISTYALETLVLYIFNMFHKSLHGPLEVLYKFLEYFSKFDWDRYGISLNGPVDLSS 297
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNN 312
LP + E E G ELLL +F + + V + +F K LNIVDPLK NNN
Sbjct: 298 LPSLTVEPTEVQG-ELLLGKDFHQGSLDRLVVIPNEFDGCDTQFRQKFLNIVDPLKANNN 356
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LGRSVSK NF+RIR A S+GAQ+LG+IL LP + + FF NTL+R+GKG R D+
Sbjct: 357 LGRSVSKANFYRIRSAFSFGAQKLGQILLLPSEYICDEIYGFFSNTLKRHGKGERLDI 414
>gi|326517667|dbj|BAK03752.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 334
Score = 331 bits (848), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 170/304 (55%), Positives = 214/304 (70%), Gaps = 19/304 (6%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P SI A W E+ ++ IQP++ SE +R V++Y+QRLI G EVFPFGSVPL
Sbjct: 27 PSSISAGAWRPFEDAAAAVVGRIQPSVSSEDRRAAVVHYVQRLIRCSVGCEVFPFGSVPL 86
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCS 155
KTYLPDGDIDLT G + +E+LA V +LE+E+ +D+ FE+KDVQY+ A+VK+VKC
Sbjct: 87 KTYLPDGDIDLTAFGSASSDENLANEVRAVLESEELRKDAEFEVKDVQYIHAEVKLVKCL 146
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
VQNI VDISFNQ+ GL LCFLE+VD GK HLFK+S++LIKAWCYYESRILGAH+GLI
Sbjct: 147 VQNIVVDISFNQIGGLCTLCFLEQVDERFGKKHLFKKSIMLIKAWCYYESRILGAHHGLI 206
Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
STYALE++ VL+RFLDYY+ FDWDN IS+ GPV +SSLPE+V+
Sbjct: 207 STYALEILVLYIFHLFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLYGPVPLSSLPELVS 266
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
+T + + L EFLK+ ++++VP + E F K LNIVDPLK NNNLGRSVS
Sbjct: 267 DTPDTHDVDFLKREEFLKEFAQMFTVPPRSFERNNRLFLRKFLNIVDPLKQNNNLGRSVS 326
Query: 319 KGNF 322
KG F
Sbjct: 327 KGFF 330
>gi|115488182|ref|NP_001066578.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|77554657|gb|ABA97453.1| Nucleotidyltransferase domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|113649085|dbj|BAF29597.1| Os12g0283100 [Oryza sativa Japonica Group]
gi|222616913|gb|EEE53045.1| hypothetical protein OsJ_35772 [Oryza sativa Japonica Group]
Length = 989
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 166/334 (49%), Positives = 230/334 (68%), Gaps = 30/334 (8%)
Query: 56 ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
+L + P +E++R++V+ Y++RL+ G EV FGSVPLK+YLPDGD+D+TVLG+
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115
Query: 116 VEEDLARFVCKILENEDQDS--VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
++ V ILE+E+QDS EIK + ++ A+VK++KC ++NI VDISFNQ+ G+S
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175
Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
LCFLE DR +GK+HLFKRS++LIKAWCY+ESRILGAH+GL+STYALE +V
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235
Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
L++FL+Y++ FDWD YCIS+NGPV +SSLP E + DELL + L
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-SSIQDELLFGKKTLP 294
Query: 277 KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
+ + ++ F +KHLNI+DPLK +NNLGRSVS+G+F+RIR ALS+GAQ+L
Sbjct: 295 EVSDGSNI----------NFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344
Query: 337 GEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
G+IL L + + FF NTL+R+G+G R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378
>gi|218186672|gb|EEC69099.1| hypothetical protein OsI_37998 [Oryza sativa Indica Group]
Length = 989
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 166/334 (49%), Positives = 230/334 (68%), Gaps = 30/334 (8%)
Query: 56 ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
+L + P +E++R++V+ Y++RL+ G EV FGSVPLK+YLPDGD+D+TVLG+
Sbjct: 56 VLLRVAPTEEAERRRQDVVGYLRRLLGTALGCEVIAFGSVPLKSYLPDGDVDITVLGNTA 115
Query: 116 VEEDLARFVCKILENEDQDS--VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
++ V ILE+E+QDS EIK + ++ A+VK++KC ++NI VDISFNQ+ G+S
Sbjct: 116 LDGACISDVHSILESEEQDSGAELEIKGLHFIDAEVKLIKCVIENIVVDISFNQIGGVST 175
Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
LCFLE DR +GK+HLFKRS++LIKAWCY+ESRILGAH+GL+STYALE +V
Sbjct: 176 LCFLELADRKVGKNHLFKRSIMLIKAWCYHESRILGAHHGLLSTYALETLVLYIFNIFHK 235
Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
L++FL+Y++ FDWD YCIS+NGPV +SSLP E + DELL + L
Sbjct: 236 SLHGPLEALYKFLEYFSKFDWDKYCISLNGPVLLSSLPSPAVEP-SSIQDELLFGKKTLP 294
Query: 277 KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
+ + ++ F +KHLNI+DPLK +NNLGRSVS+G+F+RIR ALS+GAQ+L
Sbjct: 295 EVSDGSNI----------NFCLKHLNIIDPLKWSNNLGRSVSRGSFYRIRGALSFGAQKL 344
Query: 337 GEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
G+IL L + + FF NTL+R+G+G R DV
Sbjct: 345 GQILMLHSDLIPTEIFGFFANTLKRHGRGERSDV 378
>gi|414888115|tpg|DAA64129.1| TPA: hypothetical protein ZEAMMB73_121752 [Zea mays]
Length = 942
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 166/335 (49%), Positives = 226/335 (67%), Gaps = 23/335 (6%)
Query: 55 EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
E++ + P +E++R++VI Y++RLI G EVF FGSVPL+TYLPDGD+D+TVLG+
Sbjct: 69 EVVLRVHPTREAERRRQDVIAYLRRLIGSCLGCEVFAFGSVPLRTYLPDGDVDITVLGNT 128
Query: 115 NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
+ V +L++E + D+ F++ +Q++ A+VK++KC ++NI VD+SFNQ+ G+S
Sbjct: 129 WLNSTFIDDVRSMLQSEQENCDAEFKLTGLQFINAEVKLIKCVIENIIVDVSFNQIGGVS 188
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-------- 224
CFLE VDR IG++HLFKRS++LIKAWCY+ESRILGAH+GLISTYALE +V
Sbjct: 189 TFCFLELVDRQIGQNHLFKRSIMLIKAWCYHESRILGAHHGLISTYALETLVLYIFNMFH 248
Query: 225 ---------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL 275
L+RFL+Y++ FDWD Y IS+NG V +SSL E + G E LL E
Sbjct: 249 KSLHGPLEALYRFLEYFSKFDWDRYGISLNGQVDLSSL---TVEPTDVQG-ESLLGKELQ 304
Query: 276 KKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
+ + V + G +F K LNI+DPLK NNNLGRSVSK NF+RIR A S+GAQ+
Sbjct: 305 QGYLDRLVVIPNEFDGCGTQFRQKFLNIIDPLKANNNLGRSVSKANFYRIRSAFSFGAQK 364
Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LG+IL LP + + FF NTL+R+G G RPDV
Sbjct: 365 LGQILLLPSEYIRDEIYGFFANTLKRHGNGERPDV 399
>gi|242082774|ref|XP_002441812.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
gi|241942505|gb|EES15650.1| hypothetical protein SORBIDRAFT_08g002707 [Sorghum bicolor]
Length = 546
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 212/532 (39%), Positives = 281/532 (52%), Gaps = 93/532 (17%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE--------- 88
P SI + W E ++ IQP + SE R VI+Y++RL+ G++
Sbjct: 26 PSSIPPDAWRRFESAALGVVNKIQPTVASENFRSAVIDYLKRLLGSRAGVQSWLLPFLPF 85
Query: 89 ------------------------------------VFPFGSVPLKTYLPDGDIDLTVLG 112
VFPFGSVPLKTYLPDGDIDLT
Sbjct: 86 HFYVFFGAKPVRDYEYKCVTVWIYFVGCALESLCDLVFPFGSVPLKTYLPDGDIDLTAFS 145
Query: 113 HQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAG 170
+E+LA V IL +E +DS F++KDVQY+ A+VK+VKC VQNI VDIS NQ+ G
Sbjct: 146 PAISDENLANQVYAILSSEQHRKDSEFDVKDVQYIHAEVKLVKCLVQNIVVDISVNQIGG 205
Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM------- 223
LS LCFLEKVD GK HL KRS++LIK WCYYESRILGA GL+STYALE++
Sbjct: 206 LSTLCFLEKVDENFGKKHLLKRSIVLIKDWCYYESRILGAQNGLLSTYALEVLVLYVFLI 265
Query: 224 ----------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE--TVENDGDELLLS 271
VL+RFLD+Y+ FDWD+ IS+ GPV++SSLP +V + D D +
Sbjct: 266 FHRSLGGPLAVLYRFLDFYSKFDWDSKGISLFGPVSLSSLPNLVTDPHLPAIDDDFFVPR 325
Query: 272 PEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
+ L+K E +S P E F+ K LNIVDPLK +NNLGRSV+KGNF+RIR A +
Sbjct: 326 EKILRKYAEDFSAPPRNSERDAQVFSRKFLNIVDPLKQSNNLGRSVNKGNFYRIRSAFDF 385
Query: 332 GAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRP-DVQVPIHAFGSG------KSEA 384
GA++LG+IL +P + +FF NTL+RN G RP D P+++ G +
Sbjct: 386 GARKLGKILQMPVCYTVNEVNQFFSNTLKRNHTGFRPDDGSSPLYSNPYGDLSYQFNNIL 445
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTLPVTA----QFSPPSSPSQIPSRSAWDVLCQYVQGK 440
DL+GD++ + L Y Q + P + + +PP P Q + D QG+
Sbjct: 446 NDLTGDFEINFRNLQYAQGFQQVN-PANSFYYGRLAPP--PLQYQNTHPSD-----SQGR 497
Query: 441 GNLVYQWGTEVFVPRLPFCHPYASQVRA--STFSTDE--GEKSRGTGTYIPE 488
N +P C PY S A D+ ++RG GTY P+
Sbjct: 498 KNPHAYASMNGAIP----CPPYPSGCLAWRPFIQPDDPMAMRARGIGTYFPD 545
>gi|413924678|gb|AFW64610.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 474
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 222/342 (64%), Gaps = 20/342 (5%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLK 98
SI + W E ++C I P + S+ R VI+Y+QRL ++ G +V FGSVPLK
Sbjct: 66 SISPDDWRRLEGATFSVMCKIHPTVSSQHLRARVIDYVQRLFRLHHDGYQVISFGSVPLK 125
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
TYLPDGDIDLT+L +E+L VC IL++E+Q DS FE+KDV+YVPA+VK+VKC V
Sbjct: 126 TYLPDGDIDLTLLCAAISDENLENEVCAILKSEEQRKDSEFEVKDVKYVPAEVKLVKCKV 185
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
QNI VDIS NQ+ G + + FLEKVD+ +GK++L +RS++LIK WCYYES ILGA GL+S
Sbjct: 186 QNIAVDISVNQIGGPNKVYFLEKVDQNLGKNNLLRRSIMLIKHWCYYESCILGAQRGLVS 245
Query: 217 TYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE + VL+RFLDYY+ FDWDN IS+ GP+++SSLPE+V E
Sbjct: 246 TYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDWDNKGISLFGPISLSSLPELVTE 305
Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
D L FLK C + +SVP E F+ K +NIVDPLK +NNLGRS+SK
Sbjct: 306 PPYTRDDGFLSREAFLKDCAKAFSVPPINSEENPQVFSKKFVNIVDPLKQSNNLGRSISK 365
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLER 361
GN RIR +GA +LG+IL P + +FF NTL R
Sbjct: 366 GNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRNTLSR 407
>gi|30693508|ref|NP_190730.2| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
gi|332645292|gb|AEE78813.1| NT domain of poly(A) polymerase and terminal uridylyl
transferase-containing protein [Arabidopsis thaliana]
Length = 755
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 167/333 (50%), Positives = 209/333 (62%), Gaps = 46/333 (13%)
Query: 44 ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
ELW+ EE +EI+ + P +VSE +R++VI Y+Q+LI G EV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
GDIDLT G EE+LA V +LE E+ + S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFNQ+ G+ LCFLEK
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
VL++FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227
Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
YSVP E F KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287
Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
+ L KFF N L R+G G RPDV I
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 320
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
+DLSGDY+S N L +G+ + DY P++ SPP P Q+P+ ++W+V+ + + N
Sbjct: 517 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 574
Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
VPR F H + F +E K RGTGTY P R RP
Sbjct: 575 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 634
Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
+AR+P + N +S+ +++ P R+ E + S +M ++ S
Sbjct: 635 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 685
Query: 559 LEEFPLLPVTKNLP 572
L+ FP + N P
Sbjct: 686 LDSFPDTNGSTNHP 699
>gi|147867191|emb|CAN79954.1| hypothetical protein VITISV_027426 [Vitis vinifera]
Length = 1388
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 165/354 (46%), Positives = 218/354 (61%), Gaps = 51/354 (14%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
++D E LAEER ++++ IQP SE++R+ V +Y++ LI + +VFPFGSVPLKT
Sbjct: 29 ALDQERLSLAEERTKQLIACIQPNQPSEERREAVASYVKSLIMKCFSCKVFPFGSVPLKT 88
Query: 100 YLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSV 156
YLPDGDIDLT N+++ A V ILE E++ D+ F +K+VQY+ A+V
Sbjct: 89 YLPDGDIDLTAFSKSPNLKDTWANEVRDILEREEKSGDAEFRVKEVQYIQAEV------- 141
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
D LI + HLFKRS+ILIKAWCYYESRILGAH+GLIS
Sbjct: 142 ------------------------DHLISQKHLFKRSIILIKAWCYYESRILGAHHGLIS 177
Query: 217 TYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
TYALE +VL+ RFL++++ FDW+NYC+S+ GPV ISSLP++ A+
Sbjct: 178 TYALETLVLYIFRVFNNSFAGPLEVLYRFLEFFSKFDWENYCVSLWGPVPISSLPDVTAD 237
Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSK 319
D ELLLS FL C +Y+V EN F K+ N++DPL+ NNNLGRSVSK
Sbjct: 238 PPRKDSGELLLSKLFLDACSSVYAVLPVGQENPEQPFISKYFNVIDPLRTNNNLGRSVSK 297
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVP 373
GNF RIR A ++GAQRL +L P + + +FF+NT ER+GKG RPD P
Sbjct: 298 GNFFRIRSAFAFGAQRLARLLDCPKDNVIAEVNQFFMNTWERHGKGDRPDAPSP 351
>gi|6572083|emb|CAB63026.1| putative protein [Arabidopsis thaliana]
Length = 764
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 167/333 (50%), Positives = 209/333 (62%), Gaps = 46/333 (13%)
Query: 44 ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPD 103
ELW+ EE +EI+ + P +VSE +R++VI Y+Q+LI G EV FGSVPLKTYLPD
Sbjct: 32 ELWMRVEEATREIIEQVHPTLVSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPD 91
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPV 161
GDIDLT G EE+LA V +LE E+ + S F +KDVQ + A+VK+VKC VQNI V
Sbjct: 92 GDIDLTAFGGLYHEEELAAKVFAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVV 151
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFNQ+ G+ LCFLEK
Sbjct: 152 DISFNQIGGICTLCFLEK------------------------------------------ 169
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
VL++FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+
Sbjct: 170 --VLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEM 227
Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
YSVP E F KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++
Sbjct: 228 YSVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFL 287
Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
+ L KFF N L R+G G RPDV I
Sbjct: 288 QSDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 320
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
+DLSGDY+S N L +G+ + DY P++ SPP P Q+P+ ++W+V+ + + N
Sbjct: 517 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 574
Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
VPR F H + F +E K RGTGTY P R RP
Sbjct: 575 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 634
Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
+AR+P + N +S+ +++ P R+ E + S +M ++ S
Sbjct: 635 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 685
Query: 559 LEEFPLLPVTKNLP 572
L+ FP + N P
Sbjct: 686 LDSFPDTNGSTNHP 699
>gi|218200261|gb|EEC82688.1| hypothetical protein OsI_27344 [Oryza sativa Indica Group]
Length = 1001
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 164/337 (48%), Positives = 223/337 (66%), Gaps = 27/337 (8%)
Query: 55 EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
E++ +QP +E+ R+ +I Y++ L G EVF FGSVPLKTYLPDGDID+T+LG+
Sbjct: 54 EVVLRVQPTEEAERTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113
Query: 115 NVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
+ V ILE E+Q+ + I +Q++ A+VK++KC + NI VDISFNQ+ G++
Sbjct: 114 APDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVT 173
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
LC LE VD +G DHLFKRS++LIKAWCY+ES ILGAH GLISTYALE++
Sbjct: 174 TLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFH 233
Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG--DELLLSPE 273
VL++FL+Y++ FDWD YCIS+NGPV +SSLP + TVE G DELL P
Sbjct: 234 KSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNL---TVEPSGIHDELLFGPN 290
Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGA 333
C + + + + + F K+LNI+DP+K +NNLGRSVSKG+F+RIR A S+GA
Sbjct: 291 --GSCDRLIVLKKDS-DGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGA 347
Query: 334 QRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
Q L +IL LP + + FF+NTL+ +G+G R DV
Sbjct: 348 QNLSQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384
>gi|222637691|gb|EEE67823.1| hypothetical protein OsJ_25591 [Oryza sativa Japonica Group]
Length = 1001
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 163/337 (48%), Positives = 223/337 (66%), Gaps = 27/337 (8%)
Query: 55 EILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
E++ +QP +++ R+ +I Y++ L G EVF FGSVPLKTYLPDGDID+T+LG+
Sbjct: 54 EVVLRVQPTEEADRTRQGIIGYLKLLFGTALGCEVFAFGSVPLKTYLPDGDIDITILGNT 113
Query: 115 NVEEDLARFVCKILENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
+ V ILE E+Q+ + I +Q++ A+VK++KC + NI VDISFNQ+ G++
Sbjct: 114 APDSTFISEVRGILELEEQEDGADVAITGLQFIDAEVKLIKCVIDNIVVDISFNQIGGVT 173
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
LC LE VD +G DHLFKRS++LIKAWCY+ES ILGAH GLISTYALE++
Sbjct: 174 TLCLLELVDHEVGNDHLFKRSIMLIKAWCYHESHILGAHRGLISTYALEVLVLYIFNIFH 233
Query: 224 --------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG--DELLLSPE 273
VL++FL+Y++ FDWD YCIS+NGPV +SSLP + TVE G DELL P
Sbjct: 234 KSLHSPLEVLYKFLEYFSKFDWDKYCISLNGPVPLSSLPNL---TVEPSGIHDELLFGPN 290
Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGA 333
C + + + + + F K+LNI+DP+K +NNLGRSVSKG+F+RIR A S+GA
Sbjct: 291 --GSCDRLIVLKKDS-DGSNMNFRPKYLNIIDPIKSSNNLGRSVSKGSFYRIRGAFSFGA 347
Query: 334 QRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
Q L +IL LP + + FF+NTL+ +G+G R DV
Sbjct: 348 QNLSQILMLPTDLIPTEIFGFFVNTLKSHGRGKRSDV 384
>gi|147820621|emb|CAN67650.1| hypothetical protein VITISV_005081 [Vitis vinifera]
Length = 1572
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/301 (52%), Positives = 195/301 (64%), Gaps = 44/301 (14%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP 146
VF FGSVPLKTYLPDGDIDLT +QN+++ A
Sbjct: 227 RVFTFGSVPLKTYLPDGDIDLTAFSNNQNLKDTWAN------------------------ 262
Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
QVKI+KC V+NI VDISFNQ+ GL LCFLE+VD LI ++HLFKRS+ILIKAWCYYESR
Sbjct: 263 -QVKIIKCLVENIVVDISFNQLGGLCTLCFLEEVDHLINQNHLFKRSIILIKAWCYYESR 321
Query: 207 ILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVA 249
ILGAH+GLISTYALE +VL+ RFL+++++FDWDN+C+S+ GPV
Sbjct: 322 ILGAHHGLISTYALETLVLYIFHVFNNSFTGPLEVLYRFLEFFSSFDWDNFCVSLWGPVP 381
Query: 250 ISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD 309
ISSLP++ AE D ELLLS FL C +Y+V E G F KH N++DPL+
Sbjct: 382 ISSLPDVTAEPPRQDSGELLLSKLFLDACSSVYAVFPHGQEKQGQSFISKHFNVIDPLRV 441
Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPD 369
NNNLGRSVSKGNF RIR A ++GA+RL +L P + + + F+NT ER+G G RPD
Sbjct: 442 NNNLGRSVSKGNFFRIRSAFAFGAKRLARLLD-PKENIIFEVNQLFMNTWERHGSGHRPD 500
Query: 370 V 370
Sbjct: 501 T 501
>gi|326521958|dbj|BAK04107.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1031
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 150/335 (44%), Positives = 227/335 (67%), Gaps = 22/335 (6%)
Query: 56 ILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQN 115
+L + P +E++R ++I+Y + LI +G EVF FGSVPLKTYLPDGD+D+T+L + N
Sbjct: 157 VLLRLHPTEEAERRRHKIIDYAKNLIGTTFGCEVFAFGSVPLKTYLPDGDVDITILTNVN 216
Query: 116 VEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSA 173
++ + + VC +L E ++ + F +K++Q + A+VKI+KC + N+ +DISFNQ+ G+S
Sbjct: 217 LDNNFVQDVCCLLAAEQSNEAAEFALKEIQVINAKVKIIKCVIDNLVMDISFNQVGGVST 276
Query: 174 LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV--------- 224
LCFLE ++ IGKDHLFKRS+ILIKAWCY+E I G+++ L+STYALE+++
Sbjct: 277 LCFLEMANKEIGKDHLFKRSIILIKAWCYHEGSIHGSNHWLMSTYALEVLILYIFNLFHT 336
Query: 225 --------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
L++FL+YY+ FDWDN C+++NGPV +SSL A ++ +ELLLS E L+
Sbjct: 337 VLHGPLQALYKFLEYYSKFDWDNQCLTLNGPVPLSSLRNYTAGPTGSN-EELLLSKEPLE 395
Query: 277 -KCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQR 335
R ++ +P + + G EF +K+LNI+DPLK NNLG S+S+ N IR A + GA++
Sbjct: 396 PSLRRLFDLPAGS-DGRGPEFRLKYLNIIDPLKGGNNLGTSISEANSRVIRDAFAAGAEK 454
Query: 336 LGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
LG+IL LP + + FF +TL ++G+G R D+
Sbjct: 455 LGQILKLPCELIAEQVYVFFTHTLGKHGRGERQDL 489
>gi|357463851|ref|XP_003602207.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355491255|gb|AES72458.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 768
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 157/348 (45%), Positives = 225/348 (64%), Gaps = 22/348 (6%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL 97
P ++ W E+R E+L ++P SE R +++YI+ LI + ++VF FGSVPL
Sbjct: 19 PKVLERSKWSQVEDRTIELLQFLEPNPKSETLRNNIVSYIKGLIISHVPVKVFEFGSVPL 78
Query: 98 KTYLPDGDIDLTVLGHQNV-EEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKC 154
KTYL DGDIDLT+ G+ + E + +ILE+E ++ S F +K+VQ V A+VKI+KC
Sbjct: 79 KTYLRDGDIDLTIFGNNELFPEIFIPHIQQILESEMNNEFSKFRVKEVQLVNAEVKIIKC 138
Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
V+ +DISFNQ++GL +LCFL++VD LI ++H+FKRSVILIKAWCY+ESR+LG+ GL
Sbjct: 139 LVEKFVIDISFNQLSGLCSLCFLDEVDYLISRNHIFKRSVILIKAWCYHESRLLGSKSGL 198
Query: 215 ISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIV 257
STYALE++VL+ RFL++++ FDW NYCIS++GPV + SLP +
Sbjct: 199 FSTYALEILVLYLFNLYNNEFVGPLEVLFRFLEFFSKFDWGNYCISLSGPVPLDSLPNMT 258
Query: 258 AETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSV 317
A+ D +LLL+ FL + Y +N F KH+NI+DPL++NNNLG S+
Sbjct: 259 ADCPRKDRQDLLLTESFLIASKFCYG--WRNQKNREKHFVSKHINIIDPLQENNNLGHSI 316
Query: 318 SKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKG 365
S+GNF RI+ A++YGA+++ IL L + FF NT R+G G
Sbjct: 317 SRGNFFRIKSAIAYGAEQMMRILDCTDEYLISEFDHFFENTWNRHGNG 364
>gi|357491471|ref|XP_003616023.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
gi|355517358|gb|AES98981.1| Poly(A) RNA polymerase cid14 [Medicago truncatula]
Length = 387
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 168/354 (47%), Positives = 215/354 (60%), Gaps = 83/354 (23%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI---------- 87
P S+ E AE+ EIL IQP + ++++R+EV++Y+QRLI YG
Sbjct: 33 PSSVTDEASSAAEQTTAEILRRIQPTLAADRRRREVVDYVQRLIR--YGARCEKLLPNVW 90
Query: 88 -------------EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD 134
+VFP+GSVPLKTYLPDGDIDLT L QN+E+ L V +L E+ +
Sbjct: 91 RKLDFEVRIFRIGKVFPYGSVPLKTYLPDGDIDLTALSPQNIEDGLVSDVHAVLRGEENN 150
Query: 135 SV--FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKR 192
+E+KDV+++ A+ NI VDISFNQ+ GLS LCFLEKVDRL+ KDH+FKR
Sbjct: 151 DAAEYEVKDVRFIDAE---------NIVVDISFNQLGGLSTLCFLEKVDRLVAKDHIFKR 201
Query: 193 SVILIKAWCYYESRILGAHYGLISTYALE------------------------------M 222
S+ILIKAWCYYESRILGAH+GLISTYALE M
Sbjct: 202 SIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHRFHVSLDGPLAEKERKRNLNHIM 261
Query: 223 MVLH----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
+V+H RFLDY++ FDWDNYC+S+ GPVA SS P++VAE +EN G+
Sbjct: 262 LVMHPFNKHFMHPALFQVLYRFLDYFSKFDWDNYCVSLKGPVAKSSPPDVVAEALENGGN 321
Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG 320
LL+ EF++ C E +SVP L+ F KHLNI+DPLK+NNNLGRSV+KG
Sbjct: 322 T-LLTDEFIRSCVESFSVPPRGLDLNLRAFPHKHLNIIDPLKENNNLGRSVNKG 374
>gi|168037604|ref|XP_001771293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677382|gb|EDQ63853.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 2035
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 162/392 (41%), Positives = 227/392 (57%), Gaps = 62/392 (15%)
Query: 44 ELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEV-------------- 89
+ W AE + E++ +++P +SE++R V +++RLI + EV
Sbjct: 585 DWWTRAEGQTAELIDSLKPTRLSEERRTAVTGFVERLIRDRFECEVSALPHELNGFIVRS 644
Query: 90 ----------FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILEN--EDQDSVF 137
FGSVPLKTYLPDGDIDL + +++E A+ V K L+ +D D+ F
Sbjct: 645 SAGAVRYSAVIRFGSVPLKTYLPDGDIDLYIFARNDLKETWAQDVLKALKQAEDDADAEF 704
Query: 138 EIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
+K+VQY+ A+VK++KC V+NI VDISFNQ+ GLS LCFLE+VD +G +HLFKRSVIL+
Sbjct: 705 RVKEVQYIQAEVKLIKCLVENIVVDISFNQIGGLSTLCFLERVDEEVGLNHLFKRSVILV 764
Query: 198 KAWCYYESRILGAHYGLISTYALEMM-------------------VLHRFLDYYNTFDWD 238
KAWCYYESRILGAH+GLIST+ALE + VL+ FL Y+ FDWD
Sbjct: 765 KAWCYYESRILGAHHGLISTFALETLVLYIFHVFHSMRSLHGPLEVLYLFLTYFCNFDWD 824
Query: 239 NYCISINGPVAISSLPEIVAETVEND-------------GDELLLSPEFLKKCREIYS-V 284
YC+SI GPV + +P+ +E + D G +L S EF+++C YS V
Sbjct: 825 QYCLSIWGPVPLDHIPKNSSELSQKDGGWRTVARSPWEVGGKLYFSEEFIEECINRYSDV 884
Query: 285 PITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPG 344
+ + G F K+LN++DP++ NNLGRSV+ G+F RIR A GA+ LGE+ P
Sbjct: 885 RAGSESSQGRIFNPKYLNVLDPIRHTNNLGRSVNVGSFKRIRSAFGLGARTLGEVFECPK 944
Query: 345 HCLGMGLEKFFINTLERNGK---GLRPDVQVP 373
+ + FF T + G+ RPD +P
Sbjct: 945 DQITEKFKSFFSCTFKSLGRYRSAGRPDSGIP 976
>gi|168035287|ref|XP_001770142.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678668|gb|EDQ65124.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1504
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 173/444 (38%), Positives = 248/444 (55%), Gaps = 63/444 (14%)
Query: 92 FGSVPLKTYLPDGDIDLTVLG-HQNVEEDLARFVCKILEN--EDQDSVFEIKDVQYVPAQ 148
FGSVPLKTYLPDGDIDL+ +V+ + L+ ++ +S F +K+VQ + A+
Sbjct: 144 FGSVPLKTYLPDGDIDLSAFTPSPDVKRTWIQDTYNALQKAKDNPNSEFRVKEVQLIHAE 203
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VKIVKC V+NI VD+SF+Q+ GL LCFL +VD+LIG+DHLFKRS+IL+KAWCYYESRIL
Sbjct: 204 VKIVKCFVENILVDVSFDQLGGLGTLCFLVEVDKLIGEDHLFKRSIILVKAWCYYESRIL 263
Query: 209 GAHYGLISTYALE-----------------MMVLHRFLDYYNTFDWDNYCISINGPVAIS 251
GAH GL+STYA+E + VL+ FL+++++FDWDNYC+S++ P+ +
Sbjct: 264 GAHCGLMSTYAVEALVLYIFDKFHASLRGPLQVLYLFLEFFSSFDWDNYCVSLSSPIPLK 323
Query: 252 SLP---------EIVAETVENDGDELLLSPEFLKKCREIYS-VPITALENTGHEFAIKHL 301
SL + +A + DG EL + EFL C Y VP++ + + ++F +K L
Sbjct: 324 SLSKDSEKLEDLQKLALSTRRDGGELFFTKEFLVACETEYGVVPVSQITKS-NKFTVKCL 382
Query: 302 NIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN-TLE 360
NI DPL+ +NNLGRSV++GNF RIR A +GA+ L +L+ + LE+FF N L
Sbjct: 383 NISDPLRSSNNLGRSVNQGNFARIRRAFDFGARTLRRVLSCTEEDVPAELEQFFKNCNLR 442
Query: 361 RNGKGLRPDVQVP----------IHAFGSGKSEA-----ADLSGDYDSYYNGLLYGQCYH 405
+G RPDV P G K EA +DL+ + NG + +
Sbjct: 443 LHGGYQRPDVASPRPCRTKASNAAQKIGCAKREAMGNERSDLTSHSTPFQNGNQFLRTEP 502
Query: 406 DYTLPVTA-QFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYAS 464
++ LPV F + P S+ ++LC +Q G+ V QW HP S
Sbjct: 503 NHALPVEQIHFDSNALPG---GESSQEILCNDMQCTGDSV-QW-----------THPRCS 547
Query: 465 QVRASTFSTDEGEKSRGTGTYIPE 488
++ + G + GT + E
Sbjct: 548 SLKTTPCREKNGSQFVGTPKSVAE 571
>gi|358347363|ref|XP_003637727.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
gi|355503662|gb|AES84865.1| hypothetical protein MTR_100s0017, partial [Medicago truncatula]
Length = 827
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 133/239 (55%), Positives = 167/239 (69%), Gaps = 17/239 (7%)
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK+VKC V+NI VDISFNQ+ GL LCFLE+VD LI +HLFKRS+ILIKAWCYYESRIL
Sbjct: 109 VKLVKCLVENIVVDISFNQLGGLCTLCFLEEVDGLINHNHLFKRSIILIKAWCYYESRIL 168
Query: 209 GAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAIS 251
GAH+GLISTYALE +VL+ RFL++++ FDWDN+C+S+ GPV IS
Sbjct: 169 GAHHGLISTYALETLVLYIFHVFNNSFAGPLEVLYRFLEFFSKFDWDNFCVSLWGPVPIS 228
Query: 252 SLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
SLP++ AE D ELLL FL C +Y+V EN G F KH N++DPL+ NN
Sbjct: 229 SLPDVTAEPPRKDAGELLLHKSFLDACSTVYAVFPGGPENQGQPFVSKHFNVIDPLRVNN 288
Query: 312 NLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NLGRSVSKGNF RIR A ++GA++L +L P L + + +FF+NT +R+G G RPD
Sbjct: 289 NLGRSVSKGNFFRIRSAFAFGAKKLARLLDCPKDELFLEVNQFFLNTWDRHGSGQRPDA 347
>gi|147817122|emb|CAN62161.1| hypothetical protein VITISV_017634 [Vitis vinifera]
Length = 1147
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 129/211 (61%), Positives = 155/211 (73%), Gaps = 17/211 (8%)
Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH------------ 226
K+DRLIGKDHLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+
Sbjct: 405 KIDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSLLNGP 464
Query: 227 -----RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
+FLDY++ FDWDNYC+S+NGPV ISSLPE++AET EN G + LL + L+ C +
Sbjct: 465 LAVLYKFLDYFSKFDWDNYCVSLNGPVRISSLPEMIAETPENVGADPLLGNDXLRDCLDR 524
Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
+SVP LE F KH NIVDPLK+NNNLGRSVSKGNF+RIR A +YGA++LG IL
Sbjct: 525 FSVPSRGLETNSRTFVQKHFNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGRILL 584
Query: 342 LPGHCLGMGLEKFFINTLERNGKGLRPDVQV 372
P + L KFF NTLER+G+G RPDV +
Sbjct: 585 QPEDKISEELCKFFTNTLERHGRGQRPDVDL 615
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/117 (56%), Positives = 79/117 (67%), Gaps = 10/117 (8%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSV--FEIKDVQYVP 146
VFPFGSVPLKTYLPDGDIDLT G VE+ LA V +LE EDQ+ F +KDVQ +
Sbjct: 186 VFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIH 245
Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKV--------DRLIGKDHLFKRSVI 195
A+VK+VKC VQNI VDISFNQ+ GL LCFLE+ +R + + L+KR I
Sbjct: 246 AEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQQKAIWDGVEERFLKRLSLWKRQYI 302
Score = 76.6 bits (187), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 379 SGKSEA----ADLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
SG EA ADLSGDYDS++N L YG +DY A P + PSQ S ++WD +
Sbjct: 845 SGNPEAPNSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQ 904
Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIP 487
Q + N+ Q +PR PF + + F +E K RGTGTY P
Sbjct: 905 QSAHIRRNIFPQITANGIIPRPPFYPMNPPMISGTGFGVEEMPKPRGTGTYFP 957
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 37/57 (64%), Gaps = 2/57 (3%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE 88
S PN P +I A W AE +QEI+C +QP VSE++RKEV++Y+Q LI G E
Sbjct: 27 SHPN--PPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCE 81
>gi|297745772|emb|CBI15828.3| unnamed protein product [Vitis vinifera]
Length = 929
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 132/241 (54%), Positives = 166/241 (68%), Gaps = 24/241 (9%)
Query: 155 SVQNIPVDISFNQMAGLSALCFLE-------KVDRLIGKDHLFKRSVILIKAWCYYESRI 207
SVQN + + M G+ LC VDRLIGKDHLFKRS+ILIK+WCYYESRI
Sbjct: 219 SVQNHKRENPWFHMYGVQFLCNYSFKPLLSVIVDRLIGKDHLFKRSIILIKSWCYYESRI 278
Query: 208 LGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAI 250
LGAH+GLISTYALE++VL+ RFLDY++ FDWDNYCIS+NGPV
Sbjct: 279 LGAHHGLISTYALEILVLYIFHLFHLSLDGPLAVLYRFLDYFSKFDWDNYCISLNGPVCK 338
Query: 251 SSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDN 310
SSLP+IVAE EN D+LLLS EFL+ C +++SVP LE F +KHLNI+DPL++N
Sbjct: 339 SSLPDIVAELPENGQDDLLLSEEFLRNCVDMFSVPFRGLETNSRTFPLKHLNIIDPLREN 398
Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
NNLGRSV+KGNF+RIR A YG+ +LG+IL+LP + L+ FF +TLER+ ++
Sbjct: 399 NNLGRSVNKGNFYRIRSAFKYGSHKLGQILSLPREVIQDELKNFFASTLERHRSKYMAEI 458
Query: 371 Q 371
Q
Sbjct: 459 Q 459
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 78/141 (55%), Positives = 104/141 (73%), Gaps = 2/141 (1%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
I + W AE QEI+ +QP + S ++R+EVI+Y+QRLI G EVFP+GSVPLKTY
Sbjct: 35 IAGDSWAAAERATQEIVAKMQPTLGSMRERQEVIDYVQRLIGCCLGCEVFPYGSVPLKTY 94
Query: 101 LPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQN 158
L DGDIDLT L NVEE LA V +L+ E+Q ++ FE+KD+Q++ A+VK+VKC V++
Sbjct: 95 LLDGDIDLTALCSSNVEEALASDVHAVLKGEEQNENAEFEVKDIQFITAEVKLVKCLVKD 154
Query: 159 IPVDISFNQMAGLSALCFLEK 179
I +DISFNQ+ GLS LCFLE+
Sbjct: 155 IVIDISFNQLGGLSTLCFLEQ 175
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 9/137 (6%)
Query: 386 DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVY 445
DLSGDYDS+ L YGQC + + LP SPP SPSQ+ + WD + Q++Q NL
Sbjct: 672 DLSGDYDSHIRSLQYGQCCYGHALPPPLLPSPPLSPSQLQINTPWDKVRQHLQFTQNLHS 731
Query: 446 QWGTE--VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRA 503
Q + + P HP S + F ++ +K RGTGTY P + P +RPV
Sbjct: 732 QMDSNGVILGNHFPVKHPARS---ITAFGLEDKQKPRGTGTYFPNMSHLP--NRDRPVGQ 786
Query: 504 RNPESLNHVPVSKLRRK 520
R ++L S+L R+
Sbjct: 787 RRNQALE--SHSQLHRR 801
>gi|110738268|dbj|BAF01063.1| hypothetical protein [Arabidopsis thaliana]
Length = 660
Score = 258 bits (658), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/212 (59%), Positives = 152/212 (71%), Gaps = 17/212 (8%)
Query: 180 VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH------------- 226
+D LIGKDHLFKRS+ILIKAWCYYESRILGA +GLISTYALE +VL+
Sbjct: 1 IDHLIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPL 60
Query: 227 ----RFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIY 282
+FLDY++ FDWD+YCIS+NGPV +SSLP+IV ET EN G++LLL+ EFLK+C E+Y
Sbjct: 61 AVLYKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMY 120
Query: 283 SVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
SVP E F KHLNIVDPLK+ NNLGRSVSKGNF+RIR A +YGA++LG++
Sbjct: 121 SVPSRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQ 180
Query: 343 PGHCLGMGLEKFFINTLERNGKGLRPDVQVPI 374
+ L KFF N L R+G G RPDV I
Sbjct: 181 SDEAISSELRKFFSNMLLRHGSGQRPDVHDAI 212
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 88/194 (45%), Gaps = 17/194 (8%)
Query: 385 ADLSGDYDSYYNGLLYGQCYHDYTL--PVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGN 442
+DLSGDY+S N L +G+ + DY P++ SPP P Q+P+ ++W+V+ + + N
Sbjct: 409 SDLSGDYESQLNSLRFGRWWFDYVQNGPMSP-LSPPGLP-QLPNNNSWEVMRHALPFRRN 466
Query: 443 LVYQWGTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPE----RTRPPPIEIN 498
VPR F H + F +E K RGTGTY P R RP
Sbjct: 467 APTPVNANGVVPRQVFFHVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRDRPFSPRGR 526
Query: 499 RPVRARNPESLNHVPVSKLRRKIDKAERLPSRDNLESGCSLNVENGCSLNMENSCSLILS 558
+AR+P + N +S+ +++ P R+ E + S +M ++ S
Sbjct: 527 NSHQARSPRN-NGRSMSQAHSEMN----FPDRNTRERQLHYPNQTNGSCDMSHTD----S 577
Query: 559 LEEFPLLPVTKNLP 572
L+ FP + N P
Sbjct: 578 LDSFPDTNGSTNHP 591
>gi|357116041|ref|XP_003559793.1| PREDICTED: uncharacterized protein LOC100830879 [Brachypodium
distachyon]
Length = 899
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 194/313 (61%), Gaps = 37/313 (11%)
Query: 41 IDAELWLLAEE----------RIQE-----ILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
+D +LW A E R+ E +L + P +E++R++V ++ +RLI +
Sbjct: 16 VDGKLWEPATEKTAIQVPEQMRVAEAAAAGVLRCLLPTEEAERRRRQVTDHARRLIGTNF 75
Query: 86 GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQ 143
G +V +GSVPLKTYLPDGDID+T+L H+ ++ + V +L E++ D+ F ++ +
Sbjct: 76 GCQVLTYGSVPLKTYLPDGDIDVTILTHKPLDSTIIDDVRNLLNAEEKNTDAEFVLESRR 135
Query: 144 YVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
YV AQVK+ KC++ NI VDISFNQ+ G+S LCFLE VD +GKDHLFKRS+ILIKAWCY
Sbjct: 136 YVDAQVKVFKCNIANIDVDISFNQIGGVSTLCFLELVDTEVGKDHLFKRSIILIKAWCYN 195
Query: 204 ESRILGAHYGLISTYALEMMV-----------------LHRFLDYYNTFDWDNYCISING 246
E+RI G+ L+STYALE+++ L+ FL+YY+ FDW YC++++G
Sbjct: 196 EARIQGSDQWLLSTYALEILILYIFNMFHNSLHGPFEALYMFLEYYSKFDWGKYCVTLDG 255
Query: 247 PVAISSLPEIVAE-TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVD 305
PV +SSL AE V N DELLL E L + V + EF K LNI+D
Sbjct: 256 PVPLSSLANFTAEPAVAN--DELLLGKESLSASSDRLLVLPKGSDRHDPEFRPKILNIID 313
Query: 306 PLKDNNNLGRSVS 318
PLK +NNLGRS+S
Sbjct: 314 PLKGDNNLGRSIS 326
>gi|384253068|gb|EIE26543.1| hypothetical protein COCSUDRAFT_39611 [Coccomyxa subellipsoidea
C-169]
Length = 1155
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/293 (46%), Positives = 182/293 (62%), Gaps = 24/293 (8%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARF-VCKILENEDQDSV--FEIKDV 142
+E + FGSVPLKTYLPDGDIDL V G D+ + + +LE E ++++ +KDV
Sbjct: 4 VEAYMFGSVPLKTYLPDGDIDLAVFQGKGPRLRDVWTYELSALLEAEGRNALNPHRVKDV 63
Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
Q + A+VK++KC V NI VDISF+ + GL + FLE +DR IGK HLFKRSVIL+KAWCY
Sbjct: 64 QIINAEVKLLKCLVDNIVVDISFDTLGGLCTVAFLESIDRHIGKQHLFKRSVILVKAWCY 123
Query: 203 YESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDWDNYCISIN 245
YESR+LGAH+GL+STYALE MVL+ +FL ++ FDWD + +S+
Sbjct: 124 YESRLLGAHHGLLSTYALETMVLYIFNMYHHELQSPLKVLRKFLVVFSKFDWDGHALSLQ 183
Query: 246 GPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVD 305
GP+ +SS P+ E V LL + LK E+YS P+ + G F IK++NI+D
Sbjct: 184 GPIPLSSFPDPQVEPVAGAEGGALLRGDVLKTMLEMYS-PVQ--QGPGKAFTIKNMNIMD 240
Query: 306 PLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINT 358
PL NNLGRSV+K + RIR AL++G L I G ++ FF NT
Sbjct: 241 PLLPTNNLGRSVNKASKARIRKALAHGCHMLDSIFDKVGQEATEAVDGFFRNT 293
>gi|159471748|ref|XP_001694018.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277185|gb|EDP02954.1| predicted protein [Chlamydomonas reinhardtii]
Length = 633
Score = 244 bits (624), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 150/395 (37%), Positives = 218/395 (55%), Gaps = 48/395 (12%)
Query: 27 SSSYSSPPNLYPFSIDAELWLLAEE-------RIQEILCTIQPAIVSEKKRKEVINYIQR 79
S S+S P YP S A L L+ E R ++ I+P +S ++R + ++ +
Sbjct: 2 SQSFSQP---YP-SAPAALRKLSPEFGNDLLSRTDTLISRIRPTTLSLQRRFVITEHVTQ 57
Query: 80 LINGYYG---IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL--------ARFVCKIL 128
L+ + + PFGSVPLKTYLPDGDIDL++ + + + L A + L
Sbjct: 58 LVKRCFAPHDVTAVPFGSVPLKTYLPDGDIDLSIYSYSSRAQSLKDQLRDTWATTLQLCL 117
Query: 129 ENEDQD--SVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
E+E + + F++ +VQ + A+VK++KC V NI VDISF Q+ GL+ FLE VD + K
Sbjct: 118 EDEANNPHAAFKVANVQVIHAEVKLLKCLVDNIVVDISFFQIGGLNTYNFLEDVDAFVDK 177
Query: 187 -----DHLFKRSVILIKAWCYYESRILGAHYGLISTYALE-----------------MMV 224
HLFK S+IL+K WCYYESR+LGAH+GLISTYALE + V
Sbjct: 178 AITARKHLFKDSIILVKGWCYYESRVLGAHHGLISTYALETLVLYVINLYHRELSNPLQV 237
Query: 225 LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSV 284
L++FL + FDW+ YC+++ GP+ ++S P V ET E E LL+ F+ + Y+
Sbjct: 238 LYKFLVECSGFDWERYCLTLQGPIPLASFPNPVVETPEPLQREPLLTEHFMTRAYNKYTA 297
Query: 285 P-ITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLP 343
P + A+ FAIK LN++DP+ NNNLGRSVSK ++ RIR A +GA+ L I
Sbjct: 298 PQVAAMGGEVKPFAIKQLNVMDPILPNNNLGRSVSKASYLRIRRAFEHGARMLAAIAEQT 357
Query: 344 GHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFG 378
+ + F N + RP+ + P+ A G
Sbjct: 358 KELGAVVASRNFDNFFGKVWNAQRPNRK-PLAATG 391
>gi|255083767|ref|XP_002508458.1| predicted protein [Micromonas sp. RCC299]
gi|226523735|gb|ACO69716.1| predicted protein [Micromonas sp. RCC299]
Length = 1269
Score = 239 bits (611), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 137/354 (38%), Positives = 206/354 (58%), Gaps = 47/354 (13%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDIDLT 109
+ E++ ++P S+++R+ V +I L++G + E V FGSVPL+TYLPDGDID+
Sbjct: 28 VNELIDVLRPTEQSDRRRRGVFRHIASLVDGCFAGENVLVTAFGSVPLRTYLPDGDIDVC 87
Query: 110 VLG-HQNVEED-----LARFVCKILENEDQDSV--------FEIKDVQYVPAQVKIVKCS 155
+LG H+ + D L V + + S+ F + ++ + A+VK++K
Sbjct: 88 LLGPHELLSRDDWTVRLRAHVERAEAAAAEASIELGSPVAEFAVSEIHIIHAEVKLMKLI 147
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
+ VD+S NQ GL+AL FLE+V+ IGK +FKRS++LIKAW +YE R+LGAH+ LI
Sbjct: 148 CDGVVVDVSANQFGGLAALGFLEEVNAFIGKGEIFKRSIVLIKAWGFYEGRLLGAHHALI 207
Query: 216 STYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI-- 256
STYALE + VLH+FL ++ FDWD + +S++GPV + L ++
Sbjct: 208 STYALETLVLYILNRFHKELSTPLEVLHKFLVFFADFDWDKFAVSVHGPVPLEDLHKVTG 267
Query: 257 -VAETVENDGDELLLSPEFLKKCREIY---SVPIT---ALENTGHEFAIKHLNIVDPLKD 309
+ + E + LL+P+F+ + + Y SV ++T A K+LN+VDPL
Sbjct: 268 PIGKRPEVHAEGALLTPDFMWRMMDKYGNESVSAKLGGGADSTPRPMARKYLNVVDPLLS 327
Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLGEIL--TLPGHCLGM--GLEKFFINTL 359
+NNLGRSVS+GN RIR AL+ GAQRL + + G C G LE+FF NT+
Sbjct: 328 SNNLGRSVSQGNAKRIRKALALGAQRLTALRESSTGGECFGAVRMLEQFFGNTM 381
>gi|302125450|emb|CBI35537.3| unnamed protein product [Vitis vinifera]
Length = 398
Score = 238 bits (606), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 126/206 (61%), Positives = 153/206 (74%), Gaps = 20/206 (9%)
Query: 80 LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD--SVF 137
LI G EVFP+GSVPLK YL DGDIDLTVL NVEE LA V +L+ E Q+ + F
Sbjct: 55 LIRCCLGCEVFPYGSVPLKIYLLDGDIDLTVLCSSNVEEALASDVHAVLKGERQNENAEF 114
Query: 138 EIKDVQY-VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
E+K+VQ+ + +VK VKC V++I +DISFNQ+ GLS LCFL++VDRLIGKDHLFKRS+IL
Sbjct: 115 EVKNVQFNIIVEVKPVKCLVKDIVIDISFNQLGGLSTLCFLKQVDRLIGKDHLFKRSIIL 174
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL-----------------HRFLDYYNTFDWDN 239
IK+ CYYESRILGA++GLISTYALE++VL +RFLDY++ FDWDN
Sbjct: 175 IKSRCYYESRILGAYHGLISTYALEILVLYIFHLFHSSLDGPLAVGYRFLDYFSKFDWDN 234
Query: 240 YCISINGPVAISSLPEIVAETVENDG 265
YCIS+NG V SSLP+IVAE EN G
Sbjct: 235 YCISLNGSVCKSSLPDIVAELPENGG 260
>gi|145341816|ref|XP_001415999.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576222|gb|ABO94291.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 904
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 191/334 (57%), Gaps = 32/334 (9%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDI 106
E E++ +++P +SE +R+ V +I++L +G V +GSVPL+ YLPDGDI
Sbjct: 36 ETLTNELVASLRPTEMSEIRRRAVFEHIKQLAQECFGTAHTLVSAYGSVPLRAYLPDGDI 95
Query: 107 DLTVLGHQNVEEDLARFVCKILEN-------EDQDSVFEIKDVQYVPAQVKIVKCSVQNI 159
D+ +LG V D A++ K ++ D F + +V + A+V+++KC V +
Sbjct: 96 DVCLLGDHRVI-DKAQWTTKFRKHIEKAEAEADPPHEFAVSEVSVINAEVRLMKCIVDGM 154
Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
VD+S NQ GL++L FLE+++ IG+D LF RS+IL+KAW +YE RILGAH+ LISTYA
Sbjct: 155 MVDVSANQFGGLASLGFLEEMNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALISTYA 214
Query: 220 LEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE 262
LE + VLH+ L + FDW+ Y ++I+GPVAI + E +E
Sbjct: 215 LETLVLYIINKYHADLTCPLSVLHKLLSVFAEFDWEGYALTIHGPVAIEGIATPPDECLE 274
Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNF 322
L++ EF++ YS + +K++NI+DPL NNNLGRSVS GN+
Sbjct: 275 GG----LITEEFMRTMLSTYSCEFMRAAASSAPVTVKYMNIIDPLLPNNNLGRSVSCGNY 330
Query: 323 HRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFI 356
R+R AL GAQRL ++ +G+ + F+
Sbjct: 331 RRVRAALKLGAQRLDALMASSAARDDVGVLRGFV 364
>gi|302835555|ref|XP_002949339.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
gi|300265641|gb|EFJ49832.1| hypothetical protein VOLCADRAFT_117152 [Volvox carteri f.
nagariensis]
Length = 3433
Score = 234 bits (598), Expect = 8e-59, Method: Composition-based stats.
Identities = 146/355 (41%), Positives = 206/355 (58%), Gaps = 45/355 (12%)
Query: 25 SSSSSYSSPP----NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRL 80
S + YSS P NL P +D LLA R ++ I+P +S ++R + ++ +
Sbjct: 7 SPNQPYSSNPRSLKNLQPKFVDE---LLA--RTDTLISRIRPTGLSLQRRWVITEHVTSI 61
Query: 81 INGYYG---IEVFPFGSVPLKTYLPDGDIDLTVLGH----QNVEEDLARF------VCKI 127
+ + + PFGSVPLKTYLPDGDIDL++ Q ++E L VC
Sbjct: 62 VKRCFAPHDVTAIPFGSVPLKTYLPDGDIDLSIYSESPRAQALKEALRDTWATQLQVCLE 121
Query: 128 LENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIG-- 185
E + +VF + +VQ + A+VK++KC V NI VDISF Q+ GL+ FLE VDR +
Sbjct: 122 EEANNPTAVFRVANVQVIHAEVKLLKCLVDNIVVDISFFQVGGLNTYNFLEDVDRFVDQC 181
Query: 186 ---KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE-----------------MMVL 225
+ HLFK S+IL+K WCYYESR+LGAH+GLISTYALE + VL
Sbjct: 182 IPVRKHLFKDSIILVKGWCYYESRVLGAHHGLISTYALETLVLYVINLYHRELTNPLQVL 241
Query: 226 HRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVP 285
++FL + FDW+NYC+S+ GP+ +SS P+ V ET E + LL+ +F+ + Y+ P
Sbjct: 242 YKFLVECSCFDWENYCLSLEGPIPLSSFPKPVVETPEALQRDALLTKDFMARAYFKYTEP 301
Query: 286 -ITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEI 339
+ A FAIK LN++DP+ NNLGRSVSK ++ RIR A +GA+ L +I
Sbjct: 302 QLRAQGGEPKPFAIKQLNVMDPILPGNNLGRSVSKASYLRIRRAFEHGARMLADI 356
>gi|290976573|ref|XP_002671014.1| predicted protein [Naegleria gruberi]
gi|284084579|gb|EFC38270.1| predicted protein [Naegleria gruberi]
Length = 763
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 141/378 (37%), Positives = 210/378 (55%), Gaps = 77/378 (20%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV-- 110
IQ++L IQP+ SEK RKEV + I ++ ++ + +GSV KTYLPDGDIDL+V
Sbjct: 169 IQQLLYRIQPSSESEKHRKEVFDIIAAVLE-LANLKTYLYGSVAFKTYLPDGDIDLSVFV 227
Query: 111 -------LGHQNVEEDLAR----------FVCKILENE---------------------- 131
L QNV L+ +V +L
Sbjct: 228 SNEEYLELSSQNVNNLLSHQPQVNDSTISYVHNVLLKNMHIGLKQQLADPSIPWYNKARS 287
Query: 132 -----DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
++++ I+D+ ++ A+VK++KC+V NIP+D+S Q+ GLS LCFL +VD I
Sbjct: 288 LFSEIQRNNLAYIEDMTFINAEVKLIKCTVNNIPIDMSSGQIGGLSTLCFLHEVDDKIAD 347
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV---------------LHRFLDY 231
+HLFKRS+IL+K+W YYESRILG+H+GL+STY L +++ L+RFL+Y
Sbjct: 348 NHLFKRSIILMKSWSYYESRILGSHHGLVSTYGLTVLLMYMFRLYKIETPLQALYRFLNY 407
Query: 232 YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL-------LSPEFLKKCREIYSV 284
Y+TFDW N+ ISI GP+ + ++ + +++E+ E L L+ FL+ C+ Y
Sbjct: 408 YSTFDWTNFGISIYGPIPLGAIND--HKSIEDFYYENLPPERHDSLTSSFLQSCKSKYGT 465
Query: 285 PITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPG 344
++ F IK+LNIVDPL+D NNLGRSV+ NF RIR A+ G++ + +IL
Sbjct: 466 V-----DSSKTFTIKNLNIVDPLRDFNNLGRSVNYNNFLRIRRAIKKGSKTITDILISND 520
Query: 345 HCLGMGLEK-FFINTLER 361
+ K FF N +E+
Sbjct: 521 LMESEKILKLFFKNVVEK 538
>gi|308799699|ref|XP_003074630.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
gi|116000801|emb|CAL50481.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
Length = 875
Score = 221 bits (563), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 127/331 (38%), Positives = 189/331 (57%), Gaps = 33/331 (9%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE---VFPFGSVPLKTYLPDGDI 106
E E++ +++P SE +R+ V +I+ L G +G V +GSVPL+ YLPDGDI
Sbjct: 32 ETLTNELVESLRPTAKSEMRRRAVFEHIKELAQGCFGTAHTLVSVYGSVPLRAYLPDGDI 91
Query: 107 DLTVLGHQNVEEDLARFVCKI---LENEDQDSVFE--IKDVQYVPAQVKIVKCSVQNIPV 161
D+ +LG V D A + K +E + +S FE + +V + A+V+++KC V + V
Sbjct: 92 DVCLLGDHRVI-DKASWTTKFQKHIEKVEAESDFEFAVSEVSVINAEVRLMKCIVDGMMV 150
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
D+S NQ GL++L FLE+ + IG+D LF RS+IL+KAW +YE RILGAH+ LI+TYALE
Sbjct: 151 DVSANQFGGLASLGFLEETNAFIGRDDLFVRSIILVKAWGFYEGRILGAHHALIATYALE 210
Query: 222 MM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
+ VLH+ L + FDW+ Y ++I+GPVA+ I +E
Sbjct: 211 TLVLYIINKYYAELTCPLSVLHKLLRVFGDFDWEGYVLTIHGPVALEDANNIPPGCLEGG 270
Query: 265 GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHR 324
LL+ EF++ Y T+ +K++NI+DPL NNNLGRSVS GN+ R
Sbjct: 271 ----LLTEEFMQSMLCQYGQIETS---NSAPVVVKYMNIIDPLVPNNNLGRSVSCGNYRR 323
Query: 325 IRCALSYGAQRLGEILTLPGHCLGMGLEKFF 355
+R AL GA+ L +++ +G+ K F
Sbjct: 324 VRAALRLGARHLDKLMERSEARDDLGVLKAF 354
>gi|414866688|tpg|DAA45245.1| TPA: hypothetical protein ZEAMMB73_273182, partial [Zea mays]
Length = 260
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 103/171 (60%), Positives = 137/171 (80%), Gaps = 2/171 (1%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
++P SE++R EV++Y +RL+ G EVF FGSVPLKTYLPDGDIDLTVLG+ + +
Sbjct: 38 VRPTEASERRRAEVVDYARRLVGSALGCEVFAFGSVPLKTYLPDGDIDLTVLGNTSYDST 97
Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
L V ILE+E+Q D+ F +KD++ + A+V+++KC++ NI VDISFNQ G+ ALCFL
Sbjct: 98 LVNDVFCILESEEQNSDAEFVVKDLERIDAEVRLIKCTIGNIIVDISFNQTGGICALCFL 157
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRF 228
E VDR +GK+HLFKRS+ILIKAWCYYESR+LGAH+GLISTYALE+++L+ F
Sbjct: 158 ELVDRKVGKNHLFKRSIILIKAWCYYESRLLGAHHGLISTYALEVLILYVF 208
>gi|297612542|ref|NP_001065982.2| Os12g0114200 [Oryza sativa Japonica Group]
gi|255669984|dbj|BAF29001.2| Os12g0114200, partial [Oryza sativa Japonica Group]
Length = 178
Score = 212 bits (540), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 107/167 (64%), Positives = 132/167 (79%), Gaps = 12/167 (7%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYV 145
+VFPFGSVPLKTYLPDGDIDLT GH + +E LA+ V +LE+E+ +D+ FE+KDVQY+
Sbjct: 1 QVFPFGSVPLKTYLPDGDIDLTAFGHSS-DEILAKQVQAVLESEEARKDAEFEVKDVQYI 59
Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
A+VK+VKC VQNI VDISFNQ GL LCFLEKVD+ K+HLFKRS++LIKAWCYYES
Sbjct: 60 HAEVKLVKCIVQNIIVDISFNQFGGLCTLCFLEKVDQKFEKNHLFKRSIMLIKAWCYYES 119
Query: 206 RILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
RILGAH+GLISTYALE++VL+ F ++ T D GP+A+SS
Sbjct: 120 RILGAHHGLISTYALEILVLYIFHLFHGTLD---------GPLAVSS 157
>gi|428171015|gb|EKX39935.1| hypothetical protein GUITHDRAFT_113927 [Guillardia theta CCMP2712]
Length = 632
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/369 (35%), Positives = 193/369 (52%), Gaps = 52/369 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE------VFPFGSVPLKTYLPDG 104
E+ EI+ +QP +E+ R V Y+++LI E V FGSVPLKTYLP G
Sbjct: 17 EQADEIVRQLQPHRRAERHRLTVFEYVKKLIKHVADEENKTEIYVHRFGSVPLKTYLPHG 76
Query: 105 DIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP----AQ------------ 148
D+D+T ++ L R K+ + ++ ++ + V VP AQ
Sbjct: 77 DLDVTAFAANDLW--LERLKAKLEDEAKKNDMYVVSGVHSVPRDLRAQSREELGKKDQGP 134
Query: 149 ---VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
VK+VKC V I VDI+ N + G+ LCFLEKVD ++ +DHLFKR+ IL+K+WCY+ES
Sbjct: 135 VEIVKVVKCQVNGISVDITANALGGMCNLCFLEKVDTMLKRDHLFKRATILVKSWCYFES 194
Query: 206 RILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPV 248
IL + GL+STYALE + VL RFL+YY FDW N+C+++ GPV
Sbjct: 195 HILSSQNGLLSTYALETLVLCIVNIFHEELQTPLDVLKRFLEYYANFDWRNHCLTMRGPV 254
Query: 249 AISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLK 308
S++P ++ LL+ L++ + + ++ L++ F K++NI DPL
Sbjct: 255 NRSNIPPGGEVPHLDNEPSYLLNDAILQEDSHLQFL-MSGLQDDNRGFQWKYMNICDPLS 313
Query: 309 DNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLP-GHCLGMGLEK------FFINTLER 361
NN+GRSVS+ + +RI A +G Q L +L H + L K F + +
Sbjct: 314 TRNNIGRSVSRSSAYRIASAFRHGWQSLSGLLYCSLHHNSSISLTKSEKSARAFFHFTGK 373
Query: 362 NGKGLRPDV 370
G RPDV
Sbjct: 374 TLTGHRPDV 382
>gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis]
Length = 740
Score = 205 bits (522), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 189/328 (57%), Gaps = 59/328 (17%)
Query: 68 KKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL---------------- 111
++R+ + +Q + +E F FGSVPL+ LPDGDID++
Sbjct: 376 EQRQAAVALVQECLQ----VEAFMFGSVPLRAVLPDGDIDISFFATAATTPSSPSGNGGE 431
Query: 112 --GHQ-------NVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIP 160
GH+ ++ + A + + LE E D+ F+I+DVQ + A+VK+VKC V ++
Sbjct: 432 QPGHRAGASPPGDLRDTWASQLLRALEREAVRPDAPFKIRDVQIIQAEVKLVKCVVHDVV 491
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VD+SF+ + GL + FLE DR IG+ HLFKRS++L+KAWCYYESR+LGAH+GLIS+YAL
Sbjct: 492 VDVSFDTVGGLCTVAFLEAADRRIGRQHLFKRSILLLKAWCYYESRLLGAHHGLISSYAL 551
Query: 221 EMMVLH-----------------RFLDYYNTFDWDNYCISINGPVAISSLPEIVAE--TV 261
E++VL+ RFL +FDW+ YC+++ GP+ I+ L ++ + +
Sbjct: 552 EVLVLYIFNLHHAELHTPLDVLRRFLAVLGSFDWERYCLALQGPLPIADLHKLHVDRTAL 611
Query: 262 ENDGDELLLSPEFLKKCREIYSV---------PITALENTGHEFAIKHLNIVDPLKDNNN 312
+ G E LL +F++ + YSV ++ F +KHLNIVDPL +NN
Sbjct: 612 VSSGTEPLLDADFMRGVLQHYSVQHLSQQQQQEAAGMQLVAPRFPLKHLNIVDPLLPSNN 671
Query: 313 LGRSVSKGNFHRIRCALSYGAQRLGEIL 340
LGRSVSK ++ R++ AL+ G + L E L
Sbjct: 672 LGRSVSKASYARVKKALALGNRMLEEAL 699
>gi|77553482|gb|ABA96278.1| nucleotidyltransferase family protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215769169|dbj|BAH01398.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 622
Score = 195 bits (495), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 98/194 (50%), Positives = 131/194 (67%), Gaps = 17/194 (8%)
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDW 237
+LIKAWCYYESRILGAH+GLISTYALE++VL+ RFLDYY+ FDW
Sbjct: 1 MLIKAWCYYESRILGAHHGLISTYALEILVLYIFHLFHGTLDGPLAVLYRFLDYYSKFDW 60
Query: 238 DNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFA 297
DN IS+ GP+++SSLPE+V ++ + D+ + +FLK+C + ++V E F
Sbjct: 61 DNKGISLYGPISLSSLPELVTDSPDTVNDDFTMREDFLKECAQWFTVLPRNSEKNTQVFP 120
Query: 298 IKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN 357
K NIVDPLK +NNLGRSVSKGNF RIR A +GA++LG+IL +P + + +FF N
Sbjct: 121 RKFFNIVDPLKQSNNLGRSVSKGNFLRIRSAFDFGARKLGKILQVPDNFTVDEVNQFFRN 180
Query: 358 TLERNGKGLRPDVQ 371
TL+R+ +RPDVQ
Sbjct: 181 TLKRHCSRVRPDVQ 194
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 54/114 (47%), Gaps = 9/114 (7%)
Query: 378 GSGKSEAADLSGDYDSYYNGLLYGQCYH-DYTLPVTAQFSPP--SSPSQIPSRSAWDVLC 434
G+ S +DL+GDY + +N LLY Q +H DY + F P PS P+R +
Sbjct: 352 GTIDSNWSDLTGDYTTNFNNLLYAQGFHQDYPMNQYYPFGPVYYQMPSPPPARYQ-NRRS 410
Query: 435 QYVQGKGNLVYQWGTEVFVPRLPFCHPYASQVRASTFST-DEGEKSRGTGTYIP 487
+ N+ GT P C P +R T+S D+ ++RGTGTY P
Sbjct: 411 SNGHSRNNVYGYAGTNGIGPAP--CPPGYLIMR--TYSQIDDSHRARGTGTYFP 460
>gi|168035607|ref|XP_001770301.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678518|gb|EDQ64976.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1631
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 107/247 (43%), Positives = 138/247 (55%), Gaps = 43/247 (17%)
Query: 171 LSALCFLEKVDRLIGKD-------HLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
LS L E VDR I ++ HLFKRSVIL+KAWCYYESRILGAH+GLISTYALE +
Sbjct: 242 LSNLATRELVDREIDRNDFELKQNHLFKRSVILVKAWCYYESRILGAHHGLISTYALETL 301
Query: 224 -------------------VLHRFLDYYNTFDWDNYCISINGPVAISSLPEI-------- 256
VL+ FL Y+ FDWD YC+++ GPV ++ + EI
Sbjct: 302 VLYIFHVFHPKRRLRGPLEVLYLFLVYFCNFDWDKYCVTMWGPVPLARITEISSGSARKT 361
Query: 257 -----VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDN 310
AE D +LLLS EFL++C + YS E++ F K LN++DP++D
Sbjct: 362 FRISDFAEAPRKDRGKLLLSKEFLERCIDSYSDAKGGQESSQRRNFITKFLNVLDPIRDT 421
Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFF---INTLERNGKGLR 367
NNLGRSV+ G+F RIR A GA+ LGE+L P + + FF +LER G R
Sbjct: 422 NNLGRSVNVGSFKRIRSAFGLGARTLGEVLECPTDQINEKFKSFFSCTFKSLERYRIGGR 481
Query: 368 PDVQVPI 374
PD P+
Sbjct: 482 PDTGNPL 488
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 97/147 (65%), Gaps = 6/147 (4%)
Query: 39 FSIDAEL----WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGS 94
FS A+L W E E++ +I+P SE++R V ++QRLI + +V FGS
Sbjct: 25 FSSAAKLEDGWWSRVEGHTAELIDSIKPTRSSEERRTAVTAFVQRLIRDRFDCKVVKFGS 84
Query: 95 VPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILEN--EDQDSVFEIKDVQYVPAQVKIV 152
VPLKTYLPDGDIDLT+ +++E A+ V K L+ ED ++ F +K+VQY+ A+VK++
Sbjct: 85 VPLKTYLPDGDIDLTIFARNDLKETWAQDVVKALKQAEEDTNAEFRVKEVQYIQAEVKLI 144
Query: 153 KCSVQNIPVDISFNQMAGLSALCFLEK 179
KC V+NI VDISFNQ GLS CFLE+
Sbjct: 145 KCLVENIVVDISFNQTGGLSTFCFLEE 171
>gi|218192781|gb|EEC75208.1| hypothetical protein OsI_11468 [Oryza sativa Indica Group]
Length = 860
Score = 179 bits (454), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 62/234 (26%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVP 146
VF +GSVPLKTYLPDGD+DLTVLG+ + L + IL++E+Q D+ FE+KD+Q +
Sbjct: 20 VFAYGSVPLKTYLPDGDVDLTVLGNTSYGSTLIDDIYHILQSEEQNCDAEFEVKDLQLIN 79
Query: 147 AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
A+V DR +GK+HL K S+ILIKAWCYYESR
Sbjct: 80 AEV-------------------------------DRKVGKNHLVKNSIILIKAWCYYESR 108
Query: 207 ILGAHYGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVA 249
+LGAH+GLISTYALE + VL+RFL+Y++ FDWDNYCIS+NGPVA
Sbjct: 109 LLGAHHGLISTYALETLILYIFNLFHKSLHGPLEVLYRFLEYFSKFDWDNYCISLNGPVA 168
Query: 250 ISSLPEIVAE----------TVENDGDELLLSPEFLKKCREIYSVPITALENTG 293
+SSLP +A + G LLL PE + EIY L G
Sbjct: 169 LSSLPNQIASFNRIRTAFSYGAQKLGQVLLLQPELIPD--EIYGFFKNTLNRIG 220
>gi|2651305|gb|AAB87585.1| hypothetical protein [Arabidopsis thaliana]
Length = 384
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 122/324 (37%), Positives = 166/324 (51%), Gaps = 82/324 (25%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEV----------- 89
I+AE+WL+AE R QEILC IQP ++E+ R ++I+ +Q L+ GIEV
Sbjct: 28 IEAEVWLIAEARAQEILCAIQPNYLAERSRNKIISNLQTLLWERLGIEVRTFLLLLDELS 87
Query: 90 -----------FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE 138
+ FGS+PLKTYLPDGDIDLTVL H EED AR VC +LE E +S +
Sbjct: 88 FSLQRIRNAKVYLFGSMPLKTYLPDGDIDLTVLTHHASEEDCARAVCCVLEAEMGNSDLQ 147
Query: 139 IKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIK 198
+ VQYV A KVD+ G+DHLFK+S+IL+K
Sbjct: 148 VTGVQYVQA-------------------------------KVDKAFGRDHLFKKSIILVK 176
Query: 199 AWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
AWC+YESRILGA+ GLISTYAL ++VL+ Y+ S++GP+A L +++
Sbjct: 177 AWCFYESRILGANSGLISTYALAILVLNIVNMSYS---------SLSGPLA--KLRDVLT 225
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
EN G K + ++V +LE G + ++ +P+ S
Sbjct: 226 LPGENVG----------WKLEKFFNV---SLERNGKG---QRQDVEEPVVAFGTGAADYS 269
Query: 319 --KGNFHRIRCALSYGAQRLGEIL 340
KG+F R R L YG GE L
Sbjct: 270 QLKGDFDRYRNNLIYGKWFHGESL 293
Score = 90.1 bits (222), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 84/166 (50%), Gaps = 20/166 (12%)
Query: 328 ALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADL 387
+LS +L ++LTLPG +G LEKFF +LERNGKG R DV+ P+ AFG+G ++ + L
Sbjct: 212 SLSGPLAKLRDVLTLPGENVGWKLEKFFNVSLERNGKGQRQDVEEPVVAFGTGAADYSQL 271
Query: 388 SGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLCQYVQGKGNLVYQW 447
GD+D Y N L+YG+ +H +L + PPS + ++W + Y+ N
Sbjct: 272 KGDFDRYRNNLIYGKWFHGESL---HNWLPPSLDT-----TSWANISFYMSRVRN----- 318
Query: 448 GTEVFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPP 493
G P P Q + +S GTGTYIP P
Sbjct: 319 GFNGRNQEGPTSMPNIKQ-------SPNNRRSNGTGTYIPGMVHSP 357
>gi|226506494|ref|NP_001141604.1| uncharacterized protein LOC100273722 [Zea mays]
gi|194705246|gb|ACF86707.1| unknown [Zea mays]
gi|413924676|gb|AFW64608.1| hypothetical protein ZEAMMB73_859338 [Zea mays]
Length = 251
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/184 (48%), Positives = 111/184 (60%), Gaps = 17/184 (9%)
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH-----------------RFLDYYNTFDW 237
+LIK WCYYES ILGA GL+STYALE +VL+ RFLDYY+ FDW
Sbjct: 1 MLIKHWCYYESCILGAQRGLVSTYALETLVLYIFHVFHKSLDGPLAVLYRFLDYYSKFDW 60
Query: 238 DNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFA 297
DN IS+ GP+++SSLPE+V E D L FLK C + +SVP E F+
Sbjct: 61 DNKGISLFGPISLSSLPELVTEPPYTRDDGFLSREAFLKDCAKAFSVPPINSEENPQVFS 120
Query: 298 IKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFIN 357
K +NIVDPLK +NNLGRS+SKGN RIR +GA +LG+IL P + +FF N
Sbjct: 121 KKFVNIVDPLKQSNNLGRSISKGNLGRIRKEFYFGACKLGKILQAPACFSANEINRFFRN 180
Query: 358 TLER 361
TL R
Sbjct: 181 TLSR 184
>gi|303287038|ref|XP_003062808.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455444|gb|EEH52747.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 781
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 123/212 (58%), Gaps = 28/212 (13%)
Query: 152 VKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH 211
+KC + VDIS NQ GL+ L FLE+VD I +D +FKRS+ILIKAW +YE R+LGAH
Sbjct: 1 MKCIADGVVVDISANQFGGLATLGFLEEVDAFIARDGIFKRSIILIKAWGFYEGRVLGAH 60
Query: 212 YGLISTYALEMM-----------------VLHRFLDYYNTFDWDNYCISINGPVAISSLP 254
+ LISTYALE + VLH+FL Y+ F+WD Y +SI+GPV + +L
Sbjct: 61 HALISTYALETLVLYVLNAYHEELSTPLEVLHKFLTYFADFEWDAYAVSIHGPVRLDALE 120
Query: 255 EIVAETVENDGDELLLSPEFLKKCREIYS--VPITALENTG--------HEFAIKHLNIV 304
+ V + + LL+P F K+ + Y I A + KHLN++
Sbjct: 121 KGVRD-ADAPARGPLLTPAFTKRVLDKYGNDAIINAEKGQAGPGGGGNRRAMQPKHLNVI 179
Query: 305 DPLKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
DPL +NNLGRSVS+GN RI+ AL GA +L
Sbjct: 180 DPLLPSNNLGRSVSQGNAKRIQKALRLGAAKL 211
>gi|412992209|emb|CCO19922.1| predicted protein [Bathycoccus prasinos]
Length = 1318
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 142/299 (47%), Gaps = 69/299 (23%)
Query: 136 VFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
+ E+KD+ + A V+++KC V I VD+S NQ GL+ L FL++V+ IGK+ LFKRSVI
Sbjct: 288 LLEVKDIVVIHADVRLLKCVVDGIVVDVSANQFGGLATLAFLKEVNSKIGKNDLFKRSVI 347
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMM------------------------------VL 225
L+KAW +YESRILGA Y L+STYAL+ + VL
Sbjct: 348 LVKAWAFYESRILGAPYALLSTYALKTLIICALRRFNKKESKSDATKTKKREIATPLDVL 407
Query: 226 HRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDEL----------------- 268
F +Y + F W+ + ++I G V + L ++ +
Sbjct: 408 RIFFEYVSDFPWETHAVTIFGDVPVEKLDKVSVREFSSSSKSEKNKNKNNDDEREEKDDE 467
Query: 269 ------LLSPEFLKKCREIY---SVP-ITALENTGH---------EFAIKHLNIVDPLKD 309
LL F+ + Y S P L N G+ KHL+I+DPL +
Sbjct: 468 EAEEDPLLDDTFVDTILKSYGPDSRPDANVLLNIGNGKKAPFRRRAIGAKHLHILDPLSE 527
Query: 310 NNNLGRSVSKGNFHRIRCALSYGAQRLG--EILTLPGHCLGMGLEKFFINTLERNGKGL 366
NNLGRSVS GNF R+R A GA+RL E+ + P + + G E FF L G L
Sbjct: 528 TNNLGRSVSLGNFARVRAAFRLGAERLKRLEMESEPEN-ITRGFEYFFKVALANRGGKL 585
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 14/83 (16%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE--------------VFPFG 93
+ E +E++ +++P+ SEK+R+ V ++ LI + E V FG
Sbjct: 127 MLEALTEELIASLRPSKQSEKRRRMVFRKMESLIRECFEKEFEGEGVNEKKNTIVVSAFG 186
Query: 94 SVPLKTYLPDGDIDLTVLGHQNV 116
SVP TYLPDGDID+ +LG V
Sbjct: 187 SVPFGTYLPDGDIDVCILGDHEV 209
>gi|301093296|ref|XP_002997496.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110638|gb|EEY68690.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 782
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 168/369 (45%), Gaps = 89/369 (24%)
Query: 62 PAIVSEKKRKEVINYIQRLINGYYGIEVFPF----GSVPLKTYLPDGDIDLTVLGHQNVE 117
P+ V+++ R++V++++QR+I ++ + P GS P+KTYLP D+D+ +L Q +E
Sbjct: 257 PSDVADRVRQQVLSFVQRVITAHFPLAAAPLFFATGSYPMKTYLPGSDLDICLLVPQELE 316
Query: 118 ED----LARFVCKILENEDQDSVFE--------------------------------IKD 141
+ + +C + +V + +++
Sbjct: 317 SSWYYIVTQALCVAGGSGGAGTVLDLGNSASSDVSGSSSPSGPAAASGGGPLLLTNTVRN 376
Query: 142 VQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
V ++ A V++VKC+V NIPVD + N++ L A+ L+ + +G+ HLFK+S+ILIKAWC
Sbjct: 377 VTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAARVGRQHLFKKSLILIKAWC 436
Query: 202 YYESR---------------------ILGAHYGLISTYALEMMV---------------- 224
+ESR ++GA +G +STYA+ +V
Sbjct: 437 THESRPFMQRASNEAGGSVPGSTPASVMGASHGALSTYAVNTIVMALFNQHGDALTHPLQ 496
Query: 225 -LHRFLDYYNTFDWDNYCISINGPVAISSLPEI----------VAETVENDGDELLLSPE 273
L+ FLD F W ++++GPV +S L +T + D ++ +
Sbjct: 497 ALYLFLDRLAEFPWHECALTLHGPVPLSRLASTPLNGTTSYRSKLKTAKLDASDVEAIRD 556
Query: 274 FLKKCREIYSVPITALENT-GHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYG 332
L + + + + T F I+ NIVDPL D NNL RSVS F ++ A
Sbjct: 557 TLADQFGAFDAALKSSKGTPTGLFPIRACNIVDPLDDKNNLARSVSAEGFPVMKRAFRLA 616
Query: 333 AQRLGEILT 341
+L +L
Sbjct: 617 RDQLAAMLA 625
>gi|325189429|emb|CCA23919.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1193
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/419 (27%), Positives = 191/419 (45%), Gaps = 78/419 (18%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY----GIEVFPFGSVPLKTYLPDGD 105
E +++++ + P +++ R ++ Y++ L+ + I FP GS P KTYLPD D
Sbjct: 723 ETSVKKLIHALSPTHEADQARCNILAYLRHLLELQFPRSSSILFFPTGSFPCKTYLPDAD 782
Query: 106 IDLTVLGHQNVEEDLARFVCKIL------------------------ENEDQDSVFEIKD 141
+D+ +L +++E V ++L N + +++
Sbjct: 783 LDVCLLVPRSMEPTWFFSVVQMLCFAATNDVHAEPKHSLESVQAPSWMNSTSSTGNTVRN 842
Query: 142 VQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
V ++ A V++VKC++ N+ VDI+ N++ L AL L+ D +G+ HLFK+S++LIKAWC
Sbjct: 843 VTFINADVRVVKCTIDNVAVDITVNRVGALGALVLLDTFDLRVGRHHLFKQSLVLIKAWC 902
Query: 202 YYESR-------ILGAHYGLISTYALEMMV-----------------LHRFLDYYNTFDW 237
+ +LG+ G STYA+ MV LH FLD F W
Sbjct: 903 ALDCLEGGQGCGVLGSKNGAFSTYAVNTMVMTLFNRWGYRIQHPLEALHLFLDIMTQFPW 962
Query: 238 DNYCISINGPVAISSL-----PEIVAETVENDGDELLLSPEFLKKCR----EIY-SVPIT 287
+I GPV + L IV E L++ E +++ R E + S ++
Sbjct: 963 QECAWTIFGPVLFTQLYQNLSSRIVPPGWETASANCLITREDIEQIRVCLNEYFGSFDVS 1022
Query: 288 ALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCL 347
T F ++ N++DPL+ NNL RSV F ++ G RL +L+ +
Sbjct: 1023 LGTETNAVFPLRSFNMIDPLQLGNNLARSVLPEIFPSLQVMFRDGRDRLDRVLSEEKTVM 1082
Query: 348 GMGLEKFFINT--LERNGKGLRPDV-----QVPIHAFGSGKSEAADLSGDYDSYYNGLL 399
+FF ++ L G G RPD+ Q+ + A G + D SG D+ + L+
Sbjct: 1083 -----EFFKHSWKLYGRGDGWRPDLLLHPRQLWVQANGRDEK---DESGS-DTKWKSLI 1132
>gi|403357215|gb|EJY78230.1| hypothetical protein OXYTRI_24618 [Oxytricha trifallax]
Length = 831
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 164/339 (48%), Gaps = 55/339 (16%)
Query: 56 ILCTIQPAIVSEKKRKEVINYIQRLINGYYG----IEVFPFGSVPLKTYLPDGDIDLTVL 111
L I P SE+KR ++ ++ LI G + V +GS PLKTYLPD DID+TV+
Sbjct: 28 FLNKIGPTQESERKRVKIFEQVKFLIEKALGGKSQVMVIRYGSDPLKTYLPDSDIDITVI 87
Query: 112 GHQNVEEDLA---------RFVCKILE--NEDQDSVFEIKDVQYV-PAQVKIVKCSVQNI 159
++ + + + K +E E Q+ +K + + A V+I+K + QN
Sbjct: 88 RRDYLQGNQTNQLTALTQLKLIKKEIEIFGETQNGKNFVKSMVLIDQADVEIIKLNFQNT 147
Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
VDIS Q+ G+ L F+ + + IGK L K+S+IL+KAW Y++ ILG+ ++TYA
Sbjct: 148 FVDISIKQVGGICTLYFMNYMAKRIGKQQLLKKSIILLKAWFTYDASILGSQAACMATYA 207
Query: 220 LEMMV---LHRFLDYYNT--------------FDWDNYCISINGPVAISSLPEIVAETVE 262
L +MV L+ F D N+ FDW+N ++I GP+ S E + E +
Sbjct: 208 LYVMVLFILNNFYDELNSPMDVIMMFFKVWSHFDWENNIVTIFGPIKSSGFYERLKE-CQ 266
Query: 263 NDGDELLLSPEFLKKCREIYSVPIT-------ALENTG--------------HEFAIKHL 301
D D L + L + + + +T L+ +G F K+
Sbjct: 267 FDIDRLTMLDRSLHQEYQYRKLLVTPDELSFLNLQFSGVRLSDVSSYNLANKKSFNTKYF 326
Query: 302 NIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
NI+DP NNLG+S+SK N RI+ L ++ +I
Sbjct: 327 NIIDPTFSKNNLGKSISKLNSSRIKQVLRLQNMKMRQIF 365
>gi|401410712|ref|XP_003884804.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
gi|325119222|emb|CBZ54776.1| hypothetical protein NCLIV_052020 [Neospora caninum Liverpool]
Length = 3449
Score = 136 bits (342), Expect = 4e-29, Method: Composition-based stats.
Identities = 72/194 (37%), Positives = 113/194 (58%), Gaps = 29/194 (14%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNV------EEDLARFVCKILENEDQDSV---- 136
I V+ +GS PL+T+LPDGD+D+ ++ + EE+ + +L+ ++ V
Sbjct: 356 ITVYRYGSFPLRTFLPDGDLDIGIISYNRRTGVVEGEEESDALLAVLLDKFQREDVKTHK 415
Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
F +++ V A+V+I+KC V I VD+S N++ G +L FLE DR IG+ HLFKRSV+
Sbjct: 416 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRHHLFKRSVL 475
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
LIK+W YES +LG+ GL++TY +E +VLH F YY++F W
Sbjct: 476 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVLPASLLPTPLHLLYHFFSYYSSFHW 535
Query: 238 DNYCISINGPVAIS 251
D Y ++ GP+ ++
Sbjct: 536 DRYAVTACGPLPLT 549
Score = 44.3 bits (103), Expect = 0.21, Method: Composition-based stats.
Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 2/79 (2%)
Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFF 355
F + +N+VDPL + NNL RSVS+ F+R+ A+ G Q L IL L F
Sbjct: 890 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTHILATGDAARFRTL--FL 947
Query: 356 INTLERNGKGLRPDVQVPI 374
N+ + + PDV P+
Sbjct: 948 PNSYQLLDRIKSPDVAYPV 966
>gi|221502484|gb|EEE28211.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 3297
Score = 134 bits (337), Expect = 2e-28, Method: Composition-based stats.
Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
I V+ +GS PL+T+LPDGD+D+ V+ G EE+ + +LE + V
Sbjct: 222 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 281
Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
F +++ V A+V+I+KC V I VD+S N++ G +L FLE DR IG++HLFKRSV+
Sbjct: 282 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 341
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
LIK+W YES +LG+ GL++TY +E +VLH +F YY++F W
Sbjct: 342 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 401
Query: 238 DNYCISINGPVAIS 251
D Y ++ G + ++
Sbjct: 402 DRYAVTACGALPLT 415
Score = 44.3 bits (103), Expect = 0.24, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 30/45 (66%)
Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
F + +N+VDPL + NNL RSVS+ F+R+ A+ G Q L ++L
Sbjct: 739 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 783
>gi|221482136|gb|EEE20497.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 3441
Score = 134 bits (337), Expect = 2e-28, Method: Composition-based stats.
Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
I V+ +GS PL+T+LPDGD+D+ V+ G EE+ + +LE + V
Sbjct: 367 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 426
Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
F +++ V A+V+I+KC V I VD+S N++ G +L FLE DR IG++HLFKRSV+
Sbjct: 427 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 486
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
LIK+W YES +LG+ GL++TY +E +VLH +F YY++F W
Sbjct: 487 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 546
Query: 238 DNYCISINGPVAIS 251
D Y ++ G + ++
Sbjct: 547 DRYAVTACGALPLT 560
Score = 44.3 bits (103), Expect = 0.24, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 30/45 (66%)
Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
F + +N+VDPL + NNL RSVS+ F+R+ A+ G Q L ++L
Sbjct: 884 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 928
>gi|237843045|ref|XP_002370820.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
gi|211968484|gb|EEB03680.1| hypothetical protein TGME49_014990 [Toxoplasma gondii ME49]
Length = 3436
Score = 134 bits (337), Expect = 2e-28, Method: Composition-based stats.
Identities = 74/194 (38%), Positives = 113/194 (58%), Gaps = 29/194 (14%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVL------GHQNVEEDLARFVCKILENEDQDSV---- 136
I V+ +GS PL+T+LPDGD+D+ V+ G EE+ + +LE + V
Sbjct: 361 IAVYRYGSFPLRTFLPDGDLDIGVISFNRRTGVLEGEEESDALLAVLLEKFQRAEVKSHK 420
Query: 137 -FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
F +++ V A+V+I+KC V I VD+S N++ G +L FLE DR IG++HLFKRSV+
Sbjct: 421 TFPLREASLVDAEVRILKCIVSGIAVDVSVNKVGGCCSLVFLELADRRIGRNHLFKRSVL 480
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLH------------------RFLDYYNTFDW 237
LIK+W YES +LG+ GL++TY +E +VLH +F YY++F W
Sbjct: 481 LIKSWFAYESHLLGSRSGLLATYCVEALVLHLFHVFPAALLPTPLHLLYQFFSYYSSFHW 540
Query: 238 DNYCISINGPVAIS 251
D Y ++ G + ++
Sbjct: 541 DRYAVTACGALPLT 554
Score = 44.3 bits (103), Expect = 0.24, Method: Composition-based stats.
Identities = 20/45 (44%), Positives = 30/45 (66%)
Query: 296 FAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEIL 340
F + +N+VDPL + NNL RSVS+ F+R+ A+ G Q L ++L
Sbjct: 878 FLFRSMNVVDPLHNGNNLARSVSETAFYRLLHAMKKGLQALTQVL 922
>gi|348683529|gb|EGZ23344.1| hypothetical protein PHYSODRAFT_485178 [Phytophthora sojae]
Length = 793
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/446 (24%), Positives = 186/446 (41%), Gaps = 115/446 (25%)
Query: 49 AEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPF----GSVPLKTYLPDG 104
A + ++ + P+ +++ R++V++++Q++I ++ + P GS P+KTYLP
Sbjct: 248 AARQADALIEWMGPSDAADRVRQQVLSFVQQVITAHFPLAAAPLFFATGSYPMKTYLPGS 307
Query: 105 DIDLTVLGHQNVEED----LARFVCKILENEDQDSVFEI--------------------- 139
D+D+ +L Q +E + + +C + +V ++
Sbjct: 308 DLDICLLVPQELESSWYFIVTQALCIAGGSGGAGTVLDVGNPGGSVDGSGSSSPSGPAVG 367
Query: 140 -------------KDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK 186
++V ++ A V++VKC+V NIPVD + N++ L A+ L+ + +G+
Sbjct: 368 SGSSGALLLTNTVRNVTFINADVRVVKCTVDNIPVDFTANRVGALGAVRLLDAMAVRVGR 427
Query: 187 DHLFKRSVILIKAWCYYES-------------------------RILGAHYGLISTYALE 221
HLFK+S+ILIKAWC +ES ++GA +G +STYA+
Sbjct: 428 QHLFKKSLILIKAWCTHESSPFMQAASVECGGLGPSVVPGSTPTSVMGASHGALSTYAVN 487
Query: 222 MMV-----------------LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVEND 264
+V L+ FLD F W ++++G V +S L
Sbjct: 488 TIVMALFNQHGDALTHPLQALYLFLDRLAEFPWHEAALTLHGAVPLSRLATTPLNGTTPS 547
Query: 265 GDEL---LLSPEFLKKCREIYSVPITALE---NTGHE-----FAIKHLNIVDPLKDNNNL 313
+L L ++ R+ S A + +G F I+ NIVDPL D NNL
Sbjct: 548 KSKLKAAKLDAGDVEAIRDTLSDQFGAFDAGLRSGKSAPTGLFPIRACNIVDPLDDKNNL 607
Query: 314 GRSVSKGNFHRIRCALSYGAQRLGEILT---------------LPGHCLGMGLEKFFIN- 357
RSVS F ++ A +L +L G +GM FF
Sbjct: 608 ARSVSAEGFPVMKRAFRLARDQLAAMLAPRTSHRDDDAELLSEETGSDVGMA---FFSRC 664
Query: 358 -TLERNGKGLRPDVQVPIHAFGSGKS 382
L G G RPD+ + GK+
Sbjct: 665 WQLYGRGDGWRPDLLIHPRQIWHGKA 690
>gi|242051292|ref|XP_002463390.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
gi|241926767|gb|EER99911.1| hypothetical protein SORBIDRAFT_02g042970 [Sorghum bicolor]
Length = 208
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 56/124 (45%), Positives = 89/124 (71%), Gaps = 2/124 (1%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
+ P +E++R++VI+Y++RLI G EVF FGSVPL+TYLPDGD+D+TVLG+ +
Sbjct: 84 VHPTQEAERRRQDVISYLRRLIGSSLGCEVFAFGSVPLRTYLPDGDVDITVLGNTWLNST 143
Query: 120 LARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
V +LE+E + D+ F++ + ++ A+VK++KC ++NI VD+SFNQ+ G+S CFL
Sbjct: 144 FIDDVRSMLESEQENCDAEFKLTGLHFINAEVKLIKCIIENIIVDVSFNQIGGVSTFCFL 203
Query: 178 EKVD 181
E ++
Sbjct: 204 ELIN 207
>gi|308163112|gb|EFO65472.1| Hypothetical protein GLP15_5146 [Giardia lamblia P15]
Length = 719
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 134/273 (49%), Gaps = 44/273 (16%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
+R I+ + P SE+ R ++ +I R+I + P+GS K YLP D+D+
Sbjct: 48 QRTDYIVSLVSPDKASEEFRLKIFTFISRVIEAVLPNTLIVPYGSFISKIYLPSSDLDIC 107
Query: 110 VLGH--------QNVEEDLARF---------------VCKILENE-DQDSVFEIKDVQYV 145
H Q + E L F V +++ + + E+++++++
Sbjct: 108 CYNHGLDEIPLLQKILEALTIFSDPSLRPTGVRVSPAVSQLINSRISAEERLELENIEFI 167
Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
A+V ++KC+V + VDIS Q L +EK+ + IG+++L KRS +LI++WC YE+
Sbjct: 168 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 227
Query: 206 RILGAHYGLISTYALEMMV----------------LHRFLDYYNTFDWDNYCISINGPVA 249
RI+G H ++S+YAL +MV L+ FL YY+TFD+D + +GP
Sbjct: 228 RIVGGHSQMLSSYALRVMVINILLNCKDIYTPFQALYVFLAYYSTFDYDKNIVHPSGPFP 287
Query: 250 ISSLPEIVAETVENDGDELL---LSPEFLKKCR 279
E + T + E+L +SP +L R
Sbjct: 288 KPPAHESLTMTQLSRSMEVLESHISPSYLNHLR 320
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 18/114 (15%)
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
E V ND D LLL LK+ + + P + + F +++IVDPL+ NNLGRSVS
Sbjct: 556 EPVINDHDILLL----LKRYFSMGTFP--DVLSDSRVFLPSYISIVDPLQVINNLGRSVS 609
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL---------EKFFINTLERNG 363
+ NF RI + L +I+ + C+ + + FFI+TL G
Sbjct: 610 EPNFMRITRSFQTAHIVLSDIVQM---CITGSMTITEALAEYDCFFISTLSIFG 660
>gi|159108047|ref|XP_001704297.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
gi|157432356|gb|EDO76623.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
Length = 512
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 116/199 (58%), Gaps = 20/199 (10%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL-GHQNVEEDLARFVCKI 127
R ++I YI+ ++ + +++ P+GS + +LPDGDIDL ++ G + + LA+F +
Sbjct: 55 RYQIIKYIRDKLHSLFPELQLIPYGSFVTRIFLPDGDIDLAIIVGEDDAADVLAQFYIYL 114
Query: 128 LE--NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIG 185
E +D+ F++ ++ + A+V I++ + + +DIS + GL +L+ ++ IG
Sbjct: 115 KEVAASHEDTPFKLTNLSKIQAEVPIIRLVINGVFIDISSARPVGLVTSLYLQLLNDAIG 174
Query: 186 KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM----------------VLHRFL 229
+++L KRSVILI+AWC YE+ ILG+H ++++YAL +M VL +F
Sbjct: 175 RNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTTFILTNSPELVHPLQVLFKFF 234
Query: 230 DYYNTFDWDNYCISINGPV 248
+Y+ FD+ N I+ G V
Sbjct: 235 AFYSAFDFTNNTITAFGVV 253
>gi|253742434|gb|EES99267.1| Hypothetical protein GL50581_3482 [Giardia intestinalis ATCC 50581]
Length = 711
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 77/276 (27%), Positives = 134/276 (48%), Gaps = 41/276 (14%)
Query: 14 QEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEV 73
+ EL S H +S + + + A+ +R I+ + P SE+ R ++
Sbjct: 11 RNELHKSRDHKASEEGEDTEEAIVEAARVAQALEALSQRTDYIISLVSPDRASEEFRLKI 70
Query: 74 INYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH--------QNVEEDLARF- 123
+I ++I+ + P+GS K YLP D+D+ H Q + E L F
Sbjct: 71 FTFISKVIDVVLPNTLIVPYGSFISKIYLPSSDLDICCYNHSIDEIPLLQKILEALMVFS 130
Query: 124 --------------VCKILENE-DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQM 168
V +++ + D E+++++++ A+V ++KC+V + VDIS Q
Sbjct: 131 DPNLQSTGTRVSPVVSQLINSHISADERLELENIEFIMAKVSLIKCTVCGLGVDISAAQP 190
Query: 169 AGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM----- 223
L +EK+ + IG+++L KRS +LI++WC YE+RI+G H ++S+YAL +M
Sbjct: 191 GSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEARIVGGHSQMLSSYALRVMIINIL 250
Query: 224 -----------VLHRFLDYYNTFDWDNYCISINGPV 248
VL+ FL YY+ FD+D I +GP+
Sbjct: 251 INCKDIYTPFQVLYVFLAYYSNFDYDRNIIHPSGPL 286
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 62/135 (45%), Gaps = 19/135 (14%)
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVS 318
E V ND D L L LK R + T + + F +++IVDPL+ NNLGRSVS
Sbjct: 546 EPVLNDHDVLFL----LK--RYFSTSTFTDVLSDSQVFLPSYISIVDPLQVTNNLGRSVS 599
Query: 319 KGNFHRIRCALSYGAQRLGEILTL---PGHCLGMGLEK---FFINTLERNG-------KG 365
+ NF RI + L +I+ + G + L + FFI+TL G G
Sbjct: 600 EPNFMRITRSFQTAHTVLSDIVQMCITGGTTITEALAEYDCFFISTLSIFGDAHAEKLHG 659
Query: 366 LRPDVQVPIHAFGSG 380
P++ + I +G
Sbjct: 660 DVPNLSINISVLEAG 674
>gi|308159127|gb|EFO61675.1| Topoisomerase I-related protein [Giardia lamblia P15]
Length = 512
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 121/217 (55%), Gaps = 20/217 (9%)
Query: 52 RIQEILCTIQPAIVSEKKRKEVINYIQ-RLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
R+ + + P S R ++I YI+ +L + + +++ P+GS + +LPDGDIDL +
Sbjct: 37 RLNVLYHILAPTEDSITYRYQIIKYIRDKLHDLFPELQLIPYGSFVTRIFLPDGDIDLAI 96
Query: 111 L-GHQNVEEDLARFVCKI--LENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
+ G + + L +F + + +D+ F + ++ + A+V I++ + I +DIS +
Sbjct: 97 IVGEDDAADVLTQFYIHLKDIVASQEDTPFRVTNLSKIQAEVPIIRLVINGIFIDISSAR 156
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
GL +L+ ++ IG+++L KRSVILI+AWC YE+ ILG+H ++++YAL +M
Sbjct: 157 PVGLVTSLYLQLLNDAIGRNNLLKRSVILIQAWCLYEAHILGSHSQMLNSYALRVMTIFI 216
Query: 224 ------------VLHRFLDYYNTFDWDNYCISINGPV 248
VL +F +Y+ FD+ N I+ G +
Sbjct: 217 LTNSPELVHPLQVLFKFFAFYSAFDFTNNTITAFGVI 253
>gi|159115240|ref|XP_001707843.1| Hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
gi|157435951|gb|EDO80169.1| hypothetical protein GL50803_17166 [Giardia lamblia ATCC 50803]
Length = 731
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 134/273 (49%), Gaps = 44/273 (16%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
+R I+ + P SE+ R ++ +I ++I + P+GS K YLP D+D+
Sbjct: 63 QRTDYIVSLVSPDKASEEFRLKIFTFISKVIEAVLPNTLIVPYGSFISKIYLPSSDLDIC 122
Query: 110 VLGH--------QNVEEDLARF---------------VCKILENE-DQDSVFEIKDVQYV 145
H Q + E L F V +++ + + E+++++++
Sbjct: 123 CFNHGLDEIPLLQKILEALTVFSDPSLRPTGVRVPPAVSQLINSRIPTEERLELENIEFI 182
Query: 146 PAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
A+V ++KC+V + VDIS Q L +EK+ + IG+++L KRS +LI++WC YE+
Sbjct: 183 MAKVSLIKCTVCGLGVDISAAQPGSLVTSLLIEKLSQSIGRNNLLKRSFLLIQSWCLYEA 242
Query: 206 RILGAHYGLISTYALEMMV----------------LHRFLDYYNTFDWDNYCISINGPVA 249
RI+G H ++S+YAL +MV L+ FL YY++FD+D + +GP+
Sbjct: 243 RIVGGHSQMLSSYALRVMVINILLNCRDIYTPFQALYVFLAYYSSFDYDRDIVHPSGPLP 302
Query: 250 ISSLPEIVAETVENDGDELL---LSPEFLKKCR 279
E + + E+L +SP +L R
Sbjct: 303 KPPAHESLTAAQLSGSMEVLESHISPSYLNHLR 335
>gi|253744327|gb|EET00549.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
Length = 511
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 124/217 (57%), Gaps = 20/217 (9%)
Query: 52 RIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTV 110
R+ + + P+ S R ++I YI+ ++ + +++ P+GS + +LPDGD+DL++
Sbjct: 37 RLNVLYHILAPSEDSISCRYQIIKYIRDELHSIFPELQLIPYGSFVTRIFLPDGDVDLSI 96
Query: 111 LGHQNVEEDL-ARFVCKILE--NEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
+ ++ D+ ++F + E + + + F+I ++ + A++ I++ + I +DIS +
Sbjct: 97 IVAEDDANDVFSQFYTHLKEIASSQEHATFKITNLSKIQAEMSIIRLVINGIFIDISAAR 156
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM---- 223
GL +++ ++ IG+++L KRSVIL++AW YE+ ILG+H ++++YAL +M
Sbjct: 157 PTGLVTSLYIQLLNDSIGRNNLLKRSVILVQAWSLYEAHILGSHSQMLNSYALRVMTAFI 216
Query: 224 ------------VLHRFLDYYNTFDWDNYCISINGPV 248
VL +F +Y+TFD+ N I+ G +
Sbjct: 217 LTNSPELVHPLQVLFKFFAFYSTFDFTNNTITAFGVI 253
>gi|297600524|ref|NP_001049344.2| Os03g0210800 [Oryza sativa Japonica Group]
gi|255674303|dbj|BAF11258.2| Os03g0210800 [Oryza sativa Japonica Group]
Length = 871
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 1/116 (0%)
Query: 256 IVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLG 314
+ AE D ELLLS FL KC Y+V E+ G + F KH N++DPL+ NNNLG
Sbjct: 1 MTAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60
Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
RSVSKGNF RIR A S+GA+RL ++L P L + +FF NT R+G G RPD
Sbjct: 61 RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116
>gi|222624434|gb|EEE58566.1| hypothetical protein OsJ_09878 [Oryza sativa Japonica Group]
Length = 1064
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/116 (50%), Positives = 72/116 (62%), Gaps = 1/116 (0%)
Query: 256 IVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHE-FAIKHLNIVDPLKDNNNLG 314
+ AE D ELLLS FL KC Y+V E+ G + F KH N++DPL+ NNNLG
Sbjct: 1 MGAEPPRMDAAELLLSKSFLDKCSYAYAVTPRIQESQGQQPFVSKHFNVIDPLRTNNNLG 60
Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
RSVSKGNF RIR A S+GA+RL ++L P L + +FF NT R+G G RPD
Sbjct: 61 RSVSKGNFFRIRSAFSFGAKRLAKLLECPKEDLIAEVNQFFTNTWIRHGSGNRPDA 116
>gi|298707565|emb|CBJ30149.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
Length = 1301
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 122/292 (41%), Gaps = 79/292 (27%)
Query: 33 PPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPF 92
P N P +ID R+ ++L ++PA +E R+ V ++ R + G + FP
Sbjct: 607 PSNKQPLAID--------RRVDDLLRLLRPAPRAEGYRRSVFRFVTRQVKRALGAQCFPV 658
Query: 93 GSVPLKTYLPDGDIDLTV-LGHQNVEEDLARF---VCK---------------------- 126
G ++ YLPD ++ ++ L H + R +CK
Sbjct: 659 GGYAIQAYLPDEEVGISAFLCHGQEKSWFVRVNETLCKVSSEASEEAAEEEGSGTSVGGC 718
Query: 127 ------------------------ILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQN-IP 160
I + E + +V ++ +V+ +KC V N +
Sbjct: 719 PEKITAVGEGATPKQEGGDGGGAAIRKEEGSSYRHRLSNVNFINMGRVQKIKCVVDNQVA 778
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL--ISTY 218
VDI NQ+ ++ + LE+ D+L+GKDHLFKRS++LIK+W YESR L I+
Sbjct: 779 VDIGANQVGDIATVALLEETDQLLGKDHLFKRSLLLIKSWWVYESRAYTGSNMLSRITES 838
Query: 219 ALEMMVL-----HR------------FLDYYNTFDWDNYCISINGPVAISSL 253
AL MVL H F ++ FDW YC I GP + +L
Sbjct: 839 ALATMVLAVVNQHHARLHTPLQVMALFFQMHSHFDWSRYCWCIEGPRRLDTL 890
>gi|298710234|emb|CBJ26309.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1317
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 118/270 (43%), Gaps = 71/270 (26%)
Query: 138 EIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
EI++V + A+ IV V N+ VD++ NQ ++A LE+ D LI ++HLFKRS++L+
Sbjct: 116 EIRNVSLINARTPIVTMVVGNVVVDLTENQGGSVAASALLEEADNLIQRNHLFKRSLLLL 175
Query: 198 KAWCYYES------RILGAHYGLISTYALEMMVLH-------------------RFLDYY 232
KAW + E+ R+LGA G +++Y L +MVLH RF + Y
Sbjct: 176 KAWAWCETPRLVGNRVLGARKGGLTSYGLSVMVLHLFAASASADALVHPLDVLIRFFEVY 235
Query: 233 NTFDWDNYCISINGPVAISSL--PEIVAETVENDGDELLLSPEFLKKCREIYSVPIT--- 287
+ FDW YC++++GPV + S+ N G P K E++ VP
Sbjct: 236 SEFDWARYCLTLDGPVPLESVRGSHPYEGWHGNGGTSSRFEPLVKKVLAELFRVPEKDRE 295
Query: 288 ----------------------------------ALENTGH-------EFAIKHLNIVDP 306
AL T F + NI DP
Sbjct: 296 KEREKAKEKDKDKAGRRGRRASRVGRLSDRHAEDALPETPRFAAGAMPHFPRRDCNIQDP 355
Query: 307 LKDNNNLGRSVSKGNFHRIRCALSYGAQRL 336
L NNLG SV+K N ++ AL G ++L
Sbjct: 356 LNALNNLGHSVTKNNLKALKRALQQGQKKL 385
>gi|224064218|ref|XP_002301405.1| predicted protein [Populus trichocarpa]
gi|222843131|gb|EEE80678.1| predicted protein [Populus trichocarpa]
Length = 141
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 73/111 (65%), Gaps = 3/111 (2%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D+E W AEER E++ IQP SE+ R V +Y+QRLI + +VF FGSVPLKTY
Sbjct: 31 LDSERWSKAEERTAELIACIQPNQPSEELRNAVADYVQRLIAKCFPCQVFTFGSVPLKTY 90
Query: 101 LPDGDIDLTVLGHQ-NVEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQ 148
LPDGDIDLT N+++ A V +L E +++++ F +K+VQY+ A+
Sbjct: 91 LPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAE 141
>gi|342184813|emb|CCC94295.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 1108
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 109/265 (41%), Gaps = 80/265 (30%)
Query: 72 EVINYIQRLINGYYGIE-VFPFGSVPLKTYLPDGDIDLTV-----LGHQNVE---EDLAR 122
E IN + YG + + FGS+ T LPDGD D+T+ L +E E
Sbjct: 362 EAINSVAWNAGIKYGEKRYYVFGSLATGTVLPDGDNDITIEVDGLLNPAKIEIQGEAQNS 421
Query: 123 FVCKILENEDQDSVF-----------------EIKDV-----------QYVPAQVKIVKC 154
F + DS+ EI D V A+V+++K
Sbjct: 422 FPNGAATSSCNDSISATSSSHATAIAGGELLSEIADYLRENNASVYVDTVVVAEVRVLKL 481
Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
+ D++ Q+ G+S + FL ++D +G +HL KR+++L+KAWC YE+ +L G
Sbjct: 482 VMDGSSYDVTVGQLGGVSCIRFLHEMDMRVGCEHLLKRTLLLMKAWCCYEAHVLSGQGGY 541
Query: 215 ISTYALEMMVL-------------------------------------------HRFLDY 231
+S+YA +M++ RFL Y
Sbjct: 542 MSSYAATVMLITMINTVEFLEDVEAEGSDGKTCSNCPEGHKSEGHVQISPLQLFARFLKY 601
Query: 232 YNTFDWDNYCISINGPVAISSLPEI 256
Y+ FD+D YC+++ GPV + +I
Sbjct: 602 YSYFDFDRYCLTLFGPVPCDRVNQI 626
>gi|340057832|emb|CCC52183.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 1145
Score = 85.5 bits (210), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 109/262 (41%), Gaps = 88/262 (33%)
Query: 72 EVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTV----------LGHQNVEEDL 120
E IN++ R YG + + FGS+ +T LPDGD D+T+ +G Q +
Sbjct: 398 EAINHVARRKGVKYGKVRYYVFGSLATRTVLPDGDNDITIDIDGLLDPVKVGPQGDTQST 457
Query: 121 AR-----FVC------------------KILEN------EDQDSVFEIKDVQYVPAQVKI 151
++ C ++L N E+ DSVF V K+
Sbjct: 458 SQDGGEASGCSSEFSGASPAQAAAIAGGELLSNIADYLRENNDSVFVDAVVVAEVRVAKL 517
Query: 152 VKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH 211
+ + D++ Q+ G+S + FL +VD IG HL KR+++L+KAWC YE+ +L
Sbjct: 518 I---MDGNSYDVTVGQLGGVSCIRFLHQVDTKIGCGHLLKRTLLLMKAWCCYEAHVLSGQ 574
Query: 212 YGLISTYALEMMVL---------------------------------------------H 226
G +S+YA +M++
Sbjct: 575 GGYMSSYAATVMLIAMINTIEFLEDAESEACTELEEPARTHALEGRLGALNGVSPLQLFA 634
Query: 227 RFLDYYNTFDWDNYCISINGPV 248
RFL Y++ FD++ YC+++ GPV
Sbjct: 635 RFLKYFSCFDFERYCVTLFGPV 656
>gi|261333426|emb|CBH16421.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 1120
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 73/147 (49%), Gaps = 43/147 (29%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K + DI+ Q+ G+S + FL ++D IG +HL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542
Query: 205 SRILGAHYGLISTYALEMMVLH-------------------------------------- 226
+ +L G IS+YA +M++
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLDERQRGEYQHISP 602
Query: 227 -----RFLDYYNTFDWDNYCISINGPV 248
RFL Y++ FD+++YC+++ GPV
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPV 629
>gi|71748824|ref|XP_823467.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70833135|gb|EAN78639.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 1120
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 73/147 (49%), Gaps = 43/147 (29%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K + DI+ Q+ G+S + FL ++D IG +HL KR+++L+KAWC YE
Sbjct: 483 VVAEVRVLKLVMDGSSYDITVGQLGGVSCIRFLHEMDMKIGCNHLLKRTLLLMKAWCCYE 542
Query: 205 SRILGAHYGLISTYALEMMVLH-------------------------------------- 226
+ +L G IS+YA +M++
Sbjct: 543 AHVLSGQGGYISSYAATVMIISMINTVEFLEDVEREERGGEGDGKHLEERQRGEYQHISP 602
Query: 227 -----RFLDYYNTFDWDNYCISINGPV 248
RFL Y++ FD+++YC+++ GPV
Sbjct: 603 LQLFARFLKYFSYFDFESYCLTLFGPV 629
>gi|224135259|ref|XP_002322023.1| predicted protein [Populus trichocarpa]
gi|222869019|gb|EEF06150.1| predicted protein [Populus trichocarpa]
Length = 85
Score = 85.1 bits (209), Expect = 1e-13, Method: Composition-based stats.
Identities = 40/67 (59%), Positives = 50/67 (74%), Gaps = 9/67 (13%)
Query: 186 KDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISIN 245
++HLFKRS+ILIKAWCYYESRILGAH+GLISTYALE +VL+ F + N F
Sbjct: 1 QNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHVFNNKF---------A 51
Query: 246 GPVAISS 252
GP+ +S+
Sbjct: 52 GPLEVST 58
>gi|224127915|ref|XP_002320195.1| predicted protein [Populus trichocarpa]
gi|222860968|gb|EEE98510.1| predicted protein [Populus trichocarpa]
Length = 145
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/112 (44%), Positives = 72/112 (64%), Gaps = 3/112 (2%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+D + W AEER E++ IQP SE+ R V +Y+QRLI + +VF FGSVPLKTY
Sbjct: 31 LDLDRWSKAEERTAELIDCIQPNQPSEELRNAVADYVQRLILKCFPCQVFTFGSVPLKTY 90
Query: 101 LPDGDIDLTVLGHQ-NVEEDLARFVCKIL--ENEDQDSVFEIKDVQYVPAQV 149
LPDGDIDLT N+++ A V +L E +++++ F +K+VQY+ A+
Sbjct: 91 LPDGDIDLTAFSKNPNLKDTWAHQVRDMLENEEKNENAEFRVKEVQYIQAEA 142
>gi|157868001|ref|XP_001682554.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126008|emb|CAJ04245.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 1964
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K +++ DI+ Q G++ + FL ++D +IG H+ KR+++L+KAWC YE
Sbjct: 971 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1030
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ ILG G I +YA +M++
Sbjct: 1031 AHILGGQAGYIGSYAATVMLI 1051
>gi|401419332|ref|XP_003874156.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490390|emb|CBZ25650.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 2020
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K +++ DI+ Q G++ + FL ++D +IG H+ KR+++L+KAWC YE
Sbjct: 1051 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1110
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ ILG G I +YA +M++
Sbjct: 1111 AHILGGQAGYIGSYAATVMLI 1131
>gi|389601018|ref|XP_001564070.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504611|emb|CAM38122.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 2016
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K +++ DI+ Q G++ + FL ++D +IG H+ KR+++L+KAWC YE
Sbjct: 1055 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1114
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ ILG G I +YA +M++
Sbjct: 1115 AHILGGQAGYIGSYAATVMLI 1135
>gi|224114896|ref|XP_002316887.1| predicted protein [Populus trichocarpa]
gi|222859952|gb|EEE97499.1| predicted protein [Populus trichocarpa]
Length = 199
Score = 72.0 bits (175), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/79 (53%), Positives = 56/79 (70%), Gaps = 3/79 (3%)
Query: 104 GDIDLTVLGHQ-NVEEDLARFVCKILENEDQDSV--FEIKDVQYVPAQVKIVKCSVQNIP 160
GDIDLT N+++ A VC +LENE+ + F +K+V+Y+ A+VKI+KC V+NI
Sbjct: 18 GDIDLTAFSENPNLKDTWAPQVCDMLENEENNENAEFGVKEVEYIQAEVKIIKCLVENIV 77
Query: 161 VDISFNQMAGLSALCFLEK 179
VDISFNQ+ GL LCFLEK
Sbjct: 78 VDISFNQLGGLFTLCFLEK 96
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/72 (40%), Positives = 40/72 (55%), Gaps = 8/72 (11%)
Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKG--------N 321
LS FL+ C IY+V +N G F KH N++DPL+ NNNLG SV+K
Sbjct: 106 LSKLFLEACSAIYAVLPAGQDNQGQPFLSKHFNVIDPLRINNNLGHSVNKDIQIELCYVV 165
Query: 322 FHRIRCALSYGA 333
H+++C L+ A
Sbjct: 166 CHQVKCHLNTSA 177
>gi|398013931|ref|XP_003860157.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498376|emb|CBZ33450.1| hypothetical protein, conserved [Leishmania donovani]
Length = 2047
Score = 72.0 bits (175), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/81 (38%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K +++ DI+ Q G++ + FL ++D +IG H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ ILG G I +YA +M++
Sbjct: 1118 AHILGGQAGYIGSYAATVMLI 1138
>gi|339897903|ref|XP_001464956.2| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399300|emb|CAM67197.2| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 2047
Score = 72.0 bits (175), Expect = 1e-09, Method: Composition-based stats.
Identities = 31/81 (38%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K +++ DI+ Q G++ + FL ++D +IG H+ KR+++L+KAWC YE
Sbjct: 1058 VMAEVRVLKLAMEGCNYDITIGQFGGVNCVRFLHEMDAVIGDQHVLKRTLLLLKAWCCYE 1117
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ ILG G I +YA +M++
Sbjct: 1118 AHILGGQAGYIGSYAATVMLI 1138
>gi|452823525|gb|EME30535.1| nucleotidyltransferase [Galdieria sulphuraria]
Length = 1412
Score = 68.9 bits (167), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/274 (21%), Positives = 109/274 (39%), Gaps = 77/274 (28%)
Query: 52 RIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDG--DIDLT 109
R+ L P SE +R+ V + +I G + F +GS KTY D +I
Sbjct: 700 RVNRFLDVCVPTSFSELRREAVFRVVASIIKRSIGAQAFCYGSFATKTYHADSILEIGAF 759
Query: 110 VLGHQNVEED----LARFVCK--ILENEDQDSVFE----------------IKDVQYV-- 145
++G + + L +C+ L ++ S E ++++ Y
Sbjct: 760 LVGKNDTAAEWSAKLMAALCEDATLASDHSSSSLEFSYLSLIQQKHPVPLPVRNISYFRP 819
Query: 146 ----------PAQVKIVKCSVQ--------------------NIPVDISFNQMAGLSALC 175
PA V ++ N+ V ++ N +AG+ C
Sbjct: 820 KPTPSGCQPPPAVTFTVNWPIEDPRSGLVALDTNSTERDIAPNVRVSVTLNHVAGIHTAC 879
Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV----------- 224
LE+ D +G++HLFKRS++L++ W Y ++ ++ + A+E++V
Sbjct: 880 VLEEFDHAMGRNHLFKRSLLLVRTWVDYGVKLT----DILPSRAVEVLVVFVANCFHSSI 935
Query: 225 ------LHRFLDYYNTFDWDNYCISINGPVAISS 252
L+RFL Y+ FDW + + G + +++
Sbjct: 936 ETPFDLLYRFLTYFVHFDWRKFGLCETGIIDLAT 969
>gi|224064842|ref|XP_002301578.1| predicted protein [Populus trichocarpa]
gi|222843304|gb|EEE80851.1| predicted protein [Populus trichocarpa]
Length = 60
Score = 67.8 bits (164), Expect = 2e-08, Method: Composition-based stats.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 137 FEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRL 183
F +K V+Y+ A+VKI+KC V+NI VDISFNQ+ GL LCFLEKV L
Sbjct: 13 FRVKKVEYIQAEVKIIKCLVKNIVVDISFNQLGGLFTLCFLEKVSAL 59
>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
Length = 604
Score = 63.2 bits (152), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 138/320 (43%), Gaps = 55/320 (17%)
Query: 24 TSSSSSYSSPPN-LYPFSIDAEL--------WLLAEERIQEILCTIQPAIVSEKKRKEVI 74
T S++ S+ PN LYP+ D + WL E +++ + I P+ R VI
Sbjct: 133 TGSTNEPSTEPNSLYPWIKDHDHSTQKEIADWLTME--MKDFVSYISPSRAEIVTRNNVI 190
Query: 75 NYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENE 131
N ++R ++ ++ G E FGS YLP DID+ V+ E+ +R + L +
Sbjct: 191 NTLKREVSSFWPGTEAHVFGSCATDLYLPGSDIDMVVISSTGDYENRSRLYQLSSFLRAK 250
Query: 132 DQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHL 189
+ E+ A+V I+K N+P+DISF + GL A ++ R +
Sbjct: 251 NLAKNVEV----IASAKVPIIKFVDPESNLPIDISFERTNGLDA---ARRIRRWLLATPG 303
Query: 190 FKRSVILIKAWCYYESRIL-GAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPV 248
+ V+++K + SR L H G + YA +M H F+ + IS N
Sbjct: 304 LRELVLVVKQ--FLRSRKLNNVHVGGLGGYATIIMCYH-FMQLHPK-------ISTNTMN 353
Query: 249 AISSLPEIVAETVENDG-----DELLLS-------PEFLKKCREIYSVPITALENTGHEF 296
A +L ++ E E G D L++S P +L K R +L + F
Sbjct: 354 APDNLGVLLIEFFELYGRNFSYDNLIISIDSETQLPRYLHKGRH------PSLNTARNTF 407
Query: 297 AIKHLNIVDPLKDNNNLGRS 316
+I I DP +NN+ RS
Sbjct: 408 SIV---IQDPADPSNNITRS 424
>gi|407407321|gb|EKF31173.1| hypothetical protein MOQ_004991 [Trypanosoma cruzi marinkellei]
Length = 1349
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K ++ DI+ Q+ G+ + FL+++D LIG HL KR+++L+KAWC YE
Sbjct: 594 VFAEVRVLKLVMEGSCFDITVGQLGGVECVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 653
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ IL G +S+YA +M++
Sbjct: 654 AHILSGQGGYLSSYAATIMLI 674
>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 876
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 71/140 (50%), Gaps = 17/140 (12%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
I + IQP + R+ +I I++ ++ + +V PFGS K YLP GDIDL ++
Sbjct: 76 IGSFVAYIQPTHEEHQLRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLPAGDIDLVII 135
Query: 112 GHQNVEEDLARFVCK----ILENE-DQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDIS 164
Q + E +R + K I EN QD V K A+V I+K NI VDIS
Sbjct: 136 STQMMNEQKSRILYKLAPLIRENNIGQDVVVIAK------AKVPIIKFKTIFGNINVDIS 189
Query: 165 FNQMAGLSALCFLEKVDRLI 184
NQ G+ A ++KV+ L+
Sbjct: 190 INQTNGIVA---MKKVNELL 206
>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
Length = 422
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 147/344 (42%), Gaps = 46/344 (13%)
Query: 28 SSYSSPPNLYPFSIDAELWLLA--EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
S Y N P+ + +W + E I++ + +QP + R +V+ I++++ +
Sbjct: 65 SYYIRHRNRVPWKLRRYVWGVVGLHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKELW 124
Query: 86 -GIEVFPFGSVPLKTYLPDGDIDLTVLGHQN---VEEDLARFVCKILENEDQDSVFEIKD 141
++ FGS YLPDGDID+ + G + + + V + + E+ +V E
Sbjct: 125 PSAKLETFGSYNTGLYLPDGDIDMVIQGQWEQLPMWQLRNKLVERRIAREENITVIE--- 181
Query: 142 VQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA 199
A V I+K N + VDISFN G A ++K + + K+ V+L+K
Sbjct: 182 ----KAVVPIIKLIESNTLVHVDISFNTSNGREAAALVKK---YMAEYPNLKQLVVLLKY 234
Query: 200 WCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVA 258
R L + G + +YAL ++V++ FL ++ N +L ++
Sbjct: 235 --ILNHRGLNEVWKGGLGSYALTLLVVN-FLQQHS---------RKNAKEDGENLGVLLL 282
Query: 259 ETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH--LNIVDPLKDNNNLGRS 316
E E G + ++ E +PI L + K+ L I DPL N++GRS
Sbjct: 283 EFFELYGRQFNYETCGIRIRDEAGYIPIDTLRKQMNAHGTKYGPLCIEDPLNTTNDVGRS 342
Query: 317 VSKGNFHRIRCALSYGAQRLGEIL-----------TLPGHCLGM 349
+ + ++ + ++L + L TL GH LG+
Sbjct: 343 TFQ--WKHVQACFDHCCRKLKKALEEQPDPAMRGGTLIGHILGV 384
>gi|71652853|ref|XP_815075.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70880102|gb|EAN93224.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1276
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K ++ DI+ Q+ G+ + FL+++D LIG HL KR+++L+KAWC YE
Sbjct: 565 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 624
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ IL G +S+YA +M++
Sbjct: 625 AHILSGQGGYLSSYAATIMLI 645
>gi|71408844|ref|XP_806800.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870651|gb|EAN84949.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1239
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K ++ DI+ Q+ G+ + FL+++D LIG HL KR+++L+KAWC YE
Sbjct: 568 VVAEVRVLKLVMEGGCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 627
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ IL G +S+YA +M++
Sbjct: 628 AHILSGQGGYLSSYAATIMLI 648
>gi|407846652|gb|EKG02680.1| hypothetical protein TCSYLVIO_006286 [Trypanosoma cruzi]
Length = 893
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 55/81 (67%)
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
V A+V+++K ++ DI+ Q+ G+ + FL+++D LIG HL KR+++L+KAWC YE
Sbjct: 222 VVAEVRVLKLVMEGSCFDITVGQLGGVVCVRFLQEMDMLIGCQHLLKRTLLLLKAWCCYE 281
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ IL G +S+YA +M++
Sbjct: 282 AHILSGQGGYLSSYAATIMLI 302
>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 360
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 84/174 (48%), Gaps = 18/174 (10%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLG---HQN 115
I P++ K R+ I I+R I + EVF FGS + YLPDGDIDL V+ +Q
Sbjct: 86 ISPSLTEHKTREYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGDIDLVVMRKSVNQY 145
Query: 116 VEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSA 173
++ + + +L + ++ A+V I+K S P+DIS NQ G+ A
Sbjct: 146 NKQSMLHTMASMLRQANLAQSIQV----ISKARVPIIKFTSSFGGYPIDISLNQTNGVDA 201
Query: 174 LCFL-EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
+ E +DR L +L+K C+ R + Y G +S+Y++ +V+
Sbjct: 202 GRMVNEILDRYPAARPL----SMLLK--CFLSQRSMNEVYTGGVSSYSVICLVV 249
>gi|354544020|emb|CCE40742.1| hypothetical protein CPAR2_107770 [Candida parapsilosis]
Length = 608
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 122/286 (42%), Gaps = 40/286 (13%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E +++ + I P+ R VIN +++ I+ ++ G E FGS YLP
Sbjct: 160 WLTME--MKDFVRYISPSKAEIITRNNVINTLKKEISSFWPGTEAHVFGSCATDLYLPGS 217
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID+ V+ E+ +R + L ++ E+ VP +K V N+PVD
Sbjct: 218 DIDMVVISSTGDYENRSRLYQLSSFLRVKNLAKNVEVIANAKVPI-IKFVDPD-SNLPVD 275
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
ISF + GL A + K L+ L R ++L+ ++ H G + YA +
Sbjct: 276 ISFERTNGLDAARRIRKW--LLATPGL--RELVLVVKQFLRSRKLNNVHVGGLGGYATII 331
Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDG-----DELLLS------ 271
M H F+ + IS N A +L ++ E E G D L++S
Sbjct: 332 MCYH-FMQLHPK-------ISTNTMDAPDNLGVLLIEFFELYGRNFSYDNLIISIDPETQ 383
Query: 272 -PEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
P +L K R PI L + F+I + DP +NN+ RS
Sbjct: 384 LPRYLLKGRH----PI--LSTARNTFSIV---VQDPADSSNNITRS 420
>gi|449533401|ref|XP_004173664.1| PREDICTED: uncharacterized LOC101209112 [Cucumis sativus]
Length = 831
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 33/51 (64%)
Query: 320 GNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNGKGLRPDV 370
GNF RIR A ++GA+RL + P + L +FF+NT ER+G G RPDV
Sbjct: 5 GNFFRIRSAFAFGAKRLARLFECPREDILAELNQFFLNTWERHGSGQRPDV 55
>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
Length = 494
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 94/205 (45%), Gaps = 19/205 (9%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
L+L E I + + I P + R+ +I ++ L++ + V FGS + +LP
Sbjct: 122 LYLCLHEEILDFVHFISPHDEELQARENLIAQMKNLVSNLWPRAAVETFGSHETQMFLPQ 181
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPV 161
DIDL + G +E L + LE D S E+ D A++ IVK +N I V
Sbjct: 182 SDIDLVIFGAPTGKESLFVLAAE-LEARDMVSYLEVID----KARIPIVKFVDKNSAIQV 236
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFN +GL+ +++ R+ R ++L+ + + + G I ++ L+
Sbjct: 237 DISFNISSGLATADLIKQYMRIFPS----FRPLVLVLKYFLAQRELNETFQGGIGSFLLQ 292
Query: 222 MMVLHRFLDY-------YNTFDWDN 239
+MV+ Y Y+ F ++N
Sbjct: 293 LMVVSFLQQYRRQLGTLYDDFRYNN 317
>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
commune H4-8]
Length = 671
Score = 56.6 bits (135), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 84/184 (45%), Gaps = 9/184 (4%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
L I+ I P ++ R ++ I R+I + EV PFGS K YLP GDI
Sbjct: 158 LLHTEIKAFDSWISPTPAEDEVRSMIVLLIARIIQDKFPDAEVRPFGSYGTKLYLPHGDI 217
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK----CSVQNIPVD 162
DL V + +E++ + V + L + + + VQ + A+V I+K +D
Sbjct: 218 DLVVQSN-TLEQNNKKTVLQRLADLIRSARLSSGKVQVIGARVPIIKFITAAEYGRFQID 276
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
IS NQ +GL + + R + + V+++K Y R + Y G + +Y++
Sbjct: 277 ISVNQFSGLVSSDIINGFQRGMQCPIAIRSLVLILK--LYLSQRGMNEVYTGGLGSYSIV 334
Query: 222 MMVL 225
+VL
Sbjct: 335 CLVL 338
>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Metaseiulus occidentalis]
Length = 491
Score = 55.8 bits (133), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 94/203 (46%), Gaps = 17/203 (8%)
Query: 27 SSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG 86
+S + SP Y ++ L EE I + QP + +R++VI ++ I +
Sbjct: 69 ASPWRSPNRSYSLGLEG----LTEE-IHDFFMYAQPNAADQSRREQVIEKVRAAIREKWP 123
Query: 87 ---IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDV 142
+EVF GS YLP GDID+ + G+ + L +++E + + + F++ D
Sbjct: 124 DCVVEVF--GSYKTGLYLPTGDIDMVIQGNWEIIPPLFDLERQLIEKKVGEKNTFKVLDK 181
Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
VP +K + I VD+SFNQ A F+++ R + + ++K +
Sbjct: 182 ASVPL-IKFKDADTE-IRVDLSFNQANCTEAAAFVKQCCRTFPP---LAKLIFVLKQYLS 236
Query: 203 YESRILGAHYGLISTYALEMMVL 225
+ +G IS+Y+L +M+L
Sbjct: 237 LHG-LNEVFHGGISSYSLTLMIL 258
>gi|313226931|emb|CBY22076.1| unnamed protein product [Oikopleura dioica]
Length = 424
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 80/346 (23%), Positives = 147/346 (42%), Gaps = 48/346 (13%)
Query: 28 SSYSSPPNLYPFSIDAELWLLA--EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY 85
S Y N P+ + +W + E I++ + +QP + R +V+ I++++ +
Sbjct: 65 SYYIRHRNRVPWKLRRYVWGVVGLHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKELW 124
Query: 86 ---GIEVFPFGSVPLKTYLPDGDIDLTVLGHQN---VEEDLARFVCKILENEDQDSVFEI 139
+ + F ++ YLPDGDID+ + G + + + V + + E+ +V E
Sbjct: 125 LARSLALRIFRTLNTGLYLPDGDIDMVIQGQWEQLPMWQLRNKLVERRIAREENITVIE- 183
Query: 140 KDVQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILI 197
A V I+K N + VDISFN G A ++K + + K+ V+L+
Sbjct: 184 ------KAVVPIIKLIESNTLVHVDISFNTSNGREAAALVKK---YMAEYPNLKQLVVLL 234
Query: 198 KAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
K R L + G + +YAL ++V++ FL ++ N +L +
Sbjct: 235 KY--ILNHRGLNEVWKGGLGSYALTLLVVN-FLQQHS---------RKNAKEDGENLGVL 282
Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH--LNIVDPLKDNNNLG 314
+ E E G + ++ E +PI L + K+ L I DPL N++G
Sbjct: 283 LLEFFELYGRQFNYETCGIRIRDEAGYIPIDTLRKQMNAHGTKYGPLCIEDPLNTTNDVG 342
Query: 315 RSVSKGNFHRIRCALSYGAQRLGEIL-----------TLPGHCLGM 349
RS + + ++ + ++L + L TL GH LG+
Sbjct: 343 RSTFQ--WKHVQACFDHCCRKLKKALEEQPDPAMRGGTLIGHILGV 386
>gi|343427054|emb|CBQ70582.1| related to TRF4-topoisomerase I-related protein [Sporisorium
reilianum SRZ2]
Length = 697
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 81/199 (40%), Gaps = 33/199 (16%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ--NV 116
+ P + + R V+ I R I + EV PFGS K YLP GD+DL V+ N+
Sbjct: 112 MAPTVAEHETRCMVVELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSQSMANL 171
Query: 117 EEDLA-RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
A R + L + + DVQ + A+V I+K + + VDIS N GL+
Sbjct: 172 RTQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 226
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAH----YGLISTYALEMMVLHRF 228
++ ++ W + L H G I ++L F
Sbjct: 227 TASYVNS----------------WLRKWPHIRISFLQLHPKVQRGEIEADRSLGVLLLEF 270
Query: 229 LDYY-NTFDWDNYCISING 246
L+ Y F +DN ISI G
Sbjct: 271 LELYGKNFGYDNCGISIRG 289
>gi|149244754|ref|XP_001526920.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449314|gb|EDK43570.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 664
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 123/282 (43%), Gaps = 32/282 (11%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E +++ + I P+ R +V+N ++ I ++ G E FGS YLP
Sbjct: 222 WLTME--MKDFVNYISPSSEEIVIRNKVVNTLKTQIALFWPGTEAHVFGSSATDLYLPGS 279
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID+ VL E+ +R + L+ + + E+ VP +K V NI VD
Sbjct: 280 DIDMVVLSDTGDYENRSRLYQLSSFLKAKKLATNVEVIASAKVPI-IKFVDPD-SNIHVD 337
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL-GAHYGLISTYALE 221
ISF + GL A ++ R + + V+++K + SR L H G + YA
Sbjct: 338 ISFERKNGLDA---ARRIRRWLASTPGLRELVLVVKQ--FLRSRKLNNVHVGGLGGYAT- 391
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLS-------PEF 274
+++ + FL + ++ N V + E+ D L++S P +
Sbjct: 392 IIICYHFLRLHPKLSTESMSSLDNLGVLLIEFFELYGRNFSYDN--LIISLDPKTEEPRY 449
Query: 275 LKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
L K R YSV TA NT FAI + DP NNN+ RS
Sbjct: 450 LPKGR--YSVLNTA-RNT---FAIV---VQDPADPNNNITRS 482
>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
98AG31]
Length = 987
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 73/146 (50%), Gaps = 19/146 (13%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
L AE I + I+P ++ R +I I++ + + +V PFGS K YLP GD
Sbjct: 234 LTAE--IGSFVAYIRPTREEDELRLMIIEMIRKAVTMQWPDADVVPFGSFGTKLYLPGGD 291
Query: 106 IDLTVLGHQNVEEDLARFVCKIL-----ENEDQDSVFEIKDVQYVPAQVKIVKCSV--QN 158
IDL +L + +++ ++ + ++ +N QD V K A+V I+K N
Sbjct: 292 IDLVILSTRMMKDAKSKILYRLAPLLREQNIGQDVVVIAK------AKVPIIKFKTIFGN 345
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLI 184
VDIS NQ GL A LEKV+ L+
Sbjct: 346 FQVDISINQSNGLVA---LEKVNELL 368
>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
Length = 527
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 14/159 (8%)
Query: 73 VINYIQR-LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
VI+ +QR L + + V+ FGS + YLP GDIDL VL NV D+ R + + E
Sbjct: 2 VISLLQRALCSKWPDARVYSFGSQDTQLYLPQGDIDLVVLS--NVMNDMPREIT-LSEMA 58
Query: 132 DQDSVFEIK-DVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKD 187
+++ VQ + A+V I+K C VDIS NQ GL A F V+ + K
Sbjct: 59 ACLRSYQLAIHVQVLARAKVPIIKFVCPYGQFNVDISINQANGLQASKF---VNGWLKKQ 115
Query: 188 HLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
+ V++IK + + R L Y G + +Y++ +MVL
Sbjct: 116 PAIRPLVMVIKQ--FLQQRALSEVYTGGLGSYSVTLMVL 152
>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
Length = 730
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 91/219 (41%), Gaps = 41/219 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE- 117
+ P + R VI I R I + EV+PFGS K YLP GD+DL V+ +
Sbjct: 110 MTPTAAEHETRCMVIELISRAIKSQFRDAEVYPFGSQETKLYLPQGDLDLVVVSNSMANL 169
Query: 118 --EDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
+ R + L + + DVQ + A+V I+K + + VDIS N GL+
Sbjct: 170 RVQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 224
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-----LH- 226
++ R H+ R +IL+ + + + G + +Y++ +MV LH
Sbjct: 225 TASYVNSWLRKW--PHI--RPLILVVKYLLMQRGMSEVFSGGLGSYSVIIMVISFLQLHP 280
Query: 227 ------------------RFLDYY-NTFDWDNYCISING 246
FL+ Y F +DN ISI G
Sbjct: 281 KVQRGEIDADRSLGVLLLEFLELYGKNFGYDNCGISIRG 319
>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium
dendrobatidis JAM81]
Length = 752
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 69/143 (48%), Gaps = 5/143 (3%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVP 96
P+S D L +L +E I + + ++P RK I +++++ + EV FGS
Sbjct: 169 PYSNDT-LTMLNQE-IDDYVDYVRPTEAEHSLRKLTIARVRKIVKQIWADAEVHVFGSFQ 226
Query: 97 LKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV 156
K YLP D+D+ V+G V R + K E D S E+ + VP + K +
Sbjct: 227 TKLYLPSSDVDIVVVGDSCVLPKCLRQLAKAFEKADTLSRMEVIEKTKVPIIKGVDKLT- 285
Query: 157 QNIPVDISFNQMAGLSALCFLEK 179
+ +DISFN + G+ + +++
Sbjct: 286 -HFSLDISFNMVNGIKSANIVKR 307
>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Oreochromis niloticus]
Length = 672
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 116/276 (42%), Gaps = 34/276 (12%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + I P EK R EV++ I+ +I+ + EV FGS YLP DIDL
Sbjct: 182 EEIIDFYNYISPRPEEEKMRLEVVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDIDLV 241
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + K + +E+ V + A V I+K S + V
Sbjct: 242 VFGKWESLPLWTLEEALRK---KNVADENSIKVLD-------KATVPIIKLTDSYTEVKV 291
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DISFN M+G+ A +++ K + V+++K + R L + G I +Y+L
Sbjct: 292 DISFNVMSGVKAARLIKEFKE---KYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 346
Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
+M + +Y D + ++IN V + E+ L K
Sbjct: 347 FLMAVSFLQLHYRE---DVFGLNINIGVLLIEFFELYGRNFN------YLKTGIRIKDGG 397
Query: 281 IYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
Y +N + L I DPL+ +N++GRS
Sbjct: 398 CYVAKDEVQKNMLDGYRPSMLYIEDPLQPDNDVGRS 433
>gi|346972692|gb|EGY16144.1| Poly(A) RNA polymerase cid14 [Verticillium dahliae VdLs.17]
Length = 726
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 98/226 (43%), Gaps = 26/226 (11%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLK 98
D +WL E I + ++P ++ R E+I I+ R Y G EV PFGS
Sbjct: 405 DMSVWLHKE--IVDFYEHVRPRAFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSG 462
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVP-AQVKIV 152
YLP D+D+ + + + + F + K Q+ + + V+ + A+V +V
Sbjct: 463 LYLPTADMDIVICSKEWLSGRMTAFPGGSSLYKFRAFLTQNRLADPSSVEVIAKARVPLV 522
Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K +V + VDISF++M G +A+ FL ++ L V +IK +
Sbjct: 523 KYIDAVTGLRVDISFDRMDGPAAIKTFLNWKEQYPALPIL----VTIIKHFLAMRGLNEP 578
Query: 210 AHYG--------LISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
+ G L+ + L M++ F Y N FD+ I IN P
Sbjct: 579 VNGGIGSFSSKNLVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 624
>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
Length = 1487
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/183 (26%), Positives = 85/183 (46%), Gaps = 9/183 (4%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDIDLTVL 111
++ + I P + R V+ I R + + EV PFGS K YLP GDIDL V+
Sbjct: 160 VEAFVNYISPTPEENEVRSLVVALITRAVTQAFPDAEVHPFGSYDTKLYLPVGDIDL-VV 218
Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDISFNQMA 169
Q++ V + N + + + A+V IVK NIPVDIS NQ
Sbjct: 219 HSQSMAYSKKEAVLHSIANTMKRAGITDRVRIISKAKVPIVKFVTLHGNIPVDISINQGN 278
Query: 170 GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFL 229
G++A ++ + + + V+++K++ S + + G + +Y++ +V+ FL
Sbjct: 279 GVTAGTMIK---HFLAELPALRSLVLIVKSFLSQRS-MNEVYTGGLGSYSIVCLVIS-FL 333
Query: 230 DYY 232
+
Sbjct: 334 QMH 336
>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
Length = 897
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 136/300 (45%), Gaps = 24/300 (8%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDID 107
E I ++ I+P + + R +V ++ ++ + I++ FGS+ +LP DID
Sbjct: 84 EEIVDLYHWIKPNEIEVRLRTKVYEKVRDSVSQRWQHKPIKISMFGSLRTNLFLPTSDID 143
Query: 108 LTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
+ V V D + LEN++ + +VP VK+V + + +DISF
Sbjct: 144 VLVECDDWVGTPGDWLGETARGLENDNIAESVTVFGGAFVPI-VKMVDRDTR-LSIDISF 201
Query: 166 NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMV 224
N + G+ A ++ KV L + V+L+K + +Y R L + G +S+Y L +++
Sbjct: 202 NTVQGVRAASYIAKVKEEFP---LIEPLVLLLKQFLHY--RNLNQTFTGGLSSYGLVLLL 256
Query: 225 LHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYS 283
++ F Y N Y +N + E+ ++E + +E+ +SP +C Y
Sbjct: 257 VNFFQLYALNMRHRTIYDSGVNLGHLLLRFLEVY--SMEFNYEEIGISP---GQC--CYI 309
Query: 284 VPITALENTGHEFAI-KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
A GH+ A +L + DPL N++GRS NF I A Q L +TL
Sbjct: 310 SKSAAGARYGHKRAQPGNLALEDPLLTANDVGRSTY--NFSSIANAFGQAFQILAVAVTL 367
>gi|294654384|ref|XP_456434.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
gi|199428840|emb|CAG84386.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
Length = 600
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/232 (23%), Positives = 101/232 (43%), Gaps = 25/232 (10%)
Query: 9 EMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAEL--------WLLAEERIQEILCTI 60
+ +D ++ + S + + + ++ +LYP+ D + W E I++ + I
Sbjct: 126 DYDDYSDDGVLSDDESGTIHTQTTTKSLYPWIKDHDHSKQKEIADWFTME--IKDFVNYI 183
Query: 61 QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
P+ R V+N +++ I ++ E+ FGS YLP DID+ V+ E
Sbjct: 184 SPSEAEIMTRNRVVNQLKQQIGQFWPATELHVFGSCATDLYLPGSDIDMVVVSETGDYEH 243
Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALC 175
+R + L N+ E+ A+V I+K NI +DISF + G+ A
Sbjct: 244 RSRLYQLSSFLRNKKLAKNIEV----IAKAKVPIIKFVDPTSNIHIDISFERTNGIDA-- 297
Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL-GAHYGLISTYALEMMVLH 226
+K+ R + + V+++K + SR L H G + Y+ ++ H
Sbjct: 298 -AKKIRRWLSSTPGLRELVLIVKQ--FLRSRKLNNVHVGGLGGYSTIILCYH 346
>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 781
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 11/138 (7%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I + I+P ++ R+EV++ I R + + V+PFGS P YLP G
Sbjct: 448 WLHKE--IIDFYEYIKPRAFEKRIRQEVLDEINRFVRSTFPDAGVYPFGSFPSGLYLPTG 505
Query: 105 DIDLTVLGHQ-----NVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKC--SV 156
D+D+ + Q + D R + ++ + Q + +V+ + A+V +VK S
Sbjct: 506 DMDMVLCSDQYKRNYRAKYDTRRTMYRLSDALKQQKLAFQNEVEIIAFAKVPLVKWVDSR 565
Query: 157 QNIPVDISFNQMAGLSAL 174
+ +D+SF GL A+
Sbjct: 566 TGLKIDVSFENDTGLQAI 583
>gi|116201727|ref|XP_001226675.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
gi|88177266|gb|EAQ84734.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
Length = 780
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 103/241 (42%), Gaps = 32/241 (13%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLK 98
S + +WL E + + I+P E+ R E++ +++ + E++PFGS P
Sbjct: 405 SANMAVWLHKE--VVDFYEYIKPRDFEERLRGELVRDLKQFCRKVFRDAEMYPFGSFPSG 462
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIK-----DVQYVP-AQVKIV 152
YLP GD+D+ + ++ + ++ K + + K +++ +P A+V +V
Sbjct: 463 LYLPTGDMDMAFMSDGYMKGGVPKYSTKNTLYRLRGQLMNHKVAWEDEIEVIPSAKVPLV 522
Query: 153 KC--SVQNIPVDISFNQMAGLSALC-FLEKVDRLIGKDHLFK--RSVILIKAWCYYESRI 207
K + VD+SF +G++A+ F DR G L + +L++ +
Sbjct: 523 KFIEHKTGLKVDVSFENNSGVTAIATFKAWRDRYPGMPALVTLVKHFLLMRGLNEPVNGG 582
Query: 208 LGAHY------------------GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVA 249
+G L + + L ++LH F Y N F++ IS+N P
Sbjct: 583 IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISLNPPHW 642
Query: 250 I 250
I
Sbjct: 643 I 643
>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
Length = 501
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/334 (22%), Positives = 127/334 (38%), Gaps = 60/334 (17%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKT 99
++ ++ E I + + I P R E+I ++ ++ G + V FGS +
Sbjct: 124 FNSNVYACLHEEIMDFVSFISPTEQELSSRAELIEEMREIVKGLWPEATVETFGSHYTQM 183
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNI 159
+LP DID+ + G + L + + + LE ++ S E+ D +P + K S +I
Sbjct: 184 FLPQSDIDMVLFGVPEGKAPLFK-LAQCLEEKELVSYLEVIDKARIPIVKMVHKAS--DI 240
Query: 160 PVDISFNQMAGLSALCFLEKVDR---------LIGKDHLFKRSV--------------IL 196
VD+SFN GL+ ++ R L+ K + +R + ++
Sbjct: 241 HVDVSFNVAGGLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYTGGVGSFLLQMM 300
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
+ ++ + R LGA + L +++ F Y F++ + IS
Sbjct: 301 VVSFLQHHGRALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAIS------------- 347
Query: 257 VAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD--NNNLG 314
V N G F K R Y + G F I N +P D N+
Sbjct: 348 ----VRNGGS------YFYKDDRRWY--------DDGRPFLISMENPNEPSLDIGKNSYE 389
Query: 315 RSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLG 348
K +F R L + R G+ LPG LG
Sbjct: 390 MRTVKRSFDYARLVLQHEINRHGQFNALPGSILG 423
>gi|348512463|ref|XP_003443762.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
Length = 789
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 113/278 (40%), Gaps = 36/278 (12%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P E R++V+N I+ +I + ++V FGS YLP DIDL
Sbjct: 232 HEEILDFFSFMSPKPEEESMRRDVVNRIEGIIKDLWPTVQVEIFGSFSTGLYLPTSDIDL 291
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
V G+ Q +E+ L + N ++ D VP +K+ C + + VD
Sbjct: 292 VVFGNWDHPPLQELEQALKK------HNVSGSHPIKLLDKATVPI-IKLTDCETR-VKVD 343
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLISTY 218
ISFN + A F+ K +L K V I + + + G IS+Y
Sbjct: 344 ISFNIETAVKAAQFI--------KSYLKKYPVLPPLIFVLKQFLLQRELNEVFTGGISSY 395
Query: 219 ALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKC 278
+L +M + FL + D +IN + + E+ + + + C
Sbjct: 396 SLILMAI-SFLQLHPRIDTSR--PNINLGILLIEFFELYGRDFDYIKTAIRVKNGGAYLC 452
Query: 279 REIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
+E L+ G+ L I DP++ N++GRS
Sbjct: 453 KE------EMLKEMGNGNRPSMLCIEDPVQPGNDVGRS 484
>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
Length = 723
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 26/226 (11%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLK 98
D +WL E I + ++P ++ R E+I I+ R Y G EV PFGS
Sbjct: 402 DMSVWLHKE--IVDFYEHVRPRDFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSG 459
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVP-AQVKIV 152
YLP D+D+ + + + + F + K Q+ + + V+ + A+V +V
Sbjct: 460 LYLPTADMDIVICSKEWLSGRMTAFPGGSSLYKFRGFLTQNRLADPSSVEVIAKARVPLV 519
Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K +V + VDISF++M G +A+ FL+ ++ L V +IK +
Sbjct: 520 KYIDAVTGLRVDISFDRMDGPAAIKTFLDWKEQYPALPIL----VTIIKHFLAMRGLNEP 575
Query: 210 AHYG--------LISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
+ G L+ + L M++ F Y N FD+ I IN P
Sbjct: 576 VNGGIGSFSSKNLVPEHHLGEMLMEFFDLYGNRFDYKTTAIRINPP 621
>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
Length = 689
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 61/219 (27%), Positives = 90/219 (41%), Gaps = 41/219 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQ--NV 116
+ P + R VI I R I + EV PFGS K YLP GD+DL V+ N+
Sbjct: 109 MAPTAAEHETRCMVIELISRAIKSQFRDAEVHPFGSQETKLYLPQGDLDLVVVSRSMANL 168
Query: 117 EEDLA-RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLS 172
A R + L + + DVQ + A+V I+K + + VDIS N GL+
Sbjct: 169 RTQSALRTMAACLRRHNLAT-----DVQVIAKAKVPIIKFVTTYARLKVDISLNHTNGLT 223
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV-----LH- 226
F+ R H+ R +I++ + + G + +Y++ +MV LH
Sbjct: 224 TASFVNSWLRKW--PHI--RPLIIVVKHLLMQRGMSEVFSGGLGSYSIIIMVISFLQLHP 279
Query: 227 ------------------RFLDYY-NTFDWDNYCISING 246
FL+ Y F +DN ISI G
Sbjct: 280 KVQRGEIEPGRSLGVLLLEFLELYGKNFGYDNCGISIRG 318
>gi|357491469|ref|XP_003616022.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
gi|355517357|gb|AES98980.1| hypothetical protein MTR_5g075260 [Medicago truncatula]
Length = 490
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 31/45 (68%)
Query: 319 KGNFHRIRCALSYGAQRLGEILTLPGHCLGMGLEKFFINTLERNG 363
GNF+RIR A YGA++LG IL LP + L +FF NTL+R+G
Sbjct: 10 NGNFYRIRSAFKYGARKLGWILMLPEDRIADELNRFFANTLDRHG 54
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 10/131 (7%)
Query: 379 SGKSEAA----DLSGDYDSYYNGLLYGQCYHDYTLPVTAQFSPPSSPSQIPSRSAWDVLC 434
SG +EA+ DL+GDYDS+ L YG + Y + P + +R++W+ +
Sbjct: 214 SGGTEASKSLLDLAGDYDSHIANLHYGHMCNGYPV-SPVVVPSPPRSPKFHNRNSWETVR 272
Query: 435 QYVQGKGNLVYQWGTE-VFVPRLPFCHPYASQVRASTFSTDEGEKSRGTGTYIPERTRPP 493
Q +Q ++ Q + V P HP + ++F +E K RGTG Y P T
Sbjct: 273 QCLQMNHSIHPQTNSNGVVGPLYLVNHP---TIPMASFGAEEKRKPRGTGAYFPNMT-SR 328
Query: 494 PIEINRPVRAR 504
P NRP+ R
Sbjct: 329 PFRDNRPMPGR 339
>gi|345313578|ref|XP_001519035.2| PREDICTED: DNA polymerase sigma-like, partial [Ornithorhynchus
anatinus]
Length = 641
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 83/176 (47%), Gaps = 24/176 (13%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ +I + +V FGS YLP DIDL V G R ++L
Sbjct: 111 RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------KWERPPLQLL 164
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
E + SV E K ++ + A V I+K + Q ++ VDISFN G+ A F+
Sbjct: 165 EQALRKHSVAEPKTIKVLDKATVPIIKLTDQETDVKVDISFNMETGVKAARFI------- 217
Query: 185 GKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
KD++ K S +IL+ + + G IS+Y+L +M + FL + D
Sbjct: 218 -KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 271
>gi|2642156|gb|AAB87123.1| hypothetical protein [Arabidopsis thaliana]
Length = 474
Score = 52.4 bits (124), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 73/306 (23%), Positives = 127/306 (41%), Gaps = 44/306 (14%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
+QEIL I+P R VI+ ++ ++ G V PFGS + GD+D++
Sbjct: 11 LQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGDLDIS 70
Query: 110 V----------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQ 157
V G + + L + + + S K + A+V I+K Q
Sbjct: 71 VDLFSGSSILFTGKKQKQTLLGHLLRAL-----RASGLWYKLQFVIHARVPILKVVSGHQ 125
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
I DIS + + GL FL + + G+ F+ V+L+K W + I + G ++
Sbjct: 126 RISCDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKTGTFNS 181
Query: 218 YALEMMVLHRFLDYYNTFDWDNYCI-SINGPVAI----SSLPEI--VAETVENDGDELLL 270
Y+L ++V+ F C+ +I P+ + S++ ++ V +T E ++
Sbjct: 182 YSLSLLVIFHF----------QTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTA 231
Query: 271 SPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALS 330
+ K SV ++L F K + DP + N RSVS+ N RI
Sbjct: 232 ANIARFKSERAKSVNRSSLSELLVSFFAK---VEDPFEQPVNAARSVSRRNLDRIAQVFQ 288
Query: 331 YGAQRL 336
++RL
Sbjct: 289 ITSRRL 294
>gi|440291374|gb|ELP84643.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 475
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 35/212 (16%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI--NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE 117
++P + + R+ V+ R+I + I+V PFGS K +LP DID TV+
Sbjct: 27 VEPNPIEYEIRRYVLEKYTRVIENDKKSEIKVVPFGSTQSKLFLPSSDIDFTVVTKGGKT 86
Query: 118 EDLARFVCKILENEDQDSVFEIKDVQY-VPAQVKIVKCSVQN--IPVDISFNQMAGLSAL 174
+ V +IL S++ ++D + + A V ++K + + I +DIS N +G+ +
Sbjct: 87 NMVLNSVARIL------SLYTMEDEKRALRATVPVIKLTDRETGIVLDISHNNESGVDTV 140
Query: 175 CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV---------- 224
++EK + + L + + +IK L A GL TY+L MMV
Sbjct: 141 RWMEKE---MKSNALIRPLLFIIKTVLSSYELNLPALGGL-GTYSLFMMVFCFFREKGSD 196
Query: 225 ---------LHRFLDYYNT-FDWDNYCISING 246
L RFL YY T FD + +S+ G
Sbjct: 197 LKDKRGGAILLRFLKYYATEFDSRKFGLSVTG 228
>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
Length = 759
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 98/245 (40%), Gaps = 40/245 (16%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLK 98
S + +WL E + + I+P E+ R E++ +++ + EV+PFGS P
Sbjct: 386 SANMAVWLHKE--VVDFYEYIKPRDFEERLRGELVEHLKTFCRKTFKDAEVYPFGSFPSG 443
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKIL------ENEDQDSVFEIKDVQYVPAQVKIV 152
YLP D+DL + + + R+ K + ++ +E + V A+V +V
Sbjct: 444 LYLPTADMDLAFISDSYAKGGVPRYGTKSFLYRFRSQLKNHRIAWEDEIELIVGAKVPLV 503
Query: 153 KC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
K + VDISF GL+A+ F ++ G L V LIK +
Sbjct: 504 KFIEHRTGLKVDISFENRTGLTAIETFKAWREQYPGMPAL----VTLIKHFLLMRGLNEP 559
Query: 210 AHYG------------------------LISTYALEMMVLHRFLDYYNTFDWDNYCISIN 245
+ G L + + L ++LH F Y N F++ IS+N
Sbjct: 560 VNGGIGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAISMN 619
Query: 246 GPVAI 250
P I
Sbjct: 620 PPRYI 624
>gi|449707156|gb|EMD46861.1| PAPassociated domain containing protein [Entamoeba histolytica
KU27]
Length = 400
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 22/191 (11%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
S + EL L E +Q + I+P K R+E++ +++ G V FGS K
Sbjct: 9 SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
+LP DID +VL ++ + V IL + EDQ F+ A + ++K +
Sbjct: 66 FLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117
Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
+ I +DIS N +G + F+E+V I KD ++ V+LIK+ C Y+ G
Sbjct: 118 KKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDFH--QPANG 172
Query: 214 LISTYALEMMV 224
+ TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183
>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
CCMP526]
Length = 419
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 70/144 (48%), Gaps = 14/144 (9%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLAR----FVCKILENEDQDSVFEIKDV 142
+V FGS K +LPD DID+ VL ++ R + + + E+ S EI
Sbjct: 106 FDVHVFGSEATKVFLPDSDIDMVVLPPTDLPLHQIRKNLFTLAEAFKQEESVSGMEIISQ 165
Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGL-SALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
VP IVK QN+ VDISF+ +GL SA LEK++ + R +IL+ +
Sbjct: 166 ARVP----IVKLRFQNLQVDISFSSDSGLKSARYMLEKMEAMPP-----LRPLILVLKYF 216
Query: 202 YYESRILGAHYGLISTYALEMMVL 225
+ + + G ++ L++MV+
Sbjct: 217 LAQRELNQTYMGGCGSFLLQLMVI 240
>gi|67465021|ref|XP_648697.1| topoisomerase [Entamoeba histolytica HM-1:IMSS]
gi|56464936|gb|EAL43308.1| topoisomerase, putative [Entamoeba histolytica HM-1:IMSS]
Length = 400
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 92/191 (48%), Gaps = 22/191 (11%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
S + EL L E +Q + I+P K R+E++ +++ G V FGS K
Sbjct: 9 SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
+LP DID +VL ++ + V IL + EDQ F+ A + ++K +
Sbjct: 66 FLPTSDIDFSVLTNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117
Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
+ I +DIS N +G + F+E+V I KD ++ V+LIK+ C Y+ G
Sbjct: 118 KKTLIVLDISHNNTSGTKTVNFIEEV---IKKDDRIRKLVLLIKSILCCYDFH--QPANG 172
Query: 214 LISTYALEMMV 224
+ TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183
>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
Length = 653
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 15/135 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I++ I P E+ R EV+ IQR+I + EV FGS YLP DIDL
Sbjct: 170 EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 229
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
V G+ +EE L + D++S+ ++ D VP +K++ + + VDI
Sbjct: 230 VFGNWETLPLWTLEEALRKRKVA-----DENSI-KVLDKATVPI-IKLMDSHTE-VKVDI 281
Query: 164 SFNQMAGLSALCFLE 178
SFN +G+ A ++
Sbjct: 282 SFNVQSGVKAANLIK 296
>gi|238609344|ref|XP_002397464.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
gi|215471952|gb|EEB98394.1| hypothetical protein MPER_02102 [Moniliophthora perniciosa FA553]
Length = 174
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 50/113 (44%), Gaps = 14/113 (12%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
I P+ V ++ R ++ I I Y EV PFGS K YLP GDID+ VL +
Sbjct: 27 ISPSPVEDEIRSLLVQLISSAIKTRYPDAEVHPFGSYATKLYLPTGDIDIVVLSRTHT-- 84
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGL 171
F C + + VP + + + IPVDISFNQ G+
Sbjct: 85 --IAFRCFVTAKLAK---------ARVPIVKFVTRVELGGIPVDISFNQPGGV 126
>gi|301093772|ref|XP_002997731.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
gi|262109980|gb|EEY68032.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
Length = 489
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 127/324 (39%), Gaps = 60/324 (18%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
+ I + + I P + R +++ ++ ++ G + V FGS + +LP DID+
Sbjct: 123 DEIMDFVTFISPTEDELRSRAQLVEEMRGVVKGLWPEATVETFGSHYTQMFLPQSDIDMV 182
Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMA 169
+ G +E L + + + LE +D+ S E+ D +P + K S +I VD+SFN
Sbjct: 183 LFGVPEGKEPLYK-LAQCLEEKDRVSYLEVIDKARIPIVKMVHKGS--DIHVDVSFNVAG 239
Query: 170 GLSALCFLEKVDR---------LIGKDHLFKRSV--------------ILIKAWCYYESR 206
GL+ ++ R L+ K + +R + +++ ++ + R
Sbjct: 240 GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYSGGVGSFLLQMMVVSFLQHHGR 299
Query: 207 ILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGD 266
LGA + L +++ F Y F++ + +S V N G
Sbjct: 300 ALGAEHDDPKFNNLGQLLMGFFTLYGRDFNYTDLAVS-----------------VRNGGS 342
Query: 267 ELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKD--NNNLGRSVSKGNFHR 324
F K+ R Y + G F I N +P D N+ K +F
Sbjct: 343 ------YFPKEKRRWY--------DDGRPFLIAMENPNEPSLDIGKNSYEMRTVKRSFDY 388
Query: 325 IRCALSYGAQRLGEILTLPGHCLG 348
R L R G+ TLPG LG
Sbjct: 389 ARQVLQNEIYRHGQFNTLPGSILG 412
>gi|194208578|ref|XP_001491462.2| PREDICTED: PAP-associated domain-containing protein 5 [Equus
caballus]
Length = 528
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 61/130 (46%), Gaps = 15/130 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I G + +V FGS YLP DIDL
Sbjct: 31 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKGLWPSADVQIFGSFKTGLYLPTSDIDLV 90
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
V G +EE L + D+DSV ++ D VP +K+ S + VDI
Sbjct: 91 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVPI-IKLTD-SFTEVKVDI 142
Query: 164 SFNQMAGLSA 173
SFN G+ A
Sbjct: 143 SFNVQNGVRA 152
>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
Length = 567
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 94/230 (40%), Gaps = 21/230 (9%)
Query: 9 EMEDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTI 60
E EDL ++ + S + S S + YP+ S E+ WL E I++ + I
Sbjct: 112 EAEDLSDDGVLSDDESGSRHHSQSATSPYPWVKGHDHSKQREIADWLTME--IKDFVNYI 169
Query: 61 QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
P+ R VI ++R I ++ + FGS YLP DID+ V+ E
Sbjct: 170 SPSKEEIVVRNTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGSDIDMVVISTTGDYEQ 229
Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFL 177
+ + L E+ VP +K V NI VDISF + GL A +
Sbjct: 230 RGKLYQLSSFLRTNKLAKNIEVIATAKVPI-IKFVDPQY-NIHVDISFERTNGLDAARRI 287
Query: 178 EK-VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
K +D + G R ++LI R+ H G + YA +++ H
Sbjct: 288 RKWLDSMPG-----LRELVLIVKQFLRSRRLNNVHVGGLGGYATIILMYH 332
>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
Length = 535
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 66/135 (48%), Gaps = 15/135 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I++ I P E+ R EV+ IQR+I + EV FGS YLP DIDL
Sbjct: 52 EEIKDFYEYISPRPEEEQMRHEVVARIQRVIKDLWPNAEVCVFGSFSTGLYLPTSDIDLV 111
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
V G+ +EE L + D++S+ ++ D VP +K++ + + VDI
Sbjct: 112 VFGNWETLPLWTLEEALRKRKVA-----DENSI-KVLDKATVPI-IKLMDSHTE-VKVDI 163
Query: 164 SFNQMAGLSALCFLE 178
SFN +G+ A ++
Sbjct: 164 SFNVQSGVKAANLIK 178
>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 650
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 126/309 (40%), Gaps = 44/309 (14%)
Query: 30 YSSP--PNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR 79
YS+P N YP+ S E+ WL E +++ + I P+ R V+ ++R
Sbjct: 162 YSAPNETNDYPWVKNHDHSTQKEIADWLTLE--MKDFVNYISPSKEEILTRNRVVKDLKR 219
Query: 80 LINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSV 136
IN + E FGS YLP DID+ V + E+ ++ + L N
Sbjct: 220 EINNLWPDTEAHVFGSSATDLYLPGSDIDMVVTSNTGDYENRSKLYQLSSYLRNRKL--- 276
Query: 137 FEIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKR 192
KD++ + A+V IVK NI +DISF + G+ A + + +DR G R
Sbjct: 277 --AKDIEVIAKAKVPIVKFVDPSSNIHIDISFERRNGIEAAKRIRRWLDRTPG-----LR 329
Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
++LI R+ H G + Y+ ++ H FL + +N I N +
Sbjct: 330 ELVLIVKQFLRSRRLNNVHVGGLGGYSTIILCYH-FLRLHPRISTNNISILDNLGSLLIE 388
Query: 253 LPEIVAETVENDGDELLLSPE-----FLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
E+ D + + PE +L K Y P + + F+I I DP
Sbjct: 389 FFELYGRNFSYDNLIIAIDPETDEVKYLPKKDHAYLNP------SKNPFSIV---IQDPA 439
Query: 308 KDNNNLGRS 316
NN+ RS
Sbjct: 440 DSTNNISRS 448
>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
latipes]
Length = 679
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 77/284 (27%), Positives = 117/284 (41%), Gaps = 50/284 (17%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I++ I P EK R EV++ I+ +I+ + EV FGS YLP DIDL
Sbjct: 188 EEIKDFYDYISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLV 247
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + N S ++ D VP I+K SV + V
Sbjct: 248 VFGKWETLPLWTLEEALRK------RNVADKSAIKVLDKATVP----IIKLTDSVTEVKV 297
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DISFN +G+ A +++ K + V+++K + R L + G I +Y+L
Sbjct: 298 DISFNVESGVKAARLIKEFKE---KYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 352
Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSP 272
+M + FL + F D +IN V + E+ DG +
Sbjct: 353 FLMAVS-FLQLH--FREDVCSPNINIGVLLIEFFELYGRHFNYLKTGIRIKDGGSYVAKD 409
Query: 273 EFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
E K + Y + L I DPL+ +N++GRS
Sbjct: 410 EVQKNMMDGYRPSM--------------LYIEDPLQPDNDVGRS 439
>gi|410911160|ref|XP_003969058.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 803
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 12/193 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + + + P RKEV+N I+ +I + +V FGS YLP DIDL
Sbjct: 255 HEEVIDFYNFMSPRPEEAAMRKEVVNRIETIIKELWPTADVQIFGSFSTGLYLPTSDIDL 314
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + F IK + A V I+K + Q + VDISFN
Sbjct: 315 VVFG--KWERPPLQELEQALRKHNVAEPFSIKVLDK--ATVPIIKLTDQETEVKVDISFN 370
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
G+ A F++ ++ + + ++K + + + G IS+Y+L +MV+
Sbjct: 371 VETGVKAASFIKDYVKMYP---VLPYLIFVLKQFL-LQRDLNEVFTGGISSYSLILMVI- 425
Query: 227 RFLDYYNTFDWDN 239
FL + D N
Sbjct: 426 SFLQLHPRIDARN 438
>gi|348512677|ref|XP_003443869.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
Length = 805
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 92/199 (46%), Gaps = 24/199 (12%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + + I P RKEV+N I+ +I + ++V +GS YLP DIDL
Sbjct: 254 HEEVMDFYKYISPRPEEAAMRKEVVNRIEVVIKELWPTVDVQIYGSFSTGLYLPTSDIDL 313
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIP 160
V G Q +E+ L + E D +S+ ++ D VP I+K + Q ++
Sbjct: 314 MVTGKWERPPLQELEQALRKH-----EVADPNSI-KVLDKATVP----IIKLTDQRTDVK 363
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VDISFN G+ A F++ + K + + ++K + + + G IS+Y+L
Sbjct: 364 VDISFNVETGVKAASFIKG---YMKKYPVLPYLIFVLKQFL-LQRDLNEVFTGGISSYSL 419
Query: 221 EMMVLHRFLDYYNTFDWDN 239
+MV+ FL + D N
Sbjct: 420 ILMVI-SFLQLHPRIDPTN 437
>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 603
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 59/231 (25%), Positives = 98/231 (42%), Gaps = 25/231 (10%)
Query: 11 EDLQEELLTSPSHTSSSS-SYSSPPNL-YPFSIDAEL--------WLLAEERIQEILCTI 60
ED EE+ S + S+ ++ PN YP+ D + WL E +++ + I
Sbjct: 123 EDDSEEISNSDNDLSNGDYAHGGTPNSSYPWVKDHDHSKQKEIADWLTLE--MKDFVNYI 180
Query: 61 QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
P+ R VI+ +++ I ++ G E FGS YLP DID+ V+ E+
Sbjct: 181 SPSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGDYEN 240
Query: 120 LARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALC 175
+R + L + E+ A+V I+K V + +D+SF + GL A
Sbjct: 241 RSRLYQLSTFLRTKKLAKNVEV----IASAKVPIIKFVDPVSELHIDVSFERTNGLDAAK 296
Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+ + LI L R ++L+ R+ H G + YA +M H
Sbjct: 297 RIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 343
>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 646
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 126/309 (40%), Gaps = 44/309 (14%)
Query: 30 YSSP--PNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR 79
YS+P N YP+ S E+ WL E +++ + I P+ R V+ ++R
Sbjct: 161 YSAPNETNDYPWVKNHDHSTQKEIADWLTLE--MKDFVNYISPSKEEILTRNRVVKDLKR 218
Query: 80 LINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSV 136
IN + E FGS YLP DID+ V E+ ++ + L N
Sbjct: 219 EINSLWPDTETHVFGSSATDLYLPGSDIDMVVTSKTGDYENRSKLYQLSSYLRNRKL--- 275
Query: 137 FEIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKR 192
KD++ + A+V I+K NI +DISF + G+ A + K +D+ G R
Sbjct: 276 --AKDIEVIAKAKVPIIKFVDPSSNIHIDISFERRNGIEAAKRIRKWLDKTPG-----LR 328
Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
++LI R+ H G + Y+ ++ H FL + +N I N +
Sbjct: 329 ELVLIIKQFLRSRRLNNVHVGGLGGYSTIILCYH-FLRLHPRISTNNMSILDNLGSLLIE 387
Query: 253 LPEIVAETVENDGDELLLSPE-----FLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
E+ D + + PE +L K Y L ++ + F+I I DP
Sbjct: 388 FFELYGRNFSYDNLIIAIDPETDEPKYLPKKDHAY------LNSSKNPFSIV---IQDPA 438
Query: 308 KDNNNLGRS 316
NN+ RS
Sbjct: 439 DSTNNISRS 447
>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
Length = 603
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 101/235 (42%), Gaps = 33/235 (14%)
Query: 11 EDLQEELLTSPSHTSSSS-SYSSPPNL-YPFSIDAEL--------WLLAEERIQEILCTI 60
ED EE+ S + S+ ++ PN YP+ D + WL E +++ + I
Sbjct: 123 EDDSEEISNSDNDLSNGDYAHGETPNSSYPWVKDHDHSKQKEIADWLTLE--MKDFVNYI 180
Query: 61 QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
P+ R VI+ +++ I ++ G E FGS YLP DID+ V+ E+
Sbjct: 181 SPSSEEIVTRNNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGDYEN 240
Query: 120 LARFVCKILENEDQDSVF-----EIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGL 171
+R Q S F K+V+ + A+V I+K V + +D+SF + GL
Sbjct: 241 RSRLY--------QLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHIDVSFERTNGL 292
Query: 172 SALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
A + + LI L R ++L+ R+ H G + YA +M H
Sbjct: 293 DAAKRIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 343
>gi|255730627|ref|XP_002550238.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240132195|gb|EER31753.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 603
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 73/304 (24%), Positives = 125/304 (41%), Gaps = 34/304 (11%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I++ + I P+ R +VI +++ I+ ++ G V FGS YLP
Sbjct: 166 WLTME--IRDFVNYISPSSDEIITRNKVIAALKKSISDFWPGTTVHVFGSCATDLYLPGS 223
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
DID+ V+ E+ +R + L + E+ A+V I+K +
Sbjct: 224 DIDMVVVSDTGSYENASRLYQLSTFLRTNKLATEVEV----IAHAKVPIIKFVDPKSRLH 279
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
+D+SF + G+ A +++ R + + V+++K + R+ H G + YA
Sbjct: 280 IDVSFERTNGIDA---AKRIRRWLVSTPGLRELVLVVKQFLRTR-RLNNVHVGGLGGYAT 335
Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPE-----FL 275
+M H FL + D N V + E+ D L L PE ++
Sbjct: 336 IIMCYH-FLRLHPKITTDAMSALDNLGVLLIEFFELYGRNFSYDDLVLALDPENEEPKYV 394
Query: 276 KKCREIYSVPITALENTGH-EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQ 334
+K + + L+++ FAI I DP +NN+ RS N ++ A Q
Sbjct: 395 RKSK-------SPLQHSARSSFAII---IQDPADPSNNITRSSY--NLRDLKKAFGGAYQ 442
Query: 335 RLGE 338
L E
Sbjct: 443 LLSE 446
>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
Length = 703
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 45/246 (18%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING--YYGIEVFPFGSVPLKT 99
D +WL E I + +P EK R+ +I+ + +LI + V+PFGS
Sbjct: 369 DGGVWLHKE--IVDFYNYAKPRDFEEKLRQGLIDELAKLIRNSQFRDATVYPFGSFKSNL 426
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCK--------ILENEDQDSVFEIKDVQYVPAQVKI 151
YLP GD+DL + AR+ K +E + ++ + A+V +
Sbjct: 427 YLPTGDMDLVFCSDSYMSGRAARYSSKNHVFKFGAFIERKQLAVDNHVEKIS--KARVPL 484
Query: 152 VKC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
VK S + VD+SF + G+ A+ FL ++ L V IK +
Sbjct: 485 VKYVDSRTGLKVDVSFENITGIRAIETFLAWREQFPDMPVL----VTCIKHFLAMRGLNE 540
Query: 209 GAHYGLISTYALEMMV-----------------------LHRFLDYY-NTFDWDNYCISI 244
A+ G+ T + ++V L RF D Y N F +D IS+
Sbjct: 541 PANGGIGGTTVICLVVSMLQLSPDVQSRSMTPESHLGQLLLRFFDLYGNRFSYDRVAISM 600
Query: 245 NGPVAI 250
N P I
Sbjct: 601 NPPRYI 606
>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 779
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 87/200 (43%), Gaps = 28/200 (14%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
LL EE I + P + R +I I R IN + EV PFGS + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200
Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
IDL V + + R + ++ + Q ++ ++ + A+V I+K
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVSS 259
Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
V I VDIS NQ G++A + + +D L G R +ILI + +
Sbjct: 260 LEYFSKQEGVGKINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314
Query: 206 RILGAHYGLISTYALEMMVL 225
+ + G + +Y++ MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334
>gi|407039791|gb|EKE39813.1| topoisomerase, putative [Entamoeba nuttalli P19]
Length = 400
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 91/191 (47%), Gaps = 22/191 (11%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
S + EL L E +Q + I+P K R+E++ +++ G V FGS K
Sbjct: 9 SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNVMAFGSTQSKL 65
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILEN---EDQDSVFEIKDVQYVPAQVKIVKCSV 156
+LP DID +V+ ++ + V IL + EDQ F+ A + ++K +
Sbjct: 66 FLPTSDIDFSVITNEYNTRKVLNSVSSILSSYVLEDQKRNFK--------ASIPVLKLTD 117
Query: 157 QN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYG 213
+ I +DIS N G + F+E+V I KD ++ V+LIK+ C Y+ G
Sbjct: 118 KKTLIVLDISHNNTNGTKTVNFIEEV---IKKDDRIRKLVLLIKSLLCCYDFH--QPANG 172
Query: 214 LISTYALEMMV 224
+ TY++ +MV
Sbjct: 173 GLGTYSVFVMV 183
>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
Length = 701
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 102/248 (41%), Gaps = 49/248 (19%)
Query: 38 PFSIDAELWLL----AEERIQEILCTIQ---PAIVSEKKRKEVINYIQRLING-YYGIEV 89
P+S D E W AE +E++ Q P + R VI I R I + EV
Sbjct: 82 PWSADVE-WSKCQNGAEALHRELIAFDQWMAPTGAEHETRCMVIELIARAIKSQFRDAEV 140
Query: 90 FPFGSVPLKTYLPDGDIDLTVLGHQ--NVEEDLA-RFVCKILENEDQDSVFEIKDVQYVP 146
PFGS K YLP GD+DL V+ N+ A R + L + + DVQ +
Sbjct: 141 RPFGSQETKLYLPQGDLDLVVVSRSMANLRTQSALRTMAACLRRHNLAT-----DVQVIA 195
Query: 147 -AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
A+V I+K + + VDIS N GL+ ++ L H+ R +IL+
Sbjct: 196 KAKVPIIKFVTTYARLKVDISLNHTNGLTTASYVNGW--LRKWPHI--RPLILVIKHLLM 251
Query: 204 ESRILGAHYGLISTYALEMMV-----LH-------------------RFLDYY-NTFDWD 238
+ + G + +Y++ +MV LH FL+ Y F +D
Sbjct: 252 QRGMSEVFSGGLGSYSVIIMVISFLQLHPKLQRGEIEPGRSLGVLLLEFLELYGKNFGYD 311
Query: 239 NYCISING 246
N ISI G
Sbjct: 312 NCGISIRG 319
>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Monodelphis domestica]
Length = 809
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 359 EEISDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 418
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 419 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 468
Query: 162 DISFNQMAGLSALCFLE 178
DISFN G+ A ++
Sbjct: 469 DISFNVQNGVKAAQLIK 485
>gi|384485719|gb|EIE77899.1| hypothetical protein RO3G_02603 [Rhizopus delemar RA 99-880]
Length = 494
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 24/144 (16%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------------GI 87
+I E WL ER I+ T + I KRK VI+ +Q ++ + I
Sbjct: 7 NISNEDWLSYLERY-NIIETEEEKI----KRKNVIDLLQHILVNFQRAVTKDLDWKRGDI 61
Query: 88 EVF--PFGSVPLKTYLPDGDIDLTVLG-HQNVEEDLARFVCKILENEDQDSVFEIKDVQY 144
E F PFGS L Y+ D DIDL ++ Q + + +F ++L+ Q V ++ +Q
Sbjct: 62 ECFLSPFGSYALGGYIRDADIDLVLVCPIQVLRKYFFKFFPQLLKQ--QTLVSNVESIQ- 118
Query: 145 VPAQVKIVKCSVQNIPVDISFNQM 168
A V I+KC++ NI +DISF ++
Sbjct: 119 -KANVPIIKCTIDNISIDISFVRL 141
>gi|241948905|ref|XP_002417175.1| topoisomerase 1-related protein TRF4, putative [Candida
dubliniensis CD36]
gi|223640513|emb|CAX44767.1| topoisomerase 1-related protein TRF4, putative [Candida
dubliniensis CD36]
Length = 606
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 101/238 (42%), Gaps = 36/238 (15%)
Query: 11 EDLQEELLTSPSHTS--SSSSYS---SPPNLYPFSIDAEL--------WLLAEERIQEIL 57
ED EE L S +S + Y+ +P + YP+ D + WL E +++ +
Sbjct: 123 EDESEEFLDSEDDSSYKNEGDYANGGTPNSHYPWVKDHDHSKQKEIADWLTLE--MKDFV 180
Query: 58 CTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNV 116
I P+ R VI+ ++ I ++ G E FGS YLP DID+ V+
Sbjct: 181 NYISPSSEEIVTRNNVISTLKTEIGMFWPGTETHVFGSCATDLYLPGSDIDMVVVSETGD 240
Query: 117 EEDLARFVCKILENEDQDSVF-----EIKDVQYVP-AQVKIVKCS--VQNIPVDISFNQM 168
E+ +R Q S F K+V+ + A+V I+K + + +D+SF +
Sbjct: 241 YENRSRLY--------QLSTFLRTKKLAKNVEVIASAKVPIIKFVDPISELHIDVSFERT 292
Query: 169 AGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
GL A + + LI L R ++L+ R+ H G + YA +M H
Sbjct: 293 NGLDAAKRIRRW--LISTPGL--RELVLVIKQFLRSRRLNNVHVGGLGGYATIIMCYH 346
>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
Length = 467
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 61/135 (45%), Gaps = 19/135 (14%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
L E I + + P EK R EV+N I+ +I + +V FGS YLP DI
Sbjct: 14 LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 73
Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQN 158
DL V G +EE L + D+DSV ++ D VP I+K S
Sbjct: 74 DLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTE 123
Query: 159 IPVDISFNQMAGLSA 173
+ VDISFN G+ A
Sbjct: 124 VKVDISFNVQNGVRA 138
>gi|224135265|ref|XP_002322024.1| predicted protein [Populus trichocarpa]
gi|222869020|gb|EEF06151.1| predicted protein [Populus trichocarpa]
Length = 122
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 55/112 (49%), Gaps = 31/112 (27%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
+++ E W AEER E++ IQP SE++R V+ Y+QRLI + FP
Sbjct: 29 ALEPERWATAEERTAELIACIQPNQPSEERRNAVLCYVQRLI-----MNCFP-------- 75
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENED--QDSVFEIKDVQYVPAQV 149
+E A V ILE+E+ +++ F +K+VQY+ A+V
Sbjct: 76 ----------------CQETWANEVRDILEHEEKNENAEFHVKEVQYIQAEV 111
>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
[Sarcophilus harrisii]
Length = 615
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 118 EEISDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 177
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 178 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 227
Query: 162 DISFNQMAGLSALCFLE 178
DISFN G+ A ++
Sbjct: 228 DISFNVQNGVKAAQLIK 244
>gi|340939540|gb|EGS20162.1| hypothetical protein CTHT_0046690 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 836
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 101/240 (42%), Gaps = 42/240 (17%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGD 105
+L + + + ++P ++ R E++ ++R +G EV+PFGS YLP+GD
Sbjct: 467 VLLHKEVADFYAYVKPRDFEDRLRGELVQDLKRFSRRLFGDAEVYPFGSFCSGLYLPNGD 526
Query: 106 IDLTVL-------GHQNVEEDLARFVCK---ILENEDQDSVFEIKDVQYVPAQVKIVKCS 155
+DL ++ G ++ A F + + E ++ E+ VP VK V+
Sbjct: 527 MDLVLMSDLYKNGGPPRYHDNHALFKFRNGLVKEGMIHNNNIELILHARVPL-VKFVERK 585
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLI-GKDHLFKRSVILIKAWCYYESRILGAHYGL 214
+ +DISF +G+ A+ K L G L V +IK + + G+
Sbjct: 586 T-GLRIDISFENRSGIVAINTFHKWRNLYPGMPAL----VSVIKHFLLMRGLNEPVNGGI 640
Query: 215 IS-------TYALEMM----------------VLHRFLDYY-NTFDWDNYCISINGPVAI 250
S L+MM +L RF DYY N F+++ IS++ P I
Sbjct: 641 GSFSVTCLVVSMLQMMPEVQSGNMDTRHHLGELLLRFFDYYGNRFNYEGLAISLDPPALI 700
>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo
abelii]
Length = 588
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 61/135 (45%), Gaps = 19/135 (14%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
L E I + + P EK R EV+N I+ +I + +V FGS YLP DI
Sbjct: 88 LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 147
Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQN 158
DL V G +EE L + D+DSV ++ D VP I+K S
Sbjct: 148 DLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTE 197
Query: 159 IPVDISFNQMAGLSA 173
+ VDISFN G+ A
Sbjct: 198 VKVDISFNVQNGVRA 212
>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 779
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 28/200 (14%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
LL EE I + P + R +I I R IN + EV PFGS + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200
Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
IDL V + + R + ++ + Q ++ ++ + A+V I+K
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVSS 259
Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
+ I VDIS NQ G++A + + +D L G R +ILI + +
Sbjct: 260 LEYFSKQEGIGKINVDISLNQANGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314
Query: 206 RILGAHYGLISTYALEMMVL 225
+ + G + +Y++ MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334
>gi|363738258|ref|XP_414098.3| PREDICTED: PAP-associated domain-containing protein 5 [Gallus
gallus]
Length = 534
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 89/197 (45%), Gaps = 25/197 (12%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPL 97
F+I A + + E I + + P E+ R EV+N I+ +I + +V FGS
Sbjct: 24 FAIAAVVKAILHEEIIDFYKYMSPRHEEERMRMEVVNRIENVIKELWPNADVQIFGSFKT 83
Query: 98 KTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKI 151
YLP DIDL V G +EE L + D++SV ++ D VP I
Sbjct: 84 GLYLPTSDIDLVVFGKWETLPLWTLEEALRKHNVA-----DENSV-KVLDKATVP----I 133
Query: 152 VKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
+K S + VDISFN G+ A ++ I K + V+++K + R L
Sbjct: 134 IKLTDSFTEVKVDISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLN 188
Query: 210 AHY-GLISTYALEMMVL 225
+ G I +Y+L +M +
Sbjct: 189 EVFTGGIGSYSLFLMAV 205
>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
Length = 667
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 129/310 (41%), Gaps = 49/310 (15%)
Query: 38 PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVP 96
P+ + E+ L + I + ++P RK +I + R I + + V+ FGS
Sbjct: 221 PYPEEPEVARLLHQDILNFINYLEPTPQEHAVRKSLITKLDRAIRAKWPEVTVYVFGSFE 280
Query: 97 LKTYLPDGDIDLTVLGHQNVEEDLARFV---CKILENEDQDSVFEIKDVQYVPAQVKIVK 153
+ YLP DID+ V+ V + + + L+N + ++ VP +K V
Sbjct: 281 TRLYLPTSDIDMVVMSSDTVHRGTKKHMYSLARHLKNCKLATEIQVITTANVPI-IKFVD 339
Query: 154 CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY- 212
I VD+SFNQ GL C + V+ + K + +L+K + R L +
Sbjct: 340 -PFTRIHVDVSFNQPGGLKT-CLV--VNGFLKKFPAVRPLTMLVKH--FLNMRALNEVFL 393
Query: 213 GLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSP 272
G +S+YA+ +V+ FL + P + +V + + +L
Sbjct: 394 GGLSSYAIVCLVVS-FLQMH---------------------PRLSTSSVRQEDNLGVLFL 431
Query: 273 EFLK--KCREIYSVPITALENTGHEFAIKH-----------LNIVDPLKDNNNLGRSVSK 319
EFL+ R Y A+ N G F+ K L+I DP+ +N++ +S S+
Sbjct: 432 EFLELYGKRYNYDAVGIAVHNGGFYFSKKKMGWVKPSQPYLLSIQDPVDYDNDISKS-SR 490
Query: 320 GNFHRIRCAL 329
G R++ L
Sbjct: 491 G-ILRVKATL 499
>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
Length = 833
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 64/273 (23%), Positives = 111/273 (40%), Gaps = 54/273 (19%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA------- 121
R +V+ ++ I + G +V FGS YLP DID+ VLG + +E
Sbjct: 111 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLG-EKIEPRYGNPQNGPH 169
Query: 122 -RFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKV 180
R ++L+ + + IK + + ++ + +I VDISFN G++A+ ++
Sbjct: 170 YRLQDRLLKQGIAER-YSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGY 228
Query: 181 DR---------LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
R L+ K L +R + + W G IS+Y L +MV+ FL +
Sbjct: 229 IRQFPALRYLVLVLKQFLLQRD--MNEVWT-----------GGISSYGLILMVVS-FLQH 274
Query: 232 YNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPEFLKKCREIYS 283
DN +N V + E +G + + E + +E
Sbjct: 275 QGA---DNTADDVNLGVLLIKFLRFYGMEFEYSKCCIRVKNGGQFIKKEEMATQMKE--- 328
Query: 284 VPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
TG ++ L+I DPL +N++GR+
Sbjct: 329 ------SPTGPKYVPNFLSIEDPLTPSNDVGRA 355
>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
Length = 887
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 64/271 (23%), Positives = 109/271 (40%), Gaps = 50/271 (18%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQ------NVEEDLAR 122
R +V+ ++ I + G +V FGS YLP DID+ VLG + N +
Sbjct: 165 RHDVVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPTSDIDMVVLGEKIEPRYGNPQNGPHY 224
Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDR 182
+ L + + IK + + ++ + +I VDISFN G++A+ ++ R
Sbjct: 225 RLQDRLLKQGIAERYSIKVIDSAAVPIIKMRDMITDIKVDISFNMKTGVTAIGLVKGYIR 284
Query: 183 ---------LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYN 233
L+ K L +R + + W G IS+Y L +MV+ FL +
Sbjct: 285 QFPALRYLVLVLKQFLLQRD--MNEVWT-----------GGISSYGLILMVVS-FLQHQG 330
Query: 234 TFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPEFLKKCREIYSVP 285
DN +N V + E +G + + E + +E P
Sbjct: 331 A---DNTGDDVNLGVLLIKFLRFYGMEFEYSKCCIRVKNGGQFIKKEEMATQMKE---AP 384
Query: 286 ITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
TG ++ L+I DPL +N++GR+
Sbjct: 385 ------TGPKYVPNFLSIEDPLTPSNDVGRA 409
>gi|367030285|ref|XP_003664426.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
42464]
gi|347011696|gb|AEO59181.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
42464]
Length = 772
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 103/248 (41%), Gaps = 46/248 (18%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLK 98
S + +WL E + + I+P E+ R E++ +++ + EV+PFGS P
Sbjct: 397 SANMAVWLHKE--VVDFYDYIKPRDFEERLRNELVQDLKQFCRRVFKDAEVYPFGSFPSG 454
Query: 99 TYLPDGDIDLTVL-------GHQ--NVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
YLP D+DL + GH + + L RF +++ ++ +E + + A+V
Sbjct: 455 LYLPTADMDLVFMSDAYASGGHAKYHTKSFLHRFKGQLVNHK---VAWENEIELILHAKV 511
Query: 150 KIVKC--SVQNIPVDISFNQMAGLSAL-CFLEKVDRLIGKDHLFKRSVILIKAWCYYES- 205
+VK + VD+SF G++A+ F ++ G L V LIK +
Sbjct: 512 PLVKFIEHKTGLKVDVSFENNTGVNAIKTFKAWREQYPGMPAL----VTLIKHFLLMRGL 567
Query: 206 --RILGAHYG---------------------LISTYALEMMVLHRFLDYYNTFDWDNYCI 242
+ G G L + + L ++LH F Y N F++ I
Sbjct: 568 NEPVNGGIGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQLLLHFFDLYGNKFNYQTVAI 627
Query: 243 SINGPVAI 250
S+N P I
Sbjct: 628 SLNPPRWI 635
>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
lacrymans S7.3]
Length = 538
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 4/129 (3%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
+ ++ + + P+ V ++ R VI+ + + ++ + +V PFGS K YLPDGDI
Sbjct: 184 MLHREVEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDI 243
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDIS 164
DL V+ +++ V L N + + K A+V IVK + + VDIS
Sbjct: 244 DL-VIQSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDIS 302
Query: 165 FNQMAGLSA 173
NQ G+ A
Sbjct: 303 INQGNGVIA 311
>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 664
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 213 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
V G +EE L + D+DSV ++ D VP I+K S +
Sbjct: 273 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 322
Query: 161 VDISFNQMAGLSA 173
VDISFN G+ A
Sbjct: 323 VDISFNVQNGVRA 335
>gi|353241543|emb|CCA73351.1| related to TRF4-topoisomerase I-related protein [Piriformospora
indica DSM 11827]
Length = 628
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 17/110 (15%)
Query: 84 YYGIEVFP------FGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENE 131
YY +++P FGS YLP GDID+ + +N + + IL +
Sbjct: 121 YYARKIWPDATATAFGSYATGLYLPTGDIDVVIETKYATASTKNAAQRALSQLATILRSA 180
Query: 132 DQDSVFEIKDVQYVPAQVKIVKC-SVQN-IPVDISFNQMAGLSALCFLEK 179
+ E + +Q + A+V I+K SV IPVDIS NQ G+SA+ + +
Sbjct: 181 ---GLAERRKIQVISARVSIIKFDSVHGGIPVDISLNQTTGVSAIPVINR 227
>gi|167384281|ref|XP_001736885.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
gi|165900593|gb|EDR26889.1| PAP-associated domain-containing protein, putative [Entamoeba
dispar SAW760]
Length = 400
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 91/188 (48%), Gaps = 16/188 (8%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
S + EL L E +Q + I+P K R+E++ +++ G + PFGS K
Sbjct: 9 SFEGELDLNQE--VQLFIKFIEPNKNEYKIREELLTKYSKILEKE-GYNIMPFGSTQSKL 65
Query: 100 YLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN- 158
+LP DID +V+ + E R V + + V E + + A V ++K + +
Sbjct: 66 FLPTSDIDFSVITN----EYNTRKVLNSISSILSSYVLEDQKRNF-KASVPVLKLTDKQT 120
Query: 159 -IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA-WCYYESRILGAHYGLIS 216
I +DIS N +G + F+E++ I KD +R V+LIK+ C Y+ G +
Sbjct: 121 LIVLDISHNNTSGTKTVDFIEEI---IKKDDRIRRLVLLIKSILCCYDFH--QPANGGLG 175
Query: 217 TYALEMMV 224
TY++ +MV
Sbjct: 176 TYSVFVMV 183
>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Nomascus leucogenys]
Length = 666
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 215 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 274
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
V G +EE L + D+DSV ++ D VP I+K S +
Sbjct: 275 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 324
Query: 161 VDISFNQMAGLSA 173
VDISFN G+ A
Sbjct: 325 VDISFNVQNGVRA 337
>gi|440637467|gb|ELR07386.1| hypothetical protein GMDG_08401 [Geomyces destructans 20631-21]
Length = 753
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 66/155 (42%), Gaps = 26/155 (16%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFG 93
N YP S + WL E I + +PA V E R +I+ ++R + ++ ++ PFG
Sbjct: 410 NHYP-SSNMGSWLHME--IMDFFHRFKPAEVEENMRGALISDLRRAVQKVWHDADILPFG 466
Query: 94 SVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDV---QYVPAQVK 150
S P YLP D+DL + ++ ++ K +++F +D + + A
Sbjct: 467 SYPAGLYLPTADMDLVFVSRGYMDGGYGKYTNK-------NALFRFRDFLDREKIAAPYS 519
Query: 151 IVKCSVQNIP------------VDISFNQMAGLSA 173
I S +P VD+SF GL A
Sbjct: 520 IEVISKAKVPLVKYIDYYTGLRVDVSFENDTGLIA 554
>gi|334325414|ref|XP_001371537.2| PREDICTED: DNA polymerase sigma-like [Monodelphis domestica]
Length = 860
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 84/194 (43%), Gaps = 20/194 (10%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ +I + +V FGS YLP DIDL
Sbjct: 309 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDL 368
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 369 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 424
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
G+ K RLI KD++ K S +IL+ + + G IS+Y+L +
Sbjct: 425 METGV-------KAARLI-KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 476
Query: 223 MVLHRFLDYYNTFD 236
M + FL + D
Sbjct: 477 MAI-SFLQLHPRID 489
>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Loxodonta africana]
Length = 595
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 61/133 (45%), Gaps = 15/133 (11%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
L E I + + P EK R EV+N I+ +I + +V FGS YLP DI
Sbjct: 95 LLHEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 154
Query: 107 DLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP 160
DL V G +EE L + D+DSV ++ D VP +K+ S +
Sbjct: 155 DLVVFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVPI-IKLTD-SFTEVK 206
Query: 161 VDISFNQMAGLSA 173
VDISFN G+ A
Sbjct: 207 VDISFNVQNGVRA 219
>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Callithrix jacchus]
Length = 664
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 213 HEEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDL 272
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
V G +EE L + D+DSV ++ D VP I+K S +
Sbjct: 273 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 322
Query: 161 VDISFNQMAGLSA 173
VDISFN G+ A
Sbjct: 323 VDISFNVQNGVRA 335
>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 48.9 bits (115), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 96/229 (41%), Gaps = 23/229 (10%)
Query: 11 EDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTIQP 62
+D ++ + S + + P +LYP+ S E+ WL E I++ + I P
Sbjct: 124 DDYSDDGIISDDQSGNRHIAKEPSSLYPWIKNQDHSKQQEIADWLTLE--IKDFVNYISP 181
Query: 63 AIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA 121
+ + R VI ++ I ++ EV FGS YLP DID+ V+ E +
Sbjct: 182 SKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVVISRDGDREQRS 241
Query: 122 RF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFL 177
R + L ++ E+ A+V IVK NI +D+SF + G+ A
Sbjct: 242 RLYQLSTHLRSKKLAKNIEV----IAKAKVPIVKFVDPDSNIHIDVSFERSNGIDAAI-- 295
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
K+ + + V+++K + R+ H G + Y+ ++ H
Sbjct: 296 -KIREWLASTPGLRELVLVVKQFL-RSRRLNNVHVGGLGGYSTIILCYH 342
>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
Length = 1047
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 65/140 (46%), Gaps = 15/140 (10%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 648 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 707
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
V G +EE L + + D+DSV ++ D VP +K+ S + VDI
Sbjct: 708 VFGKWENLPLWTLEEALRKH-----KVADEDSV-KVLDKATVPI-IKLTD-SFTEVKVDI 759
Query: 164 SFNQMAGLSALCFLEKVDRL 183
SFN G+ A ++ +L
Sbjct: 760 SFNVQNGVRAADLIKDFTKL 779
>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
lacrymans S7.9]
Length = 592
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 4/124 (3%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
++ + + P+ V ++ R VI+ + + ++ + +V PFGS K YLPDGDIDL V+
Sbjct: 189 VEAFVNYMSPSPVEDEIRGLVISLVTKAVSSAFPDAQVLPFGSYETKLYLPDGDIDL-VI 247
Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDISFNQMA 169
+++ V L N + + K A+V IVK + + VDIS NQ
Sbjct: 248 QSESMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFVTNHGRLNVDISINQGN 307
Query: 170 GLSA 173
G+ A
Sbjct: 308 GVIA 311
>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 96/229 (41%), Gaps = 23/229 (10%)
Query: 11 EDLQEELLTSPSHTSSSSSYSSPPNLYPF------SIDAEL--WLLAEERIQEILCTIQP 62
+D ++ + S + + P +LYP+ S E+ WL E I++ + I P
Sbjct: 124 DDYSDDGIISDDQSGNRHIAKEPSSLYPWIKNQDHSKQQEIADWLTLE--IKDFVNYISP 181
Query: 63 AIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLA 121
+ + R VI ++ I ++ EV FGS YLP DID+ V+ E +
Sbjct: 182 SKLEITTRNNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVVISRDGDREQRS 241
Query: 122 RF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFNQMAGLSALCFL 177
R + L ++ E+ A+V IVK NI +D+SF + G+ A
Sbjct: 242 RLYQLSTHLRSKKLAKNIEV----IAKAKVPIVKFVDPDSNIHIDVSFERSNGIDAAI-- 295
Query: 178 EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
K+ + + V+++K + R+ H G + Y+ ++ H
Sbjct: 296 -KIREWLASTPGLRELVLVVKQFL-RSRRLNNVHVGGLGGYSTIILCYH 342
>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
Length = 698
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 201 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 260
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 261 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 310
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 311 DISFNVQNGVRA 322
>gi|374724400|gb|EHR76480.1| terminal uridylyltransferase [uncultured marine group II
euryarchaeote]
Length = 730
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 87/191 (45%), Gaps = 14/191 (7%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQR-LINGYYGIEVFPFGSVPLK 98
S+ E+ L E I+ +L + P K R+ + +++R L + + +E+ FGS
Sbjct: 36 SVSEEVILNIGEGIEAMLKSEGPTPKQLKAREGLFRHLKRTLEHRFKNVELQQFGSSQSG 95
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ 157
L GD+DL + ++ R V ++L+ D ++D+ +P A+V I+K +
Sbjct: 96 LTLQAGDLDLCLQFKGDIPAKALRQVNRLLKQHD------MEDIVMLPRAKVPIIKFKDE 149
Query: 158 --NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
IPVDIS N AL E + R D + ++ +K W + A G
Sbjct: 150 RTKIPVDISINNTL---ALHNTELLKRYSSCDERIRSVILAVKHWANRRD-VCDASTGTF 205
Query: 216 STYALEMMVLH 226
S+YA ++ +
Sbjct: 206 SSYAWTLLAVQ 216
>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
Length = 651
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 201 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 260
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 261 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 310
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 311 DISFNVQNGVRA 322
>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Macaca mulatta]
Length = 700
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 203 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 262
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 263 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 312
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 313 DISFNVQNGVRA 324
>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
Length = 514
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 64 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 123
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 124 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 173
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 174 DISFNVQNGVRA 185
>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
Length = 588
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 60/133 (45%), Gaps = 19/133 (14%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 137 HEEISDFYEYMSPRPEEEKMRMEVVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDIDL 196
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
V G +EE L + D+DSV ++ D VP I+K S +
Sbjct: 197 VVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVK 246
Query: 161 VDISFNQMAGLSA 173
VDISFN G+ A
Sbjct: 247 VDISFNVQNGVRA 259
>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
mulatta]
Length = 653
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 62/137 (45%), Gaps = 19/137 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 203 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 262
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 263 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 312
Query: 162 DISFNQMAGLSALCFLE 178
DISFN G+ A ++
Sbjct: 313 DISFNVQNGVRAADLIK 329
>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 631
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 244 DISFNVQNGVRA 255
>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
Length = 361
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 85/171 (49%), Gaps = 15/171 (8%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
I P ++R++ I ++ I + +++ FGS K YLP+ DID+ ++ +
Sbjct: 75 IIPTSEEHRRREQAIMRVETFIKEFASEVDIQAFGSFKTKLYLPNADIDVVMIDKSMSAK 134
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK-CSVQN-IPVDISFNQMAGLSALC 175
+L + V + L D+ ++V + A+V I+K V++ DISFNQM GL +
Sbjct: 135 ELYKKVAQSLMKSDK-----FENVNLIANAKVPIIKFFEVESQYQFDISFNQMDGLKQID 189
Query: 176 FLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
+ K + + FK ++++K C + R L Y G I ++ L M+L
Sbjct: 190 EIRKAFTIYPE---FKYLIMILK--CMLKQRELNETYSGGIGSFLLFQMIL 235
>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Nomascus leucogenys]
gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
Length = 631
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 244 DISFNVQNGVRA 255
>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC
30864]
Length = 558
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 67/133 (50%), Gaps = 13/133 (9%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDI 106
E+ + + + I+P + + R+E++ I+ +I G + +EVF GS YLP DI
Sbjct: 213 EQEMYDFVEFIKPTPLEHQMREEIVQRIREVITGAWKHARVEVF--GSFATGLYLPMSDI 270
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDIS 164
D+ V G N ++ + K+LE ++ D VP I+K ++ + VDIS
Sbjct: 271 DIVVFG--NWDQIPLFTLGKLLEESRIAKNVKVIDKTSVP----IIKLADALSGVFVDIS 324
Query: 165 FNQMAGLSALCFL 177
FN +GL + F+
Sbjct: 325 FNLESGLRTVEFI 337
>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
Length = 784
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 28/200 (14%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGD 105
LL EE I + P + R +I I R IN + EV PFGS + YLP GD
Sbjct: 142 LLTEE-INAFYKYVSPTREEFEVRLFMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200
Query: 106 IDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC----------- 154
IDL V + + R + ++ + Q ++ ++ + A+V I+K
Sbjct: 201 IDLVVAHKYLSDANKQRLLAELGKAMRQANITDVVAI-IARARVPIIKFVTLEGKSHVFS 259
Query: 155 --------SVQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYES 205
+ I VDIS NQ G++A + + +D L G R +ILI + +
Sbjct: 260 LAYLTKQEGIGKINVDISLNQGNGVTAGKIINQYLDALPG-----ARQLILIVKYFLSQR 314
Query: 206 RILGAHYGLISTYALEMMVL 225
+ + G + +Y++ MV+
Sbjct: 315 SMNEVYTGGLGSYSVICMVI 334
>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum
NRRL Y-27907]
Length = 615
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 79/184 (42%), Gaps = 11/184 (5%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I++ + + P+ R V+N ++ I ++ G E FGS YLP
Sbjct: 183 WLTME--IKDFVDYVSPSSDEIVTRNTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGS 240
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID+ V+ E+ +R + L ++ E+ VP +K V + I +D
Sbjct: 241 DIDMVVISETGDYENRSRLYQLSSFLRSKKLAKNVEVIANAKVPI-IKFVDPESE-IHID 298
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + G+ A + K LI L R ++LI R+ H G + YA +
Sbjct: 299 VSFERTNGIDAAKRIRKW--LITTPGL--RELVLIVKQFLRSRRLNNVHVGGLGGYATII 354
Query: 223 MVLH 226
M H
Sbjct: 355 MCYH 358
>gi|268637610|ref|XP_002649102.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
gi|256012839|gb|EEU04050.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
Length = 938
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 85/175 (48%), Gaps = 15/175 (8%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVE- 117
++P+ + + R+++I + ++ + V FGS ++P DID+ + G N E
Sbjct: 457 LEPSELESRIRQKIIRDLDAIVKSNWPKANVVVFGSFSSNLFIPSSDIDIQISGINNAES 516
Query: 118 -----EDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAG 170
++ R + I+ QDS ++++ A+V I+K S + +DI F+ G
Sbjct: 517 VNKYNQNPIRDLFDIIIRNHQDSFINVRNI--FGAKVPIIKMTSSHSHYNIDICFDTPNG 574
Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
+ V + + K + R+++LI + +++ + + G I +YAL +MV+
Sbjct: 575 IENTA----VVKGLLKQYKSMRTLLLIIKFFLHQNNLNETYTGGIGSYALALMVV 625
>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
paniscus]
Length = 593
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 96 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 155
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 156 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 205
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 206 DISFNVQNGVRA 217
>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
mutus]
Length = 563
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 66 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 125
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 126 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 175
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 176 DISFNVQNGVRA 187
>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Terminal uridylyltransferase 3; Short=TUTase 3;
AltName: Full=Topoisomerase-related function protein
4-2; Short=TRF4-2
Length = 572
Score = 48.5 bits (114), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 122 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 181
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 182 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 231
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 232 DISFNVQNGVRA 243
>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 631
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 244 DISFNVQNGVRA 255
>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio
anubis]
Length = 605
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 108 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 167
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 168 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 217
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 218 DISFNVQNGVRA 229
>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Metaseiulus occidentalis]
Length = 512
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 59/122 (48%), Gaps = 9/122 (7%)
Query: 61 QPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED 119
+P + R+EV+N ++ ++ + + FGS YLP DIDL +LG E
Sbjct: 106 KPTRTEHQVRQEVVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVILGDW---ET 162
Query: 120 LARFVC-KILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCF 176
L F K L E S IK + A V IVK + Q N+ VDISFNQ G+ +
Sbjct: 163 LPMFTLHKALIQEKIASASTIKVLDR--ASVPIVKFTEQSTNVKVDISFNQKNGVKSAKL 220
Query: 177 LE 178
++
Sbjct: 221 IK 222
>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
familiaris]
Length = 641
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 144 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 203
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 204 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 253
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 254 DISFNVQNGVRA 265
>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
Length = 649
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 79/174 (45%), Gaps = 27/174 (15%)
Query: 73 VINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE 131
+I I R IN + EV PFGS + YLP GDIDL V + + R + ++ +
Sbjct: 32 MIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHKYLSDANKQRLLAELGKAM 91
Query: 132 DQDSVFEIKDVQYVPAQVKIVK-------------------CSVQNIPVDISFNQMAGLS 172
Q ++ ++ + A+V I+K ++ I VDIS NQ G++
Sbjct: 92 RQANITDVVAI-IARARVPIIKFVTLEGESHVTSLADSSKQGAIGKINVDISLNQGNGVT 150
Query: 173 ALCFLEK-VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
A + + +D L G R +ILI + + + + G + +Y++ MV+
Sbjct: 151 AGKIINQYLDALPG-----ARQLILIVKYFLSQRSMNEVYTGGLGSYSVICMVI 199
>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Callithrix jacchus]
Length = 631
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 134 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 193
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 194 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 243
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 244 DISFNVQNGVRA 255
>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
troglodytes]
Length = 586
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 89 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 148
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 149 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 198
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 199 DISFNVQNGVRA 210
>gi|395510809|ref|XP_003759662.1| PREDICTED: DNA polymerase sigma [Sarcophilus harrisii]
Length = 594
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 120/303 (39%), Gaps = 51/303 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ +I + +V FGS YLP DIDL
Sbjct: 43 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDL 102
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 103 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 158
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
G+ K RLI KD++ K S +IL+ + + G IS+Y+L +
Sbjct: 159 METGV-------KAARLI-KDYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 210
Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI- 281
M + FL + D A +L ++ E E G +LK I
Sbjct: 211 MAI-SFLQLHPRIDARR---------ADENLGMLLVEFFELYGRNF----NYLKTGIRIK 256
Query: 282 ----YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLG 337
Y ++ + + L I DPL N++GRS SYGA ++
Sbjct: 257 NGGAYIAKEEIMKAMTNGYRPSMLCIEDPLLPGNDVGRS-------------SYGAMQVK 303
Query: 338 EIL 340
++
Sbjct: 304 QVF 306
>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial
[Macaca mulatta]
Length = 618
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 121 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 180
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 181 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 230
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 231 DISFNVQNGVRA 242
>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
Length = 522
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 25 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 84
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 85 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 134
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 135 DISFNVQNGVRA 146
>gi|150951520|ref|XP_001387852.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
gi|149388662|gb|EAZ63829.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
Length = 605
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/186 (24%), Positives = 83/186 (44%), Gaps = 15/186 (8%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E +++ + I P+ + R +IN ++ I+ Y+ E FGS YLP
Sbjct: 177 WLTME--MKDFVNYISPSSEEIRTRNRLINKLKSSISSYWPETETHVFGSSATDLYLPGS 234
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
DID+ ++ E+ +R + L ++ E+ A+V I+K N+
Sbjct: 235 DIDIVIVSRTGDYENRSRLYQLSSYLRHKGLAKNMEV----IAKAKVPIIKFVDPESNVN 290
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
+D+SF + G+ A +K+ R + + V++IK + R+ H G + YA
Sbjct: 291 IDVSFERRNGIEA---AKKIRRWMTTTPGLRELVLIIKQFL-SSRRLNNVHSGGLGGYAT 346
Query: 221 EMMVLH 226
++ H
Sbjct: 347 IILCYH 352
>gi|327270606|ref|XP_003220080.1| PREDICTED: DNA polymerase sigma-like [Anolis carolinensis]
Length = 686
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 80/174 (45%), Gaps = 20/174 (11%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
RKEV+ I+ +I + +V FGS YLP DIDL V G E + + + L
Sbjct: 212 RKEVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ESPPLQLLEQAL 269
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ + IK + A V I+K + Q + VDISFN G+ K RLI K
Sbjct: 270 RKHNVAEPYSIKVLDK--ATVPIIKLTDQETEVKVDISFNVETGV-------KAARLI-K 319
Query: 187 DHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
D++ K + +IL+ + + G IS+Y+L +M + FL + D
Sbjct: 320 DYMKKYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 372
>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
scrofa]
Length = 511
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 14 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 73
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 74 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 123
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 124 DISFNVQNGVRA 135
>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Ailuropoda melanoleuca]
Length = 593
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 143 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 202
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 203 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 252
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 253 DISFNVQNGVRA 264
>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 778
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 84/197 (42%), Gaps = 26/197 (13%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + I P E R++V+N I+R+I + V FGS YLP DIDL
Sbjct: 238 HEEIMDFFNFISPRPEEEAMRRDVVNRIERVIKDLWPTARVEIFGSFSTGLYLPTSDIDL 297
Query: 109 TVLG---HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDI 163
V G H ++E + + L+ + + IK + A V I+K + + VDI
Sbjct: 298 VVFGKWDHPPLQE-----LEQALKKRNVAGPYPIKVLD--KATVPIIKLTDHETEVKVDI 350
Query: 164 SFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLISTYA 219
SFN + A F+ K +L K +V I + + + G I +Y+
Sbjct: 351 SFNVETAVKAAQFI--------KSYLKKYTVLPPLIFVLKQFLLQRDLNEVFTGGIGSYS 402
Query: 220 LEMMVLHRFLDYYNTFD 236
L +M + FL + D
Sbjct: 403 LILMAI-SFLQLHPRID 418
>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
Length = 632
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 135 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 194
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 195 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 244
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 245 DISFNVQNGVRA 256
>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
MF3/22]
Length = 732
Score = 48.1 bits (113), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 52/181 (28%), Positives = 79/181 (43%), Gaps = 18/181 (9%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDI 106
L + ++ L + P V + R V+ I I Y EV PFGS K YLP GDI
Sbjct: 148 LMHKEVEAYLKYVSPTPVEHEVRWMVVQLISSSIKRVYSDSEVLPFGSFGTKLYLPQGDI 207
Query: 107 DLTVLGHQNVEEDLARF----VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
DL V LA F K L N + + K A+V I+K +
Sbjct: 208 DLVVQSRT-----LASFEKVTALKSLANIVKRTGLADKVTIISQARVPIIKFTTLYGRFA 262
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYA 219
VDIS NQ G+ + ++R + + + V+++K+ + + R L Y G + +YA
Sbjct: 263 VDISMNQSNGVKT---GDMINRFLNEFPALRAIVLIVKS--FLKQRNLNEVYSGGLGSYA 317
Query: 220 L 220
+
Sbjct: 318 I 318
>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
Length = 637
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 64/142 (45%), Gaps = 19/142 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 183 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 242
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 243 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 292
Query: 162 DISFNQMAGLSALCFLEKVDRL 183
DISFN G+ A ++ +L
Sbjct: 293 DISFNVQNGVRAADLIKDFTKL 314
>gi|354479063|ref|XP_003501733.1| PREDICTED: DNA polymerase sigma-like [Cricetulus griseus]
Length = 624
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 55/195 (28%), Positives = 87/195 (44%), Gaps = 15/195 (7%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
W L EE I + + P R+EV+ I+ ++ + +V FGS YLP
Sbjct: 69 WRLHEE-IIDFYNFMSPCPEEAAMRREVVTRIETVVKDLWPTADVQIFGSFSTGLYLPTS 127
Query: 105 DIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVD 162
DIDL V G E + + + L + IK + A V I+K + Q + VD
Sbjct: 128 DIDLVVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVD 183
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
ISFN G+ A F++ + K L ++++K + R L + G IS+Y+L
Sbjct: 184 ISFNMETGVRAAEFIKN---YMKKYSLLPYLILVLKQ--FLLQRDLNEVFTGGISSYSLI 238
Query: 222 MMVLHRFLDYYNTFD 236
+M + FL + D
Sbjct: 239 LMAIS-FLQLHPRID 252
>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
Length = 527
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 30 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 90 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 139
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 140 DISFNVQNGVRA 151
>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
Length = 564
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 67 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 126
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 127 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 176
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 177 DISFNVQNGVRA 188
>gi|444729764|gb|ELW70170.1| DNA polymerase sigma [Tupaia chinensis]
Length = 729
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 78 RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 135
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 136 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 190
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 191 YSLLPYLILVLKQF-LLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 238
>gi|395857085|ref|XP_003800943.1| PREDICTED: DNA polymerase sigma [Otolemur garnettii]
Length = 837
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 286 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 345
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 346 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 401
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
G+ A F+ K+++ K S +IL+ + + G IS+Y+L +
Sbjct: 402 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 453
Query: 223 MVLHRFLDYYNTFD 236
M + FL + D
Sbjct: 454 MAIS-FLQLHPRID 466
>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
Length = 590
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 64/142 (45%), Gaps = 19/142 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 136 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 195
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 196 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 245
Query: 162 DISFNQMAGLSALCFLEKVDRL 183
DISFN G+ A ++ +L
Sbjct: 246 DISFNVQNGVRAADLIKDFTKL 267
>gi|432884542|ref|XP_004074488.1| PREDICTED: uncharacterized protein LOC101158959 [Oryzias latipes]
Length = 421
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 7/137 (5%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + + + P RKEV+ I+ +I + +V FGS YLP DIDL
Sbjct: 243 HEEVMDFYKYMSPRPEEAAMRKEVVKRIETIIKEQWPSADVQIFGSFSTGLYLPTSDIDL 302
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + F IK + A V I+K + Q + VDISFN
Sbjct: 303 VVFG--KWERPPLQELEQALRKHNVAEPFSIKVLD--KATVPIIKLTDQETEVKVDISFN 358
Query: 167 QMAGLSALCFLEKVDRL 183
G+ A F++ +L
Sbjct: 359 VETGVKAASFIKDYVKL 375
>gi|224112707|ref|XP_002316267.1| predicted protein [Populus trichocarpa]
gi|222865307|gb|EEF02438.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 47.8 bits (112), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 129/310 (41%), Gaps = 48/310 (15%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDI 106
E +++IL IQP R +VI ++ ++ G V PFGS + GD+
Sbjct: 2 EPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGDL 61
Query: 107 DLTVL---------GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV 156
D++++ + +++L V K L + +Q++P A+V I+K
Sbjct: 62 DISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGG-----WQRLQFIPNARVPILKFEN 116
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+I D+S + M GL FL ++ + D F+ V+L+K W + I G ++
Sbjct: 117 ASISCDVSIDNMQGLMKSKFLFWINEI---DRRFRDMVLLVKEWAKTHN-INNPKTGSLN 172
Query: 217 TYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLK 276
+Y+L ++V+ F C+ P + L EI V +D + E +
Sbjct: 173 SYSLSLLVIFHF----------QTCV----PAILPPLKEIYPRNVIDDLTGVRTDAE--R 216
Query: 277 KCREIYSVPITAL---------ENTGHEFAIKHL-NIVDPLKDNNNLGRSVSKGNFHRIR 326
+ EI + I+ N+ E I L I DP + N R+VS N +I
Sbjct: 217 RIGEICAANISRYRSNKSRAINRNSLSELFISFLTKIEDPFEQPENTARAVSAANLMKIS 276
Query: 327 CALSYGAQRL 336
A+ RL
Sbjct: 277 EAIQTTHHRL 286
>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
Length = 717
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 100/243 (41%), Gaps = 50/243 (20%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI-----EVFPFGSVP 96
D E WL E I + + P ++R ++ +Q ++ GY+ + FGS P
Sbjct: 348 DPEKWLHNE--ILDFYDFVAPKPYEHEQRNLLVQRVQSVL-GYHRFPQDNGRILCFGSFP 404
Query: 97 LKTYLPDGDIDLTV--------------LGHQNVEEDLARFVCKILENEDQDSVFEIKDV 142
YLP D+DL + +N L + V +L+ ++ F
Sbjct: 405 AGLYLPTADMDLVYTSDRHFNGGPPVMDVTARNATAPLLKGVRNVLQR--RNMAFGAISC 462
Query: 143 QYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-LCFLEKVDRLIGKDHLFKRSVILIKA 199
Y A+V +VK SV + VDISF ++G+ A F + D+ D ++ V L+K
Sbjct: 463 IYG-AKVPLVKFTDSVTRLQVDISFENLSGMQAQATFAQWKDKY--PDMIYM--VALLKQ 517
Query: 200 WCYYESRILGAHYGLISTYALEMMVLH----------------RFLDYY-NTFDWDNYCI 242
+ + H G I +A+ +++H FLDYY N FD + I
Sbjct: 518 FLVMRG-LNEVHTGGIGGFAIICLIVHYIHQAGKAENLAELFKGFLDYYGNKFDLTKHRI 576
Query: 243 SIN 245
+N
Sbjct: 577 QMN 579
>gi|326676716|ref|XP_686065.4| PREDICTED: DNA polymerase sigma [Danio rerio]
Length = 706
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 89/200 (44%), Gaps = 26/200 (13%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV++ I+ +I + +V FGS +LP DIDL
Sbjct: 211 HEEIMDFYNFMSPRPEEATMRQEVVDRIESVIKELWPTADVQIFGSFSTGLFLPTSDIDL 270
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYV-PAQVKIVKCSVQ--NI 159
V G Q +E+ L + SV E ++ + A V I+K + Q +
Sbjct: 271 VVFGKWEKPPLQQLEQALRKH-----------SVAEPYSIKVLDKATVPIIKLTDQETEV 319
Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
VDISFN G+ A F+++ + K + + ++K + + + G IS+Y+
Sbjct: 320 KVDISFNVETGIKAASFIKE---YVKKYTVLPYLIFVLKQFL-LQRDLNEVFTGGISSYS 375
Query: 220 LEMMVLHRFLDYYNTFDWDN 239
L +MV+ FL + D N
Sbjct: 376 LILMVI-SFLQLHPRIDTRN 394
>gi|351706462|gb|EHB09381.1| DNA polymerase sigma [Heterocephalus glaber]
Length = 542
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 80/172 (46%), Gaps = 16/172 (9%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G + R ++L
Sbjct: 11 RREVVQRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------EWERPPLQLL 64
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
E + SV E ++ + A V I+K + Q + VDISFN G+ A F++
Sbjct: 65 EQALRKHSVAEPCSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY---- 120
Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
K++ +IL+ + + G IS+Y+L +M + FL + D
Sbjct: 121 MKEYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
Length = 374
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 30 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 90 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 139
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 140 DISFNVQNGVRA 151
>gi|222619531|gb|EEE55663.1| hypothetical protein OsJ_04061 [Oryza sativa Japonica Group]
Length = 461
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 85/372 (22%), Positives = 146/372 (39%), Gaps = 92/372 (24%)
Query: 39 FSIDAEL-WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN------GYYGIEVFP 91
FS+ + + + E+ ++ IL I+P E R++ ++ IQ L N G P
Sbjct: 22 FSMTEHINYDVVEQCVKNILSLIKPV---EDDRRKRLSAIQELSNSIPKVAALRGAVFKP 78
Query: 92 FGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV 145
FGS Y GD+D++V + + ++ + R + ++L+N VQ+V
Sbjct: 79 FGSFVSNLYSNSGDLDISVHLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-----VQFV 133
Query: 146 P-AQVKIVK--CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKRSVILIKA 199
P A+V +++ + I DIS N G C++ +D G V+LIK
Sbjct: 134 PFARVPVLQYVSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKE 187
Query: 200 WCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAE 259
W ++ I G +++Y+L ++VL F C P + L EI
Sbjct: 188 WAKAQN-INDPKTGTLNSYSLCLLVLFHF----------QTC----EPAILPPLKEIYEG 232
Query: 260 TVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN-IVDPLKDNNNLGRSVS 318
+E E+ + E +HL+ + DP++ +N R+V
Sbjct: 233 NIEEGIAEMTVYDE-------------------------EHLDEVKDPIERPDNAARAVD 267
Query: 319 KGNFHRIRCALSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI-----NTLE 360
RI A + ++ E+L P G LG+ + + + NT
Sbjct: 268 LKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMANSCTNTPR 327
Query: 361 RNGKGLRPDVQV 372
RN RP QV
Sbjct: 328 RNRHHGRPTEQV 339
>gi|402871130|ref|XP_003899535.1| PREDICTED: DNA polymerase sigma [Papio anubis]
Length = 775
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 224 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 283
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 284 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 339
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
G+ A F+ K+++ K S +IL+ + + G IS+Y+L +
Sbjct: 340 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 391
Query: 223 MVLHRFLDYYNTFD 236
M + FL + D
Sbjct: 392 MAI-SFLQLHPRID 404
>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
Length = 371
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 61/132 (46%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 30 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 89
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + + D+DSV ++ D VP I+K S + V
Sbjct: 90 VFGKWENLPLWTLEEALRKH-----KVADEDSV-KVLDKATVP----IIKLTDSFTEVKV 139
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 140 DISFNVQNGVRA 151
>gi|431900711|gb|ELK08155.1| DNA polymerase sigma [Pteropus alecto]
Length = 539
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVGRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
LYAD-421 SS1]
Length = 647
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 68/154 (44%), Gaps = 13/154 (8%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
++ + + P + ++ R + I R I+ Y +V PFGS K YLP GDIDL +
Sbjct: 162 VEAFVDYMSPTPIEDEVRSLSVQLIARAISKSYPDAKVLPFGSYETKLYLPSGDIDLVIY 221
Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMA 169
H + D V L N + + + A+V I+K + VDIS NQ
Sbjct: 222 SHSMMRMDKVS-VLHSLANIMKRAGITDRVTIIAKAKVPIIKFVTAHGRFSVDISVNQGN 280
Query: 170 GLSALCFLEKVDR---------LIGKDHLFKRSV 194
G+ +++ R LI K+ L +RS+
Sbjct: 281 GVDTGKMVKQFLRELPALRSLVLIIKNFLSQRSM 314
>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
Length = 901
Score = 47.0 bits (110), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQY 144
I+V FGS+ +LP DID+ V + + D + LE ++ + +
Sbjct: 129 IKVSMFGSLRTTLFLPTSDIDVLVECDEWIGTPGDWLTETARGLEIDNIAESVSVYGGAF 188
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
VP VK+V + + +DISFN + G+ A +++KV L + V+L+K + +Y
Sbjct: 189 VPI-VKMVDRDTR-LSIDISFNTVQGVRAASYIDKVKEEFP---LIEPLVLLLKQFLHY- 242
Query: 205 SRILGAHY-GLISTYALEMMVLHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVE 262
R L + G +S+Y L +++++ F Y N Y +N + E+ + +E
Sbjct: 243 -RNLNQTFTGGLSSYGLVLLLVNFFQLYALNMRHRTIYDRGVNLGHLLLRFLEVYS--LE 299
Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIK-----HLNIVDPLKDNNNLGRSV 317
+ +E+ +SP +C Y VP +A G + +K +L + DPL N++GRS
Sbjct: 300 FNYEEIGISP---GQC--CY-VPKSA---AGARYGLKRAQPGNLALEDPLLTANDVGRST 350
Query: 318 SKGNFHRIRCALSYGAQRLGEILTL 342
NF I A Q L +TL
Sbjct: 351 Y--NFSSIANAFGQAFQILLVAVTL 373
>gi|255566462|ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis]
Length = 493
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 82/170 (48%), Gaps = 25/170 (14%)
Query: 71 KEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL---------GHQNVEEDLA 121
K+VI I+ L G V PFGS + GD+D++++ + ++++
Sbjct: 36 KDVIASIESL----RGATVEPFGSFVSNLFTRWGDLDISIMLANGSYISSAAKKRKQNVL 91
Query: 122 RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGLSALCFLE 178
R K L + + +Q+VP A+V ++K QNI D+S + + G FL
Sbjct: 92 REFHKALRQKG-----GWRRLQFVPNARVPLLKFESGRQNISCDVSIDNLQGQIKSNFLF 146
Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRF 228
++++ G+ F+ V+L+K W + I G +++Y+L ++V+ F
Sbjct: 147 WLNQIDGR---FRDMVLLVKEWAKAHN-INNPKTGTLNSYSLSLLVIFHF 192
>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
Length = 901
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 87 IEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQY 144
I+V FGS+ +LP DID+ V + + D + LE ++ + +
Sbjct: 129 IKVSMFGSLRTTLFLPTSDIDVLVECDEWIGTPGDWLTETARGLEIDNIAESVSVYGGAF 188
Query: 145 VPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
VP VK+V + + +DISFN + G+ A +++KV L + V+L+K + +Y
Sbjct: 189 VPI-VKMVDRDTR-LSIDISFNTVQGVRAASYIDKVKEEFP---LIEPLVLLLKQFLHY- 242
Query: 205 SRILGAHY-GLISTYALEMMVLHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVE 262
R L + G +S+Y L +++++ F Y N Y +N + E+ + +E
Sbjct: 243 -RNLNQTFTGGLSSYGLVLLLVNFFQLYALNMRHRTIYDRGVNLGHLLLRFLEVYS--LE 299
Query: 263 NDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIK-----HLNIVDPLKDNNNLGRSV 317
+ +E+ +SP +C Y VP +A G + +K +L + DPL N++GRS
Sbjct: 300 FNYEEIGISP---GQC--CY-VPKSA---AGARYGLKRAQPGNLALEDPLLTANDVGRST 350
Query: 318 SKGNFHRIRCALSYGAQRLGEILTL 342
NF I A Q L +TL
Sbjct: 351 Y--NFSSIANAFGQAFQILLVAVTL 373
>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
Length = 530
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 65/148 (43%), Gaps = 19/148 (12%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFG 93
LYP + + E I + + P EK R EV++ I+ +I + +V FG
Sbjct: 17 QLYPGLVRSAQTGRLHEEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFG 76
Query: 94 SVPLKTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPA 147
S YLP DIDL V G +EE L + D+DSV ++ D VP
Sbjct: 77 SFKTGLYLPTSDIDLVVFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP- 129
Query: 148 QVKIVKC--SVQNIPVDISFNQMAGLSA 173
I+K S + VDISFN G+ A
Sbjct: 130 ---IIKLTDSFTEVKVDISFNVQNGVRA 154
>gi|293332275|ref|NP_001169645.1| hypothetical protein [Zea mays]
gi|224030617|gb|ACN34384.1| unknown [Zea mays]
gi|414879730|tpg|DAA56861.1| TPA: hypothetical protein ZEAMMB73_892019 [Zea mays]
Length = 574
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 87/190 (45%), Gaps = 21/190 (11%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY---GIEVFPFGSVPLKTYLPDGDIDLT 109
I++IL I+P KR I ++ I+ G V PFGS Y GD+DL+
Sbjct: 15 IKDILALIKPVEDDRSKRLSTIQELENCIHSLASLSGAAVKPFGSFVSDLYSKSGDLDLS 74
Query: 110 V-LG-------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK-CSVQ-N 158
V G ++ +++ R V K L + V +Q++P A+V +++ S Q
Sbjct: 75 VQFGNGSNHPINKKKKQNALRDVRKALLSR---GVTGYMQMQFIPHARVPVLQYVSKQFG 131
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
I DIS AG V+ + D F V+LIK W ++ I G +++Y
Sbjct: 132 ISCDISIGNFAGRIKSKIFYWVNTV---DERFGDMVLLIKEWAKAQN-INDPKSGTLNSY 187
Query: 219 ALEMMVLHRF 228
+L ++VL+ F
Sbjct: 188 SLCLLVLYHF 197
>gi|358055188|dbj|GAA98957.1| hypothetical protein E5Q_05645 [Mixia osmundae IAM 14324]
Length = 813
Score = 47.0 bits (110), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 83/188 (44%), Gaps = 9/188 (4%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPD 103
+ +L +E +Q + P+ R VI+ ++ + + ++ PFGS + YLP
Sbjct: 280 IQMLTKE-LQAFKAYVTPSRAEHAFRGHVIDQVRNALRQIWADTDLQPFGSYLTQLYLPG 338
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPV 161
GDIDL +L + +R + +I + +D+ V A+V IVK + +
Sbjct: 339 GDIDLVMLSATAASQTPSRVLHRIAQIM-RDANIGYDFVVISRAKVPIVKFISTTGGFNI 397
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DIS NQ G+ A V R+I + ++ Y R + + G + +Y+L
Sbjct: 398 DISLNQPGGIRAGTV---VQRMIDHNGGEAARTLIYCIKLYLSIRGMSEVFTGGLGSYSL 454
Query: 221 EMMVLHRF 228
MV+ F
Sbjct: 455 ICMVISFF 462
>gi|126139251|ref|XP_001386148.1| Poly(A) polymerase PAPalpha (Polynucleotide adenylyltransferase
alpha) [Scheffersomyces stipitis CBS 6054]
gi|126093430|gb|ABN68119.1| Poly(A) polymerase PAPalpha (Polynucleotide adenylyltransferase
alpha) [Scheffersomyces stipitis CBS 6054]
Length = 556
Score = 47.0 bits (110), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 48/226 (21%), Positives = 90/226 (39%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ KKR EV+ Q+++ + G +VF FGS L Y P D+D
Sbjct: 43 ATKKRVEVLTQFQKMVQEFVFTVSKKKNMSDGMAKDAGGKVFTFGSYRLGVYGPGSDLDT 102
Query: 109 TVLGHQNV-EEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
V+ Q+V ED +I+ + D+ I ++Y VP + +V
Sbjct: 103 LVVVPQHVSREDFFTVFAEIIRKRPELEEIVPVPDAFVPIIKIEYDGVPLDLIFACLNVP 162
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
IP+D++ + + L+ +++ +L+ K +FK ++ IK W ++
Sbjct: 163 RIPLDMTLDDKNLLRNLDERDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKMWAQQKA- 221
Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 222 VYGNVFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267
>gi|449326076|gb|AGE92663.1| PAPD7 [Homo sapiens]
Length = 772
Score = 47.0 bits (110), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 83/194 (42%), Gaps = 20/194 (10%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 221 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 280
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 281 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 336
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEM 222
G+ A F+ K+++ K S +IL+ + + G IS+Y+L +
Sbjct: 337 METGVRAAEFI--------KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLIL 388
Query: 223 MVLHRFLDYYNTFD 236
M + FL + D
Sbjct: 389 MAI-SFLQLHPRID 401
>gi|218189365|gb|EEC71792.1| hypothetical protein OsI_04417 [Oryza sativa Indica Group]
Length = 557
Score = 47.0 bits (110), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 141/362 (38%), Gaps = 91/362 (25%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN------GYYGIEVFPFGSVPLKTYL 101
+ E+ ++ IL I+P E R++ ++ IQ L N G PFGS Y
Sbjct: 9 VVEQCVKNILSLIKPV---EDDRRKRLSAIQELSNSIPKVAALRGAVFKPFGSFVSNLYS 65
Query: 102 PDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVK- 153
GD+D++V + + ++ + R + ++L+N VQ++P A+V +++
Sbjct: 66 NSGDLDISVQLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-----VQFIPFARVPVLQY 120
Query: 154 -CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
+ I DIS N G C++ +D G V+LIK W ++ I
Sbjct: 121 VSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------MVLLIKEWAKAQN-IND 173
Query: 210 AHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELL 269
G +++Y+L ++VL F C P + L EI +E E+
Sbjct: 174 PKTGTLNSYSLCLLVLFHF----------QTC----EPAILPPLKEIYEGNIEEGIAEMT 219
Query: 270 LSPEFLKKCREIYSVPITALENTGHEFAIKHLNIV-DPLKDNNNLGRSVSKGNFHRIRCA 328
+ E +HL+ V DP++ +N R+V RI A
Sbjct: 220 VYDE-------------------------EHLDEVEDPIERPDNAARAVGLKGLERIAGA 254
Query: 329 LSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI-----NTLERNGKGLRPDV 370
+ ++ E+L P G LG+ + + + NT RN RP
Sbjct: 255 FTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMANSCTNTPRRNRHHGRPTE 314
Query: 371 QV 372
QV
Sbjct: 315 QV 316
>gi|335304248|ref|XP_003134203.2| PREDICTED: DNA polymerase sigma [Sus scrofa]
Length = 625
Score = 46.6 bits (109), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 83/191 (43%), Gaps = 14/191 (7%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + EV FGS YLP DIDL
Sbjct: 74 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTAEVQIFGSFSTGLYLPTSDIDL 133
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L IK + A V I+K + Q + VDISFN
Sbjct: 134 VVFG--KWERPPLQLLEQALRKHKVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 189
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
G+ A F++ + K L ++++K + R L + G IS+Y+L +M +
Sbjct: 190 METGVRAAEFIKN---YMKKYSLLPYLILVLKQ--FLLQRDLNEVFTGGISSYSLILMAI 244
Query: 226 HRFLDYYNTFD 236
FL + D
Sbjct: 245 S-FLQLHPRID 254
>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
in germ line development protein 4; AltName:
Full=Germline development defective-4
gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
Length = 845
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 76/300 (25%), Positives = 134/300 (44%), Gaps = 24/300 (8%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDID 107
E I ++ I+P + + R +V ++ + + I++ FGS+ +LP DID
Sbjct: 82 EEIVDMYHWIKPNEIESRLRTKVFEKVRDSVLRRWKQKTIKISMFGSLRTNLFLPTSDID 141
Query: 108 LTVLGHQNVEE--DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
+ V V D + LE ++ + +VP VK+V + + +DISF
Sbjct: 142 VLVECDDWVGTPGDWLAETARGLEADNIAESVMVYGGAFVPI-VKMVDRDTR-LSIDISF 199
Query: 166 NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMV 224
N + G+ A ++ KV L + V+L+K + +Y R L + G +S+Y L +++
Sbjct: 200 NTVQGVRAASYIAKVKEEFP---LIEPLVLLLKQFLHY--RNLNQTFTGGLSSYGLVLLL 254
Query: 225 LHRFLDY-YNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYS 283
++ F Y N Y +N + E+ ++E + +E+ +SP +C Y
Sbjct: 255 VNFFQLYALNMRSRTIYDRGVNLGHLLLRFLELY--SLEFNFEEMGISP---GQC--CYI 307
Query: 284 VPITALENTGHEFAI-KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTL 342
+ GH+ A +L + DPL N++GRS NF I A Q L +TL
Sbjct: 308 PKSASGARYGHKQAQPGNLALEDPLLTANDVGRSTY--NFSSIANAFGQAFQILLVAVTL 365
>gi|402083045|gb|EJT78063.1| DNA polymerase sigma [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 732
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 35/144 (24%), Positives = 67/144 (46%), Gaps = 21/144 (14%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRK----EVINYIQRLINGYYGIEVFPFGSVPLKTY 100
+WL ++ I++ ++P EK R+ E++ +++R N V+PFGS Y
Sbjct: 400 VWL--DKEIKDFYDYVKPTHFEEKLRQSLVDELVTHVRRTWND---ASVYPFGSFKSGLY 454
Query: 101 LPDGDIDLTVLGHQNVEEDLARFV--------CKILENEDQDSVFEIKDVQYVPAQVKIV 152
LP GD+DL + + +A++ + +E ++ + A+V +V
Sbjct: 455 LPTGDMDLVFCSDKYLSRHIAQYTPKKQVFHFARFVEKRGLAHQHRVERIHK--ARVPLV 512
Query: 153 KC--SVQNIPVDISFNQMAGLSAL 174
K + + VDISF G++A+
Sbjct: 513 KYVDARTGLKVDISFENSTGITAV 536
>gi|74003095|ref|XP_545180.2| PREDICTED: DNA polymerase sigma [Canis lupus familiaris]
Length = 672
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 12/190 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 121 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDL 180
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 181 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 236
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
G+ A E + + K L ++++K + + + G IS+Y+L +M +
Sbjct: 237 METGVRA---AELIKNYMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS 292
Query: 227 RFLDYYNTFD 236
FL + D
Sbjct: 293 -FLQLHPRID 301
>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
Length = 964
Score = 46.6 bits (109), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 63/135 (46%), Gaps = 13/135 (9%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + I P E R++V+N I+++I + +V FGS YLP DIDL
Sbjct: 451 HEEIMDFFNFISPRPEEEAMRRDVVNRIEKVIKDLWPTAQVEIFGSFSTGLYLPTSDIDL 510
Query: 109 TVLG---HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDI 163
V G H ++E + + L+ + + IK + A V I+K + + VDI
Sbjct: 511 VVFGKWDHPPLQE-----LEQALKKRNVAGPYPIKVLD--KATVPIIKLTDHETEVKVDI 563
Query: 164 SFNQMAGLSALCFLE 178
SFN + A F++
Sbjct: 564 SFNVETAVKAAQFIK 578
>gi|157819111|ref|NP_001100803.1| DNA polymerase sigma [Rattus norvegicus]
gi|149032753|gb|EDL87608.1| polymerase (DNA directed) sigma (predicted) [Rattus norvegicus]
Length = 541
Score = 46.6 bits (109), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPSADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|440905114|gb|ELR55540.1| DNA polymerase sigma [Bos grunniens mutus]
Length = 542
Score = 46.6 bits (109), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
Length = 680
Score = 46.6 bits (109), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 183 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 242
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 243 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 292
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 293 DISFNVQNGVRA 304
>gi|237512970|ref|NP_001153629.1| DNA polymerase sigma [Bos taurus]
Length = 541
Score = 46.6 bits (109), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
FP-101664 SS1]
Length = 660
Score = 46.6 bits (109), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 72/162 (44%), Gaps = 18/162 (11%)
Query: 43 AELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYL 101
AE++ E ++ + I P + ++ R V+ + R + Y +V PFGS K YL
Sbjct: 164 AEMYARIE--VEAFVKYISPTPIEDEVRSLVVALVSRAVTRTYTDAQVLPFGSYETKLYL 221
Query: 102 PDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNI 159
P GDIDL + D V L N + + + A+V I+K +
Sbjct: 222 PLGDIDLVIYSQSMARMDRVS-VLHSLANIVKRAGITDRVTIIAKAKVPIIKFVTTHGRF 280
Query: 160 PVDISFNQMAGLSA----LCFLEKVDRLIGKDHLFKRSVILI 197
VDIS NQ G++A FLE++ L RS++LI
Sbjct: 281 SVDISINQGNGVTAGKMVKQFLEELPAL--------RSLVLI 314
>gi|281354437|gb|EFB30021.1| hypothetical protein PANDA_012597 [Ailuropoda melanoleuca]
Length = 542
Score = 46.6 bits (109), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A E + + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRA---AELIKNYMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|449465848|ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
Length = 464
Score = 46.2 bits (108), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 88/193 (45%), Gaps = 16/193 (8%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRL---INGYYGIEVFPFGSVPLKTYLPD 103
L + I++IL ++P R +VIN ++ + I G + PFGS +
Sbjct: 4 LTLDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRW 63
Query: 104 GDIDLTVLGHQNVEEDLA-----RFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCS-- 155
GD+DL+V + A + + + ++N + + K +Q +P A+V I+K
Sbjct: 64 GDLDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYK-LQLIPHARVPILKIEHI 122
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
NI DIS + + G L V+ + G+ F V+L+K W I + G
Sbjct: 123 QHNISCDISIDNLVGQIKSKILLWVNEIDGR---FHDMVLLVKEWAKAHD-INNSKQGTF 178
Query: 216 STYALEMMVLHRF 228
++Y+L ++V+ F
Sbjct: 179 NSYSLSLLVIFHF 191
>gi|395510432|ref|XP_003759479.1| PREDICTED: poly(A) RNA polymerase GLD2 [Sarcophilus harrisii]
Length = 729
Score = 46.2 bits (108), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 61/126 (48%), Gaps = 6/126 (4%)
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
DGD+ L V ++ AR++ +++ + I+ Q +PA+V IVK V +
Sbjct: 464 DGDLCLVVKEEPVNQKTEARYILSLVQKHFCTRLCYIERPQLIPAKVPIVKFRDKVSCVE 523
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D++ N + G+ L L + + V+++K W + I A G +++Y+L
Sbjct: 524 FDLNVNNIVGIRNTFLLRTYAYLESR---VRPLVLVVKKWASHHE-INDASRGTLNSYSL 579
Query: 221 EMMVLH 226
+MVLH
Sbjct: 580 VLMVLH 585
>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Topoisomerase-related function protein 4-2;
Short=TRF4-2
gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
Length = 633
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 136 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 195
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 196 VFGKWENLPLWTLEEALRKHKVA-----DEDSV-KVLDKATVP----IIKLTDSFTEVKV 245
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 246 DISFNVQNGVRA 257
>gi|198469802|ref|XP_001355128.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
gi|198147042|gb|EAL32185.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
Length = 1202
Score = 46.2 bits (108), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 131/331 (39%), Gaps = 61/331 (18%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 284 NHMAMIAKYKGEPWRKPDYPYGEGVVGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIENVV 343
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD-SVFEI 139
+ + V FGS +LP DIDL VLG + E L + LE E + + E
Sbjct: 344 HTIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEYELVNRGIAEA 397
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
V+ + A V I+K + + + VDISFN +G+ + ++K R ++GK L +
Sbjct: 398 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 457
Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
+L++ + G IS+Y+L +M CIS +
Sbjct: 458 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 488
Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
P + N G LL E + + I +++N G A L
Sbjct: 489 PRGIYHDTTNLGVLLLEFFELYGRRFNYMRIGI-SIKNGGRYMAKDELQRDMVDGHRPSL 547
Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
I DPL N++GRS S G FH +CA
Sbjct: 548 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 577
>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
Length = 563
Score = 46.2 bits (108), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 83/185 (44%), Gaps = 25/185 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 71 EEIIDFYKYMSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 130
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D++SV ++ D VP I+K S + V
Sbjct: 131 VFGKWENLPLWTLEEALRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKV 180
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DISFN G+ A ++ I K + V+++K + R L + G I +Y+L
Sbjct: 181 DISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 235
Query: 221 EMMVL 225
+M +
Sbjct: 236 FLMAV 240
>gi|330792667|ref|XP_003284409.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
gi|325085656|gb|EGC39059.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
Length = 1460
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 85/166 (51%), Gaps = 17/166 (10%)
Query: 72 EVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKIL 128
+VI +I+ ++ + ++ FGS +LP DID+ V N ++ A+F+ K+L
Sbjct: 1091 QVIGWIRAVVKKLWSHADLDLFGSFMTGLWLPSSDIDIVVNYGNNMSIKPKNAQFLLKVL 1150
Query: 129 ENEDQDSV--FEIKDVQYVPAQVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK- 179
E + ++ + F + V A++ ++K + +NI VDISF + G++A +
Sbjct: 1151 EKQIRNDLDGFILSMVCIPSAKIPVIKLVTTENISVDISFRESPTSIHTGIAARDLIADC 1210
Query: 180 VDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
V ++G L+ +++L W E + + G +S+Y L +M++
Sbjct: 1211 VKDVVG---LYPLAIVL--KWFLRERGLNNTYTGGLSSYCLVLMLI 1251
>gi|426246851|ref|XP_004017201.1| PREDICTED: LOW QUALITY PROTEIN: DNA polymerase sigma [Ovis aries]
Length = 602
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 80/180 (44%), Gaps = 32/180 (17%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEEDLAR 122
R+EV+ I+ ++ + +V FGS YLP DIDL V G Q +E+ L +
Sbjct: 11 RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALRK 70
Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKV 180
N + ++ D VP I+K + Q + VDISFN G+ A F+
Sbjct: 71 ------HNVAEPGSIKVLDKATVP----IIKLTDQETEVKVDISFNMETGVRAAEFI--- 117
Query: 181 DRLIGKDHLFKRS----VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
K+++ K S +IL+ + + G IS+Y+L +M + FL + D
Sbjct: 118 -----KNYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
glaber]
Length = 599
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 102 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 161
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 162 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 211
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 212 DISFNVQNGVRA 223
>gi|301776096|ref|XP_002923468.1| PREDICTED: DNA polymerase sigma-like [Ailuropoda melanoleuca]
Length = 656
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 82/190 (43%), Gaps = 12/190 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 105 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDL 164
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 165 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 220
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
G+ A E + + K L ++++K + + + G IS+Y+L +M +
Sbjct: 221 METGVRA---AELIKNYMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS 276
Query: 227 RFLDYYNTFD 236
FL + D
Sbjct: 277 -FLQLHPRID 285
>gi|338718783|ref|XP_001501238.3| PREDICTED: DNA polymerase sigma [Equus caballus]
Length = 574
Score = 46.2 bits (108), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 83/190 (43%), Gaps = 12/190 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P R+EV+ I+ ++ + +V FGS YLP DIDL
Sbjct: 23 HEEIIDFYNFMSPCPEEAAMRREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDL 82
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFN 166
V G E + + + L + IK + A V I+K + Q + VDISFN
Sbjct: 83 VVFG--KWERPPLQLLEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFN 138
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
G+ A F++ + K L ++++K + + + G IS+Y+L +M +
Sbjct: 139 METGVRAAEFIKN---YMKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAI- 193
Query: 227 RFLDYYNTFD 236
FL + D
Sbjct: 194 SFLQLHPRID 203
>gi|281306711|ref|NP_941002.2| DNA polymerase sigma isoform 1 [Mus musculus]
gi|148705067|gb|EDL37014.1| polymerase (DNA directed) sigma [Mus musculus]
Length = 542
Score = 46.2 bits (108), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
Length = 442
Score = 46.2 bits (108), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 57/123 (46%), Gaps = 19/123 (15%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----- 113
+ P EK R EV+N I+ +I + +V FGS YLP DIDL V G
Sbjct: 1 MSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP 60
Query: 114 -QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAG 170
+EE L + D+DSV ++ D VP I+K S + VDISFN G
Sbjct: 61 LWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNG 110
Query: 171 LSA 173
+ A
Sbjct: 111 VRA 113
>gi|384246771|gb|EIE20260.1| Nucleotidyltransferase [Coccomyxa subellipsoidea C-169]
Length = 454
Score = 46.2 bits (108), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 59/260 (22%), Positives = 109/260 (41%), Gaps = 54/260 (20%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKT 99
S++ E ++ I + + + P R + +Q + +V+PFGS
Sbjct: 186 SVNPECTARLQQEIVDFVANVAPTWEESNLRDAALGRVQGACAMMHLYDVYPFGSKASGL 245
Query: 100 YLPDGDIDLTVLGHQNVEED---------------LARFVCKILENEDQDSVFEIKDVQY 144
L + DID+ VLG +D L + V ++ + F I+
Sbjct: 246 ELWNSDIDVVVLGIVEPSKDNLGYTTEEKVPVNDVLGKIVQQLRRSNSVRKTFHIRQ--- 302
Query: 145 VPAQVKIVKC-SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYY 203
++V I+KC +V+ + VD+S N G+ A FL +D+ + L + ++LIKA
Sbjct: 303 --SRVPIIKCTTVEGVDVDVSVNGDRGICAAQFL--IDQQARRPAL-RPLILLIKAVL-- 355
Query: 204 ESRILG---AHYGLISTYALEMMVLHR--------------------FLDYYNT-FDWDN 239
++LG G + +++L MV+ FL Y T F++D
Sbjct: 356 --KVLGLGDVSQGGLGSFSLANMVIAHLQEEEKVGRGQENLGVSLLAFLLRYGTYFNYDQ 413
Query: 240 YCISI--NGPVAISSLPEIV 257
+ ++I G V+ +++P V
Sbjct: 414 HVVAIGRGGIVSRTAVPGAV 433
>gi|432104646|gb|ELK31258.1| DNA polymerase sigma [Myotis davidii]
Length = 513
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 81/172 (47%), Gaps = 16/172 (9%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G R ++L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG------KWERPPLQLL 64
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
E + +V E ++ + A V I+K + Q + VDISFN G+ A F++ +
Sbjct: 65 EQALRKHNVAEPGSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YM 121
Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
K L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 122 KKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAI-SFLQLHPRID 171
>gi|281306714|ref|NP_001162602.1| DNA polymerase sigma isoform 2 [Mus musculus]
gi|74220251|dbj|BAE31304.1| unnamed protein product [Mus musculus]
Length = 541
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|417402533|gb|JAA48112.1| Putative dna polymerase sigma [Desmodus rotundus]
Length = 542
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPGSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|344255499|gb|EGW11603.1| DNA polymerase sigma [Cricetulus griseus]
Length = 534
Score = 46.2 bits (108), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 2 RREVVTRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 59
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 60 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 114
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 115 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 162
>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
bisporus H97]
Length = 481
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 77/185 (41%), Gaps = 12/185 (6%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDI 106
+ + ++ + P + ++ R+ + I R I + G +VFPFGS K YLP GDI
Sbjct: 142 MLHKEVEAFTKWMAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDI 201
Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
DL + + + N L + +V I A+V IVK
Sbjct: 202 DLVIVSDSMAYSNKSSVLHSLASVLRRAGIASNVTVI-----AKAKVPIVKFVTIHGRFN 256
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VDIS NQ G+ ++ + + L RS++LI + + G + +Y++
Sbjct: 257 VDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSI 316
Query: 221 EMMVL 225
+ +
Sbjct: 317 VCLAI 321
>gi|60392923|sp|Q6PB75.1|PAPD7_MOUSE RecName: Full=DNA polymerase sigma; AltName: Full=PAP-associated
domain-containing protein 7
gi|37590586|gb|AAH59846.1| Polymerase (DNA directed) sigma [Mus musculus]
Length = 542
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|255550972|ref|XP_002516534.1| Poly(A) polymerase beta, putative [Ricinus communis]
gi|223544354|gb|EEF45875.1| Poly(A) polymerase beta, putative [Ricinus communis]
Length = 754
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 56/251 (22%), Positives = 102/251 (40%), Gaps = 53/251 (21%)
Query: 35 NLYPFSIDAELWLLAEERI--QEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVF 90
+L F +D+ L+ EE + +E+LC I IV + K+ Y ++++ + +F
Sbjct: 48 DLVKFLVDSGLYESKEENLKREEVLCRID-QIVKDWIKQLTRQRGYSDQMVDEANAV-IF 105
Query: 91 PFGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-A 147
FGS L + P DID +G + N EED + IL D EI ++Q VP A
Sbjct: 106 TFGSYRLGVHGPGADIDTLCVGPSYVNREEDFFILLHDILAEMD-----EITELQPVPDA 160
Query: 148 QVKIVKCSVQNIPVDISFNQMA-------------------------GLSALCFLEKVDR 182
V ++K Q I +D+ + ++ L+ +++ +
Sbjct: 161 HVPVMKFKFQGISIDLLYASISLPVVPEDLDISHGSVLYDVDEQTVRSLNGCRVADQILK 220
Query: 183 LIGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTY-------ALEMMVLHRF 228
L+ F+ ++ +K W + G ++ L+ A+ M++ RF
Sbjct: 221 LVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVARLCQLYPNAIPSMLVSRF 280
Query: 229 LDYYNTFDWDN 239
Y + W N
Sbjct: 281 FRVYTQWRWPN 291
>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
Length = 583
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 86 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 145
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 146 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 195
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 196 DISFNVQNGVRA 207
>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
garnettii]
Length = 629
Score = 46.2 bits (108), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 132 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 191
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 192 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 241
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 242 DISFNVQNGVRA 253
>gi|326927263|ref|XP_003209812.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Meleagris gallopavo]
Length = 571
Score = 46.2 bits (108), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 58/199 (29%), Positives = 90/199 (45%), Gaps = 26/199 (13%)
Query: 37 YPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSV 95
Y F + ++L EE I + + P E+ R EV+N I+ +I + +V FGS
Sbjct: 60 YVFPVWVPEFVLHEE-IIDFYKYMSPRHEEERMRMEVVNRIENVIKELWPNADVQIFGSF 118
Query: 96 PLKTYLPDGDIDLTVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
YLP DIDL V G +EE L + D++SV ++ D VP
Sbjct: 119 KTGLYLPTSDIDLVVFGKWETLPLWTLEEALRKHNVA-----DENSV-KVLDKATVP--- 169
Query: 150 KIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRI 207
I+K S + VDISFN G+ A ++ I K + V+++K + R
Sbjct: 170 -IIKLTDSFTEVKVDISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRD 223
Query: 208 LGAHY-GLISTYALEMMVL 225
L + G I +Y+L +M +
Sbjct: 224 LNEVFTGGIGSYSLFLMAV 242
>gi|403282221|ref|XP_003932554.1| PREDICTED: DNA polymerase sigma [Saimiri boliviensis boliviensis]
Length = 542
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|284507293|ref|NP_001165276.1| DNA polymerase sigma isoform 2 [Homo sapiens]
gi|219518042|gb|AAI43883.1| POLS protein [Homo sapiens]
Length = 541
Score = 46.2 bits (108), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|5902142|ref|NP_008930.1| DNA polymerase sigma isoform 1 [Homo sapiens]
gi|397472993|ref|XP_003808011.1| PREDICTED: DNA polymerase sigma [Pan paniscus]
gi|60392922|sp|Q5XG87.2|PAPD7_HUMAN RecName: Full=DNA polymerase sigma; AltName: Full=DNA polymerase
kappa; AltName: Full=LAK-1; AltName: Full=PAP-associated
domain-containing protein 7; AltName: Full=Terminal
uridylyltransferase 5; Short=TUTase 5; AltName:
Full=Topoisomerase-related function protein 4-1;
Short=TRF4-1
gi|5139669|dbj|BAA24434.2| LAK-1 [Homo sapiens]
gi|47777665|gb|AAT38110.1| polymerase (DNA directed) sigma [Homo sapiens]
gi|109658946|gb|AAI17138.1| Polymerase (DNA directed) sigma [Homo sapiens]
gi|116496943|gb|AAI26107.1| Polymerase (DNA directed) sigma [Homo sapiens]
gi|119628506|gb|EAX08101.1| polymerase (DNA directed) sigma [Homo sapiens]
gi|127799472|gb|AAH84567.2| Polymerase (DNA directed) sigma [Homo sapiens]
gi|158260737|dbj|BAF82546.1| unnamed protein product [Homo sapiens]
gi|313883146|gb|ADR83059.1| PAP associated domain containing 7 [synthetic construct]
gi|410210372|gb|JAA02405.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410259248|gb|JAA17590.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410303968|gb|JAA30584.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410337733|gb|JAA37813.1| PAP associated domain containing 7 [Pan troglodytes]
Length = 542
Score = 46.2 bits (108), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|386782099|ref|NP_001248228.1| DNA polymerase sigma [Macaca mulatta]
gi|355691204|gb|EHH26389.1| DNA polymerase sigma [Macaca mulatta]
gi|355749808|gb|EHH54146.1| DNA polymerase sigma [Macaca fascicularis]
gi|380809116|gb|AFE76433.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
gi|383415413|gb|AFH30920.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
gi|384945014|gb|AFI36112.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
Length = 542
Score = 46.2 bits (108), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|326917206|ref|XP_003204892.1| PREDICTED: DNA polymerase sigma-like [Meleagris gallopavo]
Length = 429
Score = 46.2 bits (108), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 81/171 (47%), Gaps = 14/171 (8%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ +I + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ + IK + A V I+K + Q + VDISFN G+ A F+++ + K
Sbjct: 69 RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKE---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + R L + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFLL--QRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|387273371|gb|AFJ70180.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
Length = 542
Score = 45.8 bits (107), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|410949807|ref|XP_003981609.1| PREDICTED: DNA polymerase sigma [Felis catus]
Length = 542
Score = 45.8 bits (107), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 77/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A E + + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRA---AELIKNYMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|118794510|ref|XP_321547.3| AGAP001564-PA [Anopheles gambiae str. PEST]
gi|19572384|emb|CAD27930.1| putative DNA topoisomerase [Anopheles gambiae]
gi|116116329|gb|EAA01236.4| AGAP001564-PA [Anopheles gambiae str. PEST]
Length = 1039
Score = 45.8 bits (107), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 115/271 (42%), Gaps = 37/271 (13%)
Query: 70 RKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R V++ I++++ N + V FGS YLP DIDL V+G + + L
Sbjct: 292 RVMVVSRIEKIVQNLWPSARVEMFGSFRTGLYLPTSDIDLVVIGQWTM------LPLRTL 345
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
E E + E V+ + A V IVK + + + VDISFN +G+ + ++ R
Sbjct: 346 EMELISQGIAEPNSVRVLDKASVPIVKLTDRQTQVKVDISFNMESGVQSAKLIKGFKR-- 403
Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISI 244
++ ++L+ + + G IS+Y+L +M + FL ++
Sbjct: 404 --EYPVLEKLVLVLKQFLLQRDLNEVFTGGISSYSLILMCI-SFLQQHH----------- 449
Query: 245 NGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKH---- 300
P A S+L ++ E E G + ++K + S E E H
Sbjct: 450 QKPNACSNLGVLLIEFFELYGRKF----NYMKIGISVKSGRYIPKEELQREMIDGHRPSL 505
Query: 301 LNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
L I DPL N++GRS S G H ++ A Y
Sbjct: 506 LCIEDPLTPGNDIGRS-SYGALH-VKQAFEY 534
>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 559
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 77/185 (41%), Gaps = 12/185 (6%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN-GYYGIEVFPFGSVPLKTYLPDGDI 106
+ + ++ + P + ++ R+ + I R I + G +VFPFGS K YLP GDI
Sbjct: 141 MLHKEVEAFTKWMAPTPIEDEIRELTVQMISRAITTAFSGSKVFPFGSYETKLYLPSGDI 200
Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIP 160
DL + + + N L + +V I A+V IVK
Sbjct: 201 DLVIVSDSMAYSNKSSVLHSLASVLRRAGIASNVTVI-----AKAKVPIVKFVTIHGRFN 255
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VDIS NQ G+ ++ + + L RS++LI + + G + +Y++
Sbjct: 256 VDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLVLITKLFLSQRSMNEVFTGGLGSYSI 315
Query: 221 EMMVL 225
+ +
Sbjct: 316 VCLAI 320
>gi|357131279|ref|XP_003567266.1| PREDICTED: poly(A) RNA polymerase protein cid1-like [Brachypodium
distachyon]
Length = 595
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 80/190 (42%), Gaps = 17/190 (8%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQ---RLINGYYGIEVFPFGSVPLKTYLPDGDI 106
E+ I+EIL I+PA V KR I + + + G PFGS Y GD+
Sbjct: 9 EKCIKEILSQIKPAEVDRNKRLSAIKELDISIQSVAALKGAAAKPFGSFLSNLYSKSGDL 68
Query: 107 DLTV--LGHQNV------EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
DL+V + N+ ++ + R + K L+ E VP ++ V S
Sbjct: 69 DLSVQLMNSSNLPVSKKKKQSILRVLRKALQRNGVAGYMEFIPHARVPV-LQYVSNSF-G 126
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
I D+S + G + L D F V+LIK W ++ I G +++Y
Sbjct: 127 ISCDLSIDNYPGRIKSRIFYWISTL---DERFGDMVLLIKEWAKCQN-INDPKTGTLNSY 182
Query: 219 ALEMMVLHRF 228
+L ++VL F
Sbjct: 183 SLCLLVLFHF 192
>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
[Ornithorhynchus anatinus]
Length = 578
Score = 45.8 bits (107), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 43/130 (33%), Positives = 60/130 (46%), Gaps = 15/130 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 81 EEINDFYEYMSPRPEEEKMRMEVVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDIDLV 140
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
V G +EE L + + D+ SV ++ D VP +K+ S + VDI
Sbjct: 141 VFGKWENLPLWTLEEALRKH-----KVADEHSV-KVLDKATVPI-IKLTD-SFTEVKVDI 192
Query: 164 SFNQMAGLSA 173
SFN G+ A
Sbjct: 193 SFNVQNGVKA 202
>gi|291411166|ref|XP_002721862.1| PREDICTED: DNA polymerase sigma [Oryctolagus cuniculus]
Length = 542
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKS---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|432908932|ref|XP_004078069.1| PREDICTED: uncharacterized protein LOC101163355 [Oryzias latipes]
Length = 779
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 84/200 (42%), Gaps = 32/200 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I + + P E R++V+N I+ +I + V FGS YLP DIDL
Sbjct: 218 HEEILDFFHFMSPKPEEEAMRRDVVNKIEGVIKDLWPTARVEIFGSFSTGLYLPTSDIDL 277
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIP 160
V G Q +E+ L + C + + +K + A V I+K + Q +
Sbjct: 278 VVFGKWEHPPLQELEQALKK--CNVA------GPYPVKVLDK--ASVPIIKLTDQESKVK 327
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV----ILIKAWCYYESRILGAHYGLIS 216
VDISFN + A F+ K +L K +V I + + + G IS
Sbjct: 328 VDISFNVETAVKAAQFI--------KSYLKKYAVLPPLIFVLKQFLLQRDLNEVFTGGIS 379
Query: 217 TYALEMMVLHRFLDYYNTFD 236
+Y+L +M + FL + D
Sbjct: 380 SYSLILMAI-SFLQLHPRID 398
>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
HHB-10118-sp]
Length = 478
Score = 45.8 bits (107), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 60/139 (43%), Gaps = 12/139 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
+ + ++ + I P ++ R ++ I R + + V PFGS K YLP GDI
Sbjct: 152 MLHKEVEAFVKYISPTQEEDEIRSLIVESISRAVTKAFPDARVLPFGSYETKLYLPLGDI 211
Query: 107 DLTV----LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IP 160
DL + + + N L + D V I A+V I+K ++
Sbjct: 212 DLVIESDSMAYNNKVNVLQALATTMKRAGITDKVTII-----AKAKVPIIKFVTRHGRFS 266
Query: 161 VDISFNQMAGLSALCFLEK 179
VDIS NQM G+ A +++
Sbjct: 267 VDISLNQMNGVKAGTMIKR 285
>gi|395735628|ref|XP_002815457.2| PREDICTED: DNA polymerase sigma [Pongo abelii]
Length = 542
Score = 45.8 bits (107), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 69 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Cricetulus griseus]
Length = 464
Score = 45.8 bits (107), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 60/132 (45%), Gaps = 19/132 (14%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV++ I+ +I + +V FGS YLP DIDL
Sbjct: 14 EEISDFYEYMSPRPEEEKMRMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 73
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+DSV ++ D VP I+K S + V
Sbjct: 74 VFGKWENLPLWTLEEALRKHKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKV 123
Query: 162 DISFNQMAGLSA 173
DISFN G+ A
Sbjct: 124 DISFNVQNGVRA 135
>gi|61098442|ref|NP_001012968.1| DNA polymerase sigma [Gallus gallus]
gi|53130894|emb|CAG31774.1| hypothetical protein RCJMB04_10o8 [Gallus gallus]
Length = 429
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 77/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ +I + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ + IK + A V I+K + Q + VDISFN G+ A F+++ K
Sbjct: 69 RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKEY----MK 122
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
+ +IL+ + + G IS+Y+L +M + FL + D
Sbjct: 123 KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|296475642|tpg|DAA17757.1| TPA: DNA polymerase sigma [Bos taurus]
Length = 541
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 81/172 (47%), Gaps = 16/172 (9%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G R ++L
Sbjct: 11 RREVVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWE------RPPLQLL 64
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLI 184
E + +V E ++ + A V I+K + Q + VDISFN G+ A F++ +
Sbjct: 65 EQALRKHNVAEPGSIKVLDKATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YM 121
Query: 185 GKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
K L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 122 KKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|449493801|ref|XP_002190556.2| PREDICTED: DNA polymerase sigma [Taeniopygia guttata]
Length = 429
Score = 45.8 bits (107), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 81/171 (47%), Gaps = 14/171 (8%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ +I + +V FGS YLP DIDL V G E + + + L
Sbjct: 11 RREVVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 68
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ + IK + A V I+K + Q + VDISFN G+ A F+++ + K
Sbjct: 69 RKHNVAEPYSIKVLD--KATVPIIKLTDQETEVKVDISFNVETGVKAARFIKE---YMKK 123
Query: 187 DHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + R L + G IS+Y+L +M + FL + D
Sbjct: 124 YSLLPYLILVLKQFLL--QRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 171
>gi|344230457|gb|EGV62342.1| hypothetical protein CANTEDRAFT_126141 [Candida tenuis ATCC 10573]
Length = 615
Score = 45.8 bits (107), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 126/305 (41%), Gaps = 42/305 (13%)
Query: 32 SPPNLYPF------SIDAEL--WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING 83
+P +LYP+ S E+ WL E I++ + I P+ R V+ +++ I
Sbjct: 157 TPTSLYPWIKAHDHSKQKEIADWLTME--IKDFVSYISPSKEEIMARNSVVKTLKQQIKV 214
Query: 84 YY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIK 140
+ E FGS YLP DID+ V+ E+ + + L ++ K
Sbjct: 215 CWPDAEAHVFGSFATDLYLPGSDIDMVVVSKNGDCENRHKLYQLSSFLRSKKL-----AK 269
Query: 141 DVQYVP-AQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK-VDRLIGKDHLFKRSVIL 196
D++ + A+V I+K NI +DISF + GL A + K ++ G R ++L
Sbjct: 270 DIEVIAGAKVPIIKFVDPKTNIHLDISFERTNGLDAARRIRKWLETTAG-----LRELVL 324
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEI 256
+ ++ H G + YA ++ H F+ + +N S N + E+
Sbjct: 325 VVKQFLRSRKLNNVHVGGLGGYATIILCYH-FIKMHPRVSTENMTASENLGTLLIEFFEL 383
Query: 257 VAETVENDG-----DELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNN 311
D D SP++LKK Y P +L + + FA L + DP N
Sbjct: 384 YGRNFSFDNLVIAFDGKTDSPKYLKK----YKYP--SLIASRNPFA---LVVQDPDDPGN 434
Query: 312 NLGRS 316
N+ RS
Sbjct: 435 NITRS 439
>gi|195163618|ref|XP_002022646.1| GL14678 [Drosophila persimilis]
gi|194104669|gb|EDW26712.1| GL14678 [Drosophila persimilis]
Length = 1197
Score = 45.8 bits (107), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 131/331 (39%), Gaps = 61/331 (18%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 285 NHMAMIAKYKGEPWRKPDYPYGEGVVGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIENVV 344
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQD-SVFEI 139
+ + V FGS +LP DIDL VLG + E L + LE E + + E
Sbjct: 345 HTIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEYELVNRGIAEA 398
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
V+ + A V I+K + + + VDISFN +G+ + ++K R ++GK L +
Sbjct: 399 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 458
Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
+L++ + G IS+Y+L +M CIS +
Sbjct: 459 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 489
Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
P + N G LL E + + I +++N G A L
Sbjct: 490 PRGIYHDTTNLGVLLLEFFELYGRRFNYMRIGI-SIKNGGRYMAKDELQRDMVDGHRPSL 548
Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
I DPL N++GRS S G FH +CA
Sbjct: 549 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 578
>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
B]
Length = 547
Score = 45.8 bits (107), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 55/122 (45%), Gaps = 14/122 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNV-- 116
I P ++ R V+ I+R I + +V PFGS K YLP GDIDL + H N
Sbjct: 183 ISPTPQEDEVRSLVVELIRRAITRQFPDAQVLPFGSYETKLYLPLGDIDLVI--HSNTMA 240
Query: 117 ---EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSV--QNIPVDISFNQMAGL 171
+E++ R + L +I A+V IVK VDIS NQ G+
Sbjct: 241 YSDKENVLRALANTLRRAGITDNVKI----IAKAKVPIVKFVTIHGRFSVDISINQGNGV 296
Query: 172 SA 173
+A
Sbjct: 297 AA 298
>gi|359486610|ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera]
Length = 453
Score = 45.8 bits (107), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 45/188 (23%), Positives = 83/188 (44%), Gaps = 24/188 (12%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
+++IL I P+ R ++I + ++ G V PFGS Y GD+D++
Sbjct: 11 LKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNLYTQWGDLDIS 70
Query: 110 V---------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ-- 157
+ + ++ L V L ++ + +Q++P A+V I+K
Sbjct: 71 IELPNGAYISSAGKRHKQTLLGHVLNALRSKG-----GWRKLQFIPNARVPIIKFESYHP 125
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
NI D+S N + G FL + G D F+ V+L+K W I + G +++
Sbjct: 126 NISCDVSINNLKGQMKSKFLFWIS---GIDGRFRDLVLLVKEWARAHD-INNSKTGTLNS 181
Query: 218 YALEMMVL 225
Y+L ++V+
Sbjct: 182 YSLSLLVV 189
>gi|5565687|gb|AAD45198.1|AF089896_1 topoisomerase-related function protein [Homo sapiens]
Length = 517
Score = 45.8 bits (107), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 47/170 (27%), Positives = 78/170 (45%), Gaps = 12/170 (7%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ I+ ++ + +V FGS YLP DIDL V G E + + + L
Sbjct: 51 RREVVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKW--ERPPLQLLEQAL 108
Query: 129 ENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGK 186
+ IK + A V I+K + Q + VDISFN G+ A F++ + K
Sbjct: 109 RKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKN---YMKK 163
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 164 YSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 211
>gi|448531596|ref|XP_003870285.1| Pap1 poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380354639|emb|CCG24155.1| Pap1 poly(A) polymerase [Candida orthopsilosis]
Length = 557
Score = 45.4 bits (106), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)
Query: 69 KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
KR EV+ +Q+L + G ++F FGS L Y P DID L V
Sbjct: 51 KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 110
Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
+ ED KIL E E+ +SV E I +++ + ++ K + +P
Sbjct: 111 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 170
Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
D++ + M L+ +++ RL+ K +FK ++ IK W + R +
Sbjct: 171 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 228
Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
A+ YG A M+V L +F Y+ ++W
Sbjct: 229 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 272
>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii
CBS 8904]
Length = 624
Score = 45.4 bits (106), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 78/170 (45%), Gaps = 23/170 (13%)
Query: 70 RKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTV----LGHQNVEEDLARFV 124
RK +I+ I +I + V PFGS + YLP GDIDL V L +N L +
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185
Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEK-VD 181
+ N ++V I A+V I+K + I VDIS NQ G+SA+ + +
Sbjct: 186 RMMRGNHITETVAVI-----TRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLK 240
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY-----ALEMMVLH 226
L G L +++IKA+ S + + G + +Y AL + +H
Sbjct: 241 ALPGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLALSFLQMH 285
>gi|324975490|gb|ADY62673.1| PAPa [Candida orthopsilosis]
Length = 372
Score = 45.4 bits (106), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)
Query: 69 KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
KR EV+ +Q+L + G ++F FGS L Y P DID L V
Sbjct: 46 KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 105
Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
+ ED KIL E E+ +SV E I +++ + ++ K + +P
Sbjct: 106 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 165
Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
D++ + M L+ +++ RL+ K +FK ++ IK W + R +
Sbjct: 166 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 223
Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
A+ YG A M+V L +F Y+ ++W
Sbjct: 224 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 267
>gi|324975506|gb|ADY62687.1| PAPa [Candida orthopsilosis]
Length = 552
Score = 45.4 bits (106), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 54/224 (24%), Positives = 90/224 (40%), Gaps = 57/224 (25%)
Query: 69 KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
KR EV+ +Q+L + G ++F FGS L Y P DID L V
Sbjct: 46 KRVEVLTTLQKLTEDFVYEVSLRKNMTQGMAKDAGGKIFTFGSYRLGVYGPSSDIDALVV 105
Query: 111 LGHQNVEEDLARFVCKIL----ENEDQDSVFE----IKDVQYVPAQVKIV--KCSVQNIP 160
+ ED KIL E E+ +SV E I +++ + ++ K + +P
Sbjct: 106 VPRHVTREDFFTVFEKILRGRPELEEINSVKEAFVPIIKLEFAGISIDLLFAKLDIPRVP 165
Query: 161 VDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILG 209
D++ + M L+ +++ RL+ K +FK ++ IK W + R +
Sbjct: 166 HDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKPTVFKNALRFIKMWA--QQRAVY 223
Query: 210 AH-YGLISTYALEMMV---------------LHRFLDYYNTFDW 237
A+ YG A M+V L +F Y+ ++W
Sbjct: 224 ANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWNW 267
>gi|449017212|dbj|BAM80614.1| hypothetical protein CYME_CMK272C [Cyanidioschyzon merolae strain
10D]
Length = 1647
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 46/127 (36%), Gaps = 26/127 (20%)
Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
+V+C + N L CFL + D LIG+ HL R +IL+K W + S
Sbjct: 1160 VVRCRTNGLTTQFLLNPAVALCRSCFLVECDELIGRRHLLIRCLILLKVW-WRHSLATAQ 1218
Query: 211 HYGLISTYALEMM------------------------VLHRFLDYYN-TFDWDNYCISIN 245
L+S + ++ VL +Y DW +SI
Sbjct: 1219 ARALLSPLSGSLVSPFLALLLLSYLNCRGLPGDEPAHVLQGLFSFYGFDMDWSRCGMSIY 1278
Query: 246 GPVAISS 252
GP I S
Sbjct: 1279 GPFDIQS 1285
>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii
CBS 2479]
Length = 631
Score = 45.4 bits (106), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 51/170 (30%), Positives = 78/170 (45%), Gaps = 23/170 (13%)
Query: 70 RKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTV----LGHQNVEEDLARFV 124
RK +I+ I +I + V PFGS + YLP GDIDL V L +N L +
Sbjct: 126 RKTMIDLITHIIRKEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLSEKNKVTMLHQLA 185
Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDISFNQMAGLSALCFLEK-VD 181
+ N ++V I A+V I+K + I VDIS NQ G+SA+ + +
Sbjct: 186 RMMRGNHITETVAVI-----TRAKVPIIKFVTAEGGINVDISLNQTNGVSAVKIVNHYLK 240
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY-----ALEMMVLH 226
L G L +++IKA+ S + + G + +Y AL + +H
Sbjct: 241 ALPGAREL----ILVIKAFLSQRS-MNEVYTGGLGSYSVICLALSFLQMH 285
>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
[Rhipicephalus pulchellus]
Length = 627
Score = 45.4 bits (106), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 76/177 (42%), Gaps = 9/177 (5%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
+ I++ +QP + R VI I+ +I G + EV FGS YLP DID+
Sbjct: 144 DEIEDFYRYMQPTPAEHQMRLGVIQRIKDVILGLWPQAEVEIFGSFRTGLYLPTSDIDVV 203
Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMA 169
VLG + + ++ D VP VK+ + VDISFN
Sbjct: 204 VLGKWETLPMWTLEKALLSHGIAEPQSIKVLDKASVPI-VKLTDAKT-TVKVDISFNMNN 261
Query: 170 GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
G+ + C ++ K + V+++K + R L + G IS+Y+L +M +
Sbjct: 262 GVKSACLIQSFKE---KFPALPKLVLVLKQ--FLLQRDLNEVFTGGISSYSLILMTV 313
>gi|42565594|ref|NP_190161.2| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|30793987|gb|AAP40443.1| unknown protein [Arabidopsis thaliana]
gi|110739217|dbj|BAF01523.1| hypothetical protein [Arabidopsis thaliana]
gi|332644545|gb|AEE78066.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 682
Score = 45.4 bits (106), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 92/210 (43%), Gaps = 27/210 (12%)
Query: 36 LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
L + ID+ + L ++ + ++ C+ +P RKE++ + + YG E P
Sbjct: 30 LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVL 89
Query: 92 --FGSVPLKTYLPDGDIDLTV-LGHQNVE-------EDLARFVCKI--LENEDQDSVFEI 139
+GS + Y D+D+++ G+ E E L RF K+ L+ E Q +
Sbjct: 90 EAYGSFVMDMYSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQ-----V 144
Query: 140 KDVQYV-PAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
K+V+ + A+V IVK S Q V D+S G+ + + ++ G+ F++ +L
Sbjct: 145 KNVESIFSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCLL 201
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
+K W H L S ++ LH
Sbjct: 202 VKHWAKAHEVNSALHRTLNSVSITLLVALH 231
>gi|452988727|gb|EME88482.1| hypothetical protein MYCFIDRAFT_159726 [Pseudocercospora fijiensis
CIRAD86]
Length = 658
Score = 45.4 bits (106), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 14/158 (8%)
Query: 32 SPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGY---YGIE 88
+P N Y F+ + L L E I + + P + R++++ + I+G+ I
Sbjct: 196 APANGYDFAANVGLHLHRE--ILDFFDYVSPRDYEKNCREDLVKRVDYFISGFGRGRSIS 253
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENE-DQDSVFEIKDV 142
V PFGS YL D DIDL Q + +R L NE + + + +DV
Sbjct: 254 VQPFGSYASGLYLTDADIDLVATSSQYMRNGTKVFCQSRTQMNKLSNELRKTGMAKNRDV 313
Query: 143 -QYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
+V ++K S NI VDISF +GL+ + L
Sbjct: 314 ITLTRTKVPLIKFVDSRTNIKVDISFENDSGLNTIPTL 351
>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 316
Score = 45.4 bits (106), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 47/188 (25%), Positives = 87/188 (46%), Gaps = 10/188 (5%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPL 97
ID +L + E + + + P+ ++ R +V I+ +I G + VF FGS
Sbjct: 6 LKIDWQL-SMTETSTHDFVNFVTPSKEDKEIRNKVATSIEEVIKGVFPDCHVFVFGSCAT 64
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ 157
LP+ DIDL V E + V + + + ++ VP +KI
Sbjct: 65 GLNLPNSDIDLIVYQPDVSESRMITKVADAIVRQKKCKTIDVLKNTKVPL-IKITDSEF- 122
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLIS 216
+ VDISFN+ G+ + ++++ ++ + K ++++K C+ +SR L Y G +
Sbjct: 123 GVNVDISFNRTNGVYCVKLVKQLLQMFPE---LKPLMMVLK--CFLKSRQLNEPYSGGVG 177
Query: 217 TYALEMMV 224
++ L MMV
Sbjct: 178 SFLLTMMV 185
>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
Length = 348
Score = 45.4 bits (106), Expect = 0.100, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 88/186 (47%), Gaps = 25/186 (13%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG---IEVFPFGSVPLKTYLPDGDI 106
E I+ + P R E+++ ++R+I G + +E+F GS L LP+ DI
Sbjct: 27 HEEIEHFYRYMLPTPTEHAARIELLSRVERVIQGLWPEALVEIF--GSFRLGINLPNSDI 84
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDS------VFEIKDVQYVPAQVKIVKCSVQNIP 160
DL VLG E L + LE+E + S ++ D VP ++ C ++
Sbjct: 85 DLVVLG---CWEHLP---LRSLESELRSSGIVLPGTLQVVDTAAVPI-IRFTDCET-HLK 136
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYA 219
VDISFN G+ + E + + + + + + V+++K + E R L + G IS+Y
Sbjct: 137 VDISFNMPNGIDS---SELIKKFLHEHPVLGKLVLVLKQ--FLEQRNLNSTLNGGISSYN 191
Query: 220 LEMMVL 225
L +M +
Sbjct: 192 LIIMCI 197
>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
Length = 756
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 103/245 (42%), Gaps = 48/245 (19%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLK 98
D +WL E I + ++P + R +++ ++R + Y +V PFGS
Sbjct: 423 DMSVWLHKE--IIDFYEVVRPRDFEHEMRTQLVERLRRSLKTSHFYKDCDVRPFGSYMSG 480
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARFV-CKILENED----QDSVFEIKDVQYVP-AQVKIV 152
YLP D+DL V ++ + F K L N Q+ V ++++ A+V +V
Sbjct: 481 LYLPTADMDLVVCARSWLDGAHSNFFGMKALRNFGKFLAQNKVTHYNTMEFIASAKVPLV 540
Query: 153 KC--SVQNIPVDISFNQMAGLSAL---------------------CFL------EKVDRL 183
K ++ + VDISF+++ G A+ FL E V+
Sbjct: 541 KYIDNITGLRVDISFDRLDGPQAVKTFAEWKEQYPAMPILVTMIKHFLAMRGLNEPVNGG 600
Query: 184 IGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYY-NTFDWDNYCI 242
IG + V +++ +SR LI + L M++ FLD Y N FD+ N I
Sbjct: 601 IGSFTVTCMVVSMLQLMPQVQSR------NLIPEHHLGEMMME-FLDLYGNRFDYVNTAI 653
Query: 243 SINGP 247
+N P
Sbjct: 654 RMNPP 658
>gi|7019641|emb|CAB75788.1| putative protein [Arabidopsis thaliana]
Length = 690
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 92/210 (43%), Gaps = 27/210 (12%)
Query: 36 LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
L + ID+ + L ++ + ++ C+ +P RKE++ + + YG E P
Sbjct: 30 LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEESSPVL 89
Query: 92 --FGSVPLKTYLPDGDIDLTV-LGHQNVE-------EDLARFVCKI--LENEDQDSVFEI 139
+GS + Y D+D+++ G+ E E L RF K+ L+ E Q +
Sbjct: 90 EAYGSFVMDMYSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQ-----V 144
Query: 140 KDVQYV-PAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
K+V+ + A+V IVK S Q V D+S G+ + + ++ G+ F++ +L
Sbjct: 145 KNVESIFSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCLL 201
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
+K W H L S ++ LH
Sbjct: 202 VKHWAKAHEVNSALHRTLNSVSITLLVALH 231
>gi|297823863|ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
Length = 500
Score = 45.4 bits (106), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 81/191 (42%), Gaps = 30/191 (15%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
+QEIL I+P R VI+ ++ ++ G V PFGS + GD+DL+
Sbjct: 11 LQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGDLDLS 70
Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVFEIK------------DVQYV-PAQVKIVKC-- 154
V DL + + Q +Q+V A+V I+K
Sbjct: 71 V--------DLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKVVS 122
Query: 155 SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
Q I DIS + + GL FL + + G+ F+ V+L+K W + I + G
Sbjct: 123 GHQRIACDISIDNLDGLLKSRFLFWISEIDGR---FRDLVLLVKEWAKAHN-INDSKNGT 178
Query: 215 ISTYALEMMVL 225
++Y+L ++V+
Sbjct: 179 FNSYSLSLLVI 189
>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
Length = 501
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 83/185 (44%), Gaps = 25/185 (13%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P E+ R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 3 EEIIDFYKYMSPRPEEERMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 62
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D++SV ++ D VP I+K S + V
Sbjct: 63 VFGKWETLPLWTLEEALRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKV 112
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DISFN G+ A ++ I K + V+++K + R L + G I +Y+L
Sbjct: 113 DISFNVQNGVKAAQLIKD---FIKKYPVLPYLVLVLKQFLL--QRDLNEVFTGGIGSYSL 167
Query: 221 EMMVL 225
+M +
Sbjct: 168 FLMAV 172
>gi|194769888|ref|XP_001967033.1| GF21834 [Drosophila ananassae]
gi|190622828|gb|EDV38352.1| GF21834 [Drosophila ananassae]
Length = 1092
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 131/331 (39%), Gaps = 61/331 (18%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 327 NHREMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 386
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
+ + V FGS +LP DIDL VLG + E L + LE E + E
Sbjct: 387 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLG---LWEKLP---LRTLEFELVSRGIAEA 440
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
V+ + A V I+K + + ++ VDISFN +G+ + ++K R ++GK L +
Sbjct: 441 CTVRVLDKASVPIIKLTDRETHVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 500
Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSL 253
+L++ + G IS+Y+L +M CIS +
Sbjct: 501 FLLLRD-------LNEVFTGGISSYSLILM-----------------CISF-----LQLH 531
Query: 254 PEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN----------- 302
P + + N G LL E + + I+ ++N G + L
Sbjct: 532 PRGIYQDTANLGVLLLEFFELYGRRFNYMKIGIS-IKNGGRYMPKEDLQRDMVDGHRPSL 590
Query: 303 --IVDPLKDNNNLGRSVSKGNFH---RIRCA 328
I DPL N++GRS S G FH +CA
Sbjct: 591 LCIEDPLTPGNDIGRS-SYGVFHVQQAFKCA 620
>gi|194891045|ref|XP_001977427.1| GG18260 [Drosophila erecta]
gi|190649076|gb|EDV46354.1| GG18260 [Drosophila erecta]
Length = 1008
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 88/197 (44%), Gaps = 24/197 (12%)
Query: 37 YPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSV 95
YP+ D + L E I+ + P R EV+ I+ +++ + V FGS
Sbjct: 250 YPYG-DGVIGL--HEEIEHFYQYVLPTACEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSF 306
Query: 96 PLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVK 153
+LP DIDL VLG + + LE E + E V+ + A V I+K
Sbjct: 307 RTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEACTVRVLDKASVPIIK 360
Query: 154 CSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRIL 208
+ + + VDISFN +G+ + ++K R ++GK L + +L++ +
Sbjct: 361 LTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQFLLLRD-------LN 413
Query: 209 GAHYGLISTYALEMMVL 225
G IS+Y+L +M +
Sbjct: 414 EVFTGGISSYSLILMCI 430
>gi|312085976|ref|XP_003144894.1| hypothetical protein LOAG_09318 [Loa loa]
Length = 554
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)
Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
K QY+PA+V I+K + + + +DI+ N +AG+ L R+ D F
Sbjct: 322 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 378
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+L+K W + I A G +++Y+L +MVLH
Sbjct: 379 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 409
>gi|119482616|ref|XP_001261336.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
NRRL 181]
gi|119409491|gb|EAW19439.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
NRRL 181]
Length = 701
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 101/246 (41%), Gaps = 43/246 (17%)
Query: 42 DAELWLLAEE-------RIQ-EILCT---IQPAIVSEKKRKEVINYIQRLING-YYGIEV 89
DA W+L E R+ EIL ++P + R ++I +Q YYG+++
Sbjct: 242 DATPWMLLMEPSLHLGSRLHNEILSFYHWVKPMQYEQIVRADLITRLQVAFQSRYYGVQL 301
Query: 90 FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------VCKILENEDQDSVFEIK 140
PFGS YLP DIDL +L + + F L+N + I+
Sbjct: 302 RPFGSFASGLYLPTADIDLVLLSSNFMRNGIKTFGERKGQIYAFAAFLKNLEIAVPNSIE 361
Query: 141 DVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-----------------LCFLEKVD 181
+ + A+V I+K + + VD+SF+ +GL A + +++
Sbjct: 362 TIAH--ARVPILKFVDKMTGLRVDLSFDNDSGLIANNTFQNWKSEYPAMPVIVAVVKQFL 419
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYC 241
L G + + + C S + +G + L +++ F Y NTFD++N
Sbjct: 420 LLRGLNEVPTGGLGGFSITCLVTSLLQHLPHGHAAP-NLGSILMDFFEFYGNTFDFENVG 478
Query: 242 ISINGP 247
I +N P
Sbjct: 479 IRLNPP 484
>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
carolinensis]
Length = 665
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 81/184 (44%), Gaps = 23/184 (12%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P ++ R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 171 EEINDFYKYMSPRPEEQRMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 230
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
V G +EE L + D+ SV ++ D VP I+K S + V
Sbjct: 231 VFGKWETLPLWTLEEALRKHNV-----ADKGSV-KVLDKATVP----IIKLTDSFTEVKV 280
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
DISFN G+ A ++ I K + V+++K + + + G I +Y+L
Sbjct: 281 DISFNVQNGVKAADLIKD---FIKKYPVLPYLVLVLKQFL-LQRDLNEVFTGGIGSYSLF 336
Query: 222 MMVL 225
+M +
Sbjct: 337 LMAV 340
>gi|239835858|gb|ACS29269.1| pap1 [Meyerozyma guilliermondii]
Length = 564
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/226 (19%), Positives = 90/226 (39%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ +KR E++ +Q+++ + G ++F FGS L Y P DID
Sbjct: 43 ATRKRVEILALLQKMVQEFVYTVSKKKNMSDGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQYVPAQVKIV--KCSVQ 157
L V+ V ED +++ + D+ I +++ + ++ + +V
Sbjct: 103 LIVVPKHVVREDFFTIFDQMIRQRPELEEITAVPDAFVPIIMIEFSGISIDLIFARLNVS 162
Query: 158 NIPVDISF-----------NQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ N M L+ +++ +L+ K +FK ++ IK W +
Sbjct: 163 RVPLDMTLEDNNLLKNIDENDMRALNGTRVTDQILQLVPKVTVFKHALRCIKLWA-QQRA 221
Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 222 VYGNMFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267
>gi|345563688|gb|EGX46674.1| hypothetical protein AOL_s00097g578 [Arthrobotrys oligospora ATCC
24927]
Length = 1706
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 15/146 (10%)
Query: 40 SIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLK 98
S+ +WL E I + I+P R +++ ++ LI + ++ FGS +
Sbjct: 1266 SLYTSVWLHKE--IVDFHDYIKPQQYEHIVRHDLVRRLRELIQSRFDDADIQAFGSFAAE 1323
Query: 99 TYLPDGDIDLTVLGHQNVEEDLARF--------VCKILENEDQDSVFEIKDVQYVPAQVK 150
YLP D+D+ +L Q +E ++ + +IL N D + V + A+V
Sbjct: 1324 IYLPTSDMDIVLLSRQYLETSRPKYDSVNEIRKLARILRNSDLPQSGTL--VPILSAKVP 1381
Query: 151 IVKCS--VQNIPVDISFNQMAGLSAL 174
I+K + + VDISF +GL A+
Sbjct: 1382 IIKYKDRLTGLSVDISFENPSGLIAI 1407
>gi|312077329|ref|XP_003141256.1| PAP/25A associated domain-containing protein [Loa loa]
gi|307763579|gb|EFO22813.1| PAP/25A associated domain-containing protein [Loa loa]
Length = 419
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 63/295 (21%), Positives = 132/295 (44%), Gaps = 20/295 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + ++ ++P+ + + R V ++ ++ + ++ FGS+ +LP DID+
Sbjct: 66 HEELLDLYAWLKPSPLEKALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 125
Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
V EE L + + E+ +S+ + D +VP VK+V + I +DISFN
Sbjct: 126 VVESDLVSEEPPLWKTAVALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 182
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
+ G+ + F+E + + + + V+++K + + ++ G +S+Y L +M++
Sbjct: 183 VQGVRSARFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 237
Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
FL + ++D+ I+ +N V + S ++ + L + C++ V +
Sbjct: 238 FLQLHPSYDYSYKGITEVNMGVLLLSFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 297
Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
N+ L I DPL+ N++GR N +R A + L +
Sbjct: 298 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 343
>gi|344231415|gb|EGV63297.1| Poly(A) polymerase PAPalpha [Candida tenuis ATCC 10573]
Length = 560
Score = 44.7 bits (104), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 48/228 (21%), Positives = 89/228 (39%), Gaps = 59/228 (25%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ +KR E++N Q+L+ + G ++F FGS L Y P DID
Sbjct: 43 ATRKRVEILNLFQKLVQEFVHKVSKSKNMSEGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102
Query: 109 TVL--GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKI--------VKCSVQN 158
V+ H E+ + F + E + + + + D +VP +KI + C+ N
Sbjct: 103 LVVVPKHVTREDFFSVFESIVRERPELEEITSVPDA-FVPI-IKIEFDGISIDLICATLN 160
Query: 159 IPV--------------DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
+P ++ M L+ +++ +L+ K +FK ++ IK W
Sbjct: 161 VPQVPLNMTLDDKNLLRNLDERDMRALNGTRVTDEILQLVPKPTVFKHALRCIKMWAQAR 220
Query: 205 SRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ + G +G A MMV + +F + Y ++W
Sbjct: 221 A-VYGNVFGFPGGVAWAMMVARICQLYPNAVGAVIVDKFFNIYTKWNW 267
>gi|156096867|ref|XP_001614467.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803341|gb|EDL44740.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 567
Score = 44.7 bits (104), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 56/225 (24%), Positives = 100/225 (44%), Gaps = 32/225 (14%)
Query: 21 PSHTSSSSS--YSSPPNLYPFS-IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYI 77
PS T S + +PP+ YP S +DAEL ++ ++P+ K + ++
Sbjct: 212 PSDTPQSDTPPSDTPPSEYPPSALDAEL--------SKLEVALRPSQNDVNSMKTFLAFL 263
Query: 78 QRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT----VLGHQNVEEDLARFVCKILENED 132
Q+ IN +Y V PFGS+ + + DID+ +L + + + +C IL N
Sbjct: 264 QKEINKHYKNCHVTPFGSIINGFWTRNSDIDICIQIPILLSRKDQITFLKKICLIL-NSF 322
Query: 133 QDSVFEIKDVQYVPAQVKIVKCSVQN------IPVDISFNQMAGLSALCFLEKVDRLIGK 186
D + E Q A+V I+ ++ + DIS N + A+ + + + +
Sbjct: 323 NDGIIE----QRFSAKVPIIHFYCKSLRHSFELSCDISVNNIL---AVVNSKLIQKYVSI 375
Query: 187 DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
D + I +K W I G +S+++L +M++H FL Y
Sbjct: 376 DKRLQLMGIALKYWS-KNRNINDRSKGFLSSFSLILMIIH-FLQY 418
>gi|449448062|ref|XP_004141785.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) polymerase-like [Cucumis
sativus]
Length = 772
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 73/153 (47%), Gaps = 17/153 (11%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
L F +DA L+ EE + +E+L I IV + K+ + Y +++ + +F
Sbjct: 48 LEKFLVDAGLYESKEESAKREEVLSRIG-QIVKDWVKQLTRIKGYTDQMVEDANAV-IFT 105
Query: 92 FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
FGS L + P DID +G + N EED + ILE + E+ ++Q VP A
Sbjct: 106 FGSYRLGVHGPGADIDTLCVGPSYVNREEDFFYMLHNILEEME-----EVSELQPVPDAH 160
Query: 149 VKIVKCSVQNIPVDISFNQMAGLSALCFLEKVD 181
V ++K I +D+ + A +S L E +D
Sbjct: 161 VPVMKFKFDGISIDLLY---ASISCLVVPEDLD 190
>gi|403338429|gb|EJY68454.1| hypothetical protein OXYTRI_10932 [Oxytricha trifallax]
Length = 1545
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 114/280 (40%), Gaps = 62/280 (22%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDS-----VFEIKDV 142
EV +GS K LP DIDL + + + V I+ E Q + E+K V
Sbjct: 1195 EVKVYGSHATKLCLPWSDIDLVIKTNSTDHYSTPKHVLSIITRELQSDHTTKWIQEVKFV 1254
Query: 143 QYVPAQVKIVKCSV-----------QNI---------PVDISFNQMA-GLSALCFLEKVD 181
+ V VKC + QNI P I Q+ + L ++ V
Sbjct: 1255 ENASVPVVKVKCQIDHIMQTSGLASQNISKYQTFLEQPFSIDITQLTDNHNGLECVKLVQ 1314
Query: 182 RLIGKDHLFKRSVILIKAW---CYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWD 238
+ ++ + + ++++K + C Y ++G IS+YAL +M++
Sbjct: 1315 EFLSENEVIEPLILVLKQYLKVCQYND----PYFGGISSYALFLMIV------------- 1357
Query: 239 NYCISINGPVAIS--SLPEIVAETVENDGDELLLSPEFLKKCREIYS-VPITALENTGH- 294
+Y SI P IS +L I+ + GD F + IY+ +P E T H
Sbjct: 1358 SYLQSIQAPKLISQVNLGHILISFFQFYGD-------FQYQSYGIYTHLPGKISEKTNHY 1410
Query: 295 ---EFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSY 331
F + + I DPL +NN+G+S K F+ I+ + +
Sbjct: 1411 AIVNFLTQTVQIDDPLHVHNNVGKSSFK--FYEIKDSFKF 1448
>gi|384249905|gb|EIE23385.1| hypothetical protein COCSUDRAFT_41642 [Coccomyxa subellipsoidea
C-169]
Length = 758
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 46/185 (24%), Positives = 83/185 (44%), Gaps = 29/185 (15%)
Query: 62 PAIVSEKKRKEVINYIQRLI----NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLG----- 112
P +R++++ + ++ +G+ + V P+GS Y P GD+D+++ G
Sbjct: 39 PGPQDAARRRQILEKMGGIVGLGLDGHTELRVEPYGSFVSGLYAPTGDLDISIEGFCGKE 98
Query: 113 ---------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPV 161
++ + L R + K LE + I+ + + A+V I+K + IP
Sbjct: 99 GRGRDVRDMGKSAKAALLRALSKKLERSRLHRGY-IQRILH--ARVPILKLVWAESGIPC 155
Query: 162 DISFNQMAGLSALCF-LEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D+S G S F E V L+ D F++ + +IK W + A G +T+AL
Sbjct: 156 DVSV----GSSNSRFKAEVVKALVRLDGRFEQMLRVIKVWSGAHG-LNDASNGTFNTFAL 210
Query: 221 EMMVL 225
+MVL
Sbjct: 211 SLMVL 215
>gi|115395044|ref|XP_001213471.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193040|gb|EAU34740.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 680
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 58/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R+++I+ +Q YYG+++ PFGS YLP DIDL +L
Sbjct: 268 VKPVDYEQIVREDLISRLQSAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFTRN 327
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L N D I+ + + A+V I+K + + VD+SF+
Sbjct: 328 GIKTFGERKGQIYAFAAFLRNLDIAVPGSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 385
Query: 168 MAGLSA 173
+GL A
Sbjct: 386 DSGLIA 391
>gi|347830146|emb|CCD45843.1| hypothetical protein [Botryotinia fuckeliana]
Length = 788
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 65/137 (47%), Gaps = 10/137 (7%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
LWL E I + ++P E R++++ ++ YY ++ PFGS P YLP
Sbjct: 450 LWLHKE--IMDFYYHVKPRKFEEVIRQKLLVDLRNNFRRYYPDADILPFGSFPAGLYLPT 507
Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
D+D+ ++ + ++ ++ + ++ N ++D + + A+V + K ++
Sbjct: 508 ADMDVVMVSDEFMDGGWPKYSSKSHIWRVKSNIERDRLNTGRIDVITSAKVPLAKWVDAI 567
Query: 157 QNIPVDISFNQMAGLSA 173
+ VD+SF GL A
Sbjct: 568 TGLKVDMSFENDTGLIA 584
>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
RWD-64-598 SS2]
Length = 663
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 60/129 (46%), Gaps = 14/129 (10%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
++ + + P + ++ R V+ + + + + +V PFGS K YLP GDIDL
Sbjct: 155 VEAFVDYMSPTSIEDEIRGLVVKLVGKAVTSAFPDAKVLPFGSYGTKLYLPSGDIDLV-- 212
Query: 112 GHQNVEEDLARFVCK-----ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDIS 164
+E D ++V K L N + + K A+V IVK ++ + VDIS
Sbjct: 213 ----IESDSMQYVPKNSVLHSLANVLKRAGIADKVTIIAKAKVPIVKFITRHGRLNVDIS 268
Query: 165 FNQMAGLSA 173
NQ GL A
Sbjct: 269 INQSNGLVA 277
>gi|296194961|ref|XP_002745186.1| PREDICTED: DNA polymerase sigma [Callithrix jacchus]
Length = 866
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 80/175 (45%), Gaps = 12/175 (6%)
Query: 65 VSEKKRKEVINYIQRLINGYYGI-EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF 123
V + + +EV+ I+ ++ + +V FGS YLP DIDL V G E +
Sbjct: 331 VFKGQLREVVKRIETVVKDLWPFFQVQIFGSFSTGLYLPTSDIDLVVFG--KWERPPLQL 388
Query: 124 VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVD 181
+ + L + IK + A V I+K + Q + VDISFN G+ A F++
Sbjct: 389 LEQALRKHNVAEPCSIKVLDK--ATVPIIKLTDQETEVKVDISFNMETGVRAAEFIKNY- 445
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD 236
+ K L ++++K + + + G IS+Y+L +M + FL + D
Sbjct: 446 --MKKYSLLPYLILVLKQFL-LQRDLNEVFTGGISSYSLILMAIS-FLQLHPRID 496
>gi|242081815|ref|XP_002445676.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
gi|241942026|gb|EES15171.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
Length = 690
Score = 44.3 bits (103), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 43/177 (24%), Positives = 75/177 (42%), Gaps = 21/177 (11%)
Query: 41 IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIE-----VFPFGSV 95
ID L + EE +Q+ ++QP V + R +IN ++ G +G V PFGS
Sbjct: 55 IDPSLLPVLEELLQDTYASLQPQPVDYEHRYHMINIFNKIAEGIFGKNNGLPIVEPFGSF 114
Query: 96 PLKTYLPDGDIDLTVLGHQNVEEDLAR----FVCKILEN-----EDQDSVFEIKDVQYVP 146
+ + P D+DL++ + + + R + + L N + Q + + V
Sbjct: 115 IMDLFTPKSDLDLSINFNTDTNDQYPRRNKIYAIRKLANVLFSHQRQGLCHGVSPI--VT 172
Query: 147 AQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
A+V ++K Q V DIS G+S + + + D F+ L+K W
Sbjct: 173 ARVPVLKVIDQKTGVECDISVENKDGMSRSVIFKFISSI---DKRFRILCYLMKFWA 226
>gi|195480074|ref|XP_002101128.1| GE15792 [Drosophila yakuba]
gi|194188652|gb|EDX02236.1| GE15792 [Drosophila yakuba]
Length = 1014
Score = 44.3 bits (103), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 90/212 (42%), Gaps = 21/212 (9%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 233 NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 292
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
+ + V FGS +LP DIDL VLG + + LE E + E
Sbjct: 293 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 346
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRS 193
V+ + A V I+K + + + VDISFN +G+ + ++K R ++GK L +
Sbjct: 347 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQ 406
Query: 194 VILIKAWCYYESRILGAHYGLISTYALEMMVL 225
+L++ + G IS+Y+L +M +
Sbjct: 407 FLLLRD-------LNEVFTGGISSYSLILMCI 431
>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 397
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 121/298 (40%), Gaps = 43/298 (14%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVL 111
++ + I P+ V ++ R ++ I + + V PFGS K YLP GDIDL +L
Sbjct: 106 VKAFVHWISPSPVEDEVRGLIVTQISNTVKASFPDARVLPFGSYETKLYLPLGDIDLVIL 165
Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--IPVDISFNQMA 169
+ V L N + S A+V IVK + VDIS NQ
Sbjct: 166 SDSMAYSNKVN-VLHALANTLKRSGVTSHVTIIAKAKVPIVKFVTTHGRFHVDISLNQSN 224
Query: 170 GLSAL----CFLEKV--DRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM 223
GL + FL+ + + GK + RS++++ + + + G + +Y++
Sbjct: 225 GLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKAFLTQRSMNEVYTGGLGSYSI--- 281
Query: 224 VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEF--LKKCREI 281
C++++ + P+I ++ + + +L+ EF L C
Sbjct: 282 ----------------VCLAVS---FLQMHPKIRNGEIDPEKNLGVLAMEFFELYGCYFN 322
Query: 282 YSVPITALENTGHEFAIKHLNIVD-------PLKDNNNLGRSVSKGN--FHRIRCALS 330
Y +L + G F+ + D L+D + +SKG+ FH++R A +
Sbjct: 323 YDEVGISLRDGGMYFSKRKRGWYDYDRRGILSLEDPADPSNDISKGSYGFHKVRTAFA 380
>gi|315054687|ref|XP_003176718.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
gi|311338564|gb|EFQ97766.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
Length = 627
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 32/140 (22%), Positives = 71/140 (50%), Gaps = 16/140 (11%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 272 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 331
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVF----------EIKDVQYVPAQVKIVKC-- 154
DL +L Q + + +++C+ + + + + I+ + + A+V I+K
Sbjct: 332 DLVLLSRQFMSSN-RKYICQKVRDIYSFAEYIRGLGIAAPGSIETIAH--ARVPIIKFVD 388
Query: 155 SVQNIPVDISFNQMAGLSAL 174
S+ + VD+SF+ +GL+A+
Sbjct: 389 SLTGLKVDLSFDNSSGLAAI 408
>gi|393912435|gb|EJD76738.1| hypothetical protein LOAG_16408 [Loa loa]
Length = 430
Score = 43.9 bits (102), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)
Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
K QY+PA+V I+K + + + +DI+ N +AG+ L R+ D F
Sbjct: 198 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 254
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+L+K W + I A G +++Y+L +MVLH
Sbjct: 255 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 285
>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 108/246 (43%), Gaps = 35/246 (14%)
Query: 4 MSAFLEMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAEL--------WLLAEERIQE 55
+S FL M D E +T S S +++S+ YP+ + WL +E I++
Sbjct: 128 VSGFL-MPDQYE--ITKNSTQSHENTHSTSNTEYPWIRNHNHSRQRKIADWLTSE--IKD 182
Query: 56 ILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDGDIDLTVLGHQ 114
+ I P+ K R I+ +++ + + ++ FGS YLP DID +
Sbjct: 183 FVHYISPSKSEIKCRNRTIDKLRQAVKKLWSDADLHVFGSFATDLYLPGSDIDCVINSRH 242
Query: 115 NVEED------LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFN 166
+ +ED LAR+ L+NE E+ V +V I+K + + +D+SF
Sbjct: 243 HDKEDRNYIYELARY----LKNEGLAIRMEV----IVRTRVPIIKFIEPLSQLHIDVSFE 294
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+ GL A + R +D R ++L+ + R+ H G + + + + +++
Sbjct: 295 RTNGLEAARLI----REWLRDSPGLRELVLVIKQFLHSRRLNNVHTGGLGGFTV-ICLVY 349
Query: 227 RFLDYY 232
FL+ +
Sbjct: 350 SFLNMH 355
>gi|154323524|ref|XP_001561076.1| hypothetical protein BC1G_00161 [Botryotinia fuckeliana B05.10]
Length = 692
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 65/137 (47%), Gaps = 10/137 (7%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
LWL E I + ++P E R++++ ++ YY ++ PFGS P YLP
Sbjct: 410 LWLHKE--IMDFYYHVKPRKFEEVIRQKLLVDLRNNFRRYYPDADILPFGSFPAGLYLPT 467
Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
D+D+ ++ + ++ ++ + ++ N ++D + + A+V + K ++
Sbjct: 468 ADMDVVMVSDEFMDGGWPKYSSKSHIWRVKSNIERDRLNTGRIDVITSAKVPLAKWVDAI 527
Query: 157 QNIPVDISFNQMAGLSA 173
+ VD+SF GL A
Sbjct: 528 TGLKVDMSFENDTGLIA 544
>gi|297819094|ref|XP_002877430.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
lyrata]
gi|297323268|gb|EFH53689.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
lyrata]
Length = 680
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 46/207 (22%), Positives = 87/207 (42%), Gaps = 21/207 (10%)
Query: 36 LYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGI--EVFP-- 91
L + ID+ + L ++ + ++ C+ +P RKE++ + + YG E P
Sbjct: 30 LKRYKIDSYILLDLDKVLNDVYCSFRPVSADYDTRKELVKNLNAMAIDIYGNSEESSPVL 89
Query: 92 --FGSVPLKTYLPDGDIDLTV--------LGHQNVEEDLARFVCKILENEDQDSVFEIKD 141
+GS + Y D+D+++ L + E L RF K+ + + V ++
Sbjct: 90 EAYGSFVMDMYSSQSDLDVSINFGNGTPELPREKKLEILKRFAKKLRSLQGEGHVKNVES 149
Query: 142 VQYVPAQVKIVKCSVQNIPV--DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKA 199
+ A+V IVK S Q V D+S G+ + + ++ G+ F++ +L+K
Sbjct: 150 I--FSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIISQIDGR---FQKLCMLVKH 204
Query: 200 WCYYESRILGAHYGLISTYALEMMVLH 226
W H L S ++ LH
Sbjct: 205 WAKAHEVNSALHRTLNSVSITLLVALH 231
>gi|326470670|gb|EGD94679.1| topoisomerase TRF4 [Trichophyton tonsurans CBS 112818]
Length = 631
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
DL +L Q + + +++C+ + + + VP A+V I+K ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 394
Query: 157 QNIPVDISFNQMAGLSAL 174
+ VD+SF+ +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLAAI 412
>gi|327307976|ref|XP_003238679.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
gi|326458935|gb|EGD84388.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
Length = 631
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 31/138 (22%), Positives = 69/138 (50%), Gaps = 12/138 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335
Query: 107 DLTVLGHQNVEEDLARFVCK-------ILENEDQDSVFEIKDVQYVP-AQVKIVKC--SV 156
DL +L Q + + +++C+ E + + ++ + A+V I+K ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPRSIETIAHARVPIIKFVDAL 394
Query: 157 QNIPVDISFNQMAGLSAL 174
+ VD+SF+ +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLTAI 412
>gi|326479586|gb|EGE03596.1| topoisomerase TRF4 [Trichophyton equinum CBS 127.97]
Length = 631
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 276 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 335
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
DL +L Q + + +++C+ + + + VP A+V I+K ++
Sbjct: 336 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 394
Query: 157 QNIPVDISFNQMAGLSAL 174
+ VD+SF+ +GL+A+
Sbjct: 395 TGLKVDLSFDNSSGLAAI 412
>gi|63101121|gb|AAY33178.1| PAPa [Candida parapsilosis]
gi|354544642|emb|CCE41367.1| hypothetical protein CPAR2_303560 [Candida parapsilosis]
Length = 552
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 81/228 (35%), Gaps = 65/228 (28%)
Query: 69 KRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID-LTV 110
KR EV+ +Q+L + G ++F FGS L Y P DID L V
Sbjct: 46 KRVEVLTTLQKLTEQFVYEVSLQRNMSQGMAKDAGGKIFTFGSYKLGVYGPSSDIDALVV 105
Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFNQ-- 167
+ +D KIL E++++ V A V I+K I +D+ F +
Sbjct: 106 VPRHVTRDDFFTVFEKILRGRQ-----ELEEINCVKEAFVPIIKLEFAGISIDLLFAKLD 160
Query: 168 -----------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
M L+ +++ RL+ K +FK ++ +K W +
Sbjct: 161 IPRVPHDLTLDDKNLLKNIDEKDMRALNGTRVTDEILRLVPKSTVFKNALRFVKMWA-QQ 219
Query: 205 SRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
I YG A M+V L +F Y+ + W
Sbjct: 220 RAIYANVYGFPGGVAWAMLVARICQLYPNAVSAVILEKFFQIYSQWSW 267
>gi|302657626|ref|XP_003020531.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
gi|291184373|gb|EFE39913.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
Length = 577
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 222 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 281
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
DL +L Q + + +++C+ + + + VP A+V I+K ++
Sbjct: 282 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGLAVPGSIETIAHARVPIIKFVDAL 340
Query: 157 QNIPVDISFNQMAGLSAL 174
+ VD+SF+ +GL+A+
Sbjct: 341 TGLKVDLSFDNSSGLAAI 358
>gi|170059968|ref|XP_001865594.1| sigma DNA polymerase [Culex quinquefasciatus]
gi|167878539|gb|EDS41922.1| sigma DNA polymerase [Culex quinquefasciatus]
Length = 618
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 49/185 (26%), Positives = 86/185 (46%), Gaps = 17/185 (9%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGD 105
++ E I++ + P R +V+ I+ ++ N + V FGS YLP D
Sbjct: 1 MILHEEIEQFYAHMIPTGTEHTLRVQVVARIEAIVLNLWPMARVEMFGSFRTGLYLPTSD 60
Query: 106 IDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPV 161
IDL V+G + + LENE + E V+ + A V IVK + + + V
Sbjct: 61 IDLVVIGR------WEKLPLRTLENELISRGIAEPMSVRVLDKASVPIVKLTDRETQVKV 114
Query: 162 DISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYAL 220
DISFN +G+ + ++ R + + + V+++K + R L + G IS+Y+L
Sbjct: 115 DISFNMQSGVQSAELIKDFKR---QYPVLAKLVLVLKQFLL--QRDLNEVFTGGISSYSL 169
Query: 221 EMMVL 225
+M +
Sbjct: 170 ILMCI 174
>gi|195447518|ref|XP_002071250.1| GK25231 [Drosophila willistoni]
gi|194167335|gb|EDW82236.1| GK25231 [Drosophila willistoni]
Length = 1048
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 53/211 (25%), Positives = 89/211 (42%), Gaps = 21/211 (9%)
Query: 23 HTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLIN 82
H + Y P P E + E I+ + P R EV+ I+ +++
Sbjct: 260 HMEMIAKYKGTPWRKPDYHYGEGAIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEGVVH 319
Query: 83 GYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEIK 140
+ V FGS +LP DIDL VLG + + LE E + E
Sbjct: 320 SIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEAC 373
Query: 141 DVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR---LIGKDHLFKRSV 194
V+ + A V I+K + + + VDISFN +G+ + ++K R ++GK L +
Sbjct: 374 TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKRDYPVLGKLVLVLKQF 433
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVL 225
+L++ + G IS+Y+L +M +
Sbjct: 434 LLLRD-------LNEVFTGGISSYSLILMCI 457
>gi|170584484|ref|XP_001897029.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158595564|gb|EDP34107.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 747
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 48/92 (52%), Gaps = 8/92 (8%)
Query: 139 IKDVQYVPAQVKIVKCSV----QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSV 194
K QY+PA+V I+K + + + +DI+ N +AG+ L R+ D F
Sbjct: 515 FKTCQYIPAKVPIIKLEMAAPYEELEIDINCNNVAGIYNSHLLHYYSRV---DDRFPALC 571
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+L+K W + I A G +++Y+L +MVLH
Sbjct: 572 LLVKHWA-INAGINNAMMGTLNSYSLILMVLH 602
>gi|302508901|ref|XP_003016411.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
gi|291179980|gb|EFE35766.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
Length = 577
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 32/138 (23%), Positives = 68/138 (49%), Gaps = 12/138 (8%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 222 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQACFGQRFYGAKLYAFGSFASGLYLPTADM 281
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SV 156
DL +L Q + + +++C+ + + + VP A+V I+K ++
Sbjct: 282 DLVLLSRQFMSSN-RKYICQKVREIYSFAEYIRGQGIAVPGSIETIAHARVPIIKFVDAL 340
Query: 157 QNIPVDISFNQMAGLSAL 174
+ VD+SF+ +GL+A+
Sbjct: 341 TGLKVDLSFDNSSGLAAI 358
>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
Length = 464
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 4/129 (3%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
+ + ++ + I P ++ R ++ I R + + +V PFGS K YLP GDI
Sbjct: 140 MLQRDVEAFIDYISPTPAEDEIRGLIVQLISRAVTQAFPDAQVLPFGSYETKLYLPLGDI 199
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
DL + D V L N + + + A+V I+K + VDIS
Sbjct: 200 DLVIQSPSMAYSDKVT-VLHALANTMRRAGITDRVTIVAKAKVPIIKFITTHGRFAVDIS 258
Query: 165 FNQMAGLSA 173
NQ G++A
Sbjct: 259 LNQTNGVAA 267
>gi|449480779|ref|XP_004155993.1| PREDICTED: poly(A) polymerase-like [Cucumis sativus]
Length = 758
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 58/250 (23%), Positives = 100/250 (40%), Gaps = 53/250 (21%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
L F +DA L+ EE + +E+L I IV + K+ + Y +++ +F
Sbjct: 48 LEKFLVDAGLYESKEESAKREEVLSRIG-QIVKDWVKQLTRIKGYTDQMVEDANAA-IFT 105
Query: 92 FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
FGS L + P DID +G + N EED + ILE + E+ ++Q VP A
Sbjct: 106 FGSYRLGVHGPGADIDTLCVGPSYVNREEDFFYMLHNILEEME-----EVSELQPVPDAH 160
Query: 149 VKIVKCSVQNIPVDISFNQMAGL--------SALCFLEKVD-----------------RL 183
V ++K I +D+ + ++ L S + L VD +L
Sbjct: 161 VPVMKFKFDGISIDLLYASISCLVVPEDLDISDVSVLYNVDEPTVRSLNGCRVADQILKL 220
Query: 184 IGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRFL 229
+ F+ ++ +K W + G ++ L+ ++ M+L RF
Sbjct: 221 VPNVESFRTALRCLKFWAKRRGVYSNVTGFLGGVNWALLVARVCQLYPNAVPSMLLSRFF 280
Query: 230 DYYNTFDWDN 239
Y + W N
Sbjct: 281 RVYTLWRWPN 290
>gi|340371638|ref|XP_003384352.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Amphimedon queenslandica]
Length = 462
Score = 43.5 bits (101), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 74/146 (50%), Gaps = 16/146 (10%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV-- 145
+V+PFGS YLP DID+ VLG E LA + LE+ + I+D V
Sbjct: 133 QVYPFGSFCTNLYLPTSDIDIVVLG-----EWLALPLFS-LEDAFLKAQIAIEDSIMVLD 186
Query: 146 PAQVKIVKCSVQ--NIPVDISFNQMAGL-SALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
V I+K + + + VDISFNQ G+ SA + V + +L +++K +
Sbjct: 187 KTTVPIIKFTDRETEVKVDISFNQETGIYSANLICQYVQQFPYLPYL----ALIVKQFL- 241
Query: 203 YESRILGAHYGLISTYALEMMVLHRF 228
+ ++ +YG I++Y+L +M++ F
Sbjct: 242 AQRQLNEVYYGGINSYSLILMLVSFF 267
>gi|324975502|gb|ADY62684.1| PAPa [Candida orthopsilosis]
Length = 547
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 45/226 (19%), Positives = 88/226 (38%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ KKR EV++ Q L+ + G ++F FGS L Y P DID
Sbjct: 42 ATKKRAEVLSLFQTLVQQFVYEVSISKNMSDGMAKDAGGKIFTFGSYRLGVYGPSSDIDA 101
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
L V+ ED KI+ + D+ I +++ + + + + +V
Sbjct: 102 LVVVPRHVTREDFFTTFDKIIRQRSELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ + + L+ +++ +L+ K +FK ++ IK W E
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220
Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
+ G +G A M+ ++ +F + Y ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSAVIVEKFFNIYTKWNW 266
>gi|363751202|ref|XP_003645818.1| hypothetical protein Ecym_3523 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889452|gb|AET39001.1| Hypothetical protein Ecym_3523 [Eremothecium cymbalariae
DBVPG#7215]
Length = 683
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 47/190 (24%), Positives = 83/190 (43%), Gaps = 18/190 (9%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I++ + I P +KR + I I++ + ++ ++ FGS YLP
Sbjct: 187 WLTLE--IKDFVSYISPNREEIRKRNDAITKIRKAVKSFWPDSDLHCFGSYATDLYLPGS 244
Query: 105 DIDLTV---LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNI 159
DID V G ++ + L F + +N V I A+V I+K V I
Sbjct: 245 DIDCVVNSKSGDKDNKNALYSFASYLRKNGLASQVSVI-----AKARVPIIKFVEPVSQI 299
Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
+D+SF + G+ A + +G + V+ IK + Y R+ H G + ++
Sbjct: 300 HIDVSFERTNGVDAAKIIRG---WLGDTPGLRELVLTIKQFLYAR-RLNDVHIGGLGGFS 355
Query: 220 LEMMVLHRFL 229
+ + + + FL
Sbjct: 356 I-ICLTYSFL 364
>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
Full=Caffeine-induced death protein 14; AltName:
Full=Polynucleotide adenylyltransferase cid14
gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
Length = 684
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 33/222 (14%)
Query: 39 FSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYI-QRLINGYYGIEVFPFGSVPL 97
+ ++ E+ + + I + I P RK +++ I Q ++ + + ++ FGS
Sbjct: 231 YKVEREVSRIFHQDILHFIDYITPTPEEHAVRKTLVSRINQAVLQKWPDVSLYVFGSFET 290
Query: 98 KTYLPDGDIDLTVLG----HQNVEED---LARFVCKI-LENEDQDSVFEIKDVQYV-PAQ 148
K YLP D+DL ++ ++ ++D LA + K+ L +E VQ + A
Sbjct: 291 KLYLPTSDLDLVIISPEHHYRGTKKDMFVLAHHLKKLKLASE----------VQVITTAN 340
Query: 149 VKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
V I+K + + VDISFNQ GL C + V+ + K + VI+IK + R
Sbjct: 341 VPIIKFVDPLTKVHVDISFNQPGGLKT-CLV--VNGFMKKYPALRPLVIIIKH--FLNMR 395
Query: 207 ILGAHY-GLISTYALEMMV-----LHRFLDYYNTFDWDNYCI 242
L + G +S+YA+ +V LH L + + DN+ +
Sbjct: 396 ALNEVFLGGLSSYAIVCLVVSFLQLHPRLSTGSMREEDNFGV 437
>gi|449296989|gb|EMC93008.1| hypothetical protein BAUCODRAFT_49110, partial [Baudoinia
compniacensis UAMH 10762]
Length = 611
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 18/140 (12%)
Query: 52 RIQEILCT----IQPAIVSEKKRKEVINYIQRLI-----NGYYGIEVFPFGSVPLKTYLP 102
R+ + +C +QP E+ R+++I+ ++R I G +G ++ FGS YLP
Sbjct: 288 RLHKEICDFFNYVQPKNFEEEARRDLISRVERAILTSNIAGSHGTKIHCFGSFAAGLYLP 347
Query: 103 DGDIDLTVLGHQNVEEDLARF------VCKILENEDQDSVFEIKDVQYV-PAQVKIVKC- 154
D+DL + ++ L + ++ + + V VQ V A+V IVK
Sbjct: 348 TADMDLVAISSTFQQQGLRTVGQSGTQMHRLSAHLTRSGVAAAGSVQVVLHAKVPIVKFI 407
Query: 155 -SVQNIPVDISFNQMAGLSA 173
I VDISF G+ A
Sbjct: 408 DKKTGIKVDISFENRTGIVA 427
>gi|392867853|gb|EAS33553.2| topoisomerase TRF4 [Coccidioides immitis RS]
Length = 680
Score = 43.5 bits (101), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +RL+ N + G ++ FGS YLP D+DL +L + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
+F+C+ +++ + + I+D + VP A+V I+K + + VD+SF+
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375
Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
+GL+A ++ K+H + +L++ + LG + + T
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGGFSIICLVTS 430
Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
L+ + VL F D+Y N FD+ I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471
>gi|389748468|gb|EIM89645.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
Length = 479
Score = 43.5 bits (101), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 56/130 (43%), Gaps = 4/130 (3%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
L + + I P V ++ R V+ IQR I+ + +V FGS K YLP GDI
Sbjct: 102 LLHREVDAFVRYISPTPVEDEIRSLVVLQIQRCISSKFPDAKVRSFGSYETKLYLPLGDI 161
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
DL ++ D V + N + + + A+V IVK + VDIS
Sbjct: 162 DLVIISKSMAYSDRVT-VLHAVANTLRTAGITDRVSVIAKAKVPIVKFVTTFGRFAVDIS 220
Query: 165 FNQMAGLSAL 174
N G+ A+
Sbjct: 221 INMSNGVEAI 230
>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella
mesenterica DSM 1558]
Length = 303
Score = 43.5 bits (101), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 48/94 (51%), Gaps = 17/94 (18%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVL-------GHQNVEEDLARFVCKILENEDQDSVFEIKD 141
V PFGS + YLP GDIDL V +N+ DLAR + + D+V I
Sbjct: 62 VTPFGSWQTQLYLPQGDIDLVVTHPTLTEHNKKNLLNDLARTMRYAMIT---DNVVVIS- 117
Query: 142 VQYVPAQVKIVKCSVQN--IPVDISFNQMAGLSA 173
A+V I+K ++ + VDIS NQ+ G+SA
Sbjct: 118 ----KARVPIIKFVTKHGKLNVDISLNQVNGISA 147
>gi|324975520|gb|ADY62700.1| PAPa [Candida orthopsilosis]
Length = 547
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 46/226 (20%), Positives = 88/226 (38%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ KKR EV++ Q L+ + G ++F FGS L Y P DID
Sbjct: 42 ATKKRAEVLSLFQTLVQQFVYEVSISKNMSDGMAKDAGGKIFTFGSYRLGVYGPSSDIDA 101
Query: 108 LTVLGHQNVEEDLARFVCKILE--------NEDQDSVFEIKDVQY--VPAQVKIVKCSVQ 157
L V+ ED KI+ N D+ I +++ + + + + +V
Sbjct: 102 LVVVPRHVTREDFFTTFDKIIRQRPELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ + + L+ +++ +L+ K +FK ++ IK W E
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220
Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
+ G +G A M+ ++ +F + Y ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSAVIVEKFFNIYTKWNW 266
>gi|303323645|ref|XP_003071814.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240111516|gb|EER29669.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 680
Score = 43.5 bits (101), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +RL+ N + G ++ FGS YLP D+DL +L + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
+F+C+ +++ + + I+D + VP A+V I+K + + VD+SF+
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375
Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
+GL+A ++ K+H + +L++ + LG + + T
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTS 430
Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
L+ + VL F D+Y N FD+ I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471
>gi|70987233|ref|XP_749095.1| topoisomerase family protein TRF4 [Aspergillus fumigatus Af293]
gi|66846725|gb|EAL87057.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
Af293]
Length = 702
Score = 43.1 bits (100), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 100/246 (40%), Gaps = 43/246 (17%)
Query: 42 DAELWLLAEE-------RIQ-EILCT---IQPAIVSEKKRKEVINYIQRLING-YYGIEV 89
DA W+L E R+ EIL ++P + R ++I +Q YYG+++
Sbjct: 243 DATPWMLLMEPSLHLGSRLHNEILSFYHWVKPMQYEQIVRADLITRLQVAFQSRYYGVQL 302
Query: 90 FPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------VCKILENEDQDSVFEIK 140
PFGS YLP DIDL +L + + F L+N + I+
Sbjct: 303 RPFGSFASGLYLPTADIDLVLLSSNFMRNGIKTFGERKGQIYAFAAFLKNLEIAVPNSIE 362
Query: 141 DVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-----------------LCFLEKVD 181
+ + A+V I+K + + VD+SF+ +GL A + +++
Sbjct: 363 TIAH--ARVPILKFVDKMTGLRVDLSFDNDSGLIANNTFQNWKSEYPAMPVIVAVVKQFL 420
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYC 241
L G + + + C S + +G + L +++ F Y N FD++N
Sbjct: 421 LLRGLNEVPTGGLGGFSITCLVTSLLQHLPHGHTAP-NLGSILMDFFEFYGNNFDFENVG 479
Query: 242 ISINGP 247
I +N P
Sbjct: 480 IRLNPP 485
>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Taeniopygia guttata]
Length = 443
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 67 EKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEED 119
E R EV+N I+ +I + +V FGS YLP DIDL V G +EE
Sbjct: 8 ETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLPLWTLEEA 67
Query: 120 LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
L + D++SV ++ D VP I+K S + VDISFN G+ A +
Sbjct: 68 LRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVKAAQLI 117
Query: 178 E 178
+
Sbjct: 118 K 118
>gi|344301689|gb|EGW31994.1| Poly(A) polymerase PAPalpha [Spathaspora passalidarum NRRL Y-27907]
Length = 556
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 44/226 (19%), Positives = 88/226 (38%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ KKR EV+ Q+++ + G +VF FGS L Y P DID
Sbjct: 43 ATKKRAEVLALFQKMVQEFVYTVSKSKNMSDGMAKDAGGKVFTFGSYRLGVYGPGSDIDT 102
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQ--------DSVFEIKDVQY--VPAQVKIVKCSVQ 157
L V+ ED +I+ + D+ I +++ + + + + +V
Sbjct: 103 LVVVPKHVTREDFFTVFEQIIRKRPELQEIASVPDAFVPIIKIEFDGISIDLILARLNVP 162
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ + + L+ +++ +L+ K +FK ++ IK W +
Sbjct: 163 RVPLDMTLDDKNLLKNIDERDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWAQQRA- 221
Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 222 VYGNVFGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFNIYTKWNW 267
>gi|320035002|gb|EFW16944.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
Length = 680
Score = 43.1 bits (100), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 99/221 (44%), Gaps = 40/221 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +RL+ N + G ++ FGS YLP D+DL +L + +
Sbjct: 258 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 317
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
+F+C+ +++ + + I+D + VP A+V I+K + + VD+SF+
Sbjct: 318 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 375
Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGL-ISTY 218
+GL+A ++ K+H + +L++ + LG + + T
Sbjct: 376 NSGLAANRTFQQ-----WKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTS 430
Query: 219 ALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
L+ + VL F D+Y N FD+ I +N P
Sbjct: 431 LLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 471
>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 642
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 46/196 (23%), Positives = 88/196 (44%), Gaps = 24/196 (12%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ +++ + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEED------LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
DID + + +ED LAR+ L+NE E+ V +V I+K +
Sbjct: 233 DIDCVINSRHHDKEDRNYIYELARY----LKNEGLAIRMEV----IVRTRVPIIKFIEPL 284
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
+ +D+SF + GL A + R +D R ++L+ + R+ H G +
Sbjct: 285 SQLHIDVSFERTNGLEAARLI----REWLRDSPGLRELVLVIKQFLHSRRLNNVHTGGLG 340
Query: 217 TYALEMMVLHRFLDYY 232
+ + + +++ FL+ +
Sbjct: 341 GFTV-ICLVYSFLNMH 355
>gi|221055315|ref|XP_002258796.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193808866|emb|CAQ39569.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 548
Score = 43.1 bits (100), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 98/214 (45%), Gaps = 32/214 (14%)
Query: 31 SSPPNLYPFS-IDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIE 88
S PP YP S +DAEL ++ ++P+ K + ++Q+ IN +Y
Sbjct: 205 SPPPGEYPPSALDAEL--------NKLEIALRPSQNDVNSIKTFLAFLQKEINKHYKNCH 256
Query: 89 VFPFGSVPLKTYLPDGDIDLT----VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQY 144
V PFGS+ + + DID+ +L + + + +C IL N + D + E Q
Sbjct: 257 VTPFGSIINGFWTRNSDIDICIQIPILLSRKDQITFLKKICLILNNFN-DGIIE----QR 311
Query: 145 VPAQVKIVKCSVQN------IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIK 198
A+V I+ ++ + DIS N + A+ + + + + D + I +K
Sbjct: 312 FSAKVPIIHFYCKSLRHSFELSCDISVNNIL---AVINSKLIQKYVSIDRRLQLMGIALK 368
Query: 199 AWCYYESR-ILGAHYGLISTYALEMMVLHRFLDY 231
W +SR I G +S+++L +M++H FL Y
Sbjct: 369 YWS--KSRNINDRSKGFLSSFSLILMIIH-FLQY 399
>gi|68482706|ref|XP_714750.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
gi|3334283|sp|O42617.1|PAP_CANAL RecName: Full=Poly(A) polymerase PAPalpha; AltName:
Full=Polynucleotide adenylyltransferase alpha
gi|2696030|dbj|BAA23802.1| poly A polymerase [Candida albicans]
gi|5771514|gb|AAD51412.1| unknown [Candida albicans]
gi|46436342|gb|EAK95706.1| hypothetical protein CaO19.10713 [Candida albicans SC5314]
Length = 558
Score = 43.1 bits (100), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 47/227 (20%), Positives = 90/227 (39%), Gaps = 57/227 (25%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ KKR EV+ QRL+ + G +VF FGS L Y P DID
Sbjct: 43 ATKKRVEVLTLFQRLVQEFVYTVSKSKNMSDSMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102
Query: 109 TVLGHQNVEED--LARFVCKILENEDQDSVFEIKDVQYVP----------AQVKIVKCSV 156
V+ ++V D + F I + + + + + D YVP + + + ++
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDA-YVPIIKLEFDGISIDLIMARLNI 161
Query: 157 QNIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
+P+D++ + + L+ +++ +L+ K +FK ++ IK W +
Sbjct: 162 PRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWA-QQR 220
Query: 206 RILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 221 AVYGNIFGFPGGVAWAMLVARICQLYPNAVSSAIVEKFFNIYTKWNW 267
>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Taeniopygia guttata]
Length = 490
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 56/121 (46%), Gaps = 19/121 (15%)
Query: 67 EKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEED 119
E R EV+N I+ +I + +V FGS YLP DIDL V G +EE
Sbjct: 8 ETMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLPLWTLEEA 67
Query: 120 LARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFL 177
L + D++SV ++ D VP I+K S + VDISFN G+ A +
Sbjct: 68 LRKHNV-----ADENSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVKAAQLI 117
Query: 178 E 178
+
Sbjct: 118 K 118
>gi|195131647|ref|XP_002010257.1| GI15833 [Drosophila mojavensis]
gi|193908707|gb|EDW07574.1| GI15833 [Drosophila mojavensis]
Length = 807
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 118/305 (38%), Gaps = 61/305 (20%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
E I + P R EV+ I+ +++ + V FGS +LP DIDL
Sbjct: 301 HEEIDHFYQYVLPTACEHAIRNEVVKRIESVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 360
Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
VLG + + LE E + E V+ + A V I+K + + + VDIS
Sbjct: 361 VVLGLWE------KLPLRTLEFELVARGIAEACTVRVLDKASVPIIKLTDRETQVKVDIS 414
Query: 165 FNQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALE 221
FN +G+ + +++ R ++GK L + +L++ + G IS+Y+L
Sbjct: 415 FNMQSGVQSAELIKQFKRDYPVLGKLVLVLKQFLLLRD-------LNEVFTGGISSYSLI 467
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREI 281
+M CIS + P + N G LL E +
Sbjct: 468 LM-----------------CISF-----LQLHPRGIYHDTTNLGVLLLEFFELYGQSFNY 505
Query: 282 YSVPITALENTGHEFAIKHLN-------------IVDPLKDNNNLGRSVSKGNFH---RI 325
++ I +++N G L I DPL N++GRS S G H
Sbjct: 506 INIGI-SIKNGGRYMPKDELQRDMVDGHRPSLLCIEDPLTPGNDIGRS-SYGALHVRQAF 563
Query: 326 RCALS 330
RCA S
Sbjct: 564 RCAFS 568
>gi|119188673|ref|XP_001244943.1| hypothetical protein CIMG_04384 [Coccidioides immitis RS]
Length = 785
Score = 43.1 bits (100), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 42/222 (18%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +RL+ N + G ++ FGS YLP D+DL +L + +
Sbjct: 363 VKPRPFEDVIRTDLITRFERLMQNRFPGSQLHAFGSYASGLYLPVADVDLVLLSRSFIRQ 422
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
+F+C+ +++ + + I+D + VP A+V I+K + + VD+SF+
Sbjct: 423 G-RKFLCQKIKDIYSLTAY-IRDTEIAVPGSIETIAHARVPIIKFVDRLTGLKVDLSFDN 480
Query: 168 MAGLSALCFLEKVDRLIGKDHL--------FKRSVILIKAWCYYESRILGAHYGLIS--T 217
+GL+A ++ K+H + +L++ + LG + +I T
Sbjct: 481 NSGLAANRTFQQ-----WKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGG-FSIICLVT 534
Query: 218 YALEMM-----------VLHRFLDYY-NTFDWDNYCISINGP 247
L+ + VL F D+Y N FD+ I +N P
Sbjct: 535 SLLQHLPHGMSEPNLGGVLMDFFDFYGNKFDFSTVGIELNPP 576
>gi|238882575|gb|EEQ46213.1| Poly(A) polymerase [Candida albicans WO-1]
Length = 558
Score = 43.1 bits (100), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 47/227 (20%), Positives = 90/227 (39%), Gaps = 57/227 (25%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ KKR EV+ QRL+ + G +VF FGS L Y P DID
Sbjct: 43 ATKKRVEVLTLFQRLVQEFVYTVSKSKNMSDSMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102
Query: 109 TVLGHQNVEED--LARFVCKILENEDQDSVFEIKDVQYVP----------AQVKIVKCSV 156
V+ ++V D + F I + + + + + D YVP + + + ++
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDA-YVPIIKLEFDGISIDLIMARLNI 161
Query: 157 QNIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYES 205
+P+D++ + + L+ +++ +L+ K +FK ++ IK W +
Sbjct: 162 PRVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKLWA-QQR 220
Query: 206 RILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 221 AVYGNIFGFPGGVAWAMLVARICQLYPNAVSSAIVEKFFNIYTKWNW 267
>gi|195393956|ref|XP_002055618.1| GJ18687 [Drosophila virilis]
gi|194150128|gb|EDW65819.1| GJ18687 [Drosophila virilis]
Length = 1111
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 79/183 (43%), Gaps = 19/183 (10%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
E I + P R EV+ I+ +++ + V FGS +LP DIDL
Sbjct: 270 HEEIDHFYQYVLPTPCEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 329
Query: 109 TVLGHQNVEEDLARFVCK---ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISF 165
VLG + E L + + N + + D VP +K+ Q + VDISF
Sbjct: 330 VVLG---LWEKLPLRTLEFELVSRNIAEACTVRVLDKASVPI-IKLTDRETQ-VKVDISF 384
Query: 166 NQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
N +G+ + +++ R ++GK L + +L++ + G IS+Y+L +
Sbjct: 385 NMQSGVQSAELIKQFKRDYPVLGKLVLVLKQFLLLR-------DLNEVFTGGISSYSLIL 437
Query: 223 MVL 225
M +
Sbjct: 438 MCI 440
>gi|452839453|gb|EME41392.1| hypothetical protein DOTSEDRAFT_46399 [Dothistroma septosporum
NZE10]
Length = 754
Score = 43.1 bits (100), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 82/186 (44%), Gaps = 35/186 (18%)
Query: 10 MEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKK 69
+ D+ +E L P+HT+ + P + + + L + I + ++P E+
Sbjct: 290 LGDIVDEWL--PNHTNPTPWIQPDP-----AFTSSVGLKLHKEILDFYDYVRPHRHEEEL 342
Query: 70 RKEVINYIQRLINGYY-------GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLAR 122
R +I+ +QR + + IEV FGS P YLP D+DL L ++ L R
Sbjct: 343 RAGIIDRLQRDLQYFRQIGPNVNKIEVRSFGSFPAGLYLPTADMDLVALSSDYLDHGLKR 402
Query: 123 FVCKILENEDQDSVFEIKD-------------VQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+C+I ++ ++++ D + A+V +VK I VD+SF
Sbjct: 403 -LCQIRKH-----MWKMSDHFNRSRLPAPGTVAPVIGAKVPLVKFVDGHTGIKVDLSFEN 456
Query: 168 MAGLSA 173
+GL+A
Sbjct: 457 DSGLTA 462
>gi|410730487|ref|XP_003671423.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
gi|401780241|emb|CCD26180.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
Length = 582
Score = 42.7 bits (99), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 15/185 (8%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL E I++ + I P+ K+R + I I+ +N ++ + FGS YLP
Sbjct: 168 WLTME--IRDFVSYISPSKEEIKERNDTIGRIRDAVNHFWNDANLHVFGSYATDLYLPGS 225
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
DID ++ + ++ + + L+ + E+ A+V I+K I
Sbjct: 226 DIDCVIISEKGDKDSRSSLYALANFLKKRGLATDIEV----IAKARVPIIKFIDPRSKIH 281
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
+D+SF ++ GL A K+ R D R + LI + R+ H G + +++
Sbjct: 282 IDVSFERINGLEAA----KLIREWLNDTPGLREITLIVKQFLHARRLNNVHTGGLGGFSI 337
Query: 221 EMMVL 225
+V
Sbjct: 338 ICLVF 342
>gi|45554544|ref|NP_996381.1| Trf4-1, isoform E [Drosophila melanogaster]
gi|161077643|ref|NP_996382.2| Trf4-1, isoform C [Drosophila melanogaster]
gi|442615523|ref|NP_996383.2| Trf4-1, isoform H [Drosophila melanogaster]
gi|442615525|ref|NP_572490.2| Trf4-1, isoform I [Drosophila melanogaster]
gi|442615527|ref|NP_001259340.1| Trf4-1, isoform J [Drosophila melanogaster]
gi|33636605|gb|AAQ23600.1| RE04457p [Drosophila melanogaster]
gi|45446864|gb|AAS65290.1| Trf4-1, isoform E [Drosophila melanogaster]
gi|158031751|gb|AAS65289.2| Trf4-1, isoform C [Drosophila melanogaster]
gi|440216540|gb|AAS65288.2| Trf4-1, isoform H [Drosophila melanogaster]
gi|440216541|gb|AAF46390.3| Trf4-1, isoform I [Drosophila melanogaster]
gi|440216542|gb|AGB95183.1| Trf4-1, isoform J [Drosophila melanogaster]
Length = 1001
Score = 42.7 bits (99), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 86/209 (41%), Gaps = 15/209 (7%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 244 NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 303
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
+ + V FGS +LP DIDL VLG + + LE E + E
Sbjct: 304 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 357
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
V+ + A V I+K + + + VDISFN +G+ + ++K R D+ ++L
Sbjct: 358 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKR----DYPVLEKLVL 413
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL 225
+ + G IS+Y+L +M +
Sbjct: 414 VLKQFLLLRDLNEVFTGGISSYSLILMCI 442
>gi|147787660|emb|CAN69576.1| hypothetical protein VITISV_028613 [Vitis vinifera]
Length = 192
Score = 42.7 bits (99), Expect = 0.58, Method: Composition-based stats.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 236 DWDNYCISINGPVAISSLPEIVAETVENDGDELLL 270
DWD++C+S+ GPV ISSLP+ E ELLL
Sbjct: 148 DWDSFCVSLWGPVPISSLPDATTEPPRQGSRELLL 182
>gi|195355122|ref|XP_002044042.1| GM21691 [Drosophila sechellia]
gi|194129295|gb|EDW51338.1| GM21691 [Drosophila sechellia]
Length = 1000
Score = 42.7 bits (99), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 78/181 (43%), Gaps = 15/181 (8%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDL 108
E I+ + P R EV+ I+ +++ + V FGS +LP DIDL
Sbjct: 273 HEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPTSDIDL 332
Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
VLG + + LE E + E V+ + A V I+K + + + VDIS
Sbjct: 333 VVLGL------WEKLPLRTLEFELVSRGIAEACTVRVLDKASVPIIKLTDRETQVKVDIS 386
Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
FN +G+ + ++K R D+ ++L+ + G IS+Y+L +M
Sbjct: 387 FNMQSGVQSAELIKKFKR----DYPVLEKLVLVLKQFLLLRDLNEVFTGGISSYSLILMC 442
Query: 225 L 225
+
Sbjct: 443 I 443
>gi|50557292|ref|XP_506054.1| YALI0F30525p [Yarrowia lipolytica]
gi|49651924|emb|CAG78867.1| YALI0F30525p [Yarrowia lipolytica CLIB122]
Length = 716
Score = 42.7 bits (99), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 64/141 (45%), Gaps = 15/141 (10%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E + + + I P K R++++ ++ +NG +G +V GS +LP
Sbjct: 248 WL--QEEVMDFVKYISPTQSEIKARQDLVERVRGAVNGLWGDAKVHVLGSFTTNMHLPQS 305
Query: 105 DIDLTVL---GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYV-PAQVKIVKC--SVQN 158
DIDL V GH + + I S ++ ++Q + A+V I+K S
Sbjct: 306 DIDLVVCSPHGHYGERACIYQLSSVIR------SRMKVAELQTITKARVPIIKFIDSRTG 359
Query: 159 IPVDISFNQMAGLSALCFLEK 179
+ VDISF + G+ + K
Sbjct: 360 VHVDISFEKDGGIKTASTITK 380
>gi|323307579|gb|EGA60848.1| Trf5p [Saccharomyces cerevisiae FostersO]
Length = 575
Score = 42.7 bits (99), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 108 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 165
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 166 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 223
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 224 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 278
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 279 CLVYSFLNMH 288
>gi|256271295|gb|EEU06367.1| Trf5p [Saccharomyces cerevisiae JAY291]
gi|349580652|dbj|GAA25811.1| K7_Trf5p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 642
Score = 42.7 bits (99), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
Length = 816
Score = 42.4 bits (98), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 54/127 (42%), Gaps = 4/127 (3%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDI 106
+ + ++ + I P V ++ R ++ I + + V PFGS K YLP GDI
Sbjct: 277 MMHKEVEAFVKWISPTPVEDEIRGLIVKQIAVTVQSKFPDASVLPFGSYETKLYLPMGDI 336
Query: 107 DLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK--CSVQNIPVDIS 164
DL +L + V L N + + + A+V IVK + VDIS
Sbjct: 337 DLVILSESMAYSNKVS-VLHTLANTLKRAGITSRVTVIAKARVPIVKFVTTHGRFNVDIS 395
Query: 165 FNQMAGL 171
NQ GL
Sbjct: 396 INQENGL 402
>gi|190409265|gb|EDV12530.1| hypothetical protein SCRG_03424 [Saccharomyces cerevisiae RM11-1a]
Length = 642
Score = 42.4 bits (98), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|348690009|gb|EGZ29823.1| hypothetical protein PHYSODRAFT_466551 [Phytophthora sojae]
Length = 666
Score = 42.4 bits (98), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 68/145 (46%), Gaps = 10/145 (6%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENEDQDSVFEIKDV 142
+V FGS +LP D+DL +L +V + + + K+LE + + D
Sbjct: 202 KVETFGSYSTGIWLPSSDVDLVILDVVDVNDSQLTAKHLKELAKVLEKKKWVESLLVLDT 261
Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWC 201
VP +K+V ++P+DI+F A S L + + R L+ +++ +
Sbjct: 262 AKVPV-LKLVSAET-SVPIDITFESAATHSGLLARDLIKRYADNMPELYPLAIVFKQ--L 317
Query: 202 YYESRILGAHYGLISTYALEMMVLH 226
E + A+ G +S+Y++ +M++H
Sbjct: 318 LRERDLNDAYTGGLSSYSVVLMIIH 342
>gi|81360384|ref|NP_014100.2| non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
S288c]
gi|148887014|sp|P48561.2|TRF5_YEAST RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
Full=Topoisomerase 1-related protein TRF5
gi|151944249|gb|EDN62528.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
gi|285814367|tpg|DAA10261.1| TPA: non-canonical poly(A) polymerase TRF5 [Saccharomyces
cerevisiae S288c]
Length = 642
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|207341968|gb|EDZ69877.1| YNL299Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 642
Score = 42.4 bits (98), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|241955483|ref|XP_002420462.1| poly(A) polymerase, putative; polynucleotide adenylyltransferase,
putative [Candida dubliniensis CD36]
gi|223643804|emb|CAX41541.1| poly(A) polymerase, putative [Candida dubliniensis CD36]
Length = 558
Score = 42.4 bits (98), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 47/226 (20%), Positives = 91/226 (40%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ KKR EV+ Q+L+ + G +VF FGS L Y P DID
Sbjct: 43 ATKKRVEVLTLFQQLVQEFVYTVSKSKNMSDAMAQDAGGKVFTFGSYRLGVYGPGSDIDT 102
Query: 109 TVLGHQNVEED-----LARFVCKILENED----QDSVFEIKDVQY--VPAQVKIVKCSVQ 157
V+ ++V D A + K E E+ D+ I +++ + + + K ++
Sbjct: 103 LVVVPKHVTRDDFFSVFADIIRKRPELEEIACVPDAYVPIIKIEFDGISIDLIMAKLNIP 162
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ + + L+ +++ +L+ K +FK ++ IK W +
Sbjct: 163 RVPLDLTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHALRCIKMWA-QQRA 221
Query: 207 ILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ G +G A M+V + +F + Y ++W
Sbjct: 222 VYGNIFGFPGGVAWAMLVARICQLYPNAVSSVIVEKFFNIYTKWNW 267
>gi|392296994|gb|EIW08095.1| Trf5p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 608
Score = 42.4 bits (98), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 257 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 312 CLVYSFLNMH 321
>gi|242014489|ref|XP_002427922.1| PAP-associated domain-containing protein, putative [Pediculus
humanus corporis]
gi|212512406|gb|EEB15184.1| PAP-associated domain-containing protein, putative [Pediculus
humanus corporis]
Length = 531
Score = 42.4 bits (98), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 85/179 (47%), Gaps = 11/179 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
+ I E + P + R +V++ I+ I+ + +V FGS YLP DIDL
Sbjct: 116 HDEIIEFYNYMSPRKCEHELRLKVVHKIKSAIHELWPQAKVEVFGSFRTGLYLPTSDIDL 175
Query: 109 TVLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
V+G L + K+LENE + ++ D VP + ++ ++ VDISFN
Sbjct: 176 VVIGCWPTLP-LWKLEQKLLENEIAEQHTIKVLDKASVP--IIKLRDKESDVKVDISFNM 232
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMMVL 225
+G+ + ++ + K + + V+++K + R L + G IS+Y+L +M +
Sbjct: 233 SSGVQSAQLIKTFKK---KFPVLAKLVLVLKQ--FLLQRDLNEVFTGGISSYSLILMTI 286
>gi|301119917|ref|XP_002907686.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106198|gb|EEY64250.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 647
Score = 42.4 bits (98), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 74/173 (42%), Gaps = 14/173 (8%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL-----ARFVCKILENEDQDSVFEIKDV 142
+V FGS +LP D+DL +L V + R + K+LE + D
Sbjct: 189 KVETFGSYSTGIWLPSSDVDLVILDVVEVNDSKLTAKHLRQLAKVLEKKKWVESLLCLDA 248
Query: 143 QYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWC 201
VP +K+V ++P+DI+F A S L + + R L+ +++ +
Sbjct: 249 AKVPV-LKLVSAET-SVPIDITFESAATHSGLLARDLIKRYADNMPELYPLAIVFKQ--L 304
Query: 202 YYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLP 254
E + A+ G +S+Y++ +M++H F + D C A SLP
Sbjct: 305 LRERDLNDAYTGGLSSYSVVLMIIH-FSQLWRNGD---LCFEAASIYASGSLP 353
>gi|159123133|gb|EDP48253.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
A1163]
Length = 703
Score = 42.4 bits (98), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 49/217 (22%), Positives = 89/217 (41%), Gaps = 32/217 (14%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +Q YYG+++ PFGS YLP DIDL +L +
Sbjct: 273 VKPMQYEQIVRADLITRLQVAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSSNFMRN 332
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L+N + I+ + + A+V I+K + + VD+SF+
Sbjct: 333 GIKTFGERKGQIYAFAAFLKNLEIAVPNSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 390
Query: 168 MAGLSA-----------------LCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
+GL A + +++ L G + + + C S +
Sbjct: 391 DSGLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGGLGGFSITCLVTSLLQHL 450
Query: 211 HYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGP 247
+G + L +++ F Y N FD++N I +N P
Sbjct: 451 PHGHTAP-NLGSILMDFFEFYGNNFDFENVGIRLNPP 486
>gi|259149072|emb|CAY82314.1| Trf5p [Saccharomyces cerevisiae EC1118]
Length = 642
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|170592851|ref|XP_001901178.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158591245|gb|EDP29858.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 421
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 131/295 (44%), Gaps = 20/295 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + ++ ++P+ + R V ++ ++ + ++ FGS+ +LP DID+
Sbjct: 68 HEELLDLYAWLKPSPLERALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 127
Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
V EE L + + E+ +S+ + D +VP VK+V + I +DISFN
Sbjct: 128 VVESDLVSEEPPLWKTAIALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 184
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
+ G+ + F+E + + + + V+++K + + ++ G +S+Y L +M++
Sbjct: 185 VQGVRSAKFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 239
Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
FL + ++D+ I+ +N V + + ++ + L + C++ V +
Sbjct: 240 FLQLHPSYDYSYKRITEVNMGVLLLNFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 299
Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
N+ L I DPL+ N++GR N +R A + L +
Sbjct: 300 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 345
>gi|323303298|gb|EGA57094.1| Trf5p [Saccharomyces cerevisiae FostersB]
Length = 419
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 126/300 (42%), Gaps = 28/300 (9%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 257 VSFERTXGLEA----AKLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311
Query: 223 MVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFL----KKC 278
+++ FL+ + ++ + N V + E+ + D + +S + K C
Sbjct: 312 CLVYSFLNMHPRIKSNDIDVLDNLGVLLIDFFELYGKNFGYDDVAISISDGYASYIPKSC 371
Query: 279 REIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGE 338
LE + +F+ L I DP NNN+ R N I+ L + + +G+
Sbjct: 372 -------WRTLEPSRSKFS---LAIQDPGDXNNNISR--GSFNMKDIKKXLCWCLRTVGQ 419
>gi|225681443|gb|EEH19727.1| polymerase sigma [Paracoccidioides brasiliensis Pb03]
Length = 684
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R ++I +QRL +YG ++ FGS YLP D+DL +L Q + +
Sbjct: 268 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 327
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
+ + E + + D+ VP A+V I+K + + VD+SF+
Sbjct: 328 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 386
Query: 169 AGLSA 173
GL+A
Sbjct: 387 TGLAA 391
>gi|161077646|ref|NP_001096913.1| Trf4-1, isoform F [Drosophila melanogaster]
gi|161077648|ref|NP_001096914.1| Trf4-1, isoform G [Drosophila melanogaster]
gi|158031752|gb|ABW09362.1| Trf4-1, isoform F [Drosophila melanogaster]
gi|158031753|gb|ABW09363.1| Trf4-1, isoform G [Drosophila melanogaster]
Length = 802
Score = 42.4 bits (98), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 86/209 (41%), Gaps = 15/209 (7%)
Query: 22 SHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI 81
+H + Y P P E + E I+ + P R EV+ I+ ++
Sbjct: 45 NHMEMIAKYKGEPWRKPDYPYGEGVIGLHEEIEHFYQYVLPTPCEHAIRNEVVKRIEAVV 104
Query: 82 NGYYGIEVFP-FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE-DQDSVFEI 139
+ + V FGS +LP DIDL VLG + + LE E + E
Sbjct: 105 HSIWPQAVVEIFGSFRTGLFLPTSDIDLVVLGL------WEKLPLRTLEFELVSRGIAEA 158
Query: 140 KDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
V+ + A V I+K + + + VDISFN +G+ + ++K R D+ ++L
Sbjct: 159 CTVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAELIKKFKR----DYPVLEKLVL 214
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL 225
+ + G IS+Y+L +M +
Sbjct: 215 VLKQFLLLRDLNEVFTGGISSYSLILMCI 243
>gi|365763604|gb|EHN05131.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 42.4 bits (98), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 911
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 79/169 (46%), Gaps = 12/169 (7%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
+ P+ ++ R++VI I +++ + +++ FGS YLP DIDL ++G
Sbjct: 272 VSPSAEEKQMREDVIARISKVVETLWPSVQLRVFGSCATDIYLPTSDIDLCIMGANACSP 331
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLE 178
+ L V I + VP +K+V + + VDISF+ G + + ++
Sbjct: 332 SPIDELASALRRRSMGRVQAIATAR-VPI-IKLVDAATGCL-VDISFDVPTGPAHINLIK 388
Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESR--ILGAHYGLISTYALEMMVL 225
R + ++ K +LIK YY + + + G + +YAL +M++
Sbjct: 389 ---RYLDEEPSVKPLALLIK---YYLKQFGMNEPYTGGLGSYALIIMII 431
>gi|124806252|ref|XP_001350671.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|23496797|gb|AAN36351.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 1172
Score = 42.4 bits (98), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 77/144 (53%), Gaps = 15/144 (10%)
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE---IKDVQYVPAQ 148
FGS + + DID+ + + ++ DL + K+ +N +FE IK + V A+
Sbjct: 454 FGSCNNDIDIYNSDIDICI--YNTIQSDLIN-IRKLYKNMLHHPLFENVSIKKI--VHAK 508
Query: 149 VKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
V I+KC + + VDISFNQ++ L++ + I K+ L K +I +K + Y+
Sbjct: 509 VPIIKCFFNYTKLSVDISFNQLSALTSTI---QTQTYIQKNPLIKYILIFLKMFL-YQHD 564
Query: 207 ILGAHYGLISTYALEMMVLHRFLD 230
+ A G IS++ + +++L +FL+
Sbjct: 565 LNDASKGGISSFNI-LLLLAQFLN 587
>gi|323346953|gb|EGA81231.1| Trf5p [Saccharomyces cerevisiae Lalvin QA23]
Length = 616
Score = 42.4 bits (98), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|295671679|ref|XP_002796386.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226283366|gb|EEH38932.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 725
Score = 42.4 bits (98), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R ++I +QRL +YG ++ FGS YLP D+DL +L Q + +
Sbjct: 306 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 365
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
+ + E + + D+ VP A+V I+K + + VD+SF+
Sbjct: 366 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 424
Query: 169 AGLSA 173
GL+A
Sbjct: 425 TGLAA 429
>gi|115441021|ref|NP_001044790.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|56784029|dbj|BAD82657.1| unknown protein [Oryza sativa Japonica Group]
gi|56784702|dbj|BAD81828.1| unknown protein [Oryza sativa Japonica Group]
gi|113534321|dbj|BAF06704.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|222619532|gb|EEE55664.1| hypothetical protein OsJ_04062 [Oryza sativa Japonica Group]
Length = 381
Score = 42.4 bits (98), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
E+ ++IL I+P E R + I IQ L + Y G V PFGS + Y
Sbjct: 13 EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
GD+D++V E I + + QD++ E+ + ++++P A+V ++
Sbjct: 70 GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122
Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
+ + I DIS + G ++ L D F V+L+K W ++ I
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178
Query: 211 HYGLISTYALEMMVLHRF 228
G +++Y+L ++VL F
Sbjct: 179 KNGTLNSYSLCLLVLFHF 196
>gi|226288571|gb|EEH44083.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 684
Score = 42.0 bits (97), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R ++I +QRL +YG ++ FGS YLP D+DL +L Q + +
Sbjct: 268 VKPKSFEHLIRNDLIVRLQRLFERRHYGSQLHAFGSFASGLYLPTADMDLVLLSRQFLRQ 327
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYVP--------AQVKIVKC--SVQNIPVDISFNQM 168
+ + E + + D+ VP A+V I+K + + VD+SF+
Sbjct: 328 NRKTLCQRTREIYSFTAYLKDLDIA-VPGSIETIAHAKVPIIKFVDRLTGLKVDLSFDNS 386
Query: 169 AGLSA 173
GL+A
Sbjct: 387 TGLAA 391
>gi|323331833|gb|EGA73245.1| Trf5p [Saccharomyces cerevisiae AWRI796]
Length = 453
Score = 42.0 bits (97), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|344255571|gb|EGW11675.1| PAP-associated domain-containing protein 5 [Cricetulus griseus]
Length = 382
Score = 42.0 bits (97), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 57/123 (46%), Gaps = 19/123 (15%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH------QNVEEDLAR 122
R EV++ I+ +I + +V FGS YLP DIDL V G +EE L +
Sbjct: 2 RMEVVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLPLWTLEEALRK 61
Query: 123 FVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKV 180
D+DSV ++ D VP I+K S + VDISFN G+ A ++
Sbjct: 62 HKV-----ADEDSV-KVLDKATVP----IIKLTDSFTEVKVDISFNVQNGVRAADLIKDF 111
Query: 181 DRL 183
+L
Sbjct: 112 TKL 114
>gi|323335976|gb|EGA77253.1| Trf5p [Saccharomyces cerevisiae Vin13]
Length = 576
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|56784701|dbj|BAD81827.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 408
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 75/320 (23%), Positives = 124/320 (38%), Gaps = 72/320 (22%)
Query: 85 YGIEVFPFGSVPLKTYLPDGDIDLTV------LGHQNVEEDLARFVCKILENEDQDSVFE 138
YG PFGS Y GD+D++V + + ++ + R + ++L+N
Sbjct: 7 YGAVFKPFGSFVSNLYSNSGDLDISVHLPNNSIISKKKKQYVLRELMRVLQNRGVAGY-- 64
Query: 139 IKDVQYVP-AQVKIVK--CSVQNIPVDISFNQMAGL---SALCFLEKVDRLIGKDHLFKR 192
VQ+VP A+V +++ + I DIS N G C++ +D G
Sbjct: 65 ---VQFVPFARVPVLQYVSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGD------ 115
Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
V+LIK W ++ I G +++Y+L ++VL F C P +
Sbjct: 116 MVLLIKEWAKAQN-INDPKTGTLNSYSLCLLVLFHF----------QTC----EPAILPP 160
Query: 253 LPEIVAETVEND-GDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLN-IVDPLKDN 310
L EI +E L L+ L +Y +HL+ + DP++
Sbjct: 161 LKEIYEGNIEEGIAAYLYLNSLHLHTEMTVYDE--------------EHLDEVKDPIERP 206
Query: 311 NNLGRSVSKGNFHRIRCALSYGAQRLG-----------EILTLP--GHCLGMGLEKFFI- 356
+N R+V RI A + ++ E+L P G LG+ + + +
Sbjct: 207 DNAARAVDLKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLGIRVSQEVMA 266
Query: 357 ----NTLERNGKGLRPDVQV 372
NT RN RP QV
Sbjct: 267 NSCTNTPRRNRHHGRPTEQV 286
>gi|1050861|gb|AAC49099.1| Ynl0440p [Saccharomyces cerevisiae]
gi|1302392|emb|CAA96217.1| TRF5 [Saccharomyces cerevisiae]
Length = 625
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 346 CLVYSFLNMH 355
>gi|67539048|ref|XP_663298.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
gi|40743597|gb|EAA62787.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
gi|259484832|tpe|CBF81391.1| TPA: topoisomerase family protein TRF4, putative (AFU_orthologue;
AFUA_7G04130) [Aspergillus nidulans FGSC A4]
Length = 694
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 21/171 (12%)
Query: 15 EELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVI 74
+E PS T + PP L+ L + I ++P R++++
Sbjct: 228 DEWKLRPSETGTPWLSLMPPTLH-------LGTRLHDEILSFYHWVKPVRYEHIVRQDLV 280
Query: 75 NYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF---------V 124
+Q YYG+E+ FGS YLP+ DIDL +L + F
Sbjct: 281 ARLQAAFQSRYYGVEIHAFGSFASGLYLPNADIDLVLLSTSFRRTGVKTFGERKGQIYAF 340
Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA 173
L+N++ ++ + + A+V I+K + + VD+SF+ +GL A
Sbjct: 341 SAFLKNQNIAVPGSVETIAH--ARVPILKFVDKLTGLKVDLSFDNDSGLIA 389
>gi|345326611|ref|XP_003431063.1| PREDICTED: hypothetical protein LOC100681530 [Ornithorhynchus
anatinus]
Length = 411
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 6/69 (8%)
Query: 460 HPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRARNPESLNHVPVSKLRR 519
H Y QV S FS RG G + P+ IE++ P R P+S NH+ V RR
Sbjct: 209 HSY--QVFESPFSVVNFSSQRGQGVFSPQTI----IEVDSPGVCRVPKSDNHIAVEDKRR 262
Query: 520 KIDKAERLP 528
+++++ R P
Sbjct: 263 RVEESRRAP 271
>gi|118376886|ref|XP_001021625.1| Poly polymerase central domain containing protein [Tetrahymena
thermophila]
gi|89303391|gb|EAS01379.1| Poly polymerase central domain containing protein [Tetrahymena
thermophila SB210]
Length = 707
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 21/143 (14%)
Query: 52 RIQEILCTIQPAI-VSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
+IQEI+ I ++E+K E + ++Q V PFGS L DGD+DL
Sbjct: 176 KIQEIVNQWMVNININERKIPEQVAHMQLA-------RVLPFGSYFLGVSSVDGDVDLVC 228
Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQY----VPAQVKIVKCSVQNIPVDISFN 166
+ V+ RF K + D + + V+Y + A+V I++ +Q +P+DISF
Sbjct: 229 IAPNFVD----RF--KHFNGQLYDMISNMDGVEYHNNVIDAKVPIMQFEIQGVPIDISF- 281
Query: 167 QMAGLSALCFLEKVDRLIGKDHL 189
A L + +D+ I HL
Sbjct: 282 --AQLDVETLPDDIDKQIPDTHL 302
>gi|66821005|ref|XP_644037.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
gi|60472363|gb|EAL70316.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
Length = 1880
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 16/145 (11%)
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENE-DQDSVFEIKDVQYVP-A 147
+GS +LP DID+ V +N ++ A+F+ K+LE + D I + +P A
Sbjct: 1518 YGSFMTGLWLPSSDIDIVVNYGKNMAIKPKNAQFLLKVLEKQFRTDLNGMILTMLCIPSA 1577
Query: 148 QVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAW 200
++ ++K + +NI VDISF + G++A + V +IG L+ +++L W
Sbjct: 1578 KIPVIKLVTTENISVDISFRESPTSVHTGIAARDLVADCVREIIG---LYPLAIVL--KW 1632
Query: 201 CYYESRILGAHYGLISTYALEMMVL 225
E + + G +S+Y L +M++
Sbjct: 1633 FLRERGLNNTYTGGLSSYCLVLMLI 1657
>gi|157133264|ref|XP_001656207.1| sigma DNA polymerase, putative [Aedes aegypti]
gi|108870912|gb|EAT35137.1| AAEL012692-PA [Aedes aegypti]
Length = 491
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 81/168 (48%), Gaps = 17/168 (10%)
Query: 62 PAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDL 120
P R +V+ I++++ N + V FGS YLP DIDL V+G + E L
Sbjct: 3 PTQTEHALRVQVVARIEQIVLNLWPAARVEMFGSFRTGLYLPTSDIDLVVIG---LWEKL 59
Query: 121 ARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISFNQMAGLSALCF 176
+ LENE + E V+ + A V IVK + + + VDISFN +G+ +
Sbjct: 60 P---LRTLENELINRGIAEPMSVRVLDKASVPIVKLTDRETQVKVDISFNMQSGVQSAEL 116
Query: 177 LEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALEMM 223
++ R K + + V+++K + R L + G IS+Y+L +M
Sbjct: 117 IKDFKR---KYPVLAKLVLVLKQFLL--QRDLNEVFTGGISSYSLILM 159
>gi|189204442|ref|XP_001938556.1| PAP-associated domain containing protein 5 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985655|gb|EDU51143.1| PAP-associated domain containing protein 5 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 599
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 60/269 (22%), Positives = 108/269 (40%), Gaps = 47/269 (17%)
Query: 15 EELLTSPSHTSSSSSYSSPPNLY-PFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEV 73
++ L P+ TS+ + S ++Y S + + WL E I + + P +R +
Sbjct: 224 DDWLPLPNQTSTP--WLSAEDVYVHLSGEPDKWLHNE--ILDFYDFVAPKAYEHDQRNRL 279
Query: 74 INYIQRLIN----GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILE 129
+N I + + + FGS P+ YLP D+DL + ++ V + +
Sbjct: 280 VNRINSALGQRRFAHDNGRLLCFGSFPVGLYLPTADLDLVYVSDRHYRG--GEPVIDVSD 337
Query: 130 NEDQDSVF-----EIKD-------VQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSA-L 174
+ + +KD Q + A+V I+K + + VDISF ++G+ A
Sbjct: 338 RKASKGILFKAARRLKDTDIPQGYAQVINAKVPIIKFQDKLTKLQVDISFENLSGVQAQA 397
Query: 175 CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH-------- 226
F+E + D ++ V L+K + + H G I Y++ +++
Sbjct: 398 TFVEWKAKY--PDMIYM--VALMKQFLVMRG-LNEVHTGGIGGYSIICLIVSYLQLEKKP 452
Query: 227 --------RFLDYYNTFDWDNYCISINGP 247
FL YY FD Y I +N P
Sbjct: 453 ENLGECFLGFLKYYGNFDLARYRIQMNPP 481
>gi|156057775|ref|XP_001594811.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980]
gi|154702404|gb|EDO02143.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 794
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/137 (23%), Positives = 62/137 (45%), Gaps = 10/137 (7%)
Query: 45 LWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPD 103
WL E I + ++P E R+ ++ ++ YY ++ FGS P YLP
Sbjct: 459 FWLHKE--IMDFYHYVKPRKFEEVIRQRLLVNLRENFRKYYRDADILAFGSFPAGLYLPT 516
Query: 104 GDIDLTVLGHQNVEEDLARF-----VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SV 156
D+D+ ++ +Q ++ ++ + ++ N ++D + + V A+V + K
Sbjct: 517 ADMDVVMVSNQYMDGGYPKYNSKSHIWRVKGNVEKDRLNTGRIEVIVGAKVPLAKWVDGF 576
Query: 157 QNIPVDISFNQMAGLSA 173
+ VD+SF GL A
Sbjct: 577 TGLKVDMSFENDTGLIA 593
>gi|115673160|ref|XP_796681.2| PREDICTED: uncharacterized protein LOC592046 [Strongylocentrotus
purpuratus]
Length = 830
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 54/115 (46%), Gaps = 11/115 (9%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKIL 128
R+EV+ IQ ++ + +V +GS YLP DIDL + G D+ L
Sbjct: 156 RREVVQRIQGIVRSIWPKAKVEIYGSTRTMLYLPTSDIDLVLFG------DIGESPFFRL 209
Query: 129 ENE-DQDSVFEIKDVQYV-PAQVKIVKCS--VQNIPVDISFNQMAGLSALCFLEK 179
NE ++ + E ++ + A V IVK + V + VDISFN G +E+
Sbjct: 210 GNELEKSGIAEQGSIKVLDKASVPIVKLTDNVTKVRVDISFNMQTGTDCAKLIEE 264
>gi|356569346|ref|XP_003552863.1| PREDICTED: poly(A) RNA polymerase cid11-like [Glycine max]
Length = 415
Score = 41.6 bits (96), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 86/192 (44%), Gaps = 26/192 (13%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
+ +IL + P + R +IN ++ ++ G V PFGS + GD+D++
Sbjct: 11 VNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNLFTRWGDLDIS 70
Query: 110 V----------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV-- 156
+ G + + L V K L + S ++Q++ A+V I+K
Sbjct: 71 IELSNGLHISSAGKKQKQTFLGD-VLKALRMKGGGS-----NLQFISNARVPILKFKSYR 124
Query: 157 QNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIS 216
Q + DIS N + G L ++++ G+ F+ V+L+K W +I + G +
Sbjct: 125 QGVSCDISINNLPGQMKSKILLWINKIDGR---FRHMVLLVKEWAKAH-KINNSKAGTFN 180
Query: 217 TYALEMMVLHRF 228
+Y+L ++V+ F
Sbjct: 181 SYSLSLLVIFYF 192
>gi|21240658|gb|AAM44369.1| putative protein contains similarity to topoisomerase-related
function protein [Dictyostelium discoideum]
Length = 1880
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 74/145 (51%), Gaps = 16/145 (11%)
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENE-DQDSVFEIKDVQYVP-A 147
+GS +LP DID+ V +N ++ A+F+ K+LE + D I + +P A
Sbjct: 1518 YGSFMTGLWLPSSDIDIVVNYGKNMAIKPKNAQFLLKVLEKQFRTDLNGMILTMLCIPSA 1577
Query: 148 QVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFLEK-VDRLIGKDHLFKRSVILIKAW 200
++ ++K + +NI VDISF + G++A + V +IG L+ +++L W
Sbjct: 1578 KIPVIKLVTTENISVDISFRESPTSVHTGIAARDLVADCVREIIG---LYPLAIVL--KW 1632
Query: 201 CYYESRILGAHYGLISTYALEMMVL 225
E + + G +S+Y L +M++
Sbjct: 1633 FLRERGLNNTYTGGLSSYCLVLMLI 1657
>gi|402592089|gb|EJW86018.1| hypothetical protein WUBG_03069 [Wuchereria bancrofti]
Length = 360
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 37/203 (18%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
L+ EE IQ +Q + R ++ ++R++ G + P GS DI
Sbjct: 71 LIREEAIQRHNSYLQTL----QYRNSIVVGLRRVLARNRGQYLVPVGSTASGLASGISDI 126
Query: 107 DLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIK-----------------DVQYVPA 147
DL L + E L R + K EN + + E+K D+ + A
Sbjct: 127 DLVYLSTTDEKQRETLLRLLSK--ENYRRTFMTEVKTRIEESSLCKEFDWSQTDIIH-TA 183
Query: 148 QVKIVKCSVQ-NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC--YYE 204
+V I+ Q ++ VDI F + A + CF+ + + + V L+ W + E
Sbjct: 184 RVPILHLQTQKHMQVDIQFEKYASIRNTCFVRHCAQ-------YDKRVALLNMWAQKWLE 236
Query: 205 S-RILGAHYGLISTYALEMMVLH 226
S R+ + +GL STY + M+VLH
Sbjct: 237 SQRLKDSKHGLFSTYHVLMLVLH 259
>gi|402583139|gb|EJW77083.1| PAP/25A associated domain-containing protein, partial [Wuchereria
bancrofti]
Length = 416
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 131/295 (44%), Gaps = 20/295 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E + ++ ++P+ + R V ++ ++ + ++ FGS+ +LP DID+
Sbjct: 2 HEELLDLYAWLKPSPLERALRLRVFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDIDV 61
Query: 109 TVLGHQNVEE-DLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
V EE L + + E+ +S+ + D +VP VK+V + I +DISFN
Sbjct: 62 VVESDLVSEEPPLWKTAIALKESGITESI-NVLDKAFVPI-VKMVDKDTK-IYLDISFNT 118
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
+ G+ + F+E + + + + V+++K + + ++ G +S+Y L +M++
Sbjct: 119 VQGVRSAKFIEDMKM---RYPVLEPLVLVLKQFL-MQRQLNQVFTGGLSSYGLILMLIS- 173
Query: 228 FLDYYNTFDWDNYCIS-INGPVAISSLPEIVAETVENDGDELLLSPEFLKKCREIYSVPI 286
FL + ++D+ I+ +N V + + ++ + L + C++ V +
Sbjct: 174 FLQLHPSYDYSYKRITEVNMGVLLLNFLQLYGQEFNYMKTALRIHSGGAYVCKDEILVQM 233
Query: 287 TALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILT 341
N+ L I DPL+ N++GR N +R A + L +
Sbjct: 234 NRPSNS-------MLCIEDPLQPGNDIGR--CSHNIQLVRQAFEHAFATLCAVFV 279
>gi|1493831|gb|AAC49397.1| Trf5p [Saccharomyces cerevisiae]
Length = 625
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 85/188 (45%), Gaps = 12/188 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 175 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 232
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 233 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 290
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 291 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 345
Query: 223 MVLHRFLD 230
+++ FL+
Sbjct: 346 CLVYSFLN 353
>gi|357620388|gb|EHJ72600.1| hypothetical protein KGM_01329 [Danaus plexippus]
Length = 564
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 123/309 (39%), Gaps = 68/309 (22%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I+ + P+ R V+ I+ I + V FGS YLP DIDL
Sbjct: 115 HEEIEHFYMYMSPSETEHLVRTTVVTRIRSAILSLWPQARVEVFGSFRTGLYLPTSDIDL 174
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIP 160
V+G +E +L V + + +D V E A V IVK + ++
Sbjct: 175 VVIGQWEKLPLWTLEREL---VAQDIAEQDSIKVLE-------KATVPIVKMTDKYSDVK 224
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
VDISFN +G+ + +++ + R V+++K + + + G IS+Y+L
Sbjct: 225 VDISFNMSSGVKSAELIKQFKEQYPE---LSRLVMVLKQFL-LQRDLNEVFTGGISSYSL 280
Query: 221 EMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
+M CIS + + PE + + N G L+ EF +
Sbjct: 281 ILM-----------------CISF---LQLHPRPERLRQR-HNLGVLLI---EFFELYGR 316
Query: 281 IYSVPITALE-----------------NTGHEFAIKHLNIVDPLKDNNNLGRSVSKGNFH 323
++ TA+ N GH ++ L I DPL N++GRS S G
Sbjct: 317 KFNYVKTAIRVKNGGSYVSKDEISKEMNDGHRPSL--LCIEDPLTPGNDIGRS-SYGAI- 372
Query: 324 RIRCALSYG 332
+++ A YG
Sbjct: 373 QVKQAFDYG 381
>gi|392598007|gb|EIW87329.1| Nucleotidyltransferase [Coniophora puteana RWD-64-598 SS2]
Length = 383
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 43/167 (25%), Positives = 74/167 (44%), Gaps = 15/167 (8%)
Query: 67 EKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTV-LGHQNVEEDLARFV 124
E R++V + R++ N + EV FGS LP GD+D+ + L H + +L +
Sbjct: 91 EYMREQVRALVHRILRNRFRRSEVNVFGSAAQGLSLPGGDVDIVLSLPHADSRSELKTSL 150
Query: 125 CKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIP------VDISFNQMAGLSALCFLE 178
++ + V +V++ Q +I S Q +P DI N GL AL +E
Sbjct: 151 FQLAAMLKRTGVASDVEVRH---QARIPIISFQTVPELCTLRFDIGVNNEDGLKALPIVE 207
Query: 179 KVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
K +DH + ++L+ + A G I+++ L MV+
Sbjct: 208 KY----MQDHPVLKPLMLVLKSFLSRRGLSSASNGGINSFVLLCMVV 250
>gi|350638915|gb|EHA27270.1| hypothetical protein ASPNIDRAFT_44704 [Aspergillus niger ATCC 1015]
Length = 682
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R++++ +Q YYG+++ PFGS YLP DIDL +L +
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L+N + I+ + + A+V I+K + + VD+SF+
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389
Query: 168 MAGLSA 173
+GL A
Sbjct: 390 DSGLVA 395
>gi|317034217|ref|XP_001396205.2| topoisomerase family protein TRF4 [Aspergillus niger CBS 513.88]
Length = 682
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R++++ +Q YYG+++ PFGS YLP DIDL +L +
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L+N + I+ + + A+V I+K + + VD+SF+
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389
Query: 168 MAGLSA 173
+GL A
Sbjct: 390 DSGLVA 395
>gi|328713984|ref|XP_001948412.2| PREDICTED: PAP-associated domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 595
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 81/179 (45%), Gaps = 12/179 (6%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
+ I++ + R +V+ + +I+ + +V FGS YLP D+DL
Sbjct: 119 HKEIEDFFAYMSATPEEHSMRLDVLERVTNVIHAEWPKAKVEVFGSFRTGLYLPTSDMDL 178
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCS--VQNIPVDISFN 166
V+G + L +L+N + ++ D VP IVK + ++ VDISFN
Sbjct: 179 VVIGEWDTLP-LHSLAQALLKNSVCNENIQVLDKASVP----IVKMTDKATDVRVDISFN 233
Query: 167 QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
G+ + E + + + + V+++K + + + YG IS+Y+L +MV+
Sbjct: 234 MNNGVKS---AEMIKHYMDVFPMLPKLVLVLKQFL-AQRDMNEVFYGGISSYSLILMVV 288
>gi|312384741|gb|EFR29395.1| hypothetical protein AND_01692 [Anopheles darlingi]
Length = 1101
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 45/181 (24%), Positives = 81/181 (44%), Gaps = 15/181 (8%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDL 108
+ I + + P R V+N I++++ N + V FGS YLP DIDL
Sbjct: 272 HQEIDQFYNHMIPTPTEHALRVMVVNRIEQIVLNLWPSARVEMFGSFRTGLYLPTSDIDL 331
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDS-VFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
V+G + + LE E + + E V+ + A V IVK + + + VDIS
Sbjct: 332 VVIGQ------WEKLPLRTLEMELINRYIAEPNSVRVLDKASVPIVKLTDRETQVKVDIS 385
Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
FN +G+ + +++ +D+ ++L+ + + G IS+Y+L +M
Sbjct: 386 FNMESGVQSAKLIKEYK----QDYPVLEKLVLVLKQFLLQRDLNEVFTGGISSYSLILMC 441
Query: 225 L 225
+
Sbjct: 442 I 442
>gi|83765069|dbj|BAE55212.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391870459|gb|EIT79642.1| DNA polymerase sigma [Aspergillus oryzae 3.042]
Length = 390
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +Q YYG+++ FGS YLP DIDL +L +
Sbjct: 74 VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 133
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L N D I+ + + A+V I+K + + VD+SF+
Sbjct: 134 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 191
Query: 168 MAGLSA 173
+GL A
Sbjct: 192 DSGLVA 197
>gi|328872412|gb|EGG20779.1| hypothetical protein DFA_00644 [Dictyostelium fasciculatum]
Length = 1767
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 78/149 (52%), Gaps = 16/149 (10%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQN--VEEDLARFVCKILENEDQDSV--FEIKDVQ 143
E+ +GS +LP DID+ V ++N + A+F+ ++LE + + + F + +
Sbjct: 1420 EIDIYGSFMTGLWLPSSDIDIVVDYNKNMAIRPKNAQFLLEVLEKQIRHDLNSFILTMLC 1479
Query: 144 YVPAQVKIVK-CSVQNIPVDISFNQ-----MAGLSALCFL-EKVDRLIGKDHLFKRSVIL 196
A++ ++K + +NI VDISF + G++A + + V ++IG L+ +++L
Sbjct: 1480 IPSAKIPVIKLVTKENIAVDISFRESPTSPHTGIAARDLVADNVKKVIG---LYPLAIVL 1536
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVL 225
W + + + G +S+Y L +M++
Sbjct: 1537 --KWFLRDRGLNNTYSGGLSSYCLVLMLI 1563
>gi|449461393|ref|XP_004148426.1| PREDICTED: poly(A) polymerase-like [Cucumis sativus]
Length = 863
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/267 (22%), Positives = 105/267 (39%), Gaps = 63/267 (23%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVI-NYIQRLINGYYGIEVFPF 92
L F +D+EL+ EE + +E+L I + S K+ ++ Y ++++ + +F F
Sbjct: 42 LEKFLVDSELYESKEEAAKREEVLGRIDQIVKSWVKQLTLLRGYTEQMVEDANAV-IFTF 100
Query: 93 GSVPLKTYLPDGDIDLTVLGHQNV-EEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
GS L + P DID +G V ED + IL + E+ D+Q VP A V
Sbjct: 101 GSYRLGVHGPGSDIDTLCVGPSYVNREDFFIILHNILAEME-----EVTDLQPVPDAHVP 155
Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVD----------------------------R 182
+++ I +D+ + A +S L E +D R
Sbjct: 156 VMRFKFLGISIDLLY---ASISLLVVPEDLDISHGSVLYNVDEQTVRSLNGCRVADQILR 212
Query: 183 LIGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRF 228
L+ F+ ++ +K W + G ++ L+ ++ M++ RF
Sbjct: 213 LVPNVEHFRTTLRCLKFWAKRRGVYSNVTGFLGGVNWALLVAQVCQLYPNAIPSMLVSRF 272
Query: 229 LDYYNTFDWDNYCISINGPVAISSLPE 255
Y + W N PV + S+ E
Sbjct: 273 FRVYTQWRWPN-------PVMLCSIEE 292
>gi|218189366|gb|EEC71793.1| hypothetical protein OsI_04418 [Oryza sativa Indica Group]
Length = 381
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
E+ ++IL I+P E R + I IQ L + Y G V PFGS + Y
Sbjct: 13 EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
GD+D++V E I + + QD++ E+ + ++++P A+V ++
Sbjct: 70 GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122
Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
+ + I DIS + G ++ L D F V+L+K W ++ I
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178
Query: 211 HYGLISTYALEMMVLHRF 228
G +++Y+L ++VL F
Sbjct: 179 KNGTLNSYSLCLLVLCHF 196
>gi|405977000|gb|EKC41473.1| Poly(A) polymerase gamma [Crassostrea gigas]
Length = 264
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/190 (22%), Positives = 84/190 (44%), Gaps = 38/190 (20%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTV 110
E + I ++ I ++K INY++ IN + G +V FGS L + DID
Sbjct: 51 EILNRINTLVRKWIKDVSRQKSEINYLEYKINAFGG-KVCIFGSYRLGAHTKGADIDTVC 109
Query: 111 L--GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFNQ 167
+ GH + + + FV ++L++ E+KD++ + A V ++K + I +D+ F +
Sbjct: 110 IAPGHVDRSDFFSSFV-EVLKHTP-----EVKDLRAIEEAYVPLIKMTFDGIELDMVFAR 163
Query: 168 MA-------------------------GLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
+ L+ +++ L+ H F+R+++ IK W
Sbjct: 164 LTLPQIPDEIDLRDESLLRHLNTKCVRSLNGCRVTDEILHLVPNQHNFRRTLLAIKIWAK 223
Query: 203 YE---SRILG 209
+ S++LG
Sbjct: 224 KKGIYSKVLG 233
>gi|242051340|ref|XP_002463414.1| hypothetical protein SORBIDRAFT_02g043400 [Sorghum bicolor]
gi|241926791|gb|EER99935.1| hypothetical protein SORBIDRAFT_02g043400 [Sorghum bicolor]
Length = 460
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/171 (23%), Positives = 73/171 (42%), Gaps = 22/171 (12%)
Query: 62 PAIVSEKKRKEVINYIQRLI--------------NGYYGIEVFPFGSVPLKTYLPDGDID 107
P+ EKKR++VI +++++ G V +GS L + P+ DID
Sbjct: 102 PSPEDEKKREKVIRELKKIVMHWANAVAYEQSVPQGLATATVLTYGSYTLGAHGPESDID 161
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
+ +G FV E + V E++ ++ A+V +++ I VD ++ Q
Sbjct: 162 VLCVGPCIATLQYHFFVVLRQLLEGRPEVSELQTIE--KAKVPLMRFRFTGIAVDFTYAQ 219
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
+ + AL + + R I L+ R L C+Y G H +++ Y
Sbjct: 220 LPVIDALKKFQALLRCI---KLWARKRGLH---CHYLGFFAGIHLAILAAY 264
>gi|170593787|ref|XP_001901645.1| hypothetical protein Bm1_50885 [Brugia malayi]
gi|158590589|gb|EDP29204.1| hypothetical protein Bm1_50885 [Brugia malayi]
Length = 380
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 37/203 (18%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
L+ EE IQ +Q + R ++ ++R++ G + P GS DI
Sbjct: 71 LIREEAIQRHNSYLQTL----QYRNSIVVGLRRVLARNRGQYLVPVGSTASGLASGISDI 126
Query: 107 DLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIK-----------------DVQYVPA 147
DL L + E L R + K EN + + E+K D+ + A
Sbjct: 127 DLVYLSTTDEKQRETLLRLLSK--ENYRRTFMTEVKTRIEESSLCKEFDWSQTDIIHT-A 183
Query: 148 QVKIVKCSVQ-NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC--YYE 204
+V I+ Q ++ VDI F + A + CF+ + + + V L+ W + E
Sbjct: 184 RVPILHLQTQKHMQVDIQFEKYASIRNTCFVRHCAQ-------YDKRVALLNMWAQKWLE 236
Query: 205 S-RILGAHYGLISTYALEMMVLH 226
S R+ + +GL STY + M+VLH
Sbjct: 237 SQRLKDSKHGLFSTYHVLMLVLH 259
>gi|124802317|ref|XP_001347437.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|23495017|gb|AAN35350.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 615
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/179 (24%), Positives = 83/179 (46%), Gaps = 20/179 (11%)
Query: 59 TIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT----VLGH 113
+++P+I K +N++Q IN +Y V PFGSV ++ + DID+ +L +
Sbjct: 289 SLKPSINDINNMKIFLNFLQNEINKHYKNSYVTPFGSVINGFWMRNSDIDICIQIPILLN 348
Query: 114 QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN------IPVDISFNQ 167
+ + + +C +L N + + V E Q A+V I+ N + DIS N
Sbjct: 349 RKDQITFLKKICLLLNNFN-NGVIE----QRFSAKVPIIHFYCNNREKSFELSCDISVNN 403
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+ + ++K + D + I++K W + I G +S+++L +M++H
Sbjct: 404 ILAVINSKLIQKY---VAIDKRLQTMGIVLKYWSKIRN-INDRSKGFLSSFSLILMIIH 458
>gi|358373036|dbj|GAA89636.1| topoisomerase family protein Trf4 [Aspergillus kawachii IFO 4308]
Length = 682
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R++++ +Q YYG+++ PFGS YLP DIDL +L +
Sbjct: 272 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 331
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L+N + I+ + + A+V I+K + + VD+SF+
Sbjct: 332 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 389
Query: 168 MAGLSA 173
+GL A
Sbjct: 390 DSGLVA 395
>gi|323352825|gb|EGA85127.1| Trf5p [Saccharomyces cerevisiae VL3]
Length = 510
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/190 (24%), Positives = 86/190 (45%), Gaps = 12/190 (6%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL +E I++ + I P+ K R I+ ++R + + ++ FGS YLP
Sbjct: 141 WLTSE--IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGS 198
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID V +ED + + L+N+ E+ VP +K ++ Q + +D
Sbjct: 199 DIDCVVNSRNRDKEDRNYIYELARHLKNKGLAIRMEVIVKTRVPI-IKFIEPQSQ-LHID 256
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
+SF + GL A K+ R +D R ++LI + R+ H G + + + +
Sbjct: 257 VSFERTNGLEAA----KLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTV-I 311
Query: 223 MVLHRFLDYY 232
+++ FL+ +
Sbjct: 312 CLVYSFLNMH 321
>gi|395783917|ref|ZP_10463765.1| ribonuclease P protein component [Bartonella melophagi K-2C]
gi|395425185|gb|EJF91355.1| ribonuclease P protein component [Bartonella melophagi K-2C]
Length = 123
Score = 41.2 bits (95), Expect = 1.9, Method: Composition-based stats.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 460 HPYASQVRASTFSTDEGEKSRGTGTYIPERTRPPPIEINRPVRAR-----NPESLNHVPV 514
HP + RA + GEK RG + + R +EI P+ AR ++ N V
Sbjct: 5 HPRRIRKRADFLAVRTGEKRRGPLFLLEIKPRKQTVEIKHPLVARVGFTVTRKNGNAVKR 64
Query: 515 SKLRRKIDKAERLPSRDNLESG 536
++++R++ +A R+ +DN+E+G
Sbjct: 65 NRIKRRLREAVRVGIKDNIEAG 86
>gi|296821440|ref|XP_002850129.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
gi|238837683|gb|EEQ27345.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
Length = 620
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 14/139 (10%)
Query: 48 LAEERIQEILCTIQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDI 106
+ + I + ++P + R+++I+ +Q +YG +++ FGS YLP D+
Sbjct: 265 MLHKEILDFYHWVKPTEYEDIVRRDLISRLQSCFGQRFYGSKLYAFGSFASGLYLPTADM 324
Query: 107 DLTVLGHQNVEEDLARFVCKI---------LENEDQDSVFEIKDVQYVPAQVKIVKC--S 155
DL +L Q + + K+ L + I+ + + A+V I+K
Sbjct: 325 DLVLLSRQFITSNRKTICQKVRDIYSFAGYLRGLGIAAPGSIETIAH--ARVPIIKFVDG 382
Query: 156 VQNIPVDISFNQMAGLSAL 174
+ + VD+SF+ +GL+A+
Sbjct: 383 LTGLKVDLSFDNNSGLAAI 401
>gi|158296263|ref|XP_316693.3| AGAP006659-PA [Anopheles gambiae str. PEST]
gi|157016427|gb|EAA11489.4| AGAP006659-PA [Anopheles gambiae str. PEST]
Length = 703
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 68/142 (47%), Gaps = 12/142 (8%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTV--LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP 146
++ FGS+ L D D+D + + E+ + + + D+D+ F K V+ +
Sbjct: 117 IYEFGSIKSGLLLKDSDLDFYIHYAREKTEREEQIKLIHVVCSRMDRDTSFTGK-VKILG 175
Query: 147 AQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWCYY 203
A+V +++ N+ DI+F+ G C+ K I K D + +++K W
Sbjct: 176 AKVPLLRAVHVRTNLQCDINFSNARG----CYNSKFIHAIMKFDERIHQLTVMVKFWAQC 231
Query: 204 ESRILGAHYGLISTYALEMMVL 225
+ IL AH+ + ++Y L MMV+
Sbjct: 232 -AHILTAHHQM-NSYCLIMMVI 251
>gi|345491496|ref|XP_001605928.2| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 1
[Nasonia vitripennis]
gi|345491498|ref|XP_003426625.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 2
[Nasonia vitripennis]
Length = 683
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 52/90 (57%), Gaps = 6/90 (6%)
Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
+++++ + A+V I+K SV N+ VD+++N + G+ L R+ D + V++
Sbjct: 428 VRELELIQAKVPILKIHDSVYNLDVDLNYNNVVGIRNTHLLYCYSRI---DWRVRPLVLV 484
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLH 226
+K W ++ I A + +S+Y+L +MV+H
Sbjct: 485 VKMWAQCQN-INNARHMTMSSYSLVLMVIH 513
>gi|134080951|emb|CAK41466.1| unnamed protein product [Aspergillus niger]
Length = 614
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R++++ +Q YYG+++ PFGS YLP DIDL +L +
Sbjct: 204 VKPVRYEQIVREDLVARLQAAFQSRYYGVQLRPFGSFASGLYLPTADIDLVLLSTNFMRN 263
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L+N + I+ + + A+V I+K + + VD+SF+
Sbjct: 264 GIKTFGERKGQIYAFSAFLKNLNIAVPNSIETIAH--ARVPILKFVDKLTGLRVDLSFDN 321
Query: 168 MAGLSA 173
+GL A
Sbjct: 322 DSGLVA 327
>gi|324975487|gb|ADY62671.1| PAPa [Candida metapsilosis]
Length = 547
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 45/226 (19%), Positives = 87/226 (38%), Gaps = 55/226 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ KKR EV++ Q L+ + G ++F FGS L Y P DID
Sbjct: 42 ATKKRAEVLSLFQTLVQEFVYEVSKSKNMSDGMSRDAGGKIFTFGSYRLGVYGPSSDIDA 101
Query: 108 LTVLGHQNVEEDLARFVCKILE--------NEDQDSVFEIKDVQY--VPAQVKIVKCSVQ 157
L V+ ED +I+ N D+ I +++ + + + + +V
Sbjct: 102 LVVVPRHVTREDFFTTFDQIIRKRPELQEINGVSDAFVPIIKLEFDGISLDLIMARLNVP 161
Query: 158 NIPVDISFN-----------QMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESR 206
+P+D++ + + L+ +++ +L+ K +FK ++ IK W E
Sbjct: 162 RVPLDMTLDDKNLLKNLDERDLRSLNGTRVTDEILQLVPKPGVFKHALRCIKLWA-QERA 220
Query: 207 ILGAHYGLISTYALEMM---------------VLHRFLDYYNTFDW 237
+ G +G A M+ ++ +F Y ++W
Sbjct: 221 VYGNVFGFPGGVAWAMLTARICQLYPNAVSSVIVEKFFSIYTKWNW 266
>gi|125538688|gb|EAY85083.1| hypothetical protein OsI_06439 [Oryza sativa Indica Group]
Length = 683
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
L F ++A L+ EE + +E+L I IV E +K I Y ++++ + +F
Sbjct: 44 LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 101
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
FGS L + P DID +G V+ + FV + + E+ ++Q VP A V
Sbjct: 102 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVTLYGALSEME---EVTELQPVPDAHVP 158
Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
++K + +P+D+ + ++ S LC +++ RL+
Sbjct: 159 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 218
Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
+F++++ +K W + G + L+ ++ M++ RF
Sbjct: 219 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 278
Query: 232 YNTFDWDN 239
+ + W N
Sbjct: 279 FTQWQWPN 286
>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
Length = 531
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 45/194 (23%), Positives = 87/194 (44%), Gaps = 20/194 (10%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I++ + I P+ + R + I ++ + + G ++ FGS YLP
Sbjct: 100 WLTLE--IRDFVAYISPSRQEIELRNKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGS 157
Query: 105 DIDLTVLGHQNVEEDLARF--VCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
DID + +E+ + + L+N + E+ VP +K V+ + Q I VD
Sbjct: 158 DIDCVINSKTGDKENRSSLYELAHFLKNRKLATQVEVIAKARVPI-IKFVEPTSQ-IHVD 215
Query: 163 ISFNQMAGLSAL----CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
+SF + GL A +L++ L R ++LI + R+ H G + +
Sbjct: 216 VSFERTNGLEAAKLIRSWLQQTPGL--------RELVLIVKQFLHARRLNNVHTGGLGGF 267
Query: 219 ALEMMVLHRFLDYY 232
++ + +++ FL+ +
Sbjct: 268 SI-ICLVYAFLNLH 280
>gi|146086153|ref|XP_001465471.1| RNA polymerase II [Leishmania infantum JPCM5]
gi|134069569|emb|CAM67892.1| RNA polymerase II [Leishmania infantum JPCM5]
Length = 382
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 72/191 (37%), Gaps = 20/191 (10%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI--------K 140
V P+GS+ T DGD D V E C I + + +I
Sbjct: 76 VLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGACVIARERQEKCLSDIFSHIRKCNS 135
Query: 141 DVQYVP-----AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
DV+ P A+V IV+ ++ F+ L L + + D + V+
Sbjct: 136 DVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRHYMAGDPRLRLGVL 195
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPE 255
K W E +IL A G IS YAL +M +H D T +S + +
Sbjct: 196 GAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------ALSFDEEAVSQRVNA 248
Query: 256 IVAETVENDGD 266
IV+ E++GD
Sbjct: 249 IVSTAAESEGD 259
>gi|325183687|emb|CCA18146.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 737
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 77/163 (47%), Gaps = 19/163 (11%)
Query: 73 VINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE--DLARFVCK---- 126
V +Q L ++ + FGS +LP D+DL +L NV E DL V
Sbjct: 220 VTKSVQEL---WHDSTIETFGSYSTGLWLPTSDVDLVIL---NVLENCDLRTTVANLGKL 273
Query: 127 --ILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLI 184
+L +D + + + VP +K+V V +IP+DISF + S L + V +
Sbjct: 274 ADVLSQQDWVASIVVLETAKVPV-IKLVSKDV-SIPIDISFETSSTHSGLLARDLVSLYV 331
Query: 185 GK-DHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+ L+ ++I+ + E + A+ G +S+Y++ +M++H
Sbjct: 332 KELPELYPAAIIMKQ--LLRERDLNDAYTGGLSSYSIVLMLIH 372
>gi|350415058|ref|XP_003490519.1| PREDICTED: PAP-associated domain-containing protein 5-like [Bombus
impatiens]
Length = 572
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 70/283 (24%), Positives = 119/283 (42%), Gaps = 46/283 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDL 108
E I++ + P+ R V+ I+++I + + +V FGS YLP DIDL
Sbjct: 122 HEEIEDFFAYMCPSNEEHSLRMRVVKRIEQVIYDLWQDSKVEVFGSFRTGLYLPTSDIDL 181
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
V+G + +E L + +N + S ++ D VP VK+ + I VD
Sbjct: 182 VVIGMWTNLPLRTLERAL------LDQNIAEPSSIKVLDKASVPI-VKLTDKETE-IKVD 233
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
ISFN G+ + E ++ + + ++ V+++K + R L + G IS+Y+L
Sbjct: 234 ISFNMNNGVKS---AELINSFKKRFPVLEKLVMVLKQ--FLLQRDLNEVFTGGISSYSLI 288
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPE 273
+M + FL + + YC S N V + E+ DG + E
Sbjct: 289 LMTI-SFLQLHPR--QNAYCSSANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEE 345
Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
+ + GH ++ L I DPL N++GRS
Sbjct: 346 VQRDMID------------GHRPSL--LCIEDPLTPGNDIGRS 374
>gi|169598326|ref|XP_001792586.1| hypothetical protein SNOG_01964 [Phaeosphaeria nodorum SN15]
gi|160704377|gb|EAT90176.2| hypothetical protein SNOG_01964 [Phaeosphaeria nodorum SN15]
Length = 575
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 65/141 (46%), Gaps = 5/141 (3%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPLKTYLPDGDI 106
L +R +E+L +Q + +R + I + G ++F FGS L + P DI
Sbjct: 44 LEGNKRREEVLAHVQKVVEEFVRRAGKLKGIAQSTLDVAGGKIFFFGSYALGVHGPTSDI 103
Query: 107 D--LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDIS 164
D + V H +E+ F E + + + E+ V+ A V I+K + + +D+
Sbjct: 104 DTLIVVPKHVFIEDFFKIFPSIFRELSNAEDISELVPVE--DAYVPIIKMEYRGVSLDLL 161
Query: 165 FNQMAGLSAL-CFLEKVDRLI 184
F M +S++ LE VD+ +
Sbjct: 162 FASMPTMSSVPKELETVDKSV 182
>gi|320583303|gb|EFW97518.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Ogataea
parapolymorpha DL-1]
Length = 538
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 59/142 (41%), Gaps = 5/142 (3%)
Query: 35 NLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFG 93
N + I+ WL E I++ + I P+ K R + ++ I G + EV FG
Sbjct: 100 NDHSRQIEISDWLTME--IKDFIKYISPSAEEIKARNNTVGKLRDCITGMWPDAEVHCFG 157
Query: 94 SVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVK 153
S YLP DID+ V+ + R L + ++ + A+V I+K
Sbjct: 158 SFATDLYLPGSDIDMVVVSKARNNKYDNRSSLYQLSSYIRNHRLGVNVEAIAKAKVPIIK 217
Query: 154 C--SVQNIPVDISFNQMAGLSA 173
I +DISF + G+ A
Sbjct: 218 FVDPATKIHIDISFERTNGIKA 239
>gi|198426610|ref|XP_002126682.1| PREDICTED: similar to PAP associated domain containing 4 [Ciona
intestinalis]
Length = 713
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 75/168 (44%), Gaps = 22/168 (13%)
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQ--DSVFEIKDVQYVPAQVKIVKCS--VQN 158
+ D DL ++ + + V K+L Q ++ +K++Q + A V I+K +
Sbjct: 440 NSDADLCLVFDPRNKTVNRKTVLKMLNRMKQLLNNAHFVKNLQLIYATVPILKFEDRISG 499
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
+ D++ N + G+ L R D + V+ IK W + + I A G +S+Y
Sbjct: 500 MECDLNVNNLTGIRNSFLLLAYARC---DPRVRPMVLCIKEWAHVNN-INSAQLGTLSSY 555
Query: 219 ALEMMVLH-------RFLDYYNTFDWDNYCISINGPV-----AISSLP 254
AL +MVLH R + + DN+ S N P+ ++SLP
Sbjct: 556 ALVLMVLHYLQIVKPRVIPSFQALHKDNF--SSNLPIHCLGEKVASLP 601
>gi|242054959|ref|XP_002456625.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
gi|241928600|gb|EES01745.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
Length = 257
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 86/194 (44%), Gaps = 31/194 (15%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPDGDI 106
I++IL TI P V + KRK ++ IQ L + Y G V PFGS Y GD+
Sbjct: 15 IEDILSTINP--VEDDKRKR-LSAIQELADSIYSVGPLRGAAVKPFGSFLSNLYAKSGDL 71
Query: 107 DLTVLGH--------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ 157
D++V + +++ R + K L+ E ++P A+V I+K
Sbjct: 72 DVSVDLRNGSRLPISKKKKQNALRELMKALQMRGVARCME-----FIPTARVPILKYMSN 126
Query: 158 N--IPVDISFNQMAGLSALCFLEKVDRLIGK-DHLFKRSVILIKAWCYYESRILGAHYGL 214
+ I D+S N G ++ IG D F V+L+K W + I G
Sbjct: 127 HFGISCDVSVNNYPGQ----IKSRILYWIGTIDERFGDMVLLVKEWAKARN-INDPKNGT 181
Query: 215 ISTYALEMMVLHRF 228
+++Y+L ++V+ F
Sbjct: 182 LNSYSLCLLVIFHF 195
>gi|49388677|dbj|BAD25861.1| putative poly(A) polymerase [Oryza sativa Japonica Group]
gi|125581374|gb|EAZ22305.1| hypothetical protein OsJ_05959 [Oryza sativa Japonica Group]
Length = 610
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVI--NYIQRLINGYYGIEVFP 91
L F ++A L+ EE + +E+L I IV E +K I Y ++++ + +F
Sbjct: 45 LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 102
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
FGS L + P DID +G V+ + FV + + E+ ++Q VP A V
Sbjct: 103 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVMLYGALSEME---EVTELQPVPDAHVP 159
Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
++K + +P+D+ + ++ S LC +++ RL+
Sbjct: 160 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 219
Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
+F++++ +K W + G + L+ ++ M++ RF
Sbjct: 220 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 279
Query: 232 YNTFDWDN 239
+ + W N
Sbjct: 280 FTQWQWPN 287
>gi|255081724|ref|XP_002508084.1| predicted protein [Micromonas sp. RCC299]
gi|226523360|gb|ACO69342.1| predicted protein [Micromonas sp. RCC299]
Length = 1070
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 76/355 (21%), Positives = 129/355 (36%), Gaps = 74/355 (20%)
Query: 3 EMSAFLEMEDLQEELLTSPSHTSSSSSYSSPPNLYPFSIDAELWLLAEERIQEILCTIQP 62
E + FL+ D + + L+ SH + EL+ ++ T P
Sbjct: 602 EDAEFLDASDYERDSLSGDSHE-----------------NCELYGTLHSQVSSFASTAGP 644
Query: 63 AIVSEKKRKEVINYIQRLINGYY---GIEVFPFGSVPLKTYLPDGDIDLTVLG------- 112
+ R V + I Y +EVF G+ L L DIDL VLG
Sbjct: 645 DDSDLQDRLVVQKKVSEAITALYPEARLEVFGSGATGL--ALKCADIDLVVLGVGPEPSA 702
Query: 113 -----HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQ 167
+N EDL R + KI ++ ++ + +V A+V I+K + VD++
Sbjct: 703 GGGGFSRNDREDLVRMLRKIEKSLRREKIVWKANV-ISTAKVPIIKMNAGRYAVDLTVGA 761
Query: 168 MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHR 227
GLSA+ +++ DR+ D + R +IL+ + A G Y L +V+
Sbjct: 762 SNGLSAVQWIK--DRV--NDFVAMRPLILVLKKLLKTHHLDDASTGGCGGYLLVSLVVSH 817
Query: 228 ---------------------FLDYY-NTFDWDNYCIS-------------INGPVAISS 252
FL + N FD+ ++ + GP A
Sbjct: 818 LRQSGEPGRSANPNLGALLLGFLRRFGNEFDYSRMAVAAGRPSGVIRAKDLVVGPGAFGM 877
Query: 253 LPEIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGHEFAIKHLNIVDPL 307
P I+AE + G + + K+ R ++ + + G ++ L PL
Sbjct: 878 RPMILAEDPQETGRNITGAAYRFKEVRALFRSALEYMSAVGELAFLQELASPAPL 932
>gi|327349744|gb|EGE78601.1| topoisomerase TRF4 [Ajellomyces dermatitidis ATCC 18188]
Length = 681
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R ++I +QR +YG ++ FGS YLP D+DL +L Q + +
Sbjct: 263 VKPKPFENSIRNDLIARLQRHFERRHYGSQLRAFGSFASGLYLPTADMDLVLLSRQFIRQ 322
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
D + +C+ E +KD+ VP A+V I+K + + VD+SF+
Sbjct: 323 D-RKVLCQ-RPREIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380
Query: 168 MAGLSA 173
G++A
Sbjct: 381 STGIAA 386
>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
Length = 463
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 45/169 (26%), Positives = 77/169 (45%), Gaps = 13/169 (7%)
Query: 80 LINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI 139
L++ G EV FGS + YLP+ DID+ ++ + L + V + N D ++E
Sbjct: 192 LLDAVPGCEVKTFGSFSTELYLPNSDIDMVIVKDDIQNKSLYKKVADKIMN--CDDIYE- 248
Query: 140 KDVQYVP-AQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
++ V A+V I+K + I DISFN+ G+ L ++K G + + ++
Sbjct: 249 -NINLVTNAKVPIIKFVEKETQINFDISFNKEDGVKQLSEVKK-----GLELYPEMKYLI 302
Query: 197 IKAWCYYESRILGAHY-GLISTYALEMMVLHRFLDYYNTFDWDNYCISI 244
+ C R L Y G I ++ L M+L D ++ +N I
Sbjct: 303 MVMKCILRQRDLHETYSGGIGSFLLFCMILAFLRDLRRQYEKENRVQEI 351
>gi|156096154|ref|XP_001614111.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802985|gb|EDL44384.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 808
Score = 40.4 bits (93), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 75/159 (47%), Gaps = 17/159 (10%)
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVK 150
PFGS +P D+D+ LG EDL + D V +I+ ++ A+VK
Sbjct: 426 PFGSFVTGLSIPGSDLDVCFLGIP--LEDLDALLIISYALVKLDIVADIRLIK--DARVK 481
Query: 151 IVKC--SVQNIPVDISFNQMAGLSALCFLE-KVDRLIGKDHLFKRSVILIKAWCYYESRI 207
I+K + VD+ NQ++ F++ K+ + I + VIL+K + +R
Sbjct: 482 ILKYIDRETGVQVDVCTNQLSSRQTTDFIKSKLQKFI----YMRPLVILLK--FFLNTRN 535
Query: 208 LGAHY-GLISTYALEMMVLHRFLDYYNTFDWD---NYCI 242
L Y G I ++ L MVLH + +TFDW+ N C+
Sbjct: 536 LNETYIGGIGSFLLCCMVLHFLQLHPSTFDWNVFSNSCL 574
>gi|255723652|ref|XP_002546758.1| Poly(A) polymerase PAPa [Candida tropicalis MYA-3404]
gi|240130534|gb|EER30100.1| Poly(A) polymerase PAPa [Candida tropicalis MYA-3404]
Length = 557
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 49/231 (21%), Positives = 85/231 (36%), Gaps = 65/231 (28%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ KKR EV+N +Q++ + G ++F FGS L Y P DID
Sbjct: 43 ATKKRVEVLNILQKMTEDFVFKVSIKKNMSEGMAKDAGGKIFTFGSYRLGVYGPGSDIDT 102
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFN 166
L V+ D ++L+ D E++++ VP A V I+K I +D+ F
Sbjct: 103 LIVVPKHVTRNDFFETFYELLKTRD-----ELEEIAPVPDAFVPIIKIEFAGISIDLIFA 157
Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
+ + L+ +++ L+ K +FK ++ IK W
Sbjct: 158 RLDIPRVSRDLTLDDKNLLKNIDEKDLRALNGTRVTDEILTLVPKQTVFKHALRCIKMWA 217
Query: 202 YYESRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
+ I +G A M+V + +F Y+ + W
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLVARICQLYPNAVSAVIIEKFFHIYSQWAW 267
>gi|239615134|gb|EEQ92121.1| topoisomerase TRF4 [Ajellomyces dermatitidis ER-3]
Length = 681
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R ++I +QR +YG ++ FGS YLP D+DL +L Q + +
Sbjct: 263 VKPKPFENSIRNDLIARLQRHFERRHYGSQLRAFGSFASGLYLPTADMDLVLLSRQFIRQ 322
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
D + +C+ E +KD+ VP A+V I+K + + VD+SF+
Sbjct: 323 D-RKVLCQ-RPREIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380
Query: 168 MAGLSA 173
G++A
Sbjct: 381 STGIAA 386
>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 787
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 91/205 (44%), Gaps = 21/205 (10%)
Query: 32 SPPNLYPFSIDAELWLLA-----EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY- 85
+P P++ + W+ + + I + + P ++ R+ + + ++ +
Sbjct: 181 APKEAPPWARGRDSWIQSPLLQLHQEIVDFCEFVAPTEEEQQMRETAVERVSGVVQSIWP 240
Query: 86 GIEVFPFGSVPLKTYLPDGDIDLTVLGHQ-NVEEDLARFVCKILENEDQDSVFEIKDVQY 144
+V FGS YLP D+D+ VL +D + + K L K++Q
Sbjct: 241 HSQVKVFGSFATGLYLPTSDVDVVVLDSGCTALQDGLKALAKALTRGHVG-----KNIQV 295
Query: 145 V-PAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
+ A+V I+K +V NIP DISF+ G A F++ +G + +++K
Sbjct: 296 IGKARVPIIKFVETVSNIPFDISFDVANGPEAADFIKAA---MGAIPPLRPLCLVLK--I 350
Query: 202 YYESRILGAHY-GLISTYALEMMVL 225
+ + R L Y G I +YAL +M+L
Sbjct: 351 FLQQRELNEVYQGGIGSYALLVMLL 375
>gi|240276761|gb|EER40272.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus H143]
Length = 680
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R +++ +QR +YG ++ FGS YLP D+DL + Q V +
Sbjct: 263 VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 322
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
D + +C+ E +KD+ VP A+V I+K + + VD+SF+
Sbjct: 323 D-RKVLCQ-RPREIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380
Query: 168 MAGLSA 173
G++A
Sbjct: 381 STGIAA 386
>gi|325095205|gb|EGC48515.1| topoisomerase TRF4 [Ajellomyces capsulatus H88]
Length = 443
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R +++ +QR +YG ++ FGS YLP D+DL + Q V +
Sbjct: 26 VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 85
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
D + +C+ E +KD+ VP A+V I+K + + VD+SF+
Sbjct: 86 D-RKVLCQ-RPREIHSFAAYLKDLDIPVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 143
Query: 168 MAGLSA 173
G++A
Sbjct: 144 STGIAA 149
>gi|58220773|gb|AAW68018.1| putative polynucleotide adenylyltransferase [Oryza sativa Indica
Group]
Length = 652
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
L F ++A L+ EE + +E+L I IV E +K I Y ++++ + +F
Sbjct: 13 LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 70
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
FGS L + P DID +G V+ + FV + + E+ ++Q VP A V
Sbjct: 71 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVTLYGALSEME---EVTELQPVPDAHVP 127
Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
++K + +P+D+ + ++ S LC +++ RL+
Sbjct: 128 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 187
Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
+F++++ +K W + G + L+ ++ M++ RF
Sbjct: 188 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 247
Query: 232 YNTFDWDN 239
+ + W N
Sbjct: 248 FTQWQWPN 255
>gi|19075773|ref|NP_588273.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe 972h-]
gi|74582471|sp|O74518.1|CID12_SCHPO RecName: Full=Poly(A) RNA polymerase cid12; Short=PAP; AltName:
Full=Caffeine-induced death protein 12; AltName:
Full=Polynucleotide adenylyltransferase cid12
gi|3426138|emb|CAA20372.1| poly(A) polymerase Cid12 [Schizosaccharomyces pombe]
Length = 336
Score = 40.4 bits (93), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 83/177 (46%), Gaps = 14/177 (7%)
Query: 52 RIQEILCTIQPAIVSEKKRKEVINYIQRLINGY-YGIEVFPFGSVPLKTYLPDGDIDLTV 110
R+ L + P I K RK ++ +Q I E+ +GS+ + T L D+D+++
Sbjct: 23 RLYSFLEFVSPKIEELKYRKLLLEKLQTHIREVVLDAELQVYGSMYIGTTLSISDVDVSL 82
Query: 111 LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQV-KIVKCSVQNIPVDISFNQMA 169
+ E + R + + D D+ F + A+V +I V I VD++F
Sbjct: 83 KSPRVGELEKRRVTMVLRKYLDADADF------HSSARVPRINLVDVSGIGVDLTF---- 132
Query: 170 GLSALCFLEKVDRLIGKDH-LFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVL 225
G C ++ + ++H +F R ++L+K W + E + H+G I++ AL M++
Sbjct: 133 GNDKACRTAELQKAYNEEHPIFGRLLMLLKHWLF-ERDLENVHHGGIASCALSYMLI 188
>gi|225556159|gb|EEH04448.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus G186AR]
Length = 680
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 59/126 (46%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P R +++ +QR +YG ++ FGS YLP D+DL + Q V +
Sbjct: 263 VKPKSFEHSIRNDLVERLQRHFERRHYGSQLCAFGSFASGLYLPTADMDLVLCSRQFVRQ 322
Query: 119 DLARFVCKILENEDQDSVFEIKDVQY-VP--------AQVKIVKC--SVQNIPVDISFNQ 167
D + +C+ E +KD+ VP A+V I+K + + VD+SF+
Sbjct: 323 D-RKVLCQ-RPREIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRLTGLKVDLSFDN 380
Query: 168 MAGLSA 173
G++A
Sbjct: 381 STGIAA 386
>gi|398015076|ref|XP_003860728.1| hypothetical protein, conserved [Leishmania donovani]
gi|322498950|emb|CBZ34023.1| hypothetical protein, conserved [Leishmania donovani]
Length = 382
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 71/191 (37%), Gaps = 20/191 (10%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI--------K 140
V P+GS+ T DGD D V E C I + + +I
Sbjct: 76 VLPYGSIVSGTSFTDGDADYIVSFPIVSESTRQSGACVIARERQEKCLSDIFSHIRKCNS 135
Query: 141 DVQYVP-----AQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVI 195
DV+ P A+V IV+ ++ F+ L L + + D + V+
Sbjct: 136 DVELHPQRIFRARVPIVQYVRKSAQESTKFDLSLSLDGLKNSLLLRHYMAGDPRLRLGVL 195
Query: 196 LIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISSLPE 255
K W E +IL A G IS YAL +M +H D T +S + +
Sbjct: 196 GAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------ALSFDEEAVSQRVNA 248
Query: 256 IVAETVENDGD 266
IV+ E +GD
Sbjct: 249 IVSTAAEAEGD 259
>gi|401421907|ref|XP_003875442.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491679|emb|CBZ26952.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 382
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 47/194 (24%), Positives = 76/194 (39%), Gaps = 26/194 (13%)
Query: 89 VFPFGSVPLKTYLPDGDIDLT----------------VLGHQNVEEDLARFVCKILENED 132
V P+GS+ T DGD D V+G + E+ L+ I +
Sbjct: 76 VLPYGSIVSGTSFTDGDADYIVSFPTLSENTRQSGACVIGRERQEKFLSDIFSHIRKC-- 133
Query: 133 QDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKR 192
+S E+ + A+V IV+ ++ F+ L L + + + D +
Sbjct: 134 -NSDVELHPQRIFRARVPIVQYVRKSPKESTKFDLSLSLDGLKNSLLLRQYMAGDPRLRL 192
Query: 193 SVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCISINGPVAISS 252
V+ K W E +IL A G IS YAL +M +H D T +S +
Sbjct: 193 GVLGAKQWG-REQQILNARRGWISPYALSIMYIHFMKDTGRT------GLSFDEEAVSQR 245
Query: 253 LPEIVAETVENDGD 266
+ IV+ E++GD
Sbjct: 246 VNAIVSTAAESEGD 259
>gi|242061050|ref|XP_002451814.1| hypothetical protein SORBIDRAFT_04g008100 [Sorghum bicolor]
gi|241931645|gb|EES04790.1| hypothetical protein SORBIDRAFT_04g008100 [Sorghum bicolor]
Length = 672
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 103/250 (41%), Gaps = 53/250 (21%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSE--KKRKEVINYIQRLINGYYGIEVFP 91
L F ++A L+ EE + +++LC I IV E K+ Y +L+ + +F
Sbjct: 42 LEKFLVEAGLYEGKEESAKREDVLCEIG-QIVKEWVKQLTSKKGYADQLVEQANAV-LFT 99
Query: 92 FGSVPLKTYLPDGDIDLTVLG--HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQ 148
FGS L + P+ DID +G + N EED + IL ++ E+ ++Q VP A
Sbjct: 100 FGSYRLGVHGPEADIDTLCVGPSYVNREEDFFVTLHGILAEKE-----EVTELQPVPDAH 154
Query: 149 VKIVKCSVQNIPVDISFNQM-----------AGLSALC--------------FLEKVDRL 183
V ++K + I +D+ + + + S LC +++ RL
Sbjct: 155 VPVLKFKFRGISIDLLYASLSLSVIPADFDISQGSVLCDVDEATVRSLNGCRVADQILRL 214
Query: 184 IGKDHLFKRSVILIKAWCYYESR-------ILGAHYGLISTYALEM-------MVLHRFL 229
+ F+ ++ +K W + G ++ L+ ++ M++ RF
Sbjct: 215 VPNVENFRTTLRCLKYWAKRRGVYSNITGFLGGVNWALLVARVCQLYPNAVPSMLVSRFF 274
Query: 230 DYYNTFDWDN 239
+ + W N
Sbjct: 275 RVFTQWQWPN 284
>gi|317139015|ref|XP_001817214.2| topoisomerase family protein TRF4 [Aspergillus oryzae RIB40]
Length = 659
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +Q YYG+++ FGS YLP DIDL +L +
Sbjct: 247 VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 306
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L N D I+ + + A+V I+K + + VD+SF+
Sbjct: 307 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 364
Query: 168 MAGLSA 173
+GL A
Sbjct: 365 DSGLVA 370
>gi|303389764|ref|XP_003073114.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
gi|303302258|gb|ADM11754.1| DNA polymerase sigma [Encephalitozoon intestinalis ATCC 50506]
Length = 378
Score = 40.0 bits (92), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 15/135 (11%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIEVFPFGSVPLKTYLPDGDIDLTV- 110
+ ++ + P RK + I+RLI EV PFGS +P DID+ +
Sbjct: 70 LHQLYQKLAPTTTEINSRKYIFEKIKRLIVREIPNAEVEPFGSYTTGLIIPSSDIDINIQ 129
Query: 111 LGHQNVEEDLARFVCKI---LENED---QDSVFEIKDVQYVPAQVKIVKCS--VQNIPVD 162
LG+ + +E R++ KI + D ++++F I+ ++ I+K S V +D
Sbjct: 130 LGNNHDKEYSNRYLSKIKSLMLKADFIRKETLFHIRK-----CRIPILKFSDKVFGFKID 184
Query: 163 ISFNQMAGLSALCFL 177
IS NQ G+ A F+
Sbjct: 185 ISVNQTNGIEAAKFV 199
>gi|238482039|ref|XP_002372258.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
NRRL3357]
gi|220700308|gb|EED56646.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
NRRL3357]
Length = 608
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 56/126 (44%), Gaps = 14/126 (11%)
Query: 60 IQPAIVSEKKRKEVINYIQRLING-YYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
++P + R ++I +Q YYG+++ FGS YLP DIDL +L +
Sbjct: 196 VKPVEYEQIVRADLIERLQTAFQSRYYGVQLRAFGSFASGLYLPTADIDLVLLSSNFMRH 255
Query: 119 DLARF---------VCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQ 167
+ F L N D I+ + + A+V I+K + + VD+SF+
Sbjct: 256 GIKTFGERKGQIYAFSAFLRNLDIAVPGSIETIAH--ARVPILKFVDKMTGLRVDLSFDN 313
Query: 168 MAGLSA 173
+GL A
Sbjct: 314 DSGLVA 319
>gi|297598868|ref|NP_001046352.2| Os02g0227400 [Oryza sativa Japonica Group]
gi|255670737|dbj|BAF08266.2| Os02g0227400, partial [Oryza sativa Japonica Group]
Length = 602
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 52/248 (20%), Positives = 102/248 (41%), Gaps = 49/248 (19%)
Query: 36 LYPFSIDAELWLLAEE--RIQEILCTIQPAIVSEKKRKEVIN--YIQRLINGYYGIEVFP 91
L F ++A L+ EE + +E+L I IV E +K I Y ++++ + +F
Sbjct: 37 LEKFLVEAGLYEGEEESAKREEVLREID-QIVKEWVKKVTIQKGYSEQMVKEANAV-LFT 94
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
FGS L + P DID +G V+ + FV + + E+ ++Q VP A V
Sbjct: 95 FGSYRLGVHGPGADIDALCIGPSYVKREEEFFVMLYGALSEME---EVTELQPVPDAHVP 151
Query: 151 IVKCSVQNIPVDISFNQ-----------MAGLSALC--------------FLEKVDRLIG 185
++K + +P+D+ + ++ S LC +++ RL+
Sbjct: 152 VMKFKFRGLPIDLLYASVSLPVIPPDFDISQGSVLCDVDEATVRSLNGCRVADQILRLVP 211
Query: 186 KDHLFKRSVILIKAWCYYE-------SRILGAHYGLISTYALEM-------MVLHRFLDY 231
+F++++ +K W + G + L+ ++ M++ RF
Sbjct: 212 NAEIFRKTLRCLKYWAQRRGVYSNVTGLLGGVSWALLVARVCQLYPNAVPSMLVSRFFRV 271
Query: 232 YNTFDWDN 239
+ + W N
Sbjct: 272 FTQWQWPN 279
>gi|260818300|ref|XP_002604321.1| hypothetical protein BRAFLDRAFT_88611 [Branchiostoma floridae]
gi|229289647|gb|EEN60332.1| hypothetical protein BRAFLDRAFT_88611 [Branchiostoma floridae]
Length = 482
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 66/157 (42%), Gaps = 27/157 (17%)
Query: 411 VTAQF-SPPSS------PSQIPSRSAWDVLCQYVQGKGNLVYQWGTEVFVPRLPFCHPYA 463
VT QF SPPS +P S+W VL Q ++ + V Q F+P FCHP
Sbjct: 30 VTNQFVSPPSHWLYGHLMQLLPDESSWPVLKQMMRDYSSHVLQLWVGPFLPLFIFCHPET 89
Query: 464 SQVRASTFSTDEGEKSRGTGTYIPERTRPPP--IEINRPVRARNPESLNHVPVSKLRRKI 521
S+ ST S D S G+ + R P EI +P +S N + I
Sbjct: 90 SRAILST-SGDGLLLSHGSKWHRNRRLLTPAFHFEILKPYVTLFSQSTNVL--------I 140
Query: 522 DKAERLPSRDNLESGCSLNVENGCSLNMENS---CSL 555
DK + L SGCS+ + SL +S CSL
Sbjct: 141 DKWTK------LGSGCSVEMFEHVSLMTLDSILKCSL 171
>gi|221058739|ref|XP_002260015.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810088|emb|CAQ41282.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 821
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 75/153 (49%), Gaps = 16/153 (10%)
Query: 91 PFGSVPLKTYLPDGDIDLTVLGHQNVE-EDLARFVCKILENEDQDSVFEIKDVQYVPAQV 149
PFGS +P D+D+ L N+ EDL + D V +I+ ++ A+V
Sbjct: 435 PFGSFVTGLSIPGSDLDVCFL---NIPLEDLDALLLIAYALVKLDMVTDIRLIK--DARV 489
Query: 150 KIVKCSVQ--NIPVDISFNQMAGLSALCFLE-KVDRLIGKDHLFKRSVILIKAWCYYESR 206
KI+K + + + VD+ NQ++ F++ K+++ I + VIL+K + +R
Sbjct: 490 KILKYTDKETGVQVDVCTNQLSSRQTTDFIKSKMEKYI----YLRPLVILLK--FFLNTR 543
Query: 207 ILGAHY-GLISTYALEMMVLHRFLDYYNTFDWD 238
L Y G I ++ L MVLH + TFDW+
Sbjct: 544 NLNETYIGGIGSFMLCCMVLHFLQLHPTTFDWN 576
>gi|215707095|dbj|BAG93555.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 88/198 (44%), Gaps = 33/198 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY------GIEVFPFGSVPLKTYLPD 103
E+ ++IL I+P E R + I IQ L + Y G V PFGS + Y
Sbjct: 13 EKCTEDILSLIKPV---EGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYAKS 69
Query: 104 GDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEI----------KDVQYVP-AQVKIV 152
GD+D++V E I + + QD++ E+ + ++++P A+V ++
Sbjct: 70 GDLDVSV-------ELFNALNLPISKRKKQDTLREVRRALQKRGIARHMEFIPNARVPVL 122
Query: 153 K--CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
+ + I DIS + G ++ L D F V+L+K W ++ I
Sbjct: 123 QYVSNQYGISCDISISNYPGRIKSKIFYWINTL---DDRFGDMVLLVKEWAKAQN-INDP 178
Query: 211 HYGLISTYALEMMVLHRF 228
G +++Y+L ++VL F
Sbjct: 179 KNGTLNSYSLCLLVLFHF 196
>gi|147900520|ref|NP_001087078.1| PAP associated domain containing 4 a [Xenopus laevis]
gi|51234260|gb|AAT98005.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
Length = 509
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 6/126 (4%)
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
D D+ L + + AR + +L + I+ Q++ A+V IVK V
Sbjct: 240 DADLCLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAE 299
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D++ N + G+ L L D + V++IK W + I A G +S+Y +
Sbjct: 300 FDLNVNNVVGIRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTI 355
Query: 221 EMMVLH 226
+MVLH
Sbjct: 356 VLMVLH 361
>gi|148230683|ref|NP_001086580.1| poly(A) RNA polymerase GLD2-B [Xenopus laevis]
gi|82182837|sp|Q6DFA8.1|GLD2B_XENLA RecName: Full=Poly(A) RNA polymerase GLD2-B; Short=xGLD-2; AltName:
Full=PAP-associated domain-containing protein 4-B
gi|49903424|gb|AAH76832.1| MGC83852 protein [Xenopus laevis]
Length = 509
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 55/126 (43%), Gaps = 6/126 (4%)
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
D D+ L + + AR + +L + I+ Q++ A+V IVK V
Sbjct: 240 DADLCLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAE 299
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D++ N + G+ L L D + V++IK W + I A G +S+Y +
Sbjct: 300 FDLNVNNVVGIRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTI 355
Query: 221 EMMVLH 226
+MVLH
Sbjct: 356 VLMVLH 361
>gi|82541397|ref|XP_724941.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii 17XNL]
gi|23479769|gb|EAA16506.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii]
Length = 534
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 48/193 (24%), Positives = 88/193 (45%), Gaps = 23/193 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E + ++ ++P+ K K +N++Q IN + V PFGSV ++ + DID+
Sbjct: 190 EELNKLEIALRPSQNDIKNIKMFLNFLQNEINKQFKNCHVTPFGSVINGFWMKNSDIDIC 249
Query: 110 ----VLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIV-------KCSVQN 158
+L ++ + + + +C IL N + + E Q A+V I+ K S Q
Sbjct: 250 IQIPILLNRKDQINFLKKICLILNNY-HNGIIE----QRFSAKVPIIHFYCDDHKNSFQ- 303
Query: 159 IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTY 218
+ DIS N + + ++K + D + I +K W + I G +S++
Sbjct: 304 LSCDISVNNILAVINSKLIQK---YVSIDKRLQLMGIALKYWS-KKRNINDRSKGFLSSF 359
Query: 219 ALEMMVLHRFLDY 231
+L +M +H FL Y
Sbjct: 360 SLILMAIH-FLQY 371
>gi|301615931|ref|XP_002937419.1| PREDICTED: PAP-associated domain-containing protein 5-like [Xenopus
(Silurana) tropicalis]
Length = 572
Score = 40.0 bits (92), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 158 EEIMDFYKYMSPRPEEEKMRMEVVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLV 217
Query: 110 VLG 112
V G
Sbjct: 218 VFG 220
>gi|134026254|gb|AAI36216.1| papd4 protein [Xenopus (Silurana) tropicalis]
Length = 523
Score = 40.0 bits (92), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 77/186 (41%), Gaps = 19/186 (10%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL--------KTYLP 102
++I E+ +Q + KK+ +QR I ++FP + L T
Sbjct: 199 KQILELFQALQQQVCDLKKKDICRAELQREIQ-----QIFPQSRLYLVGSSLNGFGTRSS 253
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
D D+ L + + AR + +L + I+ Q++ A+V IVK V
Sbjct: 254 DADLCLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERPQFIKAKVPIVKFRDKVSGAE 313
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D++ N + G+ L + ++ + V++IK W Y + A G +S+Y L
Sbjct: 314 FDLNVNNVVGIRNTFLLRTYAYI---ENRVRPLVLVIKMWANYHG-LNDASRGTLSSYTL 369
Query: 221 EMMVLH 226
+M LH
Sbjct: 370 VLMALH 375
>gi|332030078|gb|EGI69903.1| PAP-associated domain-containing protein 5 [Acromyrmex echinatior]
Length = 662
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 124/297 (41%), Gaps = 33/297 (11%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I++ + P+ R V+ I+ +I + +V FGS YLP DIDL
Sbjct: 208 HEEIEDFFSYMCPSHEEHVLRLRVVKRIENVIYDLWPDSKVEVFGSFRTGLYLPTSDIDL 267
Query: 109 TVLGHQNVEEDLARFVCKILENEDQD-SVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDIS 164
V+G + LE D ++ E ++ + A V IVK + + I VDIS
Sbjct: 268 VVIGM------WTNLPLRTLERALLDRNIAEPSSIKVLDKASVPIVKLTDKESEIKVDIS 321
Query: 165 FNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMV 224
FN G+ + + R + + ++ V+++K + + + G IS+Y+L +M
Sbjct: 322 FNMNNGVKSAELINSYKR---QYPVLEKLVMVLKQFL-LQRDLNEVFTGGISSYSLILMT 377
Query: 225 LHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--NDGDELLLSPEFLKKCREIY 282
+ FL + D +C + N V + E+ G + +++ K EI
Sbjct: 378 I-SFLQLHPRKDI--HCPNTNLGVLLIEFLELYGRKFNYVKTGIRIKDGGQYISK-EEIQ 433
Query: 283 SVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS-----VSKGNFHRIRCALSYGAQ 334
I GH ++ L I DPL N++GRS K F+ C LS A
Sbjct: 434 RDMI-----DGHRPSL--LCIEDPLTPGNDIGRSSYGALYVKSAFNWAYCILSQAAN 483
>gi|410075647|ref|XP_003955406.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
gi|372461988|emb|CCF56271.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
Length = 630
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 86/192 (44%), Gaps = 22/192 (11%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYG-IEVFPFGSVPLKTYLPDG 104
WL E I++ + I P+ K R + ++ + R I + E+ FGS YLP
Sbjct: 168 WLTLE--IKDFVAYISPSREEIKLRNKAVSKLGRAIKELWSDSELLVFGSYATDLYLPGS 225
Query: 105 DIDLTVLG------HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN 158
DID V H++ +LARF+ K ++ + E+ VP +K ++
Sbjct: 226 DIDCVVNSASGNKEHRSYLYELARFLKK----KNLATSIEVIARARVPI-IKFIEPE-SG 279
Query: 159 IPVDISFNQMAGLSALCFL-EKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
+ +DISF + G+ A + E +D G R ++LI R+ H G +
Sbjct: 280 VHIDISFERTNGVEAAKLIREWLDMTPG-----LRELVLIVKQFLTARRLNDVHTGGLGG 334
Query: 218 YALEMMVLHRFL 229
+++ + +++ FL
Sbjct: 335 FSI-ICLVYSFL 345
>gi|146180895|ref|XP_001021673.2| Poly polymerase central domain containing protein [Tetrahymena
thermophila]
gi|146144391|gb|EAS01427.2| Poly polymerase central domain containing protein [Tetrahymena
thermophila SB210]
Length = 1765
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 102/264 (38%), Gaps = 58/264 (21%)
Query: 85 YGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARF--VCKILENEDQ-DSVFEIKD 141
+G E+ FGS LK D DID L V+ D F + IL+N + +F IK+
Sbjct: 1184 FGGELLFFGSYKLKVNSHDSDIDTVCLVPNFVDRDQHFFGDLVNILQNRKEVTELFPIKE 1243
Query: 142 VQYVPAQVKIVKCSVQNIPVDISFN----QMAG----LSALCFLEKVDR----------- 182
A V I+K I D+SF Q G L++ L +D
Sbjct: 1244 -----ASVPIIKMKFSGILFDLSFARAPPQYNGENIDLTSNQMLLSMDERSYRSFNGCRV 1298
Query: 183 ---LIGKDHL-----FKRSVILIKAWCYYE---SRILGAHYGLISTYALEMM-------- 223
LI KD++ FK ++ IK W S I+G G+ T + +
Sbjct: 1299 AEWLIRKDNIQNYENFKITLKCIKLWATNRGIYSNIIGYLSGIGWTILVAKICQLYPNYP 1358
Query: 224 ---VLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVENDGDELLLSPEFLKKCRE 280
++ RF Y+ + W + PV + + E VA T N D+ PEF +
Sbjct: 1359 PNQLIERFFFLYSHWKWHDL------PVYVEHIQEDVANTKLN-ADQ--WRPEFRDRNSM 1409
Query: 281 IYSVPITALENTGHEFAIKHLNIV 304
+ P NT H + L I+
Sbjct: 1410 MIITPCFPCMNTAHNVSPTTLRII 1433
>gi|118404514|ref|NP_001072915.1| poly(A) RNA polymerase GLD2 [Xenopus (Silurana) tropicalis]
gi|123906238|sp|Q0VFA3.1|GLD2_XENTR RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
Full=PAP-associated domain-containing protein 4
gi|110645459|gb|AAI18910.1| PAP associated domain containing 4 [Xenopus (Silurana) tropicalis]
Length = 528
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 77/186 (41%), Gaps = 19/186 (10%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFPFGSVPL--------KTYLP 102
++I E+ +Q + KK+ +QR I ++FP + L T
Sbjct: 204 KQILELFQALQQQVCDLKKKDICRAELQREIQ-----QIFPQSRLYLVGSSLNGFGTRSS 258
Query: 103 DGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIP 160
D D+ L + + AR + +L + I+ Q++ A+V IVK V
Sbjct: 259 DADLCLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERPQFIKAKVPIVKFRDKVSGAE 318
Query: 161 VDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYAL 220
D++ N + G+ L + ++ + V++IK W Y + A G +S+Y L
Sbjct: 319 FDLNVNNVVGIRNTFLLRTYAYI---ENRVRPLVLVIKMWANYHG-LNDASRGTLSSYTL 374
Query: 221 EMMVLH 226
+M LH
Sbjct: 375 VLMALH 380
>gi|154337346|ref|XP_001564906.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
gi|134061944|emb|CAM38985.1| RNA polymerase II [Leishmania braziliensis MHOM/BR/75/M2904]
Length = 374
Score = 39.7 bits (91), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 80/200 (40%), Gaps = 38/200 (19%)
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVF----------- 137
V P+GS+ T L DGD D V E C ++E E Q+ +
Sbjct: 68 VLPYGSIVSGTSLRDGDADYIVSFPLASESTCQSIAC-VIERERQEKLLSDIFVHIRKNN 126
Query: 138 ---EIKDVQYVPAQVKIVK------CSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDH 188
E+ + A+V IV+ C + + +S + GL L + + D
Sbjct: 127 RDDELYPQRIFRARVPIVQYVRKSACEISKFDICLS---LGGLKNSLLLRQ---YMAGDP 180
Query: 189 LFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY--YNTFDWDNYCISING 246
+ V+ K W + +IL G IS YAL +M +H F+ TF +D +S
Sbjct: 181 RLRLGVLGAKQWG-RDHQILNTRRGWISPYALSIMYIH-FMKSTGRTTFAFDEEAVS--- 235
Query: 247 PVAISSLPEIVAETVENDGD 266
+ EI++ E++GD
Sbjct: 236 ----QRVNEIISTAAESEGD 251
>gi|449514383|ref|XP_004177159.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase GLD2
[Taeniopygia guttata]
Length = 509
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 95/204 (46%), Gaps = 19/204 (9%)
Query: 35 NLYPFSIDAELWLLAEERIQEIL-----CTIQPAIVSEKK--RKEVINYIQRLINGYYGI 87
L+P +A L + ++ +++L C QP+ + K+ R E+ IQ + +
Sbjct: 167 TLFPDPTEARLPVAKDQLSKQVLEWFQVCRQQPSDLDRKELCRTELQREIQLI---FPQS 223
Query: 88 EVFPFGSV--PLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFE-IKDVQY 144
+F GS T DGD+ L V ++ AR + +++ + I+ Q
Sbjct: 224 RLFLVGSSLNGFGTRTSDGDLCLVVKEEPVNQKTEARRILSLVQKLFTTKLSSYIERPQL 283
Query: 145 VPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCY 202
+ A+V IVK V N+ D++ N + G+ FL + I ++ + V+++K W
Sbjct: 284 IRAKVPIVKFRDKVSNVDFDLNVNNVIGIRN-TFLLRSYAFI--ENRVRPLVLVVKKWAS 340
Query: 203 YESRILGAHYGLISTYALEMMVLH 226
+ I A G +++Y+L +MVLH
Sbjct: 341 FHE-INDASRGTLNSYSLVLMVLH 363
>gi|195045442|ref|XP_001991977.1| GH24509 [Drosophila grimshawi]
gi|193892818|gb|EDV91684.1| GH24509 [Drosophila grimshawi]
Length = 489
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 81/183 (44%), Gaps = 21/183 (11%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP-FGSVPLKTYLPDGDIDLT 109
E I+ + P R EV+ I+ ++ + V FGS +LP DIDL
Sbjct: 270 EEIEHFYQYVLPTPCEHAIRNEVVKRIEAVVQSIWPQAVVEIFGSFRTGLFLPTSDIDLV 329
Query: 110 VLGHQNVEEDLARFVCKILENE-DQDSVFEIKDVQYV-PAQVKIVKCSVQ--NIPVDISF 165
VLG + + LE E + E V+ + A V I+K + + + VDISF
Sbjct: 330 VLGLWE------KLPLRTLEFELVSRGIAEACTVRVLDKASVPIIKLTDRETQVKVDISF 383
Query: 166 NQMAGLSALCFLEKVDR---LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM 222
N +G+ + +++ R ++GK L + +L++ + G IS+Y+L +
Sbjct: 384 NMQSGVQSAELIKQFKREFPVLGKLVLVLKQFLLLRD-------LNEVFTGGISSYSLIL 436
Query: 223 MVL 225
M +
Sbjct: 437 MCI 439
>gi|145550572|ref|XP_001460964.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428796|emb|CAK93567.1| unnamed protein product [Paramecium tetraurelia]
Length = 463
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 95/196 (48%), Gaps = 28/196 (14%)
Query: 47 LLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYYGIEVFP---------FGSVPL 97
++++++ + +L ++ +++ K+K+ +N ++ LI+ ++ P FGS
Sbjct: 131 MISKDQDKVLLEILKQNEMTDDKKKKFLNSLKSLIHFLKQLDYDPFNAEIKYCIFGSQLN 190
Query: 98 KTYLPDGDIDLTVLGHQNVEEDLA-RFVCKILENEDQDSVFEIKD-VQYVPAQVKIVKCS 155
+ D+D T L + V+E +A +++ + +E DQ+ F+I + V++ V ++
Sbjct: 191 GFGTKESDVDFTFLTNSYVDERIALKYLRQEIEAVDQNK-FKINELVEFARIAVMKIQDQ 249
Query: 156 VQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLI 215
I +D+ FN + G+ L + G + ++ IL+K W G G+I
Sbjct: 250 TNKIEIDMCFNNLLGVINTKLL---NAYAGLNEKVQQGGILLKLW--------GKKQGII 298
Query: 216 -----STYALEMMVLH 226
S+YA+ +M LH
Sbjct: 299 NKNCFSSYAILIMWLH 314
>gi|350406748|ref|XP_003487869.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Bombus
impatiens]
Length = 655
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 67/238 (28%), Positives = 102/238 (42%), Gaps = 51/238 (21%)
Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
I+ ++ + A+V I+K S+QN+ VD++ N G+ L R+ D + V++
Sbjct: 411 IEQLELIQAKVPILKFYDSIQNLEVDLNCNNAVGIRNTHLLYCYSRI---DWRVRPLVLV 467
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLD--------------YYNTF--DWDNY 240
+K W ++ I A IS+Y+L +MVLH FL Y F D +
Sbjct: 468 VKLWAQSQN-INDAKNMTISSYSLVLMVLH-FLQCGVNPPVLPCLHSLYKGKFAPHTDIH 525
Query: 241 CISI----NGPVAI------SSLPEIVAE----TVENDGDELLLSPEFLKK-----CREI 281
CI I N PV+I +L E+ E V D ++ +S K CR
Sbjct: 526 CIDIQEELNIPVSILRPKNRQTLGELFVEFFRYYVMFDFNQYAISVRLANKIAIEECRRA 585
Query: 282 YSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRSVSKGN-FHRIRCALSYGAQRLGE 338
S +N H++ K+L I +P D N RSV + F RI+ Q L E
Sbjct: 586 RS-----YKNDPHQW--KYLCIEEPF-DLTNTARSVYDPDVFARIKHVFDCTYQNLKE 635
>gi|320038983|gb|EFW20918.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
Length = 590
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 56/122 (45%), Gaps = 25/122 (20%)
Query: 68 KKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDLTV 110
+KR ++++ IQR+ + G ++FP+GS L Y P DID V
Sbjct: 48 EKRNQMLHLIQRVTVEFVKTVGRRKGLSQAAIDVSGGKIFPYGSYRLGVYGPGSDIDTLV 107
Query: 111 LG--HQNVEEDLARFVCKILENEDQDSVFEIKDV--QYVPAQVKIVKCSVQNIPVDISFN 166
+G H +E+ F + +DS+ ++ V YVP I+K + I +D+ F
Sbjct: 108 VGPKHVAIEDFFLDFPPTLERLAPKDSIEKLTPVPDAYVP----IIKLELSGISIDLIFA 163
Query: 167 QM 168
++
Sbjct: 164 RL 165
>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
Length = 364
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 13/148 (8%)
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVK 150
FGS + YLP+ DID+ ++ ++ L + V + Q + FE +V + A+V
Sbjct: 108 FGSFKTRMYLPNADIDIVMIETSCTQKQLFKKVAARMMK--QTNKFE--NVNLIANAKVP 163
Query: 151 IVK-CSVQN-IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRIL 208
I+K V++ D+SFNQ+ GL + LEK L + +L+ C R L
Sbjct: 164 IIKFVEVESQYHFDLSFNQLDGLKQIEELEKAFELYP-----ELKFLLMTLKCVLRQRDL 218
Query: 209 GAHY-GLISTYALEMMVLHRFLDYYNTF 235
Y G + ++ L M+L ++ F
Sbjct: 219 NETYSGGVGSFLLFQMILAFLREFRKDF 246
>gi|145546801|ref|XP_001459083.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426906|emb|CAK91686.1| unnamed protein product [Paramecium tetraurelia]
Length = 364
Score = 39.7 bits (91), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 66/146 (45%), Gaps = 9/146 (6%)
Query: 92 FGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFV-CKILENEDQDSVFEIKDVQYVPAQVK 150
FGS + YLP+ DID+ ++ ++ L + V K+++ ++ + VP +K
Sbjct: 108 FGSFKTRMYLPNADIDMVMIETSCTQKQLFKKVAAKMMKQTNKFENVNLIANAKVPI-IK 166
Query: 151 IVKCSVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGA 210
++ Q D+SFNQ+ GL + LEK + + +L+ C R L
Sbjct: 167 FIEVESQ-YHFDLSFNQLDGLKQIEELEKAFEIYP-----ELKFLLMTLKCVLRQRDLNE 220
Query: 211 HY-GLISTYALEMMVLHRFLDYYNTF 235
Y G + ++ L M+L +Y F
Sbjct: 221 TYSGGVGSFLLFQMILAFLREYRKDF 246
>gi|303317852|ref|XP_003068928.1| poly(A) polymerase pla1, putative [Coccidioides posadasii C735
delta SOWgp]
gi|240108609|gb|EER26783.1| poly(A) polymerase pla1, putative [Coccidioides posadasii C735
delta SOWgp]
Length = 599
Score = 39.7 bits (91), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 56/122 (45%), Gaps = 25/122 (20%)
Query: 68 KKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDLTV 110
+KR ++++ IQR+ + G ++FP+GS L Y P DID V
Sbjct: 48 EKRNQMLHLIQRVTVEFVKTVGRRKGLSQAAIDVSGGKIFPYGSYRLGVYGPGSDIDTLV 107
Query: 111 LG--HQNVEEDLARFVCKILENEDQDSVFEIKDV--QYVPAQVKIVKCSVQNIPVDISFN 166
+G H +E+ F + +DS+ ++ V YVP I+K + I +D+ F
Sbjct: 108 VGPKHVAIEDFFLDFPPTLERLAPKDSIEKLTPVPDAYVP----IIKLELSGISIDLIFA 163
Query: 167 QM 168
++
Sbjct: 164 RL 165
>gi|198451695|ref|XP_001358484.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
gi|198131603|gb|EAL27623.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
Length = 400
Score = 39.7 bits (91), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 88/218 (40%), Gaps = 39/218 (17%)
Query: 60 IQPAIVSEKKRKEVINYIQ-RLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEE 118
IQ R E + I+ L++ + G V FGS LP DIDL V ++
Sbjct: 39 IQSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPLSDIDLVVENRKSY-- 96
Query: 119 DLARFVCKILENEDQDSVFEIKDVQYV----PAQVKIVKCS--VQNIPVDISFNQMAGLS 172
+ + E E + + D Q V A V +VK + V I DISFN +G+
Sbjct: 97 ---WYTPPLYELESELVARGVADPQTVNVVDTAAVPVVKFTERVSQIKFDISFNVGSGVK 153
Query: 173 ALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMM--------- 223
A E + LI + + V+++K + G IS+Y + +M
Sbjct: 154 A---AELIKDLIQEFPDLPKLVMVLKQFLAMHGLNDVYRSGGISSYGITLMCIGFLQHQS 210
Query: 224 --------------VLHRFLDYYN-TFDWDNYCISING 246
+L +FL+YY FD++ Y IS+ G
Sbjct: 211 QSNKKYNNNNRLGILLLKFLEYYGRKFDFNKYAISVTG 248
>gi|448086239|ref|XP_004196053.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
gi|359377475|emb|CCE85858.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
Length = 558
Score = 39.3 bits (90), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 46/203 (22%), Positives = 77/203 (37%), Gaps = 50/203 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ +KR EV+N +Q++ + G ++F FGS L Y P DID
Sbjct: 43 ATRKRVEVLNELQKMAEKFVYEVSLQKNMSDGMARDAGGKIFTFGSYRLGVYGPGSDIDA 102
Query: 108 LTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQNIPVDISFN 166
L V+ D KIL E++++ VP A V I+K + I +D+ F
Sbjct: 103 LVVVPKHVTRNDFFEVFEKILRERK-----ELEEIAAVPDAYVPIIKMVLNGISIDLIFA 157
Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
+ + L+ +++ L+ K +FK ++ IK W
Sbjct: 158 RLDVPKVPKDLKLDDKNLLRNIDEKDLRSLNGTRVTDEILTLVPKPTVFKHALRCIKMWA 217
Query: 202 YYESRILGAHYGLISTYALEMMV 224
+ I +G A M+V
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLV 239
>gi|255732153|ref|XP_002551000.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
gi|240131286|gb|EER30846.1| Poly(A) polymerase PAPalpha [Candida tropicalis MYA-3404]
Length = 558
Score = 39.3 bits (90), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 86/238 (36%), Gaps = 79/238 (33%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDIDL 108
+ KKR EV+ Q+L+ + G +VF FGS L Y P DID
Sbjct: 44 ATKKRVEVLTLFQKLVQEFVYTVSKSKNMSDGMAKDSGGKVFTFGSYRLGVYGPGSDIDT 103
Query: 109 TVLGHQNVEEDLARFVCKILENEDQDSVF--------EIKDVQYVP-AQVKIVKCSVQNI 159
V+ ++V D D SVF E++++ VP A V I+K I
Sbjct: 104 LVVVPKHVTRD------------DFFSVFPDIIRKRPELEEIACVPDAFVPIIKLEFDGI 151
Query: 160 PVDISFNQ-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSV 194
+D+ + + L+ +++ +L+ K +FK ++
Sbjct: 152 SIDLIMARLNVPRVPLEMTLDDKNLLKNLDEKDLRSLNGTRVTDEILQLVPKPTVFKHAL 211
Query: 195 ILIKAWCYYESRILGAHYGLISTYALEMMV---------------LHRFLDYYNTFDW 237
IK W + + G YG A M+V + +F Y ++W
Sbjct: 212 RCIKLWA-QQRAVYGNVYGFPGGVAWAMLVARICQLYPNAVSAVIVEKFFSIYTKWNW 268
>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
Length = 289
Score = 39.3 bits (90), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 92/192 (47%), Gaps = 20/192 (10%)
Query: 42 DAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTY 100
D+ L L EE I + I P + R+ V+ ++ +I + ++V FGS Y
Sbjct: 4 DSSLARLHEE-IIDFYQYISPRPEEKNMRETVVEGVKEVILTLWPHVQVEVFGSFRTGLY 62
Query: 101 LPDGDIDLTVLG--HQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVPAQVKIVKCSV 156
LP DIDL + G + EDL + ++++E D+D++ I + A V I+K +
Sbjct: 63 LPTSDIDLVIFGIDGKGAFEDLEK---ALMQHEVCDRDNIKCIHN-----AMVPIIKLTE 114
Query: 157 Q--NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGL 214
+ N +DI FN + + + + I K K V+++K + ++ + G
Sbjct: 115 KTCNYKMDIEFNIENSVKS---ADIIQTYIRKYEPLKYLVLVLKQFL-FQRELNEVFSGG 170
Query: 215 ISTYALEMMVLH 226
+S+Y L MMV++
Sbjct: 171 VSSYTLVMMVVN 182
>gi|19115813|ref|NP_594901.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe 972h-]
gi|15213942|sp|O13833.2|CID1_SCHPO RecName: Full=Poly(A) RNA polymerase protein cid1; AltName:
Full=Caffeine-induced death protein 1
gi|393715400|pdb|4E7X|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715401|pdb|4E7X|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715402|pdb|4E7X|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715403|pdb|4E7X|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715405|pdb|4E80|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715406|pdb|4E80|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715407|pdb|4E80|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715408|pdb|4E80|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715409|pdb|4E8F|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715410|pdb|4E8F|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|4324457|gb|AAD16889.1| caffeine-induced death protein 1 [Schizosaccharomyces pombe]
gi|5524947|emb|CAB50789.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe]
Length = 405
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 62/146 (42%), Gaps = 16/146 (10%)
Query: 88 EVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPA 147
E+ FGS+ L + D+DL VL V+ D + E FE K +Q A
Sbjct: 84 ELVAFGSLESGLALKNSDMDLCVLMDSRVQSDT---IALQFYEELIAEGFEGKFLQ--RA 138
Query: 148 QVKIVKCSVQ-------NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAW 200
++ I+K + + DI FN + L +L D K V+L+K W
Sbjct: 139 RIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKL---DARLKPMVLLVKHW 195
Query: 201 CYYESRILGAHYGLISTYALEMMVLH 226
+I ++G +S+Y +MVL+
Sbjct: 196 A-KRKQINSPYFGTLSSYGYVLMVLY 220
>gi|418617333|ref|ZP_13180231.1| hypothetical protein SEVCU120_1888 [Staphylococcus epidermidis
VCU120]
gi|374818626|gb|EHR82776.1| hypothetical protein SEVCU120_1888 [Staphylococcus epidermidis
VCU120]
Length = 671
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 55 EILCTIQPAIVSEKKRKEVINYIQRLIN-----GYYGIEVFPFGSVPLKTYLPDGDIDLT 109
EIL +I+ +S KKR ++ Y +R +N Y I +F S+P T LP ++DL
Sbjct: 258 EILRSIKKKNISWKKRLKIDKYGKRFLNFNNTLSKYAISIFSEKSLPF-TLLPKENLDLF 316
Query: 110 VLGHQNVEEDLARFVCKILENEDQDSVF 137
+ ++ V+ + + EN ++D VF
Sbjct: 317 INDYKTVKNYSKGLIQYLNENSEKDIVF 344
>gi|270009939|gb|EFA06387.1| hypothetical protein TcasGA2_TC009265 [Tribolium castaneum]
Length = 513
Score = 39.3 bits (90), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 85/201 (42%), Gaps = 20/201 (9%)
Query: 30 YSSPPNLYPFSIDAELWLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLI-NGYYGIE 88
+ P++Y I L + I + P R +V+ I+++I + + +
Sbjct: 97 WRKDPHVYARGI-----LGLHQEIDHFYQYMSPTRTEHLVRDDVVCRIKQIILSKWPEAQ 151
Query: 89 VFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCKILENE--DQDSVFEIKDVQYVP 146
V FGS YLP DIDL V+G + + LE E D D E
Sbjct: 152 VEVFGSYRTGLYLPTSDIDLVVIG------KWSNLPLRTLEQEFLDNDVAQENSIKVLDK 205
Query: 147 AQVKIVKCSVQN--IPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYE 204
A V IVK + + I VDISFN G+ + E + I + + + V ++K + E
Sbjct: 206 ASVPIVKLTDKQTEIKVDISFNMSNGVKS---AELIKTYIHQFPVLPKLVYVLKQFL-LE 261
Query: 205 SRILGAHYGLISTYALEMMVL 225
+ G IS+Y+L +M +
Sbjct: 262 RDLNEVFTGGISSYSLILMCI 282
>gi|47208265|emb|CAF92498.1| unnamed protein product [Tetraodon nigroviridis]
Length = 297
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 55/121 (45%), Gaps = 21/121 (17%)
Query: 60 IQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----- 113
I P EK R EV++ I+ +I+ + EV FGS YLP DIDL V G
Sbjct: 1 ISPRPEEEKMRLEVVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVVFGKWETLP 60
Query: 114 -QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGLS 172
+EE L + + + +E+ V + A V + I VDISFN +G+
Sbjct: 61 LWTLEEALRK---RKVADENSIKVLD-------KATVSLFSL----IFVDISFNMKSGVK 106
Query: 173 A 173
A
Sbjct: 107 A 107
>gi|448081751|ref|XP_004194965.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
gi|359376387|emb|CCE86969.1| Piso0_005494 [Millerozyma farinosa CBS 7064]
Length = 558
Score = 39.3 bits (90), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 47/203 (23%), Positives = 78/203 (38%), Gaps = 50/203 (24%)
Query: 66 SEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGDID- 107
+ +KR EV+N +Q++ + G ++F FGS L Y P DID
Sbjct: 43 ATRKRVEVLNELQKMAERFVYEVSLQKNMSDGMARDAGGKIFTFGSYRLGVYGPGSDIDA 102
Query: 108 LTVLGHQNVEEDLARFVCKIL-ENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFN 166
L V+ D KIL E +D + + + D YVP I+K + I +D+ F
Sbjct: 103 LVVVPKHVTRNDFFEVFEKILRERKDLEEIAAVPDA-YVP----IIKMILNGISIDLIFA 157
Query: 167 Q-------------------------MAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWC 201
+ + L+ +++ L+ K +FK ++ IK W
Sbjct: 158 RLDVPKVPKDLKLDDKNLLRNIDEKDLRSLNGTRVTDEILTLVPKPTVFKHALRCIKMWA 217
Query: 202 YYESRILGAHYGLISTYALEMMV 224
+ I +G A M+V
Sbjct: 218 QQRA-IYANVFGFPGGVAWAMLV 239
>gi|355756767|gb|EHH60375.1| hypothetical protein EGM_11721, partial [Macaca fascicularis]
Length = 483
Score = 39.3 bits (90), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 12/112 (10%)
Query: 51 ERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLT 109
E I + + P EK R EV+N I+ +I + +V FGS YLP DIDL
Sbjct: 21 EEISDFYEYMSPRPEEEKMRMEVVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLV 80
Query: 110 VLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQY--VPAQVKIVK 153
V G +EE L + + +ED V + V+Y +P V ++K
Sbjct: 81 VFGKWENLPLWTLEEALRKHK---VADEDSVKVLDKATVKYPVLPYLVLVLK 129
>gi|219130012|ref|XP_002185169.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403348|gb|EEC43301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 425
Score = 39.3 bits (90), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 111/256 (43%), Gaps = 50/256 (19%)
Query: 11 EDLQEELLTSPSHTSSSSSYSSPPNLYPFSI---DAELWLLAEERIQEILCTIQPAIVSE 67
ED+ E++ TS +SS PP + I A I + + ++P
Sbjct: 37 EDVDEDMDTS----TSSEDDPLPPWIEHHDIGLQKAHAATALHNEIVQFVRLMEPMPEEI 92
Query: 68 KKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEED------- 119
++R++++ ++ L++ + +V FGS +LP D+DL V+ ++ ++
Sbjct: 93 RQREDLVKRVEELVHRTFDNAQVHVFGSQATGLFLPSSDVDLLVITNEKANDETSQPDRQ 152
Query: 120 ----------LARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSVQ--NIPVDISFN 166
L RF + E D + E+ ++ + +V +VK + NI VD+ F+
Sbjct: 153 EDWQKPSGSPLDRFESVLRE----DWLMELSYLEVIGNTKVPLVKFTHAPTNISVDVCFD 208
Query: 167 QMAGLSAL----CFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
Q +G A +LE + L R + + + + +R L Y G + +Y L+
Sbjct: 209 QESGPGAAQLMKTYLEALPPL--------RPLTFVLKY-FLSARGLNEPYSGGVGSYLLQ 259
Query: 222 MMVL----HRFLDYYN 233
+M++ HR D YN
Sbjct: 260 LMIVSFLQHRERDAYN 275
>gi|380015769|ref|XP_003691868.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Apis
florea]
Length = 652
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 51/95 (53%), Gaps = 7/95 (7%)
Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
I+ ++ + A+V I+K S+QN+ VD++ N G+ L R+ D + V++
Sbjct: 408 IEQLELIQAKVPILKFHDSIQNLEVDLNCNNAVGIRNTHLLYCYSRI---DWRVRPLVLV 464
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
+K W + I A IS+Y+L +MV+H FL Y
Sbjct: 465 VKLWAQSQD-INDAKNMTISSYSLVLMVIH-FLQY 497
>gi|430813412|emb|CCJ29233.1| unnamed protein product [Pneumocystis jirovecii]
Length = 398
Score = 38.9 bits (89), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 87/190 (45%), Gaps = 23/190 (12%)
Query: 70 RKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDLTVLGH----QNVEEDLARFV 124
R+ V+ I L+ ++ +++ FGS YLP DIDL +L +DL + +
Sbjct: 107 RELVVQRINALVQKHWKNVQLCAFGSFDTMLYLPTSDIDLVILSLGPRIYETRKDLHK-L 165
Query: 125 CKILENEDQDSVFEIKDVQYVP-AQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVD 181
+ L + KD+Q + A V I+K ++ I VDISFN+ GL + +++
Sbjct: 166 SRYLRCSNV-----AKDIQVITGASVPIIKFIDTLTQIHVDISFNKPGGLVSANIIKQ-- 218
Query: 182 RLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEM----MVLHRFLDYYNT-FD 236
+ + + K V+ IK + + H L+S E ++L F + Y F+
Sbjct: 219 -YMKEHYALKPLVMFIKHFLNMRG-LNERHPKLLSKEIKEQDNLGVLLMEFFELYGKLFN 276
Query: 237 WDNYCISING 246
++ ISIN
Sbjct: 277 YNEVGISINN 286
>gi|133919900|emb|CAL91353.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
Length = 466
Score = 38.9 bits (89), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 9/116 (7%)
Query: 113 HQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAG 170
+QN E AR + +L + I+ Q++ A+V IVK V D++ N + G
Sbjct: 210 NQNTE---ARHILSLLHKHFYTRLSYIERPQFIRAKVPIVKFRDKVSGAEFDLNVNNVVG 266
Query: 171 LSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLH 226
+ L L D + V++IK W + I A G +S+Y + +MVLH
Sbjct: 267 IRNTFLLRTYAYL---DKRVRPLVLVIKKWANHHG-INDASRGTLSSYTIVLMVLH 318
>gi|145530898|ref|XP_001451221.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418865|emb|CAK83824.1| unnamed protein product [Paramecium tetraurelia]
Length = 583
Score = 38.9 bits (89), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 138/350 (39%), Gaps = 69/350 (19%)
Query: 75 NYIQRLIN--------GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCK 126
N Q+LI+ Y G ++ +GS + LP DIDL V+ N E + +
Sbjct: 275 NAFQKLIDEISAVVSATYAGAQIDVYGSYATELCLPHSDIDL-VIKISNQHEKFVTDILQ 333
Query: 127 ILENEDQDSVF--EIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR 182
+E E + F E K V V KC+ Q N +DIS + C ++ R
Sbjct: 334 RIEVELKKCKFIEETKCVTQSTTPVLRAKCNKQYMNKRLDISIQETKHNGLQCV--QLIR 391
Query: 183 LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFD--WDNY 240
K++ + + LI ++S + + G +S+Y L +M++ FL Y D W
Sbjct: 392 KYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGLSSYGLILMIV-SFLQSYQNQDKSWPTI 450
Query: 241 CISINGPVAISSLP------EIVAETVENDGDELLLSPEFLKKCREIYSVPITALENTGH 294
G + I L + +T+ D PE ++ I P H
Sbjct: 451 -----GTLLIEFLNVYGCELDYAGKTICPD------QPEVFEQETTIIFDP--------H 491
Query: 295 EFAI---KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL 351
FA + L I+DPL NN+GR N +++ A + +L LT L
Sbjct: 492 NFAYCQQQSLVIIDPLNPQNNVGR--PSYNVAKLKLAFTVAFSKL---LTFDSSTQQYPL 546
Query: 352 EKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYG 401
KFF N+ + ++Q +H+ L Y +Y + +++G
Sbjct: 547 -KFFFNSAQ--------NIQCSLHSV---------LVNQYKAYADQIMFG 578
>gi|258563094|ref|XP_002582292.1| Poly(A) polymerase PAPalpha [Uncinocarpus reesii 1704]
gi|237907799|gb|EEP82200.1| Poly(A) polymerase PAPalpha [Uncinocarpus reesii 1704]
Length = 616
Score = 38.9 bits (89), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 60/125 (48%), Gaps = 21/125 (16%)
Query: 63 AIVSEKKRKEVINYIQRLINGYY-----------------GIEVFPFGSVPLKTYLPDGD 105
A+++ RK++++ +QR+ + G ++F FGS L Y P D
Sbjct: 60 ALLNINPRKQILHLLQRVTIEFIKSVGRKKGLSQGAIDAAGGKIFTFGSYRLGVYGPGSD 119
Query: 106 IDLTVLGHQNV--EEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDI 163
ID V+G ++V E+ A F + + +DS+ ++ V A V I+K + I +D+
Sbjct: 120 IDTLVVGPKHVTREDFFADFPSTLEQLAPKDSIEKLTSVP--DAFVPIMKLELSGISIDL 177
Query: 164 SFNQM 168
F ++
Sbjct: 178 IFARL 182
>gi|145478719|ref|XP_001425382.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392452|emb|CAK57984.1| unnamed protein product [Paramecium tetraurelia]
Length = 592
Score = 38.9 bits (89), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 140/350 (40%), Gaps = 69/350 (19%)
Query: 75 NYIQRLIN--------GYYGIEVFPFGSVPLKTYLPDGDIDLTVLGHQNVEEDLARFVCK 126
N Q+LI+ Y G ++ +GS + LP DIDL V+ N E + +
Sbjct: 284 NAFQKLIDEISGVVSATYAGAQIDVYGSYATELCLPHSDIDL-VIKISNQHEKFVTDILQ 342
Query: 127 ILENEDQDSVF--EIKDVQYVPAQVKIVKCSVQ--NIPVDISFNQMAGLSALCFLEKVDR 182
+E E + F E K V V KC+ Q N +DIS + C ++ R
Sbjct: 343 RIEVELKKCKFIEETKCVTQSTTPVLRAKCNKQYMNKRLDISIQETKHNGLQCV--QLIR 400
Query: 183 LIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYALEMMVLHRFLDYYNTFDWDNYCI 242
K++ + + LI ++S + + G +S+Y L +M++ FL Y D
Sbjct: 401 KYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGLSSYGLILMIV-SFLQSYQNQDK----- 454
Query: 243 SINGPVAISSLPEIVAETVENDGDELLLS--------PEFLKKCREIYSVPITALENTGH 294
N P ++ ++ E + G EL + PE ++ I P H
Sbjct: 455 --NWP----TIGTLLIEFLNVYGCELDYAGKTICPDQPEVFEQETTIIFDP--------H 500
Query: 295 EFAI---KHLNIVDPLKDNNNLGRSVSKGNFHRIRCALSYGAQRLGEILTLPGHCLGMGL 351
FA + L I+DPL NN+GR N +++ A + +L LT L
Sbjct: 501 NFAYCQQQSLVIIDPLNPQNNVGR--PSYNVAKLKLAFTVAFSKL---LTFDSSNQQYPL 555
Query: 352 EKFFINTLERNGKGLRPDVQVPIHAFGSGKSEAADLSGDYDSYYNGLLYG 401
KFF N+ + ++Q +H+ L Y +Y + +++G
Sbjct: 556 -KFFFNSAQ--------NIQCSLHSV---------LVNQYKAYADQIMFG 587
>gi|383852647|ref|XP_003701838.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Megachile rotundata]
Length = 573
Score = 38.9 bits (89), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 69/283 (24%), Positives = 118/283 (41%), Gaps = 46/283 (16%)
Query: 50 EERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDGDIDL 108
E I++ + P+ R V+ I+++I + +V FGS YLP DIDL
Sbjct: 122 HEEIEDFFAYMCPSNEEHSLRIRVVKRIEQVIYDLWPDSKVEVFGSFRTGLYLPTSDIDL 181
Query: 109 TVLGH------QNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVD 162
V+G + +E L + +N + S ++ D VP VK+ + I VD
Sbjct: 182 VVIGMWTNLPLRTLERAL------LDQNIAEPSSIKVLDKASVPI-VKLTDKETE-IKVD 233
Query: 163 ISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHY-GLISTYALE 221
ISFN G+ + E ++ + + ++ V+++K + R L + G IS+Y+L
Sbjct: 234 ISFNMSNGVKS---AELINSFKKRYPVLEKLVMVLKQ--FLLQRDLNEVFTGGISSYSLI 288
Query: 222 MMVLHRFLDYYNTFDWDNYCISINGPVAISSLPEIVAETVE--------NDGDELLLSPE 273
+M + FL + + YC + N V + E+ DG + E
Sbjct: 289 LMTI-SFLQLHPR--QNAYCSNANLGVLLIEFLELYGRKFNYVKTGIRVKDGGTYISKEE 345
Query: 274 FLKKCREIYSVPITALENTGHEFAIKHLNIVDPLKDNNNLGRS 316
+ + GH ++ L I DPL N++GRS
Sbjct: 346 VQRDMID------------GHRPSL--LCIEDPLTPGNDIGRS 374
>gi|449669254|ref|XP_002166747.2| PREDICTED: poly(A) RNA polymerase GLD2-A-like [Hydra
magnipapillata]
Length = 437
Score = 38.9 bits (89), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 139 IKDVQYVPAQVKIVKC--SVQNIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVIL 196
I+++Q++ A+V I+K +V DI+ N G+ L ++ D + ++
Sbjct: 213 IRNIQFIRAKVPILKFKDTVSGCDCDINTNNSIGIRNTHLLRTYSKI---DDRVRPLIMA 269
Query: 197 IKAWCYYESRILGAHYGLISTYALEMMVLHRFLDY 231
+K W S I A G +S+Y+L MMV+H Y
Sbjct: 270 VKHWAKSRS-INDASQGTLSSYSLVMMVIHYLQSY 303
>gi|145535359|ref|XP_001453418.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421129|emb|CAK86021.1| unnamed protein product [Paramecium tetraurelia]
Length = 634
Score = 38.9 bits (89), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 106/248 (42%), Gaps = 33/248 (13%)
Query: 53 IQEILCTIQPAIVSEKKRKEV-INYIQRLINGYYGIEVFPFGSVPLKTYLPDGDIDLTVL 111
I E+ IQ + RK++ +N + LI Y + ++P+GS L D+D+ +
Sbjct: 308 IDELSNYIQQKSYQNRIRKQICLNRLNYLIYTNYHLNMYPYGSFETGLDLEISDVDVGIW 367
Query: 112 GHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQNIPVDISFNQMAGL 171
G QN+ ++L N + + F IK + + +Q+ I+K + P +++ +
Sbjct: 368 GSQNLSYSQIVSFLQLLNNTLKQTPFLIKS-KLIQSQMPILKLELN--PKSSFYDEDQHI 424
Query: 172 S---ALCFLEKVDRLIGKDHLFKRSVILIK-AWCYYESRILGA-HYGLISTYALE----- 221
+ L++ D+ I + F +I + W Y S + H G ST ++
Sbjct: 425 QQNWSYFHLDQNDQGIIQSSQFLGKIIQVDITWIYQWSNVYNNPHLGFASTTIVKDWVSR 484
Query: 222 -------MMVLHRFLDYYNTFDW-----DNYCISINGPVAISSLPEIVAETVENDGDELL 269
M++L + N D ++C+SI + + + +ND ++L
Sbjct: 485 FVWYRDIMLILKHLVKSKNLNDAHTGGISSFCLSI-----MLAAIYMCKHYTQNDKKQIL 539
Query: 270 LSPEFLKK 277
L +FLKK
Sbjct: 540 L--DFLKK 545
>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
Length = 573
Score = 38.9 bits (89), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 81/186 (43%), Gaps = 17/186 (9%)
Query: 46 WLLAEERIQEILCTIQPAIVSEKKRKEVINYIQRLINGYY-GIEVFPFGSVPLKTYLPDG 104
WL E I++ + I P+ + R + I+ I+ + + ++ FGS YLP
Sbjct: 170 WLTFE--IKDFVAYISPSREEIEVRNQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGS 227
Query: 105 DIDLTV---LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVPAQVKIVKCSVQN--I 159
DID + LG + +L + + ++ EI+ V A+V I+K N I
Sbjct: 228 DIDCVITSELGGKESRNNLFSLASHL---KKKNLATEIEVV--AKARVPIIKFVEPNSGI 282
Query: 160 PVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLISTYA 219
+D+SF + GL A K+ R D R ++LI + R+ H G + ++
Sbjct: 283 HIDVSFERTNGLEAA----KLIREWLNDTPGLRELVLIVKQFLHSRRLNNVHTGGLGGFS 338
Query: 220 LEMMVL 225
+ +V
Sbjct: 339 IICLVF 344
>gi|356537950|ref|XP_003537469.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Glycine max]
Length = 328
Score = 38.9 bits (89), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 85/191 (44%), Gaps = 24/191 (12%)
Query: 53 IQEILCTIQPAIVSEKKRKEVINYIQRLING---YYGIEVFPFGSVPLKTYLPDGDIDLT 109
+ +IL + P + R +IN + ++ G V P+GS + GD+D++
Sbjct: 11 VNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNLFTRWGDLDIS 70
Query: 110 V---------LGHQNVEEDLARFVCKILENEDQDSVFEIKDVQYVP-AQVKIVKCSV--Q 157
+ + ++ L V K L + S ++Q++ A+V I+K Q
Sbjct: 71 IELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGS-----NLQFISNARVPILKFKSYRQ 125
Query: 158 NIPVDISFNQMAGLSALCFLEKVDRLIGKDHLFKRSVILIKAWCYYESRILGAHYGLIST 217
+ DIS N + G L ++++ G+ F+ V+L+K W +I + G ++
Sbjct: 126 GVSCDISINNLPGQMKSKILLWINKIDGR---FRHMVLLVKEWAKAH-KINNSKAGTFNS 181
Query: 218 YALEMMVLHRF 228
Y+L ++V+ F
Sbjct: 182 YSLSLLVIFYF 192
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.135 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,021,094,170
Number of Sequences: 23463169
Number of extensions: 496789835
Number of successful extensions: 1165967
Number of sequences better than 100.0: 611
Number of HSP's better than 100.0 without gapping: 159
Number of HSP's successfully gapped in prelim test: 452
Number of HSP's that attempted gapping in prelim test: 1164856
Number of HSP's gapped (non-prelim): 985
length of query: 659
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 510
effective length of database: 8,863,183,186
effective search space: 4520223424860
effective search space used: 4520223424860
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)