BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013307
(445 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255566462|ref|XP_002524216.1| zinc finger protein, putative [Ricinus communis]
gi|223536493|gb|EEF38140.1| zinc finger protein, putative [Ricinus communis]
Length = 493
Score = 556 bits (1433), Expect = e-156, Method: Compositional matrix adjust.
Identities = 284/488 (58%), Positives = 343/488 (70%), Gaps = 56/488 (11%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60
M +++VLEPIL+D L ++ PLREDW R K+I +L++V+ S+ESLRGATVEPFGSFVSNL
Sbjct: 1 MNAHSVLEPILRDTLEVIKPLREDWAVRSKIIEELKDVIASIESLRGATVEPFGSFVSNL 60
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
F+RWGDLDISI L+NGS ISSA KK KQ++L + +ALRQKGG+RRLQFV +ARVP+LKF
Sbjct: 61 FTRWGDLDISIMLANGSYISSAAKKRKQNVLREFHKALRQKGGWRRLQFVPNARVPLLKF 120
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
E+ QNISCD+SIDNL GQIKS FLFW++QIDGRFRDMVLLVKEWAKAH+INNPKTGT N
Sbjct: 121 ESGRQNISCDVSIDNLQGQIKSNFLFWLNQIDGRFRDMVLLVKEWAKAHNINNPKTGTLN 180
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
SYSLSLLV+FHFQTCVPAILPPLK+IYP N+VDDL GVR AE +I E C NIAR+ SD
Sbjct: 181 SYSLSLLVIFHFQTCVPAILPPLKEIYPRNVVDDLTGVRTVAEERIKETCNANIARYMSD 240
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
KYR +NRSSL+ LF+SF KFSG+SLKA++LGIC FTGQW IRS RWLP + LFIED
Sbjct: 241 KYRAVNRSSLSELFISFFAKFSGISLKAADLGICTFTGQWLDIRSTMRWLPKTYALFIED 300
Query: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVR 360
PFEQPEN+ARAVS NL KI+ AF+ T+ +L NQ R +LL +L RP IL +PVR
Sbjct: 301 PFEQPENAARAVSAGNLVKIAEAFQTTYHKLVLANQNRTSLLGTLVRPEILNCIAGTPVR 360
Query: 361 YANYNNGHRRA-RPQSHKSVNSPLQAQHQSHNA------------KKENRPNRSMSQQSV 407
+Y + H ++ PQ KS+ S Q QHQ N ++E P+ S SQ V
Sbjct: 361 NLSYTSLHYQSTHPQISKSMYSSPQVQHQFQNMRQEKHQKIFTAQRQEKHPHSSNSQYRV 420
Query: 408 QQ--------------HQS-----------------------------QPVRQINGQVQQ 424
Q H+S +P + +GQ QQ
Sbjct: 421 QNTRLEKHPSYLAKQGHESHPENTRLERHPNYFAMQKQESNVNTSTRKKPAQYYHGQGQQ 480
Query: 425 IWRPKSDG 432
+WRPKSDG
Sbjct: 481 LWRPKSDG 488
>gi|359486610|ref|XP_002277771.2| PREDICTED: poly(A) RNA polymerase GLD2-like [Vitis vinifera]
gi|296086183|emb|CBI31624.3| unnamed protein product [Vitis vinifera]
Length = 453
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 263/452 (58%), Positives = 327/452 (72%), Gaps = 21/452 (4%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60
M ++NVLE +LKDIL ++NP REDW R ++I+D R V+SVESLRGATVEPFGSF+SNL
Sbjct: 1 MSTFNVLEIVLKDILLVINPSREDWAIRNQLIADFRTAVDSVESLRGATVEPFGSFLSNL 60
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
+++WGDLDISIEL NG+ ISSAGK+ KQ+LLG +L ALR KGG+R+LQF+ +ARVPI+KF
Sbjct: 61 YTQWGDLDISIELPNGAYISSAGKRHKQTLLGHVLNALRSKGGWRKLQFIPNARVPIIKF 120
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
E+ H NISCD+SI+NL GQ+KSKFLFWIS IDGRFRD+VLLVKEWA+AHDINN KTGT N
Sbjct: 121 ESYHPNISCDVSINNLKGQMKSKFLFWISGIDGRFRDLVLLVKEWARAHDINNSKTGTLN 180
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
SYSLSLLV+FH QTC PAILPPLK+IYPGN+ DDL GVRA E QI E A NI RF D
Sbjct: 181 SYSLSLLVVFHLQTCRPAILPPLKEIYPGNVADDLIGVRAVVEGQIEETSAANINRFKRD 240
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
+ R NRSSL+ LF+SFL KF ++ +ASE GICP+TGQW I SN RW+P + LF+ED
Sbjct: 241 RSRAPNRSSLSELFISFLAKFVDITSRASEQGICPYTGQWVDIDSNMRWMPRTYELFVED 300
Query: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVR 360
PFEQPEN+AR V + L +IS AF+ TH RLTS NQ +++L+ +L RP I QF +P R
Sbjct: 301 PFEQPENTARGVRSRQLQRISEAFQTTHQRLTSANQDQHSLIDTLVRPQIAQFIRRAPSR 360
Query: 361 YAN-YNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENRPN-------------------R 400
++ Y + R P NSPLQ Q+ N + ++RPN R
Sbjct: 361 NSSAYGRNNSRTYPSVPNVANSPLQFQNDFQNRRPQSRPNTTSQRSAPVQARPNSVTMQR 420
Query: 401 SMSQQSVQQHQSQPVRQ-INGQVQQIWRPKSD 431
SM + + V+Q Q Q++WRP+SD
Sbjct: 421 SMYTRPGSSTVQRSVQQATQSQSQRVWRPRSD 452
>gi|449465848|ref|XP_004150639.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
gi|449516431|ref|XP_004165250.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cucumis sativus]
Length = 464
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 261/459 (56%), Positives = 319/459 (69%), Gaps = 38/459 (8%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
L+ ++KDIL ++ PL++DW R +VI++LR VV+S+ESLRGAT+EPFGSFVSNLFSRWG
Sbjct: 5 TLDRVIKDILRVVEPLQDDWTARFQVINELRNVVQSIESLRGATIEPFGSFVSNLFSRWG 64
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLD+S++L+NGS S+AGKK KQ+LL D+ A R+ G + +LQ + HARVPILK E I
Sbjct: 65 DLDLSVQLNNGSYTSTAGKKRKQTLLRDIQNASRKNGRWYKLQLIPHARVPILKIEHIQH 124
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
NISCDISIDNL GQIKSK L W+++IDGRF DMVLLVKEWAKAHDINN K GTFNSYSLS
Sbjct: 125 NISCDISIDNLVGQIKSKILLWVNEIDGRFHDMVLLVKEWAKAHDINNSKQGTFNSYSLS 184
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
LLV+FHFQTC PAI PPL+DIYPGN+VD+LKGVRA E +IA CA NIARF S R
Sbjct: 185 LLVIFHFQTCSPAIFPPLRDIYPGNVVDNLKGVRAEVENEIARTCATNIARFKS---RTA 241
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
NRSSL+ LFVSFL KFS +S KASELGICP+TGQW I SN RWLP + +F+EDPFEQP
Sbjct: 242 NRSSLSELFVSFLAKFSDISSKASELGICPYTGQWLKIESNMRWLPKTYAIFVEDPFEQP 301
Query: 306 ENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFF----GESPVRY 361
EN+ARA++ + L +IS AF MTH RLTS Q R ++L+ LARP I Q G +
Sbjct: 302 ENTARAINARQLMRISEAFRMTHLRLTSVYQNRSSILNDLARPQISQLIINSSGSASAPA 361
Query: 362 ANYNNGHRRARPQSHKS-VNSPLQ-AQHQSHN-----------AKKENRPNRSMSQ-QSV 407
N N + RPQ H++ V P QHQ N A P+ SQ +
Sbjct: 362 FNVEN-YTPIRPQVHQARVMQPRPWIQHQFQNNIPRFNMGNFPAINSQAPHAGTSQSHPL 420
Query: 408 QQHQS----------------QPVRQINGQVQQIWRPKS 430
QH++ +P + +GQ QQ WRP+S
Sbjct: 421 VQHKTPKTKRIVSSPNVLNVGEPSKTYSGQGQQKWRPRS 459
>gi|356569346|ref|XP_003552863.1| PREDICTED: poly(A) RNA polymerase cid11-like [Glycine max]
Length = 415
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 243/431 (56%), Positives = 311/431 (72%), Gaps = 17/431 (3%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60
M +++ L+ ++ DIL ++ P++EDWE R +I+DLR +VESVESLRGATVEPFGSFVSNL
Sbjct: 1 MSTHSTLDIVVNDILRVVTPVQEDWEIRFAIINDLRSIVESVESLRGATVEPFGSFVSNL 60
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
F+RWGDLDISIELSNG ISSAGKK KQ+ LGD+L+ALR KGG LQF+++ARVPILKF
Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTFLGDVLKALRMKGGGSNLQFISNARVPILKF 120
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
++ Q +SCDISI+NL GQ+KSK L WI++IDGRFR MVLLVKEWAKAH INN K GTFN
Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
SYSLSLLV+F+FQTC+PAI PPLKDIYPGN+VDDL GVR++AE IA+ C NI RF S+
Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMVDDLIGVRSDAENLIAQTCDANINRFISN 240
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
+ R INR S+A LFV F+ KF+ + A ++GICP++G+WE I N WLP + +F+ED
Sbjct: 241 RARSINRKSVAELFVEFIGKFAKMDSMAVKMGICPYSGKWEQIEDNMIWLPKTYAIFVED 300
Query: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVR 360
PFEQP+N+AR+VS L KI+ AF TH LTSTNQ + +LLS++A +++
Sbjct: 301 PFEQPQNTARSVSAGQLKKITEAFARTHDLLTSTNQNQISLLSNMAPAHVIRCITRP--- 357
Query: 361 YANYNNGH-RRARPQSHKSVNSPLQAQHQSHNAKKENRPNRSMSQQSVQQHQSQPVRQIN 419
Y G+ +PQ +++ LQ+Q N + N S S+ H+
Sbjct: 358 ---YGGGYFHPTQPQVQRAIRPQLQSQRHFQNVSQGTSSNSSSSKGHTLVHRG------- 407
Query: 420 GQVQQIWRPKS 430
QQIWRPKS
Sbjct: 408 ---QQIWRPKS 415
>gi|356537950|ref|XP_003537469.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Glycine max]
Length = 328
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 210/324 (64%), Positives = 263/324 (81%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60
M ++++L+ ++ DIL ++ PL+EDWE R +I+D R +VESVESLRGATVEP+GSFVSNL
Sbjct: 1 MSTHSMLDIVVNDILRVVTPLQEDWEIRFAIINDFRSIVESVESLRGATVEPYGSFVSNL 60
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
F+RWGDLDISIELSNG ISSAGKK KQ+LLG++L+ALR KGG LQF+++ARVPILKF
Sbjct: 61 FTRWGDLDISIELSNGLHISSAGKKQKQTLLGEVLKALRMKGGGSNLQFISNARVPILKF 120
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
++ Q +SCDISI+NL GQ+KSK L WI++IDGRFR MVLLVKEWAKAH INN K GTFN
Sbjct: 121 KSYRQGVSCDISINNLPGQMKSKILLWINKIDGRFRHMVLLVKEWAKAHKINNSKAGTFN 180
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
SYSLSLLV+F+FQTC+PAI PPLKDIYPGN++DDL G+R++AE IAE C NI RF S+
Sbjct: 181 SYSLSLLVIFYFQTCIPAIFPPLKDIYPGNMIDDLIGIRSDAENLIAETCDANINRFISN 240
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
+ R INR S+A LFV F+ KF+ + A E+GICP+TG+WE I N WLP + +F+ED
Sbjct: 241 RARSINRKSVAELFVDFVGKFAKMDSMAVEMGICPYTGKWEQIEDNMIWLPKTYAIFVED 300
Query: 301 PFEQPENSARAVSEKNLAKISNAF 324
PFEQP+N+AR+VS L KI+ F
Sbjct: 301 PFEQPQNTARSVSAGQLKKITETF 324
>gi|79571331|ref|NP_181504.2| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|53850481|gb|AAU95417.1| At2g39740 [Arabidopsis thaliana]
gi|55733735|gb|AAV59264.1| At2g39740 [Arabidopsis thaliana]
gi|330254623|gb|AEC09717.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 511
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 236/424 (55%), Positives = 303/424 (71%), Gaps = 21/424 (4%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L+P L++IL ++ P R D +TR+ VI LR+V++SVE LRGATV+PFGSFVSNLF+RWGD
Sbjct: 7 LDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGD 66
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDIS++L +GS I GKK KQ+LLG LLRALR G + +LQFV HARVPILK + HQ
Sbjct: 67 LDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQR 126
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
ISCDISIDNL G +KS+FLFWIS+IDGRFRD+VLLVKEWAKAH+IN+ KTGTFNSYSLSL
Sbjct: 127 ISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSL 186
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
LV+FHFQTCVPAILPPL+ IYP + VDDL GVR AE IA++ A NIARF S++ + +N
Sbjct: 187 LVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVN 246
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
RSSL+ L VSF KFS +++KA E G+CPFTG+WE I SNT WLP + LF+EDPFEQP
Sbjct: 247 RSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPV 306
Query: 307 NSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGES---PVRY-- 361
N+AR+VS +NL +I+ F++T RL S R +++ L I + + P ++
Sbjct: 307 NAARSVSRRNLDRIAQVFQITSRRLVS-ECNRNSIIGILTGQHIQESLYRTISLPSQHHA 365
Query: 362 ---ANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKK--------ENRPNRSMSQQSVQQH 410
N N H +ARPQ+ + Q QS+N ++RP ++ +Q + +
Sbjct: 366 NGMHNVRNLHGQARPQNQQM----QQNWSQSYNTPNPPHWPPLTQSRPQQNWTQNNPRNL 421
Query: 411 QSQP 414
Q QP
Sbjct: 422 QGQP 425
>gi|110735731|dbj|BAE99845.1| hypothetical protein [Arabidopsis thaliana]
Length = 511
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 236/424 (55%), Positives = 302/424 (71%), Gaps = 21/424 (4%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L+P L++IL ++ P R D +TR+ VI LR+V++SVE LRGATV+PFGSFVSNLF+RWGD
Sbjct: 7 LDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGD 66
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDIS++L +GS I GKK KQ LLG LLRALR G + +LQFV HARVPILK + HQ
Sbjct: 67 LDISVDLFSGSSILFTGKKQKQILLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQR 126
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
ISCDISIDNL G +KS+FLFWIS+IDGRFRD+VLLVKEWAKAH+IN+ KTGTFNSYSLSL
Sbjct: 127 ISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSL 186
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
LV+FHFQTCVPAILPPL+ IYP + VDDL GVR AE IA++ A NIARF S++ + +N
Sbjct: 187 LVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVN 246
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
RSSL+ L VSF KFS +++KA E G+CPFTG+WE I SNT WLP + LF+EDPFEQP
Sbjct: 247 RSSLSELLVSFFAKFSDINVKAQEFGVCPFTGRWETISSNTTWLPKTYSLFVEDPFEQPV 306
Query: 307 NSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGES---PVRY-- 361
N+AR+VS +NL +I+ F++T RL S R +++ L I + + P ++
Sbjct: 307 NAARSVSRRNLDRIAQVFQITSRRLVS-ECNRNSIIGILTGQHIQESLYRTISLPSQHHA 365
Query: 362 ---ANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKK--------ENRPNRSMSQQSVQQH 410
N N H +ARPQ+ + Q QS+N ++RP ++ +Q + +
Sbjct: 366 NGMHNVRNLHGQARPQNQQM----QQNWSQSYNTPNPPHWPPLTQSRPQQNWTQNNPRNL 421
Query: 411 QSQP 414
Q QP
Sbjct: 422 QGQP 425
>gi|297823863|ref|XP_002879814.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
gi|297325653|gb|EFH56073.1| hypothetical protein ARALYDRAFT_321659 [Arabidopsis lyrata subsp.
lyrata]
Length = 500
Score = 433 bits (1114), Expect = e-119, Method: Compositional matrix adjust.
Identities = 230/424 (54%), Positives = 302/424 (71%), Gaps = 21/424 (4%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L+P L++IL ++ P R DW+TR++VI LR+V+++VE LRGATV+PFGSFVSNLF+RWGD
Sbjct: 7 LDPTLQEILQVIKPTRADWDTRIRVIDQLRDVLQTVECLRGATVQPFGSFVSNLFTRWGD 66
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LD+S++L +GS I GKK KQ+LL LLRALR G + +LQFV HARVPILK + HQ
Sbjct: 67 LDLSVDLFSGSSILFTGKKQKQTLLRHLLRALRASGLWYKLQFVIHARVPILKVVSGHQR 126
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
I+CDISIDNL G +KS+FLFWIS+IDGRFRD+VLLVKEWAKAH+IN+ K GTFNSYSLSL
Sbjct: 127 IACDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKNGTFNSYSLSL 186
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
LV+FH QTCVPAILPPL+ IYP + VDDL GVR AE IA++ A NIARF + + +N
Sbjct: 187 LVIFHLQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKLNTAKSVN 246
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
RSSL+ L VSF KFS ++LKA ELG+CPFTG+WE+I SNT WLP + LF+EDPFEQP
Sbjct: 247 RSSLSELLVSFYAKFSDINLKAQELGVCPFTGRWENISSNTTWLPKTYSLFVEDPFEQPV 306
Query: 307 NSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYA---- 362
N+AR+VS +NL +I+ F++T RL S + R +++ L I + + ++
Sbjct: 307 NAARSVSRRNLDRIAQVFQITSRRLVS-DCNRNSIIGVLTGQHIQESLHRTISLHSQQHA 365
Query: 363 ----NYNNGHRRARPQSHKSVNSPLQAQHQSHNAKK--------ENRPNRSMSQQSVQQH 410
N N H +AR Q+ + Q QS+N + ++RP ++ Q +++
Sbjct: 366 NSMHNVRNLHGQARHQNQQM----QQNWSQSYNTQNPPYWPPPTQSRPQQNWPQNNLRNL 421
Query: 411 QSQP 414
Q QP
Sbjct: 422 QGQP 425
>gi|224112707|ref|XP_002316267.1| predicted protein [Populus trichocarpa]
gi|222865307|gb|EEF02438.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 215/339 (63%), Positives = 256/339 (75%), Gaps = 39/339 (11%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
LEP LKDIL + PLREDW R KVI +L +VV+SVESLRG+TVEPFGSFVSNLF+RWGD
Sbjct: 1 LEPTLKDILNGIQPLREDWVVRFKVIEELEDVVKSVESLRGSTVEPFGSFVSNLFTRWGD 60
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDISI LSNGS ISSAGK+ KQ+LL D+L+ALRQ+GG++RLQF+ +ARVPILKFE + +
Sbjct: 61 LDISIVLSNGSYISSAGKRRKQNLLEDVLKALRQRGGWQRLQFIPNARVPILKFE--NAS 118
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
ISCD+SIDN+ G +KSKFLFWI++ID RFRDMVLLVKEWAK H+INNPKTG+ NSYSLSL
Sbjct: 119 ISCDVSIDNMQGLMKSKFLFWINEIDRRFRDMVLLVKEWAKTHNINNPKTGSLNSYSLSL 178
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
LV+FHFQTCVPAILPPLK+IYP N++DDL GVR +AER+I EICA NI+R+ S+K R IN
Sbjct: 179 LVIFHFQTCVPAILPPLKEIYPRNVIDDLTGVRTDAERRIGEICAANISRYRSNKSRAIN 238
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
R+SL+ LF+SFL K IEDPFEQPE
Sbjct: 239 RNSLSELFISFLTK-------------------------------------IEDPFEQPE 261
Query: 307 NSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSL 345
N+ARAVS NL KIS A + TH RL + NQ + + L L
Sbjct: 262 NTARAVSAANLMKISEAIQTTHHRLVTANQNQISFLGML 300
>gi|2642156|gb|AAB87123.1| hypothetical protein [Arabidopsis thaliana]
Length = 474
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 214/424 (50%), Positives = 275/424 (64%), Gaps = 58/424 (13%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L+P L++IL ++ P R D +TR+ VI LR+V++SVE LRGATV+PFGSFVSNLF+RWGD
Sbjct: 7 LDPTLQEILQVIKPTRADRDTRITVIDQLRDVLQSVECLRGATVQPFGSFVSNLFTRWGD 66
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDIS++L +GS I GKK KQ+LLG LLRALR G + +LQFV HARVPILK + HQ
Sbjct: 67 LDISVDLFSGSSILFTGKKQKQTLLGHLLRALRASGLWYKLQFVIHARVPILKVVSGHQR 126
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
ISCDISIDNL G +KS+FLFWIS+IDGRFRD+VLLVKEWAKAH+IN+ KTGTFNSYSLSL
Sbjct: 127 ISCDISIDNLDGLLKSRFLFWISEIDGRFRDLVLLVKEWAKAHNINDSKTGTFNSYSLSL 186
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
LV+FHFQTCVPAILPPL+ IYP + VDDL GVR AE IA++ A NIARF S++ + +N
Sbjct: 187 LVIFHFQTCVPAILPPLRVIYPKSAVDDLTGVRKTAEESIAQVTAANIARFKSERAKSVN 246
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
RSSL+ L VSF K +EDPFEQP
Sbjct: 247 RSSLSELLVSFFAK-------------------------------------VEDPFEQPV 269
Query: 307 NSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGES---PVRY-- 361
N+AR+VS +NL +I+ F++T RL S R +++ L I + + P ++
Sbjct: 270 NAARSVSRRNLDRIAQVFQITSRRLVSEC-NRNSIIGILTGQHIQESLYRTISLPSQHHA 328
Query: 362 ---ANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKK--------ENRPNRSMSQQSVQQH 410
N N H +ARPQ+ + Q QS+N ++RP ++ +Q + +
Sbjct: 329 NGMHNVRNLHGQARPQNQQM----QQNWSQSYNTPNPPHWPPLTQSRPQQNWTQNNPRNL 384
Query: 411 QSQP 414
Q QP
Sbjct: 385 QGQP 388
>gi|115441021|ref|NP_001044790.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|56784029|dbj|BAD82657.1| unknown protein [Oryza sativa Japonica Group]
gi|56784702|dbj|BAD81828.1| unknown protein [Oryza sativa Japonica Group]
gi|113534321|dbj|BAF06704.1| Os01g0846500 [Oryza sativa Japonica Group]
gi|222619532|gb|EEE55664.1| hypothetical protein OsJ_04062 [Oryza sativa Japonica Group]
Length = 381
Score = 323 bits (829), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 168/366 (45%), Positives = 236/366 (64%), Gaps = 7/366 (1%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y+VLE +DIL ++ P+ D R+ I +L + + S +LRGA+V+PFGSFVS L++
Sbjct: 8 NYDVLEKCTEDILSLIKPVEGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYA 67
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
+ GDLD+S+EL N + + +K KQ L ++ RAL+++G R ++F+ +ARVP+L++ +
Sbjct: 68 KSGDLDVSVELFNALNLPISKRK-KQDTLREVRRALQKRGIARHMEFIPNARVPVLQYVS 126
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
ISCDISI N G+IKSK +WI+ +D RF DMVLLVKEWAKA +IN+PK GT NSY
Sbjct: 127 NQYGISCDISISNYPGRIKSKIFYWINTLDDRFGDMVLLVKEWAKAQNINDPKNGTLNSY 186
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
SL LLVLFHFQTC PAILPPLK+IY GN+++D+ G E+ + E+C+ NI RF
Sbjct: 187 SLCLLVLFHFQTCEPAILPPLKEIYEGNIMEDISGRAYYNEKHLDEVCSINIERFRRQNM 246
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
+ N+SSL+HL SF KF + + ++ I +TG+ E I+ N RW+ ++ LF+EDPF
Sbjct: 247 GQRNQSSLSHLLASFFHKFFRIDALSDKV-ISTYTGRLERIQDNPRWMDKSYSLFVEDPF 305
Query: 303 EQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT---RYALLSSLARPFILQFFG--ES 357
E+P+N+ARAV I NAF + S R LLS L P + G S
Sbjct: 306 EKPDNAARAVGSFEFQDIVNAFSNASNKFVSDAHALTDRNGLLSLLCTPDVGSKLGGRAS 365
Query: 358 PVRYAN 363
RY N
Sbjct: 366 ASRYTN 371
>gi|357131279|ref|XP_003567266.1| PREDICTED: poly(A) RNA polymerase protein cid1-like [Brachypodium
distachyon]
Length = 595
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 162/352 (46%), Positives = 231/352 (65%), Gaps = 3/352 (0%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
+ LE +K+IL + P D R+ I +L ++SV +L+GA +PFGSF+SNL+S+
Sbjct: 6 DALEKCIKEILSQIKPAEVDRNKRLSAIKELDISIQSVAALKGAAAKPFGSFLSNLYSKS 65
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDLD+S++L N S + + KK KQS+L L +AL++ G ++F+ HARVP+L++ +
Sbjct: 66 GDLDLSVQLMNSSNLPVSKKK-KQSILRVLRKALQRNGVAGYMEFIPHARVPVLQYVSNS 124
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
ISCD+SIDN G+IKS+ +WIS +D RF DMVLL+KEWAK +IN+PKTGT NSYSL
Sbjct: 125 FGISCDLSIDNYPGRIKSRIFYWISTLDERFGDMVLLIKEWAKCQNINDPKTGTLNSYSL 184
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
LLVLFHFQTC PAILPPLKDIY GN+ +D + E + +CA NIA+F S +
Sbjct: 185 CLLVLFHFQTCEPAILPPLKDIYEGNITEDFTDMTLYDEEHLDMVCAANIAKFESQNKEQ 244
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N SSL L +F +KF ++ +++ I +TGQ E I+ N W+ ++ LFIEDP E+
Sbjct: 245 RNESSLCQLLATFFDKFCHINAITNDV-ISTYTGQLEKIQDNPNWMKKSYSLFIEDPVER 303
Query: 305 PENSARAVSEKNLAKISNAFEMTHFRLTSTNQT-RYALLSSLARPFILQFFG 355
P+N+ARAV + L +I++AF T+ + S +T + LL L P + G
Sbjct: 304 PDNAARAVGVRGLLQIASAFNDTNRKFVSLERTEKNDLLGMLCTPDVCSKLG 355
>gi|218189366|gb|EEC71793.1| hypothetical protein OsI_04418 [Oryza sativa Indica Group]
Length = 381
Score = 320 bits (821), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 167/366 (45%), Positives = 235/366 (64%), Gaps = 7/366 (1%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y+VLE +DIL ++ P+ D R+ I +L + + S +LRGA+V+PFGSFVS L++
Sbjct: 8 NYDVLEKCTEDILSLIKPVEGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYA 67
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
+ GDLD+S+EL N + + +K KQ L ++ RAL+++G R ++F+ +ARVP+L++ +
Sbjct: 68 KSGDLDVSVELFNALNLPISKRK-KQDTLREVRRALQKRGIARHMEFIPNARVPVLQYVS 126
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
ISCDISI N G+IKSK +WI+ +D RF DMVLLVKEWAKA +IN+PK GT NSY
Sbjct: 127 NQYGISCDISISNYPGRIKSKIFYWINTLDDRFGDMVLLVKEWAKAQNINDPKNGTLNSY 186
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
SL LLVL HFQTC PAILPPLK+IY GN+++D+ G E+ + E+C+ NI RF
Sbjct: 187 SLCLLVLCHFQTCEPAILPPLKEIYEGNIMEDISGRAYYNEKHLDEVCSINIERFRRQNM 246
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
+ N+SSL+HL SF KF + + ++ I +TG+ E I+ N RW+ ++ LF+EDPF
Sbjct: 247 GQRNQSSLSHLLASFFHKFFRIDALSDKV-ISTYTGRLERIQDNPRWMDKSYSLFVEDPF 305
Query: 303 EQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT---RYALLSSLARPFILQFFG--ES 357
E+P+N+ARAV I NAF + S R LLS L P + G S
Sbjct: 306 EKPDNAARAVGSFEFQDIVNAFSNASNKFVSDAHALTDRNGLLSLLCTPDVGSKLGGRAS 365
Query: 358 PVRYAN 363
RY N
Sbjct: 366 ASRYTN 371
>gi|293332275|ref|NP_001169645.1| hypothetical protein [Zea mays]
gi|224030617|gb|ACN34384.1| unknown [Zea mays]
gi|414879730|tpg|DAA56861.1| TPA: hypothetical protein ZEAMMB73_892019 [Zea mays]
Length = 574
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 166/377 (44%), Positives = 233/377 (61%), Gaps = 11/377 (2%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
VL +KDIL ++ P+ +D R+ I +L + S+ SL GA V+PFGSFVS+L+S+ G
Sbjct: 10 VLNKCIKDILALIKPVEDDRSKRLSTIQELENCIHSLASLSGAAVKPFGSFVSDLYSKSG 69
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETI 123
DLD+S++ NGS KK KQ+ L D+ +AL +G GY ++QF+ HARVP+L++ +
Sbjct: 70 DLDLSVQFGNGSN-HPINKKKKQNALRDVRKALLSRGVTGYMQMQFIPHARVPVLQYVSK 128
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
ISCDISI N G+IKSK +W++ +D RF DMVLL+KEWAKA +IN+PK+GT NSYS
Sbjct: 129 QFGISCDISIGNFAGRIKSKIFYWVNTVDERFGDMVLLIKEWAKAQNINDPKSGTLNSYS 188
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR 243
L LLVL+HFQT P ILPPL +IY GN+ D+ E+ + E+CA NI RF
Sbjct: 189 LCLLVLYHFQTSEPPILPPLNEIYEGNIAGDVTEAALFDEQHLDEVCAANIERFRLQNKG 248
Query: 244 KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFE 303
+ N +S L +F +KF+ ++ + I ++GQ E I++N W+ ++ LF+EDP E
Sbjct: 249 RRNETSTCRLLGTFFQKFAHINALTDNV-ISTYSGQIERIQNNPSWMRKSYHLFVEDPVE 307
Query: 304 QPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT-RYALLSSLARPFILQFFGESPVRYA 362
+P+N+ARAVS K L I+ AF + S R LL+ L P + G VR
Sbjct: 308 RPDNAARAVSMKGLDLIAIAFNDACHKFKSLEHIDRNELLALLCTPVVRLKLGVR-VREN 366
Query: 363 NY-----NNGHRRARPQ 374
+Y NN H R R Q
Sbjct: 367 SYPKSPRNNVHSRIRGQ 383
>gi|218189365|gb|EEC71792.1| hypothetical protein OsI_04417 [Oryza sativa Indica Group]
Length = 557
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 118/227 (51%), Positives = 160/227 (70%), Gaps = 3/227 (1%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y+V+E +K+IL ++ P+ +D R+ I +L + V +LRGA +PFGSFVSNL+S
Sbjct: 6 NYDVVEQCVKNILSLIKPVEDDRRKRLSAIQELSNSIPKVAALRGAVFKPFGSFVSNLYS 65
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
GDLDIS++L N S IS KK KQ +L +L+R L+ +G +QF+ ARVP+L++ +
Sbjct: 66 NSGDLDISVQLPNNSIIS---KKKKQYVLRELMRVLQNRGVAGYVQFIPFARVPVLQYVS 122
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
ISCDIS++N G+IKSK WIS +D RF DMVLL+KEWAKA +IN+PKTGT NSY
Sbjct: 123 NTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGDMVLLIKEWAKAQNINDPKTGTLNSY 182
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
SL LLVLFHFQTC PAILPPLK+IY GN+ + + + E + E+
Sbjct: 183 SLCLLVLFHFQTCEPAILPPLKEIYEGNIEEGIAEMTVYDEEHLDEV 229
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT-RYALLSSLARPFILQFFG 355
+EDP E+P+N+ARAV K L +I+ AF + + S R LL L P + G
Sbjct: 229 VEDPIERPDNAARAVGLKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLG 287
>gi|222619531|gb|EEE55663.1| hypothetical protein OsJ_04061 [Oryza sativa Japonica Group]
Length = 461
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 119/227 (52%), Positives = 159/227 (70%), Gaps = 3/227 (1%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y+V+E +K+IL ++ P+ +D R+ I +L + V +LRGA +PFGSFVSNL+S
Sbjct: 29 NYDVVEQCVKNILSLIKPVEDDRRKRLSAIQELSNSIPKVAALRGAVFKPFGSFVSNLYS 88
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
GDLDIS+ L N S IS KK KQ +L +L+R L+ +G +QFV ARVP+L++ +
Sbjct: 89 NSGDLDISVHLPNNSIIS---KKKKQYVLRELMRVLQNRGVAGYVQFVPFARVPVLQYVS 145
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
ISCDIS++N G+IKSK WIS +D RF DMVLL+KEWAKA +IN+PKTGT NSY
Sbjct: 146 NTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGDMVLLIKEWAKAQNINDPKTGTLNSY 205
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
SL LLVLFHFQTC PAILPPLK+IY GN+ + + + E + E+
Sbjct: 206 SLCLLVLFHFQTCEPAILPPLKEIYEGNIEEGIAEMTVYDEEHLDEV 252
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT-RYALLSSLARPFILQFFG 355
++DP E+P+N+ARAV K L +I+ AF + + S R LL L P + G
Sbjct: 252 VKDPIERPDNAARAVDLKGLERIAGAFTAANRKFASLQHAKRNDLLEMLCTPAVGSKLG 310
>gi|242054959|ref|XP_002456625.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
gi|241928600|gb|EES01745.1| hypothetical protein SORBIDRAFT_03g039650 [Sorghum bicolor]
Length = 257
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 121/244 (49%), Positives = 172/244 (70%), Gaps = 9/244 (3%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y++L+ ++DIL +NP+ +D R+ I +L + + SV LRGA V+PFGSF+SNL++
Sbjct: 7 NYDLLKACIEDILSTINPVEDDKRKRLSAIQELADSIYSVGPLRGAAVKPFGSFLSNLYA 66
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
+ GDLD+S++L NGS + + KK KQ+ L +L++AL+ +G R ++F+ ARVPILK+ +
Sbjct: 67 KSGDLDVSVDLRNGSRLPISKKK-KQNALRELMKALQMRGVARCMEFIPTARVPILKYMS 125
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
H ISCD+S++N GQIKS+ L+WI ID RF DMVLLVKEWAKA +IN+PK GT NSY
Sbjct: 126 NHFGISCDVSVNNYPGQIKSRILYWIGTIDERFGDMVLLVKEWAKARNINDPKNGTLNSY 185
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
SL LLV+FHFQTC PAILPPLKD++ +D++ + +C FS KY
Sbjct: 186 SLCLLVIFHFQTCEPAILPPLKDVFDVKAAEDMQNWCFMMRTNVDALC------FS--KY 237
Query: 243 RKIN 246
RK++
Sbjct: 238 RKVS 241
>gi|215707095|dbj|BAG93555.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 113/213 (53%), Positives = 158/213 (74%), Gaps = 1/213 (0%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+Y+VLE +DIL ++ P+ D R+ I +L + + S +LRGA+V+PFGSFVS L++
Sbjct: 8 NYDVLEKCTEDILSLIKPVEGDRNKRIYAIQELADTIYSAGALRGASVKPFGSFVSQLYA 67
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
+ GDLD+S+EL N + + +K KQ L ++ RAL+++G R ++F+ +ARVP+L++ +
Sbjct: 68 KSGDLDVSVELFNALNLPISKRK-KQDTLREVRRALQKRGIARHMEFIPNARVPVLQYVS 126
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
ISCDISI N G+IKSK +WI+ +D RF DMVLLVKEWAKA +IN+PK GT NSY
Sbjct: 127 NQYGISCDISISNYPGRIKSKIFYWINTLDDRFGDMVLLVKEWAKAQNINDPKNGTLNSY 186
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL 215
SL LLVLFHFQTC PAILPPLK+IY GN+++D+
Sbjct: 187 SLCLLVLFHFQTCEPAILPPLKEIYEGNIMEDI 219
>gi|56784701|dbj|BAD81827.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 408
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 172/310 (55%), Gaps = 61/310 (19%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRR 106
GA +PFGSFVSNL+S GDLDIS+ L N S IS KK KQ +L +L+R L+ +G
Sbjct: 8 GAVFKPFGSFVSNLYSNSGDLDISVHLPNNSIIS---KKKKQYVLRELMRVLQNRGVAGY 64
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+QFV ARVP+L++ + ISCDIS++N G+IKSK WIS +D RF DMVLL+KEWA
Sbjct: 65 VQFVPFARVPVLQYVSNTFGISCDISVNNYPGRIKSKIFCWISSLDVRFGDMVLLIKEWA 124
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQI 226
KA +IN+PKTGT NSYSL LLVLFHFQTC PAILPPLK+IY GN+ E I
Sbjct: 125 KAQNINDPKTGTLNSYSLCLLVLFHFQTCEPAILPPLKEIYEGNI-----------EEGI 173
Query: 227 AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN 286
A A+L+++ L + +++ E
Sbjct: 174 A-----------------------AYLYLNSLHLHTEMTVYDEE---------------- 194
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT-RYALLSSL 345
H ++DP E+P+N+ARAV K L +I+ AF + + S R LL L
Sbjct: 195 -------HLDEVKDPIERPDNAARAVDLKGLERIAGAFTAANRKFASLQHAKRNDLLEML 247
Query: 346 ARPFILQFFG 355
P + G
Sbjct: 248 CTPAVGSKLG 257
>gi|302786866|ref|XP_002975204.1| hypothetical protein SELMODRAFT_442740 [Selaginella moellendorffii]
gi|300157363|gb|EFJ23989.1| hypothetical protein SELMODRAFT_442740 [Selaginella moellendorffii]
Length = 373
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 144/385 (37%), Positives = 204/385 (52%), Gaps = 34/385 (8%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P ++D+E R+ ++ L ++ ++ +G ++PFGSF+SNL++ WGDLDI++ +
Sbjct: 7 LQPTQQDFEARVDILRRLEFLIREIDVCKGLAIKPFGSFLSNLYTPWGDLDITLMPLEPA 66
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+S + KK K +L + AL Q GG R+Q + RVP+L FE ISCDIS+ N
Sbjct: 67 PLSRS-KKTK--ILKSIHDALLQAGGAIRVQVLFRPRVPLLMFEDAWWRISCDISVSNTD 123
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
KS L I +D R R ++ LVK WAKA IN+PK GT NSY+LSLLV+FH QT P
Sbjct: 124 AVFKSHALGLIVGMDLRCRQLIFLVKCWAKAQCINDPKMGTLNSYALSLLVIFHLQTRNP 183
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
ILPPL I G A+ + IA F+ + K N SS+A LFVSF
Sbjct: 184 PILPPLSAIIGQG------GASADGFHYLNR-----IAEFTERGFGKGNTSSVAELFVSF 232
Query: 258 LEKFSGL-SLKASELGICPFTGQW-EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+FS + L L +C F GQW + ++ W N+ L +EDPF+ EN +R+V +
Sbjct: 233 FGQFSAVEELWIQGLAVCTFRGQWGDKTTTDPAWASKNYALLVEDPFDLSENCSRSVHQG 292
Query: 316 NLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANYNNGHRRARPQS 375
+L + AF +TH L ++ Y L L E+ R+ H RP
Sbjct: 293 SLQHVCKAFRLTHELL--CDKFSYWFLGKLK---------ETLFRWP-----HASPRPCF 336
Query: 376 HKSVNSPLQA-QHQSHNAKKEN-RP 398
H S Q +H S NA ++N RP
Sbjct: 337 HPSRPGIEQLREHVSLNASRKNDRP 361
>gi|168036791|ref|XP_001770889.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677753|gb|EDQ64219.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1171
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 191/358 (53%), Gaps = 35/358 (9%)
Query: 2 GSYNVLEPI---LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVS 58
G ++EP+ L DI L P +D+ R VI L ++V S++S +G V PFGSF S
Sbjct: 407 GDSKIMEPLERMLDDIYNSLQPTEDDYRRRQLVIERLNDLVRSLDSCQGVEVVPFGSFES 466
Query: 59 NLFSRWGDLDISIELSNGSCISSA-GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPI 117
N ++ GDLD+S+E +S K K +L + RAL + G RR+Q +AHARVP+
Sbjct: 467 NFYTACGDLDLSLEFPVDQDVSPTFTKSKKVKVLKSVERALGRSGVARRIQLIAHARVPL 526
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL---------------- 161
L F ISCDIS+DN KS+ L WI+ +D R R ++ +
Sbjct: 527 LMFVDSELKISCDISVDNGSALFKSRVLRWITDMDPRCRKLIFMYSLQLPSLSQPNFLSK 586
Query: 162 ------------VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+K WAKA IN+PK GT NSY+LSLLV+FH QT P ILPP K +
Sbjct: 587 RLISLSMLLAVQIKCWAKAQCINDPKLGTLNSYALSLLVVFHLQTRSPPILPPFKTLLGE 646
Query: 210 NLVDDLKG-VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGL-SLK 267
+ + G + +A+ Q + C I S+ + + N+ S+ LF+SF +F+ + SL
Sbjct: 647 HTSMPVAGKLNKDAQLQQMQECYGRIQALVSEGFGQDNKCSIGQLFLSFFGQFASVKSLW 706
Query: 268 ASELGICPFTGQW-EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ L + PF G+W + +N W + + +EDPF++ +N AR++ + L I N+F
Sbjct: 707 VNGLAVSPFWGEWGDSTTTNPAWNRKQYAMRVEDPFDRMDNCARSIQDAGLPIICNSF 764
>gi|302791355|ref|XP_002977444.1| hypothetical protein SELMODRAFT_417492 [Selaginella moellendorffii]
gi|300154814|gb|EFJ21448.1| hypothetical protein SELMODRAFT_417492 [Selaginella moellendorffii]
Length = 479
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 206/414 (49%), Gaps = 51/414 (12%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+ E +L L P ++D+E R+ ++ L ++ ++ +G ++PFGSF+SNL++ WG
Sbjct: 30 LFEGLLMATANQLQPTQQDFEARVDILRRLEYLIREIDVCKGLAIKPFGSFLSNLYTPWG 89
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI++ + S S KK K +L + AL Q GG R+Q + RVP+L FE
Sbjct: 90 DLDITL-MPLESAPLSRSKKTK--ILKSIHDALLQAGGAIRVQVLFRPRVPLLMFEDAWW 146
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
ISCDIS+ N KS L I +D R R ++ LVK WAKA IN+PK GT NSY+LS
Sbjct: 147 RISCDISVSNTDAVFKSHALGLIVGMDLRCRQLIFLVKCWAKAQCINDPKMGTLNSYALS 206
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
LLV+FH Q + A D + + IA F+ + K
Sbjct: 207 LLVIFHLQILLAA--------------DGFQYLS-------------RIAEFTERGFGKG 239
Query: 246 NRSSLAHLFVSFLEKFSGL-SLKASELGICPFTGQW-EHIRSNTRWLPNNHPLFIEDPFE 303
N SS+A LFVSF +FS + L L +C F GQW + ++ W N+ + +EDPF+
Sbjct: 240 NTSSVAELFVSFFGQFSAVEELWIQGLAVCTFRGQWGDKTTTDPSWASKNYAMLVEDPFD 299
Query: 304 QPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYAN 363
EN AR+V + +L + AF ++H L F F G+ ++
Sbjct: 300 LSENCARSVHQGSLQHVCKAFRLSH--------------ELLCDKFSYWFLGK--LKETL 343
Query: 364 YNNGHRRARPQSHKSVNSPLQA-QHQSHNAKKEN-RPNR-SMSQQSVQQHQSQP 414
+ H RP H S Q +H S NA ++N RP + ++ V+Q +P
Sbjct: 344 FRWPHASPRPCFHPSRPGIEQLREHASLNASRKNDRPEKQKKGKKQVRQRVPRP 397
>gi|302764122|ref|XP_002965482.1| hypothetical protein SELMODRAFT_439274 [Selaginella moellendorffii]
gi|300166296|gb|EFJ32902.1| hypothetical protein SELMODRAFT_439274 [Selaginella moellendorffii]
Length = 417
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 174/321 (54%), Gaps = 29/321 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATV-EPFGSFVSNLFSRWGDLDI 69
++++LG L P +ED + R +++ V+ ++L G++V PFGS+V+N F+ DLD+
Sbjct: 49 IEEVLGDLEPSQEDRDARAAIVASFDSFVK--QTLSGSSVVAPFGSYVTNTFTCDSDLDL 106
Query: 70 SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
S+ ++ + +S K + L+A+ + Y ++Q + +A VP++KF I C
Sbjct: 107 SLYVNRMNPLSREEKLYFLKRVTTSLQAMHAR--YDQIQPIYNATVPVVKFVDRKTGIQC 164
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D+S+DN G KS L +S ID RFR + LL+K+WAK+H+IN+ GT +SY ++LL +
Sbjct: 165 DLSVDNKDGASKSLVLAALSSIDKRFRPLCLLLKKWAKSHEINDASAGTLSSYVITLLAI 224
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA-FNIAR---FSSDKYRKI 245
FH QTC P +LPPL I G +D A C+ F AR F R +
Sbjct: 225 FHLQTCSPPVLPPLSMIIGGLDLD-------------ASYCSGFISARAKAFQGFGARNM 271
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLP-NNHPLFIEDPFEQ 304
+R ++ LF +F K + + E G+C T H + ++W +N + +ED ++
Sbjct: 272 DR--ISELFRTFFVKITAVKALWQE-GLCAST---YHAQWISKWPSFHNGCICVEDFYDP 325
Query: 305 PENSARAVSEKNLAKISNAFE 325
N+A+AV+ K+ I E
Sbjct: 326 SRNAAKAVTPKDFECIYQCLE 346
>gi|302823109|ref|XP_002993209.1| hypothetical protein SELMODRAFT_431345 [Selaginella moellendorffii]
gi|300138979|gb|EFJ05729.1| hypothetical protein SELMODRAFT_431345 [Selaginella moellendorffii]
Length = 420
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/321 (33%), Positives = 173/321 (53%), Gaps = 28/321 (8%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATV-EPFGSFVSNLFSRWGDLDI 69
+++ILG L P +ED + R +++ V+ ++L G++V PFGS+V+N F+ DLD+
Sbjct: 49 IEEILGDLEPSQEDRDARAAIVASFDSFVK--QTLSGSSVVAPFGSYVTNTFTCDSDLDL 106
Query: 70 SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
S+ ++ + +S K + L+A+ + Y ++Q + A VP++KF I C
Sbjct: 107 SLYVNRMNPLSREEKLYFLKRVTTSLQAMHAR--YDQIQPIYKATVPVVKFVDRKTGIQC 164
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D+S+DN G KS L +S ID RFR + LL+K+WAK+H+IN+ GT +SY ++LL +
Sbjct: 165 DLSVDNKDGASKSLVLAALSSIDKRFRPLCLLLKKWAKSHEINDASAGTLSSYVITLLAI 224
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA-FNIAR---FSSDKYRKI 245
FH QTC P +LPPL I G + D A C+ F AR F R +
Sbjct: 225 FHLQTCSPPVLPPLSMIIGGLDLRD------------ASYCSGFISARAKAFQGFGARNM 272
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLP-NNHPLFIEDPFEQ 304
+R ++ LF +F K + + E G+C T H + ++W +N + +ED ++
Sbjct: 273 DR--ISELFRTFFVKITAVKALWQE-GLCAST---YHAQWISKWPSFHNGCICVEDFYDP 326
Query: 305 PENSARAVSEKNLAKISNAFE 325
N+A+AV+ K+ I E
Sbjct: 327 SRNAAKAVTPKDFEYIYQCLE 347
>gi|296082631|emb|CBI21636.3| unnamed protein product [Vitis vinifera]
Length = 555
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 160/321 (49%), Gaps = 28/321 (8%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRG--ATVEPFGSFVSNLFSR 63
V E +L ++L + +P D+ R+ +I + + + VE FGSF+ ++FS
Sbjct: 32 VFEELLHNVLHIRHPKPIDYFNRIDLIRIFNVISKEIHGNGDDFTIVEGFGSFLMDMFSA 91
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
DLD+SI N S K++ Q+L L+AL++ G + + ARVPILK
Sbjct: 92 GSDLDLSINFGNYEVEVSRAKRI-QTLRKFEKKLKALQRIGHVSNVILITGARVPILKIT 150
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
I CDIS++N G KS+ + +S ID RF+ + L+K WAKAHDIN+ K T NS
Sbjct: 151 DRGTGIECDISVENRDGIAKSRIIRMVSSIDHRFQKLSFLMKAWAKAHDINSSKEHTLNS 210
Query: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDK 241
S+ LLV FH QT P ILPP I D++ V N +
Sbjct: 211 LSIILLVAFHLQTRDPPILPPFSVILKDG--SDMETVTKNVINFLG-------------- 254
Query: 242 YRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSNTRWLPNNHPLFIE 299
Y ++N+ SLA LFV+ L K + S+ G+C + G W + W + +E
Sbjct: 255 YGEVNKESLAELFVTLLLKLQSIETLWSK-GLCASIYDGSWIY----KTWDSGVGCINVE 309
Query: 300 DPFEQPENSARAVSEKNLAKI 320
D ++ +N ARAV+ K + KI
Sbjct: 310 DFTDRSQNVARAVATKQVTKI 330
>gi|384249905|gb|EIE23385.1| hypothetical protein COCSUDRAFT_41642 [Coccomyxa subellipsoidea
C-169]
Length = 758
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/324 (31%), Positives = 161/324 (49%), Gaps = 29/324 (8%)
Query: 19 NPLREDWETRMKVISDLREVVESVESLRGAT---VEPFGSFVSNLFSRWGDLDISIEL-- 73
P +D R +++ + +V L G T VEP+GSFVS L++ GDLDISIE
Sbjct: 38 TPGPQDAARRRQILEKMGGIVGL--GLDGHTELRVEPYGSFVSGLYAPTGDLDISIEGFC 95
Query: 74 ---SNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRRLQFVAHARVPILKFETIHQNI 127
G + GK K +LL L + L R GY +Q + HARVPILK I
Sbjct: 96 GKEGRGRDVRDMGKSAKAALLRALSKKLERSRLHRGY--IQRILHARVPILKLVWAESGI 153
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CD+S+ + + K++ + + ++DGRF M+ ++K W+ AH +N+ GTFN+++LSL+
Sbjct: 154 PCDVSVGSSNSRFKAEVVKALVRLDGRFEQMLRVIKVWSGAHGLNDASNGTFNTFALSLM 213
Query: 188 VLFHFQTCVPAILPPLKDIY-PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
VLFH Q PA+LPPL +++ + + + E + AF A ++ R N
Sbjct: 214 VLFHLQLRRPAVLPPLHELFRDAHDATFTRPLHLGQEISPGMLEAFQAAAEATP--RSGN 271
Query: 247 RSSLAHLFVSFLEKFSGL------SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
S+A L SF +F+ + + ++ + G W + R + +ED
Sbjct: 272 DESVAELLASFFARFAAATTRWQRAPECCDVRASTWCGAWSY-----RPWAKAYMAAVED 326
Query: 301 PFEQPENSARAVSEKNLAKISNAF 324
PF+ +N AR + L I+ F
Sbjct: 327 PFDSSDNCARTIGRDRLPYITRCF 350
>gi|7019641|emb|CAB75788.1| putative protein [Arabidopsis thaliana]
Length = 690
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 162/333 (48%), Gaps = 30/333 (9%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVV-----ESVESLRGATVEPFGSFVSNLF 61
L+ +L D+ P+ D+ TR +++ +L + +S ES +E +GSFV +++
Sbjct: 43 LDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEES--SPVLEAYGSFVMDMY 100
Query: 62 SRWGDLDISIELSNGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
S DLD+SI NG+ KK++ LR+L+ +G + ++ + A+VPI+KF
Sbjct: 101 SSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQVKNVESIFSAKVPIVKF 160
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CD+S++N G + S+ + ISQIDGRF+ + LLVK WAKAH++N+ T N
Sbjct: 161 SDQGTGVECDLSVENKDGILNSQIVRIISQIDGRFQKLCLLVKHWAKAHEVNSALHRTLN 220
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S S++LLV H QT P ILPP + + D V A++ +
Sbjct: 221 SVSITLLVALHLQTQNPPILPPFSMLLKDGM--DPPNVEKRAQKFL-------------- 264
Query: 241 KYRKINRSSLAHLFVSFLEKFSGL------SLKASELGICPFTGQWEHIRSNTRWLPNNH 294
+ + N+ SL LF +F K + L S L + +W+ + + +
Sbjct: 265 NWGQRNQESLGRLFATFFIKLQSVEFLWRQGLCVSVLNGLWISKKWKKVGVGSISVSYKK 324
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
+ED +N AR V+ KI ++ T
Sbjct: 325 LYSVEDFTNISQNVARRVNGAGAKKIYSSINRT 357
>gi|297819094|ref|XP_002877430.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
lyrata]
gi|297323268|gb|EFH53689.1| hypothetical protein ARALYDRAFT_484955 [Arabidopsis lyrata subsp.
lyrata]
Length = 680
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 94/331 (28%), Positives = 161/331 (48%), Gaps = 34/331 (10%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESV---ESLRGATVEPFGSFVSNLFSR 63
L+ +L D+ P+ D++TR +++ +L + + +E +GSFV +++S
Sbjct: 43 LDKVLNDVYCSFRPVSADYDTRKELVKNLNAMAIDIYGNSEESSPVLEAYGSFVMDMYSS 102
Query: 64 WGDLDISIELSNGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
DLD+SI NG+ KK++ LR+L+ +G + ++ + A+VPI+KF
Sbjct: 103 QSDLDVSINFGNGTPELPREKKLEILKRFAKKLRSLQGEGHVKNVESIFSAKVPIVKFSD 162
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
+ CD+S++N G + S+ + ISQIDGRF+ + +LVK WAKAH++N+ T NS
Sbjct: 163 QGTGVECDLSVENKDGILNSQIVRIISQIDGRFQKLCMLVKHWAKAHEVNSALHRTLNSV 222
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
S++LLV H QT P ILPP ++ + D V A++ + +
Sbjct: 223 SITLLVALHLQTQNPPILPPFSMLFKDGI--DPPNVEKRAQKFL--------------NW 266
Query: 243 RKINRSSLAHLFVSFLEKFSGL------SLKASELGICPFTGQWEHIRSNTRWLPNNHPL 296
+ N+ SL LF +F K + L S L + +W+ + + +
Sbjct: 267 GQRNQESLGRLFATFFIKLQSVEFLWRQGLCVSVLNGLWISKKWKKVGVGS--------I 318
Query: 297 FIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
+ED +N AR V+ KI ++ T
Sbjct: 319 SVEDFTNVSQNVARRVNGAGAKKIYSSINRT 349
>gi|42565594|ref|NP_190161.2| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|30793987|gb|AAP40443.1| unknown protein [Arabidopsis thaliana]
gi|110739217|dbj|BAF01523.1| hypothetical protein [Arabidopsis thaliana]
gi|332644545|gb|AEE78066.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 682
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 162/333 (48%), Gaps = 38/333 (11%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVV-----ESVESLRGATVEPFGSFVSNLF 61
L+ +L D+ P+ D+ TR +++ +L + +S ES +E +GSFV +++
Sbjct: 43 LDKVLNDVYCSFRPVSADYNTRKELVKNLNTMALDIYGKSEES--SPVLEAYGSFVMDMY 100
Query: 62 SRWGDLDISIELSNGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
S DLD+SI NG+ KK++ LR+L+ +G + ++ + A+VPI+KF
Sbjct: 101 SSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGEGQVKNVESIFSAKVPIVKF 160
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CD+S++N G + S+ + ISQIDGRF+ + LLVK WAKAH++N+ T N
Sbjct: 161 SDQGTGVECDLSVENKDGILNSQIVRIISQIDGRFQKLCLLVKHWAKAHEVNSALHRTLN 220
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S S++LLV H QT P ILPP + + D V A++ +
Sbjct: 221 SVSITLLVALHLQTQNPPILPPFSMLLKDGM--DPPNVEKRAQKFL-------------- 264
Query: 241 KYRKINRSSLAHLFVSFLEKFSGL------SLKASELGICPFTGQWEHIRSNTRWLPNNH 294
+ + N+ SL LF +F K + L S L + +W+ + +
Sbjct: 265 NWGQRNQESLGRLFATFFIKLQSVEFLWRQGLCVSVLNGLWISKKWKKVGVGS------- 317
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
+ +ED +N AR V+ KI ++ T
Sbjct: 318 -ISVEDFTNISQNVARRVNGAGAKKIYSSINRT 349
>gi|145350831|ref|XP_001419800.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580032|gb|ABO98093.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 633
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 151/313 (48%), Gaps = 40/313 (12%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSN------------------GSCISSAGKKV 86
G V PFGS+VS S D+DIS+++ G + ++
Sbjct: 6 FEGVRVAPFGSYVSAFHSAGSDIDISLQIDKNGPWYDEKEEAQARRSQRGGVRARRQQRQ 65
Query: 87 KQSLLGDLLRA----LRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
++ LLR LR + YR +Q ++ ARVP++KF+ ++CD+ I+N G KS
Sbjct: 66 GRTKRAQLLRKVASELRYRN-YRDVQLISKARVPLIKFKDPQTGVACDVCIEN-DGVYKS 123
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L ++ ID R+RD+V L+K WAK +D+NN G+FNSYSL LL + H Q ILPP
Sbjct: 124 AVLGVVADIDQRYRDLVFLIKLWAKHYDVNNAMEGSFNSYSLCLLCMHHLQRRPVPILPP 183
Query: 203 --LKDIYPGNLVDDLKGVRANAERQIAEICAFNI-----ARFSSDKYRKI---NRSSLAH 252
L + +LV+ K R E +E F+ AR SD R I N +LA
Sbjct: 184 TMLLTLPRPDLVESEK--RELEEHLKSEDDQFDTWKVSKARVVSDASRDIAAENTETLAE 241
Query: 253 LFVSFLEKFSGL-SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARA 311
LFVSF + L + + + G + + W +PL +EDPF +N ARA
Sbjct: 242 LFVSFFAHLCAIKDLFRNAVNASTYHGTF---IVGSSWQAFKYPLGVEDPFAAGDNVARA 298
Query: 312 VSEKNLAKISNAF 324
V + + NAF
Sbjct: 299 VQMRTRDYVLNAF 311
>gi|242081815|ref|XP_002445676.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
gi|241942026|gb|EES15171.1| hypothetical protein SORBIDRAFT_07g024040 [Sorghum bicolor]
Length = 690
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/341 (30%), Positives = 162/341 (47%), Gaps = 27/341 (7%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSR 63
VLE +L+D L P D+E R +I+ ++ E + VEPFGSF+ +LF+
Sbjct: 62 VLEELLQDTYASLQPQPVDYEHRYHMINIFNKIAEGIFGKNNGLPIVEPFGSFIMDLFTP 121
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
DLD+SI + + + ++ L ++L + +++G + + ARVP+LK
Sbjct: 122 KSDLDLSINFNTDTNDQYPRRNKIYAIRKLANVLFSHQRQGLCHGVSPIVTARVPVLKVI 181
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
+ CDIS++N G +S +IS ID RFR + L+K WAK HD+N PK T +S
Sbjct: 182 DQKTGVECDISVENKDGMSRSVIFKFISSIDKRFRILCYLMKFWAKVHDVNCPKDRTMSS 241
Query: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDK 241
++ LV FH QT P ILP I L D A+ E+ ++ F +
Sbjct: 242 MAIISLVAFHLQTRRPPILPAFSAI----LKDGTD--FASIEKNVSLFEGFGDS------ 289
Query: 242 YRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSNTRWLPNNHPLFIE 299
N+ S+ LFVS + K + E G+C F G W + W L +E
Sbjct: 290 ----NKESITELFVSLMNKLVSVE-GLWEQGLCASNFEGSW----ISKTWAKGVGNLNVE 340
Query: 300 DPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYA 340
D ++ +N AR+V K + KI T L+ ++ + A
Sbjct: 341 DFLDRSQNFARSVGMKEMQKICECLRATVSDLSKFSKGKIA 381
>gi|308807933|ref|XP_003081277.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
gi|116059739|emb|CAL55446.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
Length = 761
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 180/389 (46%), Gaps = 68/389 (17%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
VL+ L+ I+ L ++ R +++ + ++ S G V PFGS+VS S
Sbjct: 101 VLDAELRRIVNSLKTSPQEDAKRQTLMNKFKSMIGS--RFEGVRVAPFGSYVSAFHSAGS 158
Query: 66 DLDISIELSN------------------GSCISSAGKKVKQSLLGDLLRA----LRQKGG 103
D+DIS+++ G + ++ ++ LLR LR +
Sbjct: 159 DIDISLQIDKDGPWYDEKEEAQARRSQRGGVRARRQQRQGRTKRAQLLRKVASELRYRN- 217
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
YR +Q ++ ARVP++KF+ H ++CD+ I+N G KS L I+ ID R+RD+V L+K
Sbjct: 218 YRDVQLISKARVPLIKFKDPHTGVACDVCIEN-DGVYKSAVLGVIADIDQRYRDLVFLIK 276
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY---------------- 207
WAK +D+NN G+FNSYSL LLV+ H Q +LPP +
Sbjct: 277 LWAKHYDVNNALEGSFNSYSLCLLVMHHLQRRRVPVLPPTMQLTLPRWELVQSEEKELDE 336
Query: 208 ----PGNLVDDLKGVRA----NAERQIAEICAFNIARFSSDK----YRKINRSSLAHLFV 255
+ D K +A +A R IA + ++ +DK + K N +LA LFV
Sbjct: 337 HVSCEDDEFDTWKVSKARVVSDASRDIAAV------KYRADKLFVGFGKHNTETLAELFV 390
Query: 256 SFLEKFSGL-SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
SF + + + L + G++ + W +PL +EDPF +N ARAV
Sbjct: 391 SFFAQLCAVKGFFRNALNASTYHGRF---IVGSSWNAFKYPLGLEDPFAAGDNVARAVQM 447
Query: 315 KNLAKISNAFEMT----HFRLTSTNQTRY 339
+ + AF H L ++++T++
Sbjct: 448 RTRDYVLGAFPAACAELHRILHASDETQF 476
>gi|115477819|ref|NP_001062505.1| Os08g0559900 [Oryza sativa Japonica Group]
gi|45736111|dbj|BAD13142.1| unknown protein [Oryza sativa Japonica Group]
gi|45736157|dbj|BAD13203.1| unknown protein [Oryza sativa Japonica Group]
gi|113624474|dbj|BAF24419.1| Os08g0559900 [Oryza sativa Japonica Group]
Length = 581
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSR 63
VLE +L ++ +L P +D+E R +I ++ E + + VE FGSF +LF+
Sbjct: 61 VLEDLLIELYAILRPKPDDYEQRHLMIDVFNKIAEEIYGKKKGFPVVEAFGSFTMDLFTS 120
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRRLQFVAHARVPILKF 120
DLD+S+ N S +K K S++ +L + L ++ G + V A+VP+LK
Sbjct: 121 KSDLDLSVNF-NADFHSQFARKDKISVIRNLAKVLYAHQRNGRCHGVLPVVTAKVPVLKV 179
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CDIS++N G +S IS ID RF+ + L+K WAKAHD+N P+ T +
Sbjct: 180 IDKGTGVECDISVENKDGMSRSMIFKLISSIDERFQILCYLMKFWAKAHDVNCPRDRTMS 239
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S ++ LV FH QT P ILP + D ++ N + F S
Sbjct: 240 SMAIISLVAFHLQTRRPPILPAFSALLKDG--PDFPSIQRNVSL---------VEGFGSR 288
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNNHPLF 297
N+ S+A LFVS + K LS++ E G+C F G W ++ R + N L
Sbjct: 289 -----NKESVAELFVSLMSKL--LSVEGLWEQGLCASNFEGSWI-FKTWERGVGN---LS 337
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTS 333
+ED ++ +N ARAV ++ + KIS + L +
Sbjct: 338 VEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNN 373
>gi|222641019|gb|EEE69151.1| hypothetical protein OsJ_28282 [Oryza sativa Japonica Group]
Length = 609
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 121/433 (27%), Positives = 194/433 (44%), Gaps = 37/433 (8%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSR 63
VLE +L ++ +L P +D+E R +I ++ E + + VE FGSF +LF+
Sbjct: 60 VLEDLLIELYAILRPKPDDYEQRHLMIDVFNKIAEEIYGKKKGFPVVEAFGSFTMDLFTS 119
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRRLQFVAHARVPILKF 120
DLD+S+ N S +K K S++ +L + L ++ G + V A+VP+LK
Sbjct: 120 KSDLDLSVNF-NADFHSQFARKDKISVIRNLAKVLYAHQRNGRCHGVLPVVTAKVPVLKV 178
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CDIS++N G +S IS ID RF+ + L+K WAKAHD+N P+ T +
Sbjct: 179 IDKGTGVECDISVENKDGMSRSMIFKLISSIDERFQILCYLMKFWAKAHDVNCPRDRTMS 238
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S ++ LV FH QT P ILP + D ++ N + F S
Sbjct: 239 SMAIISLVAFHLQTRRPPILPAFSALLKDG--PDFPSIQRNVSL---------VEGFGSR 287
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNNHPLF 297
N+ S+A LFVS + K LS++ E G+C F G W ++ R + N L
Sbjct: 288 -----NKESVAELFVSLMSKL--LSVEGLWEQGLCASNFEGSW-IFKTWERGVGN---LS 336
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGES 357
+ED ++ +N ARAV ++ + KIS + L + + + + P + E
Sbjct: 337 VEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNNFFRGK------IDAPKLKNLLFEP 390
Query: 358 PVRYANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENRPNRSMSQQSVQQHQSQPVRQ 417
P + +N + + + P Q AK P + QQ +H P
Sbjct: 391 PHQDELISNPSLKRPKRKDHPTHGPESNPQQQKKAKHIIGPESNQKQQKKVKHTVNPGPA 450
Query: 418 INGQVQQIWRPKS 430
+ + RP +
Sbjct: 451 ASRSATNLHRPTA 463
>gi|218201608|gb|EEC84035.1| hypothetical protein OsI_30272 [Oryza sativa Indica Group]
Length = 580
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 165/336 (49%), Gaps = 31/336 (9%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSR 63
VLE +L ++ +L P +D+E R +I ++ E + + VE FGSF +LF+
Sbjct: 60 VLEDLLIELYAILRPKPDDYEQRHLMIDVFNKIAEEIYGKKKGFPVVEAFGSFTMDLFTS 119
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRRLQFVAHARVPILKF 120
DLD+S+ N S +K K S++ +L + L ++ G + V A+VP+LK
Sbjct: 120 KSDLDLSVNF-NADFHSQFARKDKISVIRNLAKVLYAHQRNGRCHGVLPVVTAKVPVLKV 178
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CDIS++N G +S IS ID RF+ + L+K WAKAHD+N P+ T +
Sbjct: 179 IDKGTGVECDISVENKDGVSRSMIFKLISSIDERFQILCYLMKFWAKAHDVNCPRDRTMS 238
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S ++ LV FH QT P ILP + D ++ N + F S
Sbjct: 239 SMAIISLVAFHLQTRRPPILPAFSALLKDG--PDFPSIQRNVSL---------VEGFGSR 287
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNNHPLF 297
N+ S+A LFVS + K LS++ E G+C F G W ++ R + N L
Sbjct: 288 -----NKESVAELFVSLMSKL--LSVEGLWEQGLCASNFEGSWI-FKTWERGVGN---LS 336
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTS 333
+ED ++ +N ARAV ++ + KIS + L +
Sbjct: 337 VEDFLDRSQNFARAVGKEEMQKISECIRVAVLNLNN 372
>gi|356518820|ref|XP_003528075.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Glycine max]
Length = 342
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 92/269 (34%), Positives = 136/269 (50%), Gaps = 24/269 (8%)
Query: 49 TVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ 108
VE +GSFV ++F D+D+S+ +N +S K L++++ KG LQ
Sbjct: 87 VVEEYGSFVMDMFDGKSDIDLSLNFNNSIEVSRQKKISALYRFNKKLQSIQSKGHVTGLQ 146
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ ARVPI+K I CD+S+DN G KS + IS ID RFR + L+K WAK
Sbjct: 147 LIFSARVPIIKVTDSGTGIECDLSVDNRDGINKSHIIRAISDIDERFRKLCFLMKSWAKV 206
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
HDIN+PK T +S+S+ V FH QT P ILPP + +G ++ E
Sbjct: 207 HDINSPKDSTLSSFSIVSFVAFHLQTRNPPILPPFSILLK-------EGDNPAYVAKVVE 259
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSN 286
FN Y K N+ SLA LF++ L K + + + G C + G W ++S
Sbjct: 260 -TYFN--------YGKQNKESLAMLFITLLVKLASVE-NLWQKGFCASLYEGSW-ILKS- 307
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEK 315
W ++ + +ED ++ +N ARAV +K
Sbjct: 308 --W-KCSYSMSVEDFIDRSQNVARAVRKK 333
>gi|226530311|ref|NP_001142471.1| uncharacterized protein LOC100274680 [Zea mays]
gi|195604758|gb|ACG24209.1| hypothetical protein [Zea mays]
Length = 690
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 191/418 (45%), Gaps = 35/418 (8%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSR 63
L+ +L+DI L P D+E R +++ ++V + ++ VEPFGSF +LF+
Sbjct: 62 ALDDLLQDIYESLQPQPVDYEHRNLMVNVFNKIVGEIFGKNNELPIVEPFGSFTMDLFTP 121
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRRLQFVAHARVPILKF 120
DLD+S+ N +K K S + L L ++ G + + ARVP+LK
Sbjct: 122 QSDLDLSVNF-NTDANDQYPRKNKISAIRKLAHVLFSHQRHGRCYGVSPIVTARVPVLKV 180
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ CDIS++N G +S +IS ID RFR + L+K WAK HD+N PK T +
Sbjct: 181 IDKGTGVECDISVENKDGMSRSAIFKFISSIDKRFRILCYLMKFWAKVHDVNCPKDRTMS 240
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
S ++ LV FH QT P ILP + L D A+ E+ ++ F +
Sbjct: 241 SMAIISLVSFHLQTRCPPILPAFSAV----LKDGTD--FASIEKNVSLFQGFGHS----- 289
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSNTRWLPNNHPLFI 298
N+ S+A LFVS + K + E G+C F G W + W L +
Sbjct: 290 -----NKESIAELFVSLMSKLVSVE-GLWEQGLCASNFEGTW----ISKTWAKGVGNLNV 339
Query: 299 EDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESP 358
ED ++ +N AR+V K + KI + L+ ++ A + + + + +
Sbjct: 340 EDFLDRSQNFARSVGVKEMQKICECLRASVSDLSKFSKGEIA--APKLKALLFKPLNQVN 397
Query: 359 VRYANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENR----PNRSMSQQSVQQHQS 412
+ +R R +K+ +P+Q + A ++++ P + ++++ Q +S
Sbjct: 398 PVIKPHQKTIKRKRTNPNKTRTNPIQKNAKKKKALEQDKLAISPGQKDGKKTLDQEKS 455
>gi|326514790|dbj|BAJ99756.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 151/327 (46%), Gaps = 29/327 (8%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRG--ATVEPFGSFVSNLFSRWGDLDIS 70
+ ML P D+E R +I ++ + + + VEPFGSF +LF+ DLD+S
Sbjct: 67 ETYAMLRPKPVDYEQRRTMIDVFNKIAKDIFGKKDEFPVVEPFGSFTMDLFTTKSDLDLS 126
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQ---KGGYRRLQFVAHARVPILKFETIHQNI 127
+ SN +K K S++ + L++ +G + V A VP+LK +
Sbjct: 127 VNFSN-DMDGQFARKDKISVIRKFAKVLQKHQSRGRCYGVLPVVSALVPVLKVTDKGTGV 185
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CDIS++N G +S +S ID RF+ + L+K WAK HD+N PK T +S + L
Sbjct: 186 ECDISVENKDGMSRSMIFKLVSSIDERFQILCYLMKFWAKTHDVNCPKDRTMSSMVIISL 245
Query: 188 VLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINR 247
V FH QT P ILP + G L D A+ +R + F IN+
Sbjct: 246 VAFHLQTRHPPILP----AFSGLLKDGAD--FASVQRNVVLFKGFG----------SINK 289
Query: 248 SSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
S+A LFVS + K + E G+C F G W + W L +ED ++
Sbjct: 290 ESVAELFVSLMSKLVAVK-DLWEQGLCASNFDGFW----ISKTWKRGIGNLSVEDFLDRS 344
Query: 306 ENSARAVSEKNLAKISNAFEMTHFRLT 332
+N AR+V + + I + T +LT
Sbjct: 345 QNFARSVGKMEMQNICECLKDTVSKLT 371
>gi|357142242|ref|XP_003572505.1| PREDICTED: terminal uridylyltransferase 7-like [Brachypodium
distachyon]
Length = 563
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 159/328 (48%), Gaps = 30/328 (9%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+I ML P D+E R +I ++ + V ++ R VE FGSF +LF+ DLD+S
Sbjct: 62 EIYAMLRPKPVDYEQRHIMIDVFNKIAKDVCGKNNRFPVVEAFGSFTMDLFTAKSDLDLS 121
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQ---KGGYRRLQFVAHARVPILKFETIHQNI 127
+ S + K S++ + LRQ +G + V +A VP+LK +
Sbjct: 122 VNFSADR-DGEFDRNKKISVIRKFAKVLRQHQSRGRCYGVLPVVNAIVPVLKVTDKGTGV 180
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CDIS++N G +S +S ID RFR + L+K WAK+HD+N P+ T +S ++ L
Sbjct: 181 ECDISVENKDGMSRSMIFKLVSSIDERFRILCYLMKFWAKSHDVNCPRDRTMSSMAIISL 240
Query: 188 VLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINR 247
V FH QT P ILP L+ D + A+ +R N++ F R N+
Sbjct: 241 VAFHLQTRRPPILPAF-----SRLLKDGADI-ASIQR--------NVSLFEGFGSR--NK 284
Query: 248 SSLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
S+A LFVS + K LS++ E G+C G W + R + N L +ED ++
Sbjct: 285 ESVAELFVSLMSKL--LSVQGLWEQGLCASNLEGSWILKMTWDRGIGN---LAVEDFLDR 339
Query: 305 PENSARAVSEKNLAKISNAFEMTHFRLT 332
+N AR+V + + I T +LT
Sbjct: 340 NQNFARSVGKVEMQTICECLRDTVCKLT 367
>gi|218202670|gb|EEC85097.1| hypothetical protein OsI_32469 [Oryza sativa Indica Group]
Length = 553
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 33/316 (10%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSRWGDLDISI 71
+ ML P D+E R ++ + + VE FGSF +LF+ DLD+S+
Sbjct: 75 VYTMLRPKPLDYEQRTTLVHVFNNIANQIFGNNNGFPVVEAFGSFTMDLFTPRSDLDLSV 134
Query: 72 ELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
+ + A KK ++ +L + ++ G + + V ARVPI+ I C
Sbjct: 135 NFTANTDDQYARKKKISAIRKFAKVLYSHQRNGIFCGVLPVVTARVPIVNVIDRGTGIEC 194
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
DI+++N G +S +IS +D RF+ + LVK WAK HD+N+P+ T +S S+ LV
Sbjct: 195 DITVENKDGMTRSMIFKFISSLDPRFQILSYLVKFWAKIHDVNSPRERTLSSMSIVSLVA 254
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
FH QT P ILPPL + D + V N AF + + N+ +
Sbjct: 255 FHLQTRDPPILPPLSALLKDG--SDFESVERNT-------LAFK-------GFGRTNKET 298
Query: 250 LAHLFVSFLEKFSGLSLKASEL---GICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
+A LFVS + K L A L G+C F W + W L +ED ++
Sbjct: 299 VAELFVSLISKL----LSAESLWEHGLCASNFEASW----ISKTWKKGIGNLNVEDFLDR 350
Query: 305 PENSARAVSEKNLAKI 320
+N AR+V +K + KI
Sbjct: 351 SQNFARSVGKKEMQKI 366
>gi|357160218|ref|XP_003578694.1| PREDICTED: terminal uridylyltransferase 7-like [Brachypodium
distachyon]
Length = 538
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 150/319 (47%), Gaps = 31/319 (9%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSRWGDLDISI 71
+ +L P D+E R ++ E+ + V+ FGSF +LF+ DLD+S+
Sbjct: 72 VYTVLRPKAVDYEQRNTLVDVFNEMTNKIFGNNNGFPVVQAFGSFTMDLFTPRSDLDLSV 131
Query: 72 ELSNGSCISSAGKKVKQSLL---GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNIS 128
S + A KK K S++ +L +L++ G Y + V ARVPI+ I
Sbjct: 132 NFSAETEDQCARKK-KISVIRKFAKVLYSLQRNGVYCGVLPVLSARVPIINVIDRGTGIE 190
Query: 129 CDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
CDISI+N G +S +IS +D RF+ + L+K WAK HD+N+P T +S S+ LV
Sbjct: 191 CDISIENKDGMTRSMVFKFISSLDERFQILSYLMKIWAKIHDVNSPSKQTMSSMSIISLV 250
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
FH QT P ILP + D K V N F + F S N+
Sbjct: 251 AFHLQTRHPPILPAFSALLKDG--SDFKSVEKN---------IFLLKGFGS-----TNKE 294
Query: 249 SLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
S+A LFVS + K LS+++ E G+C F W + W L +ED ++
Sbjct: 295 SVAELFVSLISKL--LSVESLWEHGLCASNFEASW----ISKTWKKGVGNLSVEDFLDRS 348
Query: 306 ENSARAVSEKNLAKISNAF 324
+N ARAV + KI N
Sbjct: 349 QNFARAVGKAEKQKICNCL 367
>gi|222642142|gb|EEE70274.1| hypothetical protein OsJ_30421 [Oryza sativa Japonica Group]
Length = 553
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 33/316 (10%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSNLFSRWGDLDISI 71
+ ML P D+E R ++ + + VE FGSF +LF+ DLD+S+
Sbjct: 75 VYTMLRPKPLDYEQRTTLVHVFNNIANQIFGNNNGFPVVEAFGSFTMDLFTPRSDLDLSV 134
Query: 72 ELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
+ + A KK ++ +L + ++ G + + V ARVPI+ I C
Sbjct: 135 NFTANTDDQYARKKKISAIRKFTKVLYSHQRNGIFCGVLPVVTARVPIVNVIDRGTGIEC 194
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
DI+++N G +S +IS +D RF+ + LVK WAK HD+N+P+ T +S S+ LV
Sbjct: 195 DITVENKDGMTRSMIFKFISSLDPRFQILSYLVKFWAKIHDVNSPRERTLSSMSIVSLVA 254
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
FH QT P ILPPL + D + V N AF + + N+ +
Sbjct: 255 FHLQTRDPPILPPLSALLKDG--SDFESVERNT-------LAFK-------GFGRTNKET 298
Query: 250 LAHLFVSFLEKFSGLSLKASEL---GICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
+A LFVS + K L A L G+C F W + W L +ED ++
Sbjct: 299 VAELFVSLISKL----LSAESLWEHGLCASNFEASW----ISKTWKKGIGNLNVEDFLDR 350
Query: 305 PENSARAVSEKNLAKI 320
+N AR+V +K + KI
Sbjct: 351 SQNFARSVGKKEMQKI 366
>gi|413921759|gb|AFW61691.1| hypothetical protein ZEAMMB73_856825 [Zea mays]
Length = 604
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 169/377 (44%), Gaps = 43/377 (11%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRR 106
VEPFGSF +LF+ DLD+S+ N +K K S + L L ++ G
Sbjct: 22 VEPFGSFTMDLFTPQSDLDLSVNF-NTDANDQYPRKNKISAIRKLAHVLFSHQRHGRCYG 80
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+ + ARVP+LK + CDIS++N G +S +IS ID RFR + L+K WA
Sbjct: 81 VSPIVTARVPVLKVIDKGTGVECDISVENKDGMSRSAIFKFISSIDKRFRILCYLMKFWA 140
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQI 226
K HD+N PK T +S ++ LV FH QT P ILP + L D A+ E+ +
Sbjct: 141 KVHDVNCPKDRTMSSMAIISLVSFHLQTRCPPILPAFSAV----LKDGTD--FASIEKNV 194
Query: 227 AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIR 284
+ F + N+ S+A LFVS + K + E G+C F G W
Sbjct: 195 SLFQGFGHS----------NKESIAELFVSLMSKLVSVE-GLWEQGLCASNFEGTW---- 239
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSS 344
+ W L +ED ++ +N AR+V K + KI + L+ ++ A
Sbjct: 240 ISKTWAKGVGNLNVEDFLDRSQNFARSVGVKEMQKICECLRASVSDLSKFSKGEIAAPKL 299
Query: 345 LARPFILQFFGESPVRYAN-----YNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENR-- 397
A F P+ N + +R R +K+ +P+Q + A ++++
Sbjct: 300 KALLF-------KPLNQVNPVIKPHQKTIKRKRTNPNKTRTNPIQKNAKKKKALEQDKLA 352
Query: 398 --PNRSMSQQSVQQHQS 412
P + ++++ Q +S
Sbjct: 353 ISPGQKDGKKTLDQEKS 369
>gi|412990896|emb|CCO18268.1| predicted protein [Bathycoccus prasinos]
Length = 860
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 171/398 (42%), Gaps = 71/398 (17%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
LE L+ + M D R +++ L V+ + T++PFGSFVS ++ D
Sbjct: 118 LERALQKCVQMQKATASDDVKRERLLKKLETVLTA--RFDAVTIDPFGSFVSAFHTKNSD 175
Query: 67 LDISIELSNGS-------------CISSAGKKVKQ---------SLLGDLLRALRQKGGY 104
+D+S+ + S S A + Q LL LR + Y
Sbjct: 176 IDVSLTIHPSSQWYNEEEERKYRDAQSGAPRPRAQRRQHRTKRVQLLAKFASELRWRK-Y 234
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+Q +AHARVP++KF ++CD+ + N G KS L +++ D +RD+V VK
Sbjct: 235 DDVQLIAHARVPLVKFRDPETGVACDVCVHN-DGVYKSAVLGFVADHDRLYRDLVFCVKM 293
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI-YPG------------NL 211
WAK ++N+ GTFNSYSL LL LF Q I PP+ +I P N
Sbjct: 294 WAKNWNVNDAINGTFNSYSLCLLALFTLQRH--GICPPMANITLPDEESLEKEMQRVQNE 351
Query: 212 VDDLKGV-----------RANAERQIAEICAFNIARFSSDKYRKI---NRSSLAHLFVSF 257
++ K + RA+A+R I + +DKY N+ +LA LFV F
Sbjct: 352 CEETKELGKPREVSHERKRADAQRNPHAI------KPKADKYHSYSSGNQKTLAELFVDF 405
Query: 258 LEKFSGLS-LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
S + L A L + G+W W + + +EDPF +N ARAV ++
Sbjct: 406 FVTLSAVEPLWAKGLVASTYAGRWT---CGCSWPLRKYKIGVEDPFASGDNVARAVQRRS 462
Query: 317 LAKISNAFE---MTHFRL---TSTNQTRYALLSSLARP 348
+ A MT R+ + Q A++ L P
Sbjct: 463 APVVFGAIRGAAMTVKRILWAENDEQFEMAMMDMLGDP 500
>gi|359480663|ref|XP_002272983.2| PREDICTED: uncharacterized protein LOC100247367 [Vitis vinifera]
Length = 482
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/255 (34%), Positives = 123/255 (48%), Gaps = 30/255 (11%)
Query: 75 NGSCISSAGKKVKQSLLGD----LLRALRQKGGYRRLQ---FVAHARVPILKFETIHQNI 127
+G + ++G LL D +LRA G + + ARVPILK I
Sbjct: 24 DGCTLGNSGHLGIGELLRDHYDKVLRAFSNHAGIGHVSNVILITGARVPILKITDRGTGI 83
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CDIS++N G KS+ + +S ID RF+ + L+K WAKAHDIN+ K T NS S+ LL
Sbjct: 84 ECDISVENRDGIAKSRIIRMVSSIDHRFQKLSFLMKAWAKAHDINSSKEHTLNSLSIILL 143
Query: 188 VLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINR 247
V FH QT P ILPP I D++ V N + Y ++N+
Sbjct: 144 VAFHLQTRDPPILPPFSVILKDG--SDMETVTKNVINFLG--------------YGEVNK 187
Query: 248 SSLAHLFVSFLEKFSGLSLKASELGICP--FTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
SLA LFV+ L K + S+ G+C + G W + W + +ED ++
Sbjct: 188 ESLAELFVTLLLKLQSIETLWSK-GLCASIYDGSWIY----KTWDSGVGCINVEDFTDRS 242
Query: 306 ENSARAVSEKNLAKI 320
+N ARAV+ K + KI
Sbjct: 243 QNVARAVATKQVTKI 257
>gi|440800601|gb|ELR21637.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 976
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 156/337 (46%), Gaps = 27/337 (8%)
Query: 10 ILKDIL---GMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
I KD+L L P ++ + ++ VI L+ +V ++ A + FGS + + D
Sbjct: 348 IAKDMLLSFETLRPSDQEMQAKLDVIKRLQRIVGNLWPGYQAKLNLFGSSANGFCLKNSD 407
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDI + + AG K K ++ + R LR+ + + ++HA VPI+KFE
Sbjct: 408 LDICMTIDK-----RAGTKKK--IVNRIARVLREHK-MKDVTALSHASVPIVKFEDPLSK 459
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
SCDI I+N+ + + S++D R + VK WAK ++ P TGT +SY+ L
Sbjct: 460 FSCDICINNILALHNTHMIAQYSRVDSRLLQLGYFVKHWAKCRKLDEPYTGTLSSYAWIL 519
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD----KY 242
LV+ Q P +LP L+ + P DL+G + + + N +S D ++
Sbjct: 520 LVINFLQQRSPPVLPCLQRVAPSG---DLRG-----DVPVVMVKGHNCYYYSDDIRRLRF 571
Query: 243 RKINRSSLAHLFVSFL----EKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFI 298
R N+ +LA L + F E+F + S T + + + + NH I
Sbjct: 572 RSQNQETLAELLLEFFYLYAEEFDYEHMVVSVRRGTMLTKKEKRWDKLPKTVKENHWFSI 631
Query: 299 EDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTN 335
EDPF+ + R V + NL I + F + LT+T+
Sbjct: 632 EDPFDLTHDLGRVVDQDNLKAIQHEFRRAYTLLTTTS 668
>gi|326534154|dbj|BAJ89427.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 544
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 149/331 (45%), Gaps = 31/331 (9%)
Query: 2 GSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGA--TVEPFGSFVSN 59
G L +L ++ +L P D+E R ++ ++ + V+ FGSF +
Sbjct: 64 GRLPALNDLLLEVYAVLRPKPVDYEQRNALVDVFNKMATRIFGNDNGFPVVQAFGSFTMD 123
Query: 60 LFSRWGDLDISIELS---NGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVP 116
LF+ DLD+S+ S C K +L +L++ G Y + V A+VP
Sbjct: 124 LFTPKSDLDLSVNFSAEIEDQC-PRKKKMKVVRKFAKVLYSLQRDGIYCGVLPVVSAKVP 182
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
I+ I CDI+++N G +S + IS +D RF+ + LVK WAK HD+N+P
Sbjct: 183 IVNVIDRGTGIECDITVENKDGMTRSMIVKLISSLDERFQILSYLVKTWAKIHDVNSPTA 242
Query: 177 GTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
T +S S+ LV FH QT P ILP L+ D A+ E+ I F
Sbjct: 243 QTMSSMSIISLVAFHLQTRHPPILPAFS-----ALLKDGSDF-ASVEKNILLFKGFG--- 293
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS-ELGICP--FTGQWEHIRSNTRWLPNN 293
N+ S+A LFV+ + K LS+++ E G+C F W + W
Sbjct: 294 -------STNKESVAELFVTLMSKL--LSVESLWEHGLCASNFEASW----ISKTWKKGI 340
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L +ED ++ +N ARAV + + KI
Sbjct: 341 GNLSVEDFLDRSQNFARAVGKTQMQKICTCL 371
>gi|302823113|ref|XP_002993211.1| hypothetical protein SELMODRAFT_431350 [Selaginella moellendorffii]
gi|300138981|gb|EFJ05731.1| hypothetical protein SELMODRAFT_431350 [Selaginella moellendorffii]
Length = 237
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 119/233 (51%), Gaps = 29/233 (12%)
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV- 162
Y ++Q + A VP++KF I CD+S+DN G KS L +S ID RFR + LLV
Sbjct: 5 YDQIQPIYKATVPVVKFVDRKTGIQCDLSVDNKDGASKSLVLAALSSIDKRFRPLCLLVP 64
Query: 163 -----KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
K+WAK+H+IN+ GT +SY ++LL +FH QTC P +LPPL I G + D
Sbjct: 65 EVLNLKKWAKSHEINDASAGTLSSYVITLLAIFHLQTCSPPVLPPLSMIIGGLDLRD--- 121
Query: 218 VRANAERQIAEICA-FNIAR---FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI 273
A C+ F AR F R ++R ++ LF +F K + + E G+
Sbjct: 122 ---------ASYCSGFISARAKAFQGFGARNMDR--ISELFRTFFVKITAVKPLWQE-GL 169
Query: 274 CPFTGQWEHIRSNTRWLP-NNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
C T H + ++W +N + +ED ++ N+A+ V+ K+ I E
Sbjct: 170 CAST---YHAQWISKWPSFHNGCICVEDFYDPSRNAAKGVTPKDFECIYQGLE 219
>gi|303274753|ref|XP_003056692.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461044|gb|EEH58337.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 298
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 67/183 (36%), Positives = 96/183 (52%), Gaps = 28/183 (15%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGS-------CISSAGKKVKQSLLGDLLRALR 99
G T+ PFGSFVS + D+DIS+E++ S +A + R L+
Sbjct: 62 GVTLRPFGSFVSVFHTASSDIDISLEVAPSSKWYDPKEMGPAAAAAAPGARGAGRNRRLQ 121
Query: 100 QKGGY--RRLQF------------------VAHARVPILKFETIHQNISCDISIDNLCGQ 139
Q GY R++Q +AH RVP++KF+ ++CD+ + N G
Sbjct: 122 QPRGYKSRKVQLLHKVASELRYQAFSEVNLIAHTRVPLIKFKDPQTGVNCDVCVGN-DGV 180
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAI 199
KS L ++ +D R+RD+V LVK WAK D N+ G+FNS++LSL+ LFH QT P I
Sbjct: 181 YKSACLGAMANLDSRYRDLVFLVKMWAKNFDCNDATAGSFNSFALSLMSLFHLQTRSPPI 240
Query: 200 LPP 202
LPP
Sbjct: 241 LPP 243
>gi|240255510|ref|NP_190162.4| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|332644547|gb|AEE78068.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 474
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 110/194 (56%), Gaps = 8/194 (4%)
Query: 20 PLREDWETRMKVISDLREVV-----ESVESLRGATVEPFGSFVSNLFSRWGDLDISIELS 74
P+ D+ TR +++ +L + +S ES +E +GSF N FS DLD+SI S
Sbjct: 56 PVSADYNTRKELVKNLNAMAIDIFGKSEES--SPVLEAYGSFAMNTFSSQKDLDVSINFS 113
Query: 75 NGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
+G+ KK++ + LR+L +G R + + ARVPI++F I CD+++
Sbjct: 114 SGTSEFYREKKLEILTRFATKLRSLEGQGFVRNVVPILSARVPIVRFCDQGTGIECDLTV 173
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
++ G + S+ + ISQID RF+ + LL+K WA+AH +NN T NS S+++LV H Q
Sbjct: 174 ESKDGILTSQIIRIISQIDDRFQKLCLLIKHWARAHGVNNASHNTLNSISITMLVAHHLQ 233
Query: 194 TCVPAILPPLKDIY 207
T P ILPP ++
Sbjct: 234 TQSPPILPPFSTLF 247
>gi|413921758|gb|AFW61690.1| hypothetical protein ZEAMMB73_856825 [Zea mays]
Length = 260
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 113/236 (47%), Gaps = 23/236 (9%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL---RQKGGYRR 106
VEPFGSF +LF+ DLD+S+ N +K K S + L L ++ G
Sbjct: 22 VEPFGSFTMDLFTPQSDLDLSVNF-NTDANDQYPRKNKISAIRKLAHVLFSHQRHGRCYG 80
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+ + ARVP+LK + CDIS++N G +S +IS ID RFR + L+K WA
Sbjct: 81 VSPIVTARVPVLKVIDKGTGVECDISVENKDGMSRSAIFKFISSIDKRFRILCYLMKFWA 140
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQI 226
K HD+N PK T +S ++ LV FH QT P ILP + D A+ E+ +
Sbjct: 141 KVHDVNCPKDRTMSSMAIISLVSFHLQTRCPPILPAFSAVLKDG--TDF----ASIEKNV 194
Query: 227 AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQW 280
+ F + N+ S+A LFVS + K + E G+C F G W
Sbjct: 195 SLFQGFGHS----------NKESIAELFVSLMSKLVSVE-GLWEQGLCASNFEGTW 239
>gi|388518457|gb|AFK47290.1| unknown [Lotus japonicus]
Length = 182
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/179 (39%), Positives = 98/179 (54%), Gaps = 20/179 (11%)
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
ASELGICP+TGQWE I++N WLP + +F+EDPFEQP+N+AR+VS L+KIS AF T
Sbjct: 5 ASELGICPYTGQWEQIKNNMIWLPKTYAIFVEDPFEQPQNTARSVSAGQLSKISEAFLRT 64
Query: 328 HFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANYNNG-----------------HRR 370
+ LTS NQ + +LL+ LA P + + + PV NYN G H R
Sbjct: 65 YSVLTSKNQNQNSLLTFLAPPEVSRLIIK-PV-IPNYNGGGGYFHPPQLPLHVQRAEHPR 122
Query: 371 ARPQSHKSVNSPLQAQHQSHNAKKENRPNRSMSQQSVQQHQSQPVRQINGQVQQIWRPK 429
+ +S + V++ Q N + ++ ++ S P Q+ Q QQIWR K
Sbjct: 123 HQHRSQRRVHNGSQGTTNGQNMEAKDPIAKTQKGNSNSSTSKVPPVQVR-QGQQIWRQK 180
>gi|405976862|gb|EKC41341.1| Poly(A) RNA polymerase gld-2-like protein A [Crassostrea gigas]
Length = 367
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 137/292 (46%), Gaps = 25/292 (8%)
Query: 52 PFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVA 111
P GS +S + D+D+ + ++ +K + + L++ K + R V
Sbjct: 89 PMGSTMSGFGTMKSDMDMCLMITE----DGVDQKREAPEILYLIQKALYKCSFVRESTVI 144
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A+VPIL+F + D++++N G + L + D R R +VL +K+WA+ HDI
Sbjct: 145 RAKVPILRFNDLISKAQVDLNVNNGVGIRNTHLLKYYCMTDWRVRPLVLYIKKWARFHDI 204
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
N+ T +SYSL L+++ + Q C P +LP L+++YP R + I E+
Sbjct: 205 NDASKATISSYSLCLMLIHYLQYACSPPVLPSLQELYPE---------RFDGTLDIRELK 255
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL 290
+ + SD N S+ LF+ FL +S + E IC G+ + R+
Sbjct: 256 FDDTVSYKSD-----NGQSVGELFLGFLAYYSN-KYRFEEDCICIREGRRYSLDDYMRFK 309
Query: 291 PNN---HPLFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRLTSTNQTR 338
N PL IE+PF+ N+AR+ +KN+ ++ F+ ++ L R
Sbjct: 310 NENFQLQPLCIEEPFDL-SNTARSCHDKNIFNRVKRVFKKSYQELQKKKDVR 360
>gi|307105741|gb|EFN53989.1| hypothetical protein CHLNCDRAFT_135962 [Chlorella variabilis]
Length = 1405
Score = 106 bits (264), Expect = 2e-20, Method: Composition-based stats.
Identities = 70/184 (38%), Positives = 104/184 (56%), Gaps = 23/184 (12%)
Query: 36 REVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLG--- 92
+EV+ SVE+ GSFVS L++ GDLD+SIE + S ++G+ + ++ G
Sbjct: 111 QEVLPSVET---------GSFVSGLYTPQGDLDLSIE-GDASWEDASGRLQQVTVDGMER 160
Query: 93 ----DLLRALRQKGGYRRLQF-----VAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
LRAL + +RL + HARVPILKF I + D+ I KS
Sbjct: 161 EMKVRFLRALASRIQAKRLSLGQVDRILHARVPILKFRDI-SGLDFDVGIGGSHALFKST 219
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL 203
+ ++Q D RF +V L+K WA+ HD+N+ GT NS++L+LL++FH QT PA+LPPL
Sbjct: 220 VMGLLAQYDWRFGALVRLLKLWARQHDVNDSTNGTLNSFALTLLLVFHLQTRRPAVLPPL 279
Query: 204 KDIY 207
++
Sbjct: 280 CQLF 283
>gi|449669254|ref|XP_002166747.2| PREDICTED: poly(A) RNA polymerase GLD2-A-like [Hydra
magnipapillata]
Length = 437
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 111/225 (49%), Gaps = 16/225 (7%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L ++ + LR R +QF+ A+VPILKF+ CDI+ +N G + L S+
Sbjct: 200 LKEIQKLLRYMSCIRNIQFI-RAKVPILKFKDTVSGCDCDINTNNSIGIRNTHLLRTYSK 258
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPG 209
ID R R +++ VK WAK+ IN+ GT +SYSL ++V+ + Q+ C P +L P++ YP
Sbjct: 259 IDDRVRPLIMAVKHWAKSRSINDASQGTLSSYSLVMMVIHYLQSYCRPPVLTPIQQEYPQ 318
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
D R + ++ F A K N S L F K+ L K
Sbjct: 319 YFSLD---------RNVDDLPMFEPALLIPCNCSK-NEQSHGELLFGFF-KYYSLEFKGD 367
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
E+ I GQ SN W N+ + IE+PF++ N+ARAV E
Sbjct: 368 EMVISVRLGQATPRSSNAIW--NDAYICIEEPFDRT-NTARAVHE 409
>gi|390177938|ref|XP_003736525.1| GA27190 [Drosophila pseudoobscura pseudoobscura]
gi|388859261|gb|EIM52598.1| GA27190 [Drosophila pseudoobscura pseudoobscura]
Length = 1277
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 138/299 (46%), Gaps = 37/299 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ--FVA 111
GS +SN S+ D+D+ + G S + + L ++R+L G R Q +
Sbjct: 912 GSSISNFGSKCSDMDMCMV---GYSNPSLDPRTEAVLHLQMMRSLLS--GTNRFQDFHLI 966
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F + DI+ +N G + L+ SQ++ R R M L VK+WA+ H+I
Sbjct: 967 EARVPILRFTDSQHKVEIDINFNNSVGIRNTHLLYCYSQLEWRVRPMALAVKQWAQHHNI 1026
Query: 172 NNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
NN K T +SYSL L+V+ Q C P ++P L +YP +L ++ + E+
Sbjct: 1027 NNAKNMTISSYSLMLMVIHFLQAGCSPPVIPCLHSLYPQKF--ELLDNSSSGYVDMNEVM 1084
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASEL-----GICPFTGQWEHIRS 285
A Y N +L L + FL +S + + G+ P E R+
Sbjct: 1085 A---------PYESQNTQNLGELMLQFLHYYSTFEFRKHAISIRTGGLLPI----ELCRA 1131
Query: 286 NTRWLPNNH-----PLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTSTNQTR 338
T P N L IE+PF+Q N+AR+V + + ++ F + RL ST R
Sbjct: 1132 AT--APKNDIHQWIELCIEEPFDQ-TNTARSVYDPDTFERVRAIFLCSFRRLESTRNLR 1187
>gi|345491496|ref|XP_001605928.2| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 1
[Nasonia vitripennis]
gi|345491498|ref|XP_003426625.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 2
[Nasonia vitripennis]
Length = 683
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 148/305 (48%), Gaps = 44/305 (14%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG-YR 105
G+T+ FGS VS D+D+ + + + S + G+ + + L ++ LR+ G Y
Sbjct: 378 GSTLNGFGSNVS-------DVDMCLHVRDTSNVDQRGEAIYR--LEQIMMCLRRSGKPYV 428
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
R + A+VPILK N+ D++ +N+ G + L+ S+ID R R +VL+VK W
Sbjct: 429 RELELIQAKVPILKIHDSVYNLDVDLNYNNVVGIRNTHLLYCYSRIDWRVRPLVLVVKMW 488
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNL--VDDLKGVRAN 221
A+ +INN + T +SYSL L+V+ HF C PA+LP L +++ G D+ + +
Sbjct: 489 AQCQNINNARHMTMSSYSLVLMVI-HFLQCGVTPAVLPCLHNLFKGKFHPFSDIHSIDIH 547
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASE---L 271
E I + N +L L + F + ++ +S++ ++ +
Sbjct: 548 EELNIP-----------NGALHPRNTQTLGELLIEFFKYYNTFDYEHYAISVRVADKIPI 596
Query: 272 GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFR 330
C + +++ ++ L IE+PF+ N+AR+V + N I F+ T+ R
Sbjct: 597 ETCRYVRSFKNDPHQWKY------LCIEEPFDF-TNTARSVYDPNAFQMIKEIFKQTYHR 649
Query: 331 LTSTN 335
L TN
Sbjct: 650 LKKTN 654
>gi|168040900|ref|XP_001772931.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675842|gb|EDQ62333.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 172
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 92/173 (53%), Gaps = 20/173 (11%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
V A VP++KF +H NI CD+S++N+ G +KS+ + ++ID R+R + L+K WAKA+
Sbjct: 2 VMKAAVPVVKFVEVHTNIECDVSMENMDGVLKSELIGIFTKIDLRYRQLCFLLKAWAKAY 61
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
++N+ K GT NS S+ L FH QT P ILP + G R + +
Sbjct: 62 NVNDSKKGTLNSLSIIFLAAFHLQTRSPPILPSFSALLEG--------------RSLPLV 107
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--FTGQW 280
+N+ + + + N+ +L LF SF KF + E G+C + G+W
Sbjct: 108 SMWNLV---NHGFGRDNKETLGQLFGSFFTKFLAVE-SLWEQGLCASVYEGKW 156
>gi|195145633|ref|XP_002013796.1| GL23210 [Drosophila persimilis]
gi|194102739|gb|EDW24782.1| GL23210 [Drosophila persimilis]
Length = 1280
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 138/299 (46%), Gaps = 37/299 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ--FVA 111
GS +SN S+ D+D+ + G S + + L ++R+L G R Q +
Sbjct: 915 GSSISNFGSKCSDMDMCMV---GYTNPSLDPRTEAVLHLQMMRSLLS--GTNRFQDFHLI 969
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F + DI+ +N G + L+ SQ++ R R M L VK+WA+ H+I
Sbjct: 970 EARVPILRFSDSQHKVEIDINFNNSVGIRNTHLLYCYSQLEWRVRPMALAVKQWAQHHNI 1029
Query: 172 NNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
NN K T +SYSL L+V+ Q C P ++P L +YP +L ++ + E+
Sbjct: 1030 NNAKNMTISSYSLMLMVIHFLQAGCSPPVIPCLHSLYPQKF--ELLDNSSSGYVDMNEVM 1087
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASEL-----GICPFTGQWEHIRS 285
A Y N +L L + FL +S + + G+ P E R+
Sbjct: 1088 A---------PYESQNTQNLGELMLQFLHYYSVFEFRKYAISIRTGGLLPI----ELCRA 1134
Query: 286 NTRWLPNNH-----PLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTSTNQTR 338
T P N L IE+PF+Q N+AR+V + + ++ F + RL ST R
Sbjct: 1135 AT--APKNDIHQWIELCIEEPFDQ-TNTARSVYDPDTFERVRAIFLCSFRRLESTRNLR 1190
>gi|66816699|ref|XP_642359.1| hypothetical protein DDB_G0278425 [Dictyostelium discoideum AX4]
gi|60470405|gb|EAL68385.1| hypothetical protein DDB_G0278425 [Dictyostelium discoideum AX4]
Length = 1090
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 147/326 (45%), Gaps = 40/326 (12%)
Query: 35 LREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDL 94
L+ +V + G + FGS + + + GD+DI + + S +S ++ +
Sbjct: 780 LQNLVSKIFPNSGVKLHLFGSSANGMSLKNGDIDICMVIDQSSEGTS-------DVIIER 832
Query: 95 LRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR 154
L + + G++++ + ARVPI+KF+ + +SCDI ++N ++ + S ID R
Sbjct: 833 LAEMLKINGFQKILAIPTARVPIVKFKDPNTGLSCDICMNNRLAIYNTRLVQDYSMIDER 892
Query: 155 FRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK---------- 204
+ +V +VK WAK IN P GT +SY+ LV+ QT P ILP L+
Sbjct: 893 MKPLVYVVKRWAKRRKINEPSLGTLSSYAYINLVISFLQTRQPPILPCLQELANGPKLIN 952
Query: 205 -----DIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLE 259
D+ P +VD G I+++ F SD + S + H F ++
Sbjct: 953 GKEYGDLLPDVMVD---GFNCKYYNDISKLVG-----FGSDNKETLG-SLVFHFFQTYSR 1003
Query: 260 KFSGLSLKASELGICPF---TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
+FS ++ S P + WE I + + IEDPFE N AR V+ N
Sbjct: 1004 EFSFMNQVVSIRTGSPIQKSSKTWESIAKKSHYW-----FSIEDPFETTHNLARVVNRPN 1058
Query: 317 LAKISNAFEMTHFRLTSTNQTRYALL 342
L+ I + ++L S N + +L
Sbjct: 1059 LSIIISELNRG-YKLLSKNSNLHKVL 1083
>gi|114797027|gb|ABI79451.1| polymerase beta nucleotidyltransferase [Chlamydomonas reinhardtii]
Length = 924
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/374 (25%), Positives = 167/374 (44%), Gaps = 48/374 (12%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
S + L +L+ ++ +P + + R ++I ++ ++ ++ V+P+GSFVS ++
Sbjct: 6 SADELAAVLEQVVQATSPTVDAYRMRTRLIDSIQGALKHRIGVQDLHVQPYGSFVSQQYN 65
Query: 63 RWGDLDISI---------------ELSNGSC------ISSAGKKVKQSLLGDLLRALRQK 101
DLD+++ E+ G + K+ K +LL D L
Sbjct: 66 AGSDLDLALCGYIPAAKLKPAALAEIYRGEPEEELVPLHKLDKRTKANLLRDAGYRLAGS 125
Query: 102 GGYRR--LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMV 159
G R ++FV HARVPI+KF I D+ + N K+ + +++I+ F +
Sbjct: 126 GVASRDSMEFVLHARVPIVKFADRATGIEVDLCLGNAATSFKAWSVARVAEINPAFGRLY 185
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ---TCVPAILPPLKD-IYPGNLVDDL 215
+VK WAKAH IN+ + FNS+ L+L+V++ Q + A+LPPL + +Y +D
Sbjct: 186 KVVKLWAKAHGINDGASHMFNSWCLTLVVMYFLQQYPSREQALLPPLCELLYEKRPAEDS 245
Query: 216 KGVRANAERQIAEICAFNIARFS--SDKYRKINRSSLAHLFVSFLEK----FSGLSLKAS 269
+ E+ R S + Y L LF F+++ GL L A
Sbjct: 246 PRLMQKGAELPPEVLKVMEQRASQAAKVYGARPCPPLLELFRDFVQQCGRNLRGL-LAAQ 304
Query: 270 E-----LGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE-----KNLAK 319
E I F GQ H +R + + L +EDP++ +N+AR +
Sbjct: 305 ESFRRGTRISAFYGQLLH----SRPYASQYVLCVEDPYDDNDNTARTFGTWDGHPGTIHY 360
Query: 320 ISNAFEMTHFRLTS 333
+++ FE T RL S
Sbjct: 361 VTSVFERTARRLNS 374
>gi|19112002|ref|NP_595210.1| poly(A) polymerase Cid11 (predicted) [Schizosaccharomyces pombe
972h-]
gi|74626844|sp|O74326.1|CID11_SCHPO RecName: Full=Poly(A) RNA polymerase cid11; Short=PAP; AltName:
Full=Caffeine-induced death protein 11; AltName:
Full=Polynucleotide adenylyltransferase cid11
gi|3367789|emb|CAA20054.1| poly(A) polymerase Cid11 (predicted) [Schizosaccharomyces pombe]
Length = 478
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 113/460 (24%), Positives = 183/460 (39%), Gaps = 76/460 (16%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVE--PFGSFVSNLFSRWGDLDISIELSN 75
L P E+ R + + LR ++ + ++ A ++ FGS +NL + D+D+
Sbjct: 58 LKPSNEEVSRRQQFVDKLRTILST--EIKDAKLDLFVFGSTENNLAIQQSDVDV------ 109
Query: 76 GSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDN 135
CI + G K S L L G +++ V+ ARVPI+K +I CD++I+N
Sbjct: 110 --CIITNGSKYLNSTCQ--LAQLLYSYGMKQIVCVSRARVPIVKIWDPQFDIHCDLNINN 165
Query: 136 LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQT 194
+I +K L ID R R + L++K WAK + + +GT SY++S +++ QT
Sbjct: 166 DVAKINTKMLRLFVSIDPRVRPLGLIIKYWAKQRALCDAAGSGTITSYTISCMLVNFLQT 225
Query: 195 CVPAILPPLKDIYPGN----LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSL 250
P ILP + D+ + VDD+ G + A +N++SL
Sbjct: 226 RNPPILPAMLDLMSNDDNKMFVDDIVGFKEKA---------------------TLNKTSL 264
Query: 251 AHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSAR 310
L + F + G S + + +G + + + N+ L +E+PF N A
Sbjct: 265 GRLLIDFF-YYYGFSFNYLDSVVSVRSGTVLNKQEKGWAMEVNNSLCVEEPFNTARNLAN 323
Query: 311 AVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANYNNGHRR 370
++ + + F+ FRL S N L L E+ Y N N
Sbjct: 324 TADNPSVKGLQSEFQRA-FRLMSENNACERLCKICEEYQFLDITNEA--NYGNTNTPFNT 380
Query: 371 AR----------------------PQSHKSVNS--------PLQAQHQSHNAKKENRPNR 400
A PQ S P ++ HQS+ K NR +
Sbjct: 381 AYESFGCNHTVLPEAAAYPKPYYPPQITLSDGGNMNFLYYIPDESNHQSY-ENKANRDSD 439
Query: 401 SMSQQSVQQHQSQPVRQINGQVQQIWRPKSDGSQQPSPAN 440
Q S+ Q + P I Q +W P D S P+ N
Sbjct: 440 FQGQTSLTQGSAPPWHYIPCQSWLVWYPSEDAS-NPASGN 478
>gi|330791565|ref|XP_003283863.1| hypothetical protein DICPUDRAFT_26598 [Dictyostelium purpureum]
gi|325086249|gb|EGC39642.1| hypothetical protein DICPUDRAFT_26598 [Dictyostelium purpureum]
Length = 255
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 121/259 (46%), Gaps = 35/259 (13%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
GY+++ + ARVPI+KF+ N+SCDI ++NL ++ + S+ID R + +V +V
Sbjct: 6 GYQKILAIPTARVPIVKFKDPSTNLSCDICMNNLLAIYNTRLVQDYSKIDERMKPLVYVV 65
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP----------GNLV 212
K WAK IN P GT +SY LV+ QT P ILP L+++ G L+
Sbjct: 66 KRWAKRRKINEPSLGTLSSYGYINLVISFLQTRDPPILPCLQELANGPKIINGKEYGELL 125
Query: 213 DD--LKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLA----HLFVSFLEKFSGLSL 266
DD + G I+++ F + N+ SL H F ++ +FS ++
Sbjct: 126 DDVMVDGFNCKYYNDISKLIGFGLQ----------NKESLGSLVFHFFQTYSREFSFMNQ 175
Query: 267 KASELGICPF---TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
S P + WE I + + IEDPFE N AR V+ NL+ I +
Sbjct: 176 VVSIRNGSPILKSSKTWESISKKSHYW-----FSIEDPFEITHNLARVVNRPNLSIIISE 230
Query: 324 FEMTHFRLTSTNQTRYALL 342
++L S N + +L
Sbjct: 231 LNRA-YKLLSKNSNLHKVL 248
>gi|334185748|ref|NP_001190015.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|332644546|gb|AEE78067.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 614
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 115/245 (46%), Gaps = 46/245 (18%)
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVK---------QSLLGDLL-------------- 95
+++S DLD+SI NG+ KK++ +SL G +
Sbjct: 2 DMYSSQSDLDVSINFGNGTSEIPREKKLEILKRFAKKLRSLQGKIFVPFFLLLSCMPIPL 61
Query: 96 ------RALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
+ ++G + ++ + A+VPI+KF + CD+S++N G + S+ + IS
Sbjct: 62 LFSIYTKNPNREGQVKNVESIFSAKVPIVKFSDQGTGVECDLSVENKDGILNSQIVRIIS 121
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
QIDGRF+ + LLVK WAKAH++N+ T NS S++LLV H QT P ILPP +
Sbjct: 122 QIDGRFQKLCLLVKHWAKAHEVNSALHRTLNSVSITLLVALHLQTQNPPILPPFSMLLKD 181
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
+ D V A++ + + + N+ SL LF +F K +
Sbjct: 182 GM--DPPNVEKRAQKFL--------------NWGQRNQESLGRLFATFFIKLQSVEFLWR 225
Query: 270 ELGIC 274
+ G+C
Sbjct: 226 Q-GLC 229
>gi|297819098|ref|XP_002877432.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323270|gb|EFH53691.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 189
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/108 (42%), Positives = 73/108 (67%)
Query: 100 QKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMV 159
++G R ++ + ARVPI+KF + +I CD+S++N G +KS+ + ISQ DG+F+ +
Sbjct: 53 REGHVRNVESIFTARVPIVKFCDLGTSIECDLSVENKVGNLKSQIIRIISQTDGKFQKLC 112
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+LVK WAKAH++N+ T NS+S++LL H QT P+ILPP ++
Sbjct: 113 MLVKHWAKAHEVNSTLHRTLNSFSITLLAALHLQTQNPSILPPFSTLF 160
>gi|194743090|ref|XP_001954033.1| GF18072 [Drosophila ananassae]
gi|190627070|gb|EDV42594.1| GF18072 [Drosophila ananassae]
Length = 709
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 136/297 (45%), Gaps = 41/297 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +SN S+ D+DI + I + V L+R L + + A
Sbjct: 323 GSSISNFGSKCSDMDICMLACTNPNIDPRMEAVYNL---QLMRELLNPTNVFQDFNLIEA 379
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ++ R R M L VK+WA+ H+INN
Sbjct: 380 RVPILRFTDRQHKVEVDINFNNSVGIRNTHLLYCYSQLEWRVRPMALTVKQWAQYHNINN 439
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
K T +SYSLSL+V+ Q V P ++P L +YP + + C F
Sbjct: 440 AKNMTISSYSLSLMVIHFLQAGVNPPVIPCLHKLYPDKF-------------GLLQPCDF 486
Query: 233 NIARFSS--DKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWEHI 283
+ Y+ N SL L +SFL +S +S++ G+ P E
Sbjct: 487 GYVDMNEVMGPYQSENNQSLGELMLSFLHYYSIFEYGKYAISIRVG--GVLPV----EVC 540
Query: 284 RSNTRWLPNN-----HPLFIEDPFEQPENSARAVSE-KNLAKISNAFEMTHFRLTST 334
R++ P N + L IE+PF+Q N+AR+V + + +I F ++ RL ST
Sbjct: 541 RASNA--PKNDIHQWNELCIEEPFDQ-TNTARSVYDSETFERIRAIFLASYRRLEST 594
>gi|159474620|ref|XP_001695423.1| polyadenylate polymerase beta nucleotidyltransferase [Chlamydomonas
reinhardtii]
gi|158275906|gb|EDP01681.1| polyadenylate polymerase beta nucleotidyltransferase [Chlamydomonas
reinhardtii]
Length = 711
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 167/375 (44%), Gaps = 48/375 (12%)
Query: 2 GSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
S + L +L+ ++ +P + + R ++I ++ ++ ++ V+P+GSFVS +
Sbjct: 5 ASADELAAVLEQVVQATSPTVDAYRMRTRLIDSIQGALKHRIGVQDLHVQPYGSFVSQQY 64
Query: 62 SRWGDLDISI---------------ELSNGSC------ISSAGKKVKQSLLGDLLRALRQ 100
+ DLD+++ E+ G + K+ K +LL D L
Sbjct: 65 NAGSDLDLALCGYIPAAKLKPAALAEIYRGEPEEELVPLHKLDKRTKANLLRDAGYRLAG 124
Query: 101 KGGYRR--LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
G R ++FV HARVPI+KF I D+ + N K+ + +++I+ F +
Sbjct: 125 SGVASRDSMEFVLHARVPIVKFADRATGIEVDLCLGNAATSFKAWSVARVAEINPAFGRL 184
Query: 159 VLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ---TCVPAILPPLKD-IYPGNLVDD 214
+VK WAKAH IN+ + FNS+ L+L+V++ Q + A+LPPL + +Y +D
Sbjct: 185 YKVVKLWAKAHGINDGASHMFNSWCLTLVVMYFLQQYPSREQALLPPLCELLYEKRPAED 244
Query: 215 LKGVRANAERQIAEICAFNIARFS--SDKYRKINRSSLAHLFVSFLEK----FSGLSLKA 268
+ E+ R S + Y L LF F+++ GL L A
Sbjct: 245 SPRLMQKGAELPPEVLKVMEQRASQAAKVYGARPCPPLLELFRDFVQQCGRNLRGL-LAA 303
Query: 269 SE-----LGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE-----KNLA 318
E I F GQ H +R + + L +EDP++ +N+AR +
Sbjct: 304 QESFRRGTRISAFYGQLLH----SRPYASQYVLCVEDPYDDNDNTARTFGTWDGHPGTIH 359
Query: 319 KISNAFEMTHFRLTS 333
+++ FE T RL S
Sbjct: 360 YVTSVFERTARRLNS 374
>gi|281211278|gb|EFA85443.1| Regulator of nonsense transcripts-like protein [Polysphondylium
pallidum PN500]
Length = 1412
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 143/310 (46%), Gaps = 47/310 (15%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQS-----LLGDLLRALRQK 101
G+ ++P+GSFV+ + + D+D+ C S G + L+ + +++
Sbjct: 1114 GSILKPYGSFVNGVQTASSDIDL--------CFSVVGVSTDTNDKLFHLMKRVALRIKKN 1165
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
Y+ + + ARVPI+KF+ I IS D+ +N S + + ID R + ++LL
Sbjct: 1166 TSYQLEKIIRFARVPIIKFKDIENEISFDMCFNNSMPVGNSLLIKEYTMIDVRAKVLMLL 1225
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK----DIYPGNLVDDLKG 217
+K WA DIN+ GT +SYS L+V+F+ Q+ P +LP L+ + P N +
Sbjct: 1226 IKYWASRKDINDASMGTLSSYSWLLMVIFYLQSINPPVLPNLQSTLINTAPKNAI----- 1280
Query: 218 VRANAERQIAEICAFNIARFSSDK---YRKINRSSLAHLFVSFLEKFSGLSLKASELGIC 274
+ ++ +R + F S + ++ N SL LF F +S S L I
Sbjct: 1281 ISSSEDRWL----------FLSSQALNFKSTNTMSLFQLFSGFFSFYSRFDF--SNLLIT 1328
Query: 275 PFTGQWEHIRSNTR-WLPNN--HPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRL 331
G +IR T+ +L NN + IEDPF N A +V NAF++ + L
Sbjct: 1329 IKQGCLTNIRMATKIFLDNNGKQNICIEDPFTPQNNPAASVGR-------NAFDVILYEL 1381
Query: 332 TSTNQTRYAL 341
S Q AL
Sbjct: 1382 KSAEQKLSAL 1391
>gi|195053534|ref|XP_001993681.1| GH21064 [Drosophila grimshawi]
gi|193895551|gb|EDV94417.1| GH21064 [Drosophila grimshawi]
Length = 578
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 156/345 (45%), Gaps = 44/345 (12%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
R ++T+M++ + +V +V G + GS +S + D+DI + + + +
Sbjct: 189 RTIFKTKMRLWRFIYKVTMAVYPRYGVYL--VGSSISFFGCKCSDMDICMLACTNANMDT 246
Query: 82 AGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ + L+ ++L A Q + ++ ARVPIL+F N+ DI+ +N G
Sbjct: 247 RTEAIYHLQLMREMLNATEQFQDFNLIE----ARVPILRFMDRRHNVEVDINFNNSVGIR 302
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAI 199
+ L+ SQ++ R R + L +K+WA+ H+INN K T +SYSL L+V+ Q V P +
Sbjct: 303 NTHLLYCYSQLEWRLRPIALTIKQWAQHHNINNAKNMTISSYSLMLMVIHFLQAGVNPPV 362
Query: 200 LPPLKDIYPGNLV----DDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
LP L +YP +D V N EI Y+ NR SL L +
Sbjct: 363 LPCLHKMYPEKFCILQPNDFGYVDMN------EIMP---------PYKSENRQSLGELLL 407
Query: 256 SFLEKFS-------GLSLKASELGICPFTGQWEHIRSNTRWLPNNH---PLFIEDPFEQP 305
FL+ +S +S++ G+ P E R+ + H L IE+PF+
Sbjct: 408 GFLQYYSIFDYGKFAISIRIG--GMLPV----ESCRAAKAVKNDVHQWNQLCIEEPFDL- 460
Query: 306 ENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFI 350
N+AR+V + + K A +T + L + ++ P I
Sbjct: 461 TNTARSVYDAEIFKRIRAIFLTSYNLLESTHNLISIFDGYEGPVI 505
>gi|223944817|gb|ACN26492.1| unknown [Zea mays]
gi|414884677|tpg|DAA60691.1| TPA: hypothetical protein ZEAMMB73_903036 [Zea mays]
Length = 251
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 103/189 (54%), Gaps = 5/189 (2%)
Query: 16 GMLNPLREDWETRMKVISDLREVV-ESVESLRGA-TVEPFGSFVSNLFSRWGDLDISIEL 73
+L P D++ R ++ R++V + S G+ VEPFGSF +LF+ DLD+SI
Sbjct: 55 AILRPKPLDYDQRNTLVDVFRKMVNQRFGSNSGSPVVEPFGSFTMDLFTPHSDLDLSINF 114
Query: 74 SNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
S + K++ + +L + ++ G + + + ARVPILK + CDI
Sbjct: 115 SANTDEQYTRKQMISIIKKFSKVLFSYQRSGIFCGVLPIVSARVPILKVIDRGTGVECDI 174
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
S++N G +S + ++S +D RF+ + LVK WAK HD+N P+ T +S S+ LV FH
Sbjct: 175 SVENKDGMTRSMIIKFVSSLDERFQILSYLVKFWAKVHDLNTPRQLTMSSMSIISLVAFH 234
Query: 192 FQTCVPAIL 200
Q C P IL
Sbjct: 235 LQ-CRPGIL 242
>gi|195572796|ref|XP_002104381.1| GD20928 [Drosophila simulans]
gi|194200308|gb|EDX13884.1| GD20928 [Drosophila simulans]
Length = 1338
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 142/318 (44%), Gaps = 50/318 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + + + V L + L + ++ + A
Sbjct: 956 GSSISYFGSKCSDMDICMLACTNPNVDPRTEAVYH--LHVMKELLGRTNMFQDFNLI-EA 1012
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ+D R R M L VK+WA+ H+INN
Sbjct: 1013 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLDWRVRPMALTVKQWAQYHNINN 1072
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L ++YP +D V N E
Sbjct: 1073 AKNMTISSYSLMLMVIHFLQVGASPPVLPCLHNLYPDKFGLLQPNDFGYVDMN------E 1126
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N SL L + FL +S +S++ G+ P E
Sbjct: 1127 VMA---------PYQSDNSQSLGDLLLGFLRYYSVFEYGKYAISIRVG--GVLPI----E 1171
Query: 282 HIRSNT-------RWLPNNHPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTS 333
R+ T +W+ L IE+PF+Q N+AR+V + + +I F ++ RL S
Sbjct: 1172 VCRAATAPKNDIHQWI----ELCIEEPFDQ-TNTARSVYDTDTFERIKTIFVASYRRLES 1226
Query: 334 TNQTRYALLSSLARPFIL 351
T R A+ P IL
Sbjct: 1227 TRNLR-AIFEEYDGPTIL 1243
>gi|195330939|ref|XP_002032160.1| GM26408 [Drosophila sechellia]
gi|194121103|gb|EDW43146.1| GM26408 [Drosophila sechellia]
Length = 1338
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 142/318 (44%), Gaps = 50/318 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + + + V L + L + ++ + A
Sbjct: 956 GSSISYFGSKCSDMDICMLACTNPNVDPRTEAVYH--LHVMKELLGRTNMFQDFNLI-EA 1012
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ+D R R M L VK+WA+ H+INN
Sbjct: 1013 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLDWRVRPMALTVKQWAQYHNINN 1072
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L ++YP +D V N E
Sbjct: 1073 AKNMTISSYSLMLMVIHFLQVGASPPVLPCLHNLYPDKFGLLQPNDFGYVDMN------E 1126
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N SL L + FL +S +S++ G+ P E
Sbjct: 1127 VMA---------PYQSDNSQSLGDLLLGFLRYYSVFEYGKYAISIRVG--GVLPI----E 1171
Query: 282 HIRSNT-------RWLPNNHPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTS 333
R+ T +W+ L IE+PF+Q N+AR+V + + +I F ++ RL S
Sbjct: 1172 VCRAATAPKNDIHQWI----ELCIEEPFDQ-TNTARSVYDTDTFERIKTIFVASYRRLES 1226
Query: 334 TNQTRYALLSSLARPFIL 351
T R A+ P IL
Sbjct: 1227 TRNLR-AIFEEYDGPTIL 1243
>gi|45550788|ref|NP_651012.2| Gld2, isoform B [Drosophila melanogaster]
gi|442620418|ref|NP_001262829.1| Gld2, isoform C [Drosophila melanogaster]
gi|74868425|sp|Q9VD44.3|GLD2A_DROME RecName: Full=Poly(A) RNA polymerase gld-2 homolog A; Short=DmGLD2
gi|45446588|gb|AAF55959.3| Gld2, isoform B [Drosophila melanogaster]
gi|440217741|gb|AGB96209.1| Gld2, isoform C [Drosophila melanogaster]
Length = 1364
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 143/316 (45%), Gaps = 46/316 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + I S + V L + L + ++ + A
Sbjct: 981 GSSISYFGSKCSDMDICMLACTNPNIDSRMEAVYH--LHVMKELLGRTNMFQDFNLI-EA 1037
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ+D R R M L VK+WA+ H+INN
Sbjct: 1038 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLDWRVRPMALTVKQWAQYHNINN 1097
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L ++YP +D V N E
Sbjct: 1098 AKNMTISSYSLMLMVIHFLQVGASPPVLPCLHNLYPEKFGLLQPNDFGYVDMN------E 1151
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N +L L +SFL +S +S++ G+ P E
Sbjct: 1152 VMA---------PYQSDNSQTLGDLLLSFLHYYSVFDYGKYAISIRVG--GVLPI----E 1196
Query: 282 HIRSNTRWLPNN-----HPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTSTN 335
R+ T P N + L IE+PF+Q N+AR+V + + +I F ++ RL ST
Sbjct: 1197 VCRAAT--APKNDIHQWNELCIEEPFDQ-TNTARSVYDTDTFERIKTIFVASYRRLDSTR 1253
Query: 336 QTRYALLSSLARPFIL 351
A+ P IL
Sbjct: 1254 NLS-AIFEDYDGPTIL 1268
>gi|202027840|gb|ACH95257.1| AT19242p [Drosophila melanogaster]
Length = 1366
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 143/316 (45%), Gaps = 46/316 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + I S + V L + L + ++ + A
Sbjct: 983 GSSISYFGSKCSDMDICMLACTNPNIDSRMEAVYH--LHVMKELLGRTNMFQDFNLI-EA 1039
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ+D R R M L VK+WA+ H+INN
Sbjct: 1040 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLDWRVRPMALTVKQWAQYHNINN 1099
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L ++YP +D V N E
Sbjct: 1100 AKNMTISSYSLMLMVIHFLQVGASPPVLPCLHNLYPEKFGLLQPNDFGYVDMN------E 1153
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N +L L +SFL +S +S++ G+ P E
Sbjct: 1154 VMA---------PYQSDNSQTLGDLLLSFLHYYSVFDYGKYAISIRVG--GVLPI----E 1198
Query: 282 HIRSNTRWLPNN-----HPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTSTN 335
R+ T P N + L IE+PF+Q N+AR+V + + +I F ++ RL ST
Sbjct: 1199 VCRAAT--APKNDIHQWNELCIEEPFDQ-TNTARSVYDTDTFERIKTIFVASYRRLDSTR 1255
Query: 336 QTRYALLSSLARPFIL 351
A+ P IL
Sbjct: 1256 NLS-AIFEDYDGPTIL 1270
>gi|302784064|ref|XP_002973804.1| hypothetical protein SELMODRAFT_100054 [Selaginella moellendorffii]
gi|300158136|gb|EFJ24759.1| hypothetical protein SELMODRAFT_100054 [Selaginella moellendorffii]
Length = 341
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 138/310 (44%), Gaps = 25/310 (8%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ R K S L + E L G + FGS V+ D+D+
Sbjct: 26 LIPTEEEEVRRRKFFSKLESLFE--RELPGTRLFLFGSCVNAFGVCNSDIDV-------- 75
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C+S ++ + L + + + +Q + HARVPI+KF ISCDI ++N
Sbjct: 76 CLSVDEEEPNKIELVVQMATILESDAMLNVQALTHARVPIVKFTEPATGISCDICVNNTL 135
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ SK L +QID R R + +VK WAK +N+ GT +SY+ L+ + Q P
Sbjct: 136 AVVNSKLLHDYAQIDVRLRQLAFMVKHWAKRRQVNDTYRGTLSSYAYVLMCIHFLQQRRP 195
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
ILP L+++ P V + +R Q+ + F +D N+ +L L +F
Sbjct: 196 PILPCLQEMRPTYEV-KVGSIRCAYYDQVE-----TLRDFGAD-----NKETLGELLTAF 244
Query: 258 LEKFSGLSLKASELGICPFTGQW--EHIRSNTRWLPNN-HPLFIEDPFEQPENSARAVSE 314
+ + + I TG + ++ + TR + N H + IEDPFE + R V +
Sbjct: 245 FD-YWACQHDYNHSVISVRTGGYLSKNEKEWTRRIGNERHLICIEDPFEVTHDLGRVVDK 303
Query: 315 KNLAKISNAF 324
++ + F
Sbjct: 304 HSIKALRAEF 313
>gi|302803680|ref|XP_002983593.1| hypothetical protein SELMODRAFT_118423 [Selaginella moellendorffii]
gi|300148836|gb|EFJ15494.1| hypothetical protein SELMODRAFT_118423 [Selaginella moellendorffii]
Length = 341
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 138/310 (44%), Gaps = 25/310 (8%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ R K S L + E L G + FGS V+ D+D+
Sbjct: 26 LIPTEEEEVRRRKFFSKLESLFE--RELPGTRLFLFGSCVNAFGVCNSDIDV-------- 75
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C+S ++ + L + + + +Q + HARVPI+KF ISCDI ++N
Sbjct: 76 CLSVDEEEPNKIELVVQMATILESDAMLNVQALTHARVPIVKFTEPATGISCDICVNNTL 135
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ SK L +QID R R + +VK WAK +N+ GT +SY+ L+ + Q P
Sbjct: 136 AVVNSKLLHDYAQIDVRLRQLAFMVKHWAKRRQVNDTYRGTLSSYAYVLMCIHFLQQRRP 195
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
ILP L+++ P V + +R Q+ + F +D N+ +L L +F
Sbjct: 196 PILPCLQEMRPTYEV-KVGSIRCAYYDQVE-----TLRDFGAD-----NKETLGELLTAF 244
Query: 258 LEKFSGLSLKASELGICPFTGQW--EHIRSNTRWLPNN-HPLFIEDPFEQPENSARAVSE 314
+ + + I TG + ++ + TR + N H + IEDPFE + R V +
Sbjct: 245 FD-YWACQHDYNHSVISVRTGGYLSKNEKEWTRRIGNERHLICIEDPFEVTHDLGRVVDK 303
Query: 315 KNLAKISNAF 324
++ + F
Sbjct: 304 HSIKALRAEF 313
>gi|198468732|ref|XP_002134103.1| GA26658 [Drosophila pseudoobscura pseudoobscura]
gi|198146546|gb|EDY72730.1| GA26658 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/241 (29%), Positives = 124/241 (51%), Gaps = 28/241 (11%)
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
K+ + ++ +L +++ +K R + ARVPIL+F+ I I D++ +N G + +
Sbjct: 126 KRAEALIILNLFQSVLKKTVVFRDFNLIEARVPILRFKDILNAIEVDLNFNNCVGIMNTY 185
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPP 202
L +Q+D R R +V++VK WA+ HDIN+ K T +SYSL L+VL + Q C P +LP
Sbjct: 186 LLQLYAQLDWRTRPLVVVVKLWAQYHDINDAKRMTISSYSLVLMVLHYLQNGCTPHVLPC 245
Query: 203 LKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS-DKYRKINRSSLAHLFVSFLEKF 261
L+ +YP Q+ + F++ + D Y NR +L LF+ F + +
Sbjct: 246 LQTLYPEKF-------------QLGQQDCFDLNLIETIDPYPTQNRQTLGELFLGFFKYY 292
Query: 262 SGLSLKASEL-----GICPFTG--QWEHIRSN-TRWLPNNHPLFIEDPFEQPENSARAVS 313
S + + G+ P + Q + +++ +W L IE+PF+ N+AR+V
Sbjct: 293 SSFDFRNHAISVRTGGVLPVSACRQAKSFKNDPYQW----KELNIEEPFDL-SNTARSVY 347
Query: 314 E 314
+
Sbjct: 348 D 348
>gi|119713752|gb|ABL97801.1| hypothetical protein ALOHA_HF1029C11.0028 [uncultured marine
bacterium HF10_29C11]
Length = 677
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 133/298 (44%), Gaps = 49/298 (16%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYR 105
+ A VE FGS V+NL GDLD+ + N + +KV + + G L + G
Sbjct: 52 KNAMVEAFGSSVTNLSIGTGDLDLCLSFKNKTP-----RKVLRKISGVL-----HEEGME 101
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+Q + AR+PI+KF+ + DIS+DN S L +Q D R R +V +VK W
Sbjct: 102 NIQLIPKARIPIVKFKDPRSGLDVDISLDNRLAIYNSHLLKSYAQED-RLRRLVHMVKYW 160
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
A INN G+ +SY+ +LL + H Q PA+ P ++ P + +G
Sbjct: 161 ASRRGINNAFDGSLSSYAWTLLTIQHAQLVQPALAPNRQENCPSKPL-SFQG-------- 211
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFT 277
F++ F+ D ++ N SLA L +SF ++++ +S++ +
Sbjct: 212 ----KTFDVG-FNDDDFKTENTQSLASLLISFFDRYATRWDWESMVVSIRNGK-AHSTKA 265
Query: 278 GQWEHIRSNTRWLP-----------NNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+W H T LP H + IEDPF+ + +R V + I + F
Sbjct: 266 KKWNH----TGPLPLEVVTGVDDGRMEHVMPIEDPFDHEHDLSRVVRAEGAMSIQDEF 319
>gi|443712902|gb|ELU05986.1| hypothetical protein CAPTEDRAFT_208596 [Capitella teleta]
Length = 456
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 151/322 (46%), Gaps = 30/322 (9%)
Query: 21 LREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
L+ + +MK+ + V++ V G + GS ++ D+D+ + LS+
Sbjct: 146 LQSLYVKKMKLRDAIYAVMKGVFPYCGLYI--VGSSMNGFGDMESDMDLCLMLSHSQI-- 201
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
KK +L L ALR +++ + A+VPIL+F N+ CDI+I+N G
Sbjct: 202 -DQKKDATEILRLLHTALRHCKFLSQVRII-RAKVPILRFVDRISNVECDININNQVGIR 259
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAI 199
+ L SQ+D R +V VK WA+A +IN+ G+ +SYSL L+VL + Q C P +
Sbjct: 260 NTHLLSAYSQMDARIVPLVKTVKRWARAQNINDASQGSVSSYSLVLMVLHYLQYGCSPPV 319
Query: 200 LPPLKDIYPG--NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
+P L+ YP N D++ + N E Y N S+ LF+ F
Sbjct: 320 IPSLQQKYPHKFNSDQDIRRITLNDEL---------------PTYTSPNEQSIGELFLGF 364
Query: 258 LEKFSGL-SLKASELGICPFTGQWEHI--RSNTRWLPNN-HPLFIEDPFEQPENSARAVS 313
LE ++ + ++ + + T H+ + T PN L IE+PF N+AR+V
Sbjct: 365 LEYYAVIFDFESDCISVRLGTKIPRHVAMKQCTENSPNQWKCLCIEEPFNL-SNTARSVF 423
Query: 314 EKNL-AKISNAFEMTHFRLTST 334
+ + +I + F TH + S+
Sbjct: 424 DITVFQRILHVFRKTHLHIRSS 445
>gi|307106545|gb|EFN54790.1| hypothetical protein CHLNCDRAFT_134748 [Chlorella variabilis]
Length = 826
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 90/341 (26%), Positives = 144/341 (42%), Gaps = 44/341 (12%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG- 65
L+ L+ + L P E+ +M+ ++ ++++ GA V FGS + L R
Sbjct: 480 LDAALRQMADSLMPTPEERAAQMEAFEWVKSLLQA--RYPGAGVHLFGSVANGLSVRHNN 537
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
D+D+ +EL AGK ++G+L+ A G + + ARVP++KF
Sbjct: 538 DIDVCLELEG--VDDQAGKAEVAGVVGELMEA----AGMAEVLPLPKARVPVVKFVVPRT 591
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
D++++NL I +K + ID R +V LVK WAK +N+P GT +SY
Sbjct: 592 GTKVDVTVNNLLACINTKLVADYCAIDARLAALVALVKHWAKQRAVNDPYRGTLSSYCYV 651
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
L+ + QT +LP L+ + P R + E C NI + +
Sbjct: 652 LMCIHLLQTRPTPVLPALQQLQP--------TFRRAVGQWTCEFCD-NIEALRG--FGAV 700
Query: 246 NRSSLAHLFVSFLEKFS-----GLSLKASELGICPF---------TGQWEHIRSN----- 286
N SLA L +F E ++ + + LG C G H+ S
Sbjct: 701 NCESLAQLVWAFFEYWAWRHNYSHDVVSVRLGACLHKDDKDWTRRIGNERHLASGGPAAC 760
Query: 287 ---TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
R LP++ + IEDPFE + R V + A + F
Sbjct: 761 PPACRCLPSD--VCIEDPFELSHDLGRTVDRQTRAVLHKEF 799
>gi|17508045|ref|NP_491834.1| Protein MUT-2, isoform a [Caenorhabditis elegans]
gi|351062121|emb|CCD70041.1| Protein MUT-2, isoform a [Caenorhabditis elegans]
Length = 441
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 157/364 (43%), Gaps = 60/364 (16%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+N+L ++D +E++ +M L+ ++ + P GS V+ L +
Sbjct: 42 DFNILSISMQDHFDTTKQPKEEFGKKMDWCYQLKNIISKNNPTWLFNIVPTGSTVTGLAT 101
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF------------- 109
+ DLD++I I A + ++Q G + ++ +R +Q
Sbjct: 102 KNSDLDVAIH------IPQAARVLEQEERGRNITDDERQASWREIQLEILQIVRLNLQND 155
Query: 110 --------------VAHARVPILKFETIHQNISCDISI--DNLCGQIKSKFLF-WISQID 152
+ A++ ILK T+ I CDIS+ D + + FL ++ ID
Sbjct: 156 EQINSRINWEHGIQLVQAQIQILKVMTV-DGIDCDISVVMDRFLSSMHNSFLIRHLAHID 214
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGN 210
GRF + +VK+WA + + +PK G FNSY+L LLV+ HF C P ILP L++I+ +
Sbjct: 215 GRFAPLCAIVKQWAASTKVKDPKDGGFNSYALVLLVI-HFLQCGTFPPILPNLQEIFKKD 273
Query: 211 LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASE 270
A ++ I F N + LA LF+ FL +S + K +
Sbjct: 274 ------NFIAWDDKVYPSILNFGAPLPKPLPRIAPNNAPLARLFIEFLYYYSMFNFKENY 327
Query: 271 LGICPF------TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+G P T Q +RS+T N + I+DPF++ N R V + L +I +
Sbjct: 328 IGARPVMVMDRRTSQNNMVRSST-----NKEVCIQDPFDE-HNPGRTV--RTLNRIKDVM 379
Query: 325 EMTH 328
T+
Sbjct: 380 RSTY 383
>gi|213405609|ref|XP_002173576.1| caffeine-induced death protein [Schizosaccharomyces japonicus
yFS275]
gi|212001623|gb|EEB07283.1| caffeine-induced death protein [Schizosaccharomyces japonicus
yFS275]
Length = 445
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 148/341 (43%), Gaps = 51/341 (14%)
Query: 7 LEPILKDILGMLNPLR-EDWETRMK--VISDLREVVESVESLRGATVEPFGSFVSNLFSR 63
L+P+ +L + +R D E R K +++ L+ VV SV A + FGS S L +
Sbjct: 46 LDPVTCFLLSTYDDVRVSDDELREKDAIMNLLKHVVHSVRP--EADIVAFGSIQSGLALK 103
Query: 64 WGDLDISIELSNGSCISSAGKKVKQ--SLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
D+D I L + G+++++ S + AL +G Y R AR+PI+K
Sbjct: 104 NSDIDACILLPD------IGEEMEEFASECFERFTALGFEGKYLR-----KARIPIIKLL 152
Query: 122 TIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
+ +N CDI +N + L S ID R + + +LVK WAK IN+P
Sbjct: 153 SDTKNRYYYGFQCDIGFNNQLAIYNTSLLHQYSLIDPRCKQLAILVKYWAKQKRINSPYY 212
Query: 177 GTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIA 235
GT +SY L+VLF+ V PA+LP L+D + +Q + FN+
Sbjct: 213 GTLSSYGYVLMVLFYLIHVVRPAVLPNLQD---------------SPHKQDLYVEGFNVG 257
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKF--------SGLSLKASELGICPFTGQW----EHI 283
+ N SL L F F S +S++ + W EH
Sbjct: 258 FVRGTTVARRNTESLPQLLAGFYGFFAHEFNYRESVISIRQPGGLLKKVDKDWTLAKEHT 317
Query: 284 RSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
S + + + + L IEDPFE N R VS+ L +I F
Sbjct: 318 GSADQVIKDRYVLAIEDPFEITHNVGRTVSKAGLFEIRGEF 358
>gi|334185750|ref|NP_001190016.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|332644548|gb|AEE78069.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 447
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 110/209 (52%), Gaps = 23/209 (11%)
Query: 20 PLREDWETRMKVISDLREVV-----ESVESLRGATVEPFGSFVSNLFSRWGDLDISIELS 74
P+ D+ TR +++ +L + +S ES +E +GSF N FS DLD+SI S
Sbjct: 56 PVSADYNTRKELVKNLNAMAIDIFGKSEES--SPVLEAYGSFAMNTFSSQKDLDVSINFS 113
Query: 75 NGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
+G+ KK++ + LR+L +G R + + ARVPI++F I CD+++
Sbjct: 114 SGTSEFYREKKLEILTRFATKLRSLEGQGFVRNVVPILSARVPIVRFCDQGTGIECDLTV 173
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLL---------------VKEWAKAHDINNPKTGT 178
++ G + S+ + ISQID RF+ + LL +K WA+AH +NN T
Sbjct: 174 ESKDGILTSQIIRIISQIDDRFQKLCLLHCQLFSTNVNIVICQIKHWARAHGVNNASHNT 233
Query: 179 FNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
NS S+++LV H QT P ILPP ++
Sbjct: 234 LNSISITMLVAHHLQTQSPPILPPFSTLF 262
>gi|157112713|ref|XP_001657612.1| poly(a) polymerase cid (pap) (caffein-induced death protein) [Aedes
aegypti]
gi|108877960|gb|EAT42185.1| AAEL006249-PA [Aedes aegypti]
Length = 1143
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 137/300 (45%), Gaps = 45/300 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY---RRLQF- 109
GS +S S D+D+ C+ V + G+ L L Q Y F
Sbjct: 848 GSSISGFASDSSDVDM--------CLVCRSNTVPFDMRGEALFQLGQLKNYFMNINTHFE 899
Query: 110 ---VAHARVPILKF-ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
V A+VPIL+F ET H + D++ +N G + LF SQ+D R R + L+VK W
Sbjct: 900 EFSVIQAKVPILRFRETAHSTV-IDLNFNNSVGIRNTHLLFMYSQLDWRLRPLALVVKLW 958
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAER 224
A+ H+IN+ K T +SYSL L+V+ Q V P +LP L +YP V
Sbjct: 959 AQHHNINDAKNMTISSYSLVLMVIHFLQYGVSPPVLPCLHAMYPDKFV------------ 1006
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGL-----SLKASELGICPFTGQ 279
++++I ++ + D Y N S+L LFV FLE ++ ++ + P
Sbjct: 1007 RMSDISTIDLME-TIDPYSSDNHSTLGELFVQFLEYYANFDYAHYAISVRTASVIPI--- 1062
Query: 280 WEHIRSNTRWLPNNH---PLFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRLTSTN 335
E R + + H L IE+PF+ N+AR+V + ++ +I + F RL N
Sbjct: 1063 -ESARVARSYKNDPHHWRQLCIEEPFDL-TNTARSVFDADIFEQIKSVFSTCWRRLKENN 1120
>gi|198426610|ref|XP_002126682.1| PREDICTED: similar to PAP associated domain containing 4 [Ciona
intestinalis]
Length = 713
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 135/278 (48%), Gaps = 27/278 (9%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G++V FG S+ DL + + N + +K +L + + L
Sbjct: 428 LVGSSVNGFGRLNSD-----ADLCLVFDPRNKT----VNRKTVLKMLNRMKQLLNNAHFV 478
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+ LQ + +A VPILKFE + CD++++NL G S L ++ D R R MVL +KE
Sbjct: 479 KNLQLI-YATVPILKFEDRISGMECDLNVNNLTGIRNSFLLLAYARCDPRVRPMVLCIKE 537
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
WA ++IN+ + GT +SY+L L+VL + Q P ++P + ++ N +L + E+
Sbjct: 538 WAHVNNINSAQLGTLSSYALVLMVLHYLQIVKPRVIPSFQALHKDNFSSNLP-IHCLGEK 596
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
+A + F +S+ N S ++ L + FS + + T H
Sbjct: 597 -VASLPMF----YSN------NTSPVSQLLKGWFNYFSTFDFANKVISVRLGTSYNVHTM 645
Query: 285 SNTR-WLPNNHPLFIEDPFEQPENSARAV-SEKNLAKI 320
SN++ WL + IE+PF+Q N ARAV S K ++I
Sbjct: 646 SNSKAWL--KKCVKIEEPFDQ-TNVARAVQSGKQFSQI 680
>gi|328875539|gb|EGG23903.1| Putative caffeine-induced death protein 1 [Dictyostelium
fasciculatum]
Length = 968
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 127/293 (43%), Gaps = 46/293 (15%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS + + + GD+DI C+ ++ + L + ++ + ++ +
Sbjct: 639 FGSSANGMSLKGGDIDI--------CMLIDDSFGDTDIVIEKLATMLKQNHFTKVLAIPS 690
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+KF+ N+SCDI I+N ++ + S ID R R +V +VK WAK IN
Sbjct: 691 ARVPIVKFKDQVHNLSCDICINNKLAIYNTRLVEDYSCIDDRMRPLVYVVKRWAKRRKIN 750
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD-----------IYPGNLVD-DLKGVRA 220
P TGT +SY+ +V+ Q+ P +LP L+ +Y NL D + G
Sbjct: 751 EPFTGTLSSYAYINMVISFLQSREPPVLPCLQQLAFGATSINGKVYGDNLADVTVDGYNC 810
Query: 221 NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELG 272
+ + F K N+ +L L +F E ++ +S++
Sbjct: 811 KYYNDLHNLTGFG----------KHNKETLGELVFAFFEYYARRFNYVTDVVSIRTGH-- 858
Query: 273 ICPFTGQ-WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
P T + WE I + + IEDPFE N AR V +L I + F
Sbjct: 859 TLPKTSKTWESINKKSHYY-----FSIEDPFEITHNLARVVKRSHLTMIISEF 906
>gi|195390269|ref|XP_002053791.1| GJ23150 [Drosophila virilis]
gi|194151877|gb|EDW67311.1| GJ23150 [Drosophila virilis]
Length = 588
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 152/330 (46%), Gaps = 45/330 (13%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
R ++T+M++ + +V +V G + GS +S S+ D+DI + I
Sbjct: 189 RHIFKTKMRLWRFIYKVTMAVYPRYGVYL--VGSSISYFGSKCSDMDICMLACTNPNIDP 246
Query: 82 AGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ V ++ ++L A Q + ++ ARVPIL+F + DI+ +N G
Sbjct: 247 RMEAVYHLQIMREMLNATEQFQEFNLIE----ARVPILRFTDRRHKVEVDINFNNSVGIR 302
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAI 199
+ L+ SQ++ R R + L +K+WA+ H+INN K T +SYSL L+V+ Q V P +
Sbjct: 303 NTHLLYCYSQLEWRLRPIALTIKQWAQYHNINNAKNMTISSYSLMLMVIHFLQAGVNPPV 362
Query: 200 LPPLKDIYPGNLV----DDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
LP L +YP D V N E+ A Y+ N +L L +
Sbjct: 363 LPCLHKMYPEKFCILQPSDFGYVDMN------EVMA---------PYQSDNHQTLGELLL 407
Query: 256 SFLEKFS-------GLSLKASELGICPFTGQWEHIRSNTRWLPNNH---PLFIEDPFEQP 305
SFL +S +S++ G+ P E R++ + H L IE+PF+
Sbjct: 408 SFLHYYSIFEYGKFAISIRVG--GVLPV----ETCRASKAVKNDIHQWNELCIEEPFDL- 460
Query: 306 ENSARAVSEKN-LAKISNAFEMTHFRLTST 334
N+AR+V + + +I F ++ RL ST
Sbjct: 461 TNTARSVYDPDTFDRIRAIFLASYSRLEST 490
>gi|414884676|tpg|DAA60690.1| TPA: hypothetical protein ZEAMMB73_903036 [Zea mays]
Length = 262
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 99/183 (54%), Gaps = 4/183 (2%)
Query: 16 GMLNPLREDWETRMKVISDLREVV-ESVESLRGA-TVEPFGSFVSNLFSRWGDLDISIEL 73
+L P D++ R ++ R++V + S G+ VEPFGSF +LF+ DLD+SI
Sbjct: 55 AILRPKPLDYDQRNTLVDVFRKMVNQRFGSNSGSPVVEPFGSFTMDLFTPHSDLDLSINF 114
Query: 74 SNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
S + K++ + +L + ++ G + + + ARVPILK + CDI
Sbjct: 115 SANTDEQYTRKQMISIIKKFSKVLFSYQRSGIFCGVLPIVSARVPILKVIDRGTGVECDI 174
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
S++N G +S + ++S +D RF+ + LVK WAK HD+N P+ T +S S+ LV FH
Sbjct: 175 SVENKDGMTRSMIIKFVSSLDERFQILSYLVKFWAKVHDLNTPRQLTMSSMSIISLVAFH 234
Query: 192 FQT 194
Q
Sbjct: 235 LQV 237
>gi|330801448|ref|XP_003288739.1| hypothetical protein DICPUDRAFT_92151 [Dictyostelium purpureum]
gi|325081215|gb|EGC34739.1| hypothetical protein DICPUDRAFT_92151 [Dictyostelium purpureum]
Length = 312
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 124/268 (46%), Gaps = 32/268 (11%)
Query: 94 LLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
++ ++ +K Y + ++H R+P++K NI+ D+ ++NL SK + S ID
Sbjct: 24 IVSSILRKNNYENIITLSHTRIPLIKLFDPEYNINIDLCLNNLLAIENSKLIKSYSSIDP 83
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
RF+ + +L+K WAKA +IN+ + +SYS + LV+F+ QTC P +LP L
Sbjct: 84 RFQVLYMLIKAWAKAKEINDAADESLSSYSYANLVIFYLQTCTPPVLPCLH--------- 134
Query: 214 DLKGVRANAERQIAEICAFNIARFSSDK----YRKINRSSLAHLFVSFLEKFSGLSLKAS 269
K + +R + ++ F D + N ++ LF FL +S K
Sbjct: 135 --KNTESLPKRTVEN----SVVAFHQDPKALGFVTKNTLTVGELFYDFLCFYSTFDFK-- 186
Query: 270 ELGICPFTGQWEHIRSNTR-WLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTH 328
IC G +++ + L ++I+DPF N ++++EKN K+
Sbjct: 187 RYAICINKGHMVELKNCQKELLVAPACIYIQDPFIFDFNPGKSMTEKNFTKL-------- 238
Query: 329 FRLTSTNQTRYALLSSLARPFILQFFGE 356
LT N+T Y + + + Q F E
Sbjct: 239 --LTEINKTIYIISNGIKDYGFDQIFSE 264
>gi|312085976|ref|XP_003144894.1| hypothetical protein LOAG_09318 [Loa loa]
Length = 554
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/313 (29%), Positives = 152/313 (48%), Gaps = 45/313 (14%)
Query: 27 TRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
R K+++ +R+V + + G+TV GS+ S++ DL I N S +
Sbjct: 249 VRQKLLTLIRQVYKDSNLIAVGSTVNGCGSYNSDM-----DLCICQPYKNHSF------E 297
Query: 86 VKQSLLGDLLRALRQK------GGYRRLQFVAHARVPILKFETI--HQNISCDISIDNLC 137
+S +LR L +K ++ Q++ A+VPI+K E ++ + DI+ +N+
Sbjct: 298 ANRSYSIHVLRKLHKKFVTDWRQMFKTCQYIP-AKVPIIKLEMAAPYEELEIDINCNNVA 356
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC-- 195
G S L + S++D RF + LLVK WA INN GT NSYSL L+VL HF C
Sbjct: 357 GIYNSHLLHYYSRVDDRFPALCLLVKHWAINAGINNAMMGTLNSYSLILMVL-HFLQCGA 415
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
+P +LP L+ +YP NA + + F + R+ N ++ L +
Sbjct: 416 LPPVLPNLQFLYPSLF---------NATCSLDSLELFRDLPYPLPP-REFNTETVGELLI 465
Query: 256 SFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
+F + ++ K + I C + G+ E + NT +FIE+P++Q +N+AR V
Sbjct: 466 AFFDYYAHFDFKNKAISIRNGCVY-GR-ELLADNTMRF----KIFIEEPYDQ-KNTARCV 518
Query: 313 SE-KNLAKISNAF 324
+ +NL I AF
Sbjct: 519 TSIENLQLIREAF 531
>gi|195165356|ref|XP_002023505.1| GL20158 [Drosophila persimilis]
gi|194105610|gb|EDW27653.1| GL20158 [Drosophila persimilis]
Length = 1338
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 140/299 (46%), Gaps = 35/299 (11%)
Query: 84 KKVKQSLLGDLLRA-LRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K+ + ++ +L ++ LR+ +R + ARVPIL+F+ I I D++ +N G + +
Sbjct: 960 KRAEALMILNLFQSVLRKTVVFRDFNLI-EARVPILRFKDILNEIEVDLNFNNCVGIMNT 1018
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILP 201
L +Q+D R R +V++VK WA+ HDIN+ K T +SYSL L+VL + Q C P +LP
Sbjct: 1019 YLLQLYAQLDWRTRPLVVVVKLWAQYHDINDAKRMTISSYSLVLMVLHYLQNGCTPHVLP 1078
Query: 202 PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS-DKYRKINRSSLAHLFVSFLEK 260
L+ +YP Q+ + F++ + D Y N +L LF F +
Sbjct: 1079 CLQTLYPEKF-------------QLGQQDCFDLNLIETIDPYPTQNHQTLGELFQGFFKY 1125
Query: 261 FSGLSLKASEL-----GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE- 314
+S + + G+ P + +SN L IE+PF+ N+AR+V +
Sbjct: 1126 YSCFDFRNHAISVRTGGVLPVSA-CRLAKSNKNDAYQWKELNIEEPFDL-SNTARSVYDF 1183
Query: 315 KNLAKISNAFEMTHFRLTSTNQTRYAL-LSSLARPFILQFFGESPVRYANYNNGHRRAR 372
++ F + S L ++S+ PF F G Y++GH + +
Sbjct: 1184 ATFERVKATF------VASARAVEQTLDINSVFSPF---FLGNQFANQPQYSHGHAQGQ 1233
>gi|358253165|dbj|GAA52296.1| poly(A) RNA polymerase gld-2 homolog A [Clonorchis sinensis]
Length = 972
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 134/287 (46%), Gaps = 31/287 (10%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ S D+D+ + +++ KK ++L LL LR+ R L + A
Sbjct: 684 GSSMNGFGSDESDMDMCLTVTSRDLTQ---KKEAFAVLSQLLPPLRKCSFIRNLHLI-RA 739
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
+VPILKF + CD++++N+ G + L ++ID R R + + VK WA+ DI++
Sbjct: 740 KVPILKFRDTLAGVDCDLNVNNVVGIYNTHLLAMYTRIDWRVRPLGMFVKYWAQRMDIHD 799
Query: 174 PKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPG--NLVDDLKGVRANAERQIAEIC 230
G ++Y L L+++ + Q C P ILP L+ YP N L + E AE+
Sbjct: 800 GSRGRLSTYPLLLMLIHYLQAGCTPPILPNLQAKYPKVFNYARPLCELDMRLELPWAEL- 858
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL 290
R N +SLA LFV F++ ++ + + G I R L
Sbjct: 859 ------------RSNNPASLAELFVGFIQYYTN-EFDFTRWAVSVRHGAPLPIDVAIRRL 905
Query: 291 PNNH--------PLFIEDPFEQPENSARAV-SEKNLAKISNAFEMTH 328
P + +F+E+PF Q N+AR++ + L +I AF TH
Sbjct: 906 PPHEQAHTARSFKVFVEEPFCQ-SNAARSLHGDDVLNRIRQAFIKTH 951
>gi|296089114|emb|CBI38817.3| unnamed protein product [Vitis vinifera]
Length = 989
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 129/286 (45%), Gaps = 35/286 (12%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ + + + I+ + +K L D+L Q + +Q +
Sbjct: 257 YGSCANSFGVSKSDIDVCLAIDDAD-INKSEFLLK---LADIL----QSDNLQNVQALTR 308
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K + ISCDI I+N+ + +K L +QID R R + +VK WAK+ +N
Sbjct: 309 ARVPIVKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRGVN 368
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI-CA 231
GT +SY+ L+ + Q C PAILP L+G++ + +I CA
Sbjct: 369 ETYQGTLSSYAYVLMCIHFLQQCKPAILPC------------LQGMQTTYSVTVDDIQCA 416
Query: 232 FNIARFSSDKYRKINRSSLAHLFVSFLEKFS--------GLSLKASELGICPFTGQWEHI 283
F + N+ S+A L +F ++ +S++ + I W
Sbjct: 417 FFDQVERLRHFGSHNKESIAQLVWAFFNYWAYHHDYANDVISIRTGSI-ISKREKDWTRR 475
Query: 284 RSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHF 329
+ N R H + IEDPFE + R V + ++ + FE +
Sbjct: 476 KGNDR-----HLICIEDPFEISHDLGRVVDKFSIKVLREEFERAAY 516
>gi|393912435|gb|EJD76738.1| hypothetical protein LOAG_16408 [Loa loa]
Length = 430
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/313 (29%), Positives = 152/313 (48%), Gaps = 45/313 (14%)
Query: 27 TRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
R K+++ +R+V + + G+TV GS+ S++ DL I N S +
Sbjct: 125 VRQKLLTLIRQVYKDSNLIAVGSTVNGCGSYNSDM-----DLCICQPYKNHS------FE 173
Query: 86 VKQSLLGDLLRALRQK------GGYRRLQFVAHARVPILKFETI--HQNISCDISIDNLC 137
+S +LR L +K ++ Q++ A+VPI+K E ++ + DI+ +N+
Sbjct: 174 ANRSYSIHVLRKLHKKFVTDWRQMFKTCQYIP-AKVPIIKLEMAAPYEELEIDINCNNVA 232
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC-- 195
G S L + S++D RF + LLVK WA INN GT NSYSL L+VL HF C
Sbjct: 233 GIYNSHLLHYYSRVDDRFPALCLLVKHWAINAGINNAMMGTLNSYSLILMVL-HFLQCGA 291
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
+P +LP L+ +YP NA + + F + R+ N ++ L +
Sbjct: 292 LPPVLPNLQFLYPSLF---------NATCSLDSLELFRDLPYPLPP-REFNTETVGELLI 341
Query: 256 SFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
+F + ++ K + I C + G+ E + NT +FIE+P++Q +N+AR V
Sbjct: 342 AFFDYYAHFDFKNKAISIRNGCVY-GR-ELLADNTM----RFKIFIEEPYDQ-KNTARCV 394
Query: 313 SE-KNLAKISNAF 324
+ +NL I AF
Sbjct: 395 TSIENLQLIREAF 407
>gi|358056067|dbj|GAA97964.1| hypothetical protein E5Q_04644 [Mixia osmundae IAM 14324]
Length = 780
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 136/299 (45%), Gaps = 42/299 (14%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQS---LLGDLLRALRQKGG 103
GA + PFGS + R D+D+ C S ++ ++S L+ L R + Q+
Sbjct: 107 GAKLLPFGSMANGFALRNSDMDLC-------CFRSETERPQRSSSELVEILGRIIEQETD 159
Query: 104 YRRLQFVAHARVPILKFET-----IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
+ ++ + AR+PI+K + + CDI DN ++ L +++D R R +
Sbjct: 160 FE-VKMLPRARIPIIKLSKPPSPGVPFGLQCDIGFDNRLAMENTRLLLTYARVDPRLRTI 218
Query: 159 VLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNL--VDD 214
VL +K W KA IN+P TGT +SY LLV+ HF T V PA+LP L+ + P V++
Sbjct: 219 VLFLKVWTKARKINSPYTGTLSSYGYVLLVI-HFLTNVRKPAVLPNLQRLPPPRAIPVEE 277
Query: 215 LKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC 274
L E +I ++ + ++ S+ L V+F +S A ++ I
Sbjct: 278 L-------EIDGHDIYFYDDLDSLDAVWSGTSKESVGELLVAFFRYYSQEFRYAWDV-IS 329
Query: 275 PFTGQWEHIRSNTRWLPNNHP-------------LFIEDPFEQPENSARAVSEKNLAKI 320
P T + + W + HP L IEDPF+ N AR V++ + I
Sbjct: 330 PRTEGGILTKESKGWQADLHPEDSLMGGPRELNKLCIEDPFQTDYNVARTVTKDGIFTI 388
>gi|380015769|ref|XP_003691868.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Apis
florea]
Length = 652
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 139/299 (46%), Gaps = 39/299 (13%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+T+ FGS S D+D+ + L + + + + L +L+ L++
Sbjct: 359 LVGSTMNGFGSDNS-------DVDMCL-LVRHTEMDQRNEAIGH--LEQILKCLKRCDFI 408
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+L+ + A+VPILKF QN+ D++ +N G + L+ S+ID R R +VL+VK
Sbjct: 409 EQLELI-QAKVPILKFHDSIQNLEVDLNCNNAVGIRNTHLLYCYSRIDWRVRPLVLVVKL 467
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLV--DDLKGVRAN 221
WA++ DIN+ K T +SYSL L+V+ Q V P +LP L +Y G D+ +
Sbjct: 468 WAQSQDINDAKNMTISSYSLVLMVIHFLQYGVNPPVLPCLHSLYEGKFTPHTDIHCIDIQ 527
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPF 276
E I R NR SL LF+ F + + + P
Sbjct: 528 EELDIP-----------VSVLRPKNRQSLGELFIEFFRYYVMFDFNQYAISVRLANKIPI 576
Query: 277 TGQWEHIRSNTRWLPNNHP---LFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRL 331
E R + + H L IE+PF+ N+AR+V + ++ A+I F+ T+ +L
Sbjct: 577 ----EECRRARSYKNDPHQWKYLCIEEPFDL-TNTARSVYDPDVFARIKYVFDCTYQKL 630
>gi|194911252|ref|XP_001982316.1| GG11113 [Drosophila erecta]
gi|190656954|gb|EDV54186.1| GG11113 [Drosophila erecta]
Length = 1345
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 137/299 (45%), Gaps = 45/299 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + I + V +++AL + + + A
Sbjct: 962 GSSISYFGSKCSDMDICMLACTNPNIDPRMEAVYHL---QVMKALLSRTDIFQDFNLIEA 1018
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ++ R R M L VK+WA+ H+INN
Sbjct: 1019 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLEWRVRPMALTVKQWAQYHNINN 1078
Query: 174 PKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L +YP +D V N E
Sbjct: 1079 AKNMTISSYSLMLMVIHFLQAGATPPVLPCLHKLYPDKFGLLQPNDFGYVDMN------E 1132
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N SL L ++FL +S +S++ G+ P E
Sbjct: 1133 VMA---------PYQSENSQSLGELLLNFLHYYSVFEYGKYAISIRVG--GVLPI----E 1177
Query: 282 HIRSNTRWLPNN-----HPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTST 334
R+ T P N + L IE+PF+Q N+AR+V + + +I F ++ RL ST
Sbjct: 1178 VCRAAT--APKNDIHQWNELCIEEPFDQ-TNTARSVYDTDTFERIKAIFVASYRRLEST 1233
>gi|297824611|ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp.
lyrata]
gi|297326027|gb|EFH56447.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp.
lyrata]
Length = 757
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 144/313 (46%), Gaps = 29/313 (9%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ E + ++++ L +V + A + +GS ++ D+D+
Sbjct: 438 LIPAEEELEKQRQLMAHLENLV--AKEWPHAKLYLYGSCANSFGFPKSDIDV-------- 487
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C++ G + +S + L + + + +Q + ARVPI+K ISCDI I+N+
Sbjct: 488 CLAIEGDDINKSEMLLKLAEMLESDNLQNVQALTRARVPIVKLMDPVTGISCDICINNVL 547
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ +K L +QID R R + +VK WAK+ +N GT +SY+ L+ + Q P
Sbjct: 548 AVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRRVNETYQGTLSSYAYVLMCIHFLQQRRP 607
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFV 255
ILP L+++ P VR + R CA+ N+ R + + NR ++A L
Sbjct: 608 PILPCLQEMEP------TYSVRVDNIR-----CAYFDNVDRLRN--FGSSNRETIAELVW 654
Query: 256 SFLEKFSGLSLKASELGICPFTGQWEHIRSN--TRWLPNN-HPLFIEDPFEQPENSARAV 312
F ++ A + + TG R TR + N+ H + IEDPFE + R V
Sbjct: 655 GFFNYWAYAHDYAYNV-VSVRTGSILGKREKDWTRRVGNDRHLICIEDPFETSHDLGRVV 713
Query: 313 SEKNLAKISNAFE 325
+ ++ + FE
Sbjct: 714 DKFSIRVLREEFE 726
>gi|170584484|ref|XP_001897029.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158595564|gb|EDP34107.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 747
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 158/345 (45%), Gaps = 57/345 (16%)
Query: 27 TRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS--SAG 83
R K+++ +R+V + + G+TV GS+ S++ DL I N S + S
Sbjct: 442 VRQKLLALVRQVYKDSNLIAVGSTVNGCGSYNSDM-----DLCICQPYKNHSFEANRSYS 496
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI--HQNISCDISIDNLCGQIK 141
V + L + RQ ++ Q++ A+VPI+K E ++ + DI+ +N+ G
Sbjct: 497 IHVLRKLHKKFVTDWRQM--FKTCQYIP-AKVPIIKLEMAAPYEELEIDINCNNVAGIYN 553
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAI 199
S L + S++D RF + LLVK WA INN GT NSYSL L+VL HF C +P +
Sbjct: 554 SHLLHYYSRVDDRFPALCLLVKHWAINAGINNAMMGTLNSYSLILMVL-HFLQCGALPPV 612
Query: 200 LPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRKINRSSLAHLF 254
LP L+ +YP NA C+ + D R+ N ++ L
Sbjct: 613 LPNLQFLYPSLF---------NA------TCSLDSLELFRDLPHPLPPREFNTETVGELL 657
Query: 255 VSFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARA 311
++F + F+ K + I C ++ + + NT +FIE+P++Q N+AR
Sbjct: 658 IAFFDYFAHFDFKNKAISIRNGCVYSR--DLLADNTM----RFKIFIEEPYDQ-RNTARC 710
Query: 312 VSE-KNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFG 355
V+ +NL I AF R A L + A P L+ G
Sbjct: 711 VTSIENLQLIREAF----------TSARNAFLQTWAGPPNLECIG 745
>gi|348528609|ref|XP_003451809.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Oreochromis niloticus]
Length = 481
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 129/267 (48%), Gaps = 33/267 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVA 111
GS ++ L SR D DI C+ G K +L LG LL+ + R Q +
Sbjct: 203 GSSMNGLGSRCSDADI--------CLVIKGNKKPDALRVLGRLLKLFKTLSYVERNQLI- 253
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW-ISQIDGRFRDMVLLVKEWAKAHD 170
A+VPIL+F ++ D++++N G I++ FL + D R R M+L++K+WA+ ++
Sbjct: 254 RAKVPILRFREKGSDLEFDLNVNNTVG-IRNTFLLRSYAYADLRVRPMILVIKKWARYNN 312
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG--NLVDDLKGVRANAERQIAE 228
IN+ GT +SY+L L+VL + QT +LP L+ YP N + DL V + I
Sbjct: 313 INDASKGTLSSYTLVLMVLHYLQTLSEPVLPSLQRDYPESFNPLMDLDMV-PEGPKHIP- 370
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
Y N+SSL L + FL K+ + I + ++
Sbjct: 371 ------------PYISRNKSSLGELLLGFL-KYYATEFSWDKQVISVREARAFPKNNSKE 417
Query: 289 WLPNNHPLFIEDPFEQPENSARAVSEK 315
W NN + +E+PFE+ N ARAV EK
Sbjct: 418 W--NNKFICVEEPFER-NNVARAVHEK 441
>gi|395510432|ref|XP_003759479.1| PREDICTED: poly(A) RNA polymerase GLD2 [Sarcophilus harrisii]
Length = 729
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 139/294 (47%), Gaps = 46/294 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ +R D D+ C+ + V Q + +L QK RL ++
Sbjct: 452 GSSLNGFGTRSSDGDL--------CLVVKEEPVNQKTEARYILSLVQKHFCTRLCYIERP 503
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL+VK+WA
Sbjct: 504 QLIPAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLESRVRPLVLVVKKWAS 562
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H+IN+ GT NSYSL L+VL + QT ILP L+ YP + +
Sbjct: 563 HHEINDASRGTLNSYSLVLMVLHYLQTLPEPILPSLQKNYPESFSSSM------------ 610
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
++ + A F+ Y NRS+L L + FL+ + A+E ++ Q +R
Sbjct: 611 QLHLVHQAPFTIPPYLSKNRSALGDLLLGFLKYY------ATEFD---WSSQMISVR-EA 660
Query: 288 RWLP-------NNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRLTS 333
+ LP N + +E+PF+ N+ARAV EK I + F + RL S
Sbjct: 661 KALPRPDGVEWRNKFICVEEPFDG-TNTARAVHEKQKFDMIKDEFLKSWHRLKS 713
>gi|195453234|ref|XP_002073698.1| GK14246 [Drosophila willistoni]
gi|194169783|gb|EDW84684.1| GK14246 [Drosophila willistoni]
Length = 1448
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/325 (28%), Positives = 146/325 (44%), Gaps = 41/325 (12%)
Query: 25 WETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGK 84
++T+M++ + V G + GS +S S+ D+DI + I +
Sbjct: 976 YKTKMRLWRSIYSVTMDTYPRYGLYL--VGSSISYFGSKCSDMDICMLACTNPNIDPRME 1033
Query: 85 KVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
V L+ +LL + + ++ ARVPIL+F + DI+ +N G +
Sbjct: 1034 AVYHLQLMRELLSSTDMFQDFNLIE----ARVPILRFTDRRHKVEVDINFNNSVGIRNTH 1089
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPP 202
L+ SQ++ R R + L VK+WA+ H+INN K T +SYSL L+V+ + Q V P +LP
Sbjct: 1090 LLYCYSQLEWRVRPIALTVKQWAQYHNINNAKNMTISSYSLMLMVIHYLQAGVSPPVLPC 1149
Query: 203 LKDIYPGNL----VDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFL 258
L +YP +D V N EI I + SD NR L L + FL
Sbjct: 1150 LHKLYPEKFGLLQPNDFGYVDMN------EI----IGPYQSD-----NRQPLGELLLGFL 1194
Query: 259 EKFSGLSLKASEL-----GICPFTGQWEHIRSNTRWLPNNH---PLFIEDPFEQPENSAR 310
+S + G+ P + RS+ + H L IE+PF+Q N+AR
Sbjct: 1195 HYYSVFEYSKYVISIRVGGVLPV----DVCRSSKAAKNDIHQWNELCIEEPFDQ-TNTAR 1249
Query: 311 AVSEK-NLAKISNAFEMTHFRLTST 334
+V + +I F ++ RL ST
Sbjct: 1250 SVYDPITFDRIRTIFLASYRRLEST 1274
>gi|195112618|ref|XP_002000869.1| GI10468 [Drosophila mojavensis]
gi|193917463|gb|EDW16330.1| GI10468 [Drosophila mojavensis]
Length = 455
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 151/323 (46%), Gaps = 31/323 (9%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
R ++T+M++ + +V +V G + GS +S S+ D+DI + I
Sbjct: 58 RHIFKTKMRLWRFIYKVTMAVYPRYGVYL--VGSSISFFGSKCSDMDICMLACTNHNIDP 115
Query: 82 AGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ V ++ ++L A Q + ++ ARVPIL+F + DI+ +N G
Sbjct: 116 RMEAVYHLQIMREMLNATEQFQEFNLIE----ARVPILRFTDRRHKVEVDINFNNSVGIR 171
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAI 199
+ L+ SQ++ R R + L +K+WA+ H+INN K T +SYSL L+V+ Q+ V P +
Sbjct: 172 NTHLLYCYSQLEWRLRPIALTIKQWAQYHNINNAKNMTISSYSLMLMVIHFLQSGVNPPV 231
Query: 200 LPPLKDIYPGNLV----DDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
LP L +YP D V N E+ + + SD ++ + L L
Sbjct: 232 LPCLHKLYPEKFSILQPTDFGYVDMN------EV----MTPYQSDNHQTLGELLLDFLHY 281
Query: 256 SFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNH---PLFIEDPFEQPENSARAV 312
L ++S ++ G+ P E RS+ + H L IE+PF+ N+AR+V
Sbjct: 282 YSLFEYSKFAISIRVGGVLPV----ETCRSSKAAKNDIHQWNELCIEEPFDL-TNTARSV 336
Query: 313 SEKN-LAKISNAFEMTHFRLTST 334
+ + +I F ++ RL ST
Sbjct: 337 YDPDTFDRIRAIFLASYSRLEST 359
>gi|291225474|ref|XP_002732726.1| PREDICTED: polyA polymerase CID, putative-like [Saccoglossus
kowalevskii]
Length = 577
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 137/305 (44%), Gaps = 37/305 (12%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
+ G+T+ FG+ + DLD+ + +S SS K + LL ++R ++Q
Sbjct: 291 ITGSTLNGFGT-------KQSDLDMCLMVSLPQ--SSPPKTLILRLLHRIMRQIQQDIPS 341
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+ V ARVPIL+F CDI+I+N G + L S+ D R ++L +K+
Sbjct: 342 CKSLLVIRARVPILRFTDTKSGFDCDININNATGIRNTHLLQAYSKCDWRVAPLMLTIKQ 401
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPGNLVDDLKGVRANAE 223
WA A++IN+ T +SY+L+L+VL + Q C PA+LP L+ ++P D +
Sbjct: 402 WASANNINDASQSTLSSYTLALMVLHYLQVGCNPAVLPALQQLHPYYFRSDSDVKNLFPQ 461
Query: 224 RQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHI 283
+ ++ +YR N SLA L F + + + F Q I
Sbjct: 462 NSLETDLPDHL------RYRSQNTQSLAGLLRGFFYYY---------VNVYKFAQQVISI 506
Query: 284 RSNTRW----LPNNHPLF------IEDPFEQPENSARAVS-EKNLAKISNAFEMTHFRLT 332
R + L + H L+ IE+PF+ N+AR V E +I AF T ++
Sbjct: 507 RLGITYPKQSLTSVHDLYNWKYICIEEPFDS-NNTARPVHLESKYLEIIQAFSNTFKKIK 565
Query: 333 STNQT 337
T
Sbjct: 566 KATST 570
>gi|18406841|ref|NP_566048.1| nucleotidyltransferase-like protein [Arabidopsis thaliana]
gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein [Arabidopsis thaliana]
gi|14532746|gb|AAK64074.1| unknown protein [Arabidopsis thaliana]
gi|20197056|gb|AAC06161.2| expressed protein [Arabidopsis thaliana]
gi|330255483|gb|AEC10577.1| nucleotidyltransferase-like protein [Arabidopsis thaliana]
Length = 764
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 144/317 (45%), Gaps = 29/317 (9%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL 73
I L P E+ E + ++++ L +V + A + +GS ++ D+D+
Sbjct: 441 IYKSLIPAEEELEKQRQLMAHLENLV--AKEWPHAKLYLYGSCANSFGFPKSDIDV---- 494
Query: 74 SNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
C++ G + +S + L + + + +Q + ARVPI+K ISCDI I
Sbjct: 495 ----CLAIEGDDINKSEMLLKLAEILESDNLQNVQALTRARVPIVKLMDPVTGISCDICI 550
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+N+ + +K L +QID R R + +VK WAK+ +N GT +SY+ L+ + Q
Sbjct: 551 NNVLAVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRRVNETYQGTLSSYAYVLMCIHFLQ 610
Query: 194 TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLA 251
P ILP L+++ P VR + R C + N+ R + + NR ++A
Sbjct: 611 QRRPPILPCLQEMEP------TYSVRVDNIR-----CTYFDNVDRLRN--FGSNNRETIA 657
Query: 252 HLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN--TRWLPNN-HPLFIEDPFEQPENS 308
L F ++ A + + TG R TR + N+ H + IEDPFE +
Sbjct: 658 ELVWGFFNYWAYAHDYAYNV-VSVRTGSILGKREKDWTRRVGNDRHLICIEDPFETSHDL 716
Query: 309 ARAVSEKNLAKISNAFE 325
R V + ++ + FE
Sbjct: 717 GRVVDKFSIRVLREEFE 733
>gi|356506330|ref|XP_003521938.1| PREDICTED: uncharacterized protein LOC100818029 [Glycine max]
Length = 731
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 145/317 (45%), Gaps = 29/317 (9%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL 73
I G L P E+ + ++++ L ++V + + + +GS ++ D+D+
Sbjct: 416 IYGSLIPPEEEKLKQKQLVAILEKLVS--KEWPTSNLYLYGSCANSFGVSKSDIDV---- 469
Query: 74 SNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
C++ +++S + L + Q + +Q + ARVPI+K ISCDI I
Sbjct: 470 ----CLAIEEADMEKSKIIMKLADILQSDNLQNVQALTRARVPIVKLMDPVTGISCDICI 525
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+NL + +K L + ID R R + ++K WAK+ +N GT +SY+ L+ + Q
Sbjct: 526 NNLLAVVNTKLLRDYAHIDPRLRQLAFIIKHWAKSRRVNETYHGTLSSYAYVLMCIHFLQ 585
Query: 194 TCVPAILPPLKDIYP--GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLA 251
PAILP L+++ VDD V Q+ ++C F + N+ S+A
Sbjct: 586 MRRPAILPCLQEMETTYSVTVDD---VHCAYFDQVEKLCDFG----------RHNKESIA 632
Query: 252 HLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN--TRWLPNN-HPLFIEDPFEQPENS 308
L F ++ A+ + I TG R TR + N+ H + IEDPFE +
Sbjct: 633 QLVRGFFHYWAYCHDYANTV-ISVRTGSIISKREKDWTRRIGNDRHLICIEDPFEISHDL 691
Query: 309 ARAVSEKNLAKISNAFE 325
R V + ++ + FE
Sbjct: 692 GRVVDKHSIKVLREEFE 708
>gi|19527122|ref|NP_598666.1| poly(A) RNA polymerase GLD2 [Mus musculus]
gi|81879697|sp|Q91YI6.1|GLD2_MOUSE RecName: Full=Poly(A) RNA polymerase GLD2; Short=mGLD-2; AltName:
Full=PAP-associated domain-containing protein 4
gi|16741658|gb|AAH16629.1| PAP associated domain containing 4 [Mus musculus]
gi|148668622|gb|EDL00941.1| PAP associated domain containing 4, isoform CRA_a [Mus musculus]
gi|148668623|gb|EDL00942.1| PAP associated domain containing 4, isoform CRA_a [Mus musculus]
Length = 484
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGARSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
DIN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 DINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSTSVQLHLVHH 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREAKAIPRPDDMEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|356522696|ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812787 [Glycine max]
Length = 732
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 143/317 (45%), Gaps = 29/317 (9%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL 73
I G L P E+ + K+++ L ++V + A + +GS ++ D+D+
Sbjct: 417 IYGSLIPPEEEKLKQKKLVALLEKLVS--KEWPTAKLYLYGSCANSFGVSKSDIDV---- 470
Query: 74 SNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
C++ +++S + L + Q + +Q + ARVPI+K ISCDI I
Sbjct: 471 ----CLAIEEADMEKSKIIMKLADILQSDNLQNVQALTRARVPIVKLMDPVTGISCDICI 526
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+NL + +K L + ID R R + ++K WAK+ +N GT +SY+ L+ + Q
Sbjct: 527 NNLLAVVNTKLLRDYAHIDPRLRQLAFIIKHWAKSRRVNETYHGTLSSYAYVLMCIHFLQ 586
Query: 194 TCVPAILPPLKDIYP--GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLA 251
PAILP L+++ VDD+ CA+ + + N+ S+A
Sbjct: 587 MRRPAILPCLQEMETTYSVTVDDIH-------------CAYFDQVEKLSDFGRHNKESIA 633
Query: 252 HLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN--TRWLPNN-HPLFIEDPFEQPENS 308
L F ++ A+ + I TG R TR + N+ H + IEDPFE +
Sbjct: 634 QLVRGFFHYWAYCHDYANTV-ISVRTGSIISKREKDWTRRIGNDRHLICIEDPFEISHDL 692
Query: 309 ARAVSEKNLAKISNAFE 325
R V + ++ + FE
Sbjct: 693 GRVVDKHSIKVLREEFE 709
>gi|328787132|ref|XP_393329.3| PREDICTED: poly(A) RNA polymerase gld-2 homolog A [Apis mellifera]
Length = 373
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 120/253 (47%), Gaps = 29/253 (11%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L +L+ L++ +L+ + A+VPILKF QN+ D++ +N G + L+ S+
Sbjct: 116 LEQILKCLKRCDFIEQLELI-QAKVPILKFHDSIQNLEVDLNCNNAVGIRNTHLLYCYSR 174
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPG 209
ID R R +VL+VK WA++ DIN+ K T +SYSL L+V+ Q V P +LP L +Y G
Sbjct: 175 IDWRVRPLVLVVKLWAQSQDINDAKNMTISSYSLVLMVIHFLQYGVNPPVLPCLHSLYEG 234
Query: 210 NLV--DDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK 267
D+ + E I R NR SL LF+ F +
Sbjct: 235 KFTPHTDIHCIDIQEELDIP-----------VSVLRPKNRQSLGELFIEFFRYYVMFDFN 283
Query: 268 ASELGI-----CPFTGQWEHIRSNTRWLPNNHP---LFIEDPFEQPENSARAVSEKNL-A 318
+ + P E R + + H L IE+PF+ N+AR+V + ++ A
Sbjct: 284 QYAISVRLANKIPI----EECRRARSYKNDPHQWKYLCIEEPFDL-TNTARSVYDPDVFA 338
Query: 319 KISNAFEMTHFRL 331
+I F+ T+ +L
Sbjct: 339 RIKYVFDCTYQKL 351
>gi|147782453|emb|CAN77386.1| hypothetical protein VITISV_006352 [Vitis vinifera]
Length = 720
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 125/286 (43%), Gaps = 35/286 (12%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ C++ + +S L + Q + +Q +
Sbjct: 442 YGSCANSFGVSKSDIDV--------CLAIDDADINKSEFLLKLADILQSDNLQNVQALTR 493
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K + ISCDI I+N+ + +K L +QID R R + +VK WAK+ +N
Sbjct: 494 ARVPIVKLKDPVTGISCDICINNVLAVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRGVN 553
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI-CA 231
GT +SY+ L+ + Q PAILP L+G++ + +I CA
Sbjct: 554 ETYQGTLSSYAYVLMCIHFLQQXKPAILPC------------LQGMQTTXSVTVDDIQCA 601
Query: 232 FNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG--------QWEHI 283
F + N+ S+A L +F ++ A+++ I TG W
Sbjct: 602 FFDQVERLRHFGSHNKESIAQLVWAFFNYWAYHHDYANDV-ISIRTGSIISKREKDWTRR 660
Query: 284 RSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHF 329
+ N R H + IEDPFE + R V + ++ + FE +
Sbjct: 661 KGNDR-----HLICIEDPFEISHDLGRVVDKFSIKVLREEFERAAY 701
>gi|47225120|emb|CAF98747.1| unnamed protein product [Tetraodon nigroviridis]
Length = 540
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 150/322 (46%), Gaps = 52/322 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS- 80
E+ R V S LR++ + T++PFGS V N F + G DLD+ ++L +
Sbjct: 172 ENSRLRFLVCSLLRDLAATY--FPQCTIKPFGSSV-NGFGKLGCDLDMILDLDGIRSMKP 228
Query: 81 ---------------SAGKKVKQSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIH 124
S+ + V QS+L + +L Q G +Q + +AR P+L+F
Sbjct: 229 KPKSGLSLEFQLKRVSSDRVVTQSVLSVIGESLDQFAPGCVGVQKILNARCPLLRFAHQP 288
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYS 183
CD++ +N ++ L+ ++D R R +V V+ WA+AH++ + G + ++S
Sbjct: 289 SGFQCDLTANNRVAVKSTELLYLYGELDPRVRFLVFTVRCWARAHNVTSNIPGAWITNFS 348
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDK 241
L+++VLF Q P I+P L D LK + A A+R CA N F SD
Sbjct: 349 LTVMVLFFLQKRNPPIIPTL---------DHLKQLAAPADR-----CAIDGNDCTFVSD- 393
Query: 242 YRKI----NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
+ KI N +L HL F + ++ S++ I TG+ +H P PL
Sbjct: 394 FSKIPLQQNSDTLEHLLRDFFDFYATFPF--SKMSINIRTGREQHK-------PEVAPLH 444
Query: 298 IEDPFEQPENSARAVSEKNLAK 319
I++PFE N ++ V+ L +
Sbjct: 445 IQNPFEPSLNVSKNVNMSQLDR 466
>gi|350406748|ref|XP_003487869.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like [Bombus
impatiens]
Length = 655
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 145/315 (46%), Gaps = 41/315 (13%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+T+ FGS S D+D+ + L + + + + L +L+ L++
Sbjct: 362 LVGSTMNGFGSDNS-------DVDMCL-LVRHTEMDQRNEAIGH--LEQILKCLKRCDFI 411
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+L+ + A+VPILKF QN+ D++ +N G + L+ S+ID R R +VL+VK
Sbjct: 412 EQLELI-QAKVPILKFYDSIQNLEVDLNCNNAVGIRNTHLLYCYSRIDWRVRPLVLVVKL 470
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNLV--DDLKGVRA 220
WA++ +IN+ K T +SYSL L+VL HF C P +LP L +Y G D+ +
Sbjct: 471 WAQSQNINDAKNMTISSYSLVLMVL-HFLQCGVNPPVLPCLHSLYKGKFAPHTDIHCIDI 529
Query: 221 NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-CPFTGQ 279
E I R NR +L LFV F + + +
Sbjct: 530 QEELNIP-----------VSILRPKNRQTLGELFVEFFRYYVMFDFNQYAISVRLANKIA 578
Query: 280 WEHIRSNTRWLPNNHP---LFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRLTSTN 335
E R + + H L IE+PF+ N+AR+V + ++ A+I + F+ T+ L +
Sbjct: 579 IEECRRARSYKNDPHQWKYLCIEEPFDL-TNTARSVYDPDVFARIKHVFDCTYQNLKEHH 637
Query: 336 QTRYALLSSLARPFI 350
LAR FI
Sbjct: 638 --------DLARIFI 644
>gi|363744221|ref|XP_003643001.1| PREDICTED: poly(A) RNA polymerase GLD2 [Gallus gallus]
Length = 505
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 130/274 (47%), Gaps = 25/274 (9%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL--RQKG 102
L G+++ FG+ S+ GDL + I+ +C +K + + L++ L +
Sbjct: 218 LVGSSLNGFGTRTSD-----GDLCLVIKEEPVTCFYKVNQKTEARHILSLVQKLFSTKLS 272
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLL 161
Y + A+VPI+KF + D++++N+ G I++ FL + I+ R R +VL+
Sbjct: 273 SYIERPQLIRAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYIEKRVRPLVLV 331
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
VK+WA HDIN+ GT +SYSL L+VL + QT ILP L+ YP + +
Sbjct: 332 VKKWASFHDINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKNYPESFDPTM------ 385
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
++ + A + Y N SSL L + F K+ S I +
Sbjct: 386 ------QLHLVHQAPCTIPPYLSKNESSLGELLIGFF-KYYATEFDWSHQMISVREAKAI 438
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ W N + +E+PF++ N+ARAV EK
Sbjct: 439 PRPDDIEW--RNKYICVEEPFDR-TNTARAVHEK 469
>gi|328871491|gb|EGG19861.1| Regulator of nonsense transcripts [Dictyostelium fasciculatum]
Length = 1534
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 127/272 (46%), Gaps = 37/272 (13%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISS-AGKKVKQSLLGDLLRALRQKGGYRRLQ 108
V+P+GSFV+ + DLD+ C S+ K Q +LL L+ ++ +
Sbjct: 1241 VKPYGSFVNGIQLESSDLDV--------CFSTREDMKTAQ----ELLFVLKDSKHFKLEK 1288
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ +RVPILKF NIS D+ +N S + + +D R + ++LLVK W+
Sbjct: 1289 IIQFSRVPILKFTDTLHNISYDMCFNNRLAIGNSLLIQSYANMDPRAKQLMLLVKYWSSQ 1348
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY----PGNLVDDLKGVRANAER 224
DIN+ GT +SYS +V+F+ QT P +LP L+ I P LV R
Sbjct: 1349 KDINDASVGTLSSYSWLNMVVFYLQTIQPPVLPSLQQIDSSTPPNRLV-----------R 1397
Query: 225 QIAEICAFNIARFSS-DKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHI 283
+ + F + + D + L + F SF KF+ S L I G+ +I
Sbjct: 1398 SVVDGWKFLDPKMTGFDSKNTMTVFQLLYGFFSFYSKFN-----FSNLLISIRLGKPTNI 1452
Query: 284 R-SNTRWLPNNHP--LFIEDPFEQPENSARAV 312
R ++ +L ++H + IEDPFE N A +V
Sbjct: 1453 RMASKEYLDHHHKRHICIEDPFETSHNPAASV 1484
>gi|224063941|ref|XP_002301312.1| predicted protein [Populus trichocarpa]
gi|222843038|gb|EEE80585.1| predicted protein [Populus trichocarpa]
Length = 281
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 120/252 (47%), Gaps = 17/252 (6%)
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C++ ++ +S + L + Q G + +Q + ARVPI+K ISCDI I+N+
Sbjct: 43 CLAIEDAEINKSEVLLKLADILQSGNLQNVQALTRARVPIVKLMDPATGISCDICINNVL 102
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ +K L +QID R R + +VK WAK+ +N GT +SY+ L+ + Q P
Sbjct: 103 AVVNTKLLRDYAQIDVRLRQLAFIVKHWAKSRGVNATYQGTLSSYAYVLMCIHFLQQRRP 162
Query: 198 AILPPLKDIYPGN--LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
AILP L+++ VDD++ CA+ + N+ ++A L
Sbjct: 163 AILPCLQEMRTTYSVTVDDIQ-------------CAYFDQVEKLRGFGSRNKETIARLVW 209
Query: 256 SFLEKFS-GLSLKASELGICPFTGQWEHIRSNTRWLPNN-HPLFIEDPFEQPENSARAVS 313
+F ++ G + + + + +H + TR + N+ H + IEDPFE + R V
Sbjct: 210 AFFNYWAYGHDYANAVISVRTGSILSKHEKEWTRRIGNDRHLICIEDPFEISHDLGRVVD 269
Query: 314 EKNLAKISNAFE 325
+ ++ + FE
Sbjct: 270 KFSIKVLREEFE 281
>gi|170031333|ref|XP_001843540.1| Poly(A) RNA polymerase, mitochondrial [Culex quinquefasciatus]
gi|167869800|gb|EDS33183.1| Poly(A) RNA polymerase, mitochondrial [Culex quinquefasciatus]
Length = 573
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 169/351 (48%), Gaps = 51/351 (14%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVE-PFGSFVSNLFSRWG-DLDISIELSNGSCI-- 79
D R++ ++ +R+V S++ + V PFGS V N F + G DLDI ++L + + +
Sbjct: 146 DVGKRLRFLA-VRQVESSLQGMFPQAVAFPFGSSV-NGFGKMGCDLDIILDLDSEANLKQ 203
Query: 80 --------------SSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
S+ +V++ L +GD+L+ G ++ + ARVPI+K+
Sbjct: 204 SKSSRLVFHTKAANSNERTQVQRQLESIGDVLQLFLP--GVNSVRRILKARVPIIKYHHE 261
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSY 182
H ++ D++++N+ G S+ L+ QID R + + V+ WA+A + N G + ++
Sbjct: 262 HLDLEIDLTMNNMTGVHMSELLYLFGQIDPRVQPLTCCVRRWAQAVGLTNHAPGYWITNF 321
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
SL++LV++ Q ILPP+ ++ DL+ +E QI+ C+F + S +
Sbjct: 322 SLTMLVMYFLQQLPEPILPPVNRLFANATRSDLR----ISEDQIS--CSF-LRDLSKLDF 374
Query: 243 RKINRSSLAHLFVSFLEKFSGLSL--KASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
+ N + L L + F E +S +A L I + P++ PL+I +
Sbjct: 375 KTTNATPLDDLLLQFFEFYSHFDFNQRAISLNI-----------GASILKPDHSPLYIVN 423
Query: 301 PFEQPENSARAVS--EKNLAKIS--NAFEM--THFRLTSTNQTRYALLSSL 345
P E N A+ V+ E L +I NA + TH R T+ + L+S L
Sbjct: 424 PLETVLNVAKNVNLEETELFRIQVRNALWLLDTHDRATTATGDEWGLVSLL 474
>gi|242060262|ref|XP_002451420.1| hypothetical protein SORBIDRAFT_04g001830 [Sorghum bicolor]
gi|241931251|gb|EES04396.1| hypothetical protein SORBIDRAFT_04g001830 [Sorghum bicolor]
Length = 647
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 138/318 (43%), Gaps = 28/318 (8%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESV-ESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
+L + L+ E R K + + +SV + A + +GS ++ + D+D+ +E
Sbjct: 328 LLALYESLKPSEEHRSKQKQLVDSLAKSVSKEWPNAQMHLYGSCANSFGTSHSDVDVCLE 387
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ G+ + VK L D+LR G+ ++ + ARVPI++ SCDI
Sbjct: 388 METGT-QDAIEVLVK---LADVLRT----DGFENVEAITSARVPIVRMSDPGSGFSCDIC 439
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
I+NL +K L +QID R + LVK WAK +N GT +SY+ L+ +
Sbjct: 440 INNLLAVANTKLLKDYAQIDQRLLQLAFLVKHWAKQRGVNETYRGTLSSYAYVLMCINFL 499
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q C P ILP L+ + P + + G ++ ++ F N++S+A
Sbjct: 500 QQCEPKILPCLQAMEPTYKL-TVDGTECAYFDKVDQLQGFGAD----------NKASVAE 548
Query: 253 LFVSFLEKFSG-----LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
L F ++ + + LG T + R TR + H + IEDPFE +
Sbjct: 549 LLWGFFHYWASQHHYKRDVISVRLGK---TISKQEKRWTTRVGNDRHLVCIEDPFEVSHD 605
Query: 308 SARAVSEKNLAKISNAFE 325
R V + + + E
Sbjct: 606 LGRVVDRQTIRILREEME 623
>gi|395825552|ref|XP_003785992.1| PREDICTED: poly(A) RNA polymerase GLD2 [Otolemur garnettii]
Length = 484
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/273 (28%), Positives = 130/273 (47%), Gaps = 36/273 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + LL R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLLHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIAG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT NSYSL L+VL + QT ILP L+ +YP + + + +
Sbjct: 319 QINDASRGTLNSYSLVLMVLHYLQTLPEPILPSLQKMYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW 289
N+ + S N S+L L + FL+ + A+E +T Q +R +
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYY------ATEFD---WTSQMISVRE-AKA 416
Query: 290 LP-------NNHPLFIEDPFEQPENSARAVSEK 315
+P N + +E+PF+ N+ARAV EK
Sbjct: 417 IPRPDGIEWRNKYICVEEPFDG-TNTARAVHEK 448
>gi|56605820|ref|NP_001008373.1| poly(A) RNA polymerase GLD2 [Rattus norvegicus]
gi|81883541|sp|Q5U315.1|GLD2_RAT RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
Full=PAP-associated domain-containing protein 4
gi|55249699|gb|AAH85771.1| PAP associated domain containing 4 [Rattus norvegicus]
gi|149059029|gb|EDM10036.1| PAP associated domain containing 4, isoform CRA_a [Rattus
norvegicus]
gi|149059030|gb|EDM10037.1| PAP associated domain containing 4, isoform CRA_a [Rattus
norvegicus]
Length = 484
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGARSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSTSVQLHLVHH 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREAKAIPRPDDMEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|195502498|ref|XP_002098250.1| GE10276 [Drosophila yakuba]
gi|194184351|gb|EDW97962.1| GE10276 [Drosophila yakuba]
Length = 1341
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/299 (29%), Positives = 136/299 (45%), Gaps = 45/299 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S S+ D+DI + I + V +++AL + + + A
Sbjct: 958 GSSISYFGSKCSDMDICMLACTNPNIDPRMEAVYHL---QVMKALLSRTNMFQDFNLIEA 1014
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVPIL+F + DI+ +N G + L+ SQ++ R R M L VK+WA+ H+INN
Sbjct: 1015 RVPILRFTDRCHKVEVDINFNNSVGIRNTHLLYCYSQLEWRVRPMALTVKQWAQYHNINN 1074
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL----VDDLKGVRANAERQIAE 228
K T +SYSL L+V+ Q P +LP L ++P +D V N E
Sbjct: 1075 AKNMTISSYSLMLMVIHFLQVGASPPVLPCLHKLHPDKFGLLQPNDFGYVDMN------E 1128
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWE 281
+ A Y+ N SL L ++FL +S +S++ G+ P E
Sbjct: 1129 VMA---------PYQSENSQSLGELLLNFLHYYSVFEYGKYAISIRVG--GVLPI----E 1173
Query: 282 HIRSNTRWLPNN-----HPLFIEDPFEQPENSARAVSEKN-LAKISNAFEMTHFRLTST 334
R+ P N + L IE+PF+Q N+AR+V + + +I F ++ RL ST
Sbjct: 1174 KCRAAA--APKNDIHQWNELCIEEPFDQ-TNTARSVYDTDTFERIKAIFVASYRRLEST 1229
>gi|392575623|gb|EIW68756.1| hypothetical protein TREMEDRAFT_39663 [Tremella mesenterica DSM
1558]
Length = 800
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 143/334 (42%), Gaps = 51/334 (15%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++++E A + FGS ++ R D+D+ +
Sbjct: 25 LLPTNEELHVKEEVRGLIEKLIKTIEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I + ++ S ++ L ++ ++ + AR+PILK + I+CDI
Sbjct: 77 LIDDSEANIEPSHFVAMIADLLERETNFDVKPLPKARIPILKLNLKASTALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
I+N ++ L + ID R R +VL +K WAK IN+P GT +SY +L+VL++
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWAKRRRINSPYRGTLSSYGFTLMVLYY 196
Query: 192 F-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E Q + F+ +++ IN
Sbjct: 197 LVHVKQPPVLPNLQRIAP---------LRPMTEEQYTLEGKNVYFFDDVETLRNEWSSIN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L + F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLIDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGVNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L IEDPFE N AR V++ L I F
Sbjct: 306 -----LCIEDPFETTYNVARTVTKDGLYTIRGEF 334
>gi|344252353|gb|EGW08457.1| Poly(A) RNA polymerase GLD2 [Cricetulus griseus]
Length = 484
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 136/293 (46%), Gaps = 39/293 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGARSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVYKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSTSVQLHLVHH 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREAKAIPRPDDIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRLTS 333
N + +E+PF+ N+ARAV EK I + F + RL S
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEKQKFDMIKDQFLKSWHRLKS 467
>gi|395327709|gb|EJF60106.1| hypothetical protein DICSQDRAFT_137682 [Dichomitus squalens
LYAD-421 SS1]
Length = 1165
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 152/330 (46%), Gaps = 65/330 (19%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
M V D+R+++E + +R T+EP FGS + + D+D+ + +G +S+A
Sbjct: 62 MAVKEDVRKLLERL--IR--TIEPDSRLLSFGSSANGFSLKNSDMDLCCLIDSGERLSAA 117
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF-----ETIHQNISCDISIDNLC 137
+++GDLL ++ ++ + HAR+PI+K + I+CDI +N
Sbjct: 118 DLV---TMVGDLL----ERETKFHVKPLPHARIPIVKLNLDPSPALPFGIACDIGFENRL 170
Query: 138 GQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF---HFQ 193
++ L + +D R R MVL +K W+K IN+P GT +SY LLV++ H +
Sbjct: 171 ALENTRLLMCYASVDPARVRTMVLFLKVWSKRRKINSPYKGTLSSYGYVLLVIYFLMHVK 230
Query: 194 TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS-----DKYRKINRS 248
T P +LP L+ + P +R +E + + + +NI F +++ N
Sbjct: 231 T--PPVLPNLQQMPP---------LRPISEEE-SHLNGYNIWFFDDIELLRQRWKSANTD 278
Query: 249 SLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWLPNNH 294
++A L + F + +S GL LK + G W +S+ +
Sbjct: 279 TVAELLIDFFKFYSRDFAYNTAVASIRAGL-LKKEDKG-------WATEQSDIGTSRERN 330
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L IEDPFE N AR V++ L I F
Sbjct: 331 RLCIEDPFETDFNVARCVTKDGLYTIRGEF 360
>gi|194220100|ref|XP_001918375.1| PREDICTED: poly(A) RNA polymerase GLD2 [Equus caballus]
Length = 484
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVIKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEVDLNVNNIVG-IRNTFLLRAYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNSQMISVREAKAVPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|340721317|ref|XP_003399069.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 1
[Bombus terrestris]
Length = 649
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 145/315 (46%), Gaps = 41/315 (13%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+T+ FGS S D+D+ + L + + + + L +L+ L++
Sbjct: 356 LVGSTMNGFGSDNS-------DVDMCL-LVRHTEMDQRNEAIGH--LEQILKCLKRCDFI 405
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+L+ + A+VPILKF QN+ D++ +N G + L+ S+ID R R +VL+VK
Sbjct: 406 EQLELI-QAKVPILKFYDSIQNLEVDLNCNNAVGIRNTHLLYCYSRIDWRVRPLVLVVKL 464
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNLV--DDLKGVRA 220
WA++ +IN+ K T +SYSL L+V+ HF C P +LP L +Y G D+ +
Sbjct: 465 WAQSQNINDAKNMTISSYSLVLMVI-HFLQCGVNPPVLPCLHSLYKGKFAPHTDIHCIDI 523
Query: 221 NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-CPFTGQ 279
E I R NR +L LFV F + + +
Sbjct: 524 QEELNIP-----------ISILRPKNRQTLGELFVEFFRYYVMFDFNQYAISVRLANKIA 572
Query: 280 WEHIRSNTRWLPNNHP---LFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRLTSTN 335
E R + + H L IE+PF+ N+AR+V + ++ A+I + F+ T+ L +
Sbjct: 573 IEECRRARSYKNDPHQWKYLCIEEPFDL-TNTARSVYDPDVFARIKHVFDCTYQNLKEHH 631
Query: 336 QTRYALLSSLARPFI 350
LAR FI
Sbjct: 632 --------DLARIFI 638
>gi|317106636|dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas]
Length = 748
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 130/278 (46%), Gaps = 27/278 (9%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ + + N I+ + +K L D+L Q + +Q +
Sbjct: 470 YGSCANSFGVLKSDIDVCLAIQNAD-INKSEVLLK---LADIL----QSDNLQNVQALTR 521
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K ISCDI I+N+ + +K L+ +QID R R + +VK WAK+ +N
Sbjct: 522 ARVPIVKLMDPVTGISCDICINNVLAVVNTKLLWDYAQIDVRLRQLAFIVKHWAKSRGVN 581
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP--GNLVDDLKGVRANAERQIAEIC 230
GT +SY+ L+ + Q PAILP L+++ VDD++ C
Sbjct: 582 ETYHGTLSSYAYVLMCIHFLQQRRPAILPCLQEMEATYSVAVDDIQ-------------C 628
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN--TR 288
A+ + N+ ++A L +F ++ A+ + I TG R TR
Sbjct: 629 AYFDQVEKLRGFGSRNKETIAQLVWAFFNYWAYRHDYANAV-ISIRTGSIISKREKDWTR 687
Query: 289 WLPNN-HPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+ N+ H + IEDPFE + R V + ++ + FE
Sbjct: 688 RIGNDRHLICIEDPFEISHDLGRVVDKYSIKVLREEFE 725
>gi|340721319|ref|XP_003399070.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 2
[Bombus terrestris]
Length = 655
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 90/315 (28%), Positives = 145/315 (46%), Gaps = 41/315 (13%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+T+ FGS S D+D+ + L + + + + L +L+ L++
Sbjct: 362 LVGSTMNGFGSDNS-------DVDMCL-LVRHTEMDQRNEAIGH--LEQILKCLKRCDFI 411
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+L+ + A+VPILKF QN+ D++ +N G + L+ S+ID R R +VL+VK
Sbjct: 412 EQLELI-QAKVPILKFYDSIQNLEVDLNCNNAVGIRNTHLLYCYSRIDWRVRPLVLVVKL 470
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNLV--DDLKGVRA 220
WA++ +IN+ K T +SYSL L+V+ HF C P +LP L +Y G D+ +
Sbjct: 471 WAQSQNINDAKNMTISSYSLVLMVI-HFLQCGVNPPVLPCLHSLYKGKFAPHTDIHCIDI 529
Query: 221 NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-CPFTGQ 279
E I R NR +L LFV F + + +
Sbjct: 530 QEELNIP-----------ISILRPKNRQTLGELFVEFFRYYVMFDFNQYAISVRLANKIA 578
Query: 280 WEHIRSNTRWLPNNHP---LFIEDPFEQPENSARAVSEKNL-AKISNAFEMTHFRLTSTN 335
E R + + H L IE+PF+ N+AR+V + ++ A+I + F+ T+ L +
Sbjct: 579 IEECRRARSYKNDPHQWKYLCIEEPFDL-TNTARSVYDPDVFARIKHVFDCTYQNLKEHH 637
Query: 336 QTRYALLSSLARPFI 350
LAR FI
Sbjct: 638 --------DLARIFI 644
>gi|114052222|ref|NP_001039826.1| poly(A) RNA polymerase GLD2 [Bos taurus]
gi|122135693|sp|Q2HJ44.1|GLD2_BOVIN RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
Full=PAP-associated domain-containing protein 4
gi|87578209|gb|AAI13320.1| PAP associated domain containing 4 [Bos taurus]
gi|296483646|tpg|DAA25761.1| TPA: poly(A) RNA polymerase GLD2 [Bos taurus]
gi|440893587|gb|ELR46294.1| Poly(A) RNA polymerase GLD2 [Bos grunniens mutus]
Length = 484
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
DIN+ GT +SYSL L+VL + QT ILP ++ IYP + + + +
Sbjct: 319 DINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYP-------ESFSPSIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|321459353|gb|EFX70407.1| hypothetical protein DAPPUDRAFT_11736 [Daphnia pulex]
Length = 470
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/318 (28%), Positives = 147/318 (46%), Gaps = 44/318 (13%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLD--ISIELSNGSCI 79
R W T +++ L + S++ L PFGSFV+ DLD ISIE + GS I
Sbjct: 148 RLRWLTSIQIEQTLSRLFPSLQVL------PFGSFVNGSARNGCDLDMAISIEGNFGSEI 201
Query: 80 SSAGKKV--KQSLLGDLLRALRQK-------------GGYRRLQFVAHARVPILKFETIH 124
+ + + D R QK G ++Q + ARVPI+KF
Sbjct: 202 MALQSPLIFQAKAAIDNYRLQTQKHLEFFADVVQNYTTGCVQIQRIMRARVPIIKFHHEF 261
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT-FNSYS 183
+ CDIS+ +L G S+ L+ +ID RF +V V++WA + +P G +++
Sbjct: 262 TGVDCDISM-SLSGVFMSELLYLYDKIDWRFCPLVTAVRQWAAWAKLTSPTPGNQITNFT 320
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAE-RQIAEI-CAFNIARFSSDK 241
L+++V+F Q P ILP L ++ +K R + + RQ +I C F +
Sbjct: 321 LTIMVVFFLQRRTPPILPTLGEM--------IKLARPHVDTRQTEDINCTFLRDPSVFQE 372
Query: 242 YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDP 301
+++N SL LF+ FL F S +E I TG H + +++ PL++ +P
Sbjct: 373 RKELNSESLEDLFMGFLRFFG--SFDFNERSISVITGD-SHRKFDSK------PLYVINP 423
Query: 302 FEQPENSARAVSEKNLAK 319
E+ N +R V+ L +
Sbjct: 424 LERELNVSRNVTLTELTR 441
>gi|410927408|ref|XP_003977141.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Takifugu
rubripes]
Length = 542
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 151/317 (47%), Gaps = 44/317 (13%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS- 80
E+ R V S +R++ + T++PFGS V N F + G DLD+ +++ +G+ IS
Sbjct: 175 ENSRLRFLVCSLIRDLASTY--FPECTIKPFGSSV-NGFGKLGCDLDMILDI-DGTSISK 230
Query: 81 --------------SAGKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVPILKFETIHQ 125
S+ + V QS+L + +L R G +Q + +AR P+L+F
Sbjct: 231 VKSGLSMEFQLKRVSSERVVTQSMLSVIGESLDRFAPGCVGIQKILNARCPLLRFAHQPS 290
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSL 184
CD++ +N ++ L+ ++D R R +V V+ WA+ H+I + G + ++SL
Sbjct: 291 GFQCDLTANNRVAVKSTELLYLYGELDPRVRFLVFTVRCWARVHNITSNIPGAWITNFSL 350
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKY 242
+++VLF Q P I+P L D LK + A+R + E C F + FS
Sbjct: 351 TVMVLFFLQKRNPPIIPTL---------DHLKKLAGPADRSVVEGNDCTF-VRDFSKVLL 400
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
+K N ++L L F + ++ S + I TG+ +H P PL I++PF
Sbjct: 401 QK-NSNTLEDLLREFFDFYATFPF--SNMSINIRTGKEQHK-------PEVAPLHIQNPF 450
Query: 303 EQPENSARAVSEKNLAK 319
E N ++ V+ L +
Sbjct: 451 ESSLNISKNVNNSQLDR 467
>gi|348557261|ref|XP_003464438.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Cavia porcellus]
Length = 484
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 136/291 (46%), Gaps = 39/291 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILMLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWANHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
NI + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNIPPYLSK-----NESSLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRL 331
N + +E+PF+ N+ARAV EK I + F + RL
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEKQKFDMIKDQFLKSFHRL 465
>gi|311249764|ref|XP_003123800.1| PREDICTED: poly(A) RNA polymerase GLD2 [Sus scrofa]
Length = 484
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 131/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRTSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ +YP + L + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKMYPESFSPAL-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
+N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APYNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKFICVEEPFDG-TNTARAVHEK 448
>gi|256016593|emb|CAR63592.1| hypothetical protein [Angiostrongylus cantonensis]
Length = 694
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 139/291 (47%), Gaps = 38/291 (13%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGS-CISSAGKKVKQSLLGDLLRALRQKGGYR 105
G+T+ GS+ S D+D+ + +S G+ + + + L L +R K R
Sbjct: 410 GSTINGCGSYNS-------DMDLCLHISMGAEKMYPSERTYAVKTLHRLNSIIRGKPSLR 462
Query: 106 RL---QFVAHARVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
R+ V A+VPI+K ++ + D++++N+ G S + S +D RF + L
Sbjct: 463 RIVRRSEVIPAKVPIIKMALHPPYEGLELDVNVNNIAGIYNSHLIHHYSLLDQRFPAVCL 522
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVR 219
LVK WA + I + G+FNSYSL LLVL +FQ V PA+LP L+ +YP
Sbjct: 523 LVKHWAITNGIGDASAGSFNSYSLILLVLHYFQCGVKPAVLPNLQYLYPDKF-------- 574
Query: 220 ANAERQIAEICAF-NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---CP 275
+ E+ F + R R N S+ L + F ++ + + + C
Sbjct: 575 -GCMPPLNELNLFQTLQRLPP---RMQNNQSIGELLIGFFHYYAAFDFENVAISMRDACM 630
Query: 276 FTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFE 325
F+ R+N + + +FIE+PF+ +N+AR V++ + +I +AF+
Sbjct: 631 FS------RANMAPEASIYRVFIEEPFDG-KNTARCVTKSYTIGRIRHAFK 674
>gi|392587440|gb|EIW76774.1| hypothetical protein CONPUDRAFT_110417 [Coniophora puteana
RWD-64-598 SS2]
Length = 860
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 164/349 (46%), Gaps = 65/349 (18%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P E+ M V D+R+++E + +R T+EP FGS + R
Sbjct: 39 LFDFVIQLLPTAEE----MAVKEDVRKLLERL--IR--TIEPDSRLLSFGSTANGFSLRN 90
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET-- 122
D+D+ + + +SS+ ++LGDLL ++ ++ +AHAR+PI+K
Sbjct: 91 SDMDLCCLIDSEERLSSSDMV---TMLGDLL----ERETKFHVKPLAHARIPIVKLSLDP 143
Query: 123 ---IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGT 178
+ I+CDI +N ++ L + ID R R +VL +K W+K IN+P +GT
Sbjct: 144 SPGLPLGIACDIGFENRLALENTRLLMCYAMIDPTRVRTLVLFLKVWSKRRKINSPYSGT 203
Query: 179 FNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLV---------------DDLKGVR--- 219
+SY LLV+ F P +LP L+ + P + DD++ +R
Sbjct: 204 LSSYGYVLLVIYFLVHVKNPPVLPNLQQMPPMRPISQDDTHVGGYNTWFFDDIELLRQRW 263
Query: 220 --ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
+N E +AE+C FN+ F +R +R + V+ + +GL LK G
Sbjct: 264 QSSNTE-SVAELCVFNLIDF----FRYYSRDFSYNTGVASIR--AGL-LKKESKG----- 310
Query: 278 GQWEHIRSNTRW--LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
W++ S++++ + IEDPFE N +R V+++ L I F
Sbjct: 311 --WQNDLSSSKYNDARERNRFCIEDPFETDFNVSRCVTKEGLYLIRGEF 357
>gi|426232500|ref|XP_004010260.1| PREDICTED: poly(A) RNA polymerase GLD2 [Ovis aries]
Length = 484
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
DIN+ GT +SYSL L+VL + QT ILP ++ IYP + + + +
Sbjct: 319 DINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYP-------ESFSPSIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NDSNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|270358653|gb|ACZ81442.1| Cid1 [Cryptococcus heveanensis]
Length = 738
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 155/358 (43%), Gaps = 53/358 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++++E A + FGS ++ R D+D+ + + + +
Sbjct: 25 LLPTSEELNVKEEVRGLIEKLIKTLEP--SARLLSFGSSCNSFGLRNSDMDLVVLIDDPN 82
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
AG V+ + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 83 ATIDAGNFVES------MAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGFTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPMEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L + F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLIDFFRFFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL-LSSLAR 347
L IEDPFE N AR V++ L I F M R+ + R L L+ L R
Sbjct: 306 -----LCIEDPFEVSYNVARTVTKDGLYTIRGEF-MRATRILTQRPDRAVLALAELCR 357
>gi|351697179|gb|EHB00098.1| Poly(A) RNA polymerase GLD2 [Heterocephalus glaber]
Length = 484
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 136/291 (46%), Gaps = 39/291 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILLLVHKHFCTRLSGYIDRPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWANHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSSAIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
NI + S N SSL L + FL+ ++ +S++ ++ P +W
Sbjct: 372 TPCNIPPYLSK-----NESSLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRL 331
N + +E+PF+ N+ARAV EK I + F + RL
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEKQKFDMIKDQFLKSFHRL 465
>gi|291394943|ref|XP_002713979.1| PREDICTED: PAP associated domain containing 4 [Oryctolagus
cuniculus]
Length = 484
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRTSDGDLCLVVKEEPSFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIAG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
DIN+ GT NSYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 DINDASRGTLNSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESTLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|324502316|gb|ADY41019.1| Poly(A) RNA polymerase gld-2, partial [Ascaris suum]
Length = 1110
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 126/264 (47%), Gaps = 36/264 (13%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
E R +++S ++ V E A + GS V+ + D+D+ + L + +
Sbjct: 870 EVRERLLSLVKTVYED------ANIVAVGSTVNGCGAYNSDMDLCLCLPDAIYGYDTDRD 923
Query: 86 VKQSLLGDLLRALR-QKGGYRRLQFVAHARVPILKFETIHQ--NISCDISIDNLCGQIKS 142
+L + R L Q G R A+VPILK E ++ + DI+ +N+ G S
Sbjct: 924 YGVRVLKKVFRVLAYQSNGLVRKCHCIPAKVPILKLEMGNEYSELEIDINCNNVAGIYNS 983
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAIL 200
L + ++ID RF + LLVK WA IN+ +GTFNSYSL LLVL HF C +P +L
Sbjct: 984 HLLHYYARIDDRFPALCLLVKHWAINAGINDAMSGTFNSYSLILLVL-HFLQCATMPPVL 1042
Query: 201 PPLK----DIYPGNL-VDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
P L+ DI+ G+ +D+L+ R N+ + R++NR+++ L +
Sbjct: 1043 PNLQVLHPDIFNGHCGLDNLELFR-------------NLPPLPT---RELNRNTVGELLI 1086
Query: 256 SFLEKFSGLSLKASELGI---CPF 276
+F + ++ + I C F
Sbjct: 1087 AFFDYYAKFDFVNKAISIHRGCVF 1110
>gi|403256383|ref|XP_003920859.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 484
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 128/274 (46%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSPAVQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|255088563|ref|XP_002506204.1| predicted protein [Micromonas sp. RCC299]
gi|226521475|gb|ACO67462.1| predicted protein [Micromonas sp. RCC299]
Length = 88
Score = 91.3 bits (225), Expect = 8e-16, Method: Composition-based stats.
Identities = 44/89 (49%), Positives = 57/89 (64%), Gaps = 1/89 (1%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVP++KF + CD+ + N G KS L ++ +D R+RD+V LVK WAK D N
Sbjct: 1 ARVPLIKFRDPRTGVKCDVCVGN-DGVYKSAVLGAMADLDSRYRDLVFLVKMWAKNFDCN 59
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+ G+FNSYSLSL+ LFH QT P ILP
Sbjct: 60 DATAGSFNSYSLSLMSLFHLQTRSPPILP 88
>gi|256071408|ref|XP_002572032.1| poly(A) polymerase cid (pap) (caffein-induced death protein)
[Schistosoma mansoni]
gi|350645034|emb|CCD60264.1| poly(A) polymerase cid (pap) (caffein-induced death protein),
putative [Schistosoma mansoni]
Length = 410
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 151/324 (46%), Gaps = 40/324 (12%)
Query: 20 PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCI 79
PL+ ++ ++ +++ L V+ V A + GS ++ S D+D+ C+
Sbjct: 94 PLK--YKKKICLLNALHMVISGV--FENAGLYIVGSSINGFGSNQSDMDM--------CL 141
Query: 80 SSAGKKVKQS-----LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISID 134
+ + Q +L LL++LR+ + A+VPI+KF + I CD++++
Sbjct: 142 LVTSRDLHQKNEATFILSRLLQSLRKCRFLHNFTLI-RAKVPIIKFRDKYAGIDCDLNVN 200
Query: 135 NLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT 194
N+ G + L +++D R R + + +K WA+ DI++ + G ++Y L L+++ + QT
Sbjct: 201 NVIGLYNTHLLAMYAKVDWRVRPLGIFIKHWAQCMDIHDAQRGRLSTYCLLLMLIHYLQT 260
Query: 195 -CVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHL 253
C+P +LP L++ +P + + Q+ ++ + N ++L L
Sbjct: 261 ACIPPVLPNLQEKFPEVFNYTIPPYELDMNGQLPW-----------NELKSTNFNNLGEL 309
Query: 254 FVSFLEKFS--------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
F F+ ++ ++++ + + T + I + N +FIE+PF Q
Sbjct: 310 FNGFIHYYTNEFNFNKWAITIRHNRPFMKNITMKQLPIYEQNYIMNRNCKIFIEEPFSQ- 368
Query: 306 ENSARAVSEKNLAK-ISNAFEMTH 328
N+AR++ N+ I AF T+
Sbjct: 369 TNAARSIHSDNIVSYIKQAFIKTN 392
>gi|170042745|ref|XP_001849075.1| poly(A) polymerase cid [Culex quinquefasciatus]
gi|167866218|gb|EDS29601.1| poly(A) polymerase cid [Culex quinquefasciatus]
Length = 1185
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 102/211 (48%), Gaps = 17/211 (8%)
Query: 54 GSFVSNLFSRWGDLDISIEL-SNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
GS +S S D+D+ + +N G+ + Q LG L Y V
Sbjct: 959 GSTISGFASDNSDVDMCLVCRANTIPFDMRGEALFQ--LGQLKNYFMNINTYFEEFSVIQ 1016
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPIL+F I D++ +N G + L+ SQ+D R R + L+VK WA+ H+IN
Sbjct: 1017 AKVPILRFRDSTNCIVVDLNYNNCVGIRNTHLLYCYSQLDWRLRPLTLVVKLWAQHHNIN 1076
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA 231
+ K T +SYSL L+V+ Q V P ILP L +YP V ++++I
Sbjct: 1077 DAKNMTISSYSLVLMVIHFLQYGVSPPILPCLHAMYPDKFV------------RMSDISN 1124
Query: 232 FNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
++ + + Y+ N SL LF+ FLE ++
Sbjct: 1125 LDLTE-TMEPYKNENAQSLGELFMQFLEYYA 1154
>gi|332224822|ref|XP_003261567.1| PREDICTED: poly(A) RNA polymerase GLD2 [Nomascus leucogenys]
Length = 484
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF+ + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFKDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|328769555|gb|EGF79599.1| hypothetical protein BATDEDRAFT_89693 [Batrachochytrium
dendrobatidis JAM81]
Length = 1081
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 101/193 (52%), Gaps = 15/193 (7%)
Query: 27 TRMKVISDLREVVESVESL------RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
TR+ D RE V+ V+ + A V FGS V+NL D+D++IE+S IS
Sbjct: 607 TRLAGQPDRREFVDKVQKILNQAYNNTAHVYLFGSSVNNLGLNTSDVDMTIEIS-PELIS 665
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ K L G +LRA GG + + ++HARVPI KF + DI++ + G
Sbjct: 666 NHKAKNMHHLAG-ILRA----GGMKEVVAISHARVPICKFYDPKLCVHADINVGHSLGVY 720
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVLFHFQTCVPAI 199
S L + +D R + +LL+K W+KA D+NNP + GT +SY+ S++ + + Q +
Sbjct: 721 NSALLKAYTLLDPRVKPFILLIKLWSKARDLNNPSSGGTLSSYAYSIMAIAYMQKL--GL 778
Query: 200 LPPLKDIYPGNLV 212
LP L+ P V
Sbjct: 779 LPSLQLAVPPGTV 791
>gi|387762840|ref|NP_001248668.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|355691430|gb|EHH26615.1| Poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|380787389|gb|AFE65570.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|380787397|gb|AFE65574.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|383410315|gb|AFH28371.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|383410317|gb|AFH28372.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|384945590|gb|AFI36400.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|384945592|gb|AFI36401.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
Length = 484
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSSAIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|224127462|ref|XP_002320080.1| predicted protein [Populus trichocarpa]
gi|222860853|gb|EEE98395.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 129/278 (46%), Gaps = 27/278 (9%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ C++ ++K+S + L + Q + +Q +
Sbjct: 19 YGSCANSFGVSKSDIDV--------CLTIEDAEIKKSEVLLKLADILQADNLQNVQALTR 70
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K ISCDI ++N+ + +K L +QID R R + +VK WAK+ +N
Sbjct: 71 ARVPIVKLMDPVTGISCDICLNNVLAVVNTKLLRDYAQIDVRLRQLAFVVKHWAKSRGVN 130
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP--GNLVDDLKGVRANAERQIAEIC 230
GT +SY+ L+ + Q PAILP L+++ +VDD++ C
Sbjct: 131 ATYQGTLSSYAYVLMCIHFLQQRRPAILPCLQEMGTTYSAIVDDIR-------------C 177
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN--TR 288
A+ + N+ ++A L +F ++ A+ + I TG R TR
Sbjct: 178 AYFDQVEKLRGFGSRNKETIAQLVWAFFNYWAYRHDYANGV-ISVRTGSIISKREKDWTR 236
Query: 289 WLPNN-HPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+ N+ H + IEDPFE + R V + ++ + FE
Sbjct: 237 RIGNDRHLICIEDPFEISHDLGRVVDKFSIKVLREEFE 274
>gi|389738915|gb|EIM80110.1| hypothetical protein STEHIDRAFT_126102 [Stereum hirsutum FP-91666
SS1]
Length = 1326
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 158/339 (46%), Gaps = 49/339 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P E+ + V D+R+++E + +R T+EP FGS + R
Sbjct: 45 LYDFVIQLLPTPEE----LSVKEDVRKLLERL--IR--TIEPDSRLLSFGSTANGFSLRN 96
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE--- 121
D+D+ + + +S+A ++LGDLL ++ ++ + HAR+PI+K
Sbjct: 97 SDMDLCCLIDSDERLSAADLV---TMLGDLL----ERETKFHVKPLPHARIPIVKLSLDP 149
Query: 122 --TIHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGT 178
+ I+CDI +N ++ L+ + ID R R +VL +K W K IN+P GT
Sbjct: 150 SPGLPLGIACDIGFENRLALENTRLLYCYAMIDPTRVRTLVLFLKVWCKRRKINSPYQGT 209
Query: 179 FNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA--FNIA 235
+SY LLV+ F P +LP L+ + P L+ + ++ + I E F+
Sbjct: 210 LSSYGYVLLVIYFLVHVKNPPVLPNLQQMPP------LRPI-SHEDSHIGEHNTWFFDDI 262
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFS-------GL-SLKASELGICPFTGQWEHIRSNT 287
+++ N S+A L + F + FS G+ S++A L T W++
Sbjct: 263 ELLRQRWQSANTDSVAQLLIDFFKYFSRDFLYNTGVASIRAGLLKK--ETKGWQNDLDPL 320
Query: 288 RWLPN--NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
R+ + + L IEDPFEQ N AR V+ L I F
Sbjct: 321 RYKDSRERNRLCIEDPFEQDYNVARCVTRDGLYLIRGEF 359
>gi|159481933|ref|XP_001699029.1| hypothetical protein CHLREDRAFT_177575 [Chlamydomonas reinhardtii]
gi|158273292|gb|EDO99083.1| predicted protein [Chlamydomonas reinhardtii]
Length = 500
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 146/359 (40%), Gaps = 78/359 (21%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESV-ESLRGATVEPFGSFVSNLFSRWG 65
+E L+ + P +ED R I LR ++ S+ + + P+GSF+S+ +S
Sbjct: 1 MEAFLQQLAKAREPSQEDGVARQATIERLRSLLPSIFPAPSNLHLVPYGSFLSSCYSHGS 60
Query: 66 DLDISI-------ELSNGSCI----SSAGKKV------KQSLLGDLLRALRQKGGYRRLQ 108
DLD+++ L+ G + + G +V + LLR L + R +
Sbjct: 61 DLDLALAGGVAEAHLAPGPAVELPPGAWGGEVLPLESLSEEQFAALLRRLADELEVRGVT 120
Query: 109 F-----VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
+ ARVPI+KF I CDI + K + + + +V LVK
Sbjct: 121 AGPVTRILEARVPIIKFVESSSGIECDICVTTRGCDFKGAVMRLLHGLQPGLAPLVRLVK 180
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP--AILPPLKDIY-----------PGN 210
WAKAHDIN+ GT NS+SL+L+ LF Q P A+LPPL ++ G
Sbjct: 181 LWAKAHDINSAHCGTLNSWSLALMALFSMQA-YPEGALLPPLWRLFHDSEPPLSAAGTGR 239
Query: 211 LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASE 270
+ D KGV+ A +A K ++L H +F
Sbjct: 240 PLQD-KGVQPEAMLAVARA-----------KCTGKGAAALLHPRCAFAA----------- 276
Query: 271 LGICPFTGQWEHIRSNTRW-------------LPNNHPLFIEDPFEQPENSARAVSEKN 316
TGQW ++ W P + +E+PF+ +N+AR+V ++
Sbjct: 277 -----ITGQWRDNAAHRNWRVSPWLGRGYTARFPRAYVAAVEEPFDCHDNTARSVGIRD 330
>gi|167555095|ref|NP_776158.2| poly(A) RNA polymerase GLD2 [Homo sapiens]
gi|167555097|ref|NP_001107865.1| poly(A) RNA polymerase GLD2 [Homo sapiens]
gi|167555099|ref|NP_001107866.1| poly(A) RNA polymerase GLD2 [Homo sapiens]
gi|74737798|sp|Q6PIY7.1|GLD2_HUMAN RecName: Full=Poly(A) RNA polymerase GLD2; Short=hGLD-2; AltName:
Full=PAP-associated domain-containing protein 4;
AltName: Full=Terminal uridylyltransferase 2;
Short=TUTase 2
gi|45708664|gb|AAH26061.1| PAPD4 protein [Homo sapiens]
Length = 484
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|417401754|gb|JAA47745.1| Putative polya rna polymerase gld2 isoform 2 [Desmodus rotundus]
Length = 484
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRTSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVDFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + ++ + Q+
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYPESFS---PTIQLHLVHQVPS- 374
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
Y N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 375 --------DVPPYLSKNESTLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|21755154|dbj|BAC04629.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|428181303|gb|EKX50167.1| hypothetical protein GUITHDRAFT_103980 [Guillardia theta CCMP2712]
Length = 760
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 149/330 (45%), Gaps = 44/330 (13%)
Query: 19 NPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSC 78
+P RE+ ++ V + + +++ S + ++V+ FGS SNL S+ D+DI + +
Sbjct: 399 SPPREEMVRKVSVCTSVSKIIAG--SYQRSSVQMFGSSGSNLCSKGSDVDICLLIPEEEI 456
Query: 79 ISSAGKKVKQS-----LLGDLLRALRQKGGYRRLQFVAHARVPILKFET---IHQNISCD 130
+A + K + L+G L L + G ++ + +ARVPI+KF+ + + CD
Sbjct: 457 QRNAKGQRKTARFRYFLIG--LAKLLTRQGMMNVEPLPNARVPIIKFQARDGLDFSFDCD 514
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF 190
+ ++N+ I + LF + +D R R +++ +K W K I+N G +SY+ SL+V+
Sbjct: 515 LCVNNVLACINTNLLFTYTMLDARVRPLIMCIKHWVKQRQIHNAFRGYLSSYTYSLMVIQ 574
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----- 245
+ Q ILP L+++ + R + A C + YR +
Sbjct: 575 YLQ--YERILPCLQNL-------KREEARQKNDSSFAVQCEGK--EYDCYFYRNVESLAG 623
Query: 246 ---NRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNH 294
N SL L V F +S +S+++ L G W+ +N N H
Sbjct: 624 ERNNPCSLGLLLVGFFHFYSNVFSIGEGVVSIRSGRLLKKTAKG-WDVPGNNK----NRH 678
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ + R V E + I F
Sbjct: 679 VICIEDPFDVNLDLGRYVDENTVKDIEMEF 708
>gi|431907865|gb|ELK11472.1| Poly(A) RNA polymerase GLD2 [Pteropus alecto]
Length = 483
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 200 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 259
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 260 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 317
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + +
Sbjct: 318 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYP-------ESFSPAIQLHLVHQ 370
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 371 APSNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 425
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 426 -----------NKYICVEEPFDG-TNTARAVHEK 447
>gi|344272684|ref|XP_003408161.1| PREDICTED: poly(A) RNA polymerase GLD2 [Loxodonta africana]
Length = 485
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 128/274 (46%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 202 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 261
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 262 -RAKVPIVKFRDKVNCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 319
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP L+ IYP + +
Sbjct: 320 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ESFSPAIHLHLVHQ 372
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
NI + S N S+L L + FL+ ++ +S+ ++ P +W
Sbjct: 373 APCNIPPYLSK-----NESNLGDLLLGFLKYYATEFDWDSQMISVHEAKAIPRPDGIEWR 427
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + IE+PF+ +N+ARAV EK
Sbjct: 428 -----------NKYICIEEPFDG-KNTARAVHEK 449
>gi|410948876|ref|XP_003981153.1| PREDICTED: poly(A) RNA polymerase GLD2 [Felis catus]
Length = 484
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYP-------ESFSPAIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|402871961|ref|XP_003899913.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Papio anubis]
gi|355750026|gb|EHH54364.1| Poly(A) RNA polymerase GLD2 [Macaca fascicularis]
Length = 484
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|345798653|ref|XP_849698.2| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Canis lupus
familiaris]
Length = 484
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|115480789|ref|NP_001063988.1| Os09g0570600 [Oryza sativa Japonica Group]
gi|52077188|dbj|BAD46233.1| unknown protein [Oryza sativa Japonica Group]
gi|113632221|dbj|BAF25902.1| Os09g0570600 [Oryza sativa Japonica Group]
Length = 310
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 86/174 (49%), Gaps = 21/174 (12%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
V ARVPI+ I CDI+++N G +S +IS +D RF+ + LVK WAK H
Sbjct: 115 VVTARVPIVNVIDRGTGIECDITVENKDGMTRSMIFKFISSLDPRFQILSYLVKFWAKIH 174
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
D+N+P+ T +S S+ LV FH QT P ILPPL + D + V N
Sbjct: 175 DVNSPRERTLSSMSIVSLVAFHLQTRDPPILPPLSALLKDG--SDFESVERNT------- 225
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEK-FSGLSLKASELGICP--FTGQW 280
AF + + N+ ++A LFVS + K S SL E G+C F W
Sbjct: 226 LAFK-------GFGRTNKETVAELFVSLISKLLSAESLW--EHGLCASNFEASW 270
>gi|332821218|ref|XP_001139111.2| PREDICTED: poly(A) RNA polymerase GLD2 isoform 7 [Pan troglodytes]
gi|397503433|ref|XP_003822328.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Pan paniscus]
gi|410226656|gb|JAA10547.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410226658|gb|JAA10548.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410226660|gb|JAA10549.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410261728|gb|JAA18830.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410261730|gb|JAA18831.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410261732|gb|JAA18832.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410297018|gb|JAA27109.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410297020|gb|JAA27110.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410297022|gb|JAA27111.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410352387|gb|JAA42797.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410352389|gb|JAA42798.1| PAP associated domain containing 4 [Pan troglodytes]
gi|410352393|gb|JAA42800.1| PAP associated domain containing 4 [Pan troglodytes]
Length = 484
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 319 QINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APSNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|218184236|gb|EEC66663.1| hypothetical protein OsI_32948 [Oryza sativa Indica Group]
Length = 586
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 125/257 (48%), Gaps = 29/257 (11%)
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C+S K++ + + L + G R +Q + ARVPI+K + +SCDI ++NL
Sbjct: 323 CLSIDEKEMSKVDIILKLAHILHAGNLRNIQALTRARVPIVKLMDPNTGLSCDICVNNLL 382
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ +K L S+ID R R + +VK WAK+ +N GT +SY+ ++ + + Q+
Sbjct: 383 AVVNTKLLRDYSRIDKRLRPLAFIVKHWAKSRCVNETYQGTLSSYAYVIMCIHYLQS--Q 440
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN------RSSLA 251
ILP L+++ P V V N ICA+ D+ K+N + +L+
Sbjct: 441 RILPCLQEMEPTYYVT----VDNN-------ICAY------FDQVDKLNGFGAQCKDTLS 483
Query: 252 HLFVSFLEKFSGLSLKASELGICPFTGQW--EHIRSNTRWLPNN-HPLFIEDPFEQPENS 308
L F ++ + ++ I TG+ ++++ TR + N+ H + IEDPFE +
Sbjct: 484 RLLWGFF-RYWAYAHNYTKDVISIRTGRTISKNMKDWTRRIGNDRHLICIEDPFETSHDL 542
Query: 309 ARAVSEKNLAKISNAFE 325
R V +++ + FE
Sbjct: 543 GRVVDNRSIWALREEFE 559
>gi|281350291|gb|EFB25875.1| hypothetical protein PANDA_004320 [Ailuropoda melanoleuca]
Length = 460
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 129/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYP-------ESFSPAIQLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|393233523|gb|EJD41094.1| PAP/OAS1 substrate-binding domain-containing protein, partial
[Auricularia delicata TFB-10046 SS5]
Length = 420
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 151/339 (44%), Gaps = 56/339 (16%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+ + D+R+++E + +R T+EP FGS + R D+D+ + +A
Sbjct: 48 LAIKEDVRKLLEKL--IR--TIEPDSRLMAFGSTANGFSLRNSDMDLCCLIDAAKPPLNA 103
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET-----IHQNISCDISIDNLC 137
V+ L+GDLL ++ ++ + HAR+PI+K + I+CDI +N
Sbjct: 104 SDLVQ--LVGDLL----ERETKFAVKTLPHARIPIIKLSLAPSPGLPFGIACDIGFENRL 157
Query: 138 GQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-FHFQTC 195
++ L + +D R R MVL +K W+K IN+P GT +SY LLV+ F
Sbjct: 158 ALENTRMLHTYASLDPARVRTMVLFLKVWSKRRKINSPYEGTLSSYGYVLLVIYFLVHVK 217
Query: 196 VPAILPPLKDI------------YPGN---LVDDLKGVRANAERQ----IAEICAFNIAR 236
P +LP ++ I Y GN DD+ +R + Q +AE+C F+I R
Sbjct: 218 SPPVLPNIQQIPPPTPRTHEQTHYAGNNIWFFDDIDTLRHRWQSQNTQSVAELCVFSIPR 277
Query: 237 FSS--DKYRKINRSSLAHLFVSFLEKF---SGLS------LKASELGICPFTGQWEHIRS 285
S +R SL LF + F +G++ L SE G +T
Sbjct: 278 PLSCMAGHRAGRCRSLVDLFRYYSRDFPYNTGVASIRMGPLTKSEKG---WTADVSRPSR 334
Query: 286 NTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + + L IEDPFE N R V+++ L I F
Sbjct: 335 SYSSRRDGNRLCIEDPFETDFNVGRCVTKEGLYLIRGEF 373
>gi|301761674|ref|XP_002916258.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Ailuropoda
melanoleuca]
Length = 484
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 130/274 (47%), Gaps = 38/274 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGYRRLQFV 110
GS ++ +R D D+ + + C +K + + L+ R G R Q +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSGYIERPQLI 260
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAH 169
A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL++K+WA H
Sbjct: 261 -RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLVIKKWASHH 318
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+IN+ GT +SYSL L+VL + QT ILP ++ IYP + + + +
Sbjct: 319 EINDASRGTLSSYSLVLMVLHYLQTLPEPILPSIQKIYPESFSPAI-------QLHLVHQ 371
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWE 281
N+ + S N S+L L + FL+ ++ +S++ ++ P +W
Sbjct: 372 APCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR 426
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
N + +E+PF+ N+ARAV EK
Sbjct: 427 -----------NKYICVEEPFDG-TNTARAVHEK 448
>gi|56566263|gb|AAN75184.2| CID1 [Cryptococcus neoformans var. grubii]
gi|405119913|gb|AFR94684.1| cid1 [Cryptococcus neoformans var. grubii H99]
Length = 727
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 150/357 (42%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ + + + S
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVVLIDDPS 82
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
G V+ + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 83 AKIDPGNFVES------MAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|133919902|emb|CAL91354.1| cytoplasmic poly(A) polymerase [Mus musculus]
Length = 480
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 111/226 (49%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA HDIN+ GT +SYSL L+VL + QT ILP L+ IYP +
Sbjct: 303 LVLVIKKWASHHDINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ES 355
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + + N+ + S N SSL L + FL+ ++ +S++ +
Sbjct: 356 FSTSVQLHLVHHAPCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDDMEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|301114963|ref|XP_002999251.1| Poly(A) polymerase, putative [Phytophthora infestans T30-4]
gi|262111345|gb|EEY69397.1| Poly(A) polymerase, putative [Phytophthora infestans T30-4]
Length = 558
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 145/332 (43%), Gaps = 31/332 (9%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGAT--VEPFGSFVS 58
MG+ N + + D + +L L + + + +R V+ + + T V PFGS S
Sbjct: 1 MGAANSVRKLTIDSIALLEQLEPN-KAELAAKRAVRRRVQQLLQQKWPTCRVLPFGSSES 59
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVPI 117
L D+D+ I + + + G+ Q + L A R G ++ L+FV ARVP+
Sbjct: 60 GLGFGGCDVDLGIYFEDVD-VDAQGQFSPQERVNLLATACERLSGAFQVLEFVRSARVPV 118
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
+K + ++CD+ + + + + L + Q+D R R +V VK WAK IN+ G
Sbjct: 119 IKLWDTKRQVACDVCVGGINALLNTALLKYYGQVDPRVRPLVFAVKYWAKQRGINDSANG 178
Query: 178 TFNSYSLSLLVLFHFQTC-----VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
T +SY +LL++F+ Q+ +P +L +D+ V L + + AF
Sbjct: 179 TLSSYGYTLLLIFYLQSHYAEMQLPEVLSLFQDLQSQTKVSVL----------LERMQAF 228
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPN 292
S + +S+ L F + F + + TG+ + T+W
Sbjct: 229 PTIELPS-TFGTSEMNSVGALLAGFFD-FYARRFNMEDDVVSIRTGR--ALSKTTKW--- 281
Query: 293 NHP----LFIEDPFEQPENSARAVSEKNLAKI 320
+HP L IEDPFE + R + + ++
Sbjct: 282 SHPVSWRLSIEDPFELAHDVGRVIFHRKCQEL 313
>gi|392563461|gb|EIW56640.1| hypothetical protein TRAVEDRAFT_127187 [Trametes versicolor
FP-101664 SS1]
Length = 1046
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 172/384 (44%), Gaps = 55/384 (14%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
M V D+R+++E + +R T+EP FGS + + D+D+ + +G +++A
Sbjct: 62 MAVKEDVRKLLERL--IR--TIEPDSRLLSFGSSANGFSLKNSDMDLCCLIDSGERLNAA 117
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDISIDNLC 137
+++GDLL ++ ++ + HAR+PI+K + I+CDI +N
Sbjct: 118 DLV---TMVGDLL----ERETKFHVKPLPHARIPIVKLTLDPSPALPFGIACDIGFENRL 170
Query: 138 GQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-FHFQTC 195
++ L + ID R R MVL +K W+K IN+P GT +SY LLV+ F
Sbjct: 171 ALENTRLLMCYASIDPARVRTMVLFLKVWSKRRKINSPYKGTLSSYGYVLLVIYFLVHVK 230
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS-----DKYRKINRSSL 250
P +LP L+ + P +R +E + + + FNI F +++ N ++
Sbjct: 231 NPPVLPNLQQMPP---------LRPISEEE-SHLNGFNIWFFDDIELLRQRWKSSNTDTV 280
Query: 251 AHLFVSFLEKFS-------GL-SLKASELGICPFTGQW--EHIRSNTRWLPNNHPLFIED 300
A L + F + +S G+ S++A L T W E + + + L IED
Sbjct: 281 AELLIEFFKFYSRDFAYNTGVASIRAGLLKKD--TKGWLSEMLLKDYGTGRERNRLCIED 338
Query: 301 PFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL-LSSLARPFILQFFGESPV 359
PFE N AR V++ L I F M R+ R L L+ L + G P
Sbjct: 339 PFETDFNVARCVTKDGLYTIRGEF-MRASRILQARPDRAILALAQLCEERKDETLGPPPT 397
Query: 360 RYANYNNGHRRAR--PQSHKSVNS 381
R + N R PQ+ +V S
Sbjct: 398 RQRSSNGPPRLTSIPPQTPYTVGS 421
>gi|320166348|gb|EFW43247.1| PAP/25A associated domain-containing protein [Capsaspora owczarzaki
ATCC 30864]
Length = 687
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 138/311 (44%), Gaps = 31/311 (9%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLR----GATVEPFGSFVSN 59
++ LE L +IL + P D+ I L++ VE + R A++E FGS V++
Sbjct: 351 HSALEHDLAEILERITPTTRDFHR----IVLLQQRVEQLLQHRFPEWDASIEMFGSSVNS 406
Query: 60 LFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILK 119
R D D+ C+ + K ++ + + LRQ ++ +A A+VPI+K
Sbjct: 407 FNLRDADADM--------CVYVNDAQSKTLVIRRIAKYLRQH--MTKVACIAGAKVPIVK 456
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F SCD+S++ + G + S+ L + ID R + +VK WAK IN+P +GT
Sbjct: 457 FFDPESQTSCDLSVNQVLGILNSRMLRTYATIDPRVYRLGRIVKLWAKNRQINDPPSGTL 516
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS 239
+SY+ +LVL+ Q ++P L++ P + + + I +++
Sbjct: 517 SSYAFVMLVLYFLQRRPVPVVPCLQEALPPDFPHGPQPYFVTDDNSINAAFVGDLSSIER 576
Query: 240 DKYRKINRS--SLAHLFVSFLEKFSGLSLKASELGIC-----PFTGQWEHIRSNTRWLPN 292
Y +R+ SLA L ++F F E I P T N+ L
Sbjct: 577 AGYGAASRNTESLAELVLAFF-NFYAYDFTHHEQVITIRTLRPVTLDQTSFGKNSGVL-- 633
Query: 293 NHPLFIEDPFE 303
L I+DPF+
Sbjct: 634 ---LHIQDPFD 641
>gi|67901522|ref|XP_681017.1| hypothetical protein AN7748.2 [Aspergillus nidulans FGSC A4]
gi|40742346|gb|EAA61536.1| hypothetical protein AN7748.2 [Aspergillus nidulans FGSC A4]
gi|259484098|tpe|CBF80028.1| TPA: PAP/25A associated domain family (AFU_orthologue;
AFUA_5G07790) [Aspergillus nidulans FGSC A4]
Length = 999
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 120/257 (46%), Gaps = 30/257 (11%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQ 108
V FGS + L S D+DI CI++ K+++ LL D+L K G R+
Sbjct: 79 VHVFGSSGNKLCSSDSDVDI--------CITTTCKELEHVCLLADVL----AKNGMERVV 126
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
++HA+VPI+K ++CD++++N ++ + +ID R R + +++K W K
Sbjct: 127 CISHAKVPIVKIWDPELRLACDMNVNNTMALENTRMVRTYVEIDERVRPLAMIIKHWTKR 186
Query: 169 HDINNPK-TGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
+N+ GT +SY+ L++ QT P ILP L+ R + +R A
Sbjct: 187 RILNDAGLGGTLSSYTWICLIINFLQTREPPILPSLQ-------------ARPHKKRLTA 233
Query: 228 E--ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRS 285
+ +C+F+ S Y K N+ SL LF F ++ G L + I G+
Sbjct: 234 DGLVCSFDDDLDSLVGYGKQNKQSLGELFFQFF-RYYGHELDFEKYVISVREGRLISKEG 292
Query: 286 NTRWLPNNHPLFIEDPF 302
L N+ L +E+PF
Sbjct: 293 KGWHLLQNNRLCVEEPF 309
>gi|297610194|ref|NP_001064267.2| Os10g0188300 [Oryza sativa Japonica Group]
gi|110288742|gb|ABG65960.1| PAP/25A associated domain containing protein, expressed [Oryza
sativa Japonica Group]
gi|215697928|dbj|BAG92107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255679257|dbj|BAF26181.2| Os10g0188300 [Oryza sativa Japonica Group]
Length = 320
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 125/257 (48%), Gaps = 29/257 (11%)
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C+S K++ + + L + G R +Q + ARVPI+K + +SCDI ++NL
Sbjct: 57 CLSIDEKEMSKVDIILKLAHILHAGNLRNIQALTRARVPIVKLMDPNTGLSCDICVNNLL 116
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ +K L S+ID R R + +VK WAK+ +N GT +SY+ ++ + + Q+
Sbjct: 117 AVVNTKLLRDYSRIDKRLRPLAFIVKHWAKSRCVNETYQGTLSSYAYVIMCIHYLQS--Q 174
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN------RSSLA 251
ILP L+++ P V V N ICA+ D+ K+N + +L+
Sbjct: 175 RILPCLQEMEPTYYV----TVDNN-------ICAY------FDQVDKLNGFGAQCKDTLS 217
Query: 252 HLFVSFLEKFSGLSLKASELGICPFTGQW--EHIRSNTRWLPNN-HPLFIEDPFEQPENS 308
L F ++ + ++ I TG+ ++++ TR + N+ H + IEDPFE +
Sbjct: 218 RLLWGFF-RYWAYAHNYTKDVISIRTGRTISKNMKDWTRRIGNDRHLICIEDPFETSHDL 276
Query: 309 ARAVSEKNLAKISNAFE 325
R V +++ + FE
Sbjct: 277 GRVVDNRSIWALREEFE 293
>gi|56566240|gb|AAN75161.2| CID1 [Cryptococcus neoformans var. grubii]
Length = 727
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 150/357 (42%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ + + + S
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVVLIDDPS 82
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
G V+ + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 83 AKIDPGNFVES------MAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|302677318|ref|XP_003028342.1| hypothetical protein SCHCODRAFT_112660 [Schizophyllum commune H4-8]
gi|300102030|gb|EFI93439.1| hypothetical protein SCHCODRAFT_112660 [Schizophyllum commune H4-8]
Length = 1016
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 141/327 (43%), Gaps = 57/327 (17%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
M V D+R+++E + +R T+EP FGS + R D+D+ C+ +
Sbjct: 57 MSVKEDVRKLLERL--IR--TIEPESRLLSFGSTANGFSLRNSDMDLC-------CLIDS 105
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDISIDNLC 137
G+++ S L +L L ++ ++ + HAR+PI+K + I+CDI +N
Sbjct: 106 GERLSASDLVTMLGDLLERETKFHVKPLPHARIPIVKLSLDPSPGLPLGIACDIGFENRL 165
Query: 138 GQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF-QTC 195
++ L + ID R R +VL +K W+K IN+P GT +SY LLV+F
Sbjct: 166 ALENTRLLMCYAMIDPTRVRTLVLFLKVWSKRRKINSPYKGTLSSYGYVLLVIFFLVHVK 225
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
PA+LP L+ + P + + + I F+ ++ N S+A L +
Sbjct: 226 NPAVLPNLQQMPPLRPIS-----KEDTHLGDKNIWFFDDIDVLRQRWHSENTESVAELLI 280
Query: 256 SFLEKFS--------------GLSLKASELG----ICPFTGQWEHIRSNTRWLPNNHPLF 297
F +S GL LK G + P G+ R R L
Sbjct: 281 DFFRYYSKDFLYNTGVASIRAGL-LKKDAKGWQNDLSP--GRVNDARERNR-------LC 330
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAF 324
IEDPFE N AR V++ L I F
Sbjct: 331 IEDPFETDFNVARCVTKDGLYTIRGEF 357
>gi|365982357|ref|XP_003668012.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
gi|343766778|emb|CCD22769.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
Length = 684
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 108/201 (53%), Gaps = 16/201 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R K IS LR+ V+ + S + + FGS+ ++L+ D+D
Sbjct: 209 IKDFVSYISPSREEIELRNKTISKLRKAVKELWS--DSQLHIFGSYATDLYLPGSDIDCV 266
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L DL R L+QKG +++ +A ARVPI+KF I D
Sbjct: 267 VN-------SKMGDKEQRQYLYDLARHLKQKGLTSQVEVIAKARVPIIKFVEKSSQIHID 319
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+S G R+++L+VK++ A +N+ TG +++ LV
Sbjct: 320 VSFERTNGVEAAKLIREWLSATPG-LRELILIVKQFLSARRLNDVHTGGLGGFTIICLV- 377
Query: 190 FHFQTCVPAI----LPPLKDI 206
+ F + P I + PL+++
Sbjct: 378 YSFLSMHPRIKTNDIDPLENL 398
>gi|193594236|ref|XP_001948967.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 1
[Acyrthosiphon pisum]
Length = 586
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 115/247 (46%), Gaps = 48/247 (19%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQI----KSKFLFWISQIDGRFRDMVLLVKEW 165
+ +A+VPIL+F I C + +D C + + L+ S++D R R +V+ +K W
Sbjct: 358 IVYAKVPILRFRWIGDG-GCKMDVDFCCNNVVGIRNTHLLYCYSRLDYRVRPLVVTIKLW 416
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
A H+IN+PK T +SYSL L+V+ Q+ P +LP L+ IY G++ ++
Sbjct: 417 ASHHNINDPKKMTLSSYSLVLMVINFLQSITPPVLPSLQCIY---------GMKFSSFTD 467
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI----------CP 275
I + + S +R N+ SL L + F E ++ + + + C
Sbjct: 468 IEFVHMHE--QLPSSGWRSDNKQSLGELLLQFFEYYNDFNFYKHAVSVRMGSPIPLESCR 525
Query: 276 FT-------GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL-AKISNAFEMT 327
GQW+ I IE+PFE+ N+AR+V+ N+ A+I A +
Sbjct: 526 MADAVKNNPGQWKFIG-------------IEEPFEK-TNTARSVNNHNVFAQIKEAITNS 571
Query: 328 HFRLTST 334
+ +L T
Sbjct: 572 YNQLKET 578
>gi|293331075|ref|NP_001169620.1| uncharacterized protein LOC100383501 [Zea mays]
gi|224030451|gb|ACN34301.1| unknown [Zea mays]
gi|413935342|gb|AFW69893.1| hypothetical protein ZEAMMB73_444453 [Zea mays]
gi|414881287|tpg|DAA58418.1| TPA: hypothetical protein ZEAMMB73_118166 [Zea mays]
Length = 607
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 124/287 (43%), Gaps = 45/287 (15%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ + D+D+ +E+ G+ SA + + + L D+LRA G+ ++ +
Sbjct: 328 YGSCANSFGTSHSDVDVCLEMETGA--ESAVEVLVR--LADVLRA----DGFENVEAITG 379
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI++ SCDI I+NL ++ L ++ID R + LVK WAK +N
Sbjct: 380 ARVPIVRMSDPGSGFSCDICINNLLAVANTRLLKDYARIDERLLQLAFLVKHWAKQRGVN 439
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG-------------NLVDDLKGVR 219
GT +SY+ L+ + Q P ILP L+ + P + VD L+G
Sbjct: 440 EAYRGTLSSYAYVLMCINFLQLREPRILPCLQAMEPTYTLTVDGTECAYFDRVDQLQGFG 499
Query: 220 ANAERQIAEIC-AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG 278
A + I E+ F S +Y++ + G +++ E G
Sbjct: 500 AGNKASIGELLWGFFHYWASQHRYKR-----------DVISVRLGKTIRKQEKG------ 542
Query: 279 QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
W TR + H + IEDPFE + R V + + I E
Sbjct: 543 -W-----TTRVGGDRHLMCIEDPFETGHDLGRVVDRQTIWIIREEME 583
>gi|194889311|ref|XP_001977058.1| GG18455 [Drosophila erecta]
gi|190648707|gb|EDV45985.1| GG18455 [Drosophila erecta]
Length = 1361
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 114/231 (49%), Gaps = 25/231 (10%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1090 ARVPILRFKDITNGIEVDLNFNNCVG-IKNTYLLQLYAQMDWRTRPLVVIVKLWAQYHDI 1148
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
N+ K T +SYSL L+VL + Q CVP +LP L +YP Q+ +
Sbjct: 1149 NDAKRMTISSYSLVLMVLHYLQHACVPHVLPCLHSLYPEKF-------------QLGQPD 1195
Query: 231 AFNIARFSS-DKYRKINRSSLAHLFVSFLEKFSGLSLKASEL-----GICPFTGQWEHIR 284
++ + Y+ +N +L + F + +S + + GI P + +
Sbjct: 1196 CLDLDLIEPIEPYQALNTQTLGEHLLGFFKYYSSFDFRNFAISIRTGGILPVS-TCRMAK 1254
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRLTST 334
S + L IE+PF+ N+AR+V + ++ F ++ RL T
Sbjct: 1255 SPKNDVYQWKELNIEEPFDL-SNTARSVYDAPTFERVKAVFLVSARRLDHT 1304
>gi|326435530|gb|EGD81100.1| hypothetical protein PTSG_11137 [Salpingoeca sp. ATCC 50818]
Length = 272
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 117/254 (46%), Gaps = 22/254 (8%)
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
Q V ARVPI K D++ N+ G + ++ + +++D RFR + LVK WAK
Sbjct: 29 QVVKTARVPIAKLIDKKTGTEVDVNCANVLGLVNTRLIRTYTKVDDRFRHLGFLVKLWAK 88
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
A ++N+ GT +SY+ ++ + + Q C P +LP L+D + ++ R
Sbjct: 89 ACNLNDASMGTLSSYAWLIMTIHYLQRCDPPVLPNLQD----------RRIKGPPRRYHG 138
Query: 228 EICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRS 285
CA+ ++++ + ++ N S+ L+ F + + G ++ C T + + +
Sbjct: 139 FDCAYCSDLSKL-QEVWKSRNTQSIGELYYGFFDYYCGFDFD-RDIITCHTTKR-KPKAA 195
Query: 286 NTRWLPN--NHPLFIEDPFEQPENSARAVSEKNLAKIS----NAFEMTHFRLTSTNQTRY 339
RW N P+ I+DP E+ N + +S +A +A++ FR Y
Sbjct: 196 EKRWSRNYAARPMAIQDPIEKSHNLGKGISRLGVAVAHQHDRHAYDHHEFR-HGRRTNHY 254
Query: 340 ALLSSLARPFILQF 353
+S +P + QF
Sbjct: 255 ERRNSRLKPLLDQF 268
>gi|321263195|ref|XP_003196316.1| hypothetical Protein CGB_I0640W [Cryptococcus gattii WM276]
gi|54112170|gb|AAV28772.1| CID1p [Cryptococcus gattii]
gi|317462791|gb|ADV24529.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 728
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I KV + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKVDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|328718959|ref|XP_003246627.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog A-like isoform 2
[Acyrthosiphon pisum]
Length = 612
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 115/247 (46%), Gaps = 48/247 (19%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQI----KSKFLFWISQIDGRFRDMVLLVKEW 165
+ +A+VPIL+F I C + +D C + + L+ S++D R R +V+ +K W
Sbjct: 384 IVYAKVPILRFRWIGDG-GCKMDVDFCCNNVVGIRNTHLLYCYSRLDYRVRPLVVTIKLW 442
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
A H+IN+PK T +SYSL L+V+ Q+ P +LP L+ IY G++ ++
Sbjct: 443 ASHHNINDPKKMTLSSYSLVLMVINFLQSITPPVLPSLQCIY---------GMKFSSFTD 493
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI----------CP 275
I + + S +R N+ SL L + F E ++ + + + C
Sbjct: 494 IEFVHMHE--QLPSSGWRSDNKQSLGELLLQFFEYYNDFNFYKHAVSVRMGSPIPLESCR 551
Query: 276 FT-------GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL-AKISNAFEMT 327
GQW+ I IE+PFE+ N+AR+V+ N+ A+I A +
Sbjct: 552 MADAVKNNPGQWKFIG-------------IEEPFEK-TNTARSVNNHNVFAQIKEAITNS 597
Query: 328 HFRLTST 334
+ +L T
Sbjct: 598 YNQLKET 604
>gi|341886819|gb|EGT42754.1| hypothetical protein CAEBREN_19005 [Caenorhabditis brenneri]
Length = 459
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 122/270 (45%), Gaps = 20/270 (7%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GSF + DLD +I++ S AG K + L + L ++ ++ V
Sbjct: 114 GSFAAGFDIPSSDLDFTIKVE-----SLAGCKTPAAKLNIIKEKLAKEQEAFNVKRVVGG 168
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
RVP+L + I D++IDN ++ ++FL W SQ+D R +V VK WA +
Sbjct: 169 RVPVLVLQHRATQIDVDVTIDNDTPKLNTQFLIWYSQVDARVAPLVRAVKYWASETGVEC 228
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLK-GVRANAERQIAEICA 231
K G NS+S+ LLV+ Q V PA+LP L++ +P + ++K + R +AE
Sbjct: 229 SKKGRLNSFSICLLVIHFLQKGVSPAVLPNLQETFP-EINGEIKISADPSKRRHLAE--- 284
Query: 232 FNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTR 288
+ + N SL L++ F + + + I ++ RS T+
Sbjct: 285 ----DLRRQGWSQQNTDSLGALYLGFFQYYRKFDFTTRWISIKRGTSLVKRYAKDRSPTQ 340
Query: 289 WLPNNHPLFIEDPFE-QPENSARAVSEKNL 317
P + + +EDPF P N A V + ++
Sbjct: 341 VHPRGY-IVVEDPFLITPWNCAGTVRQGDI 369
>gi|322800718|gb|EFZ21622.1| hypothetical protein SINV_01930 [Solenopsis invicta]
Length = 546
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 142/306 (46%), Gaps = 34/306 (11%)
Query: 50 VEPFGSFVSNLFSRWGDLDISI--ELSNGSCISS----AGKKVK-------QSLLGDLLR 96
V PFGS ++ + DLD+ + + +N S I+S K +K + LG L
Sbjct: 196 VLPFGSSINGFGRKRCDLDLLLVPDGNNESNIASRLVFHTKSMKHNDRNETKEFLGILAN 255
Query: 97 ALRQ-KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRF 155
++ G ++ + ARVPI+KF + + CD+S N+ ++ L ++D R
Sbjct: 256 GMQYFIPGVYNVRKILEARVPIIKFRYDYTHTECDLSAINMTAIYMTELLNLYGEMDWRV 315
Query: 156 RDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDD 214
R +V+ ++ WAK+ +I + G + ++ L+LLVLF+ Q ILP LK + +D
Sbjct: 316 RPLVITIRVWAKSQEITSDVPGQWITNFPLTLLVLFYLQQ--KKILPSLKMLKTYATRND 373
Query: 215 LKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELG 272
++ AE I C F +I + +D K N+ SL L F E +S + G
Sbjct: 374 MR----TAENGID--CTFLRDINKLPADYKYKSNQDSLETLLYGFFEYYSTFDFHVN--G 425
Query: 273 ICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLT 332
IC G IR P+ PL I +P E N + VS L +I + L
Sbjct: 426 ICIREGV--QIRK-----PSRSPLHITNPLETTLNVCKNVSLYELNRIITKLHDAIYALE 478
Query: 333 STNQTR 338
+++++R
Sbjct: 479 TSDKSR 484
>gi|32564963|ref|NP_871880.1| Protein MUT-2, isoform b [Caenorhabditis elegans]
gi|351062122|emb|CCD70042.1| Protein MUT-2, isoform b [Caenorhabditis elegans]
Length = 338
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 131/305 (42%), Gaps = 46/305 (15%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
+N+L ++D +E++ +M L+ ++ + P GS V+ L +
Sbjct: 42 DFNILSISMQDHFDTTKQPKEEFGKKMDWCYQLKNIISKNNPTWLFNIVPTGSTVTGLAT 101
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF------------- 109
+ DLD++I I A + ++Q G + ++ +R +Q
Sbjct: 102 KNSDLDVAIH------IPQAARVLEQEERGRNITDDERQASWREIQLEILQIVRLNLQND 155
Query: 110 --------------VAHARVPILKFETIHQNISCDISI--DNLCGQIKSKFLF-WISQID 152
+ A++ ILK T+ I CDIS+ D + + FL ++ ID
Sbjct: 156 EQINSRINWEHGIQLVQAQIQILKVMTV-DGIDCDISVVMDRFLSSMHNSFLIRHLAHID 214
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGN 210
GRF + +VK+WA + + +PK G FNSY+L LLV+ HF C P ILP L++I+ +
Sbjct: 215 GRFAPLCAIVKQWAASTKVKDPKDGGFNSYALVLLVI-HFLQCGTFPPILPNLQEIFKKD 273
Query: 211 LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASE 270
A ++ I F N + LA LF+ FL +S + K +
Sbjct: 274 ------NFIAWDDKVYPSILNFGAPLPKPLPRIAPNNAPLARLFIEFLYYYSMFNFKENY 327
Query: 271 LGICP 275
+G P
Sbjct: 328 IGARP 332
>gi|449544109|gb|EMD35083.1| hypothetical protein CERSUDRAFT_54323, partial [Ceriporiopsis
subvermispora B]
Length = 556
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 154/352 (43%), Gaps = 44/352 (12%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P +++ + V D+R+++E + T+EP FGS + R
Sbjct: 46 LIDFVVQLLPTQDE----LAVKEDVRKLLERLIQ----TIEPESRLLSFGSTANGFSLRN 97
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET-- 122
D+D+ C+ + ++ S L +L L ++ ++ + HAR+PI+K
Sbjct: 98 SDMDLC-------CLIDSEDRLSASDLVTMLGDLLERETKFHVKPLPHARIPIVKLTLDP 150
Query: 123 ---IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGT 178
+ I+CDI +N ++ L + ID R R MVL +K W+K IN+P GT
Sbjct: 151 SPGLPFGIACDIGFENRLALENTRLLMCYAMIDPARVRTMVLFLKVWSKRRKINSPYKGT 210
Query: 179 FNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLV--DDLKGVRANAERQIAEICAFNIA 235
+SY LLV+ F P +LP L+ + P + +D N R ++ F+
Sbjct: 211 LSSYGYVLLVIYFLVHVKSPPVLPNLQQMSPLRPISHEDTHLNGYNIWR-VSWTLFFDDI 269
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFS-----GLSLKASELGICPFTGQ-WE---HIRSN 286
K++ N ++A L + F + +S + + + G+ + W+ H R
Sbjct: 270 ELLRQKWKSSNTETVAELLIGFFKFYSREFAYNIGVASIRDGLLAKESKGWQSELHERGT 329
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTR 338
R + L IEDPFE N AR V+ L I F M R+ S R
Sbjct: 330 PR---ERNRLCIEDPFETDFNVARCVTRDGLYTIRGEF-MRALRILSARPER 377
>gi|281211277|gb|EFA85442.1| adenylyl cyclase [Polysphondylium pallidum PN500]
Length = 1439
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 147/332 (44%), Gaps = 42/332 (12%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIEL------SNGSC------ISSAGKKVKQSLLGDL 94
G+ ++P+GSFV+ + + D+D+ + +N ++ + KK K L +
Sbjct: 1120 GSKLKPYGSFVNGVQTASSDIDVCFSVVGVPTDTNSKLLHLMKRVAISIKKSKYPLPATI 1179
Query: 95 LRALRQK------GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+ L + Y + + +RVPIL+F+ I +IS D+ +N S +
Sbjct: 1180 SQFLTYQFIYISDTSYELEKIIRFSRVPILRFKDIGSDISFDMCFNNSLPVGNSLLIKEY 1239
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP 208
+ ID R + ++LL+K WA DIN+ GT +SYS +V+F+ Q P +LP L+
Sbjct: 1240 TMIDARAKVLMLLIKYWASRKDINDASMGTLSSYSWLNMVIFYLQCVSPPVLPCLQSTLT 1299
Query: 209 GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKA 268
+ +++E + + + ++ N SL LF F +S
Sbjct: 1300 NTTPKS--SIISSSEDGWKFLNSLTL------NFKSTNTMSLFQLFSGFFSFYSRFDF-- 1349
Query: 269 SELGICPFTGQWEHIRSNTR-WLPNNHP--LFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+ L I G +IR T+ +L +N + IEDPF +N A +V NAF+
Sbjct: 1350 ANLLITIKRGCLTNIRMATKLYLDHNRKQNICIEDPFNPQQNPAASVGR-------NAFD 1402
Query: 326 MTHFRLTSTNQTRYALLSSLARPFILQFFGES 357
+ + L S Q LSSL + F ES
Sbjct: 1403 VILYELKSAEQK----LSSLKSNETVDVFMES 1430
>gi|334362793|gb|AEG78588.1| CID1 [Cryptococcus gattii]
Length = 728
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I KV + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKVDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|24641449|ref|NP_572766.1| wispy [Drosophila melanogaster]
gi|74871733|sp|Q9VYS4.1|GLD2B_DROME RecName: Full=Poly(A) RNA polymerase gld-2 homolog B; AltName:
Full=Protein wispy
gi|7292717|gb|AAF48114.1| wispy [Drosophila melanogaster]
gi|443906779|gb|AGD79330.1| RE03648p1 [Drosophila melanogaster]
Length = 1373
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 66/98 (67%), Gaps = 3/98 (3%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1086 ARVPILRFKDISNGIEVDLNFNNCVG-IKNTYLLQLYAQMDWRTRPLVVIVKLWAQYHDI 1144
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYP 208
N+ K T +SYSL L+VL + Q CVP +LP L +YP
Sbjct: 1145 NDAKRMTISSYSLVLMVLHYLQHACVPHVLPCLHSLYP 1182
>gi|393219460|gb|EJD04947.1| PAP/OAS1 substrate-binding domain-containing protein, partial
[Fomitiporia mediterranea MF3/22]
Length = 547
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/374 (27%), Positives = 161/374 (43%), Gaps = 68/374 (18%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNL 60
L L D + L P E+ + V D+R+++E + +R T+EP FGS +
Sbjct: 51 LSQCLLDFVIQLLPTVEE----LAVKEDVRKLLERL--IR--TIEPESRLLSFGSTANGF 102
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
R D+D+ C+ + +++ S L ++ L ++ ++ + HAR+PI+K
Sbjct: 103 SLRNSDMDMC-------CLIDSDQRLSASDLVTMVGDLLERETKFHVKPLPHARIPIVKL 155
Query: 121 ET-----IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNP 174
+ I+CDI +N ++ L S +D R R +VL +K W+K IN+P
Sbjct: 156 SLDPSPGLPLGIACDIGFENRLALENTRLLLCYSMVDPTRVRTLVLFLKVWSKRRKINSP 215
Query: 175 KTGTFNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA--EICA 231
GT +SY LLV+ F P +LP L+ I P L + A E +A I
Sbjct: 216 YEGTLSSYGYVLLVIYFLVHVKNPPVLPNLQQIPP------LHPIPAE-EHHLAGRNIWF 268
Query: 232 FNIARFSSDKYRKINRSSLAHLFVSFLEKF-------SGL-SLKASELGICPFTGQWEHI 283
F+ +++ N S+A L + F + SG+ S++A L
Sbjct: 269 FDDIELLRQRWKSQNSESVAELLIDFFRYYAKDFTYNSGVASIRAGLLK----------- 317
Query: 284 RSNTRWLPNNHPLF----------IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTS 333
+ + W +N P F IEDPFE N AR V+ L I F M R+
Sbjct: 318 KESKGWQGDNDPRFKDGRERNRLCIEDPFETDYNVARCVTRDGLYVIRGEF-MRASRILQ 376
Query: 334 TNQTR-YALLSSLA 346
Q R Y L+ L
Sbjct: 377 HRQERGYQALAQLC 390
>gi|54112136|gb|AAV28739.1| CID1p [Cryptococcus gattii]
Length = 728
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I KV + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKVDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|46403037|gb|AAS92532.1| CID1 [Cryptococcus gattii]
Length = 728
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I KV + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKVDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|388856182|emb|CCF50173.1| related to caffeine-induced death protein 1 Cid1 [Ustilago hordei]
Length = 1208
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 150/366 (40%), Gaps = 66/366 (18%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L PIL P E++ + L + V GA + FGS + R D
Sbjct: 386 LSPIL--------PTEEEYRIKEATRRQLERLSNRVSP--GAKLLAFGSMANGFALRNSD 435
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE----- 121
+D+ + G + L+ L + +R++ + + + AR+PI+K
Sbjct: 436 MDLCCLMGKGDDGHPTTQHTASELVEILGQLIREETDFNVMP-LPKARIPIIKINRSPTA 494
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ I+CDI +N ++ L + +D R R +VL VK WAK +N+P GT +
Sbjct: 495 DLPYEIACDIGFENRLALENTRLLLSYAMVDPPRLRTLVLFVKVWAKRRKLNSPYMGTLS 554
Query: 181 SYSLSLLVLFHF-QTCVPAIL------PPLKDIYPGNLV---------DDLKGVRANAER 224
SY +LLVLF PA+L PP + + P +V DD+ +R
Sbjct: 555 SYGYTLLVLFFLAHVKKPAVLPNLQRMPPTRPMEPEEMVLNGNNIYFYDDVAALRKEWSS 614
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG----LSLKASELGICPFTGQW 280
Q E N+ L H F F ++FS +SLK SE G+
Sbjct: 615 QNTE----NVGEL------------LIHFFRYFSKEFSYSRDVISLK-SETGLV------ 651
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTST-NQTRY 339
+ + W N L IEDPF+ N +R V++ L I F LT+T Q
Sbjct: 652 --SKDSMDW---NAELCIEDPFQMGYNVSRTVTKDGLYTIRGEFMRASRILTNTRGQKVS 706
Query: 340 ALLSSL 345
AL++ L
Sbjct: 707 ALIAEL 712
>gi|334362821|gb|AEG78615.1| CID1 [Cryptococcus gattii]
Length = 728
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I KV + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKVDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|341895680|gb|EGT51615.1| CBN-CID-1 protein [Caenorhabditis brenneri]
Length = 1489
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS ++ L D+DI + +G + + K+ +L + LR+ G +R+Q +
Sbjct: 1116 FGSVMTGLSVNCSDIDICLRFGDGD-VPPKDRTPKEVILK-VEEVLRKCGMVKRVQAIVT 1173
Query: 113 ARVPILKFE---TIHQNISCDISIDNLCGQIKSKFL--FWISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF+ + + DIS N+ + L + + D RF + L +K+WAK
Sbjct: 1174 AKVPIVKFQLRLKTGEMVDADISYYNILAIYNTALLREYTLWTPDSRFAKLALFIKKWAK 1233
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN-----LVDDLKGVRANA 222
+ DI + G+ +SY+ +L++ + Q C P +LP L++ + + LVD+ A
Sbjct: 1234 SCDIGDASRGSLSSYAHIILLISYLQNCDPPVLPRLQEDFRSDNDEKRLVDNWNTSYAQV 1293
Query: 223 ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK------ASELGICPF 276
E ++ + N + N+ + A L + + + +S + E+ +
Sbjct: 1294 EDELVQ----NWPK---------NKETCAQLLIGYFDYYSRYDFRNFVVQCRREMILSKM 1340
Query: 277 TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
W P+ +EDPF+ N + V++K I F
Sbjct: 1341 EKDWP------------RPICVEDPFDLNHNLSSGVTKKMFVFIMKVF 1376
>gi|222612546|gb|EEE50678.1| hypothetical protein OsJ_30926 [Oryza sativa Japonica Group]
Length = 828
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 134/282 (47%), Gaps = 37/282 (13%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ C+S K++ + + L + G R +Q +
Sbjct: 548 YGSCANSFGFSNSDIDL--------CLSIDEKEMSKVDIILKLAHILHAGNLRNIQALTR 599
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K + +SCDI ++NL + +K L S+ID R R + +VK WAK+ +N
Sbjct: 600 ARVPIVKLMDPNTGLSCDICVNNLLAVVNTKLLRDYSRIDKRLRPLAFIVKHWAKSRCVN 659
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
GT +SY+ ++ + + Q+ ILP L+++ P V V N ICA+
Sbjct: 660 ETYQGTLSSYAYVIMCIHYLQS--QRILPCLQEMEPTYYVT----VDNN-------ICAY 706
Query: 233 NIARFSSDKYRKIN------RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW--EHIR 284
D+ K+N + +L+ L F ++ + ++ I TG+ ++++
Sbjct: 707 ------FDQVDKLNGFGAQCKDTLSRLLWGFF-RYWAYAHNYTKDVISIRTGRTISKNMK 759
Query: 285 SNTRWLPNN-HPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
TR + N+ H + IEDPFE + R V +++ + FE
Sbjct: 760 DWTRRIGNDRHLICIEDPFETSHDLGRVVDNRSIWALREEFE 801
>gi|195044023|ref|XP_001991738.1| GH11902 [Drosophila grimshawi]
gi|193901496|gb|EDW00363.1| GH11902 [Drosophila grimshawi]
Length = 610
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/336 (26%), Positives = 152/336 (45%), Gaps = 52/336 (15%)
Query: 8 EPILKDILGMLNPLR-EDWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWG 65
E I + IL + R D RM+ ++ L +V +++ + A +PFGS V N F + G
Sbjct: 175 ESIEQQILKLYEHTRLNDLGVRMRFLAAL-QVQQAISGMFPDALAQPFGSSV-NGFGKMG 232
Query: 66 -DLDISIELSNGSCIS---------------------SAGKKVKQ---SLLGDLLRALRQ 100
DLD+ + + I+ S G+ Q +GDLL
Sbjct: 233 CDLDLILRFDGETTITDGQEMSANEPSRLIYHTKENMSNGRSQTQRQMECIGDLLHLFLP 292
Query: 101 KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
G ++ + ARVPI+K+ H N+ D+S+ NL G S+ L+ ++D R R +
Sbjct: 293 --GVCHVRRILQARVPIIKYHHEHLNLEVDLSMSNLTGFYMSELLYMFGELDPRVRPLTF 350
Query: 161 LVKEWAKAHDINNPKTGTFNS-YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVR 219
++ WA++ + NP G + S +SL+ LV++ Q ILP + G LV K
Sbjct: 351 SIRRWAQSCGLTNPSPGRWISNFSLTCLVMYFLQQLRQPILPAI-----GALV---KAAN 402
Query: 220 ANAERQIAEICAFNIARFSSDK--YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
A+ R + AR +D+ +R N S+L+ L + F E +S + +
Sbjct: 403 ASDVRITEDGINCTFAR-DTDRLGFRSRNTSNLSELLLQFFEFYSQFDFHNRAISL---- 457
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R+ T+ P++ ++I +P EQ N ++ VS
Sbjct: 458 ---NEGRALTK--PDHSAMYIANPLEQLLNVSKNVS 488
>gi|195356065|ref|XP_002044502.1| GM13245 [Drosophila sechellia]
gi|194131804|gb|EDW53738.1| GM13245 [Drosophila sechellia]
Length = 430
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 120/245 (48%), Gaps = 30/245 (12%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I I D++ +N C IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 148 ARVPILRFKDISNGIEVDLNFNN-CVGIKNTYLLQLYAQMDWRTRPLVVIVKLWAQYHDI 206
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
N+ K T +SYSL L+VL + Q CVP +LP L +YP Q+ +
Sbjct: 207 NDAKRMTISSYSLVLMVLHYLQHACVPHVLPCLHSMYPEKF-------------QLGQQD 253
Query: 231 AFNIARFSS-DKYRKINRSSLAHLFVSFLEKFSGLSLKASEL-----GICPFTGQWEHIR 284
++ + Y+ +N +L + F + +S + + G+ P + +
Sbjct: 254 CLDLDLIEPIEPYQALNTQTLGEHLLGFFKYYSSFDFRNFAISIRTGGVLPVS-TCRMAK 312
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRLTSTNQTRYALLS 343
S + L IE+PF+ N+AR+V + ++ F ++ RL T L+
Sbjct: 313 SPKNDVYQWKELNIEEPFDL-SNTARSVYDAPTFERVKAVFLVSARRLDHTLD-----LA 366
Query: 344 SLARP 348
++ RP
Sbjct: 367 TIFRP 371
>gi|357138525|ref|XP_003570842.1| PREDICTED: poly(A) RNA polymerase cid11-like [Brachypodium
distachyon]
Length = 566
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 142/324 (43%), Gaps = 38/324 (11%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESV-ESLRGATVEPFGSFVSNLFSRWGDLDISI 71
D+L + L+ E + K + + +S+ + A + +GS ++ + D+D+ +
Sbjct: 246 DLLSLYESLKPSEEHKSKQTQLIDSLAKSLSKEWPNARLHLYGSCANSFGTSHSDVDVCL 305
Query: 72 ELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
E+ G+ + ++ Q L D+L + ++ + ARVPI++ SCDI
Sbjct: 306 EIEIGT---ESTVEILQRL-ADILHG----DNFDDVEAITSARVPIVRMLDPGSGFSCDI 357
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
I+NL +K L +QIDGR + +VK WAK +N GT +SY+ L+ +
Sbjct: 358 CINNLFAVANTKLLKDYAQIDGRLLQLASIVKHWAKLRGVNETYRGTLSSYAYVLMCISF 417
Query: 192 FQTCVPAILPPLKDIYPGNL--VDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
Q P ILP L+ + P + VDD K + Q+ + A N+ S
Sbjct: 418 LQLREPKILPCLQAMDPTYIMVVDDTKCTYFDDIHQLHDFGAE-------------NKES 464
Query: 250 LAHLFVSFLEKFS--------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDP 301
+A L +F ++ +S++ ++ I W TR + H + IEDP
Sbjct: 465 IAELLWAFFHYWAFQHDYRKDVISIRMGKI-ISKKEKDW-----TTRVGNDRHLMCIEDP 518
Query: 302 FEQPENSARAVSEKNLAKISNAFE 325
FE + R V + + I FE
Sbjct: 519 FEISHDLGRVVDRQTIRIIIEEFE 542
>gi|19115813|ref|NP_594901.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe 972h-]
gi|15213942|sp|O13833.2|CID1_SCHPO RecName: Full=Poly(A) RNA polymerase protein cid1; AltName:
Full=Caffeine-induced death protein 1
gi|393715400|pdb|4E7X|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715401|pdb|4E7X|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715402|pdb|4E7X|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715403|pdb|4E7X|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715405|pdb|4E80|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715406|pdb|4E80|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715407|pdb|4E80|C Chain C, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715408|pdb|4E80|D Chain D, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715409|pdb|4E8F|A Chain A, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|393715410|pdb|4E8F|B Chain B, Structural Basis For The Activity Of A Cytoplasmic Rna
Terminal U- Transferase
gi|4324457|gb|AAD16889.1| caffeine-induced death protein 1 [Schizosaccharomyces pombe]
gi|5524947|emb|CAB50789.1| poly(A) polymerase Cid1 [Schizosaccharomyces pombe]
Length = 405
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 125/299 (41%), Gaps = 49/299 (16%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A + FGS S L + D+D+ + + + + + + L+ +
Sbjct: 83 AELVAFGSLESGLALKNSDMDLCVLMDSRVQSDTIALQFYEELIAEGFEG---------- 132
Query: 108 QFVAHARVPILKFETIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+F+ AR+PI+K + +N CDI +N + L +++D R + MVLLV
Sbjct: 133 KFLQRARIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKLDARLKPMVLLV 192
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WAK IN+P GT +SY L+VL++ + I PP ++P L+ LK
Sbjct: 193 KHWAKRKQINSPYFGTLSSYGYVLMVLYYL---IHVIKPP---VFPNLLLSPLK------ 240
Query: 223 ERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGIC---- 274
Q + F++ DK I N SSL L F +F + E +
Sbjct: 241 --QEKIVDGFDVG--FDDKLEDIPPSQNYSSLGSLLHGFF-RFYAYKFEPREKVVTFRRP 295
Query: 275 ---------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+T EH S + + + + L IEDPFE N R VS L +I F
Sbjct: 296 DGYLTKQEKGWTSATEHTGSADQIIKDRYILAIEDPFEISHNVGRTVSSSGLYRIRGEF 354
>gi|299740994|ref|XP_002910390.1| CID1p [Coprinopsis cinerea okayama7#130]
gi|298404505|gb|EFI26896.1| CID1p [Coprinopsis cinerea okayama7#130]
Length = 577
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 155/359 (43%), Gaps = 61/359 (16%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNL 60
L L D + L P +E+ M V D+R+++E + +R T+EP FGS +
Sbjct: 53 LSQCLFDFVIQLLPTQEE----MAVKEDVRKLLERL--IR--TIEPDSRLLSFGSTANGF 104
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
R D+D+ + + +S+A ++LGDLL ++ ++ + HAR+PI+K
Sbjct: 105 SLRNSDMDLCCLIDSDDKLSAADLV---TMLGDLL----ERETKFHVKPLPHARIPIVKL 157
Query: 121 ET-----IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNP 174
+ I+CDI +N ++ L + ID R R MVL +K W+K IN+P
Sbjct: 158 TLDPSPGLPHGIACDIGFENRLALENTRLLMCYAMIDPTRVRTMVLFLKVWSKRRKINSP 217
Query: 175 KTGTFNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFN 233
GT +SY LLV+ F P +LP L+ + P + + N I F+
Sbjct: 218 YKGTLSSYGYVLLVIYFLVHVKNPPVLPNLQQMPPLRPI-TTEETHLNGH----NIWFFD 272
Query: 234 IARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASEL------------------GICP 275
++R N S+A L+VS F L L A + G+
Sbjct: 273 DIDLLRQRWRSENTESVAELYVSHPFTFLLLVLIAIPMARMIDFFRYFARDFQYNNGVAS 332
Query: 276 FTG--------QWEHIRSNTRWLPN--NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
W+ + ++R+ P + L IEDPFE N AR V++ L I F
Sbjct: 333 IRAGLLKKDAKGWQSDQYSSRYDPGRERNRLCIEDPFELDYNVARCVTKDGLYTIRGEF 391
>gi|307195642|gb|EFN77484.1| Poly(A) RNA polymerase, mitochondrial [Harpegnathos saltator]
Length = 592
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 139/307 (45%), Gaps = 34/307 (11%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSN------GSCISSAGKKVK-------QSLLGDLLR 96
V PFGS ++ + DLD+ + +N S + K ++ + +G L
Sbjct: 233 VLPFGSSINGFGRKKCDLDLVLVPANIKEDNVNSRLIFHSKTMRINERYETKEFMGILAS 292
Query: 97 ALRQ-KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRF 155
+++ G L+ + ARVPI+KF + + CD+S N+ S+ L +ID R
Sbjct: 293 SMQHFIPGVENLRRILEARVPIIKFNFEYTRLECDLSTTNMSAVYMSELLHLYGEIDWRV 352
Query: 156 RDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDD 214
R +V +++ WAK +I G + ++SLSLLVLF+FQ ILP L+ + DD
Sbjct: 353 RPLVSVIRNWAKVQEITCDSPGPWITNFSLSLLVLFYFQQ--KNILPSLRMLKTYATRDD 410
Query: 215 LKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELG 272
++ + C F +I + ++ K N+ +L L + F E +S G
Sbjct: 411 IRHTENGID------CTFLRDINKLPNEYKYKSNQENLEALLLDFFEFYSLFDFYTK--G 462
Query: 273 ICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLT 332
IC G IR P+ PL I +P E N A+ V+ L ++ F L
Sbjct: 463 ICIREGI--PIRK-----PSRLPLHIVNPLETTLNVAKNVTIYELNRLKEKAHDAIFILE 515
Query: 333 STNQTRY 339
+++++ Y
Sbjct: 516 TSDKSNY 522
>gi|308485806|ref|XP_003105101.1| hypothetical protein CRE_20741 [Caenorhabditis remanei]
gi|308257046|gb|EFP00999.1| hypothetical protein CRE_20741 [Caenorhabditis remanei]
Length = 821
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 101/204 (49%), Gaps = 19/204 (9%)
Query: 113 ARVPI--LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
A+VPI LK I + + DI+++N+ G S + S +D RF + LLVK WA A+
Sbjct: 595 AKVPIIKLKLNGIFKELEVDINVNNIAGIYNSHLTHYYSLVDARFPVLALLVKHWAGANY 654
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
INN + G NSY++ LLV+ HF C PA+LP L+ ++P L I++
Sbjct: 655 INNAQAGYLNSYTVILLVV-HFLQCGVSPAVLPNLQYVFPDKFDKKLPLDELLLYGDISD 713
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
++ N SL LF+ F +S + + I +GQ RS
Sbjct: 714 KLPVSVP----------NTWSLGELFIGFFHYYSNFDFEKYAISI--RSGQVVP-RSLLP 760
Query: 289 WLPNNHPLFIEDPFEQPENSARAV 312
N+P+FIE+PF+ N+AR+V
Sbjct: 761 RDTANYPMFIEEPFDAI-NTARSV 783
>gi|432874392|ref|XP_004072474.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Oryzias latipes]
Length = 483
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 137/301 (45%), Gaps = 43/301 (14%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D E + + L+E ++S+ S+ A + GS ++ L R D D+ C+ G
Sbjct: 177 DLERKEVFRARLQEDIQSIFSV--ARLYLTGSSMNGLGCRSSDADL--------CLVITG 226
Query: 84 KKVKQSL-LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K L + LR L + Y + A+VPILKF+ ++ D++I+N G I++
Sbjct: 227 NKKPDPLSVLSRLRKLFRTLSYVEGTCLIRAKVPILKFKEKGSDLEFDLNINNTVG-IRN 285
Query: 143 KFLFW-ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
FL + D R R M+L+VK+WA H IN+ GT +SY+L L+VL + QT +LP
Sbjct: 286 TFLLRSYAHADPRVRPMILVVKKWACHHQINDASKGTLSSYTLVLMVLHYLQTVRDPVLP 345
Query: 202 PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
L+ +P + EI Y N+SSL L + FL +
Sbjct: 346 SLQRDHPDCFSPCM------------EIDMVPEGSTHVPPYISRNQSSLGELLLGFLRYY 393
Query: 262 SGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP-------LFIEDPFEQPENSARAVSE 314
ASE + Q +R T + P + + +E+PFE+ N ARAV E
Sbjct: 394 ------ASEFS---WDKQVISVREATAF-PKTYAQEWRKKFICVEEPFER-NNVARAVHE 442
Query: 315 K 315
K
Sbjct: 443 K 443
>gi|148230683|ref|NP_001086580.1| poly(A) RNA polymerase GLD2-B [Xenopus laevis]
gi|82182837|sp|Q6DFA8.1|GLD2B_XENLA RecName: Full=Poly(A) RNA polymerase GLD2-B; Short=xGLD-2; AltName:
Full=PAP-associated domain-containing protein 4-B
gi|49903424|gb|AAH76832.1| MGC83852 protein [Xenopus laevis]
Length = 509
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 134/301 (44%), Gaps = 43/301 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ R D D+ C+ + + Q+ + +L K Y RL ++
Sbjct: 228 GSSLNGFGIRSSDADL--------CLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERP 279
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF D++++N+ G I++ FL + +D R R +VL++K+WA
Sbjct: 280 QFIRAKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYLDKRVRPLVLVIKKWAN 338
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H IN+ GT +SY++ L+VL + QT ILP L+ YP + + +
Sbjct: 339 HHGINDASRGTLSSYTIVLMVLHYLQTLPEPILPSLQKKYP-------ECFDRTMQLHLV 391
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF------SGLSLKASELGICPFTGQWE 281
NI +F S N + L L + FL+ F S + E P T +E
Sbjct: 392 HQAPRNIPQFLSK-----NETPLGDLLLGFLKYFAVEFDWSKDIISLREAKALPRTDDYE 446
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
W N + +E+PF+ N+ARAV EK + A + + N+ Y+L
Sbjct: 447 -------W--RNKYICVEEPFDG-SNTARAVYEKQKFDLIRAEFLKAWVALRDNRDLYSL 496
Query: 342 L 342
L
Sbjct: 497 L 497
>gi|347969656|ref|XP_001231185.2| AGAP003314-PA [Anopheles gambiae str. PEST]
gi|333469670|gb|EAU75994.2| AGAP003314-PA [Anopheles gambiae str. PEST]
Length = 639
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 106/221 (47%), Gaps = 27/221 (12%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL--GDLLRALRQKG 102
L G+T+ FG+ S D+D+ I +G + + +LL + +L
Sbjct: 333 LMGSTISGFGTDTS-------DMDMCIVDIDGPTYCDSRTEALNNLLRVKSFIESL-PTC 384
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+ L + A+VPIL+F + +NI D+SI+N G + L +Q+D R R +VL++
Sbjct: 385 SFEHLDLI-RAKVPILRFRHVEENIDIDLSINNCVGIRNTHLLNCYAQLDERVRPLVLVI 443
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRAN 221
K WA+ H++N+P T +SYSL L+VL Q V PA++P L I+P K N
Sbjct: 444 KLWAQHHNLNDPIHSTMSSYSLVLMVLNFLQCGVTPAVIPCLHRIFPEKFCK--KNFTNN 501
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+IA +R N +L L + F + ++
Sbjct: 502 LLERIA-------------PHRSDNSDTLGQLLLKFFKYYA 529
>gi|121713318|ref|XP_001274270.1| PAP/25A associated domain family [Aspergillus clavatus NRRL 1]
gi|119402423|gb|EAW12844.1| PAP/25A associated domain family [Aspergillus clavatus NRRL 1]
Length = 1084
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 134/299 (44%), Gaps = 30/299 (10%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
DI L P E + R +++ L ++ R V FGS + L S D+DI
Sbjct: 123 DIYDRLLPSAESDDRRRQLVRKLEKLFNDQWPGRDIKVHVFGSSGNKLCSSDSDVDI--- 179
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++ LL ++L K G R+ V+HA+VPI+K ++CD+
Sbjct: 180 -----CITTTYKELEHVCLLAEVL----AKHGMERVVCVSHAKVPIVKIWDPELRLACDM 230
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + +ID R R + +++K W K +N+ GT +SY+ L++
Sbjct: 231 NVNNTLALENTRMVRTYVEIDDRVRPLAMIIKYWTKRRILNDAGLGGTLSSYTWICLIIN 290
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRS 248
QT P +LP L+ R + +R A+ +C+F+ S + + N+
Sbjct: 291 FLQTRDPPVLPSLQ-------------ARPHKKRTTADGLVCSFDDDLGSLTGFGRKNKQ 337
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
+L L F ++ G L + I G+ L N+ L +E+PF N
Sbjct: 338 TLGELLFHFF-RYYGHELDFEKYVISVREGKLISKEEKGWHLLQNNRLCVEEPFNTSRN 395
>gi|58266784|ref|XP_570548.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134110354|ref|XP_776004.1| hypothetical protein CNBD0540 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50258672|gb|EAL21357.1| hypothetical protein CNBD0540 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|56566306|gb|AAN75730.2| CID1 [Cryptococcus neoformans var. neoformans]
gi|57226781|gb|AAW43241.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 708
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I K+ + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKIDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|56566282|gb|AAN75620.2| CID1 [Cryptococcus neoformans var. neoformans]
Length = 708
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ + +V + ++++ +E A + FGS ++ R D+D+ +
Sbjct: 25 LLPPSEELSVKEEVRCLIEKLIKGLEP--SARLLSFGSSCNSFGLRNSDMDLVV------ 76
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
I K+ + + AL ++ ++ + AR+PILK E + I+CDI
Sbjct: 77 LIDDPNAKIDPGNFVESMAALLERETNFNVKPLPRARIPILKLELAPSPALPFGIACDIG 136
Query: 133 IDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-F 190
I+N ++ L + ID R R +VL +K W+K IN+P GT +SY +L+VL F
Sbjct: 137 IENRLAIENTRLLLTYATIDPARVRTLVLFLKVWSKRRRINSPYRGTLSSYGYTLMVLYF 196
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA----EICAFNIARFSSDKYRKIN 246
P +LP L+ I P +R E ++ + F+ ++ +N
Sbjct: 197 LVHVKQPPVLPNLQRIMP---------MRPLEEEEVMLEGRNVYFFDDVETLRREWSSVN 247
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASEL--------GICPFTGQWEHIRSNTRWL 290
S+ L V F FS LSL+A +L G E R R
Sbjct: 248 FESVGELLVDFFRYFSHDFQFNNSVLSLRAGQLTKESKGWVNDIDVGGLNEMARDRNR-- 305
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLAR 347
L IEDPFE N AR V++ L I F LT + L+ L R
Sbjct: 306 -----LCIEDPFEITYNVARTVTKDGLYTIRGEFMRATRILTQRPERAVLALAELCR 357
>gi|402550493|pdb|4FHX|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - H336n Mutant Bound
To Mgatp
Length = 349
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 49/299 (16%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A + FGS S L + D+D+ + + + + + + L+ +
Sbjct: 55 AELVAFGSLESGLALKNSDMDLCVLMDSRVQSDTIALQFYEELIAEGFEG---------- 104
Query: 108 QFVAHARVPILKFETIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+F+ AR+PI+K + +N CDI +N + L +++D R + MVLLV
Sbjct: 105 KFLQRARIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKLDARLKPMVLLV 164
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WAK IN+P GT +SY L+VL++ + I PP ++P L+ LK
Sbjct: 165 KHWAKRKQINSPYFGTLSSYGYVLMVLYYL---IHVIKPP---VFPNLLLSPLK------ 212
Query: 223 ERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGIC---- 274
+ +I + F++ DK I N SSL L F +F + E +
Sbjct: 213 QEKIVD--GFDVG--FDDKLEDIPPSQNYSSLGSLLHGFF-RFYAYKFEPREKVVTFRRP 267
Query: 275 ---------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+T EH S + + + + L IEDPFE N R VS L +I F
Sbjct: 268 DGYLTKQEKGWTSATEHTGSADQIIKDRYILAIEDPFEISNNVGRTVSSSGLYRIRGEF 326
>gi|354493358|ref|XP_003508809.1| PREDICTED: poly(A) RNA polymerase GLD2 [Cricetulus griseus]
Length = 480
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 118/245 (48%), Gaps = 36/245 (14%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H+IN+ GT +SYSL L+VL + QT ILP L+ IYP +
Sbjct: 303 LVLVIKKWASHHEINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ES 355
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + + N+ + S N SSL L + FL+ ++ +S++ +
Sbjct: 356 FSTSVQLHLVHHAPCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTH 328
+ P +W N + +E+PF+ N+ARAV EK I + F +
Sbjct: 411 KAIPRPDDIEWR-----------NKYICVEEPFDG-TNTARAVHEKQKFDMIKDQFLKSW 458
Query: 329 FRLTS 333
RL S
Sbjct: 459 HRLKS 463
>gi|402550488|pdb|4FH3|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity
gi|402550489|pdb|4FH5|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mgutp Bound
gi|402550490|pdb|4FHP|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Cautp Bound
gi|402550491|pdb|4FHV|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mgctp Bound
gi|402550492|pdb|4FHW|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mggtp Bound
gi|402550494|pdb|4FHY|A Chain A, Crystal Structures Of The Cid1 Poly (U) Polymerase Reveal
The Mechanism For Utp Selectivity - Mg 3'-Datp Bound
Length = 349
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 49/299 (16%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A + FGS S L + D+D+ + + + + + + L+ +
Sbjct: 55 AELVAFGSLESGLALKNSDMDLCVLMDSRVQSDTIALQFYEELIAEGFEG---------- 104
Query: 108 QFVAHARVPILKFETIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+F+ AR+PI+K + +N CDI +N + L +++D R + MVLLV
Sbjct: 105 KFLQRARIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKLDARLKPMVLLV 164
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WAK IN+P GT +SY L+VL++ + I PP ++P L+ LK
Sbjct: 165 KHWAKRKQINSPYFGTLSSYGYVLMVLYYL---IHVIKPP---VFPNLLLSPLK------ 212
Query: 223 ERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGIC---- 274
+ +I + F++ DK I N SSL L F +F + E +
Sbjct: 213 QEKIVD--GFDVG--FDDKLEDIPPSQNYSSLGSLLHGFF-RFYAYKFEPREKVVTFRRP 267
Query: 275 ---------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+T EH S + + + + L IEDPFE N R VS L +I F
Sbjct: 268 DGYLTKQEKGWTSATEHTGSADQIIKDRYILAIEDPFEISHNVGRTVSSSGLYRIRGEF 326
>gi|133919900|emb|CAL91353.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
Length = 466
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 134/301 (44%), Gaps = 43/301 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ R D D+ C+ + + Q+ + +L K Y RL ++
Sbjct: 185 GSSLNGFGIRSSDADL--------CLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERP 236
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF D++++N+ G I++ FL + +D R R +VL++K+WA
Sbjct: 237 QFIRAKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYLDKRVRPLVLVIKKWAN 295
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H IN+ GT +SY++ L+VL + QT ILP L+ YP + + +
Sbjct: 296 HHGINDASRGTLSSYTIVLMVLHYLQTLPEPILPSLQKKYP-------ECFDRTMQLHLV 348
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF------SGLSLKASELGICPFTGQWE 281
NI +F S N + L L + FL+ F S + E P T +E
Sbjct: 349 HQAPRNIPQFLSK-----NETPLGDLLLGFLKYFAVEFDWSKDIISLREAKALPRTDDYE 403
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
W N + +E+PF+ N+ARAV EK + A + + N+ Y+L
Sbjct: 404 -------W--RNKYICVEEPFDG-SNTARAVYEKQKFDLIRAEFLKAWVALRDNRDLYSL 453
Query: 342 L 342
L
Sbjct: 454 L 454
>gi|66826987|ref|XP_646848.1| DNA2/NAM7 helicase family protein [Dictyostelium discoideum AX4]
gi|60474984|gb|EAL72920.1| DNA2/NAM7 helicase family protein [Dictyostelium discoideum AX4]
Length = 2523
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 138/309 (44%), Gaps = 27/309 (8%)
Query: 43 ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG 102
E A++ +GSF+S L DLDI+ S+ +K + L + + L +
Sbjct: 2210 EGFATASINLYGSFLSGLSLNDSDLDINF---------SSTQKEDTTHLKQVYKYLNRSQ 2260
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
Y+ ++ A+VPI++F+ I + D+ +++ S L ID R RD+VLLV
Sbjct: 2261 LYKLIEKRTDAKVPIIRFKEISSGVHFDMCFNSMMSYHNSLLLGEYCSIDNRSRDLVLLV 2320
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WA + D+NN TF+S+ L +V+ Q+ P ILP L+ L++ R
Sbjct: 2321 KWWAVSKDLNNAAEKTFSSFCLVNMVIHFLQSLNPPILPNLQTT-SNQLLEKYSTNRNLI 2379
Query: 223 ERQIAEICAFNIARF----SSDKYR-KINRSSLAHLFVSFLEKFSGLSLKAS-------- 269
+ + I + ++ S +K+ K N+ ++A LF F +S + K +
Sbjct: 2380 KLKSQTIVENYLVKYYDWSSFNKFEPKRNKLTIAQLFYQFFYYYSTFNYKENIISISHSS 2439
Query: 270 -ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTH 328
+C + RSN R + + DPF N A ++ +K ++ F M
Sbjct: 2440 GGGSLCENGALLK--RSNIRSRAVRDHIIVLDPFINDRNLASSI-KKTYQRVLMEFIMME 2496
Query: 329 FRLTSTNQT 337
+ L ST T
Sbjct: 2497 YSLRSTKST 2505
>gi|422296106|gb|EKU23405.1| nucleotidyltransferase-like protein [Nannochloropsis gaditana
CCMP526]
Length = 432
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 60/196 (30%), Positives = 91/196 (46%), Gaps = 15/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP-------FGSFVSNLFSR 63
++ +L L P + ++TR V R VE++ RG T P FGS +N +
Sbjct: 1 MESLLPALLPSQACFQTRENV----RARVEALLPSRGPTTFPDGTRLRVFGSSANNFGND 56
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
DLD+ + + S + + L LL A G + AR+PI+ F
Sbjct: 57 AADLDMCVTFPDSSPLPAGSSGEMIEALASLLEA----NGMEDVVARPTARIPIVLFREP 112
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
+ CDIS++N ++ L SQ+D R R + +VK WA+A INN GT +SY+
Sbjct: 113 GTGLDCDISVENPLALRNTQLLHEYSQVDPRVRALAYIVKHWARARKINNASGGTLSSYA 172
Query: 184 LSLLVLFHFQTCVPAI 199
L+VL QT A+
Sbjct: 173 YILMVLHFLQTAAAAV 188
>gi|147900520|ref|NP_001087078.1| PAP associated domain containing 4 a [Xenopus laevis]
gi|51234260|gb|AAT98005.1| cytoplasmic poly(A) polymerase [Xenopus laevis]
Length = 509
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 134/301 (44%), Gaps = 43/301 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ R D D+ C+ + + Q+ + +L K Y RL ++
Sbjct: 228 GSSLNGFGIRSSDADL--------CLVLKEEPMNQNTEARHILSLLHKHFYTRLSYIERP 279
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF D++++N+ G I++ FL + +D R R +VL++K+WA
Sbjct: 280 QFIRAKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYLDKRVRPLVLVIKKWAN 338
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H IN+ GT +SY++ L+VL + QT ILP L+ YP + + +
Sbjct: 339 HHGINDASRGTLSSYTIVLMVLHYLQTLPEPILPSLQRKYP-------ECFDRTMQLHLV 391
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF------SGLSLKASELGICPFTGQWE 281
NI +F S N + L L + FL+ F S + E P T +E
Sbjct: 392 HQAPRNIPQFLSK-----NETPLGDLLLGFLKYFAVEFDWSKDVISLREAKALPRTDDYE 446
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
W N + +E+PF+ N+ARAV EK + A + + N+ Y+L
Sbjct: 447 -------W--RNKYICVEEPFDG-SNTARAVYEKQKFDLIRAEFLKAWVALRDNRDLYSL 496
Query: 342 L 342
L
Sbjct: 497 L 497
>gi|390136629|pdb|4EP7|A Chain A, Functional Implications From The Cid1 Poly(U) Polymerase
Crystal Structure
gi|390136630|pdb|4EP7|B Chain B, Functional Implications From The Cid1 Poly(U) Polymerase
Crystal Structure
Length = 340
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 49/299 (16%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A + FGS S L + D+D+ + + + + + + L+ +
Sbjct: 46 AELVAFGSLESGLALKNSDMDLCVLMDSRVQSDTIALQFYEELIAEGFEG---------- 95
Query: 108 QFVAHARVPILKFETIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+F+ AR+PI+K + +N CDI +N + L +++D R + MVLLV
Sbjct: 96 KFLQRARIPIIKLTSDTKNGFGASFQCDIGFNNRLAIHNTLLLSSYTKLDARLKPMVLLV 155
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WAK IN+P GT +SY L+VL++ + I PP ++P L+ LK
Sbjct: 156 KHWAKRKQINSPYFGTLSSYGYVLMVLYYL---IHVIKPP---VFPNLLLSPLK------ 203
Query: 223 ERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGIC---- 274
+ +I + F++ DK I N SSL L F +F + E +
Sbjct: 204 QEKIVD--GFDVG--FDDKLEDIPPSQNYSSLGSLLHGFF-RFYAYKFEPREKVVTFRRP 258
Query: 275 ---------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+T EH S + + + + L IEDPFE N R VS L +I F
Sbjct: 259 DGYLTKQEKGWTSATEHTGSADQIIKDRYILAIEDPFEISHNVGRTVSSSGLYRIRGEF 317
>gi|363729640|ref|XP_418580.3| PREDICTED: poly(A) RNA polymerase, mitochondrial [Gallus gallus]
Length = 568
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 77/299 (25%), Positives = 133/299 (44%), Gaps = 53/299 (17%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKVKQ------------------ 88
+TV+PFGS V N F + G D+D+ ++ + I K+K+
Sbjct: 219 STVKPFGSSV-NTFGKLGCDVDMFLDFHD---IQKHATKMKKGPFEMEYQMKRLPSERLA 274
Query: 89 -----SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
S++GD L GY +Q + +AR P++KF CD+S+ N S+
Sbjct: 275 TQKILSIIGDCLDNFGP--GYSSVQKILNARCPLVKFSHQPTGFQCDLSVSNSIAIRCSE 332
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPP 202
L+ +D R R +V ++ WA+ H + N GT+ ++SL+++++F Q P I+P
Sbjct: 333 LLYIYGCLDPRVRALVFSLRCWARVHGLTNSVPGTWITNFSLTMMIMFFLQKRSPPIIPT 392
Query: 203 LKDIYPGNLVDDLKGVRANAERQI--AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
L D LK + ++ + C+F ++ S K K N +L L F +
Sbjct: 393 L---------DQLKELADEKDKHVIGGYDCSF-VSDLSKIKPTK-NTETLDELLCDFFQY 441
Query: 261 FSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
F + + L + + P + PL+I +PFEQ N ++ V++ L K
Sbjct: 442 FGNFDFRKNSLNL---------RKGKEVNKPESSPLYIWNPFEQDLNISKNVNQPQLEK 491
>gi|374724400|gb|EHR76480.1| terminal uridylyltransferase [uncultured marine group II
euryarchaeote]
Length = 730
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 126/301 (41%), Gaps = 48/301 (15%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSN---GSCISSAGKKVKQSLLGDLLRALRQK 101
+ ++ FGS S L + GDLD+ ++ + + +KQ + D++
Sbjct: 81 FKNVELQQFGSSQSGLTLQAGDLDLCLQFKGDIPAKALRQVNRLLKQHDMEDIV------ 134
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
+ A+VPI+KF+ I DISI+N ++ L S D R R ++L
Sbjct: 135 -------MLPRAKVPIIKFKDERTKIPVDISINNTLALHNTELLKRYSSCDERIRSVILA 187
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
VK WA D+ + TGTF+SY+ +LL + Q P + P L++ ++V+
Sbjct: 188 VKHWANRRDVCDASTGTFSSYAWTLLAVQALQQATPPVAPVLQEGQERSMVE-------- 239
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG--- 278
E ++ + S D N SL LFV F+ +++ +S + + TG
Sbjct: 240 VEGTTYDLTMREVDSISMDPK---NNQSLGELFVEFIFQYA-VSWPFKDHVVSVRTGSPV 295
Query: 279 -----QWEH----------IRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
+W+H + TR H L IEDPF+ + +R + I
Sbjct: 296 TRKSKKWKHATPKAEKAVLMEKKTRL--GQHSLPIEDPFDLKHDLSRVLRPAGALDIQEE 353
Query: 324 F 324
F
Sbjct: 354 F 354
>gi|355566405|gb|EHH22784.1| hypothetical protein EGK_06113, partial [Macaca mulatta]
Length = 642
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 119/272 (43%), Gaps = 38/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 90 GDLGKALELAEAPKREKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 147
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL +S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 148 SGLHGDISLSNRLALHNSRFLSLVSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 206
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 207 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASR 253
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWL 290
R N L+ L F S L+ S L + P TG WE +R
Sbjct: 254 LERSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVTGGLPSNLWEGLRLG---- 309
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 310 ----PMNLQDPFDLSHNVAANVTSRVAGRLQN 337
>gi|431910377|gb|ELK13450.1| U6 snRNA-specific terminal uridylyltransferase 1 [Pteropus alecto]
Length = 871
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 123/275 (44%), Gaps = 44/275 (16%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQS----LLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
GDL ++EL+ + G+K +Q L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKALELAE----ALKGEKTEQGAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKF 371
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ D+S+ N S+FL S++DGR R +V V+ WA+ ++ N
Sbjct: 372 CHRPSGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTVRCWAQGRGLSG-SGPLLN 430
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
+Y+L+LLV++ QT P +LP + + + E + E+ ++ + F D
Sbjct: 431 NYALTLLVIYFLQTREPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRD 478
Query: 241 KYR---KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNT 287
R N+ SL+ L F S L+ S L + P G WE +R
Sbjct: 479 ASRLEPSTNKESLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG- 537
Query: 288 RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 538 -------PMNLQDPFDLSHNVAANVTSRVAGRLQN 565
>gi|301603642|ref|XP_002931502.1| PREDICTED: terminal uridylyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 1562
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E + + FGS + R
Sbjct: 863 ILDRVCKRCYDELSPNSSEQQNREQILAYLERFIRK-EFNNHSRLCLFGSSKNGFGFRDS 921
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G + + + A+VPI+KFE
Sbjct: 922 DLDICMTLEGHE---NAKKLNCKEIIDGLAKVLKKHPGLKNILAITTAKVPIVKFEHKES 978
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + VK +AK DI + G+ +SY+
Sbjct: 979 GVEGDISLYNTLAQHNTRMLATYAAIDPRVKYLGYTVKFFAKRCDIGDASRGSLSSYAYI 1038
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++IY G + +R + AF + R
Sbjct: 1039 LMVLYFLQQRNPPVIPVLQEIYDG---------QETPQRMVDGWNAFFFDNTEELRNRFP 1089
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
NR S+ L++ FL ++ +S++ +L + F QW +
Sbjct: 1090 SLGKNRESVGELWLGFLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1137
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1138 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1168
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 96/204 (47%), Gaps = 18/204 (8%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
+ED+ R +++ + ++++ L ++ +GS ++ D++I ++
Sbjct: 304 QEDFLLRQEIVKSMEKIIQM--KLPECSLRMYGSSLTRFAFTSSDINIDVKFP------- 354
Query: 82 AGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+K+ Q +L +L L+ Y ++ HA+VP++ + ++C +S N +
Sbjct: 355 --RKMNQPDVLIQVLDILKNCACYTEVESDFHAKVPVVFCKDKKSGLTCKVSAGNDTACL 412
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
++FL I +++ +V+ + WA+ I+ G SY +L+V+F Q P IL
Sbjct: 413 TTEFLEAIGKLEPVLIPLVMAFRYWARLCHIDCQAEGGIPSYCFALMVVFFLQKRQPPIL 472
Query: 201 P----PLKDIYPGNLVDD--LKGV 218
P P D + +DD LKGV
Sbjct: 473 PAYLGPWIDGFESKKLDDYHLKGV 496
>gi|195043314|ref|XP_001991594.1| GH12744 [Drosophila grimshawi]
gi|193901352|gb|EDW00219.1| GH12744 [Drosophila grimshawi]
Length = 1336
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 112/243 (46%), Gaps = 49/243 (20%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPIL+F+ I D++ +N G + L + +Q+D R R +V++VK WA+ HDIN
Sbjct: 1057 ARVPILRFKDRLNGIEVDLNYNNCVGIKNTYLLQFYAQLDWRTRPLVVIVKLWAQYHDIN 1116
Query: 173 NPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYPG--NLVDDLKGVRANAERQIAEI 229
+ K T +SYSL L+VL + Q C+P +LP L+ +YP NL G + + + E
Sbjct: 1117 DAKRMTVSSYSLVLMVLHYLQYACMPRVLPCLQALYPDKFNL-----GQQDCLDLDLIEP 1171
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASEL-----GICPFTG------ 278
+ Y N+ +L + F + +S + + G+ P T
Sbjct: 1172 I---------EPYHTQNKQTLGEHLLGFFKYYSTFDFENYAISIRTGGVLPVTACRLAKS 1222
Query: 279 ------QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRL 331
QW+ + IE+PF+ N+AR+V + ++ F +T RL
Sbjct: 1223 PKNDIHQWKELN-------------IEEPFDL-SNTARSVYDSATFERVKTTFIVTALRL 1268
Query: 332 TST 334
T
Sbjct: 1269 EHT 1271
>gi|427789257|gb|JAA60080.1| Putative polya rna polymerase mitochondrial-like protein
[Rhipicephalus pulchellus]
Length = 536
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 139/315 (44%), Gaps = 56/315 (17%)
Query: 24 DWETRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
D ETR+ + R+V E + L V PFGS V N F R + DI + S+
Sbjct: 183 DLETRLGFVV-CRQVEEFISGLYPKGQVLPFGSLV-NGFGRH-NCDIDMVYCVPEATESS 239
Query: 83 GKKVKQS----------------LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
G+ Q LGDLL + G +Q + ARVPI+KF+
Sbjct: 240 GQLYFQDKNQAMNDRTLVQRVLETLGDLLHYV--VPGVSEVQRILRARVPIVKFQHNVVG 297
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLS 185
CD++++N+ G S+ L +Q+ ++ V+ WA A + GT+ ++ L+
Sbjct: 298 RECDLTLNNMSGVHMSRLLHSCTQLAPALCPLLFTVRSWAMAQGVTTKVPGTWITNFQLT 357
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
LL +FH Q C +LP L+D+ +D K ++ + +R K +
Sbjct: 358 LLAIFHLQQC--GLLPSLRDL------EDKKRLK-----------TWQKSRLPYGKAEE- 397
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
L L SF E ++ + K+ GI PF+GQ T P +F+++P ++
Sbjct: 398 ----LEDLLRSFFEYYASFNFKSK--GIAPFSGQ-------TLEKPEYTAMFVQNPLDRQ 444
Query: 306 ENSARAVSEKNLAKI 320
N++R + +L K+
Sbjct: 445 LNASRNIGLSDLKKL 459
>gi|328871484|gb|EGG19854.1| hypothetical protein DFA_06957 [Dictyostelium fasciculatum]
Length = 1635
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 134/294 (45%), Gaps = 34/294 (11%)
Query: 51 EPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL--------RQKG 102
E +GSFV+ + D+D+ + I+++ +++ L+ ++ L + +G
Sbjct: 1327 EAYGSFVNGIQLESSDIDVCFKTD----INTSDPVLRKDLMKSIVTRLYNRKSKRSKLRG 1382
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
Y+ + + +VPI+KF + N+S D+ +N S + ++ID R + ++LLV
Sbjct: 1383 PYQVERVLDSIKVPIIKFRDLRYNVSYDMCFNNRLAIGNSLLVKSYAEIDERAKQLMLLV 1442
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WA DIN+ GT +SY+ +V+F+ QT P +LP L + +
Sbjct: 1443 KYWASRKDINDASGGTLSSYAWLNMVIFYLQTVQPPVLPSLH-----------ANISSKP 1491
Query: 223 ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ--W 280
Q+ + + + + N+ +L LF F + ++L IC G+
Sbjct: 1492 TNQLVQKDDWKFVDHRHTGFVRQNKKTLFQLFYGFFNFYCKFDF-TNQL-ICIRLGKPTS 1549
Query: 281 EHIRSNTRWLPNNHPLF-IEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTS 333
+ S + NN L IEDPF+ N +V K+S++F++ F TS
Sbjct: 1550 YSLASKSYKDNNNQSLIRIEDPFDTSANPGASV------KLSSSFKIIIFEFTS 1597
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 134/313 (42%), Gaps = 43/313 (13%)
Query: 51 EPFGSFVSNL-FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ- 108
EP+GSFV+ + D+D+ + S + S ++K L+ ++R L+++ G RR
Sbjct: 540 EPYGSFVNGIQLESSDDIDVCFKTSFDT---SNAIRLK-ILMKSVVRCLKKRKGGRRGNK 595
Query: 109 ----------FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
F + V I++F +S ++S +N S + ++ID R + +
Sbjct: 596 LKGPYSVERIFDSIKEVGIIRFRDYKHRVSFNMSFNNRLAIGNSLLVKSYAEIDERAKQL 655
Query: 159 VLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGV 218
+LLVK WA DIN+ GT +SY+ +V+F+ QT P +LP L N + + V
Sbjct: 656 MLLVKYWASRKDINDASGGTLSSYAWLNMVIFYLQTVQPPVLPSLHSNVSSNCPTN-QPV 714
Query: 219 RANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG 278
+ + + F R + + N +L LF F + + FT
Sbjct: 715 QKDDWSIKEDEWKFVDHRHTG--FVSQNNKTLFQLFYGFFDFYCKFD----------FTN 762
Query: 279 QWEHIR---SNTRWL---PNNHP-LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRL 331
Q IR S T + NNH + IEDPF+ N +V S +F + F
Sbjct: 763 QLICIRLGKSTTNKIGMDQNNHSQICIEDPFDTSSNLGASVK-------STSFNIIIFEF 815
Query: 332 TSTNQTRYALLSS 344
S L+S+
Sbjct: 816 MSMQSKLLELVSN 828
>gi|190194365|ref|NP_060579.3| poly(A) RNA polymerase, mitochondrial precursor [Homo sapiens]
gi|74753002|sp|Q9NVV4.1|PAPD1_HUMAN RecName: Full=Poly(A) RNA polymerase, mitochondrial; Short=PAP;
AltName: Full=PAP-associated domain-containing protein
1; AltName: Full=Polynucleotide adenylyltransferase;
AltName: Full=Terminal uridylyltransferase 1;
Short=TUTase 1; AltName: Full=mtPAP; Flags: Precursor
gi|7022551|dbj|BAA91641.1| unnamed protein product [Homo sapiens]
gi|34596242|gb|AAQ76801.1| hypothetical protein [Homo sapiens]
gi|63108298|dbj|BAD98252.1| mitochondrial polyA polymerase [Homo sapiens]
gi|119606420|gb|EAW86014.1| PAP associated domain containing 1, isoform CRA_a [Homo sapiens]
gi|119606421|gb|EAW86015.1| PAP associated domain containing 1, isoform CRA_a [Homo sapiens]
Length = 582
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 131/295 (44%), Gaps = 47/295 (15%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L +S+ + + Q +L
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQVKNVPSERIATQKIL 285
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 345
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 LDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 399
Query: 210 NLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
D LK + ++ + E C F +++R + N +L L F E F +
Sbjct: 400 ---DSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQ----NTETLELLLKEFFEYFGNFA 452
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 453 FDKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|52545561|emb|CAH56395.1| hypothetical protein [Homo sapiens]
Length = 712
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 126/291 (43%), Gaps = 39/291 (13%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L +S+ + + Q +L
Sbjct: 357 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQVKNVPSERIATQKIL 415
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 416 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 475
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 476 LDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 529
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
D LK + ++ + E R S N +L L F E F + +
Sbjct: 530 ---DSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQNTETLELLLKEFFEYFGNFAFDKN 586
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 587 SINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 628
>gi|395827437|ref|XP_003786909.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase,
mitochondrial [Otolemur garnettii]
Length = 639
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 125/302 (41%), Gaps = 57/302 (18%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG---- 102
TV PFGS V N F + G DLD+ ++L GK LG+ L + K
Sbjct: 228 CTVRPFGSSV-NTFGKLGCDLDMFLDLGE------TGKPSTDKTLGNFLMEFQMKSVPSE 280
Query: 103 --------------------GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
G +Q + +AR P+++F CD++ +N S
Sbjct: 281 RIATQKILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQPSGFQCDLTTNNRIALKSS 340
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILP 201
+ L+ +D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP
Sbjct: 341 ELLYIYGSLDSRVRALVFSIRSWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILP 400
Query: 202 PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFL 258
L DL A+AE + + N F D R N +L L F
Sbjct: 401 TL----------DLLKTLADAEDKC--MIEGNNCTFVRDLNRIHPSGNTETLELLLKEFF 448
Query: 259 EKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLA 318
E F + L I + + P++ PL+I++PFE N ++ VS+ L
Sbjct: 449 EYFGNFAFNKYSLNI---------RQGKEQNKPDSSPLYIQNPFETSLNISKNVSQSQLQ 499
Query: 319 KI 320
K
Sbjct: 500 KF 501
>gi|10433530|dbj|BAB13981.1| unnamed protein product [Homo sapiens]
Length = 582
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 131/295 (44%), Gaps = 47/295 (15%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L +S+ + + Q +L
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQVKNVPSERIATQKIL 285
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 345
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 LDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 399
Query: 210 NLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
D LK + ++ + E C F +++R + N +L L F E F +
Sbjct: 400 ---DSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQ----NTETLELLLKEFFEYFGNFA 452
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 453 FDKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|332029843|gb|EGI69712.1| Poly(A) RNA polymerase gld-2-like protein A [Acromyrmex echinatior]
Length = 654
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 145/324 (44%), Gaps = 77/324 (23%)
Query: 47 GATVEPFGSFVSN----LFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG 102
G+T+ FGS S+ L + +LD+ CI+ L ++L+ L+Q
Sbjct: 355 GSTMNGFGSNDSDVDVCLLMKHTELDVR-------CIAIEH-------LLEVLKHLKQSN 400
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+L+ + HA+VPI+ F + + D++ +N G + L+ S++D R + + L+V
Sbjct: 401 FVEQLEII-HAKVPIITFFDVVRKFKIDMNFNNSVGVKNTHLLYCYSKLDWRVKPLALVV 459
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNLVDDLKGVRA 220
K WA+ H+INNPK T +SYSL L+V+ HF C P +LP L ++ A
Sbjct: 460 KLWAQWHNINNPKCRTLSSYSLVLMVI-HFLQCGTNPPVLPCLHSMF------------A 506
Query: 221 NAERQIAEICAFNIAR---FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---- 273
N + A+I NI S + N SL L F + + + +
Sbjct: 507 NKFKSDADIYNINIHEDLNIPSSNHLPENHQSLGELLFEFFKYYVEFDFSQYAISVRLAS 566
Query: 274 ------C---------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL- 317
C P+ QW++ L IE+PF+ N+AR+V + ++
Sbjct: 567 KIPKEECRMVQSSKNDPY--QWKY-------------LCIEEPFDL-TNTARSVYDPDMF 610
Query: 318 AKISNAFEMTHFRLTSTNQTRYAL 341
+KI +T+ RL + RY+L
Sbjct: 611 SKIIFILNITYTRL----KRRYSL 630
>gi|321465404|gb|EFX76405.1| hypothetical protein DAPPUDRAFT_306159 [Daphnia pulex]
Length = 258
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 77/262 (29%), Positives = 126/262 (48%), Gaps = 40/262 (15%)
Query: 78 CISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
C+ +G V Q L + +AL+Q +L V A+VPIL+F N+ D++
Sbjct: 4 CLVISGHDVDQRFHALEYLYRVQKALKQCRFLTKLD-VIRAKVPILRFYDSITNLEVDLN 62
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
+N+ G + L +Q+D R R +VL VK WA+ HDIN K+ T +SYSL+L+V+++
Sbjct: 63 FNNIVGIRNTHLLKTYAQLDWRVRPLVLAVKLWARQHDINEAKSMTMSSYSLTLMVIYYL 122
Query: 193 QTCVPA-ILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI---NRS 248
QT V +LP L+ + AER E + F+ ++ + + N
Sbjct: 123 QTGVHVPVLPCLQ--------------KVRAERFWPEGDIRRLQTFTDEELKVLRSNNHM 168
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP-----LFIEDPFE 303
+L LF FL+ ++ K G P +++ P N P + IE+PF+
Sbjct: 169 TLGQLFAGFLDYYAH-HFKLG--GTIPLEQCRQYMS------PKNDPHHWKYICIEEPFD 219
Query: 304 QPENSARAVSE-KNLAKISNAF 324
+ N+AR+V + KI + F
Sbjct: 220 R-TNTARSVYDPAAFQKIVDVF 240
>gi|427781625|gb|JAA56264.1| Putative terminal uridylyltransferase 4 [Rhipicephalus pulchellus]
Length = 1570
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 98/204 (48%), Gaps = 7/204 (3%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L + +D++ + P E+ R K++ +L + + + A + +GS +
Sbjct: 841 ILNDVCQDVMKICTPDPEEEACREKLLRELESYIR--KKYKDAKLTLYGSSCNGFGLIRS 898
Query: 66 DLDISIELSNGSCISSAGKK-VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + + S GK + + DL + L +++ + A+VPI+KF
Sbjct: 899 DLDICLTFDH----SKDGKDFCHKEKIMDLAKELNDHKNLKKITPITSAKVPIVKFYHKP 954
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS N Q ++ L SQID R R + K +AK I + G+ +SY+
Sbjct: 955 TQLEGDISFYNTLAQHNTRLLKVYSQIDERVRVLGYTFKHFAKTCAIGDASRGSLSSYAY 1014
Query: 185 SLLVLFHFQTCVPAILPPLKDIYP 208
LL L++ Q C P ++P L+++YP
Sbjct: 1015 ILLTLYYLQQCKPPVIPVLQELYP 1038
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 132/303 (43%), Gaps = 47/303 (15%)
Query: 23 EDWETRMKVISDLREVVESVE-----SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
E+ E R +V+SDL +++ SL G++ FG SN ++I+L+
Sbjct: 359 EEVELRKRVVSDLETFIKATLPDVKLSLHGSSGNGFGLKTSN---------VNIDLT--- 406
Query: 78 CISSAGKKVKQSLL---GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISID 134
GK L GDLL+ + Y ++ ++VP ++F+ + +SC+IS++
Sbjct: 407 ---PLGKADCAQLFVGTGDLLQECPK---YAQVTKDYLSKVPRIRFKEVDSKLSCEISLN 460
Query: 135 NLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT 194
N Q SK L + +D R + + + + WAK ++ GT ++ +++ +F Q
Sbjct: 461 NSNSQKTSKLLDDYASLDRRVKILGVAFRLWAKHCGLDQQDRGTLPPHAFAIMTVFFLQQ 520
Query: 195 CVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLF 254
C PA+LP L ++ G + + R I + N S+ L+
Sbjct: 521 CKPAVLPVLHEMKDGKESESYLKPKDLEGRWICK-----------------NDRSIGQLW 563
Query: 255 VSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
V L +F K ++ +C Q I +W N + IEDP+ N AR++
Sbjct: 564 VELL-RFYATEFKLNKRVVCIRRSQPMLI-VEKKW--NKRYIAIEDPYSCKRNLARSIPS 619
Query: 315 KNL 317
+ +
Sbjct: 620 ERM 622
>gi|297267656|ref|XP_001118438.2| PREDICTED: u6 snRNA-specific terminal uridylyltransferase 1-like
[Macaca mulatta]
Length = 710
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 119/273 (43%), Gaps = 38/273 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 158 GDLGKALELAEAPKREKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 215
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL +S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 216 SGLHGDISLSNRLALHNSRFLSLVSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 274
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 275 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASR 321
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWL 290
R N L+ L F S L+ S L + P TG WE +R
Sbjct: 322 LERSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVTGGLPSNLWEGLRLG---- 377
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 378 ----PMNLQDPFDLSHNVAANVTSRVAGRLQNC 406
>gi|443712766|gb|ELU05930.1| hypothetical protein CAPTEDRAFT_221986, partial [Capitella teleta]
Length = 1259
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 102/204 (50%), Gaps = 7/204 (3%)
Query: 10 ILKDILGML----NPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L ++L M+ P + R V+ ++ E V E+ A + FGS V+ +
Sbjct: 720 MLDNMLWMVPNDFEPSERETFLRKTVLHEMEEYVR--ETFPDAQLSLFGSSVNGFGFKQS 777
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI ++ + S +++ L + L+ G+ + + A+VPI+KF
Sbjct: 778 DLDICLQFKSTPVKDSQSLNCV-AIIETLAQILKIHRGFYNVFAITTAKVPIVKFRHRRS 836
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L S+ID R R + +K +AK DI + G+ +SY+
Sbjct: 837 QLEGDISLYNTLAQHNTRLLQAYSEIDSRVRVLGYTMKVFAKCCDICDASRGSLSSYAFV 896
Query: 186 LLVLFHFQTCVPAILPPLKDIYPG 209
L+V+++ Q C P +LP L+++Y G
Sbjct: 897 LMVIYYLQQCDPPVLPVLQELYTG 920
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 113/285 (39%), Gaps = 46/285 (16%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS + + R D++I I + N I KK+ D LR G + ++
Sbjct: 292 YGSSYTGIALRTSDVNIDITMENSPKIL---KKIY-----DFLRT-DDSGCFCDVRSDFS 342
Query: 113 ARVPILKFET-IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A+VP + F + + I+I+N SK L ID R + + WA I
Sbjct: 343 AKVPSVLFTMPAYGQTTFQIAINNSPSCKTSKLLQTFVSIDPRVGQLSKCFRWWAHLCSI 402
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA 231
++ G+ +Y+ +++ ++ Q C P +LP L ++ ++ +E + +
Sbjct: 403 DSQDNGSLPAYAFAMMTVYFLQQCQPPVLPVLHELV------NVDPKTPGSEHEYLD--- 453
Query: 232 FNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
SDK ++ N SSLA L++ L +F + IC + RS
Sbjct: 454 ------DSDKLKECWASKNTSSLAELWLQLL-RFYCVDFDMGTYVIC-VRQKRPLPRSEK 505
Query: 288 RWLPN------------NHPL---FIEDPFEQPENSARAVSEKNL 317
+W HPL F DPF + N AR +S +
Sbjct: 506 KWASKRIALEGAYTKLQQHPLSFIFSLDPFIRRRNIARTLSHSQI 550
>gi|307188110|gb|EFN72942.1| Poly(A) RNA polymerase gld-2-like protein A [Camponotus floridanus]
Length = 628
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 115/278 (41%), Gaps = 49/278 (17%)
Query: 79 ISSAGKKVKQSLLGDLLRALRQKGGYRRLQFV-----AHARVPILKFETIHQNISCDISI 133
I S K L+G ++ + F+ HA+VPI+KF QN+ D++
Sbjct: 347 IKSQYPKYGLFLIGSIMNGFGSDNSDVDMYFIDQLELIHAKVPIIKFRDTIQNLKVDLNC 406
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+N G ++ L+ S++D R R +VL++K WA+ HDINN K T +SYSL L+V+ HF
Sbjct: 407 NNAVGIRNTQLLYCYSKLDWRVRPLVLVIKLWAQHHDINNAKDMTISSYSLVLMVI-HFL 465
Query: 194 TC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLA 251
C P +LP L I+ + I I + S N SL
Sbjct: 466 QCGVNPPVLPCLHSIFEDKFSPHI---------DIHSIDIHEDLKIPSSTRLPENNQSLG 516
Query: 252 HLFVSFLEKFSGLSLKASELGI----------CPFTG-------QWEHIRSNTRWLPNNH 294
L V F + + + + C QW++
Sbjct: 517 ELLVEFFRYYDKFDFRQYAISVRLAKKIPIEECRMVQSLKNDPRQWKY------------ 564
Query: 295 PLFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRL 331
L IE+PF+ N+AR+V ++ +I E H +L
Sbjct: 565 -LCIEEPFDL-TNTARSVYDQVTFLRIQQLIERAHKKL 600
>gi|443894150|dbj|GAC71500.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases [Pseudozyma antarctica T-34]
Length = 1060
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 148/353 (41%), Gaps = 47/353 (13%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L PIL P E++ + L + V G+ + FGS + R D
Sbjct: 336 LSPIL--------PTEEEYRIKEATRRQLERLANRVSP--GSKLLAFGSMANGFALRNSD 385
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE----- 121
+D+ + G + L+ L + +R++ + + + AR+PI+K
Sbjct: 386 MDLCCLIGKGPDGQPTTQHTASELVEILGQLIREETDFTVMP-LPKARIPIIKINRSPTT 444
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ I+CDI +N ++ L + +D R R +VL +K W K +N+P GT +
Sbjct: 445 DLPYEIACDIGFENRLALENTRLLLSYAMVDPTRLRTLVLFLKVWTKRRKLNSPYMGTLS 504
Query: 181 SYSLSLLVLFHFQTCV--PAILPPLKDIYPGN-LVDDLKGVRANAERQIAEICAFNIARF 237
SY +LLVL+ F T V PA+LP L+ + P + D + N I ++
Sbjct: 505 SYGYTLLVLY-FLTHVKKPAVLPNLQRVPPTRPMKPDEMELNGN------NIYFYDDVAT 557
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRW 289
++ N ++ L V F FS +SLK SE G+ P G+ W
Sbjct: 558 LRKEWSSHNTDNVGELLVDFFRYFSKEFSYARDVISLK-SENGLIPKDGKT--------W 608
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALL 342
N L IEDPF+ N +R V++ L I F LT+ + ++L
Sbjct: 609 ---NAELCIEDPFQAGYNVSRTVTKDGLYTIRGEFMRASRYLTNMRGQKISVL 658
>gi|308468493|ref|XP_003096489.1| hypothetical protein CRE_19377 [Caenorhabditis remanei]
gi|308243076|gb|EFO87028.1| hypothetical protein CRE_19377 [Caenorhabditis remanei]
Length = 431
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/334 (28%), Positives = 145/334 (43%), Gaps = 51/334 (15%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
E++ +M + L++ + + P GS V+ L + DLD++I + + I
Sbjct: 52 EEEFVRKMNLCKTLKKAISKHNPDWLFNIVPTGSSVTGLATANSDLDVAIHIPQAALI-- 109
Query: 82 AGKKVKQSLLGDLLRALRQKGGYRRLQF---------------------------VAHAR 114
V+Q G + A +K +R +Q + A+
Sbjct: 110 ----VEQRCKGKKIDAEEKKIMWREMQLNILQIVRLVLVNNEEISQMIDWEEGVNLVQAQ 165
Query: 115 VPILKFETIHQNISCDISI--DNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDI 171
+ ILK +T+ I DIS+ D + + FL + ID RF + +VKEWA + +
Sbjct: 166 IQILKLKTV-DGIEFDISVVMDCFLSSMHNSFLIKHMVLIDHRFGPLCAVVKEWAASTKV 224
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
NPK G FNSY+L LLV+ HF C P +LP L+ +Y K A +E+
Sbjct: 225 KNPKDGGFNSYALVLLVI-HFLQCGTFPPVLPNLQFLYRD------KNFIAMSEKDFPAR 277
Query: 230 CAFNIAR-FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC-PFTGQWEHIRSNT 287
F A F K +K N + +A LF+ FL +S + + I T E S T
Sbjct: 278 LDFGAALPFPLPKIQK-NEAPIARLFLEFLNYYSEFNFDKFYISIKHGKTKIRERSASET 336
Query: 288 RWLPNNHPLFIEDPFEQPENSARAV-SEKNLAKI 320
N ++IEDPF+ N R V S KN+ KI
Sbjct: 337 VQNENRKQVYIEDPFDS-HNPGRTVRSLKNIQKI 369
>gi|324500015|gb|ADY40021.1| Terminal uridylyltransferase 7 [Ascaris suum]
Length = 1444
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/337 (23%), Positives = 141/337 (41%), Gaps = 35/337 (10%)
Query: 110 VAHARVPILKFETIHQ--NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+ +A+VPI+KF H+ + D+S+ N+ ++ L S++D R + + ++VKEWAK
Sbjct: 1043 IPNAKVPIVKFHCQHRYNRLEADVSLYNVLALENTRLLHAYSELDERAKALGVVVKEWAK 1102
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
+I + G+ +SYS ++++ Q P +LP L+ +++G E +I
Sbjct: 1103 CCEIGDASRGSLSSYSFIVMLIHFLQRTTPPVLPFLQ---------EMEGRGRQKEPKIV 1153
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
E C ++ N +S++ L++ FL+ +S I F + IR +
Sbjct: 1154 EDCDVYFCSVEDLEWVTENTASVSELWMGFLDYYS---------RIFDFGAEVVQIRRSE 1204
Query: 288 R-------WLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYA 340
R W P+ IEDPF+ N + V + +A I +F + R
Sbjct: 1205 RLSKLDKGW--QGRPIAIEDPFDLKHNLSSGVHMRTMAYIQRSFIRSRERFARIRCPTKQ 1262
Query: 341 LLSSLARPFILQFFGESPVRYAN--YNNGHRRARPQSHKSVNSPLQA----QHQSHNAKK 394
L ++ I+ FG V N R R H N PL ++ S K
Sbjct: 1263 LNNNSFEALIMDLFGGCRVGAGPPLARNCCYRCRQIGHFVENCPLGQKGGRKNHSGTPAK 1322
Query: 395 ENRPNRSMSQQSVQQHQSQPVRQINGQVQQIWRPKSD 431
E R R + + + ++QI + P +D
Sbjct: 1323 EARRARDANATNRGSENRERIQQIEPTSKIEADPAAD 1359
>gi|114629899|ref|XP_001136690.1| PREDICTED: poly(A) RNA polymerase, mitochondrial isoform 3 [Pan
troglodytes]
gi|397501656|ref|XP_003821496.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Pan paniscus]
gi|410212354|gb|JAA03396.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410212356|gb|JAA03397.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410252822|gb|JAA14378.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410252824|gb|JAA14379.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410252826|gb|JAA14380.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410252828|gb|JAA14381.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410291220|gb|JAA24210.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410291222|gb|JAA24211.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410328579|gb|JAA33236.1| mitochondrial poly(A) polymerase [Pan troglodytes]
gi|410328581|gb|JAA33237.1| mitochondrial poly(A) polymerase [Pan troglodytes]
Length = 582
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 131/295 (44%), Gaps = 47/295 (15%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L +S+ + + Q +L
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSAHKTSGNFLMEFQVKNVPSERIATQKIL 285
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 345
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 LDSRVRALVFSIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 399
Query: 210 NLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
D LK + ++ + E C F +++R + N +L L F E F +
Sbjct: 400 ---DSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQ----NTETLELLLKEFFEYFGNFA 452
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 453 FDKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|358343273|ref|XP_003635729.1| Poly(A) RNA polymerase gld-2-like protein [Medicago truncatula]
gi|355501664|gb|AES82867.1| Poly(A) RNA polymerase gld-2-like protein [Medicago truncatula]
Length = 720
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 113/252 (44%), Gaps = 41/252 (16%)
Query: 45 LRGATVEPFGSFVSN----LFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ 100
L G+T+ FGS S+ L R ++D IE S G L +L+ LRQ
Sbjct: 423 LVGSTISGFGSNNSDMDMCLLVRHSEMDQLIE--------SLGH------LERVLKCLRQ 468
Query: 101 KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+ + A+VPILKF+ + D++ +N G + LF +Q+D R R +VL
Sbjct: 469 CSFIKNADLI-QAKVPILKFKDAEHGLEVDLNCNNAVGIRNTHMLFCYAQMDWRVRPLVL 527
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLV--DDLKG 217
+VK WA + IN+ K T +SYSL L+V+ Q V P++LP L ++P DL
Sbjct: 528 IVKLWAASQGINDAKNMTISSYSLVLMVINFLQCGVNPSVLPCLHKLHPSKFQPHTDLHF 587
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASEL----GI 273
+ + E Q + N SL LF +FLE ++ + + G
Sbjct: 588 IDLHEELQ---------------PIKSENNQSLGELFAAFLEYYAQFDYTKNAVSVRTGS 632
Query: 274 CPFTGQWEHIRS 285
C + H RS
Sbjct: 633 CLSIEECRHARS 644
>gi|444317060|ref|XP_004179187.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
gi|387512227|emb|CCH59668.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
Length = 461
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 100/191 (52%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P R++ E+R I LR VV+ E A ++ FGS+ ++L+ DLD
Sbjct: 75 IKDFVAYISPNRKEIESRNTAIDKLRSVVK--ELWDDADLQVFGSYATDLYLPGSDLDCV 132
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ + +G K + L L L++K + ++ VAH RVPI+KF + NI D
Sbjct: 133 VN-------TKSGNKGDKKHLYSLATFLKEKIAAKDVEVVAHTRVPIIKFIEPNSNIHID 185
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G +K + W+ G R++VL++K++ A +NN +TG +S+ LV
Sbjct: 186 ISFERTNGLEAAKLIRSWLETTPG-LRELVLIIKQFLHARRLNNVRTGGLGGFSIICLV- 243
Query: 190 FHFQTCVPAIL 200
+ F P IL
Sbjct: 244 YSFLHLHPRIL 254
>gi|357145985|ref|XP_003573837.1| PREDICTED: uncharacterized protein LOC100846935 [Brachypodium
distachyon]
Length = 815
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 130/279 (46%), Gaps = 31/279 (11%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-LGDLLRALRQKGGYRRLQFVA 111
+GS ++ D+D+ + + N KV L L D+L+A G + +Q +
Sbjct: 520 YGSCANSFGFSNSDIDLCLSIDNNEM-----SKVDIILKLADILQA----GNLQNIQALT 570
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
ARVPI+K +SCDI ++NL + +K L +QID R R + +VK WAK+ +
Sbjct: 571 RARVPIVKLMDPDTGLSCDICVNNLLAVVNTKLLRDYAQIDRRLRQLAFIVKHWAKSRRV 630
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL--VDDLKGVRANAERQIAEI 229
N GT +SY+ ++ + Q + ILP L+++ VDD
Sbjct: 631 NETYQGTLSSYAYVIMCIHLLQ--LRRILPCLQEMEATCYVTVDD-------------NH 675
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW--EHIRSNT 287
CA+ + Y N+ +++ L +F ++ ++ I TG+ +H++ T
Sbjct: 676 CAYFDQVDKLNNYGAHNKETISSLLWAFFHYWAYQHDYTKDV-ISIRTGRIISKHMKDWT 734
Query: 288 RWLPNN-HPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
R + N+ H + IEDPFE + R V + ++ + FE
Sbjct: 735 RRVGNDRHLICIEDPFETSHDLGRVVDKFSIKILREEFE 773
>gi|345316663|ref|XP_001511688.2| PREDICTED: poly(A) RNA polymerase GLD2 [Ornithorhynchus anatinus]
Length = 459
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 93/170 (54%), Gaps = 10/170 (5%)
Query: 54 GSFVSNLFSRWGDLDISIELSNG----SCISSAGKKVKQSLLGDLLR---ALRQKGGYRR 106
GS ++ +R D D+ + ++ SC+ +K + + L++ + R R
Sbjct: 209 GSSLNGFGTRSSDGDLCLVVTEEPLFFSCLFQVNQKTEARYILSLVQNHFSTRLSSYIER 268
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEW 165
Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R +VL+VK+W
Sbjct: 269 PQLI-RAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRPLVLVVKKW 326
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL 215
A HDIN+ GT NSYSL L+VL + QT +LP L+ YP ++ DL
Sbjct: 327 ASHHDINDASRGTLNSYSLVLMVLHYLQTLPEPVLPSLQKKYPVSVSSDL 376
>gi|449514383|ref|XP_004177159.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase GLD2
[Taeniopygia guttata]
Length = 509
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 128/288 (44%), Gaps = 58/288 (20%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQK--- 101
L G+++ FG+ S+ GDL C+ + V Q + +L QK
Sbjct: 227 LVGSSLNGFGTRTSD-----GDL----------CLVVKEEPVNQKTEARRILSLVQKLFT 271
Query: 102 ---GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW-ISQIDGRFRD 157
Y + A+VPI+KF N+ D++++N+ G I++ FL + I+ R R
Sbjct: 272 TKLSSYIERPQLIRAKVPIVKFRDKVSNVDFDLNVNNVIG-IRNTFLLRSYAFIENRVRP 330
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL+VK+WA H+IN+ GT NSYSL L+VL + QT ILP LK I D
Sbjct: 331 LVLVVKKWASFHEINDASRGTLNSYSLVLMVLHYLQTLPEPILPSLKKITQXECFDPTMQ 390
Query: 218 VR--ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLK 267
+ A R I Y N SSL L + F + ++ +S++
Sbjct: 391 LHFVHQAPRTIP-------------PYVSKNGSSLGDLLIGFFKYYATEFDWSHQMISVR 437
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
++ P +W N + +E+PF+ N+ARAV EK
Sbjct: 438 EAKAIARPDGIEWR-----------NKFICVEEPFDG-TNTARAVHEK 473
>gi|111219431|ref|XP_646847.2| DNA2/NAM7 helicase family protein [Dictyostelium discoideum AX4]
gi|90970906|gb|EAL72919.2| DNA2/NAM7 helicase family protein [Dictyostelium discoideum AX4]
Length = 2314
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 151/348 (43%), Gaps = 37/348 (10%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMK-VISDLREVVESVESLRGATVEPFGSFVSNLFSR 63
N L I+ I +++D R+K +IS E A++ +GSF+S L
Sbjct: 1971 NSLNEIISKIECKKKSIKQDSYNRLKKLIS---------EGFATASINLYGSFLSGLSLN 2021
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
DLDI+ S+ +K + L + + L + Y+ ++ A+VPI++F+ I
Sbjct: 2022 DSDLDINF---------SSTQKEDTTHLKQVYKYLNRSQLYKLIEKRISAKVPIIRFKEI 2072
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
I D+ ++ S L ID R RD+ LLVK WA + D+NN TF+S+
Sbjct: 2073 SSGIHFDMCFHSMMSYHNSLLLGEYCSIDKRCRDLALLVKWWAVSKDLNNAAEKTFSSFC 2132
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARF----SS 239
L +V+ Q+ P ILP L+ L++ R + + I + ++ S
Sbjct: 2133 LVNMVIHFLQSLNPPILPNLQTT-SNQLLEKYSTDRNLIKLKSQTIVENYLVKYYDWSSF 2191
Query: 240 DKYR-KINRSSLAHLFVSFLEKFSGLSLKAS---------ELGICPFTGQWEHIRSNTRW 289
+K+ K N+ ++A LF F +S + K + +C + RS +
Sbjct: 2192 NKFEPKRNKLTIAQLFYQFFYYYSTFNYKENIISISHSSGGGSLCENGALLK--RSTIKG 2249
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT 337
P + + DPF N A ++ +K ++ F M + L ST T
Sbjct: 2250 TPVKGHIIVLDPFINDRNLASSI-KKTYQRVLMEFTMMEYSLRSTKTT 2296
>gi|38197606|gb|AAH61703.1| Mitochondrial poly(A) polymerase [Homo sapiens]
Length = 582
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 126/291 (43%), Gaps = 39/291 (13%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L +S+ + + Q +L
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSAHKISGNFLMEFQVKNVPSERIATQKIL 285
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 345
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 LDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 399
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
D LK + ++ + E R S N +L L F E F + +
Sbjct: 400 ---DSLKTLADAEDKCVIEGNNRTFVRDLSRIKPSQNTETLELLLKEFFEYFGNFAFDKN 456
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 457 SINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|355562366|gb|EHH18960.1| hypothetical protein EGK_19559 [Macaca mulatta]
Length = 582
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L +S+ + + Q
Sbjct: 225 CVVRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSTHKTSGNFLMEFQVKNVPSERIATQK 283
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 284 ILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIY 343
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 344 GTLDSRVRALVFGIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 399
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + A+AE + I N F D R N +L L F E F
Sbjct: 400 -----DSLKTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNF 451
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 452 AFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETALNISKNVSQSQLQKF 498
>gi|322785381|gb|EFZ12054.1| hypothetical protein SINV_03147 [Solenopsis invicta]
Length = 659
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 144/322 (44%), Gaps = 74/322 (22%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGS----CISSAGKKVKQSLLGDLLRALRQKG 102
G+T+ FGS S D+DI + + + CI+ L ++L+ L+Q
Sbjct: 361 GSTMNGFGSNDS-------DVDICLLVKHKEMDVRCIAIEH-------LMEVLKHLKQND 406
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+L+ + HA+VPI+ F + D++ +N G + L+ S++D R + + L+V
Sbjct: 407 FVEQLEII-HAKVPIITFFDAARKFKVDMNCNNSVGIRNTHLLYCYSKLDWRVKPLALVV 465
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDIYPGNLVDDLKGVRA 220
K WA+ H+INNPK T +SYSL L+V+ HF C P +LP L ++
Sbjct: 466 KLWAQWHNINNPKCRTLSSYSLVLMVI-HFLQCGTNPPVLPCLHSMF------------V 512
Query: 221 NAERQIAEICAFNIAR---FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---- 273
N R A+I NI SS++ K N SL L F + + + +
Sbjct: 513 NKFRPDADIYNINIHEDLNISSNRLPK-NHQSLGELLFEFFKYYVEFDFSQYAISVRLAS 571
Query: 274 ------CPFTG-------QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL-AK 319
C QW++ L IE+PF+ N+AR+V + ++ +K
Sbjct: 572 KIPKEECRMVQSSKNDPYQWKY-------------LCIEEPFDL-TNTARSVYDPDMFSK 617
Query: 320 ISNAFEMTHFRLTSTNQTRYAL 341
I +T+ RL + RY+L
Sbjct: 618 IIFILNITYTRL----KQRYSL 635
>gi|71021859|ref|XP_761160.1| hypothetical protein UM05013.1 [Ustilago maydis 521]
gi|46100598|gb|EAK85831.1| hypothetical protein UM05013.1 [Ustilago maydis 521]
Length = 1174
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 150/359 (41%), Gaps = 59/359 (16%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L PIL P E++ + L + V GA + FGS + R D
Sbjct: 360 LSPIL--------PTEEEYRIKEATRRQLERLANRVSP--GAKLLAFGSMANGFALRNSD 409
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE----- 121
+D+ + + L+ L + +R++ + + + AR+PI+K
Sbjct: 410 MDLCCLMGKRDDAQPTPQHTASELVEILGQLIREETDFTVMP-LPKARIPIIKISRSPTA 468
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ I+CDI +N ++ L + +D R R +VL +K WAK +N+P GT +
Sbjct: 469 DLPYEIACDIGFENRLALENTRLLLSYAMVDPQRLRTLVLFLKVWAKRRKLNSPYMGTLS 528
Query: 181 SYSLSLLVLFHFQTCV--PAILPPLKDIYP------------GN---LVDDLKGVRANAE 223
SY +L+VLF F T V PA+LP L+ + P GN DD+ +R +
Sbjct: 529 SYGYTLMVLF-FLTHVKKPAVLPNLQRVPPTRPMKPEEMELNGNNIYFYDDVAALRKSWS 587
Query: 224 RQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHI 283
Q E N+ D +R ++ F +SLK SE G+
Sbjct: 588 SQNTE----NVGELLVDFFRYFSK--------EFSYARDVISLK-SETGLL--------S 626
Query: 284 RSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALL 342
+ + W N L IEDPF++ N +R V++ L I F LT+T + ++L
Sbjct: 627 KDSKSW---NAELCIEDPFQEGYNVSRTVTKDGLYTIRGEFIRASRLLTNTRGQKISVL 682
>gi|380790759|gb|AFE67255.1| poly(A) RNA polymerase, mitochondrial precursor [Macaca mulatta]
gi|380808336|gb|AFE76043.1| poly(A) RNA polymerase, mitochondrial precursor [Macaca mulatta]
gi|380808338|gb|AFE76044.1| poly(A) RNA polymerase, mitochondrial precursor [Macaca mulatta]
gi|383419609|gb|AFH33018.1| poly(A) RNA polymerase, mitochondrial precursor [Macaca mulatta]
Length = 582
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L +S+ + + Q
Sbjct: 225 CVVRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSTHKTSGNFLTEFQVKNVPSERIATQK 283
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 284 ILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIY 343
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 344 GTLDSRVRALVFGIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 399
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + A+AE + I N F D R N +L L F E F
Sbjct: 400 -----DSLKTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNF 451
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 452 AFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETALNISKNVSQSQLQKF 498
>gi|157113025|ref|XP_001657730.1| hypothetical protein AaeL_AAEL000996 [Aedes aegypti]
gi|108883700|gb|EAT47925.1| AAEL000996-PA [Aedes aegypti]
Length = 564
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 126/260 (48%), Gaps = 25/260 (9%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+GD+L+ G ++ + ARVPI+K+ H ++ D++++N+ G S+ L+ Q
Sbjct: 239 IGDVLQLFLP--GVNSVRRILKARVPIIKYHHEHLDLEIDLTMNNMTGVYMSELLYLFGQ 296
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +++WA+A + N G + ++SL++LV++ Q ILPP+ +
Sbjct: 297 IDPRVQPLTFCIRKWAQAVGLTNHAPGYWITNFSLTMLVMYFLQQLKEPILPPINKLVQN 356
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
DL+ E QI C+F + S ++ N S+L L + F E +S
Sbjct: 357 ASPTDLR----ITESQIN--CSF-LRDISKLDFKTSNTSTLEDLLLQFFEFYSHFDFNQR 409
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS--EKNLAKIS--NAFE 325
+ + T + P++ P++I +P E N ++ V+ E L +I NA
Sbjct: 410 AISLNVGTSILK---------PDHSPMYIVNPLETILNVSKNVNLEETELFRIQVRNALW 460
Query: 326 M--THFRLTSTNQTRYALLS 343
+ TH + T+ T + L+S
Sbjct: 461 LLDTHDKSTAAESTEWGLVS 480
>gi|355782716|gb|EHH64637.1| hypothetical protein EGM_17906 [Macaca fascicularis]
Length = 582
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L +S+ + + Q
Sbjct: 225 CVVRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSTHKTSGNFLMEFQVKNVPSERIATQK 283
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 284 ILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIY 343
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 344 GTLDSRVRALVFGIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 399
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + A+AE + I N F D R N +L L F E F
Sbjct: 400 -----DSLKTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNF 451
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 452 AFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|71027159|ref|XP_763223.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350176|gb|EAN30940.1| hypothetical protein, conserved [Theileria parva]
Length = 471
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 99/190 (52%), Gaps = 9/190 (4%)
Query: 17 MLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNG 76
+L P E +E + ++ L+ ++ES S+ G T+ FGS + L++R D+D+ + + N
Sbjct: 150 VLCPTAEQFEKKRSLMDYLKPLIES--SING-TLHTFGSCDNGLWTRGSDIDLCLVIPNC 206
Query: 77 SCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNL 136
K+ S L +L+++ + ARVPI+K +N CDISI+N
Sbjct: 207 D-----SKRYMLSKL-NLIKSCLSNSSIISKISIISARVPIVKLFDKEENSICDISINNT 260
Query: 137 CGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
S+++ +S++D R + +K WA + INN GT +SY+L L + + Q
Sbjct: 261 IALANSEYVKAMSRLDERVVLLGRFIKYWATSRKINNRAQGTMSSYTLILQLFYFLQNTT 320
Query: 197 PAILPPLKDI 206
P I+PP KDI
Sbjct: 321 PPIIPPFKDI 330
>gi|83772230|dbj|BAE62360.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 493
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 132/303 (43%), Gaps = 38/303 (12%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
++ L P E + R +++ L + R V FGS + L S D+DI
Sbjct: 42 EVYDRLLPSAESDDRRRQLVRKLERLFNEQWPGRDIKVHVFGSSGNKLCSSDSDVDI--- 98
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++Q LL ++L + G R+ V+HA+VPI+K ++CD+
Sbjct: 99 -----CITTTYKELEQVCLLAEVL----ARHGMERVVCVSHAKVPIVKIWDPELQLACDM 149
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + +ID R R + +++K W K + + GT +SY+ L++
Sbjct: 150 NVNNTLALDNTRMVRTYVEIDERVRPLAMIIKHWTKRRILCDAGLGGTLSSYTWICLIIN 209
Query: 191 HFQTCVPAILPPL------KDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
QT P ILP L K I P LV C+F+ + Y +
Sbjct: 210 FLQTRNPPILPSLQARPHEKKISPEGLV-----------------CSFDDDLGNLTGYGR 252
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N+ SL LF F K+ G L + + G+ + L N+ L +E+PF
Sbjct: 253 KNKQSLGDLFFQFF-KYYGHELDYEKYVVSVREGKLISKEAKGWHLLQNNRLCVEEPFNT 311
Query: 305 PEN 307
N
Sbjct: 312 SRN 314
>gi|109088607|ref|XP_001083177.1| PREDICTED: poly(A) RNA polymerase, mitochondrial isoform 2 [Macaca
mulatta]
Length = 582
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L +S+ + + Q
Sbjct: 225 CVVRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSTHKTSGNFLTEFQVKNVPSERIATQK 283
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 284 ILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIY 343
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 344 GTLDSRVRALVFGIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 399
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + A+AE + I N F D R N +L L F E F
Sbjct: 400 -----DSLKTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNF 451
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 452 AFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|194770583|ref|XP_001967371.1| GF21578 [Drosophila ananassae]
gi|190618051|gb|EDV33575.1| GF21578 [Drosophila ananassae]
Length = 1334
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 66/98 (67%), Gaps = 3/98 (3%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1062 ARVPILRFKDILNGIEVDLNFNNCVG-IKNTYLLQLYAQLDWRTRPLVVIVKLWAQFHDI 1120
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYP 208
N+ K T +SYSL L+VL + Q C+P +LP L +YP
Sbjct: 1121 NDAKRMTISSYSLVLMVLHYLQYGCIPHVLPCLHSLYP 1158
>gi|402879903|ref|XP_003903561.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Papio anubis]
Length = 582
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 130/296 (43%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L +S+ + + Q
Sbjct: 225 CVVRPFGSSV-NTFGKLGCDLDMFLDLDETRNLSTHKTSGNFLMEFQVKNVPSERIATQK 283
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 284 ILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIY 343
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 344 GTLDSRVRALVFGIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 399
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + A+AE + I N F D R N +L L F E F
Sbjct: 400 -----DSLKTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNF 451
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 452 AFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQKF 498
>gi|336363388|gb|EGN91781.1| hypothetical protein SERLA73DRAFT_66841 [Serpula lacrymans var.
lacrymans S7.3]
Length = 369
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 149/340 (43%), Gaps = 64/340 (18%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+ V D+R+++E + T+EP FGS + R D+D+ C+ +
Sbjct: 40 IAVKEDVRKLLERLIR----TIEPHSRLLSFGSTANGFSLRNSDMDLC-------CLIDS 88
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET-----IHQNISCDISIDNLC 137
G+++ S L +L L ++ ++ + HAR+PI+K + I+CDI +N
Sbjct: 89 GERLSSSDLVTMLADLLERETKFHVKPLPHARIPIVKLSLDPSPGLPLGIACDIGFENRL 148
Query: 138 GQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-FHFQTC 195
++ L + ID R R +VL +K W+K IN+P GT +SY LLV+ F
Sbjct: 149 ALENTRLLMCYAMIDPTRVRTLVLFLKVWSKRRKINSPYQGTLSSYGYVLLVIYFLVHVK 208
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIA--EICAFNIARFSSDKYRKINRSSLAHL 253
P +LP L+ + P L+ + + + Q+A F+ +++ N S+A L
Sbjct: 209 NPPVLPNLQQMPP------LRPI-SQEDTQLAGYNTWFFDDIELLRQRWQSSNTESVAEL 261
Query: 254 FVSFLEKFS--------------GLSLKAS-----ELGICPFTGQWEHIRSNTRWLPNNH 294
+ F +S GL K S +L +G++ R R+
Sbjct: 262 LIDFFRYYSRDFSYNTGVASIRAGLLKKESKGWQNDLS----SGKYNDARERNRFC---- 313
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTST 334
IEDPFE N AR V++ L I F M R+ ST
Sbjct: 314 ---IEDPFEADFNVARCVTKDGLYLIRGEF-MRASRILST 349
>gi|345569402|gb|EGX52268.1| hypothetical protein AOL_s00043g57 [Arthrobotrys oligospora ATCC
24927]
Length = 747
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 145/364 (39%), Gaps = 65/364 (17%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFS 62
Y + K ++ ++P + E MK S LR + E +L G+ + PFGS VS +
Sbjct: 253 YTQMTDYAKSVVQSISPTPD--EIAMKS-STLRRITEICNNLVPGSRIIPFGSLVSGFAT 309
Query: 63 RWGDLDI-----SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPI 117
+ D+D+ S+ S S+ ++ L K G+ + + RVPI
Sbjct: 310 KGADMDVIFAHDSLHPQPFSHESNVPVRLANEFL---------KRGFE-VDLLIRTRVPI 359
Query: 118 LKFETIH--------------------------QNISCDISIDNLCGQIKSKFLFWISQI 151
LK +T +NISCDI G S F SQ
Sbjct: 360 LKIKTPSNDPGSRPGSPSAQDALKEDLDEEPWPENISCDIGFKAHLGITNSYFFRTYSQC 419
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGN 210
D RFR+MVL VK+W+K D+N+P GT +SY L+V F P +LP L+ I P
Sbjct: 420 DSRFREMVLFVKQWSKNRDLNSPYFGTLSSYGYVLMVAHFLINIVKPPVLPNLQLIPP-- 477
Query: 211 LVDDLKGVRANAERQIAEICAF-NIARFSSDKY--RKINRSSLAHLFVSFLEKF-SGLSL 266
D + + +I F +IA+ +S + N L L F + F + +
Sbjct: 478 ---DPDTPESELRQDGFDIWYFKDIAKITSGELLPDGKNEMGLGQLIYEFFQYFTTNFNF 534
Query: 267 KASELGICPFTG-------QWEHIR---SNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
+ + I G W R T + + L +EDPFE N R
Sbjct: 535 VSEVVTIRSLGGVMYKQHKGWTSARERVGETTTYQDRYLLALEDPFEITHNVGRTCGGPG 594
Query: 317 LAKI 320
+ +I
Sbjct: 595 VRRI 598
>gi|195438834|ref|XP_002067337.1| GK16234 [Drosophila willistoni]
gi|194163422|gb|EDW78323.1| GK16234 [Drosophila willistoni]
Length = 506
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 136/317 (42%), Gaps = 50/317 (15%)
Query: 24 DWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWGDLDISIE---------- 72
D RM+ ++ L ++ +S+ + A PFGS V+ DLD+ +
Sbjct: 73 DLGVRMRFLAAL-QIQQSISGMFPSAQAVPFGSSVNGFGKMGCDLDLILRFDKERGAKNH 131
Query: 73 ---------------LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPI 117
LSNG S ++ +S+ GDLL G ++ + ARVPI
Sbjct: 132 QQTEPSRLIYHLKENLSNGR---SQTQRQMESI-GDLLHLFLP--GVCHVRRILQARVPI 185
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
+K+ H N+ D+S+ NL G S+ L+ ++D R R + ++ WA++ + NP G
Sbjct: 186 IKYHHEHLNLEVDLSMSNLSGFYMSELLYMFGELDTRVRPLTFTIRRWAQSCGLTNPSPG 245
Query: 178 TF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
+ +++SLS LV+F Q ILP + + DD + C F +
Sbjct: 246 RWISNFSLSCLVIFFLQQLRQPILPSIGSLVKAADADDFRVTEDGIN------CTF-VRD 298
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPL 296
++ N S L+ L + F E +S + + RS + P++ +
Sbjct: 299 LERLNFQSRNTSKLSELLLQFFEFYSQFDFHNRAISL-------NEARSLAK--PDHSAI 349
Query: 297 FIEDPFEQPENSARAVS 313
+I +P EQ N ++ VS
Sbjct: 350 YIVNPLEQLLNVSKNVS 366
>gi|195446725|ref|XP_002070898.1| GK25423 [Drosophila willistoni]
gi|194166983|gb|EDW81884.1| GK25423 [Drosophila willistoni]
Length = 1383
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 114/243 (46%), Gaps = 49/243 (20%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F +I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1100 ARVPILRFRDALNDIEVDLNYNNCVG-IKNTYLLQLYAQLDWRTRPLVVIVKLWAQYHDI 1158
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
N+ K T +SYSL L+V+ + Q C+P +LP L ++P Q+++
Sbjct: 1159 NDAKRMTISSYSLVLMVIHYLQHGCIPHVLPCLHTLFPDKF-------------QLSQQD 1205
Query: 231 AFNIARFSS-DKYRKINRSSLAHLFVSFLEKFS-----GLSLKASELGICPFTG------ 278
++ + Y+ +N+ +L + F + FS L++ G+ P
Sbjct: 1206 CLDLDLIEPIEPYQTLNKQTLGEHLLGFFQYFSQFDFRNLAISIRTGGVLPVNACRLAKA 1265
Query: 279 ------QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE-KNLAKISNAFEMTHFRL 331
QW+ + IE+PF+ N+AR+V + ++ F + RL
Sbjct: 1266 QKNDIHQWKELN-------------IEEPFDL-SNTARSVYDYATFERVKAIFVASAHRL 1311
Query: 332 TST 334
T
Sbjct: 1312 EHT 1314
>gi|156392397|ref|XP_001636035.1| predicted protein [Nematostella vectensis]
gi|156223134|gb|EDO43972.1| predicted protein [Nematostella vectensis]
Length = 418
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/310 (25%), Positives = 132/310 (42%), Gaps = 44/310 (14%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
R +V+ +L + + V A + FGS V+ + DLDI + L + KV
Sbjct: 78 RQEVLRNLEDYIREVYP--AACLYLFGSSVNGFGFKESDLDICMTLDGKTKDDVDPIKV- 134
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ DL + L+Q R + + A+VPI+KF DIS+ N S+ L
Sbjct: 135 ---IHDLSKKLKQHSDIRNVLAITTAKVPIVKFYIRSVKREGDISLYNTLALENSRMLRT 191
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+ +D R R + +K +AK DI + G+ +SY+ L++L + QTC P ++P L++++
Sbjct: 192 YADLDVRVRQLGFTLKIFAKVCDIGDASKGSLSSYAYILMLLHYLQTCQPPVIPILQELH 251
Query: 208 ----PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
P N+++ N + ++ ++ NR S+ L++ FL ++
Sbjct: 252 NGQCPNNMIEGWNCWYYNDLPNLPKV------------WKSKNRESVGLLWLGFLRYYTE 299
Query: 264 LSLKASELGICPFTGQWEH----IRSNTRW-----LPNNHPLFIEDPFEQPENSARAVSE 314
T WEH R + R + H IEDPF N ++
Sbjct: 300 -------------TFDWEHDVVCCRRSKRLTKFEKMWTKHEFAIEDPFNLSHNLGAGITR 346
Query: 315 KNLAKISNAF 324
K IS F
Sbjct: 347 KMATFISLVF 356
>gi|449492186|ref|XP_004186192.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase,
mitochondrial [Taeniopygia guttata]
Length = 387
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 134/297 (45%), Gaps = 47/297 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKK-----------------VKQS 89
+TV+ FGS V N F + G D+D+ ++ + S+ KK Q
Sbjct: 36 STVKLFGSSV-NTFGKLGSDVDMFLDFCDTGKHSTKMKKGPFEMEYQMKRLPSERLATQR 94
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P++KF CD+S+ N S+ L+
Sbjct: 95 ILSVIGDCLDNFGPGCVNVQKILNARCPLVKFSHQPTGFQCDLSVSNSIATRSSELLYIY 154
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+ H + N GT+ ++SL+++V+F Q P I+P L
Sbjct: 155 GCLDSRVRALVFTVRCWARVHGLTNSAPGTWITNFSLTMMVMFFLQRRSPPIIPTL---- 210
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + ++ I I ++ + F SD RKI N +L L F E F
Sbjct: 211 -----DQLKELADEKDKHI--IGGYDCS-FVSD-LRKIKPTKNTETLDVLLGEFFEYFGN 261
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + L + + + + P + PL+I +PFEQ N ++ V++ L K
Sbjct: 262 FDFRKNSLNL----RKGKEVNK-----PESSPLYIWNPFEQDLNISKNVNQPQLEKF 309
>gi|170097539|ref|XP_001879989.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645392|gb|EDR09640.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 901
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 153/341 (44%), Gaps = 53/341 (15%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P E+ M V D+R+++E + +R T+EP FGS + R
Sbjct: 45 LFDFVIQLLPTPEE----MAVKEDVRKLLERL--IR--TIEPDSRLLSFGSTANGFSLRN 96
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE--- 121
D+D+ C+ + +++ + L +L L ++ ++ + HAR+PI+K
Sbjct: 97 SDMDLC-------CLIDSQERLAATDLVTMLGDLLERETKFHVKPLPHARIPIVKLSLDP 149
Query: 122 --TIHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGT 178
+ I+CDI +N ++ L + +D R R MVL +K W+K IN+P GT
Sbjct: 150 SPGLPLGIACDIGFENRLALENTRLLMCYAMVDPTRVRTMVLFLKVWSKRRKINSPYKGT 209
Query: 179 FNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARF 237
+SY LLV+ F P +LP L+ + P L+ + ++ + +N F
Sbjct: 210 LSSYGYVLLVIYFLVHVKNPPVLPNLQQMPP------LRPI----TKEDTHLNGYNTWFF 259
Query: 238 SS-----DKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWEHIRS 285
++ N ++A L + F +S G++ + L F G W++ S
Sbjct: 260 DDIELLRQRWHSENTETVAELLIDFFRYYSRDFSYNTGVASIRAGLLKKDFKG-WQNDLS 318
Query: 286 NTRW--LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+R+ + L IEDPFE N +R V++ L I F
Sbjct: 319 ASRYNDARERNRLCIEDPFETDYNVSRCVTKDGLYTIRGEF 359
>gi|66812982|ref|XP_640670.1| hypothetical protein DDB_G0281431 [Dictyostelium discoideum AX4]
gi|60468695|gb|EAL66697.1| hypothetical protein DDB_G0281431 [Dictyostelium discoideum AX4]
Length = 920
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 149/315 (47%), Gaps = 25/315 (7%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
+ R K DL ++ S+ G + FGS V+++ + D+D+ L K+
Sbjct: 584 DNRFKSFDDLARIL--TRSISGIRLFLFGSSVNSMALKNSDIDVCANL----------KR 631
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
+ L+ ++ ++ +K GY + ++ +RVP++KF +I D+ I+NL S+ +
Sbjct: 632 NDEKLIF-IISSILKKNGYENIVTISQSRVPLIKFFDAKYDIHIDLCINNLLAIENSRLI 690
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL-K 204
S ID R + +L+K WAK+ +N+ + +SYS + L +F QT P +LP L K
Sbjct: 691 KSYSSIDSRMEPLFMLIKTWAKSKGLNDAAEKSLSSYSYANLTVFFLQTRQPPVLPCLHK 750
Query: 205 DIYPGN---LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI--NRSSLAHLFVSFLE 259
+ P LV+++ + + N + + K+ N+ S+A L F +
Sbjct: 751 GMSPKTKEVLVENVNVSYLDPTIFLNSSNNGNNNNANGNYGFKLNQNKESVAELLFGFFD 810
Query: 260 KFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSARAVSEKNL 317
+S + + I + E ++ +R N+ P ++I+DPF N ++++SEKN
Sbjct: 811 FYSKFDFENWIIDI----RRGEAVQLKSRKEINSTPANIYIQDPFIFDFNPSKSLSEKNF 866
Query: 318 AKISNAFEMTHFRLT 332
K + F L+
Sbjct: 867 IKFNTEVRKAAFLLS 881
>gi|363736637|ref|XP_422476.3| PREDICTED: terminal uridylyltransferase 4 [Gallus gallus]
Length = 1612
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 143/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 914 ILDIVCKRCFDELSPPLSEQQNREQILASLERFIRK-EYNDKARLCLFGSSKNGFGFRDS 972
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 973 DLDICMTLEGHE---NAEKLNCKEIIEGLAKVLKKHPGLRNILPITTAKVPIVKFEHRRS 1029
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1030 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1089
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1090 LMVLYFLQQRNPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDMEELKKRLP 1140
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1141 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1188
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1189 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1219
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 87/179 (48%), Gaps = 10/179 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D+ R ++++++ ++++ + L ++ +GS ++ + D++I I+
Sbjct: 319 DDFRIRQEIVNEMEKIIQ--QPLPDCSLRMYGSCLTRFAFKTSDINIDIKF--------P 368
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K + +L +L L+ Y ++ HA+VP++ + I ++C +S N + +
Sbjct: 369 PKMSQPDVLIQVLEILKNSAVYSEVESDFHAKVPVVFCKDIKSGLTCKVSARNDVACLTT 428
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
L + +++ +VL + WAK I+ G SYS +L+V+F Q P ILP
Sbjct: 429 DLLAALGKLEPVLIPLVLAFRYWAKLCHIDCQAEGGIPSYSFALMVIFFLQQRKPPILP 487
>gi|403223254|dbj|BAM41385.1| uncharacterized protein TOT_030000647 [Theileria orientalis strain
Shintoku]
Length = 523
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 94/190 (49%), Gaps = 9/190 (4%)
Query: 17 MLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNG 76
+LNP E ++ + +I L + +S + T FGS + L+SR D+D+ + + N
Sbjct: 171 ILNPTPEQYQKKQNLIDHLTPIFKSTIDGKLYT---FGSCDNGLWSRGSDIDLCLVVPNC 227
Query: 77 SCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNL 136
K+ S L +L+++ + ARVPI+K + N CDISI+N
Sbjct: 228 D-----SKRYMLSKL-NLIKSCLSNSDIISKISIISARVPIVKLYDMDNNNLCDISINNT 281
Query: 137 CGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
+ S+++ + ID R M +K WA INN GT +SY+L L + ++ Q
Sbjct: 282 VALLNSEYVKTMCNIDSRVVTMGRFIKYWATCRKINNRAEGTMSSYTLILQLFYYLQNRD 341
Query: 197 PAILPPLKDI 206
P I+P LK+I
Sbjct: 342 PPIIPTLKEI 351
>gi|355752030|gb|EHH56150.1| hypothetical protein EGM_05505, partial [Macaca fascicularis]
Length = 642
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 118/272 (43%), Gaps = 38/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 90 GDLGKALELAEAPKREKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 147
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL +S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 148 SGLHGDISLSNRLALHNSRFLSLVSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 206
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 207 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASR 253
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWL 290
R N L+ L F S L+ S L + P G WE +R
Sbjct: 254 LERSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG---- 309
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 310 ----PMNLQDPFDLSHNVAANVTSRVAGRLQN 337
>gi|261189334|ref|XP_002621078.1| poly(A) polymerase [Ajellomyces dermatitidis SLH14081]
gi|239591655|gb|EEQ74236.1| poly(A) polymerase [Ajellomyces dermatitidis SLH14081]
Length = 1129
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 117/253 (46%), Gaps = 25/253 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E + R K ++ L +++ V FGS + L S D+DI
Sbjct: 148 MRELYHRLLPSEESEQRRSKFVNKLEKLLNKQWPGNNIRVHVFGSSGNKLCSSDSDVDI- 206
Query: 71 IELSNGSCISSAGKKV-KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++ K +L D L K G R+ V+HARVPI+K ++C
Sbjct: 207 -------CITTTYKELEKVCILADFL----AKSGMERVVCVSHARVPIVKIWDPELRLAC 255
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 256 DMNVNNTLALENTRMIRTYVEIDERVRPLAMIVKYWTKRRILNDAALGGTLSSYTWICLI 315
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
+ QT ILP L++ N +D G ++ + + ++ F K N+S
Sbjct: 316 INFLQTRTIPILPSLQERCAKN-TNDTGGSGSSFDDDLEKLAGFG----------KENKS 364
Query: 249 SLAHLFVSFLEKF 261
+L L F +
Sbjct: 365 TLGQLLFQFFRYY 377
>gi|350589497|ref|XP_003130750.3| PREDICTED: poly(A) RNA polymerase, mitochondrial [Sus scrofa]
Length = 581
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 126/294 (42%), Gaps = 39/294 (13%)
Query: 47 GATVEPFGSFVSNLFSRWG-DLDISI---ELSNGSCISSAG--------------KKVKQ 88
G V PFGS V N F + G DLD+ + E+ N S ++G + V Q
Sbjct: 226 GCAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGNFSAQKASGNFLMEFQVKNVPSERIVTQ 284
Query: 89 SLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 285 KILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYL 344
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDI 206
+D R R +V ++ WA+ H + + G + ++SL+++V+F Q P ILP L
Sbjct: 345 YGALDSRVRALVFSIRCWARVHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL--- 401
Query: 207 YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSL 266
D LK + ++ I E R + N SL L F E F +
Sbjct: 402 ------DSLKSLADAEDKCIIEGHNCTFVRDLNKIKPSGNTESLELLLKEFFEYFGNFAF 455
Query: 267 KASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I R + P + PL I++PFE N ++ VS+ L K
Sbjct: 456 NKNSINI-------RQGREQNK--PESSPLHIQNPFETSLNISKNVSQSQLQKF 500
>gi|26328863|dbj|BAC28170.1| unnamed protein product [Mus musculus]
Length = 279
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 110/226 (48%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N G I++ FL + ++ R R
Sbjct: 44 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNTVG-IRNTFLLRTYAYLENRVRP 101
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA HDIN+ GT +SYSL L+VL + QT ILP L+ IY +
Sbjct: 102 LVLVIKKWASHHDINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYT-------ES 154
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + + N+ + S N SSL L + FL+ ++ +S++ +
Sbjct: 155 FSTSVQLHLVHHAPCNVPPYLSK-----NESSLGDLLLGFLKYYATEFDWNTQMISVREA 209
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 210 KAIPRPDDMEWR-----------NKYICVEEPFDG-TNTARAVHEK 243
>gi|328711103|ref|XP_001945875.2| PREDICTED: poly(A) RNA polymerase, mitochondrial-like
[Acyrthosiphon pisum]
Length = 557
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 157/347 (45%), Gaps = 58/347 (16%)
Query: 6 VLEPILKDI-LGMLNPLRE------------DWETRMKVISDLREVVESVESLRG----A 48
++EP +KDI L L+ L D TR++ ++ ++ +E+ +L+G +
Sbjct: 156 MIEPNIKDIELNQLDSLSNQMLLLLKRTQLNDIGTRLRFLTAMQ--IEN--ALKGIFPLS 211
Query: 49 TVEPFGSFVSNLFSRWGDLDISIE------------LSNGSCISSAGKKVKQSL--LGDL 94
V PFGS V++ D+D+ I + +G C+S+ + ++++ LGDL
Sbjct: 212 RVLPFGSSVNSFGKIGSDIDLVIMDSQTTENETSRLIYHGKCVSNGRTQTQRNIEILGDL 271
Query: 95 LRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR 154
L+ G R++ + ARVPI+K+ + CD+++ N S+ L+ D R
Sbjct: 272 LQLFL--PGCSRVKRITQARVPIVKYSQDFVGVECDLAVSNETAVNMSELLYIFGNFDYR 329
Query: 155 FRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
R +V VK WAK ++ N G + ++SL+LLVLF+ Q I+P ++ +
Sbjct: 330 VRPLVFTVKMWAKEINLTNDTPGRWITNFSLTLLVLFYLQQ--EKIIPDIQTL------- 380
Query: 214 DLKGVRANAERQIAEICAFNIARFSSDKYRKI-NRSSLAHLFVSFLEKFSGLSLKASELG 272
+K R N R E R +S R + N+ +L L + F E F+ + +
Sbjct: 381 -VKQARNNDVRITNEGINCTFLRDASKLPRVVENKKTLDQLLLGFFEYFASFDFNTNAIS 439
Query: 273 ICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ T P L+I +P E N ++ +S + + +
Sbjct: 440 LN---------FGKTINKPEYSALYIVNPLEVHFNVSKNISSEEVER 477
>gi|449268208|gb|EMC79078.1| Terminal uridylyltransferase 4 [Columba livia]
Length = 1593
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 143/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 904 ILDIVCKRCFDELSPPLSEQQNREQILASLERFIRK-EYNDKARLCLFGSSKNGFGFRDS 962
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 963 DLDICMTLEGHE---NAEKLNCKEIIEGLAKVLKKHPGLRNILPITTAKVPIVKFEHRRS 1019
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1020 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1079
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1080 LMVLYFLQQRNPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDMEELKKRLP 1130
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1131 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1178
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1179 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1209
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/199 (21%), Positives = 94/199 (47%), Gaps = 14/199 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D+ R ++++++ ++++ + L ++ +GS ++ + D++I I+
Sbjct: 309 DDFRIRQEIVNEMEKIIQ--QPLPDCSLRMYGSCLTRFAFKTSDINIDIKF--------P 358
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K + +L +L L+ Y ++ HA+VP++ + + ++C +S N + +
Sbjct: 359 PKMSQPDVLIQVLEILKNSDVYSDVESDFHAKVPVVFCKDVKSGLTCKVSARNDVACLTT 418
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +VL + WA+ I+ G SYS +L+V+F Q P ILP
Sbjct: 419 DLLAALGKLEPVLIPLVLAFRYWARLCHIDCQAEGGIPSYSFALMVIFFLQQRKPCILPS 478
Query: 203 LKDIYPGNLVDDLKGVRAN 221
Y GN ++ R +
Sbjct: 479 ----YLGNWIEGFDSKRPD 493
>gi|296206391|ref|XP_002750184.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Callithrix
jacchus]
Length = 582
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 134/296 (45%), Gaps = 51/296 (17%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS---------------------AGKKVK 87
V PFGS V N F + G DLD+ ++L+ +S+ A +K+
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLNETRSLSTHKTSGNFLMEFQVKNVPSERIATQKI- 284
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
S++G+ L G +Q + +AR P+++F + CD++ +N S+ L+
Sbjct: 285 LSVIGECLDNF--SPGCVGVQKILNARCPLVRFSHQASGLQCDLTTNNRIALTSSELLYI 342
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDI 206
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 343 YGTLDSRVRALVFTVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL--- 399
Query: 207 YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSG 263
D L+ + A+AE + I N F D R N +L L F E F
Sbjct: 400 ------DSLQTL-ADAEDKC--IIESNNCTFVRDLNRIKPSENTETLELLLKEFFEYFGN 450
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ V++ L K
Sbjct: 451 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVNQSQLQK 497
>gi|449508859|ref|XP_002193471.2| PREDICTED: terminal uridylyltransferase 4 [Taeniopygia guttata]
Length = 1623
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 143/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 914 ILDIVCKRCFDELSPPLSEQQNREQILASLERFIRK-EYNDKARLCLFGSSKNGFGFRDS 972
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 973 DLDICMTLEGHE---NAEKLNCKEIIEGLAKVLKKHPGLRNILPITTAKVPIVKFEHRRS 1029
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1030 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1089
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1090 LMVLYFLQQRNPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDMEELKKRLP 1140
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1141 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1188
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1189 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1219
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/199 (21%), Positives = 93/199 (46%), Gaps = 14/199 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED+ R +++ ++ ++++ + L ++ +GS ++ + D++I I+
Sbjct: 319 EDFRIRQEIVKEMEKIIQ--QPLPDCSLRMYGSCLTRFAFKTSDVNIDIKF--------P 368
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K + +L +L L+ Y ++ HA+VP++ + + ++C +S N + +
Sbjct: 369 PKMSQPDVLIQVLEILKNSAVYSDVESDFHAKVPVVFCKDVKSGLTCKVSARNDVACLTT 428
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +VL + WA+ I+ G SYS +L+V+F Q P ILP
Sbjct: 429 DLLAALGKLEPVLIPLVLAFRYWARLCHIDCQAEGGIPSYSFALMVIFFLQQRKPRILPS 488
Query: 203 LKDIYPGNLVDDLKGVRAN 221
Y GN ++ R +
Sbjct: 489 ----YLGNWIEGFDSKRPD 503
>gi|239609033|gb|EEQ86020.1| poly(A) polymerase [Ajellomyces dermatitidis ER-3]
gi|327354327|gb|EGE83184.1| Poly(A) polymerase [Ajellomyces dermatitidis ATCC 18188]
Length = 1129
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 66/253 (26%), Positives = 117/253 (46%), Gaps = 25/253 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E + R K ++ L +++ V FGS + L S D+DI
Sbjct: 148 MRELYHRLLPSEESEQRRSKFVNKLEKLLNKQWPGNNIRVHVFGSSGNKLCSSDSDVDI- 206
Query: 71 IELSNGSCISSAGKKV-KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++ K +L D L K G R+ V+HARVPI+K ++C
Sbjct: 207 -------CITTTYKELEKVCILADFL----AKSGMERVVCVSHARVPIVKIWDPELRLAC 255
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 256 DMNVNNTLALENTRMIRTYVEIDERVRPLAMIVKYWTKRRILNDAALGGTLSSYTWICLI 315
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
+ QT ILP L++ N +D G ++ + + ++ F K N+S
Sbjct: 316 INFLQTRTIPILPSLQERCAKN-TNDTGGSGSSFDDDLEKLAGFG----------KENKS 364
Query: 249 SLAHLFVSFLEKF 261
+L L F +
Sbjct: 365 TLGELLFQFFRYY 377
>gi|296194251|ref|XP_002744874.1| PREDICTED: poly(A) RNA polymerase GLD2 [Callithrix jacchus]
Length = 480
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 110/226 (48%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ES 355
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 356 FSPAVQLHLVHQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|242016216|ref|XP_002428725.1| polyA polymerase CID, putative [Pediculus humanus corporis]
gi|212513410|gb|EEB15987.1| polyA polymerase CID, putative [Pediculus humanus corporis]
Length = 596
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 136/300 (45%), Gaps = 41/300 (13%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+T+ FGS S D+D+ + L + + + V S LG + + L+
Sbjct: 317 LVGSTMSGFGSNDS-------DVDMCL-LVRHTEMDQKNEAV--SHLGQISKYLKNCDFV 366
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
+++ + A+VPILKF ++ D++ +N G + L+ SQ+D R R +VL+VK
Sbjct: 367 DQVELI-QAKVPILKFRSL--GFEVDLNCNNAVGIRNTHLLYCYSQLDWRVRPLVLIVKL 423
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPG--NLVDDLKGVRAN 221
WA +IN+ K T +SYSL+L+V+ + Q V P +LP L D+Y N + D+ + +
Sbjct: 424 WAAKQNINDAKNMTISSYSLALMVIHYLQCGVSPPVLPCLHDVYKEKFNPLSDINQIDLH 483
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
E + Y N SL L V FL ++ S I G
Sbjct: 484 EELK---------------PYNSQNEQSLGELLVGFLLYYANFDY--SVYAISVRLGSKV 526
Query: 282 HIRSNTRWLP-NNHP-----LFIEDPFEQPENSARAVSEK-NLAKISNAFEMTHFRLTST 334
+I R N P L IE+PF+ N+ARAV +I F +H L T
Sbjct: 527 NIEECRRAKSLKNEPHQWKYLCIEEPFDLT-NTARAVYNAIRFQRIKKVFYESHKYLEKT 585
>gi|198470955|ref|XP_001355451.2| GA10992 [Drosophila pseudoobscura pseudoobscura]
gi|198145697|gb|EAL32510.2| GA10992 [Drosophila pseudoobscura pseudoobscura]
Length = 660
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 141/321 (43%), Gaps = 57/321 (17%)
Query: 24 DWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWG-DLDISIELS--NGSCI 79
D RM+ ++ L +V +++ + A PFGS V N F + G DLD+ + G
Sbjct: 196 DLGVRMRFLAAL-QVQQAISGMFPAAQAHPFGSSV-NGFGKMGCDLDLILRFDGETGGRK 253
Query: 80 SSAGK-------KVKQSL-------------LGDLLRALRQKGGYRRLQFVAHARVPILK 119
S+G+ K++L GD+L G ++ + ARVPI+K
Sbjct: 254 QSSGEPPSRLIYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIK 311
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
+ H N+ D+S+ NL G S+ L+ +ID R R + ++ WA+A + NP G +
Sbjct: 312 YHHEHLNLEVDLSMSNLSGFYMSELLYMFGEIDSRVRPLTFSIRRWAQACGLTNPSPGRW 371
Query: 180 -NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLK----GVRANAERQIAEICAFNI 234
+++SL+ LV++ Q ILP + + D++ G+ R + +
Sbjct: 372 ISNFSLTCLVMYFLQQLRQPILPTIGALVKAAEAKDVRVTEDGINCTFGRDLERV----- 426
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL--PN 292
++ N SSL+ L + F E +S + + + R L P+
Sbjct: 427 ------GFQSRNTSSLSELLLQFFEFYSQFDFHNRAISL-----------NEGRQLSKPD 469
Query: 293 NHPLFIEDPFEQPENSARAVS 313
+ ++I +P EQ N ++ VS
Sbjct: 470 HSAMYIVNPLEQLLNVSKNVS 490
>gi|326921606|ref|XP_003207048.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Meleagris
gallopavo]
Length = 544
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 135/299 (45%), Gaps = 53/299 (17%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKVKQ------------------ 88
++V+PFGS V N F + G D+D+ ++ + I K+K+
Sbjct: 195 SSVKPFGSSV-NTFGKLGCDVDMFLDFRD---IQKHPTKMKKGPFEMEYQMKRLPSERLA 250
Query: 89 -----SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
S++GD L GY +Q + +AR P++KF CD+S+ N S+
Sbjct: 251 TQKILSIIGDCLDNF--GPGYSSVQKILNARCPLVKFSHQPTGFQCDLSVSNSIAIKCSE 308
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPP 202
L+ +D R R +V ++ WA+ H + N GT+ ++SL+++++F Q P I+P
Sbjct: 309 LLYIYGCLDPRVRALVFSLRCWARVHGLTNSVPGTWITNFSLTMMIMFFLQKRSPPIIPT 368
Query: 203 LKDIYPGNLVDDLKGVRANAERQI--AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
L D LK + ++ + C+F ++ S K K N +L L F +
Sbjct: 369 L---------DQLKELADEKDKDVIGGYDCSF-VSDLSKIKPTK-NTETLDELLCDFFQY 417
Query: 261 FSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
F + + + + + + + P + PL+I +PFEQ N ++ V++ L K
Sbjct: 418 FGNFDFRKNSINL----RKGKEVNK-----PESSPLYIWNPFEQDLNISKNVNQPQLEK 467
>gi|195393058|ref|XP_002055171.1| GJ19221 [Drosophila virilis]
gi|194149681|gb|EDW65372.1| GJ19221 [Drosophila virilis]
Length = 618
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 150/349 (42%), Gaps = 61/349 (17%)
Query: 24 DWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS- 80
D RM+ ++ L +V ES+ + A +PFGS V N F + G DLD+ + + +
Sbjct: 200 DLGVRMRFLAAL-QVQESISGMFPDALAQPFGSSV-NGFGKMGCDLDLILRFDGKTTANG 257
Query: 81 ------------------SAGKKVKQ---SLLGDLLRALRQKGGYRRLQFVAHARVPILK 119
S G+ Q +GD+L G ++ + ARVPI+K
Sbjct: 258 LDSQREASRLIYHTKENLSNGRSQTQRQMECIGDVLHLFLP--GVCHVRRILQARVPIIK 315
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
+ H ++ D+S+ NL G S+ L+ ++D R R + V+ WA++ + NP G +
Sbjct: 316 YHHEHLDLEVDLSMSNLTGFYMSELLYMFGELDSRVRPLTFSVRRWAQSCGLTNPSPGRW 375
Query: 180 -NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLK----GVRANAERQIAEICAFNI 234
++SL+ LV+F Q ILP + + D++ G+ R + +
Sbjct: 376 ITNFSLTCLVMFFMQQLRQPILPAIGALAKAASATDIRVTEDGINCTFARDMERV----- 430
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL--PN 292
++ N SSL+ L + F E +S + + + R L P+
Sbjct: 431 ------GFQSRNTSSLSELLLQFFEFYSQFDFHNRAISL-----------NEGRALAKPD 473
Query: 293 NHPLFIEDPFEQPENSARAVS----EKNLAKISNAFEMTHFRLTSTNQT 337
+ ++I +P EQ N ++ VS E+ ++ NA M + +T T
Sbjct: 474 HSAMYIVNPLEQLLNVSKNVSLEECERLRIEVRNAAWMLESEVENTTLT 522
>gi|403256381|ref|XP_003920858.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 480
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 110/226 (48%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ES 355
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 356 FSPAVQLHLVHQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|213407290|ref|XP_002174416.1| Poly(A) RNA polymerase cid11 [Schizosaccharomyces japonicus yFS275]
gi|212002463|gb|EEB08123.1| Poly(A) RNA polymerase cid11 [Schizosaccharomyces japonicus yFS275]
Length = 459
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/305 (25%), Positives = 127/305 (41%), Gaps = 47/305 (15%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
R+ ++S L V+++ + FGS SNL R D+DI CI + + K
Sbjct: 77 RIALLSKLSRVLQTNFPEEDIELTTFGSTESNLALRRSDVDI--------CIQTHSRTSK 128
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
L R L ++G + + ARVPI+K I+CDI+++N + + +
Sbjct: 129 LQTTCQLARLLHEEG-LVNIVCIPRARVPIVKAWDPSLGIACDINLNNSLAKTNTAMIKA 187
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+ D R R M LL+K WAK N K G +SY+++ ++L + Q P ILP + +
Sbjct: 188 CVEYDARIRPMALLIKHWAKCRKFNGTKGKGVLSSYTITCMLLNYLQLTDPPILPSMVAL 247
Query: 207 YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSL 266
Q + C + ++N S+ LF+ F E + G
Sbjct: 248 ------------------QQNDYCKPKV---------QLNEKSIGSLFIGFFE-YYGYRF 279
Query: 267 KASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEM 326
+ I G ++ N+ L +E+PF N+ R NLA +N E+
Sbjct: 280 EYESYVISVKQGYLLSKKTKVWDSDQNNILCVEEPF----NNMR-----NLANTANEMEV 330
Query: 327 THFRL 331
RL
Sbjct: 331 MGLRL 335
>gi|409076172|gb|EKM76545.1| hypothetical protein AGABI1DRAFT_78278, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 541
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 163/394 (41%), Gaps = 59/394 (14%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS + R D+D+ C+ + +++ + L +L L ++ ++ + H
Sbjct: 57 FGSTANGFSLRNSDMDLC-------CLIDSPERLNPADLVTILGDLLERETRFHVKPLPH 109
Query: 113 ARVPILKFET-----IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWA 166
AR+PI+K + I+CDI +N ++ L ++ID R R +VL +K W+
Sbjct: 110 ARIPIVKLSLDPSPGLPSGIACDIGFENRLAIENTRLLLTYAKIDPTRVRTLVLFLKIWS 169
Query: 167 KAHDINNPKTGTFNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVD------------ 213
K IN+P GT +SY LLV+ F P +LP L+ + P ++
Sbjct: 170 KRRKINSPYKGTLSSYGYVLLVIYFLVHVKNPPVLPNLQQMPPLRPINKDDTTLNGYNVW 229
Query: 214 -----DLKGVRANAE--RQIAEICAFNIARFSSDKYRKINR-----SSLAHLFVSFLEKF 261
D+ R +E +AE+ F + F +R +R + +A + L+K
Sbjct: 230 FFDDTDILCQRWQSENTESVAELHVFTLIDF----FRYFSRDFSYNTGVASIRAGLLKK- 284
Query: 262 SGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKIS 321
K + + P +G+++ R R+ IEDPFE N AR V++ L I
Sbjct: 285 ---DAKGWQNDLSPTSGRYDPARERNRFC-------IEDPFETDYNVARCVTKDGLYTIR 334
Query: 322 NAFEMTHFRLTSTNQTRYAL-LSSLARPF----ILQFFGESPVRYANYNNGHRRARPQSH 376
F M R+ S R + L+ L ++ P+ Y N PQ+
Sbjct: 335 GEF-MRASRILSIRPERAIIALAELCEERKEEDLVSAPSHHPISYVNPTPAKLSIPPQTP 393
Query: 377 KSVNSPLQAQHQSHNAKKENRPNRSMSQQSVQQH 410
S+ + + + + P+ + Q V +H
Sbjct: 394 YSIGTQSRRMAPGLSPRMTMSPHPTYVLQVVDEH 427
>gi|74144005|dbj|BAE22125.1| unnamed protein product [Mus musculus]
Length = 892
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 215 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 273
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 274 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 330
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 331 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 390
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 391 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 441
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 442 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 489
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 490 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 520
>gi|380810174|gb|AFE76962.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|383416229|gb|AFH31328.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
gi|384945588|gb|AFI36399.1| poly(A) RNA polymerase GLD2 [Macaca mulatta]
Length = 480
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 111/226 (49%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYP-------ES 355
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 356 FSSAIQLHLVHQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
Length = 584
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R K IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 183 IKDFVAYISPSREEIEIRNKTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 240
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I D
Sbjct: 241 V-------TSKLGGKESRNNLYSLASHLKKKNLATEVEVVAKARVPIIKFVEPHSGIHID 293
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 294 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 352 FSFLHMHPRII 362
>gi|341890319|gb|EGT46254.1| hypothetical protein CAEBREN_10930 [Caenorhabditis brenneri]
Length = 443
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 141/308 (45%), Gaps = 30/308 (9%)
Query: 28 RMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKV 86
R+K+ + L E+ ++V L A + GSF +N+ D+D+++E+ S G+
Sbjct: 78 RLKLKTVLAELRKTVSRLFPDAKIWATGSFPANVDLPTSDIDVTMEIP-----SLDGEPR 132
Query: 87 KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
K S++ A+ +GG +++ + RVP+L + D+++DN + ++ L
Sbjct: 133 KLSVI---RAAMEGQGGPFQVKKIVGGRVPVLALMHKATKVPVDVTMDNGAPKRNTQLLI 189
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKD 205
W Q+D RF + +K WA + N G NS S+ L+V+ + Q V PA+LP L+
Sbjct: 190 WYGQVDRRFVPLCRAIKSWASQTGVENSMKGRLNSCSICLMVIHYLQCGVTPAVLPSLQA 249
Query: 206 IYP---GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
I+P G + D + + R + E + + N+ SL L++ F F+
Sbjct: 250 IFPELNGEIEIDCE---ESKRRDLGE-------ELRASGWAPTNQESLGALYLGFFRYFA 299
Query: 263 GLSLKASELGI---CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE-NSARAVSEKNL- 317
+ + C ++ + + IEDPF P N AR V++ ++
Sbjct: 300 KFDFINQMISVKNGCSMPKPKKNDEKDDTYALRY--TVIEDPFMNPLFNCARTVNQGDIF 357
Query: 318 AKISNAFE 325
++ + FE
Sbjct: 358 ERLVSEFE 365
>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
Length = 584
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R K IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 183 IKDFVAYISPSREEIEIRNKTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 240
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I D
Sbjct: 241 V-------TSKLGGKESRNNLYSLASHLKKKNLATEVEVVAKARVPIIKFVEPHSGIHID 293
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 294 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 352 FSFLHMHPRII 362
>gi|348533165|ref|XP_003454076.1| PREDICTED: terminal uridylyltransferase 4 [Oreochromis niloticus]
Length = 1503
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 144/332 (43%), Gaps = 39/332 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 804 ILDGLCKLCYYELSPTHAEQQRREQILASLERFIRK-EYNEKAQLCLFGSSKNGFGFRDS 862
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 863 DLDICMTLEGHE---TAEKLNCKEIIEGLAKVLKKHTGLRNILPITTAKVPIVKFEHKQS 919
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + +D R + + +K +AK DI + G+ +SY+
Sbjct: 920 GLEGDISLYNTLAQHNTRMLATYAALDPRVQFLGYTMKVFAKRCDIGDASRGSLSSYAYI 979
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF------NIARFSS 239
L+VL+ Q P ++P L++I+ GN V +R + AF ++ R S
Sbjct: 980 LMVLYFLQQRQPPVIPVLQEIFDGNTV---------PQRLVDGWNAFFFDDLEDLRRHHS 1030
Query: 240 DKYRKINRSSLAHLFVSFLEKFS-GLSLKASELGI------CPFTGQWEHIRSNTRWLPN 292
+ + N S+ L++ L ++ K + I F QW
Sbjct: 1031 ENQQ--NTESVGELWLGLLRFYTEEFDFKEHVISIRQRKRLTTFEKQW-----------T 1077
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1078 SKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1109
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 82/179 (45%), Gaps = 10/179 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED+E R V+S + +++ L ++ +GS ++ + D++I + +
Sbjct: 289 EDFEIRKTVVSRMEIIIQ--RHLTACSLRLYGSCLTRFAFKTSDINIDV--------TYP 338
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L +L L+ + ++ HA+VPI+ + + C +S N + +
Sbjct: 339 PSMTQPEVLIQVLEILKNSPEFSEVESDFHAKVPIVFCRDVSSGLMCKVSAGNDVACLTT 398
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
L +++++ R +VL + WA+ I+ G SYS +L+V+F Q ILP
Sbjct: 399 NHLAALAKLEPRLISLVLAFRYWARLCHIDCQAEGGIPSYSFALMVIFFLQQRKDPILP 457
>gi|317149559|ref|XP_001823493.2| PAP/25A associated domain family [Aspergillus oryzae RIB40]
Length = 1068
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 132/303 (43%), Gaps = 38/303 (12%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
++ L P E + R +++ L + R V FGS + L S D+DI
Sbjct: 122 EVYDRLLPSAESDDRRRQLVRKLERLFNEQWPGRDIKVHVFGSSGNKLCSSDSDVDI--- 178
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++Q LL ++L + G R+ V+HA+VPI+K ++CD+
Sbjct: 179 -----CITTTYKELEQVCLLAEVL----ARHGMERVVCVSHAKVPIVKIWDPELQLACDM 229
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + +ID R R + +++K W K + + GT +SY+ L++
Sbjct: 230 NVNNTLALDNTRMVRTYVEIDERVRPLAMIIKHWTKRRILCDAGLGGTLSSYTWICLIIN 289
Query: 191 HFQTCVPAILPPL------KDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
QT P ILP L K I P LV C+F+ + Y +
Sbjct: 290 FLQTRNPPILPSLQARPHEKKISPEGLV-----------------CSFDDDLGNLTGYGR 332
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N+ SL LF F K+ G L + + G+ + L N+ L +E+PF
Sbjct: 333 KNKQSLGDLFFQFF-KYYGHELDYEKYVVSVREGKLISKEAKGWHLLQNNRLCVEEPFNT 391
Query: 305 PEN 307
N
Sbjct: 392 SRN 394
>gi|194763565|ref|XP_001963903.1| GF21267 [Drosophila ananassae]
gi|190618828|gb|EDV34352.1| GF21267 [Drosophila ananassae]
Length = 611
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/327 (23%), Positives = 145/327 (44%), Gaps = 55/327 (16%)
Query: 24 DWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWGDLDISIELSN------G 76
D R++ ++ L +V +++ + A +PFGS V+ DLD+ + ++ G
Sbjct: 190 DLGIRLRFLAAL-QVQQAISGMFPTAQAQPFGSSVNGFGKMGCDLDLILRFNDDTGSQKG 248
Query: 77 SCISSAGKKV---KQSL-------------LGDLLRALRQKGGYRRLQFVAHARVPILKF 120
+S + V K++L GD+L G ++ + ARVPI+K+
Sbjct: 249 LAVSEPSRLVFHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIKY 306
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF- 179
H ++ D+S+ NL G S+ L+ ++D R R + ++ WA++ + NP G +
Sbjct: 307 HHEHLDLEVDLSMSNLTGFYMSELLYMFGEVDPRVRPLTFSIRRWAQSCGLTNPSPGRWI 366
Query: 180 NSYSLSLLVLFHFQTCVPAILPPL----KDIYPGNLVDDLKGVRANAERQIAEICAFNIA 235
+++SL+ LV+F Q ILP + K PG+ G+ R + +
Sbjct: 367 SNFSLTCLVMFFLQQLRQPILPTIGALAKAAEPGDSRVTEDGINCTFARDMDRL------ 420
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL--PNN 293
+R N+SSL+ L + F E +S + + + R L P++
Sbjct: 421 -----GFRSRNQSSLSELLLQFFEFYSQFDFHNRAISL-----------NEGRALSKPDH 464
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKI 320
++I +P EQ N ++ VS + ++
Sbjct: 465 SAMYIVNPLEQLLNVSKNVSMEECERL 491
>gi|444724868|gb|ELW65455.1| Terminal uridylyltransferase 4 [Tupaia chinensis]
Length = 1618
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 928 ILDLVCKRCFDELSPPYSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 986
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 987 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1043
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1044 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1103
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1104 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1154
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1155 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1202
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1203 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1233
>gi|56758428|gb|AAW27354.1| SJCHGC06948 protein [Schistosoma japonicum]
Length = 273
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 127/257 (49%), Gaps = 26/257 (10%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
+L LL +L++ R + + A+ PI+KF H + CDI+++N+ G + L +
Sbjct: 21 ILSQLLPSLKKCRFLRDFRLI-RAKTPIIKFHDTHSTVDCDINVNNVIGIYNTHLLAMYA 79
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYP 208
++D R R + + +K WA+ DI++ + G ++YSL L+++ + Q C P +LP L++ +P
Sbjct: 80 KVDWRVRPLGIFIKHWAQCLDIHDAQRGRLSTYSLLLMLIHYLQVGCSPPVLPNLQEKFP 139
Query: 209 GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS------ 262
+ + + Q+ ++ + N ++L+ LF+ F++ ++
Sbjct: 140 KLFNHSIPPYKLDMCLQLPW-----------NELQSNNSANLSELFIGFIDYYANRFDFN 188
Query: 263 --GLSLKASELGICPF-TGQWEH--IRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ + T +H + N + +FIE+PF + N+AR++ N+
Sbjct: 189 KWAISIRHTSSSSSLLKTVAMKHSSMNENAVMPIRDCKIFIEEPFSR-TNTARSIHSDNI 247
Query: 318 -AKISNAFEMTHFRLTS 333
+ I AF T+ L S
Sbjct: 248 VSSIKEAFNKTNTVLCS 264
>gi|28839666|gb|AAH47581.1| PAPD4 protein [Homo sapiens]
gi|119616243|gb|EAW95837.1| PAP associated domain containing 4, isoform CRA_a [Homo sapiens]
Length = 480
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 111/226 (49%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP + +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-- 360
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 361 -----QLHLVHQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|348535678|ref|XP_003455326.1| PREDICTED: terminal uridylyltransferase 7-like [Oreochromis
niloticus]
Length = 1317
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 144/340 (42%), Gaps = 48/340 (14%)
Query: 7 LEPILKDILGMLNPLRE-----------DWETRMKVISDLREVVESVESLRGATVEPFGS 55
L P+ + L +LN + E + R ++ DL V GA ++ FGS
Sbjct: 813 LPPVTPEFLSVLNKVCEQCYSDFAPDELEVGVREYILQDLEVFVR--RQFPGAQLQLFGS 870
Query: 56 FVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARV 115
+ R DLDI + L I +++ L R+L++ G R + + A+V
Sbjct: 871 SKNGFGFRQSDLDICMVLEGQETIDDINCI---NVIESLARSLKKHPGLRNILPITTAKV 927
Query: 116 PILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK 175
PI+KF + + DIS+ N + L + ID R + + ++K +AK DI +
Sbjct: 928 PIVKFYHVRTGLEGDISLYNTLALHNTHLLATYAAIDRRVKILCYVMKVFAKMCDIGDAS 987
Query: 176 TGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIA 235
G+ +SY+ +L+VLF Q P ++P L++IY G + L + +N+
Sbjct: 988 RGSLSSYAYTLMVLFFLQQRNPPVIPVLQEIYDGKKPEVL-------------VDGWNVY 1034
Query: 236 RFSSDK--------YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
F K Y K N ++ L++ L +F E +C +H R T
Sbjct: 1035 FFGDLKALPSHWPHYGK-NTETVGELWLGLL-RFYTEDFDFREHVVC----IRQHARLTT 1088
Query: 288 ---RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+W + + IEDPF+ N +S K I AF
Sbjct: 1089 FNKQW--TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1126
Score = 45.1 bits (105), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 40/178 (22%), Positives = 78/178 (43%), Gaps = 10/178 (5%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D E R V+S +++++ SV L + +GS + + D++I I+
Sbjct: 265 DVEKRQCVVSTMQDLLLSV--LPEVRLRLYGSSCTKFGFKDSDVNIDIQYPTHMHQPDVL 322
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
VK+ L L + ++ HARVP++ + + + C +S N +
Sbjct: 323 MLVKECLSVSSL--------FVEMEADFHARVPVVICKERNSGLICKVSAGNENAFQTTT 374
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+L ++ + +VL + WA+ +I+ + G Y +LLV+F+ Q +LP
Sbjct: 375 YLSALATQEPLLMPLVLGFRRWARICEIDRAEEGGLPPYLFALLVIFYLQKRKEPLLP 432
>gi|195162231|ref|XP_002021959.1| GL14387 [Drosophila persimilis]
gi|194103857|gb|EDW25900.1| GL14387 [Drosophila persimilis]
Length = 628
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 141/321 (43%), Gaps = 57/321 (17%)
Query: 24 DWETRMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWG-DLDISIELS--NGSCI 79
D RM+ ++ L +V +++ + A PFGS V N F + G DLD+ + G
Sbjct: 194 DLGVRMRFLAAL-QVQQAISGMFPAAQAHPFGSSV-NGFGKMGCDLDLILRFDGETGGRK 251
Query: 80 SSAGK-------KVKQSL-------------LGDLLRALRQKGGYRRLQFVAHARVPILK 119
S+G+ K++L GD+L G ++ + ARVPI+K
Sbjct: 252 QSSGEPPSRLIYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIK 309
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
+ H N+ D+S+ NL G S+ L+ +ID R R + ++ WA+A + NP G +
Sbjct: 310 YHHEHLNLEVDLSMSNLSGFYMSELLYMFGEIDPRVRPLTFSIRRWAQACGLTNPSPGRW 369
Query: 180 -NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLK----GVRANAERQIAEICAFNI 234
+++SL+ LV++ Q ILP + + D++ G+ R + +
Sbjct: 370 ISNFSLTCLVMYFLQQLRQPILPTIGALVKAAEAKDVRVTEDGINCTFGRDLERV----- 424
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL--PN 292
++ N SSL+ L + F E +S + + + R L P+
Sbjct: 425 ------GFQSRNTSSLSELLLQFFEFYSQFDFHNRAISL-----------NEGRQLSKPD 467
Query: 293 NHPLFIEDPFEQPENSARAVS 313
+ ++I +P EQ N ++ VS
Sbjct: 468 HSAMYIVNPLEQLLNVSKNVS 488
>gi|390465952|ref|XP_002750876.2| PREDICTED: terminal uridylyltransferase 4 [Callithrix jacchus]
Length = 1640
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 940 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 998
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 999 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1055
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1056 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1115
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1116 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1166
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1167 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1214
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1215 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1245
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 361 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 410
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 411 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 470
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 471 DLLTALGKMEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 530
Query: 203 L 203
L
Sbjct: 531 L 531
>gi|60360306|dbj|BAD90397.1| mKIAA0191 protein [Mus musculus]
Length = 1556
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 879 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 937
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 938 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 994
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 995 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1054
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1055 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1105
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1106 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1153
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1154 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1184
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 303 DDLRIRQDIVEEMSKVIMTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 352
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 353 PKMNHPDLLIQVLGILKKSALYIDVESDFHAKVPVVVCKDRKSALLCRVSAGNDMACLTT 412
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 413 DLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 472
Query: 203 L 203
L
Sbjct: 473 L 473
>gi|332240560|ref|XP_003269454.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase,
mitochondrial [Nomascus leucogenys]
Length = 583
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 46/295 (15%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQSLL 91
V PFGS V N F + G DLD+ ++L + + + + Q +L
Sbjct: 227 VRPFGSSV-NTFGKLGCDLDMFLDLDETRNLGTHKTSGNFLMEFQVKNVPSERIATQKIL 285
Query: 92 GDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 SVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGA 345
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 LDSRVRALVFSIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL------ 399
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYR----KINRSSLAHLFVSFLEKFSGLS 265
D LK + +++ + E N F SD R + + + L F E F +
Sbjct: 400 ---DSLKTLAGSSDSCVIE---GNNCTFCSDLNRIKPSQDTETXVKLLLKEFFEYFGNFA 453
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I Q + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 454 FNKNSINIRQGKEQNK---------PDSSPLYIQNPFETSLNISKNVSQSQLQKF 499
>gi|302828142|ref|XP_002945638.1| hypothetical protein VOLCADRAFT_85811 [Volvox carteri f.
nagariensis]
gi|300268453|gb|EFJ52633.1| hypothetical protein VOLCADRAFT_85811 [Volvox carteri f.
nagariensis]
Length = 691
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 150/356 (42%), Gaps = 80/356 (22%)
Query: 26 ETRMKVISDLREVVESVESLRGATVE---------------------PFGSFVSNLFSRW 64
E +K+++ RE E ++R AT+E P+GSF+S+ +SR
Sbjct: 57 EQSLKMLAKSREPGEHDGAVRQATIERLQALLRDPQIFPAPSRLQLVPYGSFLSSCYSRS 116
Query: 65 GDLDISI--ELS------NGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH---- 112
DLD+++ E++ G + ++Q + + L + Q + H
Sbjct: 117 SDLDLALTGEVAPVVVGRRGGIPAGTAVPLEQLSREECVMLLVRLAYTLEAQQLTHGSVD 176
Query: 113 -----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
ARVPI+KFE + I CD+ + KS + + ++ ++ LVK WAK
Sbjct: 177 RNPLGARVPIIKFEAVGSGIECDVCVTTRGCDFKSSIMRSLYKLQPSLAPLIQLVKLWAK 236
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPA--ILPPLKDIY----------PGNLVDDL 215
HDIN+ T NS+SL+L+V+F Q+ P +LPPL I+ G + D
Sbjct: 237 HHDINSAHCSTLNSWSLALMVVFSLQS-YPGGHLLPPLWRIFHDEEPTGPAGKGRPLQD- 294
Query: 216 KGVRANAERQIAEI-CAFNIARFSSDKYRKINRS--SLAHLFVSFLEKFSGLSLKASELG 272
K ++ N +AEI C + ++ + + L + FL FS +
Sbjct: 295 KSLQLNDMLVVAEIRCTEEADHLLAPRWSQSSSEPPGLLDQLLWFLSCFSTI-------- 346
Query: 273 ICPFTGQWEHIRSNTRW-------------LPNNHPLFIEDPFEQPENSARAVSEK 315
+C QW + W P + +E+PF+ +N+AR++ +
Sbjct: 347 MC----QWRDNSAQRNWRVSTWLGRGYTARFPKAYVAAVEEPFDCNDNTARSLGTR 398
>gi|268570020|ref|XP_002640673.1| Hypothetical protein CBG19735 [Caenorhabditis briggsae]
Length = 802
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 113/242 (46%), Gaps = 35/242 (14%)
Query: 90 LLGDLLRALRQ-KGGYRRLQFVAH-----ARVPI--LKFETIHQNISCDISIDNLCGQIK 141
+L L +A+R+ K G+ Q + + A+VPI LK + + ++ DI+++N+ G
Sbjct: 550 VLRKLDKAIRRSKPGHPLRQHIRYCEMVPAKVPIIKLKMQGAYPDMEVDINVNNIAGIYN 609
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAI 199
S + S ID RF + L +K WA +NN + G NSY++ LLV+ HF C PA+
Sbjct: 610 SHLTHYYSLIDARFPVLALAIKHWASRQGVNNAQAGYLNSYTIILLVV-HFLQCGVSPAV 668
Query: 200 LPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLE 259
LP L+ ++P L IAE R + N S+ LFV F
Sbjct: 669 LPNLQYLFPEKFDKKLPISALQLYGDIAE-------RLPTS---APNTWSIGELFVGFFH 718
Query: 260 -----KFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
FS ++ + P + H+ N+P+F+E+PF+ N+AR+V
Sbjct: 719 YYAHFDFSTQAISVRSAQVVPRSSLPHHMA--------NYPIFVEEPFDA-INTARSVRT 769
Query: 315 KN 316
N
Sbjct: 770 PN 771
>gi|332230575|ref|XP_003264469.1| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Nomascus
leucogenys]
Length = 1635
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 82/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQDIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L 203
L
Sbjct: 541 L 541
>gi|441624570|ref|XP_004089001.1| PREDICTED: terminal uridylyltransferase 4 [Nomascus leucogenys]
Length = 1636
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 82/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQDIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L 203
L
Sbjct: 541 L 541
>gi|403258058|ref|XP_003921600.1| PREDICTED: terminal uridylyltransferase 4 [Saimiri boliviensis
boliviensis]
Length = 1643
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKMEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L 203
L
Sbjct: 541 L 541
>gi|238495318|ref|XP_002378895.1| zinc finger protein, cchc domain containing protein, putative
[Aspergillus flavus NRRL3357]
gi|220695545|gb|EED51888.1| zinc finger protein, cchc domain containing protein, putative
[Aspergillus flavus NRRL3357]
Length = 1096
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 132/303 (43%), Gaps = 38/303 (12%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
++ L P E + R +++ L + R V FGS + L S D+DI
Sbjct: 150 EVYDRLLPSAESDDRRRQLVRKLERLFNEQWPGRDIKVHVFGSSGNKLCSSDSDVDI--- 206
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++Q LL ++L + G R+ V+HA+VPI+K ++CD+
Sbjct: 207 -----CITTTYKELEQVCLLAEVL----ARHGMERVVCVSHAKVPIVKIWDPELQLACDM 257
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + +ID R R + +++K W K + + GT +SY+ L++
Sbjct: 258 NVNNTLALDNTRMVRTYVEIDERVRPLAMIIKHWTKRRILCDAGLGGTLSSYTWICLIIN 317
Query: 191 HFQTCVPAILPPL------KDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
QT P ILP L K I P LV C+F+ + Y +
Sbjct: 318 FLQTRNPPILPSLQARPHEKKISPEGLV-----------------CSFDDDLGNLTGYGR 360
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N+ SL LF F K+ G L + + G+ + L N+ L +E+PF
Sbjct: 361 KNKQSLGDLFFQFF-KYYGHELDYEKYVVSVREGKLISKEAKGWHLLQNNRLCVEEPFNT 419
Query: 305 PEN 307
N
Sbjct: 420 SRN 422
>gi|403294992|ref|XP_003938441.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Saimiri
boliviensis boliviensis]
Length = 595
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 134/296 (45%), Gaps = 51/296 (17%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELSNGSCISS---------------------AGKKVK 87
V PFGS V N F + G DLD+ ++L+ +S+ A +K+
Sbjct: 240 VRPFGSSV-NTFGKLGCDLDMFLDLNETRNLSTHKTSGNFLMEFQVKNVPSERIATQKI- 297
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
S+LG+ L G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 298 LSVLGECLDNF--SPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYI 355
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDI 206
+D R R +V V+ WA+AH + + G + ++SL+++V+F QT P +LP L
Sbjct: 356 YGALDSRVRALVFTVRCWARAHLLTSSIPGAWITNFSLTMMVIFFLQTRSPPVLPTL--- 412
Query: 207 YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSG 263
D L+ + A+AE + I N F D R N +L L F E F
Sbjct: 413 ------DSLQTL-ADAEDKC--IIEGNNCTFVRDLNRIKPSENTETLEILLKEFFEYFGN 463
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ V++ L K
Sbjct: 464 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVNQSQLQK 510
>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
Length = 584
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R K IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 183 IKDFVAYISPSREEIEIRNKTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 240
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I D
Sbjct: 241 V-------TSKLGGKESRNNLYSLASHLKKKKLATEVEVVAKARVPIIKFVEPHSGIHID 293
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 294 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 352 FSFLHMHPRII 362
>gi|1228035|dbj|BAA12105.1| KIAA0191 [Homo sapiens]
Length = 1516
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 822 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 880
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 881 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 937
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 938 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 997
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 998 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1048
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1049 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1096
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1097 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1127
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 243 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 292
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 293 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 352
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 353 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 412
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 413 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 472
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 473 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 530
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 531 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 560
>gi|343425896|emb|CBQ69429.1| related to caffeine-induced death protein 1 Cid1 [Sporisorium
reilianum SRZ2]
Length = 1181
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 87/335 (25%), Positives = 140/335 (41%), Gaps = 47/335 (14%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L PIL P E++ + L + V GA + FGS + R D
Sbjct: 346 LSPIL--------PTEEEYRIKEATRRQLERLANRVSP--GAKLLAFGSMANGFALRNSD 395
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE----- 121
+D+ + G + L+ L + +R++ + + + AR+PI+K
Sbjct: 396 MDLCCLIGKGPDGQPTTQHTASELVEILGQLIREETDFTVMP-LPKARIPIIKINRSPTA 454
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ I+CDI +N ++ L + +D R R +VL +K WAK +N+P GT +
Sbjct: 455 DLPYEIACDIGFENRLALENTRLLLSYAMVDPPRLRTLVLFLKVWAKRRKLNSPYMGTLS 514
Query: 181 SYSLSLLVLFHFQTCV--PAILPPLKDIYPG-NLVDDLKGVRANAERQIAEICAFNIARF 237
SY +L+VLF F V PA+LP L+ + P + D + N ++ A A
Sbjct: 515 SYGYTLMVLF-FLAYVKKPAVLPNLQRVPPTRTMKPDEMELNGNNIYFYDDVAALRKA-- 571
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRW 289
+ N ++ L + F FS +SLK SE G+ + + W
Sbjct: 572 ----WTSHNTDNVGELLIDFFRYFSKEFSYARDVISLK-SETGLL--------SKDSKSW 618
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
N L IEDPF+ N +R V++ L I F
Sbjct: 619 ---NAELCIEDPFQMGYNVSRTVTKDGLYTIRGEF 650
>gi|397503431|ref|XP_003822327.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 1 [Pan paniscus]
gi|410352391|gb|JAA42799.1| PAP associated domain containing 4 [Pan troglodytes]
Length = 480
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 111/226 (49%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP + +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-- 360
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 361 -----QLHLVHQAPSNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|395730499|ref|XP_002810865.2| PREDICTED: terminal uridylyltransferase 4 [Pongo abelii]
Length = 1644
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|380811048|gb|AFE77399.1| terminal uridylyltransferase 4 isoform a [Macaca mulatta]
gi|383416971|gb|AFH31699.1| terminal uridylyltransferase 4 isoform a [Macaca mulatta]
Length = 1644
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 949 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1007
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1008 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1064
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1065 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1124
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1125 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1175
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1176 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1223
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1224 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1254
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 370 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 419
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 420 PKINHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 479
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 480 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 539
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 540 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 599
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC
Sbjct: 600 TETDNQSNAMKEKHGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIRDI- 657
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 658 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 687
>gi|397488036|ref|XP_003815081.1| PREDICTED: terminal uridylyltransferase 4 [Pan paniscus]
Length = 1644
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQQIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|410032969|ref|XP_003949471.1| PREDICTED: terminal uridylyltransferase 4 [Pan troglodytes]
gi|410224346|gb|JAA09392.1| zinc finger, CCHC domain containing 11 [Pan troglodytes]
gi|410251850|gb|JAA13892.1| zinc finger, CCHC domain containing 11 [Pan troglodytes]
gi|410353207|gb|JAA43207.1| zinc finger, CCHC domain containing 11 [Pan troglodytes]
Length = 1645
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQQIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|354468198|ref|XP_003496554.1| PREDICTED: terminal uridylyltransferase 4 [Cricetulus griseus]
Length = 1648
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 970 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1028
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1029 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1085
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1086 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1145
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1146 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1196
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1197 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1244
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1245 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1275
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 394 DDLRVRQNIVEEMSKVIMTY--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 443
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 444 PKMNHPDLLIQVLGILKKSTLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 503
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 504 DLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 563
Query: 203 L 203
L
Sbjct: 564 L 564
>gi|332808996|ref|XP_001146430.2| PREDICTED: terminal uridylyltransferase 4 isoform 8 [Pan troglodytes]
gi|410297382|gb|JAA27291.1| zinc finger, CCHC domain containing 11 [Pan troglodytes]
Length = 1644
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQQIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|348565805|ref|XP_003468693.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Cavia
porcellus]
Length = 571
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 146/335 (43%), Gaps = 48/335 (14%)
Query: 12 KDILGMLNPLREDWE-----TRMKVISDLREVVESVES--LRGATVEPFGSFVSNLFSRW 64
K I LN L ++++ TR++ ++ ++E + + V+PFGS V N F +
Sbjct: 187 KSIDDQLNTLLKEFQLTEENTRLRYLTS--SLIEDIAAAYFPDCRVKPFGSSV-NTFGKL 243
Query: 65 G-DLDISIELSNGSCIS-----------------SAGKKVKQSLLGDLLRALRQKG-GYR 105
G DLD+ ++L + ++ + Q +L L L G G
Sbjct: 244 GCDLDMFLDLDETKKLDIQKNKGNFLIEFQVKNVASERMATQKILSVLGECLDHFGPGCV 303
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+Q + HAR P+++F CD++ +N S+ L+ +D R R +V V+ W
Sbjct: 304 SVQKILHARCPLVRFSHQASGFQCDLTTNNRIAMKSSELLYIYGTLDARVRALVCSVRYW 363
Query: 166 AKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
A+AH + + G + ++SL+++V+F Q P ILP L + +D + N
Sbjct: 364 ARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTLDTLMTLADEEDECVIEGNN-- 421
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
C F I + K + N SL L F E F + + + I R
Sbjct: 422 -----CTF-IRDLNKIKPSE-NTESLEVLLKEFFEYFGNFAFSKNSINI-------RQGR 467
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ P++ PL I++PFE N ++ V++ L +
Sbjct: 468 EQNK--PDSSPLHIQNPFETSLNISKNVNQSQLQR 500
>gi|291398882|ref|XP_002715137.1| PREDICTED: zinc finger, CCHC domain containing 11 isoform 1
[Oryctolagus cuniculus]
Length = 1652
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 962 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1020
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1021 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1077
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1078 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1137
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1138 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1188
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1189 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1236
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1237 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1267
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 137/332 (41%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++++++ +V+ +V L ++ +GS ++ + D++I I+
Sbjct: 387 DDLRVRKEIVAEMSKVITTV--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 436
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 437 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 496
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F+ Q P +LP
Sbjct: 497 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFYLQQRKPPLLPC 556
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + +P + D LKG+ A + IAE
Sbjct: 557 LLGNWIEGFHPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 616
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC
Sbjct: 617 TETDNQSNAMKEKHGKSPLTLETTNQVSLGQLWLELL-KFYTLDFALEEYVICVRIHDI- 674
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 675 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 704
>gi|291398884|ref|XP_002715138.1| PREDICTED: zinc finger, CCHC domain containing 11 isoform 2
[Oryctolagus cuniculus]
Length = 1631
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 942 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1000
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1001 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1057
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1058 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1117
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1118 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1168
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1169 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1216
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1217 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1247
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 137/332 (41%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++++++ +V+ +V L ++ +GS ++ + D++I I+
Sbjct: 367 DDLRVRKEIVAEMSKVITTV--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 416
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 417 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 476
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F+ Q P +LP
Sbjct: 477 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFYLQQRKPPLLPC 536
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + +P + D LKG+ A + IAE
Sbjct: 537 LLGNWIEGFHPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 596
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC
Sbjct: 597 TETDNQSNAMKEKHGKSPLTLETTNQVSLGQLWLELL-KFYTLDFALEEYVICVRIHDI- 654
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 655 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 684
>gi|402871959|ref|XP_003899912.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 1 [Papio anubis]
Length = 480
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 111/226 (49%), Gaps = 35/226 (15%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP + +
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-- 360
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKAS 269
+ + N+ + S N S+L L + FL+ ++ +S++ +
Sbjct: 361 -----QLHLVHQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREA 410
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+ P +W N + +E+PF+ N+ARAV EK
Sbjct: 411 KAIPRPDGIEWR-----------NKYICVEEPFDG-TNTARAVHEK 444
>gi|297278716|ref|XP_001111993.2| PREDICTED: terminal uridylyltransferase 4-like isoform 5 [Macaca
mulatta]
Length = 1639
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 949 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1007
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1008 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1064
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1065 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1124
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1125 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1175
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1176 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1223
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1224 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1254
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 370 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 419
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 420 PKINHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 479
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 480 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 539
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 540 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 599
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC
Sbjct: 600 TETDNQSNAMKEKHGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIRDI- 657
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 658 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 687
>gi|187956551|gb|AAI50792.1| Zinc finger, CCHC domain containing 11 [Mus musculus]
Length = 1644
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 967 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1025
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1026 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1082
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1083 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1142
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1143 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1193
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1194 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1241
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1242 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1272
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 391 DDLRIRQDIVEEMSKVIMTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 440
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 441 PKMNHPDLLIQVLGILKKSALYIDVESDFHAKVPVVVCKDRKSALLCRVSAGNDMACLTT 500
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 501 DLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 560
Query: 203 L 203
L
Sbjct: 561 L 561
>gi|83977461|ref|NP_780681.2| terminal uridylyltransferase 4 [Mus musculus]
gi|259554115|sp|B2RX14.2|TUT4_MOUSE RecName: Full=Terminal uridylyltransferase 4; Short=TUTase 4;
AltName: Full=Zinc finger CCHC domain-containing protein
11
Length = 1644
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 967 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1025
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1026 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1082
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1083 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1142
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1143 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1193
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1194 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1241
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1242 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1272
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 391 DDLRIRQDIVEEMSKVIMTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 440
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 441 PKMNHPDLLIQVLGILKKSALYIDVESDFHAKVPVVVCKDRKSALLCRVSAGNDMACLTT 500
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 501 DLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 560
Query: 203 L 203
L
Sbjct: 561 L 561
>gi|57863248|ref|NP_001009881.1| terminal uridylyltransferase 4 isoform a [Homo sapiens]
gi|124297125|gb|AAI31735.1| Zinc finger, CCHC domain containing 11 [Homo sapiens]
Length = 1645
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|410353209|gb|JAA43208.1| zinc finger, CCHC domain containing 11 [Pan troglodytes]
Length = 1640
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQQIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|50294195|ref|XP_449509.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528823|emb|CAG62485.1| unnamed protein product [Candida glabrata]
Length = 626
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 103/199 (51%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P RE+ ETR K I+ +R V+ + + A ++ FGS+ ++++ D+D
Sbjct: 191 IRDFVAYISPSREEIETRNKTIAKIRRSVKRLWT--DADLQVFGSYATDMYLPGSDIDCV 248
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S +G K + L +L R L+ G R++ +A +RVPI+KF +I D
Sbjct: 249 VN-------SKSGDKENRQYLYELARHLKNDGLATRVEVIAKSRVPIIKFVEPESDIHID 301
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + WI G R++ L+VK++ A +N+ TG +S+ LV
Sbjct: 302 VSFERSNGLEAAKLIREWIGDTPG-LRELTLVVKQFLHARRLNDVHTGGLGGFSIICLV- 359
Query: 190 FHFQTCVPAILPPLKDIYP 208
F F P I+ DI P
Sbjct: 360 FSFLRLHPRIITG--DIDP 376
>gi|17554128|ref|NP_498099.1| Protein CID-1 [Caenorhabditis elegans]
gi|351064473|emb|CCD72858.1| Protein CID-1 [Caenorhabditis elegans]
Length = 1425
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/336 (22%), Positives = 143/336 (42%), Gaps = 49/336 (14%)
Query: 11 LKDILGMLNPLRED---WETRMKVISDLREVVESVESL------RGATVEPFGSFVSNLF 61
LKDI M++ + E R+K+ L ++ ++S T+ FGS ++ L
Sbjct: 1006 LKDIDDMIDKYYHENILDERRLKM---LDHKIDELQSFLRKNYREDVTLTTFGSVMTGLS 1062
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
D+DI + +G + ++ LR+ +R+Q + A+VPI+KF+
Sbjct: 1063 VNCSDIDICLRFGDGDV--PPKDLTAKEVIQKTESVLRKCHLVKRVQAIVTAKVPIVKFQ 1120
Query: 122 TIHQN---ISCDISIDNLCGQIKSKFL--FWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
N I DIS N+ + L + + D RF + L VK WAK +I +
Sbjct: 1121 VKLSNGAIIDVDISYYNILAIYNTALLKEYSLWTPDKRFAKLALFVKTWAKNCEIGDASR 1180
Query: 177 GTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
G+ +SY ++++ + Q C P +LP L++ + + N ER++ + + A+
Sbjct: 1181 GSLSSYCHVIMLISYLQNCDPPVLPRLQEDFRSD----------NRERRLVDNWDTSFAQ 1230
Query: 237 FSSDKYRK--INRSSLAHLFVSFLEKFSGLSLK------ASELGICPFTGQWEHIRSNTR 288
+ ++ N+ S A L + + + +S + E+ + +W
Sbjct: 1231 VETSLLQRWPKNKESCAQLLIGYFDYYSRFDFRNFVVQCRREMILSKMEKEWP------- 1283
Query: 289 WLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
PL +EDPF+ N + V++K I F
Sbjct: 1284 -----RPLCVEDPFDLSHNLSSGVNKKMFVFIMKVF 1314
>gi|149039756|gb|EDL93872.1| rCG24089, isoform CRA_a [Rattus norvegicus]
Length = 1539
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 144/338 (42%), Gaps = 50/338 (14%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 1003 NILDQVCIQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1060
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1061 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1112
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ LF S ID R + + +K + K DI + G+
Sbjct: 1113 FSHLRSGLEVDISLYNTLALHNTRLLFAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1172
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGN-----LVDDLKGVRANAERQIAEICAFNI 234
+SY+ +L+VL+ Q P ++P L++IY G LVD G QI E+
Sbjct: 1173 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKGEKKPEILVD---GWNIYFFDQINELPT--- 1226
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSN 286
+Y K N S+ L++ L ++ +S++ L + F QW
Sbjct: 1227 ---CWPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL-LTTFKKQW------ 1275
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N +S K I AF
Sbjct: 1276 -----TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1308
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E+ + R+++ + V + L ++ +GS S L R D D++I++ + +S
Sbjct: 311 ENLDQRLEIKCAMENVFQ--HKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS-- 364
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 365 ----QPDVLLLVQECLKNSDCFIDVDADFHARVPVVVCRHKQSGLLCKVSAGNENAWLTT 420
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
K L + +++ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP
Sbjct: 421 KHLTALGKLEPRLLPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP- 479
Query: 203 LKDIYPGNLVDDL 215
+Y G+ +++
Sbjct: 480 ---VYLGSWIEEF 489
>gi|351697778|gb|EHB00697.1| Terminal uridylyltransferase 4 [Heterocephalus glaber]
Length = 1668
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 973 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1031
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1032 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1088
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1089 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1148
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1149 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1199
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1200 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1247
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1248 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1278
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 135/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R++++ ++ +V+ + L ++ +GS ++ D++I I+ +
Sbjct: 396 DDLKARLEIVEEMSKVITAF--LPECSLRLYGSSLTKFALTSSDVNIDIKFPS------- 446
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
LL +L ++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 447 -TMNHPDLLIQVLGIFKKNVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDSACLTT 505
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L +S+++ F +VL + WAK I++ G SY +L+ +F Q P ILP
Sbjct: 506 DLLAALSKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMAMFFLQQRKPPILPC 565
Query: 203 L-----KDIYPGNLVD-DLKGV----------RANAERQIAEICAFNIARFS--SDKYRK 244
L + P + D LKG+ ++++ + I N A+ D +K
Sbjct: 566 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYKSSSATEKNSIAEENKAKADQPKDDTKK 625
Query: 245 I-----------------------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N+ SL L++ L KF L E IC Q
Sbjct: 626 TETDNQSNAMKEKHGKSPLTLETPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 683
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 684 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 713
>gi|353238304|emb|CCA70254.1| related to caffeine-induced death protein 1 Cid1 [Piriformospora
indica DSM 11827]
Length = 714
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 152/359 (42%), Gaps = 69/359 (19%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P +E+ + V D+R+++E + +R T+EP FGS + +
Sbjct: 36 LLDFVVQLLPTKEE----VSVKEDVRKLLERL--IR--TIEPSSQLLSFGSTANGFELKN 87
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRA--LRQKGGYRRLQFVAHARVPILKFE- 121
D+D+ C+ + + +LRA L ++ ++ + +AR+PI+K
Sbjct: 88 SDMDLC-------CVLDVRPETPPNASQFVLRAAQLLERETKFAVKPLPNARIPIIKLSL 140
Query: 122 ----TIHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKT 176
+I I+CDI +N ++ LF + ID R R +VL +K WAK IN+P
Sbjct: 141 QPSPSIPFGIACDIGFENRLALENTRLLFTYAAIDPTRVRTLVLFLKLWAKRRKINSPYH 200
Query: 177 GTFNSYSLSLLVLFHF-----QTCVPAI--LPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
GT +SY +LLV+F +P + +PP++ I P D + V
Sbjct: 201 GTLSSYGYALLVIFFLVHVKDPPVLPNLQQMPPMRPISPSETHIDGRNV----------- 249
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS-----GLSLKASELGICPFTGQWEHIR 284
F+ K++ N ++ L + F FS G S+ + G H+
Sbjct: 250 WFFDDIELLRRKWQSPNTETIGELLLDFFRYFSRDFSYGTSVASIRAG---------HLS 300
Query: 285 SNTRWLPNNHP-------LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQ 336
T+ N P L+IEDPF N R V+ L I F M R+ +T
Sbjct: 301 KETKVDANKPPDPREGTSLWIEDPFATDFNVGRCVTRDGLYTIRGEF-MRALRILNTKH 358
>gi|57863246|ref|NP_056084.1| terminal uridylyltransferase 4 isoform b [Homo sapiens]
gi|116242850|sp|Q5TAX3.3|TUT4_HUMAN RecName: Full=Terminal uridylyltransferase 4; Short=TUTase 4;
AltName: Full=Zinc finger CCHC domain-containing protein
11
gi|119627183|gb|EAX06778.1| zinc finger, CCHC domain containing 11, isoform CRA_a [Homo sapiens]
gi|119627185|gb|EAX06780.1| zinc finger, CCHC domain containing 11, isoform CRA_a [Homo sapiens]
Length = 1644
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 950 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1008
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1009 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1065
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1066 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1125
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1126 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1176
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1177 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1224
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1225 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1255
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC Q
Sbjct: 601 TETDNQSNAMKEKHGKSPLALETPNRVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 688
>gi|432921901|ref|XP_004080278.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Oryzias latipes]
Length = 794
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 140/322 (43%), Gaps = 59/322 (18%)
Query: 42 VESLRGATVEPFGSFVSNLFSRWGDLDISIELSN---------------GSCISSAGKKV 86
VE + + PFGS V+ DLD+ ++L N G +S G+
Sbjct: 192 VEFFPDSEILPFGSSVNTFGIHSCDLDLFLDLENTKTFQARAKSTTEQVGEGVSDDGRS- 250
Query: 87 KQSLLGDL------------LRAL---RQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
+ S+L D+ L A R ++ V AR+P++KF+ N+ DI
Sbjct: 251 EDSILSDIDLTTASPAEVLDLVATILKRCVPNVHKVHVVGTARLPVVKFQHHKLNLQGDI 310
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT---FNSYSLSLLV 188
+I+N G ++FL S ++ R R +V ++ WA+ + +G N+Y+L+LLV
Sbjct: 311 TINNRLGVRNTRFLQLCSGMEERLRPLVYTIRFWARQKKLAGNPSGAGPLLNNYALTLLV 370
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI--CAF---NIARFSSDKYR 243
+F+ Q C P +LP V+ LK + E + E C F IA S +
Sbjct: 371 IFYLQNCEPPVLP---------TVEQLKDMACEEEECVIEGWNCTFPSQPIAVLPSKNTQ 421
Query: 244 KINRSSLAHLFVSFLEKF----SGLSLKASE-LGICPFTGQWEHIRSNTRWLPNNHP--- 295
+ SSL F SF KF + +SL+ L + F G+ + N HP
Sbjct: 422 DL--SSLLAGFFSFYAKFDFASNVVSLREGRALPVVDFLGKGKEEEENPPKGSRQHPKLG 479
Query: 296 -LFIEDPFEQPENSARAVSEKN 316
+ + DPFE N A ++E++
Sbjct: 480 SMTLLDPFELSHNVAGNLNERS 501
>gi|392333761|ref|XP_001066334.3| PREDICTED: terminal uridylyltransferase 7-like [Rattus norvegicus]
gi|392354131|ref|XP_001060307.3| PREDICTED: terminal uridylyltransferase 7-like [Rattus norvegicus]
Length = 1479
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 144/338 (42%), Gaps = 50/338 (14%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 1003 NILDQVCIQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1060
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1061 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1112
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ LF S ID R + + +K + K DI + G+
Sbjct: 1113 FSHLRSGLEVDISLYNTLALHNTRLLFAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1172
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGN-----LVDDLKGVRANAERQIAEICAFNI 234
+SY+ +L+VL+ Q P ++P L++IY G LVD G QI E+
Sbjct: 1173 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKGEKKPEILVD---GWNIYFFDQINELPT--- 1226
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSN 286
+Y K N S+ L++ L ++ +S++ L + F QW
Sbjct: 1227 ---CWPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL-LTTFKKQW------ 1275
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N +S K I AF
Sbjct: 1276 -----TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1308
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 87/193 (45%), Gaps = 14/193 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E+ + R+++ + V + L ++ +GS S L R D D++I++ + +S
Sbjct: 311 ENLDQRLEIKCAMENVFQ--HKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS-- 364
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 365 ----QPDVLLLVQECLKNSDCFIDVDADFHARVPVVVCRHKQSGLLCKVSAGNENAWLTT 420
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
K L + +++ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP
Sbjct: 421 KHLTALGKLEPRLLPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP- 479
Query: 203 LKDIYPGNLVDDL 215
+Y G+ +++
Sbjct: 480 ---VYLGSWIEEF 489
>gi|149039758|gb|EDL93874.1| rCG24089, isoform CRA_c [Rattus norvegicus]
Length = 1217
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 144/338 (42%), Gaps = 50/338 (14%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 681 NILDQVCIQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 738
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 739 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 790
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ LF S ID R + + +K + K DI + G+
Sbjct: 791 FSHLRSGLEVDISLYNTLALHNTRLLFAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 850
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGN-----LVDDLKGVRANAERQIAEICAFNI 234
+SY+ +L+VL+ Q P ++P L++IY G LVD G QI E+
Sbjct: 851 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKGEKKPEILVD---GWNIYFFDQINELPT--- 904
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSN 286
+Y K N S+ L++ L ++ +S++ L + F QW
Sbjct: 905 ---CWPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL-LTTFKKQW------ 953
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N +S K I AF
Sbjct: 954 -----TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 986
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 4/104 (3%)
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
HARVP++ + C +S N + +K L + +++ R +V+ + WAK I
Sbjct: 68 HARVPVVVCRHKQSGLLCKVSAGNENAWLTTKHLTALGKLEPRLLPLVIAFRYWAKLCSI 127
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL 215
+ P+ G Y +L+ +F Q +LP +Y G+ +++
Sbjct: 128 DRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSWIEEF 167
>gi|380811046|gb|AFE77398.1| terminal uridylyltransferase 4 isoform b [Macaca mulatta]
gi|383416969|gb|AFH31698.1| terminal uridylyltransferase 4 isoform b [Macaca mulatta]
Length = 1639
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 949 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1007
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1008 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1064
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1065 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1124
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1125 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1175
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1176 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1223
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1224 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1254
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 370 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 419
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 420 PKINHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 479
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 480 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 539
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 540 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 599
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC
Sbjct: 600 TETDNQSNAMKEKHGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIRDI- 657
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 658 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 687
>gi|148232888|ref|NP_001087892.1| poly(A) RNA polymerase GLD2-A [Xenopus laevis]
gi|82180930|sp|Q641A1.1|GLD2A_XENLA RecName: Full=Poly(A) RNA polymerase GLD2-A; AltName:
Full=PAP-associated domain-containing protein 4-A
gi|51950239|gb|AAH82438.1| MGC83633 protein [Xenopus laevis]
Length = 509
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 137/297 (46%), Gaps = 27/297 (9%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY-RRLQFVAH 112
GS ++ +R D D+ + L +LG L + + Y RLQF+
Sbjct: 228 GSSLNGFGTRISDADLCLVLKEEPMNQHTEAT---QILGLLHKLFYTRLSYIERLQFI-R 283
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDI 171
A+VPI+KF D++++N+ G I++ FL + ++ R R +VL++K+WA H I
Sbjct: 284 AKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYLESRVRPLVLVIKKWANHHGI 342
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG--NLVDDLKGVRANAERQIAEI 229
N+ GT +SY+L L+VL + QT ILP L+ YP +L L V +A R I
Sbjct: 343 NDASRGTLSSYTLVLMVLHYLQTLPEPILPSLQKKYPECFDLSMQLNLVH-HAPRNIP-- 399
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW 289
Y N + L L + FL+ F+ + S+ I G+ + W
Sbjct: 400 -----------PYLSKNETPLGDLLLGFLKYFA-VEFDWSKDIISVREGKALPRSDDYLW 447
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLA 346
N + +E+PF+ N+ARAV E+ + A + + ++ Y+LL A
Sbjct: 448 --RNKYICVEEPFDG-TNTARAVYERQKFDMIRAEFLKAWGALRDDRDLYSLLPVTA 501
>gi|268567892|ref|XP_002640105.1| C. briggsae CBR-MUT-2 protein [Caenorhabditis briggsae]
Length = 446
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 121/265 (45%), Gaps = 34/265 (12%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL--------- 73
E+++ +M + L+ ++ S + P GS V+ L ++ DLD++I +
Sbjct: 59 EEFDRKMDLCYQLKNIISKHNSTWLFNIVPTGSTVTGLATKNSDLDVAIHIPQAAKLLEE 118
Query: 74 --SNGSCISSAGKKVKQSLLGDLLRALR----------QKGGYRRLQFVAHARVPILKFE 121
S+ I ++ + + ++L+ +R + + + + A++ ILK E
Sbjct: 119 MHSDIYHIEEERNRLWRGMQLEILQIVRLLLENDEQIKSRIDWNKGVQLVQAQIQILKIE 178
Query: 122 TIHQNISCDISI--DNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGT 178
T+ I CD+S+ D + + F+ + ID RF + +VK+WA + + NPK G
Sbjct: 179 TV-DGIDCDVSVVMDPFLSSMHNSFMIRHFANIDARFAPLCAVVKQWAASSGVKNPKEGG 237
Query: 179 FNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
FNSY+L +LV+ HF C P ILP L +Y + A +RQ F
Sbjct: 238 FNSYALVILVI-HFLQCGAYPPILPHLSKLYKDD------NFIAQNDRQYPLRLDFGAPL 290
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKF 261
+ N SSLA LF+ FL +
Sbjct: 291 PRALPTVSANHSSLAQLFLEFLHYY 315
>gi|326475011|gb|EGD99020.1| PAP/25A associated domain-containing protein [Trichophyton
tonsurans CBS 112818]
Length = 1074
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 98/196 (50%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E E R+K + L +++++ V FGS + L + D+DI
Sbjct: 50 IKELYQKLLPSPESEERRVKFVRKLEKLLDTQWPGNEIKVNVFGSSGNKLCTSDSDVDI- 108
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K + +L D L K G R+ V+HA+VPI+K ++C
Sbjct: 109 -------CITTPSKCFEPVCVLADFL----AKSGMERVVCVSHAKVPIVKIWDPELQVAC 157
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + +LVK W K +N+ GT +SY+ L+
Sbjct: 158 DMNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAALGGTLSSYTWICLI 217
Query: 189 LFHFQTCVPAILPPLK 204
+ QT +P I+P L+
Sbjct: 218 INFLQTRIPPIVPSLQ 233
>gi|332635009|ref|NP_001193859.1| terminal uridylyltransferase 4 [Bos taurus]
gi|296489128|tpg|DAA31241.1| TPA: Caffeine Induced Death homolog family member (cid-1)-like [Bos
taurus]
Length = 1639
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 948 DILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRD 1006
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1007 SDLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRR 1063
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1064 SGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAY 1123
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1124 ILMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRL 1174
Query: 244 ---KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
N +L L++ L ++ +S++ +L + F QW
Sbjct: 1175 PSLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------T 1222
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1223 SKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1254
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R +++ ++ +VV + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLKVRQEIVEEMSKVVTTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 421 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATERNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +KY K NR SL L++ L KF L E IC
Sbjct: 601 TETTNQSNARKEKYGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIKDI- 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSIKRNVARSLN 688
>gi|440906885|gb|ELR57101.1| Terminal uridylyltransferase 4 [Bos grunniens mutus]
Length = 1665
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 969 DILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRD 1027
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1028 SDLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRR 1084
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1085 SGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAY 1144
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1145 ILMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRL 1195
Query: 244 ---KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
N +L L++ L ++ +S++ +L + F QW
Sbjct: 1196 PSLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------T 1243
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1244 SKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1275
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R +++ ++ +VV + L ++ +GS ++ + D++I I+
Sbjct: 392 DDLKVRQEIVEEMSKVVTTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 441
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 442 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 501
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 502 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 561
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A IAE
Sbjct: 562 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATERNSIAEENKAKADQPKDDTKK 621
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +KY K NR SL L++ L KF L E IC
Sbjct: 622 TETTNQSNARKEKYGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIKDI- 679
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 680 LTRENKNW-PKRR-IAIEDPFSIKRNVARSLN 709
>gi|326483183|gb|EGE07193.1| PAP/25A associated domain-containing protein [Trichophyton equinum
CBS 127.97]
Length = 1146
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 98/196 (50%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E E R+K + L +++++ V FGS + L + D+DI
Sbjct: 122 IKELYQKLLPSPESEERRVKFVRKLEKLLDTQWPGNEIKVNVFGSSGNKLCTSDSDVDI- 180
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K + +L D L K G R+ V+HA+VPI+K ++C
Sbjct: 181 -------CITTPSKCFEPVCVLADFL----AKSGMERVVCVSHAKVPIVKIWDPELQVAC 229
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + +LVK W K +N+ GT +SY+ L+
Sbjct: 230 DMNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAALGGTLSSYTWICLI 289
Query: 189 LFHFQTCVPAILPPLK 204
+ QT +P I+P L+
Sbjct: 290 INFLQTRIPPIVPSLQ 305
>gi|344241837|gb|EGV97940.1| Terminal uridylyltransferase 4 [Cricetulus griseus]
Length = 1451
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 826 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 884
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 885 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 941
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 942 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1001
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1002 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1052
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1053 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1100
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1101 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1131
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R ++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 250 DDLRVRQNIVEEMSKVIMTY--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 299
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 300 PKMNHPDLLIQVLGILKKSTLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 359
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 360 DLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 419
Query: 203 L 203
L
Sbjct: 420 L 420
>gi|291401945|ref|XP_002717334.1| PREDICTED: PAP associated domain containing 1-like [Oryctolagus
cuniculus]
Length = 582
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 124/299 (41%), Gaps = 51/299 (17%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG---- 102
TV PFGS V N F + G DLD+ ++L GK+ +G+ L + K
Sbjct: 225 CTVRPFGSSV-NTFGKLGCDLDMFLDLDE------IGKRSTPKTVGNFLLEFQVKNVPSE 277
Query: 103 --------------------GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
G +Q + +AR P+++F CD++ +N S
Sbjct: 278 RIATQKILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTANNRIALKSS 337
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILP 201
+ L+ +D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP
Sbjct: 338 ELLYLYGALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILP 397
Query: 202 PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
L+ + +D + N + ++ +R N L L F E F
Sbjct: 398 TLESLKALASAEDKCVIEGNNCTFVRDLNKIQPSR---------NTEPLELLLKEFFEYF 448
Query: 262 SGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P++ PL I++PFE N +R VS+ L K
Sbjct: 449 GNFAFNKNSINI-------RQGREQNK--PDSSPLHIQNPFETSLNISRNVSQSQLQKF 498
>gi|350586191|ref|XP_003128036.3| PREDICTED: terminal uridylyltransferase 4 [Sus scrofa]
Length = 1376
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 681 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 739
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 740 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 796
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 797 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 856
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 857 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 907
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 908 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 955
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 956 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 986
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 76/334 (22%), Positives = 134/334 (40%), Gaps = 55/334 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +VV + L ++ +GS ++ + D++I I+
Sbjct: 102 DDLRVRQEIVEEMSKVVTAF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 151
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 152 PKMNHPDLLIQVLGILKKSVLYLDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 211
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ----TCVPA 198
L + +++ F +VL + WAK I++ G SY +L+V+F Q +P
Sbjct: 212 DLLAALGKMEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKHPLLPC 271
Query: 199 ILPPLKDIYPGNLVDD--LKGV-------------RANAERQIAE--------------- 228
+L + + +DD LKG+ A + IAE
Sbjct: 272 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 331
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC Q
Sbjct: 332 TETDNQSNAMKEKHGKSPLTLETPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 389
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
R N W P + IEDPF N AR+++ +
Sbjct: 390 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLNSQ 421
>gi|350580023|ref|XP_003480737.1| PREDICTED: LOW QUALITY PROTEIN: speckle targeted PIP5K1A-regulated
poly(A) polymerase-like [Sus scrofa]
Length = 926
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 117/271 (43%), Gaps = 36/271 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 372 GDLGKAVELAQALKGEKAEGGAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 429
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V V+ WA+ ++ ++Y+L
Sbjct: 430 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTVRCWAQGRGLSG-SGPLLSNYAL 488
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 489 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 536
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N+ L+ L F S L+ S L + P G WE +R
Sbjct: 537 EPSTNKEPLSSLLAQFFSCISCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 591
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 592 ---PMNLQDPFDLSHNVAANVTSRVAGRLQN 619
>gi|195469707|ref|XP_002099778.1| GE16534 [Drosophila yakuba]
gi|194187302|gb|EDX00886.1| GE16534 [Drosophila yakuba]
Length = 612
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 135/319 (42%), Gaps = 59/319 (18%)
Query: 27 TRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKK 85
RM+ ++ L+ A +PFGS V N F R G DLD+ + N S+ ++
Sbjct: 191 VRMRFLAALQVQQAIAGMFPAAQAQPFGSSV-NGFGRMGCDLDLILRFDNDMGAKSSLEE 249
Query: 86 VKQSLL----------------------GDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
S L GD+L G ++ + ARVPI+K+
Sbjct: 250 AVPSRLVYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIKYHHE 307
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSY 182
H ++ D+S+ NL G S+ L+ ++D R R + ++ WA+ + NP G + +++
Sbjct: 308 HLDLEVDLSMSNLTGFYMSELLYMFGEMDPRVRPLTFTIRRWAQTCGLTNPSPGRWISNF 367
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI---CAF--NIARF 237
SL+ LV+F Q ILP + L + + ++ E C F N+ R
Sbjct: 368 SLTCLVMFFLQQLRQPILP---------TIGALTKAAESGDSRVTEDGINCTFTRNVDRL 418
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC---PFTGQWEHIRSNTRWLPNNH 294
+R N+SSL+ L + F E +S + + P + P++
Sbjct: 419 G---FRSRNQSSLSELLLQFFEFYSQFDFHNRAISLNEGKPLSK------------PDHS 463
Query: 295 PLFIEDPFEQPENSARAVS 313
++I +P EQ N ++ VS
Sbjct: 464 AMYIVNPLEQLLNVSKNVS 482
>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
S288c]
gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
polymerase kappa; AltName: Full=DNA polymerase sigma;
AltName: Full=Topoisomerase 1-related protein TRF4
gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces
cerevisiae S288c]
gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 584
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 183 IKDFVAYISPSREEIEIRNQTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 240
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I D
Sbjct: 241 V-------TSELGGKESRNNLYSLASHLKKKNLATEVEVVAKARVPIIKFVEPHSGIHID 293
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 294 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 352 FSFLHMHPRII 362
>gi|195130965|ref|XP_002009921.1| GI15633 [Drosophila mojavensis]
gi|193908371|gb|EDW07238.1| GI15633 [Drosophila mojavensis]
Length = 613
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/360 (23%), Positives = 153/360 (42%), Gaps = 71/360 (19%)
Query: 3 SYNVLEPILKDILGM---LNPLRE-----DWETRMKVISDLREVVESVESL-RGATVEPF 53
S+ L+ +L+ G+ LN L E + RM+ I+ L +V +++ + A +PF
Sbjct: 163 SHEALKELLRGAAGIDQQLNLLYEQTRLNELGVRMRFIAAL-QVEQAISGMFPDALAQPF 221
Query: 54 GSFVSNLFSRWGDLDISIE--------------------------LSNGSCISSAGKKVK 87
GS V+ DLD+ + LSNG + + +
Sbjct: 222 GSSVNGFGKMGCDLDLILRFDGKTPGTDQDSQREASRLIYHTKENLSNGR----SQTQRQ 277
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+GD+L G ++ + ARVPI+K+ H ++ D+S+ NL G S+ L+
Sbjct: 278 MECIGDMLHLFLP--GVCHVRRILQARVPIIKYHHEHLDLEIDLSMSNLTGFFMSELLYM 335
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDI 206
++D R R + V+ WA++ + NP G + ++SL+ LV+F Q ILP + +
Sbjct: 336 FGEMDPRVRPLTFCVRRWAQSCGLTNPSPGRWITNFSLTCLVMFFLQQMRQPILPSIGAM 395
Query: 207 YPGNLVDDLK----GVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
D++ G+ R + + ++ N SSL+ L + F E +S
Sbjct: 396 VKAANTADIRVTEDGINCTFARDMERV-----------GFQSRNTSSLSELLLQFFEFYS 444
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWL--PNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + R L P++ ++I +P EQ N ++ VS + ++
Sbjct: 445 QFDFHNRAISL-----------NEGRALAKPDHSAMYIVNPLEQLLNVSKNVSLEECERL 493
>gi|66472546|ref|NP_001018436.1| poly(A) RNA polymerase GLD2 [Danio rerio]
gi|82192766|sp|Q503I9.1|GLD2_DANRE RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
Full=PAP-associated domain-containing protein 4
gi|63100692|gb|AAH95312.1| Zgc:110560 [Danio rerio]
Length = 489
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 150/332 (45%), Gaps = 31/332 (9%)
Query: 21 LREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
L + R + +D++++ + G GS ++ SR D D+ + + G
Sbjct: 180 LEKKESCRAALQTDIQKIFPCAKVFLG------GSSLNGFGSRSSDADLCLVIEEGP--- 230
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ + L+R L K Y + A+VPI+KF + D++ +N G I
Sbjct: 231 -VNHRKDAVYVLSLVRKLLYKLSYIEKPQLIRAKVPIVKFRDRISGVEFDLNFNNTVG-I 288
Query: 141 KSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAI 199
++ FL + ++ R R +VL++K+WA H IN+ GT +SY+L L+VL + QT +
Sbjct: 289 RNTFLLRTYAFVEKRVRPLVLVIKKWANHHCINDASRGTLSSYTLVLMVLHYLQTLPEPV 348
Query: 200 LPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLE 259
+P L+ YP D K ++I AF ++R N+SSL LF+ FL
Sbjct: 349 IPCLQRDYPTCF--DPKMDIHLVPSGPSDIPAF-VSR---------NQSSLGDLFLGFLR 396
Query: 260 KFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLA 318
++ + K + I + + W + + +E+PF + N+ARAV E+
Sbjct: 397 YYATV-FKWDKQVISVRMARTLPKSNCKEW--KDKFICVEEPFNR-TNTARAVHERMKFE 452
Query: 319 KISNAFEMTHFRLTSTNQTRYALLSS--LARP 348
I AF +H L + L S +ARP
Sbjct: 453 AIKAAFIESHRLLQLRKDLNFILPKSKQMARP 484
>gi|347965367|ref|XP_322031.4| AGAP001130-PA [Anopheles gambiae str. PEST]
gi|333470543|gb|EAA01442.4| AGAP001130-PA [Anopheles gambiae str. PEST]
Length = 1187
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/212 (31%), Positives = 103/212 (48%), Gaps = 33/212 (15%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPIL+F+ I D++ +N G + L SQ+D R R +VL+VK WA+ H+IN
Sbjct: 951 AKVPILRFQDSKHGIEVDLNFNNCVGIRNTHLLHCYSQMDWRVRPLVLVVKLWARHHNIN 1010
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA 231
+ K T +SYSL L+V+ Q P +LP L ++P E+ + I
Sbjct: 1011 DAKNMTISSYSLVLMVIHFLQYGTSPPVLPCLHALHP--------------EKFMKIIDI 1056
Query: 232 FNIARFSS-DKYRKINRSSLAHLFVSFLEKFS-------GLSLKASELGICPFTGQWEHI 283
NI + Y N+ SL L +SFL+ ++ +S++ S I P E
Sbjct: 1057 HNIEMIERIEPYHTDNKESLGELLLSFLDYYTKFDYEHYAISVRTST--IIPI----EEC 1110
Query: 284 RSNTRWLPNNH---PLFIEDPFEQPENSARAV 312
R + + H L IE+PF+ N+AR+V
Sbjct: 1111 RLARSYKNDPHHWKHLCIEEPFD-FTNTARSV 1141
>gi|334321492|ref|XP_003340115.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 4-like
[Monodelphis domestica]
Length = 1597
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L+ + K L+P + ++R ++++ L ++ E A + FGS + R
Sbjct: 901 DILDLVCKKCFDELSPPFSEQQSREQILASLERFIQK-EYNEKARLCLFGSSKNGFGFRD 959
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L +A K + ++ L + L++ G + + + A+VPI+KFE
Sbjct: 960 SDLDICMTLDGHE---NAEKLNCKEIIEGLAKILKRHPGLKNILPITTAKVPIVKFEHRR 1016
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1017 SGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAY 1076
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1077 ILMVLYFLQQRDPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDMEELKKRL 1127
Query: 244 ---KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
N +L L++ L ++ +S++ +L + F QW
Sbjct: 1128 PSLGQNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------T 1175
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1176 SKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1207
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/178 (21%), Positives = 84/178 (47%), Gaps = 10/178 (5%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D+ R +++++ +V++ L ++ +GS ++ + D++I ++ S +S
Sbjct: 316 DFGFRQDIVTEMEKVIQL--RLPDCSLRLYGSSMTRFAFKSSDVNIDVKFP--STMSHP- 370
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
+L +L L+ Y ++ HA+VP++ + + + C +S N + +
Sbjct: 371 -----DVLIQVLDILKNCALYSEVESDFHAKVPVVFCKDVKSGLICKVSAGNDVACLTTD 425
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
L + +++ +VL + WA+ I+ G SYS +L+V+F Q P +LP
Sbjct: 426 LLAALGKLEPVLTPLVLAFRYWARLCHIDCQAEGGIPSYSFALMVMFFLQQRKPPLLP 483
>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 584
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 183 IKDFVAYISPSREEIEIRNQTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 240
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I D
Sbjct: 241 V-------TSELGGKESRNNLYSLASHLKKKNLATEVEVVAKARVPIIKFVEPHSGIHID 293
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 294 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 351
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 352 FSFLHMHPRII 362
>gi|395530214|ref|XP_003767192.1| PREDICTED: terminal uridylyltransferase 4 [Sarcophilus harrisii]
Length = 1588
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 146/332 (43%), Gaps = 37/332 (11%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L+ + K L+P + ++R ++++ L ++ E A + FGS + R
Sbjct: 915 DILDLVCKKCFDELSPPFSEQQSREQILASLERFIQK-EYNEKARLCLFGSSKNGFGFRD 973
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L +A K + ++ L + L++ G + + + A+VPI+KFE
Sbjct: 974 SDLDICMTLDGHE---NAEKLNCKEIIEGLAKILKRHPGLKNILPITTAKVPIVKFEHRR 1030
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1031 SGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAY 1090
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1091 ILMVLYFLQQRDPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDMEELKKRL 1141
Query: 244 ---KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
N +L L++ L ++ +S++ +L + F QW
Sbjct: 1142 PSLGQNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------T 1189
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1190 SKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1221
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/178 (20%), Positives = 85/178 (47%), Gaps = 10/178 (5%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D+ R ++++++ +V++ + L ++ +GS ++ + D++I ++ +
Sbjct: 322 DFGFRQEIVTEMEKVIQ--QRLPDCSLRLYGSSLTRFAFKSSDVNIDVKFPS-------- 371
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
K +L +L L+ Y ++ HA+VP++ + + + C +S N + +
Sbjct: 372 KMSHPDVLIQVLDILKNCALYSEVESDFHAKVPVVFCKDVKSGLICKVSAGNDVACLTTD 431
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
L + +++ +VL + WA+ I+ G SYS +L+V+F Q P +LP
Sbjct: 432 LLAALGKLEPVLTPLVLAFRYWARLCHIDCQAEGGIPSYSFALMVMFFLQQRKPPLLP 489
>gi|195396577|ref|XP_002056907.1| GJ16636 [Drosophila virilis]
gi|194146674|gb|EDW62393.1| GJ16636 [Drosophila virilis]
Length = 1399
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 66/98 (67%), Gaps = 3/98 (3%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1113 ARVPILRFKDRINGIEVDLNYNNCVG-IKNTYLLQLYAQLDWRTRPLVVIVKLWAQYHDI 1171
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ-TCVPAILPPLKDIYP 208
N+ K T +SYSL L+VL + Q CVP +LP L+ +YP
Sbjct: 1172 NDAKRMTVSSYSLVLMVLHYLQYGCVPHVLPCLQALYP 1209
>gi|395855062|ref|XP_003799990.1| PREDICTED: terminal uridylyltransferase 4 [Otolemur garnettii]
Length = 1620
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 951 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1009
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1010 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1066
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1067 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1126
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1127 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PKRMVDGWNAFFFDKTEELKKRLP 1177
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1178 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1225
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1226 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1256
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/181 (23%), Positives = 84/181 (46%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS +S + D++I I+
Sbjct: 372 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLSKFALKSSDVNIDIKF--------P 421
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 422 PKMNHPDLLIQVLGILKKTASYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 481
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 482 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 541
Query: 203 L 203
L
Sbjct: 542 L 542
>gi|320041109|gb|EFW23042.1| hypothetical protein CPSG_00941 [Coccidioides posadasii str.
Silveira]
Length = 1069
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E + R+K + L ++ + V FGS + L S D+DI
Sbjct: 151 MKELYKKLLPSSESEQRRIKFVKKLENLLNTQWPGNDIKVHVFGSSGNKLCSSDSDVDI- 209
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL D L K G R+ V+HA+VPI+K ++C
Sbjct: 210 -------CITTPFKELEHVCLLADFL----AKNGMERVVCVSHAKVPIVKIWDPELQVAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 259 DMNVNNTMALENTRMIRTYVEVDERVRPLAMIVKHWTKQRILNDAALGGTLSSYTWICLI 318
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P I+P L+
Sbjct: 319 INFLQTRSPPIVPSLQ 334
>gi|327271119|ref|XP_003220335.1| PREDICTED: terminal uridylyltransferase 4-like [Anolis carolinensis]
Length = 1606
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 141/322 (43%), Gaps = 37/322 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 898 ILDLVCKRCFDELSPPFSEQQNREQILASLERFIRK-EYNDKARLCLFGSSKNGFGFRDS 956
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 957 DLDICMTLEGHE---NAEKLNCKEIIENLAKVLKKHPGLRNILPITTAKVPIVKFEHRRS 1013
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1014 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1073
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1074 LMVLYFLQQREPPVIPVLQEIFDGQQI---------PQRMVDGWNAFFFDDTEELKKRLP 1124
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1125 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1172
Query: 294 HPLFIEDPFEQPENSARAVSEK 315
+ IEDPF+ N VS K
Sbjct: 1173 KCIAIEDPFDLNHNLGAGVSRK 1194
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/199 (22%), Positives = 93/199 (46%), Gaps = 14/199 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D++ R ++ D+ ++++ + L T+ +GS ++ + D++I I+ S +S
Sbjct: 313 DDFKIRQDIVRDMEKIIQ--QHLPECTLRMYGSCLTRFAFKTSDVNIDIKFP--STMSHP 368
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+L L L+ Y ++ HA+VP++ + ++C +S N + +
Sbjct: 369 ------DVLIQALEILKNIACYSDVESDFHAKVPVIFCKDNKSGLTCKVSAGNDVACLTT 422
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +VL + WA+ I+ G SYS +L+V+F Q P ILP
Sbjct: 423 DLLAALGKLEPVLVPLVLAFRYWARLCHIDCQAEGGIPSYSFALMVIFFLQQREPRILPS 482
Query: 203 LKDIYPGNLVDDLKGVRAN 221
Y G+ ++ +A+
Sbjct: 483 ----YLGSWIEGFDSKKAD 497
>gi|303319029|ref|XP_003069514.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
gi|240109200|gb|EER27369.1| PAP/25A associated domain containing protein [Coccidioides
posadasii C735 delta SOWgp]
Length = 1109
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E + R+K + L ++ + V FGS + L S D+DI
Sbjct: 151 MKELYKKLLPSSESEQRRIKFVKKLENLLNTQWPGNDIKVHVFGSSGNKLCSSDSDVDI- 209
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL D L K G R+ V+HA+VPI+K ++C
Sbjct: 210 -------CITTPFKELEHVCLLADFL----AKNGMERVVCVSHAKVPIVKIWDPELQVAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 259 DMNVNNTMALENTRMIRTYVEVDERVRPLAMIVKHWTKQRILNDAALGGTLSSYTWICLI 318
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P I+P L+
Sbjct: 319 INFLQTRSPPIVPSLQ 334
>gi|432095581|gb|ELK26719.1| Terminal uridylyltransferase 4 [Myotis davidii]
Length = 1660
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 970 ILDLVCKRCFDELSPPFSEQHNREQILMGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1028
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1029 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1085
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1086 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1145
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1146 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1196
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1197 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1244
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1245 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1275
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 76/332 (22%), Positives = 130/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +V+ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 393 DDLRVRQEVVEEMSKVITAF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 442
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 443 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 502
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L I +++ F + L + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 503 DLLAAIGKMEPVFIPLALAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 562
Query: 203 LK----DIYPGNLVDD--LKGV-------------RANAERQIAEICAFNIARFSSDKYR 243
L + + +DD LKG+ A + IAE + D R
Sbjct: 563 LLGSWIEGFDSKRMDDFQLKGIIEEKFVKWEHNSSSATEKNSIAEENKAKADQLKVDTRR 622
Query: 244 ----------------------KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N+ SL L++ L KF L E IC
Sbjct: 623 TEKDNQSNAIKGKHGKSPLTLETPNQVSLGQLWLELL-KFYTLDFALEEYVICVRIKDI- 680
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 681 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 710
>gi|393908275|gb|EJD74988.1| PAP/25A associated domain-containing protein [Loa loa]
Length = 1344
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 103/224 (45%), Gaps = 24/224 (10%)
Query: 110 VAHARVPILKFETI-HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ HA+VPI+KF H ++ D+S+ N+ ++ L S++D R + ++ K WAK
Sbjct: 1072 IPHAKVPIVKFRCRNHYHLEADVSLYNVLALENTRLLRTYSKLDRRIHQLGIMTKMWAKN 1131
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
+I N G+ +SYS ++++ + Q P + P L+++ P E I +
Sbjct: 1132 CEIGNASKGSLSSYSYIIMLIHYLQRTNPPVAPFLQELVPPGRY---------REPVIID 1182
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
C F K+ NRS++ L++ FL+ F G FT + IR
Sbjct: 1183 DCDVYFCSFEDLKWTIHNRSTVGELWIGFLDYF-GTKFD--------FTREVIQIRQTLP 1233
Query: 289 WLP-----NNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
L + P+ IEDPF+ N + V K +A I +F ++
Sbjct: 1234 LLKLDKGWQSRPIAIEDPFDLTHNLSSGVHSKTMAYIQKSFILS 1277
>gi|348554637|ref|XP_003463132.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 4-like
[Cavia porcellus]
Length = 1620
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 934 ILDLVCKRCFDELSPPFSEQYNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 992
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 993 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1049
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1050 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1109
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1110 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1160
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1161 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1208
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1209 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1239
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/332 (22%), Positives = 135/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED + R +++ ++ +V+ + L ++ +GS ++ + D++I I+ +
Sbjct: 358 EDLKARQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKFPS------- 408
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 409 -TMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVICKDRKSGLLCRVSAGNDTACLTT 467
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY L+ +F Q P ILP
Sbjct: 468 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFVLMTMFFLQQRKPPILPC 527
Query: 203 L-----KDIYPGNLVD-DLKGV----------RANAERQIAEICAFNIARF-----SSDK 241
L + P + D LKG+ ++++ + I N A+ + K
Sbjct: 528 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYKSSSATEKNSIAEENKAKADQPKDDTKK 587
Query: 242 YRKINRS--------------------SLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+N+S SL L++ L KF L E IC Q
Sbjct: 588 TETVNQSNAMKEKHGKSPLTLETPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 645
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 646 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 675
>gi|344278690|ref|XP_003411126.1| PREDICTED: terminal uridylyltransferase 4 [Loxodonta africana]
Length = 1643
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 949 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1007
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1008 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1064
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1065 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1124
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1125 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1175
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1176 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1223
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1224 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1254
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/181 (22%), Positives = 83/181 (45%), Gaps = 10/181 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
++ R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 374 DELRVRQEIVEEMSKVITTC--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 423
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 424 PKMSHPDLLIQVLGILKKNVSYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 483
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + + + F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 484 DLLAALGKREPVFTPLVLAFRYWAKLCHIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 543
Query: 203 L 203
L
Sbjct: 544 L 544
>gi|134026254|gb|AAI36216.1| papd4 protein [Xenopus (Silurana) tropicalis]
Length = 523
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 134/301 (44%), Gaps = 43/301 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ +R D D+ C+ + + Q + +L K Y RL ++
Sbjct: 242 GSSLNGFGTRSSDADL--------CLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERP 293
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF D++++N+ G I++ FL + I+ R R +VL++K WA
Sbjct: 294 QFIKAKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYIENRVRPLVLVIKMWAN 352
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H +N+ GT +SY+L L+ L + QT I+P L+ YP + + + +
Sbjct: 353 YHGLNDASRGTLSSYTLVLMALHYLQTLPEPIIPSLQKKYP-------ECFDSTMQLHLV 405
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF------SGLSLKASELGICPFTGQWE 281
NI ++ S N + L L + FL+ F S + E P + +E
Sbjct: 406 HHAPRNIPKYLSK-----NETPLGDLLLGFLKYFAIEFDWSKDIISVREAKALPRSDDYE 460
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
W N + +E+P+++ N+ARAV E+ + A + + N+ Y+L
Sbjct: 461 -------W--RNKFICVEEPYDR-TNTARAVYERQKFDMIRAEFLRAWVALRDNRDLYSL 510
Query: 342 L 342
L
Sbjct: 511 L 511
>gi|119182218|ref|XP_001242254.1| hypothetical protein CIMG_06150 [Coccidioides immitis RS]
Length = 1069
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E + R+K + L ++ + V FGS + L S D+DI
Sbjct: 151 MKELYKKLLPSAESEQRRIKFVKKLENLLNTQWPGNDIKVHVFGSSGNKLCSSDSDVDI- 209
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL D L K G R+ V+HA+VPI+K ++C
Sbjct: 210 -------CITTPFKELEHVCLLADFL----AKNGMERVVCVSHAKVPIVKIWDPELQVAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 259 DMNVNNTMALENTRMIRTYVEVDERVRPLAMIVKHWTKQRILNDAALGGTLSSYTWICLI 318
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P I+P L+
Sbjct: 319 INFLQTRSPPIVPSLQ 334
>gi|345780735|ref|XP_532573.3| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Canis lupus
familiaris]
Length = 1625
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 931 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 989
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 990 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1046
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1047 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1106
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1107 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1157
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1158 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1205
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1206 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1236
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 134/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 354 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKNSDVNIDIKF--------P 403
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 404 PRMNHPDLLIQVLGILKKSVLYIDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 463
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+VLF Q P +LP
Sbjct: 464 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVLFFLQQRKPPLLPC 523
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 524 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 583
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC Q
Sbjct: 584 TETDNQSNAMKEKHGKSPLTLGTPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 641
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 642 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 671
>gi|119479751|ref|XP_001259904.1| PAP/25A associated domain family [Neosartorya fischeri NRRL 181]
gi|119408058|gb|EAW18007.1| PAP/25A associated domain family [Neosartorya fischeri NRRL 181]
Length = 1008
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 136/305 (44%), Gaps = 33/305 (10%)
Query: 10 ILKDILGMLN---PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
+ KD+L + + P E + R +++ L ++ V FGS + L S D
Sbjct: 36 LTKDMLEVYDRLLPSAESDDRRRQLVRKLEKLFNDQWPGHDIKVHVFGSSGNKLCSSDSD 95
Query: 67 LDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
+DI CI++ K+++ LL ++L K G R+ V+HA+VPI+K
Sbjct: 96 VDI--------CITTTYKELEHVCLLAEVL----AKHGMERVVCVSHAKVPIVKIWDPEL 143
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSL 184
++CD++++N ++ + +ID R R + +++K W K +N+ GT +SY+
Sbjct: 144 RLACDMNVNNTLALENTRMVRTYVEIDERVRPLAMIIKYWTKRRILNDAGLGGTLSSYTW 203
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKY 242
L++ QT P +LP L+ R + +R + +C+F+ S Y
Sbjct: 204 ICLIINFLQTREPPVLPSLQ-------------ARPHKKRVTTDGLVCSFDDDLSSLVGY 250
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
+ N+ +L L F ++ G L + I G+ L N+ L +E+PF
Sbjct: 251 GRKNKQTLGELLFQFF-RYYGHELDYEKYVISVREGKLISKEEKGWHLLQNNRLCVEEPF 309
Query: 303 EQPEN 307
N
Sbjct: 310 NTSRN 314
>gi|348525522|ref|XP_003450271.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Oreochromis
niloticus]
Length = 538
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 145/320 (45%), Gaps = 49/320 (15%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG--------DLDIS---I 71
E+ R V S L+++ + T++PFGS V N F + G IS +
Sbjct: 201 ENSRLRFLVCSLLKDIATAY--FPECTIKPFGSSV-NGFGKLGCDLDMLLDLDSISGRNV 257
Query: 72 ELSNGSCI-----SSAGKKVKQSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQ 125
+LS S +++ + V QS+L + + + Q G G +Q + +AR P+++F
Sbjct: 258 KLSGLSLEYQMKRANSERAVTQSILSVIGKCVDQFGPGCVGVQKILNARCPLVRFAHQPS 317
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSL 184
CD++ +N ++ L+ ++D R R +V V+ WA+AH + + G + ++SL
Sbjct: 318 GFQCDLTANNRVAMKSTELLYLYGELDPRVRSLVFTVRCWARAHGVTSSIPGAWITNFSL 377
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+++VLF Q P I+P L D L+ + A++ + E N F SD + K
Sbjct: 378 TVMVLFFLQKRSPPIIPTL---------DHLRDLAGPADKSVIE---GNDCTFVSD-FNK 424
Query: 245 I----NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW-LPNNHPLFIE 299
I N +L L F E ++ PF+ +IR P PL I+
Sbjct: 425 IQLQSNTETLEQLLGEFFEFYATF----------PFSRMSLNIRKGKEQNKPEVAPLHIQ 474
Query: 300 DPFEQPENSARAVSEKNLAK 319
+PFE N ++ V+ L +
Sbjct: 475 NPFETSLNVSKNVNASQLDR 494
>gi|291242203|ref|XP_002740998.1| PREDICTED: mKIAA0191 protein-like [Saccoglossus kowalevskii]
Length = 1544
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/381 (22%), Positives = 161/381 (42%), Gaps = 49/381 (12%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
VL+ +L +I+ ++ ETR ++ +L V+ + ++G FGS + R
Sbjct: 269 VLDALLLNIIEQQGLTAQEIETRYNIVKNLNAVISA--DIKGCQFHLFGSSSNGFALRHS 326
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
D++I IE+ G S +L LL L++ Y ++ ++P + F
Sbjct: 327 DVNIDIEIEKGIQTSK--------VLLQLLDILKKSYSYSKVVSHFTVKIPSIHFVDKKS 378
Query: 126 NISCDIS-IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ C I+ ++ + S+ L +ID R R + ++++ W K I+ G+ S++
Sbjct: 379 GLRCIITYVETDASRQTSRLLSLYCEIDPRVRTLGIVLRYWGKLCHIDKQDMGSLPSHAF 438
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
L+V+++ Q C P +LP L +L+ + R+ + F+ S+ ++
Sbjct: 439 PLMVIYYLQQCQPPVLPVLH-----SLISKGETERSKLLGNDSMYSYFDDLSQLSNVWKC 493
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N SS+ L+V +F L +++ +C + +W + + IEDPF
Sbjct: 494 KNESSVGVLWVGLF-RFYALEFNMNDIVVC-IKQSTPMAKELKKW--STKKITIEDPFMP 549
Query: 305 PENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANY 364
N AR V+ + + FE RL L+FF P +N
Sbjct: 550 KRNLARCVNSQLV------FEYIQDRLRDA----------------LRFFSLPPTTTSNK 587
Query: 365 NNG--HRRARPQ-----SHKS 378
+ +R+RPQ SHK+
Sbjct: 588 QSSAKAKRSRPQPASNASHKT 608
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 78/339 (23%), Positives = 137/339 (40%), Gaps = 30/339 (8%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L + ++ G P + + R V+ DL + + + A + FGS ++ + D
Sbjct: 913 LNKLCLEMPGTHAPSDREVQNRNNVLRDLERYIRT-QFDDDAQLCLFGSSINCFGFKQSD 971
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
LDI + V + ++ L L++ + + A+VPI+KF
Sbjct: 972 LDICMTFRGVDTTEDLETPVPE-IIESLAAKLKRYNAVYNVIPIPTAKVPIVKFVHRRTQ 1030
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH---------DINNPKTG 177
+ DIS+ N Q ++ L + ID R + + +K +AK + DI + G
Sbjct: 1031 LEADISLYNTLAQHNTRMLAAYANIDVRVQQLGYTIKVFAKVNIFFLIFQRCDIGDASRG 1090
Query: 178 TFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLKGVRANAERQIAEICAF 232
+ +SY+ L++L+ Q P ++P L+++Y P LVD + + + ++
Sbjct: 1091 SLSSYAYILMMLYFLQQRKPPVIPVLQELYKGETKPETLVDGCNAWFYDDIKNLNKVWP- 1149
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPN 292
+Y N S+ L++ L +F K E I Q R W
Sbjct: 1150 --------EY-GTNTESVGELWIGLL-RFYTEEFKFKEHVISIRQKQL-LTRFEKMW--T 1196
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRL 331
+ L IEDPF+ N VS K I +AF+ R
Sbjct: 1197 SKCLAIEDPFDLSHNLGAGVSRKMNTYIMSAFQRGRTRF 1235
>gi|148691104|gb|EDL23051.1| PAP associated domain containing 1, isoform CRA_b [Mus musculus]
Length = 595
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD+ ++L + + + Q
Sbjct: 238 CVIRPFGSSV-NTFGKLGCDLDMFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 296
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 297 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 356
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 357 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 412
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 413 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 463
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 464 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 510
>gi|55731420|emb|CAH92424.1| hypothetical protein [Pongo abelii]
Length = 1244
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 879 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 937
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 938 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 994
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 995 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1054
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1055 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1105
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1106 SLGKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1153
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1154 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1184
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 117/291 (40%), Gaps = 44/291 (15%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTRFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 421 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 LKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
L + +S NR SL L++ L KF
Sbjct: 541 L------------------------------LGSWSPLTLETPNRVSLGQLWLELL-KFY 569
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
L E IC Q R N W P + IEDPF N AR+++
Sbjct: 570 TLDFALEEYVIC-VRIQDILTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 617
>gi|392865146|gb|EAS30906.2| PAP/25A associated domain family protein [Coccidioides immitis RS]
Length = 1109
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E + R+K + L ++ + V FGS + L S D+DI
Sbjct: 151 MKELYKKLLPSAESEQRRIKFVKKLENLLNTQWPGNDIKVHVFGSSGNKLCSSDSDVDI- 209
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL D L K G R+ V+HA+VPI+K ++C
Sbjct: 210 -------CITTPFKELEHVCLLADFL----AKNGMERVVCVSHAKVPIVKIWDPELQVAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 259 DMNVNNTMALENTRMIRTYVEVDERVRPLAMIVKHWTKQRILNDAALGGTLSSYTWICLI 318
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P I+P L+
Sbjct: 319 INFLQTRSPPIVPSLQ 334
>gi|345314193|ref|XP_001508655.2| PREDICTED: terminal uridylyltransferase 4, partial [Ornithorhynchus
anatinus]
Length = 1528
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 143/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + + R ++++ L + E A + FGS + R
Sbjct: 915 ILDLVCKKCFDELSPPLSEHQNREQILASLERFIRK-EYNDKARLCLFGSSKNGFGFRDS 973
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + LR+ G R + + A+VPI+KFE
Sbjct: 974 DLDICMTLEGHE---NAEKLNCKDIIESLAKILRKHPGLRNILPITTAKVPIVKFEHRRS 1030
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + +D R + + +K +AK DI + G+ +SY+
Sbjct: 1031 GLEGDISLYNTLAQHNTRMLATYAALDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1090
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF K R
Sbjct: 1091 LMVLYFLQQRNPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDDAEELKKRLP 1141
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N SL L++ L ++ +S++ +L + F QW +
Sbjct: 1142 ALAKNTESLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1189
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1190 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1220
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 40/179 (22%), Positives = 87/179 (48%), Gaps = 10/179 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D+ R +++++ ++V+ SL ++ +GS ++ + D++I ++ +
Sbjct: 313 DDFRVREDIVNEMEKIVQ--RSLPDCSLRMYGSSLTKFAFQNSDVNIDVKFPS------- 363
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K +L +L L+ Y ++ HA+VP++ + + ++C +S N + +
Sbjct: 364 -KMSHPDVLIQVLDILKHSALYSDVESDFHAKVPVVFCKDVKSGLTCKVSAGNDVACLTA 422
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
L + +++ R +VL + WA+ I+ G SYS +L+V+F Q P ILP
Sbjct: 423 DLLAALGKLEPVLRPLVLAFRYWARMCHIDCQAEGGIPSYSFALMVIFFLQQRKPPILP 481
>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
Length = 1487
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 98/193 (50%), Gaps = 13/193 (6%)
Query: 5 NVLEPILKDILGMLN---PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + +D+ +N P E+ E R V++ + V ++ A V PFGS+ + L+
Sbjct: 151 NVAEMLHRDVEAFVNYISPTPEENEVRSLVVALITRAV--TQAFPDAEVHPFGSYDTKLY 208
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ + S K+++L + +++ G R++ ++ A+VPI+KF
Sbjct: 209 LPVGDIDLVVHSQ------SMAYSKKEAVLHSIANTMKRAGITDRVRIISKAKVPIVKFV 262
Query: 122 TIHQNISCDISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
T+H NI DISI+ G + +++++ R +VL+VK + +N TG
Sbjct: 263 TLHGNIPVDISINQGNGVTAGTMIKHFLAELPA-LRSLVLIVKSFLSQRSMNEVYTGGLG 321
Query: 181 SYSLSLLVLFHFQ 193
SYS+ LV+ Q
Sbjct: 322 SYSIVCLVISFLQ 334
>gi|26345490|dbj|BAC36396.1| unnamed protein product [Mus musculus]
Length = 585
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD+ ++L + + + Q
Sbjct: 228 CVIRPFGSSV-NTFGKLGCDLDMFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 286
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 287 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 346
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 347 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 402
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 403 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 453
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 454 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 500
>gi|308499953|ref|XP_003112162.1| hypothetical protein CRE_29500 [Caenorhabditis remanei]
gi|308268643|gb|EFP12596.1| hypothetical protein CRE_29500 [Caenorhabditis remanei]
Length = 477
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 110/248 (44%), Gaps = 40/248 (16%)
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+V +P+L+ +S D++IDN + ++ L W Q+D +F + VK WA
Sbjct: 157 YVQKGMIPVLQMVHAETGVSIDVTIDNDTAKRNTQLLCWYGQLDAKFPLLCKAVKAWASK 216
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYP---GNL-VDDLKGVRANAE 223
+ G NS+SL ++VL + Q PA+LP L++++P G + V+ + N
Sbjct: 217 VGVEGASRGRLNSFSLCMMVLSYLQVGTTPAVLPNLQEMFPELNGEINVESDNYTKRNLR 276
Query: 224 RQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHI 283
+I E +F D+ N+SSLA LF+ L ++ F+ +W +
Sbjct: 277 EEIQE-----QGKFKFDE----NKSSLAALFLGCLRYYADFD----------FSTKWISV 317
Query: 284 RSN----TRWLPNNHPL----------FIEDPF-EQPENSARAVSEKN-LAKISNAFEMT 327
++ +W PL +EDPF P N A V + + + +I F
Sbjct: 318 KNGKVLEKQWSEEGEPLNGLPQKCWYIVVEDPFLPTPHNCAGTVQQSDYVERIQMEFREE 377
Query: 328 HFRLTSTN 335
+ R+ TN
Sbjct: 378 YHRILETN 385
>gi|21312970|ref|NP_080433.1| poly(A) RNA polymerase, mitochondrial precursor [Mus musculus]
gi|81916921|sp|Q9D0D3.1|PAPD1_MOUSE RecName: Full=Poly(A) RNA polymerase, mitochondrial; Short=PAP;
AltName: Full=PAP-associated domain-containing protein
1; AltName: Full=Polynucleotide adenylyltransferase;
Flags: Precursor
gi|12847740|dbj|BAB27689.1| unnamed protein product [Mus musculus]
gi|35505240|gb|AAH57643.1| Mitochondrial poly(A) polymerase [Mus musculus]
Length = 585
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD+ ++L + + + Q
Sbjct: 228 CVIRPFGSSV-NTFGKLGCDLDMFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 286
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 287 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 346
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 347 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 402
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 403 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 453
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 454 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 500
>gi|426215544|ref|XP_004002031.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Ovis aries]
Length = 1643
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 948 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1006
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1007 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1063
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1064 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1123
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1124 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1174
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1175 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1222
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1223 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1253
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLKVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 421 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATERNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +KY K NR SL L++ L KF L E IC
Sbjct: 601 TETTNQSNARKEKYGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIKDI- 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSIKRNVARSLN 688
>gi|148691103|gb|EDL23050.1| PAP associated domain containing 1, isoform CRA_a [Mus musculus]
Length = 468
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD+ ++L + + + Q
Sbjct: 111 CVIRPFGSSV-NTFGKLGCDLDMFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 169
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 170 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 229
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 230 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 285
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 286 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 336
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 337 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 383
>gi|426215542|ref|XP_004002030.1| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Ovis aries]
Length = 1643
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 948 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1006
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1007 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1063
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1064 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1123
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1124 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1174
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1175 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1222
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1223 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1253
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 371 DDLKVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 420
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 421 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 480
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 481 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 540
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A IAE
Sbjct: 541 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATERNSIAEENKAKADQPKDDTKK 600
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +KY K NR SL L++ L KF L E IC
Sbjct: 601 TETTNQSNARKEKYGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIKDI- 658
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 659 LTRENKNW-PKRR-IAIEDPFSIKRNVARSLN 688
>gi|301759929|ref|XP_002915778.1| PREDICTED: terminal uridylyltransferase 4-like [Ailuropoda
melanoleuca]
Length = 1650
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 960 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1018
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1019 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1075
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1076 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1135
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1136 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1186
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1187 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1234
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1235 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1265
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 134/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ EV+ + L ++ +GS ++ + D++I I+
Sbjct: 384 DDLRVRQEIVEEMSEVITTF--LPECSLRLYGSSLTKFALKNSDVNIDIKF--------P 433
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 434 PRMNHPDLLIQVLGILKKSVLYIDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 493
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 494 DLLAALGKLEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 553
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 554 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 613
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC Q
Sbjct: 614 TETDNQSNAMKEKHGKSPLTLGTPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRMQDI 671
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 672 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 701
>gi|73948918|ref|XP_535150.2| PREDICTED: poly(A) RNA polymerase, mitochondrial [Canis lupus
familiaris]
Length = 584
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 141/333 (42%), Gaps = 49/333 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDI 69
L +L ED + R S + ++ + TV PFGS V N F + G DLD+
Sbjct: 192 LNTLLKEFQLTEEDIKLRYLTCSLIEDIAAAY--FLDCTVRPFGSSV-NSFGKLGCDLDM 248
Query: 70 SIELSNGSCISS-----------------AGKKVKQSLLGDLLRALRQKG-GYRRLQFVA 111
++L +++ + + Q +L + L G G +Q +
Sbjct: 249 FLDLDEIGKLNTNKTSGNFLMEFQVKSVPSERVATQKVLSVIGECLDHFGPGCVGVQKIL 308
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR P+++F CD++ +N S+ L+ +D R R MV ++ WA+AH +
Sbjct: 309 NARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYIYGALDSRVRAMVFSIRCWARAHSL 368
Query: 172 NNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI- 229
+ G++ ++SL+++V+F Q P ILP L D LK + ++ I E
Sbjct: 369 TSSIPGSWITNFSLTMMVIFFLQRRSPPILPTL---------DYLKTLADAEDKCIIEGH 419
Query: 230 -CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN 286
C F ++ R N +L L F E F + + + I R
Sbjct: 420 NCTFIRDLNRIKPSG----NTETLESLLKEFFEYFGNFAFNKNSINI-------RQGREQ 468
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ P + PL I++PFE N ++ VS+ L K
Sbjct: 469 NK--PESSPLHIQNPFETSLNISKNVSQSQLQK 499
>gi|281353554|gb|EFB29138.1| hypothetical protein PANDA_003791 [Ailuropoda melanoleuca]
Length = 1639
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 960 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1018
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1019 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1075
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1076 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1135
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1136 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1186
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1187 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1234
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1235 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1265
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 134/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ EV+ + L ++ +GS ++ + D++I I+
Sbjct: 384 DDLRVRQEIVEEMSEVITTF--LPECSLRLYGSSLTKFALKNSDVNIDIKF--------P 433
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 434 PRMNHPDLLIQVLGILKKSVLYIDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 493
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 494 DLLAALGKLEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 553
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 554 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 613
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC Q
Sbjct: 614 TETDNQSNAMKEKHGKSPLTLGTPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRMQDI 671
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 672 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 701
>gi|390332645|ref|XP_781520.3| PREDICTED: uncharacterized protein LOC576082 [Strongylocentrotus
purpuratus]
Length = 1331
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 140/324 (43%), Gaps = 47/324 (14%)
Query: 10 ILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
+ ++I+ P R++ R I++L + +LR A + FGS + R DLDI
Sbjct: 257 VCEEIMLRNQPSRQEMRARDNCIAELEYYIRQ-NALRDARLSLFGSSGNGFGFRNSDLDI 315
Query: 70 SIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
C++ K Q + + L AL++ + + A+VPI+KF
Sbjct: 316 --------CLTFQDMKTGQDIDVGFVIEKLAAALKRNHTLYNIVPIPTAKVPIVKFIHRP 367
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N Q ++ L SQID R R + +K AK DI + G+ +SY+
Sbjct: 368 TRLEGDISLYNTLAQCNTRLLCMYSQIDERVRVLGYSMKLLAKYCDIGDASRGSLSSYAY 427
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFN---------IA 235
+LL ++ Q P ILP L+++Y G+ ++ + E+ +N ++
Sbjct: 428 TLLTIYFLQQRKPPILPVLQELYTGD------------KQPVHEVDGWNAWFFGNLNQLS 475
Query: 236 RFSSDKYRKINRSSLAHLFVSFL----EKFSGLSLKASELGICPFTGQWEHIRSNTRWLP 291
R +++ N+ S+ L++ L E+F S P T R W+
Sbjct: 476 RVWKGQFK--NKESIGSLWLGMLRFYTEEFDFTKYVVSIRQHKPLT------RFEKLWMT 527
Query: 292 NNHPLFIEDPFEQPENSARAVSEK 315
+ IEDPF+ N A S+K
Sbjct: 528 PQKGVAIEDPFDLEHNLGGAASKK 551
>gi|20128857|ref|NP_569904.1| CG11418, isoform A [Drosophila melanogaster]
gi|442614746|ref|NP_001259129.1| CG11418, isoform C [Drosophila melanogaster]
gi|4688672|emb|CAA17688.2| EG:8D8.8 [Drosophila melanogaster]
gi|7290143|gb|AAF45607.1| CG11418, isoform A [Drosophila melanogaster]
gi|364503012|gb|AEW48257.1| FI17515p1 [Drosophila melanogaster]
gi|440216307|gb|AGB94975.1| CG11418, isoform C [Drosophila melanogaster]
Length = 612
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 143/325 (44%), Gaps = 59/325 (18%)
Query: 28 RMKVISDLREVVESVESL-RGATVEPFGSFVSNLFSRWG-DLDISIELSN--GSCIS--- 80
RM+ ++ L +V +++ + A +PFGS V N F R G DLD+ + + G+ I
Sbjct: 192 RMRFLAAL-QVQQAIAGMFPAAQAQPFGSSV-NGFGRMGCDLDLILRFDSDMGAKIPLEA 249
Query: 81 --------------SAGKKVKQ---SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
S G+ Q GD+L G ++ + ARVPI+K+
Sbjct: 250 AVPSRLVYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIKYHHE 307
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSY 182
H ++ D+S+ NL G S+ L+ ++D R R + ++ WA+ + NP G + +++
Sbjct: 308 HLDLEVDLSMSNLTGFYMSELLYMFGEMDPRVRPLTFTIRRWAQTCGLTNPSPGRWISNF 367
Query: 183 SLSLLVLFHFQTCVPAILPPL----KDIYPGNLVDDLKGVRANAERQIAEICAFNIARFS 238
SL+ LV+F Q ILP + K PG+ G+ R N+ R
Sbjct: 368 SLTCLVMFFLQQLRQPILPTIGALAKAAEPGDSRVTEDGINCTFTR--------NVDRLG 419
Query: 239 SDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC---PFTGQWEHIRSNTRWLPNNHP 295
+R N+SSL+ L + F E +S + + P + P++
Sbjct: 420 ---FRSRNQSSLSELLLQFFEFYSQFDFHNRAISLNEGKPLSK------------PDHSA 464
Query: 296 LFIEDPFEQPENSARAVSEKNLAKI 320
++I +P EQ N ++ VS + ++
Sbjct: 465 MYIVNPLEQLLNVSKNVSLEECERL 489
>gi|312068112|ref|XP_003137061.1| hypothetical protein LOAG_01474 [Loa loa]
Length = 361
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 103/224 (45%), Gaps = 24/224 (10%)
Query: 110 VAHARVPILKFETI-HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ HA+VPI+KF H ++ D+S+ N+ ++ L S++D R + ++ K WAK
Sbjct: 89 IPHAKVPIVKFRCRNHYHLEADVSLYNVLALENTRLLRTYSKLDRRIHQLGIMTKMWAKN 148
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
+I N G+ +SYS ++++ + Q P + P L+++ P E I +
Sbjct: 149 CEIGNASKGSLSSYSYIIMLIHYLQRTNPPVAPFLQELVPPG---------RYREPVIID 199
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
C F K+ NRS++ L++ FL+ F G FT + IR
Sbjct: 200 DCDVYFCSFEDLKWTIHNRSTVGELWIGFLDYF-GTKFD--------FTREVIQIRQTLP 250
Query: 289 WLP-----NNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
L + P+ IEDPF+ N + V K +A I +F ++
Sbjct: 251 LLKLDKGWQSRPIAIEDPFDLTHNLSSGVHSKTMAYIQKSFILS 294
>gi|118404514|ref|NP_001072915.1| poly(A) RNA polymerase GLD2 [Xenopus (Silurana) tropicalis]
gi|123906238|sp|Q0VFA3.1|GLD2_XENTR RecName: Full=Poly(A) RNA polymerase GLD2; AltName:
Full=PAP-associated domain-containing protein 4
gi|110645459|gb|AAI18910.1| PAP associated domain containing 4 [Xenopus (Silurana) tropicalis]
Length = 528
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 134/301 (44%), Gaps = 43/301 (14%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH- 112
GS ++ +R D D+ C+ + + Q + +L K Y RL ++
Sbjct: 247 GSSLNGFGTRSSDADL--------CLVLKDEPMNQHTEARHILSLLHKHFYTRLSYIERP 298
Query: 113 ----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
A+VPI+KF D++++N+ G I++ FL + I+ R R +VL++K WA
Sbjct: 299 QFIKAKVPIVKFRDKVSGAEFDLNVNNVVG-IRNTFLLRTYAYIENRVRPLVLVIKMWAN 357
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H +N+ GT +SY+L L+ L + QT I+P L+ YP + + + +
Sbjct: 358 YHGLNDASRGTLSSYTLVLMALHYLQTLPEPIIPSLQKKYP-------ECFDSTMQLHLV 410
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF------SGLSLKASELGICPFTGQWE 281
NI ++ S N + L L + FL+ F S + E P + +E
Sbjct: 411 HHAPRNIPKYLSK-----NETPLGDLLLGFLKYFAIEFDWSKDIISVREAKALPRSDDYE 465
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
W N + +E+P+++ N+ARAV E+ + A + + N+ Y+L
Sbjct: 466 -------W--RNKFICVEEPYDR-TNTARAVYERQKFDMIRAEFLRAWVALRDNRDLYSL 515
Query: 342 L 342
L
Sbjct: 516 L 516
>gi|21430928|gb|AAM51142.1| SD27341p [Drosophila melanogaster]
Length = 612
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 57/324 (17%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSN--GSCIS---- 80
RM+ ++ L+ A +PFGS V N F R G DLD+ + + G+ I
Sbjct: 192 RMRFLAALQVQQAIAGMFPAAQAQPFGSSV-NGFGRMGCDLDLILRFDSDMGAKIPLEAA 250
Query: 81 -------------SAGKKVKQ---SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
S G+ Q GD+L G ++ + ARVPI+K+ H
Sbjct: 251 VPSRLVYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIKYHHEH 308
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYS 183
++ D+S+ NL G S+ L+ ++D R R + ++ WA+ + NP G + +++S
Sbjct: 309 LDLEVDLSMSNLTGFYMSELLYMFGEMDPRVRPLTFTIRRWAQTCGLTNPSPGRWISNFS 368
Query: 184 LSLLVLFHFQTCVPAILPPL----KDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS 239
L+ LV+F Q ILP + K PG+ G+ R N+ R
Sbjct: 369 LTCLVMFFLQQLRQPILPTIGALAKAAEPGDSRVTEDGINCTFTR--------NVDRLG- 419
Query: 240 DKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC---PFTGQWEHIRSNTRWLPNNHPL 296
+R N+SSL+ L + F E +S + + P + P++ +
Sbjct: 420 --FRSRNQSSLSELLLQFFEFYSQFDFHNRAISLNEGKPLSK------------PDHSAM 465
Query: 297 FIEDPFEQPENSARAVSEKNLAKI 320
+I +P EQ N ++ VS + ++
Sbjct: 466 YIVNPLEQLLNVSKNVSLEECERL 489
>gi|428671662|gb|EKX72580.1| conserved hypothetical protein [Babesia equi]
Length = 556
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 120/266 (45%), Gaps = 36/266 (13%)
Query: 18 LNPLREDWETRMKVISDLREVVE-SVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNG 76
L P E +E +I++L+ ++E SV G T+ FGS + L+ R D+D
Sbjct: 225 LVPPGEQFERMNFLIANLKPLLERSV----GGTMHTFGSCSNGLWVRGSDIDF------- 273
Query: 77 SCISSAGKKVKQSLLGDLL---RALRQKGGYRRLQFVAHARVPILK-FETIHQNISCDIS 132
C+ K K+ L L+ +L ++Q + ARVPI K F+ N+ CD+S
Sbjct: 274 -CLVIPDCKTKRQWLSKLMLVKSSLLNTDYISKIQII-QARVPIAKLFDNNGVNV-CDVS 330
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
I+N S ++ ++ +D R + +K WAK INN GT +SY+LSL + +
Sbjct: 331 INNTVALNNSLYVTTMTSLDARVAKLGRFIKYWAKCRQINNRAEGTMSSYTLSLQLFYFL 390
Query: 193 QTCVPAILPPLKDIYPG-----NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINR 247
P ILP KDI +L + L + AE I E C KY N+
Sbjct: 391 ANRNPPILPLFKDITRNYSPFEDLDNQLCFISDTAE--IMERC----------KYLGKNQ 438
Query: 248 SSLAHLFVSFLEKFSGLSLKASELGI 273
SL+ L +F + K + GI
Sbjct: 439 ESLSELVFAFFNYYGSEKFKGGDSGI 464
>gi|148691105|gb|EDL23052.1| PAP associated domain containing 1, isoform CRA_c [Mus musculus]
Length = 397
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 127/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD+ ++L + + + Q
Sbjct: 40 CVIRPFGSSV-NTFGKLGCDLDMFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 98
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 99 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 158
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 159 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 214
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 215 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 265
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 266 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 312
>gi|194912376|ref|XP_001982492.1| GG12706 [Drosophila erecta]
gi|190648168|gb|EDV45461.1| GG12706 [Drosophila erecta]
Length = 613
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 136/325 (41%), Gaps = 59/325 (18%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKV 86
RM+ ++ L+ A +PFGS V N F R G DLD+ + N S +
Sbjct: 192 RMRFLAALQVQQAIAGMFPAAQAQPFGSSV-NGFGRMGCDLDLILRFDNDMGAKSPVEAA 250
Query: 87 KQSLL----------------------GDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
S L GD+L G ++ + ARVPI+K+ H
Sbjct: 251 VPSRLVYHTKENLSNGRSQTQRHMECFGDMLHLFLP--GVCHVRRILQARVPIIKYHHEH 308
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYS 183
++ D+S+ NL G S+ L+ ++D R R + ++ WA+ + NP G + +++S
Sbjct: 309 LDLEVDLSMSNLTGFYMSELLYMFGEMDPRVRPLTFTIRRWAQTCGLTNPSPGRWISNFS 368
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI---CAF--NIARFS 238
L+ LV+F Q ILP + L + + ++ E C F N+ R
Sbjct: 369 LTCLVMFFLQHLRQPILP---------TIGALAKAAESGDSRVTEDGINCTFTRNVDRLG 419
Query: 239 SDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC---PFTGQWEHIRSNTRWLPNNHP 295
+R N+SSL+ L + F E +S + + P + P++
Sbjct: 420 ---FRSRNQSSLSELLLQFFEFYSQFDFHNRAISLNEGKPLSK------------PDHSA 464
Query: 296 LFIEDPFEQPENSARAVSEKNLAKI 320
++I +P EQ N ++ VS + ++
Sbjct: 465 MYIVNPLEQLLNVSKNVSLEECERL 489
>gi|330946981|ref|XP_003306828.1| hypothetical protein PTT_20081 [Pyrenophora teres f. teres 0-1]
gi|311315491|gb|EFQ85083.1| hypothetical protein PTT_20081 [Pyrenophora teres f. teres 0-1]
Length = 1266
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/334 (23%), Positives = 143/334 (42%), Gaps = 64/334 (19%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P +ED +TR + + ++ ++E+ V FGS + L++ D+DI
Sbjct: 267 MRELYDRLEPKQEDTDTRERFVRKVQRILETEFPGTKIMVHVFGSSGNMLWTSESDVDI- 325
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G +R+ + A+V I+K +SCD
Sbjct: 326 -------CIQTPMKRLEE--MHPLAEAL-DKHGMQRVVCIPAAKVRIVKVWDPELQLSCD 375
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVL 189
I+++N+ ++ + Q+D R R + +++K W K +N+ GT +SY+ L+L
Sbjct: 376 INVNNVAAIENTRLIKTYIQLDDRVRPLAMIIKHWTKRRILNDAGIGGTISSYTWICLIL 435
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRK 244
QT P +LP L + P D+ G +++ F+ D Y +
Sbjct: 436 NFLQTRDPPVLPNLHKL-PDRARDETTG-------------QPSLSSFADDVGKLRGYGQ 481
Query: 245 INRSSLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWL 290
N+ SL L F + G + + G P GQ E +
Sbjct: 482 DNKESLGQLLFHFFRLYGHEIDYEKEAISVRQGKRIPREDKGWHPGGGQKEGVNR----- 536
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L +E+PF +E+NL ++ +
Sbjct: 537 -----LCVEEPFN---------TERNLGNSADDY 556
>gi|338721674|ref|XP_003364417.1| PREDICTED: terminal uridylyltransferase 4 [Equus caballus]
Length = 1642
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 144/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 952 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1010
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 1011 DLDICMTLEGHE---NAEKLNCKEIIESLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1067
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1068 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1127
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1128 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1178
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1179 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1226
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1227 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1257
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/332 (22%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +VV + L ++ +G+ ++ + D++I I+
Sbjct: 374 DDLRIRQEIVEEMSKVVTTC--LPECSLRLYGTSLTKFALKSSDVNIDIKF--------P 423
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 424 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 483
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 484 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPS 543
Query: 203 L-----KDIYPGNLVD-DLKGV----------RANAERQIAEICAFNIARFS--SDKYRK 244
L + P + D LKG+ +++ + I N A+ D +K
Sbjct: 544 LLGDWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKTSIAEENKAKADQPKDDTKK 603
Query: 245 INRS-----------------------SLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
R SL L++ L KF L E IC Q
Sbjct: 604 TERDNQSNAMKEKHGKSPLTLEATNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 661
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 662 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 691
>gi|149693612|ref|XP_001490500.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Equus caballus]
Length = 1647
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 144/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 952 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1010
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ L + L++ G R + + A+VPI+KFE
Sbjct: 1011 DLDICMTLEGHE---NAEKLNCKEIIESLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1067
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1068 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1127
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1128 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1178
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1179 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1226
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1227 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1257
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/332 (22%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +VV + L ++ +G+ ++ + D++I I+
Sbjct: 374 DDLRIRQEIVEEMSKVVTTC--LPECSLRLYGTSLTKFALKSSDVNIDIKF--------P 423
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 424 PKMNHPDLLIQVLGILKKSVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 483
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 484 DLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPS 543
Query: 203 L-----KDIYPGNLVD-DLKGV----------RANAERQIAEICAFNIARFS--SDKYRK 244
L + P + D LKG+ +++ + I N A+ D +K
Sbjct: 544 LLGDWIEGFDPKRMDDFQLKGIVEEKFVKWEYNSSSATEKTSIAEENKAKADQPKDDTKK 603
Query: 245 INRS-----------------------SLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
R SL L++ L KF L E IC Q
Sbjct: 604 TERDNQSNAMKEKHGKSPLTLEATNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 661
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 662 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 691
>gi|330845454|ref|XP_003294600.1| hypothetical protein DICPUDRAFT_85060 [Dictyostelium purpureum]
gi|325074905|gb|EGC28871.1| hypothetical protein DICPUDRAFT_85060 [Dictyostelium purpureum]
Length = 1700
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 107/233 (45%), Gaps = 20/233 (8%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L GA + +GSF S + ++D+ + L KK ++LL + L+ Y
Sbjct: 512 LAGAKIRQYGSFFSGISLNESEIDVCLFL----------KKNDKTLLSQVKYILKDTKNY 561
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
++ A+VP L+F NI D+ + KS + +D R RD++LLVK
Sbjct: 562 TIVEISKRAKVPTLRFNEKTTNIHFDMCFNKRLEIYKSLLIKEYVDLDPRCRDLILLVKH 621
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPG-NLVDDLKGVRAN- 221
WA +I + GTF+S+ L+L+V+ QT V P ILP L+ P ++++ ++ N
Sbjct: 622 WATQKNIKDASRGTFSSFCLTLMVINFLQTGVSPPILPNLES--PNKSILEPTSNLKTNF 679
Query: 222 -AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI 273
E + + +F S N+ S+ LF F + + G K + I
Sbjct: 680 IIEEYLVQYYDHTTLKFKSSD----NKLSIDQLFYQFFKYYLGFDFKNLSINI 728
>gi|74183307|dbj|BAE22572.1| unnamed protein product [Mus musculus]
Length = 584
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 126/296 (42%), Gaps = 47/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS-----------------SAGKKVKQS 89
+ PFGS V N F + G DLD ++L + + + Q
Sbjct: 227 CVIRPFGSSV-NTFGKLGCDLDTFLDLDETGKLDVHKNTGNFFMEFQVKNVPSERIATQK 285
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 ILSVIGECLDNFGPGCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIY 345
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 GSLDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTVMVIFFLQRRSPPILPTL---- 401
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
D LK + +R I E N F D KI N +L L F E F
Sbjct: 402 -----DSLKSIADAEDRCILE---GNNCTFVQD-VNKIQPSGNTETLELLIKEFFEYFGN 452
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 453 FAFNKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 499
>gi|402581938|gb|EJW75885.1| hypothetical protein WUBG_13205 [Wuchereria bancrofti]
Length = 228
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 102/212 (48%), Gaps = 34/212 (16%)
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
++ + DI+ +N+ G S L + S++D RF + LLVK WA INN GT NSYS
Sbjct: 5 YEELEIDINCNNVAGIYNSHLLHYYSRVDDRFPALCLLVKHWAINAGINNAMMGTLNSYS 64
Query: 184 LSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD- 240
L L+VL HF C +P +LP L+ +YP NA C+ + D
Sbjct: 65 LILMVL-HFLQCGALPPVLPNLQFLYPSLF---------NAT------CSLDSLELFRDL 108
Query: 241 ----KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTRWLPNN 293
R+ N ++ L ++F + F+ K + I C ++ + + NT
Sbjct: 109 PQPLPPREFNTETIGELLIAFFDYFAHFDFKNKAISIRNGCVYSR--DLLADNTM----R 162
Query: 294 HPLFIEDPFEQPENSARAVSE-KNLAKISNAF 324
+FIE+P++Q N+AR V+ +NL I AF
Sbjct: 163 FKIFIEEPYDQ-RNTARCVTSIENLQLIREAF 193
>gi|328873192|gb|EGG21559.1| hypothetical protein DFA_01445 [Dictyostelium fasciculatum]
Length = 946
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 83/313 (26%), Positives = 148/313 (47%), Gaps = 43/313 (13%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS ++ L + DLDI++ +++ S + K + ++L++ ++ + +
Sbjct: 645 FGSSLNGLAFKNSDLDIAL-VTDRPLHSLSNYTFK---VSNVLKS----NNFKNVLAITR 696
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
RVPI++F I ++SCD+SI+N SK ++ QID R R + +++K+WAK IN
Sbjct: 697 TRVPIIRFNDIFSSLSCDLSINNPLAIFNSKMIYDYMQIDIRVRTIAIIIKQWAKVRGIN 756
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGV-----RANAERQIA 227
+ T +SYS +++ Q ILP L+ + G +KG R + I+
Sbjct: 757 DASNNTLHSYSFVNMIIHFMQREEVLILPSLQRMANGQYY-YIKGRRYGDGRVKEDHMIS 815
Query: 228 EI-CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKF--------SGLSLKASELGICPF 276
+ C + N+ + + + K N ++ L +F + + S +S+++ I P
Sbjct: 816 DKNCKYYNNLGQL-REVFGKHNTMTVPELLFAFFQYYALHFDYQNSVISIRSG--VILPA 872
Query: 277 -TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE-KNLAKISNAFEMTHFRLTST 334
T W+ R + IEDPF+ N AR++ + +LA I F M + L S
Sbjct: 873 KTKTWDDKRE--------YFFMIEDPFDTTFNIARSIRKPHHLAAIVKEF-MRAYELLSN 923
Query: 335 NQTRYALLSSLAR 347
N A LS L +
Sbjct: 924 N----APLSELVK 932
>gi|324500041|gb|ADY40033.1| Poly(A) RNA polymerase gld-2 [Ascaris suum]
Length = 1815
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 104/213 (48%), Gaps = 15/213 (7%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
GS ++ + D+D+ + +S V ++ + L L+ R Q +
Sbjct: 1397 VGSSLNGFGTNSSDMDLCLMISRDDLDQRTDALVILKMVAEALVNLKSI----REQVLIP 1452
Query: 113 ARVPIL--KFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
A+VPIL KF ++ D++++N + L++ S D R R +V +VKEWAK D
Sbjct: 1453 AKVPILRLKFMEPFAELAVDLNVNNSVAIRNTHLLYYYSLFDWRVRPIVTVVKEWAKRRD 1512
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+N+ TF SYSL L+V+ +FQ V P +LP L+ +YP ++ R + R++
Sbjct: 1513 MNDANRSTFTSYSLVLMVIHYFQCGVDPPLLPSLQRLYP------VRFDRHSDVRKLDMS 1566
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
N A YR+ N +L L + FL+ ++
Sbjct: 1567 VPLNPAPSVMWPYRETN--TLGELLLGFLDYYA 1597
>gi|431896895|gb|ELK06159.1| Terminal uridylyltransferase 4 [Pteropus alecto]
Length = 1522
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 145/331 (43%), Gaps = 37/331 (11%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 900 ILDLVCKRCFDELSPPFSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 958
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 959 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1015
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1016 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1075
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR-- 243
L+VL+ Q P ++P L++I+ G + +R + AF + K R
Sbjct: 1076 LMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAFFFDKTEELKKRLP 1126
Query: 244 --KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
N +L L++ L ++ +S++ +L + F QW +
Sbjct: 1127 SLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW-----------TS 1174
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N VS K I AF
Sbjct: 1175 KCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1205
>gi|307175913|gb|EFN65726.1| Poly(A) RNA polymerase, mitochondrial [Camponotus floridanus]
Length = 558
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 150/349 (42%), Gaps = 39/349 (11%)
Query: 12 KDILGMLNPLRE-----DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
K I G + L E D ETR++ + + V PFGS ++ + D
Sbjct: 165 KSISGQIIDLYEALKLNDLETRLRFHTAYHLEQYFSRLFQNTKVLPFGSSLNGFGRKRCD 224
Query: 67 LDI-----SIELSNGSC-ISSAGKKVKQSL------LGDLLRALRQK--GGYRRLQFVAH 112
LD+ +IE +N + + K +K S ++L + Q G ++ +
Sbjct: 225 LDLVLLPDNIEENNAASRLVFHTKPMKLSERHETREFMEILASTMQHFIPGVCNVRKILE 284
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+KF + NI CD+S N+ + L+ +ID R R +V +++WAK +I
Sbjct: 285 ARVPIIKFLYEYTNIECDLSTTNMAAVYMCELLYLYGEIDWRVRPLVTAIRKWAKNQEIT 344
Query: 173 NPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA 231
+ G + ++SLSLLVLF+ Q ILP L+ + DD++ + C
Sbjct: 345 SDVPGPWITNFSLSLLVLFYLQQ--KNILPSLRVLKTYATSDDIRCTENGID------CT 396
Query: 232 F--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW 289
F N+ + + K N+ +L L F + S GIC G IR
Sbjct: 397 FLRNLEKLPPEYKYKSNQDNLESLLHGFFDYISTFDFHTK--GICIREGV--PIRK---- 448
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTR 338
P+ L I +P E N + V+ L +I+ + L +T+ +R
Sbjct: 449 -PSRSALHITNPLETTLNVCKNVNIYELNRITEKAHDAIYILETTDNSR 496
>gi|354494619|ref|XP_003509434.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Cricetulus
griseus]
Length = 1494
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 137/310 (44%), Gaps = 50/310 (16%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIKQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGHETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L RALR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARALRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEIY 1195
Query: 208 PGN-----LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
G LVD G QI E+ A+ +Y + N S+ L++ L ++
Sbjct: 1196 RGEKKPEILVD---GWNIYFFDQIDELPAY------WPEYGR-NTESVGQLWLGLLRFYT 1245
Query: 263 G--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
+S++ L + F QW + + IEDPF+ N +S
Sbjct: 1246 EEFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSR 1293
Query: 315 KNLAKISNAF 324
K I AF
Sbjct: 1294 KMTNFIMKAF 1303
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKHVMESVFQHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAIMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDAFTDVDADFHARVPVVVCRDKQSGLLCKVSAGNENACLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQREEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|324500027|gb|ADY40027.1| Poly(A) RNA polymerase gld-2 [Ascaris suum]
Length = 1249
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 104/213 (48%), Gaps = 15/213 (7%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
GS ++ + D+D+ + +S V ++ + L L+ R Q +
Sbjct: 831 VGSSLNGFGTNSSDMDLCLMISRDDLDQRTDALVILKMVAEALVNLKSI----REQVLIP 886
Query: 113 ARVPIL--KFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
A+VPIL KF ++ D++++N + L++ S D R R +V +VKEWAK D
Sbjct: 887 AKVPILRLKFMEPFAELAVDLNVNNSVAIRNTHLLYYYSLFDWRVRPIVTVVKEWAKRRD 946
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
+N+ TF SYSL L+V+ +FQ V P +LP L+ +YP ++ R + R++
Sbjct: 947 MNDANRSTFTSYSLVLMVIHYFQCGVDPPLLPSLQRLYP------VRFDRHSDVRKLDMS 1000
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
N A YR+ N +L L + FL+ ++
Sbjct: 1001 VPLNPAPSVMWPYRETN--TLGELLLGFLDYYA 1031
>gi|326672376|ref|XP_692256.3| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Danio rerio]
Length = 582
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 130/293 (44%), Gaps = 42/293 (14%)
Query: 50 VEPFGSFVSNLFSRWG-DLDISIELS----------NGSCIS--------SAGKKVKQSL 90
+ PFGS V N F + G D+D+ ++L +GS +S + + V QS+
Sbjct: 223 IRPFGSTV-NSFGKLGCDVDMILDLDGIYARSQKKVSGSGLSLEYQVKTGPSERAVTQSI 281
Query: 91 LGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L + + + Q G G +Q + AR PI++F CD++ +N S+ LF
Sbjct: 282 LSVVGKCVDQFGPGCVGVQNILQARCPIVRFAHQPSGFQCDLTANNKVAMKSSELLFLYG 341
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYP 208
+D R R +V V+ WA+AH I + G + ++SL+++V+F Q PA+LP L
Sbjct: 342 HLDPRVRHLVFSVRCWARAHSITSSIPGAWITNFSLTVMVVFFLQQRSPAMLPTL----- 396
Query: 209 GNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKA 268
D LK + +++ + E I S + N +L L F E +
Sbjct: 397 ----DRLKELAGPSDKCVIEGNDCTIVSDLSKIALQKNTDTLEKLLQEFFEFY------- 445
Query: 269 SELGICPFTGQWEHIRSNT-RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
G PF +IR + P L I++PFE N ++ V+ L +
Sbjct: 446 ---GNFPFNKASINIRKGKEQSKPEAAALHIQNPFEATLNVSKNVNAAQLERF 495
>gi|124802317|ref|XP_001347437.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|23495017|gb|AAN35350.1| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 615
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 134/282 (47%), Gaps = 40/282 (14%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGGY 104
+ + V PFGS ++ + R D+DI I++ +K + + L + L G
Sbjct: 316 KNSYVTPFGSVINGFWMRNSDIDICIQIP-----ILLNRKDQITFLKKICLLLNNFNNGV 370
Query: 105 RRLQFVAHARVPILKFETIHQN----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+F A+VPI+ F ++ +SCDIS++N+ I SK + ID R + M +
Sbjct: 371 IEQRF--SAKVPIIHFYCNNREKSFELSCDISVNNILAVINSKLIQKYVAIDKRLQTMGI 428
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVR 219
++K W+K +IN+ G +S+SL L+++ Q P ILP L+DI ++
Sbjct: 429 VLKYWSKIRNINDRSKGFLSSFSLILMIIHFLQNVAEPKILPSLQDI----------SIK 478
Query: 220 ANAE-----RQIAEICAFNIARFSSDKYRKINRS-------SLAHLFVSFLEKFSGLSLK 267
N + + C +I D+ +++N S ++ L + F KF G K
Sbjct: 479 RNEKPFYIMGVDCKFCQDDIV--IQDELKRLNNSIHNNLYVDISTLLIEFF-KFYGYKYK 535
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSA 309
+ + I G +++ ++ ++ ++ LF+++PFE +N A
Sbjct: 536 SGIIAIRDINGYYQNFQTLKKF--ESYFLFVDNPFEVGKNVA 575
>gi|354494617|ref|XP_003509433.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Cricetulus
griseus]
Length = 1477
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 137/310 (44%), Gaps = 50/310 (16%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIKQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGHETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L RALR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARALRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEIY 1195
Query: 208 PGN-----LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
G LVD G QI E+ A+ +Y + N S+ L++ L ++
Sbjct: 1196 RGEKKPEILVD---GWNIYFFDQIDELPAY------WPEYGR-NTESVGQLWLGLLRFYT 1245
Query: 263 G--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
+S++ L + F QW + + IEDPF+ N +S
Sbjct: 1246 EEFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSR 1293
Query: 315 KNLAKISNAF 324
K I AF
Sbjct: 1294 KMTNFIMKAF 1303
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKHVMESVFQHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAIMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDAFTDVDADFHARVPVVVCRDKQSGLLCKVSAGNENACLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQREEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|409043231|gb|EKM52714.1| hypothetical protein PHACADRAFT_261315 [Phanerochaete carnosa
HHB-10118-sp]
Length = 865
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 139/328 (42%), Gaps = 62/328 (18%)
Query: 29 MKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+ + D+R+++E + +R T+EP FGS + R D+D+ C+ +
Sbjct: 61 LSIKEDVRKLLERL--IR--TIEPDSRLLSFGSTANGFSLRNSDMDLC-------CLIDS 109
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ-----NISCDISIDNLC 137
++ + L ++L L + ++ + HAR+PI+K Q I+CDI +N
Sbjct: 110 EDRLPATDLVNMLGDLFARETKFHIKPLPHARIPIVKLSLDPQPGLPYGIACDIGFENRL 169
Query: 138 GQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL-FHFQTC 195
++ L + ID R R +VL +K W K IN+P GT +SY LLV+ F
Sbjct: 170 ALENTRLLMCYAMIDPMRVRTLVLFLKVWCKRRKINSPYKGTLSSYGYVLLVIYFLVHVK 229
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS-----DKYRKINRSSL 250
P +LP L+ I P L+ + ++ I +N+ F ++ N S+
Sbjct: 230 NPPVLPNLQQIPP------LRPI----SQEETHINGYNVWFFDDVNLLRQRWHSQNTESV 279
Query: 251 AHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPL 296
A L + F + +S GL K S+ W++ N L
Sbjct: 280 AELLIDFFKFYSRDFAYNTGVASIRAGLLTKESK--------GWDNDPDKGTARERNR-L 330
Query: 297 FIEDPFEQPENSARAVSEKNLAKISNAF 324
IEDPFE N AR V+ L I F
Sbjct: 331 CIEDPFETNFNVARCVTRDGLYTIRGEF 358
>gi|358414990|ref|XP_588743.5| PREDICTED: poly(A) RNA polymerase, mitochondrial [Bos taurus]
gi|359071446|ref|XP_002692192.2| PREDICTED: poly(A) RNA polymerase, mitochondrial [Bos taurus]
Length = 583
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 131/297 (44%), Gaps = 47/297 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIEL----------SNGSCISS-------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L ++G+ + + + Q
Sbjct: 227 CAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGKFTAQKTSGNFLMEFQVKNVPSERVATQK 285
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L Q G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 ILSVIGECLDQFGPGCVGVQRILNARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYMY 345
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 GALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 401
Query: 208 PGNLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
D LK + ++ I E C F ++ R + N +L L F E F
Sbjct: 402 -----DYLKTLADAEDKCIIEGHNCTFVGDLNRIKPSR----NTETLELLLKEFFEYFGN 452
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P + PL I++PFE N ++ VS+ L K
Sbjct: 453 FAFNKNSINI-------RQGREQNK--PESSPLHIQNPFETSLNVSKNVSQSQLQKF 500
>gi|440911284|gb|ELR60972.1| Poly(A) RNA polymerase, mitochondrial, partial [Bos grunniens
mutus]
Length = 580
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 128/297 (43%), Gaps = 47/297 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L ++ + + Q
Sbjct: 224 CAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGKFTAQKTSGNFLMEFQVKNVPSERVATQK 282
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L Q G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 283 ILSVIGECLDQFGPGCVGVQRILNARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYMY 342
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 343 GALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 398
Query: 208 PGNLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
D LK + ++ I E C F ++ R + N +L L F E F
Sbjct: 399 -----DYLKTLADAEDKCIIEGHNCTFVGDLNRIKPSR----NTETLELLLKEFFEYFGN 449
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + I R + P + PL I++PFE N ++ VS+ L K
Sbjct: 450 FAFNKNSINI-------RQGREQNK--PESSPLHIQNPFETSLNVSKNVSQSQLQKF 497
>gi|410963394|ref|XP_003988250.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Felis catus]
Length = 584
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 138/329 (41%), Gaps = 41/329 (12%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDI 69
L +L L E+ + R S + ++ + TV PFGS V N F + G DLD+
Sbjct: 192 LNTLLKELQLTEENTKLRYLTCSLIEDIAAAY--FLDCTVRPFGSSV-NSFGKLGCDLDM 248
Query: 70 SIELSNGSCISSAG-----------------KKVKQSLLGDLLRALRQKG-GYRRLQFVA 111
++L S++ + Q +L + L G G +Q +
Sbjct: 249 FLDLDEIGKFSASKTSGNFLMEFQVKNVPSERIATQKILSVIGECLDHFGPGCVGVQKIL 308
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR P+++F CD++ +N S+ L+ +D R R +V ++ WA+AH +
Sbjct: 309 NARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYIYGALDSRVRALVFSIRCWARAHSL 368
Query: 172 NNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
+ G++ ++SL+++V+F Q P ILP L D LK + ++ + E
Sbjct: 369 TSSIPGSWITNFSLTMMVIFFLQRRSPPILPTL---------DSLKTLADAEDKCVIEGH 419
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL 290
R + N +L L F E F + + + I R +
Sbjct: 420 NCTFVRDLNKIKPSGNTETLELLLKEFFEYFGNFAFNKNSINI-------RQGREQNK-- 470
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAK 319
P + PL I++PFE N ++ VS+ L K
Sbjct: 471 PESSPLHIQNPFETSLNISKNVSQSQLQK 499
>gi|378730228|gb|EHY56687.1| poly(A) polymerase, variant [Exophiala dermatitidis NIH/UT8656]
gi|378730229|gb|EHY56688.1| poly(A) polymerase [Exophiala dermatitidis NIH/UT8656]
Length = 1091
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 137/315 (43%), Gaps = 32/315 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++ + L P E + R + I+ L +++ V FGS +NL + D+D+
Sbjct: 151 MQKLYETLQPTAESEQRRSQFINKLDKILRERWPTSAINVNVFGSTGNNLGTSDSDVDV- 209
Query: 71 IELSNGSCISSAGKKVKQSL-LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ + DLL K G R+ V+ A+VPI+K ++C
Sbjct: 210 -------CITTDCKEMEHVCSIADLL----AKHGMERVVCVSSAKVPIVKIWDPELQVAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
DI+++N ++ + ID R R + +++K WAK +N+ GT +SY+ L
Sbjct: 259 DINVNNPLALENTELVRTYVSIDSRVRPLAMIIKYWAKRRILNDAALGGTLSSYTWICLA 318
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI--- 245
L QT P ILP L+ P L GV + +R + D YR
Sbjct: 319 LNFLQTRDPPILPTLQQ-QPHLEPKFLAGVNVSFDRDV-------------DAYRGFGAR 364
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQP 305
N+SSL L F ++ G L + + G+ + L ++ L +E+PF
Sbjct: 365 NKSSLGELLFHFF-RYYGHELDFEQSVVSVRLGRVTTKVEKSWHLLQDNRLCVEEPFNIS 423
Query: 306 ENSARAVSEKNLAKI 320
N A + ++ I
Sbjct: 424 RNLANTADDTSMRGI 438
>gi|290991229|ref|XP_002678238.1| caffeine-induced death protein 1 [Naegleria gruberi]
gi|284091849|gb|EFC45494.1| caffeine-induced death protein 1 [Naegleria gruberi]
Length = 662
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 113/254 (44%), Gaps = 40/254 (15%)
Query: 87 KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
K++ + L + L K Y ++ + AR+PI+ F + I+CDI ++N+ ++ +
Sbjct: 407 KKNYVSQLKQFLESKLNYTDVKGIFTARIPIVTFTEQNLKINCDIGVNNILAVYNTRLIG 466
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
ID R + ++ L+K W+K IN+P GT +SY L ++V+ Q C +LP L+D
Sbjct: 467 LYCNIDIRCKQLIFLIKYWSKQRCINDPFGGTLSSYCLVIMVIHLLQQC--DVLPFLQD- 523
Query: 207 YPGNLVDDLKGVRANAERQIAE-------ICAFNIARFSSDKYRKINRSSLAHLFVSFLE 259
K V N + +I + C+F + +K SL L + F +
Sbjct: 524 ---------KTVFTNMKTKIVDGLDGNNYDCSFEESLDEINKKITKKDDSLGSLLLKFFK 574
Query: 260 KFS--------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF-IEDPFEQPENSAR 310
++ +S++ S G + + W LF +EDPFE N+AR
Sbjct: 575 YYAFEFDYENNAISIRNS--------GNRIFSKEDKSW----KALFAVEDPFETEFNTAR 622
Query: 311 AVSEKNLAKISNAF 324
VS L I F
Sbjct: 623 NVSITGLDAIRYEF 636
>gi|240273092|gb|EER36615.1| PAP/25A associated domain family [Ajellomyces capsulatus H143]
Length = 839
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 96/197 (48%), Gaps = 14/197 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E R+K + L ++ V FGS + L S D+DI
Sbjct: 437 MRELYDRLLPSEESESRRLKFVDKLENLLNKQWPGNNIRVHVFGSSGNKLCSSDSDVDI- 495
Query: 71 IELSNGSCISSAGKKV-KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++ K +L D L K G R+ V+HARVPI+K ++C
Sbjct: 496 -------CITTTYKELEKVCILADFL----AKSGMERVVCVSHARVPIVKIWDPELRLAC 544
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 545 DMNVNNTLALENTRMIRTYVEIDERVRQLAMIVKYWTKRRILNDAALGGTLSSYTWICLI 604
Query: 189 LFHFQTCVPAILPPLKD 205
+ QT P ILP L++
Sbjct: 605 INFLQTRNPPILPSLQE 621
>gi|413934364|gb|AFW68915.1| hypothetical protein ZEAMMB73_981239 [Zea mays]
Length = 780
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 113/257 (43%), Gaps = 29/257 (11%)
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
C+S K++ + + L + Q G + +Q + ARVPI+K +SCDI ++NL
Sbjct: 503 CLSIDNKEMSKVDIILKLADIFQAGNLQNIQPLTRARVPIVKLMDPKTGLSCDICVNNLL 562
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+ +K L QID R + + +VK WAK +N GT +SY+ ++ + Q +
Sbjct: 563 AVVNTKLLRDYGQIDKRLQQLAFIVKHWAKTRRVNETYQGTLSSYAYVIMCIHLLQ--LR 620
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEI-CAFNIARFSSDKYRKINRSSLAHLFVS 256
ILP L+++ A ++ EI CA+ + Y NR +++ L S
Sbjct: 621 RILPCLQEM------------EATYYVKVEEINCAYFDQVDKLNNYGAHNRDTVSRLLWS 668
Query: 257 FLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENS 308
F ++ +S++ + I W N R H + IEDPFE +
Sbjct: 669 FFHYWAYEHDYTRDVISIRTGRI-ISKERKDWTRRVGNDR-----HLICIEDPFEISHDL 722
Query: 309 ARAVSEKNLAKISNAFE 325
R V + + + FE
Sbjct: 723 GRVVDKFTIKILREEFE 739
>gi|341876924|gb|EGT32859.1| hypothetical protein CAEBREN_29455 [Caenorhabditis brenneri]
Length = 473
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 129/284 (45%), Gaps = 19/284 (6%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GSF + + DLD + + + G +Q L ++ + LR+ FV
Sbjct: 104 GSFAAGVDIERSDLDFVVNVPS----LKEGNPFQQ--LMEMKKELRKFNNIFEKVFVQKG 157
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
VP+LK + +S D+S+DN + +K L Q+D RF + +K WA +
Sbjct: 158 HVPVLKLTDRDRKVSIDVSMDNGTSKRNTKLLSLYGQVDARFPLLCKAMKAWASKVGVEG 217
Query: 174 PKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFN 233
K NS+SL L+++ + Q + +LP L++I+P L + N E + +
Sbjct: 218 AKRARLNSFSLCLMLIQYLQ--MQKVLPNLQEIFP-ELNGEFLVENDNYEEKDLKEKIIK 274
Query: 234 IARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP---FTGQWEHIRSNTRWL 290
+F D+ N+SSLA LF+ FL+ ++ K + + +W+ +
Sbjct: 275 EGKFKLDE----NKSSLAALFLGFLKYYADFDFKKYWISVRNGRIMEKRWDEEGKPLDGM 330
Query: 291 PN-NHPLFIEDPF-EQPENSARAVSEKN-LAKISNAFEMTHFRL 331
P+ N + +EDPF + P N A V + + + KI F+ + R+
Sbjct: 331 PDKNRYIVVEDPFLKVPRNCAGTVRQTDYVEKIQFEFQQEYDRI 374
>gi|145235221|ref|XP_001390259.1| PAP/25A associated domain family [Aspergillus niger CBS 513.88]
gi|134057940|emb|CAK47817.1| unnamed protein product [Aspergillus niger]
Length = 1076
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 133/299 (44%), Gaps = 30/299 (10%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
++ L P E + R +++ L ++ V FGS + L S D+DI
Sbjct: 122 EVYDQLLPSAESDDRRRQLVRKLEKLFNDQWPGCNIKVHVFGSSGNKLCSSDSDVDI--- 178
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++ LL ++L + G R+ V+HA+VPI+K ++CD+
Sbjct: 179 -----CITTTYKELEHVCLLAEVL----ARHGMERVVCVSHAKVPIVKIWDPELRLACDM 229
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + ++D R R + +++K W K +N+ GT +SY+ L++
Sbjct: 230 NVNNTMALENTRMVRTYVELDERVRPLAMIIKHWTKRRILNDAGLGGTLSSYTWICLIIN 289
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRS 248
QT P ILP L+ R + ++ A+ +C+F+ S Y + N+
Sbjct: 290 FLQTRDPPILPSLQ-------------ARPHKKKLTADGIVCSFDDDLDSLIGYGRKNKQ 336
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
SL L F K+ G L I G+ + L N+ L +E+PF N
Sbjct: 337 SLGELLFQFF-KYYGHELDYERHVISVREGKLLSKEAKGWHLLQNNRLCVEEPFNTSRN 394
>gi|449454502|ref|XP_004144993.1| PREDICTED: uncharacterized protein LOC101204551 [Cucumis sativus]
gi|449521808|ref|XP_004167921.1| PREDICTED: uncharacterized LOC101204551 [Cucumis sativus]
Length = 763
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 155/373 (41%), Gaps = 57/373 (15%)
Query: 2 GSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
G ++L L I L P E+ E + +++ L ++V V A + FGS ++
Sbjct: 422 GDIDMLTIPLLRIYESLIPPEEEKEKQRQLLISLEKLV--VNEWPHAHLFLFGSCANSFG 479
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFV----------- 110
D+D+ C+ + +S + L + Q ++ +Q +
Sbjct: 480 VSNSDVDV--------CLVLRDADIDKSEILLKLAEILQSANFQNVQVMKWLYASTWDIM 531
Query: 111 --AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
ARVPI+K + +SCDI I+N+ + +K L +QID R + +VK WAK+
Sbjct: 532 ALTRARVPIIKLKDPVTGVSCDICINNVLAVVNTKLLRDYAQIDVRLPQLAFIVKHWAKS 591
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD---IYPGNLVDDLKGVRANAERQ 225
+N GT +SY+ L+ + Q P ILP L++ + +VD+++
Sbjct: 592 RGVNETYQGTLSSYAYVLMCIHFLQHRDPPILPCLQETKIVTYHKIVDNIE--------- 642
Query: 226 IAEICAF-----NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW 280
CA+ + F SD N+ S+A L F ++ A+ + + T
Sbjct: 643 ----CAYFDQVEKLKTFGSD-----NKESVARLVWGFFHYWAYCHDYANTV-VSVRTKNT 692
Query: 281 EHIRSNT---RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT 337
R+ R + H + IEDPFE + R V + ++ + FE R + QT
Sbjct: 693 VSKRAKDWTRRIGKDRHLICIEDPFETSHDLGRVVDKYSIKVLREEFE----RAATILQT 748
Query: 338 RYALLSSLARPFI 350
L PF+
Sbjct: 749 YPNPCEKLFEPFV 761
>gi|322967050|sp|D2HS90.1|STPAP_AILME RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|281352583|gb|EFB28167.1| hypothetical protein PANDA_014931 [Ailuropoda melanoleuca]
Length = 869
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 119/275 (43%), Gaps = 38/275 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 317 GDLGKALELAEALSGEKTEGVAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 374
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 375 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 433
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 434 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASG 480
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNTRWL 290
N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 481 LEPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGDLPSNRWEGLRLG---- 536
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
P+ ++DPF+ N A V+ + ++ N+ +
Sbjct: 537 ----PMNLQDPFDLSHNVAANVTSRVAGRLQNSCQ 567
>gi|410929987|ref|XP_003978380.1| PREDICTED: terminal uridylyltransferase 7-like [Takifugu rubripes]
Length = 1238
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 96/205 (46%), Gaps = 5/205 (2%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
+VL + + P + R ++ DL ++ A ++ FGS + R
Sbjct: 787 SVLNKVCEQCYTDFAPDELEMGVRELILKDLETFIK--RQFPAAQLQLFGSSKNGFGFRQ 844
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L I+ SL+ L R LR+ G + + + A+VPI+KF +
Sbjct: 845 SDLDICMVLEGQETINDVDCI---SLIESLARLLRKHSGVKNVLPITTAKVPIVKFYHVQ 901
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N + L + ID R + + ++K +AK DI + G+ +SY+
Sbjct: 902 TGLEGDISLYNTLALHNTHLLASYAAIDRRVKILCYIMKVFAKMCDIGDASRGSLSSYAY 961
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPG 209
+L+ LF Q P ++P L++IY G
Sbjct: 962 TLMALFFLQQRNPPVIPVLQEIYDG 986
>gi|366992111|ref|XP_003675821.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
gi|342301686|emb|CCC69457.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
Length = 586
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 99/191 (51%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ D + ++P RE+ E+R + IS +R V+ + A + FGS+ ++L+ D+D
Sbjct: 181 IADFVSYISPSREEIESRNQTISKVRNAVKQL--WPDADLHVFGSYATDLYLPGSDIDCV 238
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S AG K ++ L L L+Q+G +++ +A RVPI+KF NI D
Sbjct: 239 IN-------SKAGDKENRNSLYSLASFLKQQGLATQIEVIAKTRVPIIKFVEPESNIHID 291
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +S+ + ++
Sbjct: 292 VSFERTNGLEAAKLIREWLQDTPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFSI-ICIV 349
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 350 FSFLQMHPRII 360
>gi|301780024|ref|XP_002925431.1| PREDICTED: u6 snRNA-specific terminal uridylyltransferase 1-like
[Ailuropoda melanoleuca]
Length = 903
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 119/275 (43%), Gaps = 38/275 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 349 GDLGKALELAEALSGEKTEGVAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 406
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 407 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 465
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 466 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASG 512
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNTRWL 290
N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 513 LEPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGDLPSNRWEGLRLG---- 568
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
P+ ++DPF+ N A V+ + ++ N+ +
Sbjct: 569 ----PMNLQDPFDLSHNVAANVTSRVAGRLQNSCQ 599
>gi|292627234|ref|XP_001335519.3| PREDICTED: terminal uridylyltransferase 4-like, partial [Danio
rerio]
Length = 653
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 120/283 (42%), Gaps = 34/283 (12%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS + R DLDI + L +A K + ++ L + L++ G R + +
Sbjct: 7 FGSSKNGFGFRDSDLDICMTLEGHD---TAEKLNCKEIIEGLAKVLKKHTGLRNILPITT 63
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+KFE + DIS+ N Q ++ L + ID R + + +K +AK DI
Sbjct: 64 AKVPIVKFEHRQSGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIG 123
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
+ G+ +SY+ L+VL+ Q P ++P L++I+ GN +R + AF
Sbjct: 124 DASRGSLSSYAYILMVLYFLQQRQPPVIPVLQEIFDGN---------TTPQRMVDGWNAF 174
Query: 233 NIARFSSDKYR----KINRSSLAHLFVSFLEKFS-GLSLKASELGI------CPFTGQWE 281
+ R NR ++ L++ L ++ K + I F QW
Sbjct: 175 FFDDLDELRRRLPELHQNRETVGELWLGLLRFYTEEFDFKEHVISIRQRKRLTTFEKQW- 233
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 234 ----------TSKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 266
>gi|402589923|gb|EJW83854.1| hypothetical protein WUBG_05236 [Wuchereria bancrofti]
Length = 441
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/282 (25%), Positives = 124/282 (43%), Gaps = 35/282 (12%)
Query: 53 FGSFVSNLFSRWG-DLDISIELSNG---SCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ 108
FGS N F G D DI ++ G I SA ++ + +R+ +
Sbjct: 114 FGS-AGNGFGLLGSDADICLQFGAGVRHEDIDSA------EVICKIAEVIRKMPNVVYVC 166
Query: 109 FVAHARVPILKFETI-HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+ HA+VPI+KF H N+ D+S+ N+ ++ L S++D R + ++ K WAK
Sbjct: 167 EIPHAKVPIVKFRCRNHYNLEADVSLYNVLALENTRLLRTYSKLDRRIHQLGIMTKMWAK 226
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
+I N G+ +SYS ++++ + Q P + P L+++ P E I
Sbjct: 227 NCEIGNASKGSLSSYSYIIMLIHYLQRTDPPVAPFLQEVAPPG---------RYREPIII 277
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
+ C F ++ NR ++ L++ FL+ F A++ FT + IR
Sbjct: 278 DSCDVYFCSFKDLEWTVHNRLTVGELWIGFLDYF------ATKFD---FTREVVQIRQTP 328
Query: 288 RWLP-----NNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + P+ IEDPF+ N + V K +A I +F
Sbjct: 329 PLMKLDKGWQSRPIAIEDPFDLSHNLSSGVHSKTMAYIQKSF 370
>gi|414884674|tpg|DAA60688.1| TPA: hypothetical protein ZEAMMB73_903036 [Zea mays]
gi|414884675|tpg|DAA60689.1| TPA: hypothetical protein ZEAMMB73_903036 [Zea mays]
Length = 153
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 79/144 (54%), Gaps = 3/144 (2%)
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSL--LGDLLRALRQKGGYRRLQFVAHARVP 116
+LF+ DLD+SI S + K++ + +L + ++ G + + + ARVP
Sbjct: 2 DLFTPHSDLDLSINFSANTDEQYTRKQMISIIKKFSKVLFSYQRSGIFCGVLPIVSARVP 61
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
ILK + CDIS++N G +S + ++S +D RF+ + LVK WAK HD+N P+
Sbjct: 62 ILKVIDRGTGVECDISVENKDGMTRSMIIKFVSSLDERFQILSYLVKFWAKVHDLNTPRQ 121
Query: 177 GTFNSYSLSLLVLFHFQTCVPAIL 200
T +S S+ LV FH Q C P IL
Sbjct: 122 LTMSSMSIISLVAFHLQ-CRPGIL 144
>gi|402220189|gb|EJU00261.1| hypothetical protein DACRYDRAFT_81189 [Dacryopinax sp. DJM-731 SS1]
Length = 753
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 143/331 (43%), Gaps = 33/331 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
L D + L P E+ + V L ++ +VE + + FGS + R D+D+
Sbjct: 43 LFDFVVQLLPTPEELAIKEDVRKLLERLIRNVEP--DSRLLSFGSTANGFALRNSDMDLC 100
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF-----ETIHQ 125
+ + +A + V ++ DLL ++ +++ + AR+PI+K + +
Sbjct: 101 CLIDSDRLPPTASEMV--VMVADLL----ERETKFQVKPLPKARIPIIKLTLAPTQGLPY 154
Query: 126 NISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
I+CDI +N ++ L + +D R R MVL +K W+K IN+P GT +SY
Sbjct: 155 GIACDIGFENRLALENTRLLLSYATLDPTRVRTMVLFLKLWSKRRKINSPYLGTLSSYGY 214
Query: 185 SLLVLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDK 241
+L+V+F P +LP L+ I P L+ + + I E I F+
Sbjct: 215 ALMVIFFLVHVKHPPVLPNLQVIPP------LRPI-TKEDMHIGEHNIWFFDDTELLKQT 267
Query: 242 YRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNN 293
++ N + L V F + FS +S++ L + G +E +
Sbjct: 268 WQSANTDGVGELLVDFFKYFSHDFPYNTHVISIRGGLLEKT-YKGWYEPDPRAHDLSRDR 326
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ L IEDPFE N R V++ L I F
Sbjct: 327 NRLCIEDPFEISYNVGRTVTKDGLYTIRGEF 357
>gi|62913984|gb|AAH05013.2| TUT1 protein [Homo sapiens]
Length = 578
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 22 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 79
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 80 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSGSGP-LLSNYAL 138
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 139 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 186
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
IN L+ L F S L+ S L + P G WE +R
Sbjct: 187 EPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 241
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 242 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 270
>gi|170591628|ref|XP_001900572.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158592184|gb|EDP30786.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 1395
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 58/221 (26%), Positives = 102/221 (46%), Gaps = 24/221 (10%)
Query: 110 VAHARVPILKFETI-HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ HA+VPI+KF H N+ D+S+ N+ ++ L S++D R + ++ K WAK
Sbjct: 1122 IPHAKVPIVKFRCRNHYNLEADVSLYNVLALENTRLLRTYSKLDRRIHQLGIMTKIWAKN 1181
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
+I N G+ +SYS ++++ + Q P + P L+++ P E I +
Sbjct: 1182 CEIGNASKGSLSSYSYIIMLIHYLQRTDPPVAPFLQEVAPPGRC---------REPIIID 1232
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
C F ++ NR ++ L++ FL+ F A++ FT + IR
Sbjct: 1233 NCDVYFCNFEDLEWTVHNRLTVGELWIGFLDYF------ATKFD---FTREVVQIRQTPP 1283
Query: 289 WLP-----NNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + P+ IEDPF+ N + V K +A I +F
Sbjct: 1284 LMKLDKGWQSRPIAIEDPFDLSHNLSSGVHSKTMAYIQKSF 1324
>gi|341896648|gb|EGT52583.1| hypothetical protein CAEBREN_17557 [Caenorhabditis brenneri]
Length = 448
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 129/284 (45%), Gaps = 29/284 (10%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GSF + + DLD S+E+ N SA K + + LR Y R+ F
Sbjct: 100 GSFAAGIALSSSDLDFSLEIPNMMGHESA----KLEAIWNKLRDYYDHPYYDRVLF---T 152
Query: 114 RVPILKFETIHQN-----ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ P+LK + + + D+++DN + ++ L W QID RF + VK WA
Sbjct: 153 KFPVLKMTLKYSDKRISDVDVDLTLDNHPPKRNTQLLVWYGQIDPRFNTLCRAVKIWASR 212
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
+ N + G NS+S+ +LV+F Q +LP +++++ ++L G E Q +
Sbjct: 213 TGVKNSRNGFLNSFSVCILVIFFLQQV--KVLPNIQEVF-----EELNG---ELEIQDDD 262
Query: 229 ICAFNIARFSSDKYRKI--NRSSLAHLFVSFLEKFSGLSLKASELGI--CPFTGQWEHIR 284
++ DK + N SSL LF F++ +S L +A + I + +
Sbjct: 263 YYKRDLLEELHDKGIVVGQNGSSLGALFFGFMKFYSELDFEAHWISIKRGKLLKKIDEDG 322
Query: 285 SNTRWLPNNH-PLFIEDPF-EQPENSARAVSE-KNLAKISNAFE 325
+ LP+N + +EDPF E P N AR V + KI N F+
Sbjct: 323 NPVDGLPHNSLYIVLEDPFLEHPFNCARTVKDLARFKKIQNEFQ 366
>gi|291383480|ref|XP_002708297.1| PREDICTED: Caffeine Induced Death (S. pombe Cid) homolog family
member (cid-1)-like [Oryctolagus cuniculus]
Length = 1505
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 135/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1037 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGHETAEGLDC 1088
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1089 VRTIEELARVLRKHAGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1148
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1149 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1208
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ A+ +Y K N S+ L++ L ++
Sbjct: 1209 KGEKKPEIFVDGWNIYFFDQIDELPAY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1261
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1262 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1309
Query: 318 AKISNAF 324
I AF
Sbjct: 1310 NFIMKAF 1316
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S + R D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRMGFRNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIEVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + + + + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTVLGKAEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|344295651|ref|XP_003419525.1| PREDICTED: LOW QUALITY PROTEIN: speckle targeted PIP5K1A-regulated
poly(A) polymerase-like [Loxodonta africana]
Length = 921
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 73/269 (27%), Positives = 117/269 (43%), Gaps = 30/269 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL S+EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 370 GDLGKSLELAEALKGEKAEGGAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 427
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL S++DGR R +V ++ WA+ ++ N+Y+L
Sbjct: 428 SGLHGDISLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSGSGP-LLNNYAL 486
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 487 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 534
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN-------H 294
N L+ L F S L+ S L + GQ + LP+N
Sbjct: 535 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSL--REGQALPVAGG---LPSNLWGGLRLG 589
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 590 PLNLQDPFDLSHNVAANVTCRVAGRLQNC 618
>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
Length = 531
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P R++ E R K I LR V + GA ++ FGS+ ++L+ D+D
Sbjct: 105 IRDFVAYISPSRQEIELRNKTIRTLRHAVRKL--WPGADLQVFGSYATDLYLPGSDIDCV 162
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S G K +S L +L L+ + +++ +A ARVPI+KF I D
Sbjct: 163 IN-------SKTGDKENRSSLYELAHFLKNRKLATQVEVIAKARVPIIKFVEPTSQIHVD 215
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ Q G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 216 VSFERTNGLEAAKLIRSWLQQTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 273
Query: 190 FHFQTCVPAIL 200
+ F P I+
Sbjct: 274 YAFLNLHPRIV 284
>gi|297720833|ref|NP_001172779.1| Os02g0122100 [Oryza sativa Japonica Group]
gi|41052754|dbj|BAD07610.1| putative caffeine-induced death protein 1 [Oryza sativa Japonica
Group]
gi|125537868|gb|EAY84263.1| hypothetical protein OsI_05643 [Oryza sativa Indica Group]
gi|125580616|gb|EAZ21547.1| hypothetical protein OsJ_05175 [Oryza sativa Japonica Group]
gi|255670556|dbj|BAH91508.1| Os02g0122100 [Oryza sativa Japonica Group]
Length = 597
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 139/329 (42%), Gaps = 34/329 (10%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
+ P L + LNP E + ++I L V + A + +GS ++ +
Sbjct: 271 DAFTPGLLSLYESLNPSEEHKAKQRQLIESLTNSVS--KEWPNAQLHLYGSCANSFGNSH 328
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
D+D+ +++ ++A + + + LL L +K + ++ + ARVPI+K
Sbjct: 329 SDVDVCLQID-----TAAEENIAELLL--ALAETLRKDDFDNVEAITSARVPIVKIADPG 381
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+SCDI ++NL +K L +QID R + +VK WAK +N GT +SY+
Sbjct: 382 SGLSCDICVNNLFAVANTKLLKDYAQIDERLLQLAFIVKHWAKLRGVNETYRGTLSSYAY 441
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
L+ + Q P ILP L+ + P V + G Q+ ++ F
Sbjct: 442 VLMCISFLQQREPKILPCLQAMEPTYTV-VVDGTECAYFDQVDQLKDFGAE--------- 491
Query: 245 INRSSLAHLFVSFLEKFS--------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPL 296
N+ S+A L +F ++ +S++ I W TR + H +
Sbjct: 492 -NKESIAELLWAFFHYWAFHHDYRNDVISVRMGNT-ISKQEKNW-----TTRVGNDRHLI 544
Query: 297 FIEDPFEQPENSARAVSEKNLAKISNAFE 325
IEDPFE + R V + + + FE
Sbjct: 545 CIEDPFETSHDLGRVVDRQTIRVLREEFE 573
>gi|242053379|ref|XP_002455835.1| hypothetical protein SORBIDRAFT_03g025985 [Sorghum bicolor]
gi|241927810|gb|EES00955.1| hypothetical protein SORBIDRAFT_03g025985 [Sorghum bicolor]
Length = 55
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 36/54 (66%), Positives = 44/54 (81%)
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL 215
+KEWAKA +IN+PK+G+ NSYSL LLVLFHFQT P ILPPL +IY GN+ D+
Sbjct: 1 IKEWAKAQNINDPKSGSLNSYSLCLLVLFHFQTSDPPILPPLNEIYEGNIAGDV 54
>gi|292614134|ref|XP_002662153.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Danio rerio]
Length = 489
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 150/332 (45%), Gaps = 31/332 (9%)
Query: 21 LREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
L + R + +D++++ + G GS ++ SR D D+ + + G
Sbjct: 180 LEKKESCRAALQTDIQKIFPCAKVFLG------GSSLNGFGSRSSDADLCLVIEEGP--- 230
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ + L+R L K Y + A+VPI+KF + D++ +N G I
Sbjct: 231 -VNHRKDAVYVLSLVRKLLYKLSYIEKPQLIRAKVPIVKFRDRISGVEFDLNFNNTVG-I 288
Query: 141 KSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAI 199
++ FL + ++ R R +VL++K+WA H IN+ GT +SY+L L+VL + QT +
Sbjct: 289 RNTFLLRTYAFVEKRVRPLVLVIKKWANHHCINDASRGTLSSYTLVLMVLHYLQTLPEPV 348
Query: 200 LPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLE 259
+P L+ YP D K ++I AF ++R N+SSL LF+ FL
Sbjct: 349 IPCLQRDYPTCF--DPKMDIHLVPSGPSDIPAF-VSR---------NQSSLGDLFLGFLR 396
Query: 260 KFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK-NLA 318
++ + K + I + + W + + +E+PF + N+ARAV E+
Sbjct: 397 YYATV-FKWDKQVISVRMARTLPKSNCKEW--KDKFICVEEPFNR-TNTARAVHERMKFE 452
Query: 319 KISNAFEMTHFRLTSTNQTRYALLSS--LARP 348
I AF ++ L + L S +ARP
Sbjct: 453 AIKAAFIESYRLLQLRKDLNFILPKSKQMARP 484
>gi|195442374|ref|XP_002068933.1| GK18036 [Drosophila willistoni]
gi|194165018|gb|EDW79919.1| GK18036 [Drosophila willistoni]
Length = 582
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 141/312 (45%), Gaps = 33/312 (10%)
Query: 30 KVISDLREVV-ESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
+ I D+R +V E + SL T PFGS V+ L ++ D+D+ ++ +K +
Sbjct: 46 ECIHDMRILVQEKMGSLLEVT--PFGSIVTGLSLKYSDVDLYMQWDGKK------RKSRT 97
Query: 89 SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
L + LR + + + ARVPI++ + + +S DI++ N S+F+ +
Sbjct: 98 VLYNQINGFLRTATLFGDVVAIRSARVPIIRCKHMTTGLSTDINVSNPKSIYNSRFVTEL 157
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP 208
D R + + L +K WAK IN P T SY L +L++++ Q +LP +K++
Sbjct: 158 ISRDLRLKQLNLFLKIWAKKSKINGPACMT--SYCLCVLIVYYLQQ--RGLLPSIKNLQS 213
Query: 209 GNLVDDLKGVR-ANAERQIAEIC----AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
L ++ GV A + + EI +F++ R Y ++ S+ + + G
Sbjct: 214 TRLPSNVWGVNYAYDLKNVPEISNDVNSFDLIRGFFKFYSTLDFESI------LISPYLG 267
Query: 264 LSLKASELGICP-----FTGQWEHIRSNTRWLPN----NHPLFIEDPFEQPENSARAVSE 314
+L + + P + Q + I T P L I+DPFE N A+ VS+
Sbjct: 268 RALNRTLAFLGPHAFPDYYAQMDAIERFTGTRPERFQMQRTLCIQDPFELDHNVAKGVSK 327
Query: 315 KNLAKISNAFEM 326
NL I F +
Sbjct: 328 ANLIYIQQCFTL 339
>gi|338712343|ref|XP_001502659.2| PREDICTED: LOW QUALITY PROTEIN: speckle targeted PIP5K1A-regulated
poly(A) polymerase [Equus caballus]
Length = 982
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 115/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GD S+EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 426 GDPGKSLELAEAPNGEKTEGVAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 483
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ N+Y+L
Sbjct: 484 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLNNYAL 542
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 543 TLLVIYFLQTRDPPVLPTVAQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 590
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N+ L+ L F S L+ S L + P G WE +R
Sbjct: 591 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 645
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 646 ---PMNLQDPFDLSHNVAANVTSRVAGRLQNC 674
>gi|321469036|gb|EFX80018.1| hypothetical protein DAPPUDRAFT_319093 [Daphnia pulex]
Length = 736
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 135/281 (48%), Gaps = 26/281 (9%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL-FSRWGDLDISIE 72
++ ++ E+ T+ ++IS L E + S+E G + FGS V+ L F DLDI +E
Sbjct: 172 LVKIVETTEEEKSTKSQIISSLEEWL-SLE-FPGCCLHLFGSSVTGLAFRNDSDLDIFLE 229
Query: 73 L---SNGSCISSAG-------KKVKQSLLGDLLRA---LRQKGGYRRLQFVAHARVPILK 119
+ G ++ A +K ++ +L L RA +R L V++AR+P+ K
Sbjct: 230 IPKYDEGLAVADASLSDEKLTEKKREYMLKTLRRASNIIRSHPDITDLFVVSNARIPVSK 289
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + CD++ +N+ SK L+ + +D R R + +K WAK+H + + T
Sbjct: 290 FVYSPIGVKCDLTCNNIIAVQNSKLLYSLQSLDVRIRPYLYALKFWAKSHRLISSPESTL 349
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS 239
+SY+L+L+ +F+ Q P ++P ++ + V K + N +F + +
Sbjct: 350 SSYALTLMAVFYLQQTDPPLVPSIESLQSE--VPQEKKIYCNGWN-----ISFQVPLDTG 402
Query: 240 DK-YRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ 279
+ + S+ +L + F + L+ A+E+ +CP G+
Sbjct: 403 KSPTQATSEMSIIYLLIGFFRFYQKLN--ANEVVVCPRIGK 441
>gi|429328192|gb|AFZ79952.1| hypothetical protein BEWA_028010 [Babesia equi]
Length = 450
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 134/291 (46%), Gaps = 54/291 (18%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
+V FGS ++ L++ DLD+ +++ N ++S K++ L +++ L R+
Sbjct: 142 CSVAVFGSAITGLWTHGSDLDLCVQIPN---VNSRSAKIRN--LRCIVKVLSPLAPTRKF 196
Query: 108 QFVAHARVPILKFE------------------TIHQNISCDISIDNLCGQIKSKFLFWIS 149
+ + +A++PI+ ++ + S DI+I+N + S +
Sbjct: 197 EQIFNAKIPIVHWKHTGGKSLDLPHNYSEFALDAYDGASIDIAINNNLAVVNSSLIGVYV 256
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R R +++ +K WA+ ++N+ GT S+++SL+V+ Q C P ILP L+D+
Sbjct: 257 SIDIRVRSLIIFLKMWARNKNLNDRSKGTMGSFAISLMVIHFLQNCSPPILPSLQDL--- 313
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI-------------NRSSLAHLFVS 256
A + +I + RF++D +KI N S L +
Sbjct: 314 ----------AFSTNEIPNFVSGFDCRFTTDT-KKIEAELRYLRNNGPENTLSSRELLMQ 362
Query: 257 FLEKFSGLSLKASELGICPFT---GQWEHIRSNTRWLPNNHP-LFIEDPFE 303
F + F L +S+ IC + ++ + ++ + P+N P L +++PFE
Sbjct: 363 FFKYFGWFHLHSSKKPICIRSVDFSVFDDLFNDFKKNPSNEPFLHVDNPFE 413
>gi|332019938|gb|EGI60398.1| U6 snRNA-specific terminal uridylyltransferase 1 [Acromyrmex
echinatior]
Length = 668
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 152/366 (41%), Gaps = 52/366 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVI-SDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
L +L ++ + TR VI S L E+ S+ FGS V+ L + DLDI
Sbjct: 103 LAALLNVVQLTEVELTTRYNVICSHLNEIFRSI--FPECRTYRFGSTVAQLSFKESDLDI 160
Query: 70 SI--------------ELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARV 115
+ ++ + + KKV++ + K + + + A+
Sbjct: 161 YMYVGKIGLPPNYYKPDMPSHIWTTMVFKKVRRVMYN-------LKSVFTNIISIPKAKT 213
Query: 116 PILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK 175
PI+KF I N+SCDIS N G KS FL + + D R R ++LL+K WA+ I+
Sbjct: 214 PIIKFRYIPTNVSCDISFKNGLGIYKSDFLRYCALHDPRLRPLMLLIKYWARHFGISG-- 271
Query: 176 TGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIA 235
+G +SY L L++F+ Q +LPPL + + L G + N N +
Sbjct: 272 SGRISSYGLVCLIIFYLQQESIGLLPPLLHLQRNCIPQILNGWQVNFNENTVLPAITNSS 331
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFS----GLSLKASELGICPFTGQWEHI-------- 283
N ++L H F +F +F+ L L + P Q + +
Sbjct: 332 ----------NIATLLHNFFTFYGEFNFTACVLCLLDGKTYSLPDFAQLDKLPNYMDRYK 381
Query: 284 ----RSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRY 339
+NT+ L P+ ++DP E +N+A S++ L + + + ++ ++ Y
Sbjct: 382 SYIMDNNTKKLDIYKPVCVQDPIELNQNTAANTSDRALLAFQHCCKFSADICSTASENNY 441
Query: 340 ALLSSL 345
L +
Sbjct: 442 NYLIKM 447
>gi|358374739|dbj|GAA91329.1| zinc finger protein, cchc domain containing protein [Aspergillus
kawachii IFO 4308]
Length = 1076
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 133/299 (44%), Gaps = 30/299 (10%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
++ L P E + R +++ L ++ V FGS + L S D+DI
Sbjct: 122 EVYDQLLPSAESDDRRRQLVRKLEKLFNDQWPGCNIKVHVFGSSGNKLCSSDSDVDI--- 178
Query: 73 LSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
CI++ K+++ LL ++L + G R+ V+HA+VPI+K ++CD+
Sbjct: 179 -----CITTTYKELEHVCLLAEVL----ARHGMERVVCVSHAKVPIVKIWDPELRLACDM 229
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLF 190
+++N ++ + ++D R R + +++K W K +N+ GT +SY+ L++
Sbjct: 230 NVNNTMALENTRMVRTYVELDERVRPLAMIIKHWTKRRILNDAGLGGTLSSYTWICLIIN 289
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRS 248
QT P +LP L+ R + ++ A+ +C+F+ S Y + N+
Sbjct: 290 FLQTRDPPVLPSLQ-------------ARPHKKKLTADGIVCSFDDDLDSLVGYGRKNKQ 336
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
SL L F K+ G L I G+ + L N+ L +E+PF N
Sbjct: 337 SLGELLFQFF-KYYGHELDYERHVISVREGKLLSKEAKGWHLLQNNRLCVEEPFNTSRN 394
>gi|302829348|ref|XP_002946241.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
nagariensis]
gi|300269056|gb|EFJ53236.1| hypothetical protein VOLCADRAFT_102845 [Volvox carteri f.
nagariensis]
Length = 1387
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 159/363 (43%), Gaps = 83/363 (22%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVE----PFGSFVSNL 60
+LE ++KD + P +D R++VI++L++ ++ G T++ PFGSFV++L
Sbjct: 43 ELLEALVKDSI----PSNDDRNKRLRVINELQQSLQKAN--LGPTIDLRILPFGSFVNSL 96
Query: 61 FSRWGDLDISI------------ELSNGSCISSAGKKVKQSLLGDLLR---ALRQKGGY- 104
+ R DLD++I E ++ K+ + +LLR A K G
Sbjct: 97 YERSSDLDVAIVGNIVATDDLSPEDKRFPFVTLQLAKLPRDYTVELLRTAVARFVKDGLV 156
Query: 105 --RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
+ L V HAR+PI+KF IS D+++ + K+ + + + + +V
Sbjct: 157 EEKSLVPVLHARIPIIKFVHPATGISVDVTVGSDGDAFKTWSMGQLMALHPAADKLFRVV 216
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K WA+AH IN+P++ TFN++ L+LL + + + V K++ P
Sbjct: 217 KVWARAHCINDPRSFTFNTWCLTLLRVENLMSMVS------KNVVPAR------------ 258
Query: 223 ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS---------ELGI 273
F K + ++ LF+ FL+ S L LKA+ I
Sbjct: 259 ------------ESFGCRKAKPLD-----QLFLQFLDSNSKL-LKAALCNKEPLKKGTAI 300
Query: 274 CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV-----SEKNLAKISNAFEMTH 328
F G + H S+ H LF+EDP +N+AR + S + L ++ F +
Sbjct: 301 SVFYGDF-HRDSHF----AEHRLFVEDPLNSLDNTARTLRAPKNSGRTLDYVTKTFTQSA 355
Query: 329 FRL 331
RL
Sbjct: 356 MRL 358
>gi|85000367|ref|XP_954902.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303048|emb|CAI75426.1| hypothetical protein, conserved [Theileria annulata]
Length = 501
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 100/197 (50%), Gaps = 12/197 (6%)
Query: 17 MLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNG 76
+L P + +E + ++ L+ ++ES S+ G T+ FGS + L++R D+D+ + + N
Sbjct: 169 VLCPTADQFEKKRSLMDHLKPLIES--SING-TLHTFGSCDNGLWTRGSDIDMCLVIPN- 224
Query: 77 SCISSAGKKVKQSLL----GDLLRALRQ---KGGYRRLQFVAHARVPILKFETIHQNISC 129
C S K +L+ LL ++ + ARVPI+K + +N C
Sbjct: 225 -CDSKRYMLSKLNLVSLSANQLLVQIKSCLSNSDIISKISIISARVPIVKLFDMEENSIC 283
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
DISI+N S+++ + ++D R + +K WA + INN GT +SY+L L +
Sbjct: 284 DISINNTIALSNSEYVKTMCKLDERVVLLGRFIKYWATSRKINNRAQGTMSSYTLILQLF 343
Query: 190 FHFQTCVPAILPPLKDI 206
+ Q P I+PP KDI
Sbjct: 344 YFLQNTTPPIIPPFKDI 360
>gi|189234246|ref|XP_973715.2| PREDICTED: similar to AGAP001130-PA [Tribolium castaneum]
Length = 510
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 123/272 (45%), Gaps = 37/272 (13%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S D+DI + ISS + L L AL + G + + A
Sbjct: 238 GSTMSGFALEGSDIDICLLTKP---ISSEPRIDSLHHLDYLQHALLENGLASEAELIM-A 293
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
+VPILKF+ D++ +N+ G ++ L+ +Q+D R R +V++VK WA+ + IN+
Sbjct: 294 KVPILKFKNKETGFEIDLNCNNIVGIQNTRLLYCYAQLDWRVRPLVVMVKIWAQRNHIND 353
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
K T +SYS +L+V+ + Q V PA+LP L +YP + L+ + + +
Sbjct: 354 AKNMTISSYSWTLMVIHYLQCGVFPAVLPCLHSLYPEEF-NTLENRSLDVQGGVE----- 407
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPN 292
+ F S+ N L L + F +S + + + + P
Sbjct: 408 GLKDFESE-----NTRCLGDLLIGFFHYYSYFNYQHYAISVKS---------------PK 447
Query: 293 NHP-----LFIEDPFEQPENSARAVSEKNLAK 319
N P L IE+PF+ N+AR+V + + K
Sbjct: 448 NDPHQWKFLCIEEPFDL-SNTARSVFDLEIFK 478
>gi|432109014|gb|ELK33484.1| Terminal uridylyltransferase 7 [Myotis davidii]
Length = 1481
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 91/182 (50%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1014 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1065
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ LF
Sbjct: 1066 IRTIEELARVLKKHSGLRNILPITTAKVPIVKFYHLRSGLEVDISLYNTLALHNTRLLFA 1125
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1126 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRKPPVIPVLQEIY 1185
Query: 208 PG 209
G
Sbjct: 1186 KG 1187
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/201 (21%), Positives = 91/201 (45%), Gaps = 21/201 (10%)
Query: 15 LGMLNPLREDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
G+LN E+ E R+++ + ++E+V L ++ +GS S L + D++I I+
Sbjct: 306 FGLLN---ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSQLGFKNSDVNIDIQ 358
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I S + +L + L+ + + HARVP++ + + C +S
Sbjct: 359 FP---AIMS-----QPDVLLLVQECLKNSDSFIDVDADFHARVPMVVCKEKKSGLLCKVS 410
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
N + + L + +++ + +V+ + WAK I+ P+ G Y +L+ +F
Sbjct: 411 AGNEHACLTTNHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMTVFFL 470
Query: 193 QTCVPAILPPLKDIYPGNLVD 213
Q +LP +Y G+ ++
Sbjct: 471 QQRKEPLLP----VYLGSWIE 487
>gi|281202391|gb|EFA76596.1| Putative caffeine-induced death protein 1 [Polysphondylium pallidum
PN500]
Length = 803
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 89/172 (51%), Gaps = 11/172 (6%)
Query: 31 VISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL 90
+++ L+ +V + + FGS + + + GD+DI + + SS G ++
Sbjct: 542 LLTKLQNMVSKTFTKSQVKLHLFGSSANGMSLKGGDIDICMLVD-----SSEGDS--DTI 594
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ L L+Q + L + ARVPI+KF QN++CDI I+N +K + S
Sbjct: 595 ISKLATMLKQNNLTKVLA-IPSARVPIVKFRDPIQNLACDICINNKLAIYNTKLVSDYSA 653
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS---LSLLVLFHFQTCVPAI 199
ID R R +V +VK WAK IN P TGT +SY+ L+ +F++ T V +I
Sbjct: 654 IDERMRPLVYVVKRWAKRRKINEPFTGTLSSYAYINLTYSRVFNYATEVISI 705
>gi|270002400|gb|EEZ98847.1| hypothetical protein TcasGA2_TC004457 [Tribolium castaneum]
Length = 524
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 128/278 (46%), Gaps = 35/278 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS +S D+DI + ISS + L L AL + G + + A
Sbjct: 238 GSTMSGFALEGSDIDICLLTKP---ISSEPRIDSLHHLDYLQHALLENGLASEAELIM-A 293
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
+VPILKF+ D++ +N+ G ++ L+ +Q+D R R +V++VK WA+ + IN+
Sbjct: 294 KVPILKFKNKETGFEIDLNCNNIVGIQNTRLLYCYAQLDWRVRPLVVMVKIWAQRNHIND 353
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
K T +SYS +L+V+ + Q V PA+LP L +YP + L+ + + +
Sbjct: 354 AKNMTISSYSWTLMVIHYLQCGVFPAVLPCLHSLYPEEF-NTLENRSLDVQGGVE----- 407
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ------WEHIRSN 286
+ F S+ N L L + F +S + + I TG + ++S
Sbjct: 408 GLKDFESE-----NTRCLGDLLIGFFHYYSYFNYQ--HYAISVRTGSRIPIEICKQVKS- 459
Query: 287 TRWLPNNHP-----LFIEDPFEQPENSARAVSEKNLAK 319
P N P L IE+PF+ N+AR+V + + K
Sbjct: 460 ----PKNDPHQWKFLCIEEPFDL-SNTARSVFDLEIFK 492
>gi|93003164|tpd|FAA00165.1| TPA: zinc finger protein [Ciona intestinalis]
Length = 1410
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 108/231 (46%), Gaps = 25/231 (10%)
Query: 7 LEPILKDILGML----NPLREDWETRMKVISDLREVVESVESLR------GATVEPFGSF 56
L PI K +L L N + E++ R K + + ++ E++ + + FGS
Sbjct: 766 LPPITKSLLNALDRVCNYMYENFALRQKEVQERNKICEALMNYIQRKYNFKCQMNLFGSS 825
Query: 57 VSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-----SLLGDLLRALRQKGGYRRLQFVA 111
+ R DLDI C++ G + S++ D+ + LR+ + +
Sbjct: 826 RNGFGFRRSDLDI--------CMTFYGNATGEDLDFVSIITDVAKCLRRNSDLCNILPIT 877
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A+VPI+KFE + DIS+ NL Q + L S ID R + + +K + K I
Sbjct: 878 TAKVPIVKFEHKMSGLEGDISLYNLLAQKNTAMLSCYSSIDCRCKVLGYAMKVFVKRCQI 937
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN--LVDDLKGVRA 220
+ G+ +SY+ +L+V+F+ Q P +LP L+ +Y G+ VD + G A
Sbjct: 938 GDASRGSLSSYAYTLMVIFYLQQRKPPVLPVLQQLYEGSEQPVDTIDGWNA 988
>gi|390339031|ref|XP_003724909.1| PREDICTED: poly(A) RNA polymerase GLD2-B-like [Strongylocentrotus
purpuratus]
Length = 568
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 136/297 (45%), Gaps = 35/297 (11%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ S+ D D+ + S+ + V +L G L+Q+ + + V A
Sbjct: 281 GSSLNGFGSKGCDADMCLYFSDAPISQKDARDVLLTLRG----YLQQRCSFIKNMKVIFA 336
Query: 114 RVPILKFETIH-QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
+VPILKF+ +++ CD++I++ G + L S++D R +V+LVK+WAK H IN
Sbjct: 337 KVPILKFQHKRFRDLECDLNINHHTGVRNTALLQTYSELDWRVAPLVMLVKQWAKNHGIN 396
Query: 173 NPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYPGNLV-DDLKGVRANAERQIAEIC 230
+ GT +SYS L+++ + Q C P ++ ++ G V D + + + R
Sbjct: 397 DASQGTLSSYSYVLMIINYLQVGCKPPVVNSVQAQEWGRSVFSDRQSILYSWNRLYDHPK 456
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS----ELGIC-PFTGQWEHIRS 285
+ + + + N L L F E ++ + + LG C P T
Sbjct: 457 NY----MNDPRNQSKNSQDLYSLLKGFFEYYTNFDFENTVISIRLGSCFPCT-------- 504
Query: 286 NTRWLPNNHP------LFIEDPFEQPENSARAVSE-KNLAKISNAFEMTHFRLTSTN 335
LP NH +FIE+PF++ N+AR+V + +I E T+ L TN
Sbjct: 505 ---LLPPNHQGWRQKYIFIEEPFDRI-NTARSVFDWGRYRQILETLEETYESLCKTN 557
>gi|198429697|ref|XP_002127607.1| PREDICTED: zinc finger (CCHC/C2H2)-1 [Ciona intestinalis]
Length = 1408
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 108/231 (46%), Gaps = 25/231 (10%)
Query: 7 LEPILKDILGML----NPLREDWETRMKVISDLREVVESVESLR------GATVEPFGSF 56
L PI K +L L N + E++ R K + + ++ E++ + + FGS
Sbjct: 764 LPPITKSLLNALDRVCNYMYENFALRQKEVQERNKICEALMNYIQRKYNFKCQMNLFGSS 823
Query: 57 VSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-----SLLGDLLRALRQKGGYRRLQFVA 111
+ R DLDI C++ G + S++ D+ + LR+ + +
Sbjct: 824 RNGFGFRRSDLDI--------CMTFYGNATGEDLDFVSIITDVAKCLRRNSDLCNILPIT 875
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A+VPI+KFE + DIS+ NL Q + L S ID R + + +K + K I
Sbjct: 876 TAKVPIVKFEHKMSGLEGDISLYNLLAQKNTAMLSCYSSIDCRCKVLGYAMKVFVKRCQI 935
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN--LVDDLKGVRA 220
+ G+ +SY+ +L+V+F+ Q P +LP L+ +Y G+ VD + G A
Sbjct: 936 GDASRGSLSSYAYTLMVIFYLQQRKPPVLPVLQQLYEGSEQPVDTIDGWNA 986
>gi|401837753|gb|EJT41641.1| PAP2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 592
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE ++ + A + FGS+ ++L+ D+D
Sbjct: 185 IKDFVAYISPSREEIEIRNQTISTIREALKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 242
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++ ++ VA ARVPI+KF H I D
Sbjct: 243 VN-------SELGGKESRNNLYSLASHLKKNNLATEIEVVAKARVPIIKFVEPHSRIHID 295
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W++ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 296 VSFERTNGLEAAKLIREWLNDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 353
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 354 FSFLHMHPRII 364
>gi|348502152|ref|XP_003438633.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Oreochromis niloticus]
Length = 798
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/325 (24%), Positives = 131/325 (40%), Gaps = 63/325 (19%)
Query: 42 VESLRGATVEPFGSFVSNLFSRWGDLDISIELSN---------------GSCISSAGKKV 86
VE + + PFGS V+ DLD+ ++L N G +S G+
Sbjct: 192 VEFFPDSQILPFGSSVNTFGIHSCDLDLFLDLENTKVFQAHAKSTTEQPGEGVSDDGRS- 250
Query: 87 KQSLLGDL------------LRAL---RQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
+ S+L D+ L A+ R ++ V+ AR+P++KF N+ DI
Sbjct: 251 EDSILSDIDLSTATPAEVLDLVAMILKRCVPSVHKVHVVSSARLPVVKFHHRELNLQGDI 310
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT---FNSYSLSLLV 188
+I+N ++FL S ID R R +V ++ WAK + +G+ N+Y+L+LL+
Sbjct: 311 TINNRLAVRNTRFLQLCSGIDERLRPLVYTIRYWAKQKQLAGNPSGSGPLLNNYALTLLI 370
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI--CAFNIARFSSDKYRKIN 246
+F Q C P +LP VD LK + E + E C F + + N
Sbjct: 371 IFFLQNCEPPVLP---------TVDQLKDLACEEEECVIEGWNCTFPSQPIAVPPSK--N 419
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----------QWEHIRSNTRWL 290
L L F ++ +S + + P T Q + +
Sbjct: 420 TQQLCTLLAGFFSFYANFDFASSVISVREGRALPITDFLSQNKEDDALQQDQSTKTHQQR 479
Query: 291 PNNHPLFIEDPFEQPENSARAVSEK 315
P PL + DPFE N A ++E+
Sbjct: 480 PKLGPLNLLDPFELSHNVAGNLNER 504
>gi|432855630|ref|XP_004068280.1| PREDICTED: terminal uridylyltransferase 4-like [Oryzias latipes]
Length = 1408
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 138/316 (43%), Gaps = 31/316 (9%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L+P + + R ++++ L + E A + FGS + R DLDI + L
Sbjct: 711 LSPSPVEQQKREQILAGLERFIRK-EFNEKAQLCLFGSSKNGFGFRDSDLDICMTLEGHD 769
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
SA K + ++ L + L++ G R + + A+VPI+KFE + DIS+ N
Sbjct: 770 ---SAEKLNCKEIIEGLAKVLKKHTGLRNILPITTAKVPIVKFEHRQSGLEGDISLYNTL 826
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
Q ++ L + +D R + + +K +AK DI + G+ +SY+ L+VL+ Q P
Sbjct: 827 AQHNTRMLATYAALDPRVQFLGYTMKVFAKRCDIGDASRGSLSSYAYILMVLYFLQQRQP 886
Query: 198 AILPPLKDIYPGNLVDD--LKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFV 255
++P L++I+ G V + G A IA++ R + R+ N S+ L++
Sbjct: 887 PVIPVLQEIFDGTTVPQRMVDGWNAFFFDDIADL----RQRLAG---RQPNMESVGELWL 939
Query: 256 SFLEKFS-GLSLKASELGI------CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENS 308
L ++ K + I F QW + + IEDPF+ N
Sbjct: 940 GLLRFYTEEFDFKEHVISIRQRKRLTTFEKQW-----------TSKCIAIEDPFDLNHNL 988
Query: 309 ARAVSEKNLAKISNAF 324
VS K I AF
Sbjct: 989 GAGVSRKMTNFIMKAF 1004
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/199 (20%), Positives = 88/199 (44%), Gaps = 14/199 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED++ R V++ + EV++ L ++ +GS ++ + D++I + +
Sbjct: 209 EDFKVRETVVTRMEEVIK--RHLAACSLRLYGSCLTRFAFKSSDINIDVTFPS------- 259
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L +L L+ + ++ HA+VP + + C +S N + +
Sbjct: 260 -TMTQPEVLIKVLEILKNSVEFSDVESDFHAKVPAVFCRDKSSGLLCKVSAGNDVACLTT 318
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ R +VL + WA+ ++ G SYS +L+V+F Q ILP
Sbjct: 319 NHLAALVKLEPRLVPLVLAFRYWARLCHVDCQAEGGIPSYSFALMVIFFLQQRKEPILP- 377
Query: 203 LKDIYPGNLVDDLKGVRAN 221
+Y G ++ + R +
Sbjct: 378 ---VYLGRWIEGFEVKRVD 393
>gi|126302611|sp|Q9H6E5.2|STPAP_HUMAN RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|84708631|gb|AAI10911.1| Terminal uridylyl transferase 1, U6 snRNA-specific [Homo sapiens]
gi|118763610|gb|AAI28264.1| Terminal uridylyl transferase 1, U6 snRNA-specific [Homo sapiens]
gi|119594435|gb|EAW74029.1| RNA binding motif protein 21 [Homo sapiens]
Length = 874
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 482
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
IN L+ L F S L+ S L + P G WE +R
Sbjct: 483 EPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 537
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 538 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 566
>gi|158258657|dbj|BAF85299.1| unnamed protein product [Homo sapiens]
Length = 874
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 482
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
IN L+ L F S L+ S L + P G WE +R
Sbjct: 483 EPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 537
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 538 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 566
>gi|330803837|ref|XP_003289908.1| hypothetical protein DICPUDRAFT_98523 [Dictyostelium purpureum]
gi|325079984|gb|EGC33559.1| hypothetical protein DICPUDRAFT_98523 [Dictyostelium purpureum]
Length = 3376
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/308 (21%), Positives = 134/308 (43%), Gaps = 20/308 (6%)
Query: 42 VESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQK 101
+ + + GS++ +L DL+++ ++ K+V L +
Sbjct: 3078 ISGFSNSNISILGSYLYDLALTDSDLNVNFTITEKESDIRIYKQVS--------VFLEKN 3129
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G Y+ ++P+++F I + I ++S ++ G KS + D R + + LL
Sbjct: 3130 GNYKVQSKRFEDQIPVIRFIDIQKRIQFEMSFNSQMGYHKSLLIKEYVMSDSRVKSLTLL 3189
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
VK WA DIN+ + TF+S+ L +V+F Q ILP L + +L+ + ++AN
Sbjct: 3190 VKHWASQKDINDYEKDTFSSFCLVNMVIFFLQKT--NILPNLSEPSQESLIPNKSQLKAN 3247
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+ + ++I D +K++ SSL + F F + K + + I
Sbjct: 3248 CRVENNLVKYYDITTLIIDT-KKLSVSSLLYKFFHF---YCTFDFKNNHISITNIPFDRS 3303
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYAL 341
+++ + P+ + DPF + +N A+++++ I F + ++L N+ YA
Sbjct: 3304 QLKN------QDSPIIVLDPFIEGKNLAQSITQNTFENIMTEFAIMEYKLKQFNKPNYAK 3357
Query: 342 LSSLARPF 349
+ F
Sbjct: 3358 FNDTKEIF 3365
>gi|10438696|dbj|BAB15314.1| unnamed protein product [Homo sapiens]
Length = 874
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 482
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
IN L+ L F S L+ S L + P G WE +R
Sbjct: 483 EPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 537
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 538 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 566
>gi|348683816|gb|EGZ23631.1| hypothetical protein PHYSODRAFT_556286 [Phytophthora sojae]
Length = 562
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 95/200 (47%), Gaps = 11/200 (5%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLR-----GATVEPFGS 55
MG+ N + + D + +L E E ++ R V V+ L V PFGS
Sbjct: 1 MGATNSVRKLTIDSIALL----EQLEPNAAELAAKRAVRRRVQQLLKQQWPTCRVLPFGS 56
Query: 56 FVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHAR 114
S L D+D+ I + + + G+ Q + L A R G ++ +FV +AR
Sbjct: 57 SESGLGFGGCDVDLGIYFEDVD-VDAQGQFSPQERVELLATACERLAGAFQVQEFVRNAR 115
Query: 115 VPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNP 174
VP++K + ++CD+ + + + + L + Q+D R R +V VK W K IN+
Sbjct: 116 VPVIKLWDSKRQVACDVCVGGVNALLNTALLKYYGQVDPRVRPLVFAVKYWTKQRGINDS 175
Query: 175 KTGTFNSYSLSLLVLFHFQT 194
GT +SY +LL++F+ Q+
Sbjct: 176 VNGTLSSYGYTLLLVFYLQS 195
>gi|226371750|ref|NP_073741.2| speckle targeted PIP5K1A-regulated poly(A) polymerase [Homo
sapiens]
Length = 912
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 356 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 413
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 414 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 472
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 473 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 520
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
IN L+ L F S L+ S L + P G WE +R
Sbjct: 521 EPSINVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 575
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 576 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 604
>gi|225679449|gb|EEH17733.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1102
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E + R+K ++ L +++ V FGS + L S D+DI
Sbjct: 120 MRELYDRLLPSEESEQRRLKFVNKLEKLLNKQWPGNNIRVHVFGSSGNKLCSSDSDVDI- 178
Query: 71 IELSNGSCISSAGKKV-KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++ K +L D L K G R+ V+HARVPI+K ++C
Sbjct: 179 -------CITTTYKELEKVCMLADFL----AKSGMERVVCVSHARVPIVKIWDPELLLAC 227
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R R + +++K W K +N+ GT +SY+ L+
Sbjct: 228 DMNVNNTMALENTRMIRTYVDIDERVRPLAMILKYWTKRRILNDAALGGTLSSYTWICLI 287
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P +LP L+
Sbjct: 288 ISFLQTRNPPVLPSLQ 303
>gi|260799419|ref|XP_002594694.1| hypothetical protein BRAFLDRAFT_285462 [Branchiostoma floridae]
gi|229279930|gb|EEN50705.1| hypothetical protein BRAFLDRAFT_285462 [Branchiostoma floridae]
Length = 333
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 102/220 (46%), Gaps = 25/220 (11%)
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
R+LQ++ HAR P++KF I CD++ +N S+ L S+ID R R +V V+
Sbjct: 34 RQLQYILHARCPLVKFMHEASGIQCDLTSNNSIALKSSELLNLYSRIDPRVRPLVYAVRH 93
Query: 165 WAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAE 223
WA+ H I + G + ++SL+ LV+F Q +LP +D LK + A+
Sbjct: 94 WARMHHITSSMPGGWITNFSLTALVIFFLQYTDRPVLPT---------IDALKVLADKAD 144
Query: 224 RQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW 280
+ E N SD + N S L + F E +S + K L + TG+
Sbjct: 145 TCVLE---GNDCTLVSDLTKVPLSENTDSTDELLLEFFEFYSNFNFKNCGLNL--RTGEM 199
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + N L+I++PFEQ N ++ VS L K
Sbjct: 200 QEKK-------NFDALYIQNPFEQQLNLSKNVSMHQLEKF 232
>gi|327263074|ref|XP_003216346.1| PREDICTED: poly(A) RNA polymerase GLD2-like [Anolis carolinensis]
Length = 488
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 132/285 (46%), Gaps = 53/285 (18%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGY 104
L G+++ FG+ S+ GDL C+ + + Q + L QK
Sbjct: 207 LVGSSLNGFGTRSSD-----GDL----------CLVVKEEPINQKTEARYILGLLQKHFC 251
Query: 105 RRLQ-FVAH-----ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R+L F+ A+VPI+KF + D++++N+ G I++ FL + I+ R R
Sbjct: 252 RKLSNFIERPQLIRAKVPIVKFRDKVSCVEFDLNVNNVVG-IRNTFLLRTYAYIESRVRP 310
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKG 217
+VL+VK+WA IN+ GT +SYSL L+VL + QT ILP L+ YP + D
Sbjct: 311 LVLVVKKWASFRGINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKNYPESF-DPTMQ 369
Query: 218 VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
+R + A + Y N ++L L + FL + A+E ++
Sbjct: 370 LRLVHQ-----------APCTIPPYVSKNEATLGDLLLGFLRYY------ATEFD---WS 409
Query: 278 GQWEHIRSNTRWLP-------NNHPLFIEDPFEQPENSARAVSEK 315
Q +R + LP N + +E+PF++ N+ARAV EK
Sbjct: 410 SQMISVR-EAKALPRSDGIEWRNKFICVEEPFDR-TNTARAVHEK 452
>gi|354491815|ref|XP_003508049.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Cricetulus
griseus]
Length = 543
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 127/305 (41%), Gaps = 61/305 (20%)
Query: 47 GATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG--- 102
G + PF S V N F + G DLD+ ++L GK GD L + K
Sbjct: 186 GCVIWPFSSSV-NTFGKLGCDLDMFLDLD------EIGKLDVHKNAGDFLMEFQMKTVPS 238
Query: 103 ---------------------GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
+Q + +AR P+L+F CD++++N
Sbjct: 239 ERIATQKILSVIGECIDNFGPSCVGVQKILNARCPLLRFSHQASGFQCDLTVNNSIALKS 298
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAIL 200
S+ L+ +D R R +V V+ WA+AH + + GT+ ++SL+++V+F Q P IL
Sbjct: 299 SELLYIYGSLDSRVRALVFSVRCWARAHSLTSSIPGTWITNFSLTVMVIFFLQRRSPPIL 358
Query: 201 PPLKDIYPGNLVDDLKGVRANAERQIAEICAFNI--ARFSSDKYR---KINRSSLAHLFV 255
P L D LK + A+AE + C N F D Y+ N +L L
Sbjct: 359 PTL---------DSLKSL-ADAEDR----CILNGHNCTFVRDLYKIKPSGNTETLELLLK 404
Query: 256 SFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
F E F + + + I Q + P++ PL+I++PFE N ++ S+
Sbjct: 405 EFFEYFGNFAFNKNSINIRQGKEQNK---------PDSSPLYIQNPFETSLNISKNSSQS 455
Query: 316 NLAKI 320
L K
Sbjct: 456 QLQKF 460
>gi|348578471|ref|XP_003475006.1| PREDICTED: terminal uridylyltransferase 7-like [Cavia porcellus]
Length = 1492
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 127/293 (43%), Gaps = 42/293 (14%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + +L R LR+
Sbjct: 1011 GTKLSLFGSSKNGFGFKQSDLDI--------CMTINGHETAEGLDCVRTIEELARVLRKH 1062
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + +
Sbjct: 1063 SGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDLRVKYLCYT 1122
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL--KGVR 219
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G ++ G
Sbjct: 1123 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEIYKGEKKPEIFVDGWN 1182
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASEL 271
QI E+ + +Y K N S+ L++ L ++ +S++ L
Sbjct: 1183 IYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL 1235
Query: 272 GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ F QW + + IEDPF+ N +S K I AF
Sbjct: 1236 -LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1276
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 70/161 (43%), Gaps = 12/161 (7%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS S L + D++I I+ I S + +L + LR + + H
Sbjct: 340 YGSSCSRLGFKNSDVNIDIQFP---AIMS-----QPDVLLLVQECLRNSDSFTDVDADFH 391
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVP++ + C +S N + +K L + +++ R +V+ + WAK I+
Sbjct: 392 ARVPVVVCREKQSGLLCKVSAGNENACLTTKHLSILGKLEPRLVPLVIAFRYWAKLCAID 451
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
P+ G Y L+ +F Q +LP +Y G+ ++
Sbjct: 452 RPEEGGLPPYVFCLMAIFFLQQRKEPLLP----VYLGSWIE 488
>gi|334332807|ref|XP_001366240.2| PREDICTED: terminal uridylyltransferase 7 [Monodelphis domestica]
Length = 1469
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 124/298 (41%), Gaps = 52/298 (17%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + DL R L++
Sbjct: 1029 GTKLSLFGSSKNGFGFKQSDLDI--------CMTIDGLETAEGLDCIRMIEDLSRVLKKH 1080
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N +K L S ID R + +
Sbjct: 1081 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTKLLAAYSAIDPRVKYLCYT 1140
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P +VD
Sbjct: 1141 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRDPPVIPVLQEIYEEEKRPEIIVD--- 1197
Query: 217 GVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC 274
G +I ++ AF R N S+ L++ L +F E +C
Sbjct: 1198 GWNTYFFDRICDLPAFWPEYGR---------NTESVGELWLGLL-RFYTEEFDFKEHVVC 1247
Query: 275 --------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
F QW + + IEDPF+ N +S K I AF
Sbjct: 1248 IRRKGLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1294
Score = 43.1 bits (100), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 36/191 (18%), Positives = 80/191 (41%), Gaps = 14/191 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+++ + ++ + L ++ +GS S + DL+I ++
Sbjct: 306 EDLEQRLEIKQTMEKLFH--QKLPDCSLRLYGSSYSRFGFKNSDLNIDVQF--------P 355
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 356 VTMTQPDVLLLIQENLKNSESFIDVDADFHARVPVVVCREKQSGLICKVSAGNENACLTT 415
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ + +V+ + WAK ++P G Y +L+ +F Q +LP
Sbjct: 416 NHLAALGKLEPKLVPLVIAFRYWAKLCCTDHPDEGGLPPYVFALMAIFFLQQRKEPVLP- 474
Query: 203 LKDIYPGNLVD 213
+Y G+ ++
Sbjct: 475 ---VYLGSWIE 482
>gi|296420314|ref|XP_002839720.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295635914|emb|CAZ83911.1| unnamed protein product [Tuber melanosporum]
Length = 699
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 151/378 (39%), Gaps = 57/378 (15%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L+ ++I+ P E+ + + + R + + G V PFGS V+ D
Sbjct: 311 LQVFAREIIATATPTPEELAAQNQHLKKCRAICRRI-CPEGELV-PFGSLVTGFAITKSD 368
Query: 67 LDISIELSNGSCISSAGKKVKQS--LLGDLLRALRQKGGYRRLQFVAHARVPILKF---- 120
LD + + S K+ +S L +L + + +G L + RVPILK
Sbjct: 369 LDAVLTSPYPEDLFSTPNKIDESNSLPQNLAKEFQSEGFEATL--LLKTRVPILKLALKA 426
Query: 121 -ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
+ +++CDI +N G ++ L S+ D R R+MVL +K WAK IN+P GT
Sbjct: 427 TDESSFDLNCDIGFNNDLGVHNTRMLQTYSRCDPRVREMVLFIKWWAKRRHINSPYRGTL 486
Query: 180 NSYSLSLLVL-FHFQTCVPAIL------PPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
+SY L+++ F P +L P +D+ P + D+ E QI
Sbjct: 487 SSYGYVLMIIHFLINVVDPPVLINLQNTPIPEDVPPDQIFDE----GGEGEHQIW----- 537
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFS---------------GLSLKASELG----- 272
A+ + + N+ + L SF E +S G E G
Sbjct: 538 -YAKDIENLPKTANQMHVGQLLHSFFEYYSYKFQWGREVISIRTQGGIFSKQEKGWVAAV 596
Query: 273 ICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE--MTHFR 330
I P G+ H + R+L IEDPFE N R + + +I F+ ++ R
Sbjct: 597 IRP--GRSGHTQIKDRYL-----FTIEDPFETSHNVGRTCNPPGVDRIRAEFKRAVSIIR 649
Query: 331 LTSTNQTRYALLSSLARP 348
++ LL A P
Sbjct: 650 FRDGGKSMRELLCQEAPP 667
>gi|402893122|ref|XP_003919690.1| PREDICTED: LOW QUALITY PROTEIN: speckle targeted PIP5K1A-regulated
poly(A) polymerase, partial [Papio anubis]
Length = 580
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 112/259 (43%), Gaps = 38/259 (14%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 344 GDLGKALELAEAPKREKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 401
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL +S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 402 SGLHGDISLSNRLALHNSRFLSLVSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 460
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSD 240
+LLV++ QT P +LP + + + E + E+ C+F R +S
Sbjct: 461 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCSF--PRDASR 507
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWL 290
R N L+ L F S L+ S L + P G WE +R
Sbjct: 508 LERSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG---- 563
Query: 291 PNNHPLFIEDPFEQPENSA 309
P+ ++DPF+ N A
Sbjct: 564 ----PMNLQDPFDLSHNVA 578
>gi|426240881|ref|XP_004014322.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Ovis aries]
Length = 583
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 124/293 (42%), Gaps = 39/293 (13%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
V PFGS V N F + G DLD+ ++L ++ + + Q
Sbjct: 227 CAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGKFTAQKTSGNFLMEFQVKNVPSERVATQK 285
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L Q G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 286 ILSVIGECLDQFGPGCVGVQRILNARCPLVRFSHQASGFQCDLTANNRIALKSSELLYMY 345
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 346 GALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---- 401
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK 267
D LK + ++ I E R + N +L L F E F +
Sbjct: 402 -----DYLKTLADAEDKCIIEGHNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNFAFN 456
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I R + P + PL I++PFE N ++ VS+ L K
Sbjct: 457 KNSINI-------RQGREQNK--PESSPLHIQNPFETSLNISKNVSQSQLQKF 500
>gi|449690275|ref|XP_002168277.2| PREDICTED: terminal uridylyltransferase 4-like, partial [Hydra
magnipapillata]
Length = 426
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 145/344 (42%), Gaps = 61/344 (17%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP---------------FGSFVS 58
+ ML +R W T+ +I + VV + L ++ FGS V+
Sbjct: 58 FMEMLGEIRSSWTTKEFLIKAAKAVVIAETGLNVNWMQKDKFEQAARYSCRLLLFGSCVN 117
Query: 59 NLFSRWGDLDISIELSNGS---------CISSAGKKVKQSLLGDLLRALRQKGGYRRLQF 109
+ DLDIS+ + IS KK+++SL + + ++
Sbjct: 118 GFGFQNSDLDISLCFETDTPPKDFDYQRTISQIEKKLRKSLKSSI---------FYKVDS 168
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
V A+VPI+KF + NI DIS+ N SK L + ID R + M +K +AK
Sbjct: 169 VKSAKVPIVKFCVRNSNIQGDISLYNCLAIANSKLLKTYAMIDTRVKIMGYCIKYFAKIC 228
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
DI + G+ +SY+ LL+L++ Q C P ++P L+++ VD K + +
Sbjct: 229 DIGDASHGSLSSYAYILLMLYYLQHCEPPVIPVLQEL----AVDKKKTFLIDGKDTWFFD 284
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSF----LEKFSGLSL-----KASELGICPFTGQW 280
N+ D Y K N+ +LA L++ F +EKF + + L C +W
Sbjct: 285 DIQNLDTVWKD-YGK-NKQTLAELWIGFFNFYVEKFFFKRFVIAIRQKNSLSKCQ--KEW 340
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
W N + IEDPF+ N A V + KI + F
Sbjct: 341 --------W---NCSMAIEDPFDLDHNLAAGVKDDMFDKIMSCF 373
>gi|295657484|ref|XP_002789310.1| PAP/25A associated domain family [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226283940|gb|EEH39506.1| PAP/25A associated domain family [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 1104
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 97/196 (49%), Gaps = 14/196 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E + R+K ++ L +++ V FGS + L S D+DI
Sbjct: 120 MRELYDRLLPSEESEQRRLKFVNKLEKLLNKQWPGNNIRVRVFGSSGNKLCSSDSDVDI- 178
Query: 71 IELSNGSCISSAGKKV-KQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++ K +L D L K G R+ V+HARVPI+K ++C
Sbjct: 179 -------CITTTYKELEKVCMLADFL----AKSGMERVVCVSHARVPIVKIWDPELLLAC 227
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R R + +++K W K +N+ GT +SY+ L+
Sbjct: 228 DMNVNNTMALENTRMIRTYVDIDERVRPLAMILKYWTKRRILNDAALGGTLSSYTWICLI 287
Query: 189 LFHFQTCVPAILPPLK 204
+ QT P +LP L+
Sbjct: 288 ISFLQTRNPPVLPSLQ 303
>gi|449020088|dbj|BAM83490.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 666
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 95/191 (49%), Gaps = 12/191 (6%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS-------IELSNG 76
++E R ++ LR V S RG V+ +GS + + + GDLD++ +E+
Sbjct: 169 EYERRSRLARHLRNVASS--RFRGCRVDVYGSTATGVLLKGGDLDVNFVAPMAPLEVLRA 226
Query: 77 SCISS--AGKKVKQSLLGDL-LRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISI 133
+ ++ ++GDL R++ + +Q + RVP++KF + +I D+ +
Sbjct: 227 QYQDEEYSIDDFRRDVVGDLGRLLRRRRHEFVNVQIITQTRVPLVKFHDLRSDIEVDVQV 286
Query: 134 DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+N + L ++D R R + + +K WA A D+N P GT +SY+ +L++ + Q
Sbjct: 287 NNDFVVRNTALLRAYVRLDPRVRPLAIFIKRWAVARDLNEPFAGTLSSYAYLMLLIQYLQ 346
Query: 194 TCVPAILPPLK 204
P +LP L+
Sbjct: 347 IVNPPVLPCLQ 357
>gi|395514280|ref|XP_003761347.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Sarcophilus
harrisii]
Length = 1531
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 124/296 (41%), Gaps = 48/296 (16%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + +L R L++
Sbjct: 1029 GTKLSLFGSSKNGFGFKQSDLDI--------CMTIDGLETAEGLDCIRMIEELSRVLKKH 1080
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N +K L S ID R + +
Sbjct: 1081 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTKLLAAYSAIDPRVKYLCYT 1140
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P +VD
Sbjct: 1141 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYEEEKRPEIIVD--- 1197
Query: 217 GVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC-- 274
G +I+E+ AF N S+ L++ L +F E IC
Sbjct: 1198 GWNTYFFDRISELPAFWPEHGK-------NTESVGELWLGLL-RFYTEEFDFKEHVICIR 1249
Query: 275 ------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
F QW + + IEDPF+ N +S K I AF
Sbjct: 1250 RRSLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1294
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/191 (19%), Positives = 82/191 (42%), Gaps = 14/191 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+++ + ++ + L ++ +GS S + DL+I ++
Sbjct: 307 EDLEQRLEIKQTMEKLFH--QKLPDCSLRLYGSSYSRFGFKNSDLNIDVQF--------P 356
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 357 ATMTQPDVLLLIQEILKSSESFIDIDADFHARVPVVVCREKQSGLICKVSAGNENACLTT 416
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ + +V+ + WAK ++P+ G SY +L+ +F Q +LP
Sbjct: 417 NHLAALGKLEPKLVPLVIAFRYWAKLCCADHPEEGGLPSYVFALMAIFFLQQRKEPVLP- 475
Query: 203 LKDIYPGNLVD 213
+Y G+ +D
Sbjct: 476 ---VYLGSWID 483
>gi|366988339|ref|XP_003673936.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
gi|342299799|emb|CCC67555.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
Length = 683
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 102/199 (51%), Gaps = 16/199 (8%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D + ++P +++ TR + ++ LR+ V E + A++ FGS+ ++L+ D+D ++
Sbjct: 204 DFVSYISPSKDEIHTRNRTLARLRKAVS--EQWKDASLHVFGSYATDLYLPGSDIDCAV- 260
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
IS K ++ L DL ++L+QKG L+ +A ARVPI+KF I D+S
Sbjct: 261 ------ISRNRDKDRRQCLYDLAKSLKQKGLATHLEVIAKARVPIIKFVEPRSKIHIDVS 314
Query: 133 IDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
+ G +K + WI G R++VL++K++ +N G +S+ LV +
Sbjct: 315 FEKTNGAEAAKLIREWIKDTPG-LRELVLVLKQFLAVKKLNEVVNGGLGGFSIICLV-YA 372
Query: 192 FQTCVPAI----LPPLKDI 206
F P I + P+K++
Sbjct: 373 FLRMHPRIKAGEIDPMKNL 391
>gi|410967458|ref|XP_003990236.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 4 [Felis
catus]
Length = 1629
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 125/284 (44%), Gaps = 36/284 (12%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS + R DLDI + L +A K + ++ +L + L++ G R + +
Sbjct: 981 FGSSKNGFGFRDSDLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITT 1037
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+KFE + DIS+ N Q ++ L + ID R + + +K +AK DI
Sbjct: 1038 AKVPIVKFEHRRSGLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIG 1097
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF 232
+ G+ +SY+ L+VL+ Q P ++P L++I+ G + +R + AF
Sbjct: 1098 DASRGSLSSYAYILMVLYFLQQRKPPVIPVLQEIFDGKQI---------PQRMVDGWNAF 1148
Query: 233 NIARFSSDKYR----KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQW 280
+ K R N +L L++ L ++ +S++ +L + F QW
Sbjct: 1149 FFDKTEELKKRLPSLGKNTETLGELWLGLLRFYTEEFDFKEYVISIRQKKL-LTTFEKQW 1207
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N VS K I AF
Sbjct: 1208 -----------TSKCIAIEDPFDLNHNLGAGVSRKMTNFIMKAF 1240
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/332 (22%), Positives = 133/332 (40%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 364 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 413
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ LL +L L++ Y ++ HA+VP++ + + C +S N + +
Sbjct: 414 PRMNHPDLLIQVLGILKKSVLYIDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACLTT 473
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + + + F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 474 DLLAALGKTEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 533
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 534 LLGSWIEGFDPXRMDDFQLKGIVEEKFVKWEYNSSSATEKNSIAEENKAKADQPKDDTKK 593
Query: 229 ICAFNIARFSSDKYRKI-------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K N+ SL L++ L KF L E IC Q
Sbjct: 594 TETDNQSNAMKEKHGKSPLTLGTPNQVSLGQLWLELL-KFYTLDFALEEYVIC-VRIQDI 651
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 652 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 681
>gi|426384275|ref|XP_004058696.1| PREDICTED: poly(A) RNA polymerase GLD2 [Gorilla gorilla gorilla]
Length = 491
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 107/224 (47%), Gaps = 20/224 (8%)
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRD 157
R G R Q + A+VPI+KF + D++++N+ G I++ FL + ++ R R
Sbjct: 245 RLSGYIERPQLI-RAKVPIVKFRDKVSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRP 302
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDD--L 215
+VL++K+WA H IN+ GT +SYSL L+VL + QT ILP L+ IYP + L
Sbjct: 303 LVLVIKKWASHHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPVKRFSNRRL 362
Query: 216 KGVRANAERQIAEICAFNIARFSSDKYR----KINRSSLAHLFVSFLEKFSGLSLKASEL 271
+ + +R I I F R K ++ +F+ E ++ + E
Sbjct: 363 AILTYSRKRLIVRISDFEAVRNVRGKSHVEEYQLYEFGWNRIFI-LRESWNSQMISVREA 421
Query: 272 GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
P E W N + +E+PF+ N+ARAV EK
Sbjct: 422 KAIPRPDGIE-------W--RNKYICVEEPFDG-TNTARAVHEK 455
>gi|426364354|ref|XP_004049282.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Gorilla gorilla
gorilla]
Length = 469
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 109/238 (45%), Gaps = 28/238 (11%)
Query: 88 QSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
Q +L L L G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 169 QKILSVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLY 228
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKD 205
+D R R +V ++ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 229 IYGALDSRVRALVFSIRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL-- 286
Query: 206 IYPGNLVDDLKGVRANAERQIAEI--CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKF 261
D LK + ++ + E C F +++R + N +L L F E F
Sbjct: 287 -------DSLKTLADAEDKCVIEGNNCTFVRDLSRIKPSQ----NTETLELLLKEFFEYF 335
Query: 262 SGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 336 GNFAFDKNSINI-------RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQSQLQK 384
>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora
B]
Length = 547
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 97/196 (49%), Gaps = 13/196 (6%)
Query: 2 GSYNVLEPILKDILGM---LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVS 58
G NV E + +D+ G ++P ++ E R V+ +R + A V PFGS+ +
Sbjct: 164 GCTNVSEMLHRDVEGFVRYISPTPQEDEVRSLVVELIRRAI--TRQFPDAQVLPFGSYET 221
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPIL 118
L+ GD+D+ I SN S K+++L L LR+ G ++ +A A+VPI+
Sbjct: 222 KLYLPLGDIDLVIH-SNTMAYSD-----KENVLRALANTLRRAGITDNVKIIAKAKVPIV 275
Query: 119 KFETIHQNISCDISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
KF TIH S DISI+ G K + ++S++ R +V +VK + +N TG
Sbjct: 276 KFVTIHGRFSVDISINQGNGVAAGKMINHFLSELPA-LRALVFVVKSFLSQRSMNEVFTG 334
Query: 178 TFNSYSLSLLVLFHFQ 193
SYS+ L + Q
Sbjct: 335 GLGSYSIVCLAISFLQ 350
>gi|395514282|ref|XP_003761348.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Sarcophilus
harrisii]
Length = 1485
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 124/296 (41%), Gaps = 48/296 (16%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + +L R L++
Sbjct: 1029 GTKLSLFGSSKNGFGFKQSDLDI--------CMTIDGLETAEGLDCIRMIEELSRVLKKH 1080
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N +K L S ID R + +
Sbjct: 1081 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTKLLAAYSAIDPRVKYLCYT 1140
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P +VD
Sbjct: 1141 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYEEEKRPEIIVD--- 1197
Query: 217 GVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC-- 274
G +I+E+ AF N S+ L++ L +F E IC
Sbjct: 1198 GWNTYFFDRISELPAFWPEHGK-------NTESVGELWLGLL-RFYTEEFDFKEHVICIR 1249
Query: 275 ------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
F QW + + IEDPF+ N +S K I AF
Sbjct: 1250 RRSLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1294
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/191 (19%), Positives = 82/191 (42%), Gaps = 14/191 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+++ + ++ + L ++ +GS S + DL+I ++
Sbjct: 307 EDLEQRLEIKQTMEKLFH--QKLPDCSLRLYGSSYSRFGFKNSDLNIDVQF--------P 356
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 357 ATMTQPDVLLLIQEILKSSESFIDIDADFHARVPVVVCREKQSGLICKVSAGNENACLTT 416
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ + +V+ + WAK ++P+ G SY +L+ +F Q +LP
Sbjct: 417 NHLAALGKLEPKLVPLVIAFRYWAKLCCADHPEEGGLPSYVFALMAIFFLQQRKEPVLP- 475
Query: 203 LKDIYPGNLVD 213
+Y G+ +D
Sbjct: 476 ---VYLGSWID 483
>gi|296218501|ref|XP_002755537.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Callithrix jacchus]
Length = 967
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 113/269 (42%), Gaps = 30/269 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ G L+G +LR G R+Q V AR P++KF
Sbjct: 411 GDLGKALELAEAPKGEKTGGTAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 468
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ ++Y+L
Sbjct: 469 SGLHGDVSLSNRLALHNSRFLSLCSELDDRVRPLVYTLRCWAQGRGLSGSGP-LLSNYAL 527
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+LLV++ QT P +LP V L + E+ + + R +S
Sbjct: 528 TLLVIYFLQTRDPPVLP---------TVSQLTQKAGDGEQVKVDGWDCSFPRDASRLEPS 578
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLPNNH 294
N L++L F S L+ S L + P G WE +R
Sbjct: 579 TNVEPLSYLLAQFFSCVSCWDLRGSLLSLRDGQALPVAGGLPSNFWEGLRLG-------- 630
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 631 PMNLQDPFDLSHNVAANVTSRVAGRLQNC 659
>gi|322800046|gb|EFZ21152.1| hypothetical protein SINV_03493 [Solenopsis invicta]
Length = 642
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 124/293 (42%), Gaps = 45/293 (15%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGK-----KVKQSLLGDLLRALR--QKGGYR 105
FGS ++ L + DLDI +++ K V ++ +R + K +
Sbjct: 113 FGSTLAQLSFKESDLDIYMDVGRIGLHPYYNKPDIPSHVWTPMIFKRVRRVMYSMKTVFS 172
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+ + A+ PI+KF I N+SCD+S N G KS FL++ + D R R ++L++K W
Sbjct: 173 NIISIPKAKTPIIKFRYIPTNVSCDLSFKNSLGIYKSNFLYYCASRDPRLRPLMLIIKYW 232
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
AK I+ G +SY L LL++F+ Q +LPPL D+ + G + N
Sbjct: 233 AKHFGISG--IGRISSYGLILLIIFYLQQESVGLLPPLLDLQRTCEPQIMNGWQINFNEH 290
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW----- 280
A N N ++L H F SF +F + IC G+
Sbjct: 291 TALPPITN----------NCNIATLLHKFFSFYGEFD-----FNSCVICLLDGKTYSTPD 335
Query: 281 ---------------EHIRSN-TRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
++ N T+ L P+ ++DP E +N+A S++ L
Sbjct: 336 FLQLDKLPNYMDLYKNYVTDNSTKKLDVQKPICLQDPIELNQNTAANTSDRAL 388
>gi|365758533|gb|EHN00370.1| Pap2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 514
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE ++ + A + FGS+ ++L+ D+D
Sbjct: 185 IKDFVAYISPSREEIEIRNQTISTIREALKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 242
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++ ++ VA ARVPI+KF H I D
Sbjct: 243 VN-------SELGGKESRNNLYSLASHLKKNNLATEIEVVAKARVPIIKFVEPHSRIHID 295
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W++ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 296 VSFERTNGLEAAKLIREWLNDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 353
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 354 FSFLHMHPRII 364
>gi|355727116|gb|AES09087.1| terminal uridylyl transferase 1, U6 snRNA-specific [Mustela
putorius furo]
Length = 877
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 118/272 (43%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 325 GDLGKALELAEALRGEKTEGVAMLELVGSILRGCVP--GVYRVQTVPTARRPVVKFCHRP 382
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 383 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSGSGP-LLSNYAL 441
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 442 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 489
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNTRWLP 291
N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 490 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSDRWEGLRLG----- 544
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N+
Sbjct: 545 ---PMNLQDPFDLSHNVAANVTSRVAGRLQNS 573
>gi|340382691|ref|XP_003389852.1| PREDICTED: terminal uridylyltransferase 7-like [Amphimedon
queenslandica]
Length = 913
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 152/345 (44%), Gaps = 36/345 (10%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
ET+ + SD+R++ + +++E FGS + DLD+ + + + +
Sbjct: 591 ETKELIQSDIRKLYAN------SSLELFGSSANGFGHSKSDLDLCLIMEDDE------QT 638
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
K ++ DL+ +L+ YRR+ + ARVPI+K NI DIS+ N + L
Sbjct: 639 DKVQIIEDLVESLKADVKYRRVVGIKTARVPIVKLTISRCNIDADISLLNSLALHNTNML 698
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK- 204
+ ID R + + ++K +AK D+ + +G+ +SY+ ++++ + Q +LP L+
Sbjct: 699 AAYNDIDERLQTLGFILKYFAKVCDMCDASSGSISSYAFIIMMIHYLQQLPIPVLPVLQQ 758
Query: 205 --DIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
D G +V+ R + E+ K + NR S+A L++ FL+ +
Sbjct: 759 LGDRSVGPVVNGWNCYYFKDIRNLYEVW----------KPVERNRMSVAELWIGFLKYY- 807
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
++ L Q + + +W + H + IEDPF N +AVS + +
Sbjct: 808 --AMDFDWLTDVVTIKQLDKLTKFKKWWTSKH-VAIEDPFNLEHNLGQAVSGRMRTYMLM 864
Query: 323 AFEMTHFRLTSTNQTRYALLSSLARPFIL-----QFFGESPVRYA 362
F+ + T ++ L L F+L Q PVR A
Sbjct: 865 RFQRAYKHHTKPKHWKH--LDELINEFVLCEDTEQSLVVPPVRTA 907
>gi|344239136|gb|EGV95239.1| Poly(A) RNA polymerase, mitochondrial [Cricetulus griseus]
Length = 315
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 103/219 (47%), Gaps = 29/219 (13%)
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+Q + +AR P+L+F CD++++N S+ L+ +D R R +V V+ WA
Sbjct: 36 VQKILNARCPLLRFSHQASGFQCDLTVNNSIALKSSELLYIYGSLDSRVRALVFSVRCWA 95
Query: 167 KAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
+AH + + GT+ ++SL+++V+F Q P ILP L D LK + A+AE +
Sbjct: 96 RAHSLTSSIPGTWITNFSLTVMVIFFLQRRSPPILPTL---------DSLKSL-ADAEDR 145
Query: 226 IAEICAFNI--ARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW 280
C N F D Y+ N +L L F E F + + + I Q
Sbjct: 146 ----CILNGHNCTFVRDLYKIKPSGNTETLELLLNEFFEYFGNFAFNKNSINIRQGKEQN 201
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ P++ PL+I++PFE N ++ S+ L K
Sbjct: 202 K---------PDSSPLYIQNPFETSLNISKNSSQSQLQK 231
>gi|449278689|gb|EMC86480.1| Poly(A) RNA polymerase GLD2 [Columba livia]
Length = 505
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/231 (29%), Positives = 112/231 (48%), Gaps = 41/231 (17%)
Query: 110 VAHARVPILKFETIHQNI---SC---DISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLV 162
+ A+VPI+KF + SC D++++N+ G I++ FL + I+ R R +VL+V
Sbjct: 274 LIQAKVPIVKFRDKFSFLFPNSCVDFDLNVNNVVG-IRNTFLLRTYAYIESRVRPLVLVV 332
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
K+WA+ HDIN+ GT +SYSL L+VL + QT +LP L+ YP + + +
Sbjct: 333 KKWARFHDINDASRGTLSSYSLVLMVLHYLQTLPEPVLPSLQKNYPESFDPTM---HLHL 389
Query: 223 ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGIC 274
QI ++ Y N SSL L + F + ++ +S++ ++
Sbjct: 390 VHQIP---------YTIPPYLSRNGSSLGDLLIGFFKYYATEFDWSRQMISVREAKAVPR 440
Query: 275 PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE-KNLAKISNAF 324
P +W N + +E+PF+ N+ARAV E K I N F
Sbjct: 441 PDGIEWR-----------NKFICVEEPFDG-TNTARAVHEKKKFDTIKNEF 479
>gi|357629675|gb|EHJ78294.1| putative terminal uridylyl transferase 1, U6 snRNA-specific-like
protein [Danaus plexippus]
Length = 684
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 133/322 (41%), Gaps = 62/322 (19%)
Query: 31 VISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL 90
+ SDL++ + ++ G PFGS + L I+ S+ C S G +
Sbjct: 155 LYSDLQDALRTL--WPGCVATPFGSITTGL---------GIKSSDADCFVSLGTERITDA 203
Query: 91 LGDLLRAL-RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
+G RAL R+ + + + A PI+KF + +CD++ G S+ + ++
Sbjct: 204 VGRAKRALLREPRLFAEVLAIPQAHTPIVKFFHVPTGTNCDVTFKTPLGTYNSRLVSFML 263
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
D R + +LVK WAK H + +G +Y+L++++LF+ Q ++LP ++ + G
Sbjct: 264 HADPRLVPLAVLVKYWAKVHGFSG--SGRLTNYALTVMILFYLQQPPVSVLPSVRSLQEG 321
Query: 210 --NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSG 263
+VD +N+A D+ ++ N SS+ L F + +S
Sbjct: 322 FDQIVD-----------------GWNVA--FDDRLDRLPASTNTSSIPELLGGFFQYYST 362
Query: 264 LSLKASELGICPFTGQWEHIRSNTRW--LPNNHPLF-------------------IEDPF 302
L ICP+ G+ S R LP L+ ++DPF
Sbjct: 363 FDF--DRLVICPYLGRPITKESFKRLSSLPPEMSLYRRNLESGAAGAMRFTTSICVQDPF 420
Query: 303 EQPENSARAVSEKNLAKISNAF 324
E N A VS + ++ F
Sbjct: 421 ELCHNVASCVSSRLYEEVQAYF 442
>gi|308505938|ref|XP_003115152.1| hypothetical protein CRE_28469 [Caenorhabditis remanei]
gi|308259334|gb|EFP03287.1| hypothetical protein CRE_28469 [Caenorhabditis remanei]
Length = 549
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 134/279 (48%), Gaps = 34/279 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGK--KVKQSLLGD-------LLRA-LRQKGG 103
GSF + + + DLD +I+ S S SS K K+K +G+ ++RA +R K
Sbjct: 164 GSFAAGVDTFKSDLDFTIKTSRWSEESSFQKLMKIKGFFIGNSLFKTGRVVRARVRTKVN 223
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
+ + V+ + P+LK + ++ D+++DN + ++ L W SQ+D RF + +K
Sbjct: 224 HMEINHVSF-QTPVLKLVHLETDVEIDVTMDNEDSKRNTQLLSWYSQMDNRFSKLCRAIK 282
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG-----NLVDDLKGV 218
WA I K G NS+S+ L+++ + QT ILP +++ +P + DD G
Sbjct: 283 GWASESGIEGAKNGRLNSFSICLMLIQYLQTL--NILPNIQEFFPELNGPIEIEDDNYG- 339
Query: 219 RANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICP--- 275
R + +++I E K+ + N SL+ L+ FL+ ++ + S + +
Sbjct: 340 RRDMKKEIQE---------RGYKFEE-NEKSLSDLYFGFLKFYAEFNFDKSWISVKNGKI 389
Query: 276 FTGQWEHIRSNTRWLPNNHP-LFIEDPF-EQPENSARAV 312
+++ LP++H + +EDPF P N +V
Sbjct: 390 MEKRFDETEKPLDGLPDSHHFIVVEDPFLTTPRNCGGSV 428
>gi|122692425|ref|NP_001073791.1| speckle targeted PIP5K1A-regulated poly(A) polymerase [Bos taurus]
gi|118595568|sp|Q1JPD6.1|STPAP_BOVIN RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|95768664|gb|ABF57373.1| RNA binding motif protein 21 [Bos taurus]
Length = 871
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 117/269 (43%), Gaps = 32/269 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GD ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDQGKAVELAEALKGEKAEGGAMLELVGSILRGCVP--GVYRVQTVPSARCPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL S++DGR R +V ++ WA+ ++ N+Y+L
Sbjct: 376 SGLHGDISLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLNNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + A Q+ E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQLT------------QKAGEQV-EVDGWDCS-FPRDASRL 480
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN-------H 294
N+ L+ L F S L+ S L + GQ + LP+N
Sbjct: 481 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSL--REGQALSVAGG---LPSNLSEGLRLG 535
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 536 PMNLQDPFDLSHNVAANVTSRVAGRLQNC 564
>gi|91085789|ref|XP_974515.1| PREDICTED: similar to CG11418 CG11418-PA [Tribolium castaneum]
Length = 581
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 137/327 (41%), Gaps = 58/327 (17%)
Query: 24 DWETRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
D TR++ ++ R+V +V + A PFGS V+ DLD+ + L C
Sbjct: 188 DLGTRLRFLTA-RQVENAVRGMFPKAKAYPFGSSVNGYGKMGCDLDLVLRL----CDDKV 242
Query: 83 GKKVK-----------------------QSLLGDLLRALRQKGGYRRLQFVAHARVPILK 119
GK VK +GDLL+ G +++ + ARVPI+K
Sbjct: 243 GKLVKNDARLMFHCKGLVGSERTASQRNMEAIGDLLQLFLP--GCSQVRRILQARVPIIK 300
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
+ ++ CD+S+ N+ G S FL+ + +D R R +V +++WA + N G +
Sbjct: 301 YYQQLTDVECDLSMANMSGVHMSDFLYIMGSLDARIRPLVFTIRKWASEIGLTNSSPGRW 360
Query: 180 -NSYSLSLLVLFHFQTCVPA--ILPPLKDIY----PGNLVDDLKGVRANAERQIAEICAF 232
++SL+LLVL Q + + ILP L + P + G+ R I ++
Sbjct: 361 ITNFSLTLLVLAFLQKPINSKPILPSLNTLVKLAEPKDSYMTEDGINCTFLRDITKL--- 417
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPN 292
K N+ SL L V F E +S + L + + P
Sbjct: 418 --------KTPTENKESLETLLVEFFEFYSQFDFASKALCLNESVAITK---------PE 460
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAK 319
+ L+I +P E+ N ++ VS + L +
Sbjct: 461 HCALYIVNPLERGLNVSKNVSMEELDR 487
>gi|149743481|ref|XP_001493802.1| PREDICTED: poly(A) RNA polymerase, mitochondrial [Equus caballus]
Length = 584
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 129/306 (42%), Gaps = 48/306 (15%)
Query: 42 VESLRGA-----TVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-------------- 81
+E + GA V PFGS V N F + G DLD+ ++L S+
Sbjct: 216 IEDVAGAYFPDCAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGKFSAHKTSGNFLMEFQVK 274
Query: 82 ---AGKKVKQSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+ + Q +L + L Q G G +Q + +AR P+++F CD++ +N
Sbjct: 275 NVPSERIATQKILSVIGECLDQFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRI 334
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCV 196
S+ L+ +D R R +V V+ WA+AH + + G + ++SL+++V+F Q
Sbjct: 335 ALKSSELLYIYGALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRS 394
Query: 197 PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLF 254
P ILP L Y NL D C F ++ R + N +L L
Sbjct: 395 PPILPTLD--YLENLADAEDKCVIEGHN-----CTFIRDLNRIKPSE----NTETLELLL 443
Query: 255 VSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
F E F + + + I R + P + PL I++PFE N ++ V++
Sbjct: 444 KEFFEYFGNFAFNKNSINI-------RQGREQNK--PESSPLHIQNPFETSLNISKNVTQ 494
Query: 315 KNLAKI 320
L K
Sbjct: 495 SQLQKF 500
>gi|86196877|gb|EAQ71515.1| hypothetical protein MGCH7_ch7g922 [Magnaporthe oryzae 70-15]
gi|440472437|gb|ELQ41297.1| caffeine-induced death protein 1 [Magnaporthe oryzae Y34]
gi|440484284|gb|ELQ64373.1| caffeine-induced death protein 1 [Magnaporthe oryzae P131]
Length = 1474
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 100/221 (45%), Gaps = 22/221 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P + R K+I L ++ V FGS + L S D+DI
Sbjct: 432 MRELFDRLKPTEKVKANRDKLIKKLEKMFNDQWPGHSIKVHLFGSSGNKLCSDDSDVDI- 490
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI++ K+++ + + L QK G ++ V+ A+VPI+K ++CD
Sbjct: 491 -------CITTDWKELENVCM---IAQLLQKRGMEKVVCVSSAKVPIVKIWDPELGLACD 540
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + +ID R R + ++VK W + IN+ GT +SY+ +++
Sbjct: 541 MNVNNTLALENTRMVLTYVEIDERVRTLAMIVKHWTRRRTINDAAFGGTLSSYTWICMII 600
Query: 190 FHFQTCVPAILPPL----------KDIYPGNLVDDLKGVRA 220
Q P ILP L KD P DDL +R
Sbjct: 601 AFLQLRDPPILPALHQNPHKKQTSKDGQPSEFADDLTKLRG 641
>gi|23272242|gb|AAH23880.1| Zcchc6 protein [Mus musculus]
Length = 1027
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 531 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 588
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 589 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 640
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 641 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 700
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 701 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 730
>gi|398016899|ref|XP_003861637.1| DNA polymerase sigma-like protein [Leishmania donovani]
gi|322499864|emb|CBZ34937.1| DNA polymerase sigma-like protein [Leishmania donovani]
Length = 599
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/394 (24%), Positives = 157/394 (39%), Gaps = 88/394 (22%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL---------- 60
L ++L L+P ED ET+++VI D+R ++ G ++ +GS + L
Sbjct: 248 LIELLYCLSPTSEDRETKLRVIDDIRTTMQRA----GMDIQIYGSLCTGLVIPASDVDCV 303
Query: 61 FSRWGDLDISIELS-NGSC----ISSA--GKKVKQSLLGDLLRA-------LRQKGGYRR 106
R GD I+ +S N SC I+SA G +SL G L A +R+ +
Sbjct: 304 LMRSGDEQIASAMSANLSCAMLTIASAATGSVPPKSLKGPLSTAVRIVAERMRKSQSFIH 363
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR--FRDMVLLVKE 164
+ +AHA+VPI+K ++ D+S + G + S +L + G R +++LVK
Sbjct: 364 VTSIAHAKVPIVKCRHRRDDVKVDLSFEQ-SGCVSSNYLCELLCAPGNEMARPLIVLVKA 422
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
++ P G S+ +SLLVL++ Q CV
Sbjct: 423 LVNNCGLDEPSMGGLGSFPISLLVLWYLQQCV---------------------------- 454
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
RFS++ R S+ L FL K+ G LGI ++++
Sbjct: 455 ---------RTRFSAELQR-----SIGALLAGFL-KYYGTEFDFRRLGI-------DYVQ 492
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVS--EKNLAKISNAFEMTHFRLTSTNQTRYALL 342
T P L+I +P N A+A + + + T L N + +
Sbjct: 493 QKTFTKPPADELYIVNPIRPETNCAKAATLFATRVMPLFQRASATFVGLLDANASPATME 552
Query: 343 SSLARPFILQFFGESPVRYANYNNGHRRARPQSH 376
S L L +F ++ N+ + RRA + H
Sbjct: 553 SQL-----LHYFAKATSDVRNWRDVSRRAAREPH 581
>gi|410974248|ref|XP_003993559.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Felis catus]
Length = 873
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 118/272 (43%), Gaps = 37/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
GDL ++EL+ V L+G +LR G R+Q V AR P++KF
Sbjct: 317 GDLGKALELAEALKEGEKTDGVAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHR 374
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+
Sbjct: 375 PSGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYA 433
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR 243
L+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 434 LTLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASR 481
Query: 244 ---KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNTRWL 290
N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 482 LEPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNRWEGLRLG---- 537
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 538 ----PMNLQDPFDLSHNVAANVTSRVAGRLQN 565
>gi|315614514|gb|ADU33129.1| mitochondrial poly(A) RNA polymerase [Sus scrofa]
Length = 581
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 129/295 (43%), Gaps = 41/295 (13%)
Query: 47 GATVEPFGSFVSNLFSRWG-DLDISI---ELSNGSCISSAG--------------KKVKQ 88
G V PFGS V N F + G DLD+ + E+ N S ++G + V Q
Sbjct: 226 GCAVRPFGSSV-NSFGKLGCDLDMFLDLDEIGNFSAQKASGNFLMEFQVKNVPSERIVTQ 284
Query: 89 SLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDN-LCGQIKSKFLF 146
+L + L G G +Q + +AR P+++F CD++ +N + ++ S F+
Sbjct: 285 KILSVIGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALKVLSCFIL 344
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKD 205
+ +D R R +V ++ WA+ H + + G + ++SL+++V+F Q P ILP L
Sbjct: 345 Y-GALDSRVRALVFSIRCWARVHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL-- 401
Query: 206 IYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
D LK + ++ I E R + N SL L F E F +
Sbjct: 402 -------DSLKSLADAEDKCIIEGHNCTFVRDLNKIKPSGNTESLELLLKEFFEYFGNFA 454
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + I + + P + PL I++PFE N ++ VS+ L K
Sbjct: 455 FNKNSINI---------RQGGEQNKPESSPLHIQNPFETSLNISKNVSQSQLQKF 500
>gi|405976720|gb|EKC41216.1| Terminal uridylyltransferase 4 [Crassostrea gigas]
Length = 1168
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 140/332 (42%), Gaps = 40/332 (12%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L +LK + P E+ R + +L + ++ E A +E FGS + R
Sbjct: 605 ILTEVLKQVPKDFAPSGEEIRDRENIRWELEQFIQ--ELYPTARLEMFGSSNNGFGFRHS 662
Query: 66 DLDISIELSN---------GSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVP 116
DLD+ + S+ CI KK+K KG Y + A+VP
Sbjct: 663 DLDLCMTFSDLPVPENLDYVDCIEKITKKLKT-----------HKGLYNVFP-ITTAKVP 710
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
I+KF+ + DIS+ NL ++ + S++DGR + + K +AK +I +
Sbjct: 711 IIKFKHRRSQLEGDISLYNLLALHNTRMINLYSELDGRVKVLGYAFKVFAKICEIGDASR 770
Query: 177 GTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
G+ +SY+ L+++++ Q C P +LP L++++P + ER + A+ +
Sbjct: 771 GSLSSYAYILMLIYYLQQCNPPVLPVLQELHPES---------EKPERIVEGWNAWYMDN 821
Query: 237 FSSD----KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPN 292
++ + N +S+ L+ KF K E +C Q R W N
Sbjct: 822 TAALPKLWPHCGKNSASVGELWTGLF-KFYTEEFKIDEYVVC-IRQQEPLTRFEKLW--N 877
Query: 293 NHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ IEDPF+ N +S K I AF
Sbjct: 878 GKCVAIEDPFDLNHNLGGGLSRKMHQYIIKAF 909
Score = 41.2 bits (95), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/149 (21%), Positives = 71/149 (47%), Gaps = 12/149 (8%)
Query: 26 ETRMKVISDLREVVESV----ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
E ++ SDL + +E E L+ + +GS +S + D++I + + + + +
Sbjct: 100 EEEVRYRSDLTKALEDKLTKDEHLKDIKLVLYGSSLSATGIKDSDVNIDLVVPHKA---N 156
Query: 82 AGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
K + Q+ +++ L + Y+ +Q ++VP + F + C ++I + Q
Sbjct: 157 HAKALMQAF--KIMKTLEE---YKDVQSQFSSKVPCVLFTDQVHGLRCQLTIGSDLAQET 211
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
S L S+ D RF+ + ++ + WAK ++
Sbjct: 212 SHLLLMYSRCDPRFKKLAVIFRYWAKTYE 240
>gi|351696760|gb|EHA99678.1| Terminal uridylyltransferase 7, partial [Heterocephalus glaber]
Length = 1481
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1032 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGHETAEGLDC 1083
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1084 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLCA 1143
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1144 YSAIDLRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEIY 1203
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1204 RGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1256
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1257 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1304
Query: 318 AKISNAF 324
I AF
Sbjct: 1305 NFIMKAF 1311
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 86/191 (45%), Gaps = 14/191 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E+ E R+++ + +V + LR ++ +GS S L + D++I I+ I S
Sbjct: 311 ENLEQRLEIKRIMEDVFQ--HKLRDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIMS- 364
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + L+ + + HARVP++ + C +S N + +
Sbjct: 365 ----QPDVLLLVQECLKNSDAFTDVDADFHARVPVVVCREKQSGLLCKVSAGNESACLTT 420
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +LP
Sbjct: 421 KHLSILGKLEPKLVPLVIAFRYWAKLCSIDCPEEGGLPPYVFALMAVFFLQQRKEPLLP- 479
Query: 203 LKDIYPGNLVD 213
+Y G+ VD
Sbjct: 480 ---VYLGSWVD 487
>gi|259016375|sp|Q5BLK4.3|TUT7_MOUSE RecName: Full=Terminal uridylyltransferase 7; Short=TUTase 7;
AltName: Full=Zinc finger CCHC domain-containing protein
6
Length = 1491
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 995 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1052
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1053 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1104
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1105 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1164
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1165 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1194
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|296471659|tpg|DAA13774.1| TPA: U6 snRNA-specific terminal uridylyltransferase 1 [Bos taurus]
Length = 871
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 117/269 (43%), Gaps = 32/269 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GD ++EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDQGKAVELAEALKGEKAEGGAMLELVGSILRGCVP--GVYRVQTVPSARCPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL S++DGR R +V ++ WA+ ++ N+Y+L
Sbjct: 376 SGLHGDISLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLNNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + A Q+ E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQLT------------QKAGEQV-EVDGWDCS-FPRDASRL 480
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN-------H 294
N+ L+ L F S L+ S L + GQ + LP+N
Sbjct: 481 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSL--REGQALSVAGG---LPSNLSEGLRLG 535
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 536 PMNLQDPFDLSHNVAANVTSRVAGRLQNC 564
>gi|50302781|ref|XP_451327.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640458|emb|CAH02915.1| KLLA0A07359p [Kluyveromyces lactis]
Length = 684
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 104/199 (52%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P R++ E R + I+ L+E V VE +++ FGS+ ++L+ D+D
Sbjct: 195 IKDFVSYISPNRQEIEQRNQAIAKLKEAV--VELWPDSSLNCFGSYATDLYLPGSDIDCV 252
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S++G K ++ L L L++K +++ +A ARVPI+KF I D
Sbjct: 253 VR-------SASGDKENRNALYSLASFLKRKQLATQVEVIAKARVPIIKFVEPESKIHID 305
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G ++ + W+ + G R++VL+VK++ A +NN TG YS+ LV
Sbjct: 306 VSFERTNGLEAARVIRGWLEEQPG-LRELVLIVKQFLHARRLNNVHTGGLGGYSIICLV- 363
Query: 190 FHFQTCVPAILPPLKDIYP 208
+ F P +L DI P
Sbjct: 364 YTFLKLHPRVL--TGDIDP 380
>gi|410896061|ref|XP_003961518.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Takifugu rubripes]
Length = 796
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/318 (27%), Positives = 130/318 (40%), Gaps = 67/318 (21%)
Query: 52 PFGSFVSNLFSRWGDLDISIELSN---------------GSCISSAGKKVKQSLLGDLLR 96
PFGS V+ DLD+ ++L N G +S G+ + S+L D+
Sbjct: 201 PFGSSVNTFGIHSCDLDLFLDLENTKVFQAHAKSTTGQTGEGMSDDGRS-EDSMLSDI-- 257
Query: 97 ALRQKGGYRRLQFVA-----------------HARVPILKFETIHQNISCDISIDNLCGQ 139
L L VA AR+P++KF N+ DI+ +N
Sbjct: 258 DLSTATPAEVLDLVAAILKRCVPSVHKVHVVSVARLPVVKFHHRELNLQGDITTNNRLAV 317
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT---FNSYSLSLLVLFHFQTCV 196
++FL S+ID R R +V ++ WAK + +GT N+Y+L+LLV+F Q C
Sbjct: 318 RNTRFLQLCSEIDERLRPLVYTIRCWAKQKQLAGNPSGTGPLLNNYALTLLVIFFLQNCD 377
Query: 197 PAILPPLKDIYPGNLVDDLKGVRANAERQIAEI--CAFNIARFSSDKYRKINRSSLAHL- 253
P +LP VD LK + E + E C F + + NR L L
Sbjct: 378 PPVLP---------TVDQLKAMACEEEECVIEGWNCTFPSQAIAVPPSK--NRQDLCTLL 426
Query: 254 --FVSFLEKF----SGLSLKASE-LGICPFTGQWEHIRSNTRWLPNN--------HPLFI 298
F +F KF S +SL+ L I F Q + + PN PL +
Sbjct: 427 AGFFNFYAKFDFASSVISLREGRALPITDFLKQNKDEEAMGEETPNTGMHHGPKLGPLNL 486
Query: 299 EDPFEQPENSARAVSEKN 316
DPFE N A ++E++
Sbjct: 487 LDPFELSHNVAGNLNERS 504
>gi|148709347|gb|EDL41293.1| zinc finger, CCHC domain containing 6, isoform CRA_b [Mus musculus]
Length = 1484
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 1005 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1062
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1063 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1114
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1115 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1174
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1175 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1204
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 328 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 379
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 380 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 439
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 440 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 495
Query: 212 VDDL 215
+++
Sbjct: 496 IEEF 499
>gi|148709346|gb|EDL41292.1| zinc finger, CCHC domain containing 6, isoform CRA_a [Mus musculus]
Length = 1534
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 995 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1052
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1053 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1104
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1105 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1164
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1165 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1194
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|195131881|ref|XP_002010373.1| GI15889 [Drosophila mojavensis]
gi|193908823|gb|EDW07690.1| GI15889 [Drosophila mojavensis]
Length = 550
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/97 (43%), Positives = 61/97 (62%), Gaps = 1/97 (1%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPIL+F+ I D++ +N G + L +Q+D R R +V++VK WA+ HDIN
Sbjct: 239 ARVPILRFKDRINGIEVDLNYNNSVGIKNTYLLQLYAQLDWRTRPLVVIVKLWAQYHDIN 298
Query: 173 NPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDIYP 208
+ K T +SYSL L+VL + Q C P +LP L+ +YP
Sbjct: 299 DAKRMTVSSYSLVLMVLHYLQYGCTPHVLPCLQALYP 335
>gi|395859961|ref|XP_003802291.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Otolemur
garnettii]
gi|395859963|ref|XP_003802292.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Otolemur
garnettii]
Length = 1496
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 142/335 (42%), Gaps = 44/335 (13%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L + + G + FGS + +
Sbjct: 1000 NILDQVCIQCYKDFSPTISEDQAREHIRQNLESFIR--QDFPGTKLSLFGSSKNGFGFKQ 1057
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1058 SDLDV--------CMTINGLETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1109
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1110 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1169
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL--KGVRANAERQIAEICAFNIARF 237
+SY+ +L+VL+ Q P ++P L++IY G ++ G QI E+ +
Sbjct: 1170 SSYAYTLMVLYFLQQRNPPVIPVLQEIYKGEKKPEIFVDGWNIYFFDQIDELPTY----- 1224
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRW 289
+Y K N S+ L++ L ++ +S++ L + F QW
Sbjct: 1225 -WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL-LTTFKKQW--------- 1272
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N +S K I AF
Sbjct: 1273 --TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1305
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/182 (21%), Positives = 82/182 (45%), Gaps = 14/182 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ ++E V L ++ +GS S+L R D++I I+ I S + +L
Sbjct: 318 EIKCIMEKVFQHKLPDCSLRLYGSSCSSLGFRNSDVNIDIQFP---AIMS-----QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ + +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLLP----VYLGSW 485
Query: 212 VD 213
++
Sbjct: 486 IE 487
>gi|301610981|ref|XP_002935025.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 7
[Xenopus (Silurana) tropicalis]
Length = 1437
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 99/209 (47%), Gaps = 15/209 (7%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + +P + + R + DL + ++ GA++ FGS + +
Sbjct: 944 ILDAVCIQCYEDFSPTALEDKAREHIRQDLEDFIK--RDFSGASLTLFGSSKNGFGFKQS 1001
Query: 66 DLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILKF 120
DLDI C++ G + + L + DL R LR+ G R + + A+VPI+KF
Sbjct: 1002 DLDI--------CMTIDGLETAEELDSIRTIEDLARLLRKHQGLRNILPITTAKVPIVKF 1053
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ + DIS+ N ++ L + ID R + ++K + K DI + G+ +
Sbjct: 1054 YHVRSGLEGDISLYNTLALHNTRLLASFAAIDPRVTYLCYIMKVFTKMCDIGDASRGSLS 1113
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
SY+ +L+VL+ Q P ++P L++I G
Sbjct: 1114 SYAYTLMVLYFLQQRNPPVIPVLQEICKG 1142
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/183 (22%), Positives = 81/183 (44%), Gaps = 17/183 (9%)
Query: 38 VVESVESL-----RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLG 92
+VE++E+L G ++ +GS + + DL+I I+ V++SL
Sbjct: 263 IVEAMENLIQKKLPGCSLRLYGSSWTRFGFKNSDLNIDIQFPINMNQPDVLLLVQESL-- 320
Query: 93 DLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
+Q + + HARVP++ ++ C +S N + S + + +++
Sbjct: 321 ------KQSDLFTDFEADFHARVPVVVCREKQSSLLCKVSAGNENACLTSNLMAALGKLE 374
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLV 212
R +V+ + WAK I+ P+ G Y L+L+ +F Q +LP +Y G +
Sbjct: 375 PRLLSLVVAFRYWAKLCCIDKPEEGGLPPYVLALMAIFFLQQRKQPVLP----VYLGAWI 430
Query: 213 DDL 215
+D
Sbjct: 431 EDF 433
>gi|432089506|gb|ELK23447.1| Speckle targeted PIP5K1A-regulated poly(A) polymerase [Myotis
davidii]
Length = 1000
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 121/275 (44%), Gaps = 44/275 (16%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQS----LLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
G+L ++EL+ + G+K +Q L+G +LR G R+Q V AR P++KF
Sbjct: 487 GELGKALELAE----APKGEKTEQGAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKF 540
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ D+S+ N S+FL S++D R R +V ++ WA+ + N
Sbjct: 541 CHRPSGLHGDVSLSNRLALHNSRFLSLCSELDERVRPLVYTLRCWAQGRGLTG-SGPLLN 599
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE---ICAFNIARF 237
+Y+L+LLV++ QT P +LP + + +A Q+A C+F R
Sbjct: 600 NYALTLLVIYFLQTRDPPVLPTVSQLT----------QKAGEGEQVAVDGWDCSF--PRD 647
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNT 287
+S N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 648 ASRLEPSANKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGDLPSNRWEGLRLG- 706
Query: 288 RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
P+ ++DPF+ N A V+ + ++ N
Sbjct: 707 -------PMNLQDPFDLSHNVAANVTSRVAGRLQN 734
>gi|254588108|ref|NP_705766.3| terminal uridylyltransferase 7 [Mus musculus]
Length = 1474
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 995 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1052
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1053 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1104
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1105 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1164
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1165 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1194
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|212530714|ref|XP_002145514.1| PAP/25A associated domain family [Talaromyces marneffei ATCC 18224]
gi|210074912|gb|EEA28999.1| PAP/25A associated domain family [Talaromyces marneffei ATCC 18224]
Length = 1059
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 129/298 (43%), Gaps = 24/298 (8%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D+ L P E E R +++ L ++ V FGS + L + D+DI
Sbjct: 122 MQDLYEQLLPSAESDERRRQLVQKLEKLFNEQWPGNNIDVHVFGSSGNKLCTSDSDVDI- 180
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI+++ K+++ L L L Q G R+ V+HARVPI+K ++CD
Sbjct: 181 -------CITTSFKQLENVCL--LAEVLAQHG-MERVVCVSHARVPIVKIWDPQLKMACD 230
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + ID R R + +++K W K +N+ GT +SY+ L++
Sbjct: 231 MNVNNTLALENTRMIRTYVDIDERVRPLAMIIKHWTKRRVLNDAALGGTLSSYTWICLII 290
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
QT P ILP L+ P + GV+ + + + + + A N S
Sbjct: 291 NFLQTRDPPILPSLQQ-RPHKAQKVIDGVQVSFDDDLESLRGYGHA----------NTQS 339
Query: 250 LAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
L L F ++ G + + + G+ L N+ L +E+PF N
Sbjct: 340 LGELLFHFF-RYYGHEVNYEKHVVSVREGKLISKEGKGWHLLQNNRLCVEEPFNTTRN 396
>gi|213405635|ref|XP_002173589.1| Poly(A) RNA polymerase [Schizosaccharomyces japonicus yFS275]
gi|212001636|gb|EEB07296.1| Poly(A) RNA polymerase [Schizosaccharomyces japonicus yFS275]
Length = 683
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 85/159 (53%), Gaps = 12/159 (7%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYR 105
R V PFGS ++ L + D+D+ I+ N + +S + + LR+K Y
Sbjct: 400 RKIKVYPFGSSLTGLMTESSDIDVVIKCKNKNLLSR---------IYPIADHLRRK--YT 448
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+++ VA A VP++KF T + CD+S + L S+ L +QID R + ++++VK W
Sbjct: 449 QVRVVARAHVPLIKFRT-NSGFCCDMSFNGLLAVYNSELLCLYTQIDERVKYLLIMVKFW 507
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
AK ++ + +SY+ +LV+++ Q P +LP L+
Sbjct: 508 AKTRLLHKVQLQALSSYTWCILVIYYCQRRNPPLLPNLQ 546
>gi|74183024|dbj|BAE20473.1| unnamed protein product [Mus musculus]
Length = 1214
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 831 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 888
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 889 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 940
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 941 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1000
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1001 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1030
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 154 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 205
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 206 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 265
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 266 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 321
Query: 212 VDDL 215
+++
Sbjct: 322 IEEF 325
>gi|344277922|ref|XP_003410746.1| PREDICTED: LOW QUALITY PROTEIN: poly(A) RNA polymerase,
mitochondrial-like [Loxodonta africana]
Length = 582
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 140/330 (42%), Gaps = 48/330 (14%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISS-----------------AGKKVKQS 89
TV PFGS V N F + G DLD+ ++L + + + + Q
Sbjct: 226 CTVRPFGSSV-NSFGKLGCDLDMFLDLDETGRLHAQKTSGNFLMEFQVKNVPSERIATQK 284
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L L L G G +Q + +AR P+++F CD+S +N S+ L+
Sbjct: 285 ILSVLGECLDHFGPGCVGVQKILNARCPLVRFSHQASGFQCDLSTNNRIALKSSELLYIY 344
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V ++ WA+A + + G + ++SL+++V+F Q P +LP L
Sbjct: 345 GALDSRVRALVFSIRCWARARSLTSSIPGAWITNFSLTMMVIFFLQRRSPPVLPTL---- 400
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGL 264
D LK + ++ I E N F SD R N +L L F E F
Sbjct: 401 -----DYLKTLAGTEDKCIIE--GHNCT-FISDLNRIKPSGNTETLELLLKEFFEYFGNF 452
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + + I R + P++ PL I++PFE N ++ V++ L K
Sbjct: 453 AFNKNSINI-------RQGREQNK--PDSSPLHIQNPFETSLNISKNVNQSQLEKFVELA 503
Query: 325 EMTHFRLTSTNQTRYALLSSLARPFILQFF 354
+ + L +Q R SS +P+ L F
Sbjct: 504 RESAWVLQQEDQDRP---SSSYQPWGLAFL 530
>gi|332020693|gb|EGI61098.1| Poly(A) RNA polymerase, mitochondrial [Acromyrmex echinatior]
Length = 575
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 112/237 (47%), Gaps = 20/237 (8%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G ++ + ARVPI+KF + CD+S N+ ++ L ++D R R +V+ +
Sbjct: 289 GILNVRRILEARVPIIKFYYNYTQTECDLSATNMTAIYMTELLNLYGEMDWRIRPLVITI 348
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WAK ++ + G + ++ L+LLVLF Q ILP LK + + +D++
Sbjct: 349 RAWAKNQELTSDVPGQWITNFPLTLLVLFFLQQ--KKILPSLKILKLYSTDNDVRC---- 402
Query: 222 AERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ 279
AE I C F +I + D KIN+ SL L F E +S + + GIC G
Sbjct: 403 AENGID--CTFLRDINKLPPDYKYKINQDSLETLLYDFFEFYSIFDFQ--KYGICIREGV 458
Query: 280 WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQ 336
IR P+ L+I +P E N ++ VS L +I + + L +T++
Sbjct: 459 --QIRK-----PSRSALYITNPLETTLNVSKNVSLYELNRIISKTHDAIYTLETTDK 508
>gi|60688060|gb|AAH43111.1| Zinc finger, CCHC domain containing 6 [Mus musculus]
Length = 1474
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 995 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 1052
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 1053 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 1104
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1105 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1164
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1165 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 1194
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 85/184 (46%), Gaps = 14/184 (7%)
Query: 34 DLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
+++ V+ESV L ++ +GS S L R D D++I++ + +S + +L
Sbjct: 318 EIKRVMESVFRHKLPDCSLRLYGSSCSRLGFR--DSDVNIDVQFPAVMS------QPDVL 369
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ L+ + + HARVP++ + C +S N + +K L + ++
Sbjct: 370 LLVQECLKNSDSFIDVDADFHARVPVVVCRDKQSGLLCKVSAGNENAWLTTKHLTALGKL 429
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+ R +V+ + WAK I+ P+ G Y +L+ +F Q +LP +Y G+
Sbjct: 430 EPRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLLP----VYLGSW 485
Query: 212 VDDL 215
+++
Sbjct: 486 IEEF 489
>gi|196001319|ref|XP_002110527.1| hypothetical protein TRIADDRAFT_54639 [Trichoplax adhaerens]
gi|190586478|gb|EDV26531.1| hypothetical protein TRIADDRAFT_54639 [Trichoplax adhaerens]
Length = 484
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 136/307 (44%), Gaps = 27/307 (8%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
+ +M++ S L +++V + A++ GS + S D+D ++N +
Sbjct: 177 DQKMQLRSALLHAIKTV--YKDASLHIVGSSTNGFGSEDSDIDFCAVVNNNREFTRRKTL 234
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
S L L LR R + V VPIL+F+ +CDISI+N G + L
Sbjct: 235 YALSNLRAKLATLRYLKDVRLIPAV----VPILEFQDCVSGFNCDISINNDTGIRNTHLL 290
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
+ S D R +V +K W + IN + GT +SY++ LV+ + Q C P +LP L++
Sbjct: 291 YAYSLCDDRVAPLVKFIKMWGHYYGINKSQYGTLSSYAVVNLVINYLQECDPPVLPFLQE 350
Query: 206 IYPGNLVDDLKGVRANAER-QIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+P N+ + + +R + ++ N+++ N+ ++ L + F ++
Sbjct: 351 DFP-NIFRKKSSLNSIPKRSKSVDLSGIPQNLSK---------NQKTIGELLIGFYRHYA 400
Query: 263 GLSLKASELGI--CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV-SEKNLAK 319
+ I F W H + H + IE+PFE +N AR++ S K K
Sbjct: 401 VFKWSNYIISIKKGKFPLDWRHKFMTAK----AHYINIEEPFED-KNVARSIRSRKKYEK 455
Query: 320 ISNAFEM 326
I AF M
Sbjct: 456 IKIAFNM 462
>gi|332836702|ref|XP_508491.3| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
isoform 4 [Pan troglodytes]
Length = 912
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 115/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 356 GDLGKAPELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 413
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 414 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 472
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 473 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 520
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N L+ L F S L+ S L + P G WE +R
Sbjct: 521 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 575
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 576 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 604
>gi|67968953|dbj|BAE00833.1| unnamed protein product [Macaca fascicularis]
gi|67971788|dbj|BAE02236.1| unnamed protein product [Macaca fascicularis]
Length = 337
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 102/221 (46%), Gaps = 25/221 (11%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q + +AR P+++F CD++ +N S+ L+ +D R R +V +
Sbjct: 53 GCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALTSSELLYIYGTLDSRVRALVFGI 112
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WA+AH + + G + ++SL+++V+F Q P ILP L D LK + A+
Sbjct: 113 RCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---------DSLKTL-AD 162
Query: 222 AERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG 278
AE + I N F D R N +L L F E F + + + I
Sbjct: 163 AEDKC--IIEGNNCTFVRDLNRIKPSGNTETLELLLKEFFEYFGNFAFNKNSINI----- 215
Query: 279 QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 216 --RQGREQNK--PDSSPLYIQNPFETALNISKNVSQSQLQK 252
>gi|270009992|gb|EFA06440.1| hypothetical protein TcasGA2_TC009322 [Tribolium castaneum]
Length = 577
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 137/319 (42%), Gaps = 46/319 (14%)
Query: 24 DWETRMKVISDLREVVESVESLR-GATVEPFGSFVSNLFSRWGDLDISIELSNGS----- 77
D TR++ ++ R+V +V + A PFGS V+ DLD+ + L +
Sbjct: 188 DLGTRLRFLTA-RQVENAVRGMFPKAKAYPFGSSVNGYGKMGCDLDLVLRLCDDKVKNDA 246
Query: 78 -----CISSAGKKVKQS-----LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNI 127
C G + S +GDLL+ G +++ + ARVPI+K+ ++
Sbjct: 247 RLMFHCKGLVGSERTASQRNMEAIGDLLQLFLP--GCSQVRRILQARVPIIKYYQQLTDV 304
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSL 186
CD+S+ N+ G S FL+ + +D R R +V +++WA + N G + ++SL+L
Sbjct: 305 ECDLSMANMSGVHMSDFLYIMGSLDARIRPLVFTIRKWASEIGLTNSSPGRWITNFSLTL 364
Query: 187 LVLFHFQTCVPA--ILPPLKDIY----PGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
LVL Q + + ILP L + P + G+ R I ++
Sbjct: 365 LVLAFLQKPINSKPILPSLNTLVKLAEPKDSYMTEDGINCTFLRDITKL----------- 413
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
K N+ SL L V F E +S + L + + P + L+I +
Sbjct: 414 KTPTENKESLETLLVEFFEFYSQFDFASKALCLNESVAITK---------PEHCALYIVN 464
Query: 301 PFEQPENSARAVSEKNLAK 319
P E+ N ++ VS + L +
Sbjct: 465 PLERGLNVSKNVSMEELDR 483
>gi|354493334|ref|XP_003508797.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Cricetulus griseus]
Length = 884
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 109/247 (44%), Gaps = 36/247 (14%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G R+Q V AR P++KF + DIS+ N S+FL S
Sbjct: 362 LVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSGLHGDISLSNRLALYNSRFLNLCS 419
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++DGR R +V V+ WA+ + ++ N+Y+L+LLV++ QT P +LPP+ +
Sbjct: 420 EMDGRVRPLVYTVRCWAQHNGLSG-GGPLLNNYALTLLVIYFLQTRDPPVLPPVAQL--- 475
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSL 266
+ + E + E+ ++ + F D R N L+ L F S L
Sbjct: 476 --------TQRSGEGEQVEVDGWDCS-FPKDASRLEPSTNLEPLSSLLAQFFSCVSCWDL 526
Query: 267 KASELGI---CPF-------TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
S L + C + WE +R P+ ++DPF+ N A V+ +
Sbjct: 527 SGSLLSLREGCALLVSGGLPSDLWEGLRLG--------PMNLQDPFDLSHNVAANVTSRV 578
Query: 317 LAKISNA 323
++ N
Sbjct: 579 ARRLQNC 585
>gi|346473397|gb|AEO36543.1| hypothetical protein [Amblyomma maculatum]
Length = 536
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 135/313 (43%), Gaps = 52/313 (16%)
Query: 24 DWETRMKVISDLREVVESVESLRG-ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS- 81
D ETR+ + R+V E + L V PFGS V+ D+D+ + + S
Sbjct: 183 DLETRLGFMV-CRQVEEFIVGLYSEGQVLPFGSLVNGFGRHHCDIDMVYCIPEAAEFSGN 241
Query: 82 ----------AGKKVKQSLL---GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNIS 128
+ + Q LL GDLL + G +Q + ARVPI+KF+
Sbjct: 242 LYFQEKNQAITDRTLVQRLLETLGDLLHYV--VPGVSEVQRILRARVPIVKFQHHIVGRE 299
Query: 129 CDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLL 187
CD+++ N+ G S+ L +Q+ ++ + WA A + + GT+ ++ L+LL
Sbjct: 300 CDLTLSNMSGVHMSRLLHTCTQLAPALCPLLFTARSWAMAQGVTSKVPGTWITNFQLTLL 359
Query: 188 VLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINR 247
+FH Q C +LP L+D+ ++ K ++A + + R
Sbjct: 360 AIFHLQQC--GLLPSLRDL------EEKKRLKA----------------WEKSRSRDGKA 395
Query: 248 SSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
+ L F E ++ + K+ GI P++GQ P ++I++P ++ N
Sbjct: 396 EAFEDLLRGFYEFYASFNFKSK--GIAPYSGQILEK-------PEYTAMYIQNPLDRQLN 446
Query: 308 SARAVSEKNLAKI 320
++R + +L K+
Sbjct: 447 ASRNIGLSDLKKL 459
>gi|255949412|ref|XP_002565473.1| Pc22g15560 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592490|emb|CAP98844.1| Pc22g15560 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1063
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 134/301 (44%), Gaps = 30/301 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ D+ L P E + R +++ L ++ FGS + L S D+DI
Sbjct: 124 IMDLYDRLLPSAESDDRRRQLVRKLEKLFNDQWPGHNIKANIFGSSGNKLCSSDSDVDI- 182
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL ++L K G +R+ V+HA+VPI+K ++C
Sbjct: 183 -------CITTNYKELEHVCLLAEVL----AKHGMQRVVCVSHAKVPIVKIWDPELRLAC 231
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + + +K W K +N+ GT +SY+ L+
Sbjct: 232 DMNVNNTLALENTRMIRTYVEVDERVRPLAMAIKHWTKQRILNDAALGGTLSSYTWICLI 291
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKIN 246
+ QT P ILP L+ R + +R + +C+F+ + ++ + N
Sbjct: 292 INFLQTRNPPILPSLQ-------------ARPHKKRMTHDGLVCSFDDDLKTLSQFGRKN 338
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
+ S+ L F ++ G L + I G + + L N+ L +E+PF
Sbjct: 339 KQSVGELLFQFF-RYYGYELDYEKNVISVRDGTLINKEAKGWHLMLNNRLCVEEPFNTSR 397
Query: 307 N 307
N
Sbjct: 398 N 398
>gi|343960290|dbj|BAK63999.1| RNA binding motif protein 21 [Pan troglodytes]
Length = 874
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 115/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKAPELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 435 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 482
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N L+ L F S L+ S L + P G WE +R
Sbjct: 483 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 537
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 538 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 566
>gi|397516649|ref|XP_003828536.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Pan paniscus]
Length = 912
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 115/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 356 GDLGKAPELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 413
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 414 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 472
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 473 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 520
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N L+ L F S L+ S L + P G WE +R
Sbjct: 521 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 575
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 576 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 604
>gi|302507638|ref|XP_003015780.1| PAP/25A associated domain family protein [Arthroderma benhamiae CBS
112371]
gi|291179348|gb|EFE35135.1| PAP/25A associated domain family protein [Arthroderma benhamiae CBS
112371]
Length = 1179
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/258 (23%), Positives = 110/258 (42%), Gaps = 18/258 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E E R+K + L +++++ V FGS + L + D D
Sbjct: 150 IKELYQKLLPSPESEERRVKFVRKLEKLLDTQWPGNEIKVNVFGSSGNKLCTSDSDADFL 209
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S S+ K+ GG R+ V+HA+VPI+K ++CD
Sbjct: 210 AKSERSSLFYSSSKR-------PFFANSSFTGGMERVVCVSHAKVPIVKIWDPELQVACD 262
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + ++D R R + +LVK W K +N+ GT +SY+ L++
Sbjct: 263 MNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAALGGTLSSYTWICLII 322
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRK 244
QT +P I+P L+ V +G + C + F D +
Sbjct: 323 NFLQTRIPPIVPSLQ-----KRVAQSEGSTDGSSITSTTSCNSTYSSFDDDVEKLGGFGD 377
Query: 245 INRSSLAHLFVSFLEKFS 262
N+S+L L F ++
Sbjct: 378 DNKSTLGELLFQFFRYYA 395
>gi|344236659|gb|EGV92762.1| U6 snRNA-specific terminal uridylyltransferase 1 [Cricetulus
griseus]
Length = 861
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 109/247 (44%), Gaps = 36/247 (14%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G R+Q V AR P++KF + DIS+ N S+FL S
Sbjct: 339 LVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSGLHGDISLSNRLALYNSRFLNLCS 396
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++DGR R +V V+ WA+ + ++ N+Y+L+LLV++ QT P +LPP+ +
Sbjct: 397 EMDGRVRPLVYTVRCWAQHNGLSG-GGPLLNNYALTLLVIYFLQTRDPPVLPPVAQL--- 452
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSL 266
+ + E + E+ ++ + F D R N L+ L F S L
Sbjct: 453 --------TQRSGEGEQVEVDGWDCS-FPKDASRLEPSTNLEPLSSLLAQFFSCVSCWDL 503
Query: 267 KASELGI---CPF-------TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
S L + C + WE +R P+ ++DPF+ N A V+ +
Sbjct: 504 SGSLLSLREGCALLVSGGLPSDLWEGLRLG--------PMNLQDPFDLSHNVAANVTSRV 555
Query: 317 LAKISNA 323
++ N
Sbjct: 556 ARRLQNC 562
>gi|47226593|emb|CAG08609.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1066
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 96/205 (46%), Gaps = 5/205 (2%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
+VL + + P + R ++ DL + L A ++ FGS + +
Sbjct: 611 SVLNKVCEQCYTDFAPDELEMGVRELILKDLETFIR--RQLPAARLQLFGSSKNGFGFKQ 668
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
DLDI + L I+ +L+ L R LR+ G + + + A+VPI+KF
Sbjct: 669 SDLDICMVLEGQESINDVDCI---ALIESLARLLRKHSGVKNVLPITTAKVPIVKFYHAQ 725
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N ++ L + ID R + + ++K +AK DI + G+ +SY+
Sbjct: 726 TGLEGDISLYNTLALHNTRLLASYAAIDRRVKVLCYVMKVFAKMCDIGDASRGSLSSYAY 785
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPG 209
+L+ LF Q P ++P L++IY G
Sbjct: 786 TLMALFFLQQRNPPVIPVLQEIYDG 810
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 79/179 (44%), Gaps = 10/179 (5%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R V++ +++++ SV L + +GS + + D++I I+
Sbjct: 78 EDVEKRRSVVAVMQDLLLSV--LPEIRLRLYGSSCTKFGFKDSDVNIDIQFPQHMHQPDV 135
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
VK++L L + ++ HARVP++ + + C +S N +
Sbjct: 136 LLLVKETLSVCPL--------FVDVEADFHARVPVVLCKDKTCGLICKVSAGNENAYQTT 187
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+L +S + +VL ++ WA+ I+ + G Y +L+V++ Q ++LP
Sbjct: 188 AYLAALSSREPLVLALVLGLRRWARLCTIDRAEEGGLPPYVFALMVIYFLQQRKESLLP 246
>gi|157822467|ref|NP_001100829.1| poly(A) RNA polymerase, mitochondrial [Rattus norvegicus]
gi|149032575|gb|EDL87453.1| PAP associated domain containing 1 (predicted) [Rattus norvegicus]
Length = 336
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 101/222 (45%), Gaps = 27/222 (12%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q + +AR P+++F CD++ +N S+ L+ +D R R +V V
Sbjct: 53 GCVGVQKILNARCPLVRFSHQGSGFQCDLTANNSIALKSSELLYIYGSLDSRVRALVFGV 112
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WA+AH + + G + ++SL+++V+F Q P ILP L D LK +
Sbjct: 113 RCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTL---------DSLKSMADA 163
Query: 222 AERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
+R I E N F D KI N +L L F E F + + + I
Sbjct: 164 EDRCILE---GNNCTFIQD-INKIKPSGNTETLELLLKEFFEYFGNFAFNKNSINI---- 215
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
R + P++ PL+I++PFE N ++ VS+ L K
Sbjct: 216 ---RQGREQNK--PDSSPLYIQNPFETSLNISKNVSQNQLQK 252
>gi|242053947|ref|XP_002456119.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
gi|241928094|gb|EES01239.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
Length = 568
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 143/336 (42%), Gaps = 58/336 (17%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++P E+ +R + D+ +VV+ + VE FGSF + L+ D+D+
Sbjct: 147 DFCDFISPSTEEQSSRTAAVQDVSDVVKHI--WPQCKVEVFGSFRTGLYLPTSDIDV--- 201
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ + K Q L L +AL QKG +++Q +A ARVPI+KF I+ DIS
Sbjct: 202 -----VVFESRVKTPQVGLYALAKALSQKGVAKKIQVIAKARVPIVKFVERKSGIAFDIS 256
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ + R + +++K + ++N TG SY+L +++ H
Sbjct: 257 FDMDGGPQAADFIKDAVKKLPALRPLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHL 316
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q ++ G D+ G Y + +L
Sbjct: 317 QL-----------VWGGK---DILG------------------------YHQSKEHNLGI 338
Query: 253 LFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSAR 310
L V F + F G L ++GI + + ++S+ ++ ++ P L I+DP PEN
Sbjct: 339 LLVRFFD-FYGRKLNHWDVGISCNSSRTFFLKSDKDFMNHDRPHLLAIQDPM-VPENDI- 395
Query: 311 AVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLA 346
+ N K+ +AF + LT N LL+SL
Sbjct: 396 GKNSFNYFKVKSAFSKAYSMLTDAN-----LLTSLG 426
>gi|291409555|ref|XP_002721091.1| PREDICTED: terminal uridylyl transferase 1, U6 snRNA-specific
[Oryctolagus cuniculus]
Length = 911
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 121/276 (43%), Gaps = 44/276 (15%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQS----LLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
GDL ++EL+ S G+K + + L+G +LR G R+Q V AR P++KF
Sbjct: 357 GDLGKALELAE----SLKGEKTEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKF 410
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ N
Sbjct: 411 CHRPSGLHGDVSLSNRLALHNSRFLSLCSELDERVRPLVYTIRCWAQGRGLSGSGP-LLN 469
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
+Y+L+LLV++ QT P +LP + + +A E Q+ E+ ++ + F D
Sbjct: 470 NYALTLLVIYFLQTRDPPVLPTVSQLT----------QKAGEEEQV-EVDGWDCS-FPRD 517
Query: 241 KYR---KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNT 287
R N + L F S L+ S L + P G WE +R
Sbjct: 518 ASRLEPSANVEPVGSLLAQFFSCVSSWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG- 576
Query: 288 RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 577 -------PMNLQDPFDLSHNVAANVTSRVAGRLQNC 605
>gi|403300965|ref|XP_003941182.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403300967|ref|XP_003941183.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 1493
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1023 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1074
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1075 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1134
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1135 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1194
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1195 KGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1247
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1248 DFREHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1295
Query: 318 AKISNAF 324
I AF
Sbjct: 1296 NFIMKAF 1302
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFVDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
++ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTEHLTALGKLEPKLIPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|149634744|ref|XP_001507658.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like
[Ornithorhynchus anatinus]
Length = 579
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 141/323 (43%), Gaps = 51/323 (15%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDISIELSNGSCIS- 80
E+ + R V S + ++ + T++ FGS V N F + G D+D+ ++L N IS
Sbjct: 202 ENTQLRYLVCSFIEDIAAAY--FPSCTIKLFGSSV-NTFGKLGCDVDMFLDLDNLGKIST 258
Query: 81 ----------------SAGKKVKQSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETI 123
S+ + Q +L + L G G +Q + +A P+++F
Sbjct: 259 KKAADPYFMEFQMKNVSSERVATQKILSVIGECLDNFGPGCVGVQRILNANCPLVRFSHQ 318
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT-FNSY 182
CD++ +N S+ L+ +D R R +V V+ WA H + + G+ ++
Sbjct: 319 PSGFQCDLTANNRIALKSSELLYLYGTLDPRVRALVFSVRCWAHVHALTSSIPGSWLTNF 378
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQI---AEICAF--NIARF 237
SL+++VLF Q P ++P L + LK + A+AE + C F N+ +
Sbjct: 379 SLTMMVLFFLQKRSPPVIPTL---------NHLKTL-ADAEDKCIMQGHDCTFVSNLNKI 428
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
+ N SL L F E F S + + I + + P++ PL+
Sbjct: 429 EPSE----NTESLDVLLSQFFEYFGNFSFNKNSISI---------RKGKEQNKPDSSPLY 475
Query: 298 IEDPFEQPENSARAVSEKNLAKI 320
I++PFEQ N ++ V++ L +
Sbjct: 476 IQNPFEQTLNISKNVNQSQLQRF 498
>gi|330796515|ref|XP_003286312.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
gi|325083739|gb|EGC37184.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
Length = 2271
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 110/234 (47%), Gaps = 31/234 (13%)
Query: 35 LREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDL 94
L++++E + A ++ +GSF+ L + GDLD+ L ++L +
Sbjct: 1969 LKKLLE--KEFTTADIQLYGSFLYGLSLKGGDLDVCFTLKQMG---------DRALFLQV 2017
Query: 95 LRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR 154
L Y+ + A +PI++F ++ D+ ++ G KS + S +D R
Sbjct: 2018 KDFLNNSKKYKIIDLRLSATIPIIRFLELNTGTQFDMCFNHEIGIYKSNLIKEYSDLDPR 2077
Query: 155 FRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPL------KDIY 207
++++LLVK WA+ DIN+ GTF+S+ L L+V+ Q + P ILP L KD
Sbjct: 2078 CKELILLVKYWAQQKDINDASKGTFSSFCLVLMVIHFLQYGIYPPILPNLEAGSNKKDHL 2137
Query: 208 PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
N++ + VR + I +FN K+N+S+ A LF F + +
Sbjct: 2138 KENIIIEDHHVRYINSKLI----SFN---------PKLNKSTTAQLFYQFFKYY 2178
>gi|426368830|ref|XP_004051405.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Gorilla gorilla gorilla]
Length = 912
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/272 (25%), Positives = 115/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 356 GDLGKAPELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 413
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 414 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 472
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 473 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 520
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N L+ L F S L+ S L + P G WE +R
Sbjct: 521 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG----- 575
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 576 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 604
>gi|241826838|ref|XP_002416632.1| poly(A) polymerase, putative [Ixodes scapularis]
gi|215511096|gb|EEC20549.1| poly(A) polymerase, putative [Ixodes scapularis]
Length = 483
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/334 (26%), Positives = 143/334 (42%), Gaps = 84/334 (25%)
Query: 22 REDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
+E + +M + + + +++ + L G V GS V+ L S D+DI + S
Sbjct: 196 KEMYSQKMALRNKVYRILQRIFPLCGLYV--VGSSVNGLGSNSSDMDILVTKSE------ 247
Query: 82 AGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
+ +A+VPILKF + D++I+N G
Sbjct: 248 ----------------------------IIYAKVPILKFSDRGSGVEIDLNINNSVGIRN 279
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAIL 200
++ L S++D R + L VK WA+ H IN K T +SYSL L+++ + Q C P +L
Sbjct: 280 TQLLNCYSRLDWRVAPLALAVKAWAEHHGINQAKFMTLSSYSLVLMLIHYLQCGCRPVVL 339
Query: 201 PPLKDIYPG----------NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSL 250
P L+ + P NL ++L +++ E+ + E L
Sbjct: 340 PCLQKMLPKKFQEEDVRSLNLYEELPAFKSHNEQSLGE---------------------L 378
Query: 251 AHLFVSFLEK---FSGLSLKASELGIC-PFTGQWEHIRS--NTR--WLPNNHPLFIEDPF 302
H F+ + + FS S + LG C P H RS NTR W + IE+PF
Sbjct: 379 LHGFLCYYARQFSFSD-SCISVRLGDCIPRAAAMAH-RSPKNTRQQW----KFICIEEPF 432
Query: 303 EQPENSARAVSE-KNLAKISNAFEMTHFRLTSTN 335
++ N+AR+V + I N F+ + RL +T+
Sbjct: 433 DR-TNTARSVYDYAAFQLIMNTFKTSLARLEATH 465
>gi|388583188|gb|EIM23490.1| hypothetical protein WALSEDRAFT_59229 [Wallemia sebi CBS 633.66]
Length = 934
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 109/446 (24%), Positives = 188/446 (42%), Gaps = 59/446 (13%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL 73
IL +L P E++ + + L ++++ V GA + FGS + R D+D+ L
Sbjct: 17 ILPLL-PTAEEYAIKEQTRLLLEKLIDRVSP--GARLIAFGSMANGFALRNSDMDLQCIL 73
Query: 74 SNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE-----TIHQNIS 128
S S+ + +++G+L+R + + ++ + AR+PI+K + I+
Sbjct: 74 DPASEPLSSSELT--TIVGELIR---HETNFH-VKPLPKARIPIIKLTLAPTPALPYGIA 127
Query: 129 CDISIDNLCGQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CDI ++ L + ID R R +VL +K W+K IN+ GT +SY +LL
Sbjct: 128 CDIGFGGQLALENTRLLLGYASIDPPRLRTLVLFIKVWSKRRKINSAYRGTLSSYGFTLL 187
Query: 188 VLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKIN 246
V+F P +LP L+ I P V +A + I F+ ++ N
Sbjct: 188 VIFFLAHVKRPPVLPNLQRIPPLRPVSP-----ESASYEGRNIYFFDDVALLRQEWSSAN 242
Query: 247 RSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQ-WEH---IRSNTRWLPNNH 294
S+ L F ++ +S++ +E GI + W + + + + +
Sbjct: 243 TQSVGELLWEFFRFYAKDFNYTHDVISIR-TEGGILSKDAKGWVQDLEVDGASEFARDRN 301
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFF 354
L IEDPF+ N AR V+ L I F L + +T +L S + +
Sbjct: 302 RLCIEDPFDTTYNVARTVTADGLYTIRGEFMRASRMLQTVGKTD--ILPSQVLVDLCEER 359
Query: 355 GESPVRYANYNNGHRRARPQSHKSVNSPLQ--------------AQHQSHNAKKENRPNR 400
E+ V A ++ G+R RP ++ S+ L Q SHN +
Sbjct: 360 EEALVP-APFSAGNRTPRPLTNPSLGPTLGPSLHPPPSLQPPPIVQTHSHNTHM----TQ 414
Query: 401 SMSQQSVQQHQSQ----PVRQINGQV 422
+ SQQS QSQ P R+++ Q+
Sbjct: 415 TASQQSYSPTQSQSSIFPSRKVSDQI 440
>gi|10438584|dbj|BAB15282.1| unnamed protein product [Homo sapiens]
Length = 535
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 108/247 (43%), Gaps = 36/247 (14%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G R+Q V AR P++KF + D+S+ N S+FL S
Sbjct: 4 LVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSGLHGDVSLSNRLALHNSRFLSLCS 61
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++DGR R +V ++ WA+ ++ ++Y+L+LLV++ QT P +LP + +
Sbjct: 62 ELDGRVRPLVYTLRCWAQGRGLSGSGP-LLSNYALTLLVIYFLQTRDPPVLPTVSQL--- 117
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSL 266
+ E + E+ ++ + F D R IN L+ L F S L
Sbjct: 118 --------TQKAGEGEQVEVDGWDCS-FPRDASRLEPSINVEPLSSLLAQFFSCVSCWDL 168
Query: 267 KASELGI-----CPFTGQ-----WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
+ S L + P G WE +R PL ++DPF+ N A V+ +
Sbjct: 169 RGSLLSLREGQALPVAGGLPSNLWEGLRLG--------PLNLQDPFDLSHNVAANVTSRV 220
Query: 317 LAKISNA 323
++ N
Sbjct: 221 AGRLQNC 227
>gi|308454363|ref|XP_003089817.1| hypothetical protein CRE_20119 [Caenorhabditis remanei]
gi|308268217|gb|EFP12170.1| hypothetical protein CRE_20119 [Caenorhabditis remanei]
Length = 301
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/219 (33%), Positives = 106/219 (48%), Gaps = 18/219 (8%)
Query: 110 VAHARVPILKFETIHQNISCDISI--DNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWA 166
+ A++ ILK +T+ I DIS+ D+ + + FL + ID RF + +VKEWA
Sbjct: 31 LVQAQIQILKLKTV-DGIEFDISVVMDSFLSSMHNSFLIKQMVLIDHRFGPLCAVVKEWA 89
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAER 224
+ + NPK G FNSY+L LLV+ HF C P +LP L+ +Y K A +E+
Sbjct: 90 ASTKVKNPKDGGFNSYALVLLVI-HFLQCGTFPPVLPNLQFLYRD------KNFIAMSEK 142
Query: 225 QIAEICAFNIAR-FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGIC-PFTGQWEH 282
F F K +K N + +A LF+ FL +S + + I T E
Sbjct: 143 DFPVRLDFGAPLPFPLPKIQK-NEAPIARLFLEFLNYYSEFNFDKFYISIKHGKTKIRER 201
Query: 283 IRSNTRWLPNNHPLFIEDPFEQPENSARAV-SEKNLAKI 320
S T N ++IEDPF+ N R V S KN+ KI
Sbjct: 202 SASETVQNENRKQVYIEDPFDS-HNPGRTVRSLKNIQKI 239
>gi|390457912|ref|XP_002742930.2| PREDICTED: terminal uridylyltransferase 7 [Callithrix jacchus]
Length = 1536
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1020 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1071
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1072 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1131
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1132 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPILQEIY 1191
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1192 KGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1244
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1245 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1292
Query: 318 AKISNAF 324
I AF
Sbjct: 1293 NFIMKAF 1299
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFVDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
++ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTEHLTALGKLEPKLIPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMTIFFLQQRKEPVL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|403373157|gb|EJY86493.1| hypothetical protein OXYTRI_13606 [Oxytricha trifallax]
Length = 1023
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 16/207 (7%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVES------LRGAT-VEPFGSFVSN 59
+E I+K I + RED R +VI++++ ++ L G + FGS +
Sbjct: 669 VEGIIKQIYKEQSVSREDLAIRDRVINNIKNAFKNTNDREYPALLTGQLRISGFGSCQNG 728
Query: 60 LFS-RWGDLDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPI 117
L++ D+D++ CI S + Q LL +++ L FV +RVPI
Sbjct: 729 LWNVEKSDIDVT-------CIISEKIEFNQHQLLRACTTIIKKVAKQGTLIFVPASRVPI 781
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
LKF+ + D +++N+ G S +F QID RF + L +K W+K +I G
Sbjct: 782 LKFQENQTGLEVDFNVNNILGIHNSDLIFTYCQIDQRFHILSLFLKYWSKKVEIIGAAYG 841
Query: 178 TFNSYSLSLLVLFHFQTCVPAILPPLK 204
+SY+L+L+++ Q+ P +LP L+
Sbjct: 842 LLSSYALTLMLIAFLQSTSPPVLPCLQ 868
>gi|402897789|ref|XP_003911927.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 7 [Papio
anubis]
Length = 1536
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 133/309 (43%), Gaps = 48/309 (15%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1020 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1071
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1072 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1131
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1132 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1191
Query: 208 PGNLVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
G ++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 1192 KGEKKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTE 1242
Query: 264 --------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1243 EFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRK 1290
Query: 316 NLAKISNAF 324
I AF
Sbjct: 1291 MTNFIMKAF 1299
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQDCLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
Length = 573
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 98/191 (51%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 175 IKDFVAYISPSREEIEVRNQTISMIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 232
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S G K ++ L L L++K ++ VA ARVPI+KF + I D
Sbjct: 233 I-------TSELGGKESRNNLFSLASHLKKKNLATEIEVVAKARVPIIKFVEPNSGIHID 285
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W++ G R++VL+VK++ + +NN TG +S+ LV
Sbjct: 286 VSFERTNGLEAAKLIREWLNDTPG-LRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLV- 343
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 344 FSFLHMHPRII 354
>gi|297684701|ref|XP_002819963.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Pongo abelii]
gi|297684703|ref|XP_002819964.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Pongo abelii]
Length = 1494
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 133/309 (43%), Gaps = 48/309 (15%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PGNLVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
G ++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 1196 KGEKKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTE 1246
Query: 264 --------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1247 EFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRK 1294
Query: 316 NLAKISNAF 324
I AF
Sbjct: 1295 MTNFIMKAF 1303
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella
mesenterica DSM 1558]
Length = 303
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 89/176 (50%), Gaps = 8/176 (4%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P RE++E R+ +I + V+ ATV PFGS+ + L+ GD+D+ +
Sbjct: 32 MSPTREEYEVRLLIIESITRAVKY--KWPEATVTPFGSWQTQLYLPQGDIDLVV------ 83
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+ + K++LL DL R +R + ++ ARVPI+KF T H ++ DIS++ +
Sbjct: 84 THPTLTEHNKKNLLNDLARTMRYAMITDNVVVISKARVPIIKFVTKHGKLNVDISLNQVN 143
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G K + + R ++L+VK + +N TG SYS+ LV+ Q
Sbjct: 144 GISAGKIINQYLDVIPGARQLILVVKAFLSQRSMNEVYTGGLGSYSVICLVISFLQ 199
>gi|156086386|ref|XP_001610602.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797855|gb|EDO07034.1| conserved hypothetical protein [Babesia bovis]
Length = 481
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 144/312 (46%), Gaps = 56/312 (17%)
Query: 28 RMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRWGDLDISIELSNGSCISS 81
++K+IS+ ++V+ VES + P FGS ++ L++ DLDI +++ N + S+
Sbjct: 151 QLKIISE--QIVKLVESQLKERLNPKCSVCVFGSAINGLWTDASDLDICVQIPNVTSRSA 208
Query: 82 AGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN------------ISC 129
+ +++ + LL L G+ +F A ++P+L ++ N S
Sbjct: 209 TIRNLRR--IAFLLEPLAPARGFEN-RFTA--KIPLLHWKNERPNKRAGHPIIQALKTSI 263
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
DIS++N+ S + D R R+++L +K WA+A D+N+ GT S++LS++ +
Sbjct: 264 DISVNNVLAISNSALIGTYVACDHRVRNLILAIKLWARARDLNDRSKGTLGSFALSIMAI 323
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR------ 243
Q C P IL ++D+ A A+ +I + RF++D R
Sbjct: 324 HFLQRCNPPILVSIQDL-------------AIADNEIPRYVSGIDVRFTTDLNRINEELQ 370
Query: 244 -----KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRS-------NTRWLP 291
+ N+S++ L F F K ++ IC + ++++ + T +
Sbjct: 371 WLTKGERNKSNVIQLLQEFFYYFGWTFTKNTQTPICIRSVDFQYMDTQLTFPHRTTGFDM 430
Query: 292 NNHPLFIEDPFE 303
+ + +++PFE
Sbjct: 431 DEKFMHVDNPFE 442
>gi|355753446|gb|EHH57492.1| Terminal uridylyltransferase 7 [Macaca fascicularis]
Length = 1490
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 133/309 (43%), Gaps = 48/309 (15%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1020 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1071
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1072 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1131
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1132 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1191
Query: 208 PGNLVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
G ++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 1192 KGEKKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTE 1242
Query: 264 --------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1243 EFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRK 1290
Query: 316 NLAKISNAF 324
I AF
Sbjct: 1291 MTNFIMKAF 1299
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQDCLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|196008079|ref|XP_002113905.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
gi|190582924|gb|EDV22995.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
Length = 1063
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 92/174 (52%), Gaps = 11/174 (6%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG-GYRR 106
AT+ FGS + ++ D+D+ + + + S K Q + + + LR+K + +
Sbjct: 487 ATLHLFGSSKNGFGTKQSDVDMCMMIPDDSLNCLDEKLRGQEAIRRIAKQLRKKSRDFAK 546
Query: 107 LQFVAHARVPILKFE----------TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFR 156
+Q ++ A VPI+KF ++++ +SCDIS N + L +D R
Sbjct: 547 VQDISRATVPIVKFYDVRRYVNPTCSLNRKLSCDISYQNALAVHNTNLLASYGSLDDRIP 606
Query: 157 DMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
+VL++K AKA DI + G+ +SY+ +L++++ Q C P +LP L++++ G+
Sbjct: 607 ILVLVLKLIAKACDIGDASRGSLSSYAHTLMMIYFLQHCDPPVLPVLQELHDGD 660
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 99/234 (42%), Gaps = 21/234 (8%)
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+ID R R + +++++WA+ I+ P G + +L ++++++ Q C P +LP L +
Sbjct: 220 KIDSRTRQLGVVLRKWARVCGIDRPNEGGLHPGALIIMLIYYLQRCTPPVLPVLHE---- 275
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKAS 269
L D + N E FN D ++ N S+ L++ F F L S
Sbjct: 276 -LASDDQSKNFNYEIDGVPFIYFNDIDTLDDIWQSDNEKSIGQLWLGFF-TFYSLDYGIS 333
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEM--T 327
+C T + R +W N+ IE PF N + V+ + + + FE
Sbjct: 334 RNVVC-ITSKSTITRRMRKWEANS--FAIESPFGNRHNCGKTVTTRQVIECPKQFEWLSD 390
Query: 328 HFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANYNNGHRRAR-PQSHKSVN 380
++T T Q +S E P+R + + HR+ P+S ++N
Sbjct: 391 SSKITDTFQLEKIFVSGTH---------EMPLRCPSCDGNHRKDNCPESISTLN 435
>gi|109112036|ref|XP_001083489.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Macaca mulatta]
gi|109112038|ref|XP_001083813.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Macaca mulatta]
Length = 1490
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 133/309 (43%), Gaps = 48/309 (15%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1020 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1071
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1072 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1131
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1132 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1191
Query: 208 PGNLVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
G ++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 1192 KGEKKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTE 1242
Query: 264 --------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1243 EFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRK 1290
Query: 316 NLAKISNAF 324
I AF
Sbjct: 1291 MTNFIMKAF 1299
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQDCLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|380816560|gb|AFE80154.1| terminal uridylyltransferase 7 isoform 1 [Macaca mulatta]
gi|383421619|gb|AFH34023.1| terminal uridylyltransferase 7 isoform 1 [Macaca mulatta]
Length = 1490
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 133/309 (43%), Gaps = 48/309 (15%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1020 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1071
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1072 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1131
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1132 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1191
Query: 208 PGNLVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
G ++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 1192 KGEKKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTE 1242
Query: 264 --------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1243 EFDFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRK 1290
Query: 316 NLAKISNAF 324
I AF
Sbjct: 1291 MTNFIMKAF 1299
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQDCLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|410926597|ref|XP_003976764.1| PREDICTED: terminal uridylyltransferase 4-like, partial [Takifugu
rubripes]
Length = 1518
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 94/195 (48%), Gaps = 4/195 (2%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L+P + + R ++++ L + E A + FGS + R DLDI + L
Sbjct: 765 LSPTHVEQQKREQILASLERFIRK-EYNEKAQLCLFGSSKNGFGFRDSDLDICMTLEGHE 823
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
K ++ L + L++ G R + + A+VPI+KFE + DIS+ N
Sbjct: 824 TAEMLNCK---EIIEGLAKVLKKHTGLRNILPITTAKVPIVKFEHKQSGLEGDISLYNTL 880
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
Q ++ L + +D R + + +K +AK DI + G+ +SY+ L+VL+ Q P
Sbjct: 881 AQHNTRMLATYAALDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYILMVLYFLQQRQP 940
Query: 198 AILPPLKDIYPGNLV 212
++P L++I+ G V
Sbjct: 941 PVIPVLQEIFDGTTV 955
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/194 (19%), Positives = 87/194 (44%), Gaps = 14/194 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D+ R V++ + E++ L ++ +GS ++ + D++I + +
Sbjct: 253 DDFGARKAVVTTMEEIIR--RHLPACSLRLYGSTLTQFAFKTSDINIDV--------THP 302
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L +L L+ + ++ HA+VP + + + C ++ N + +
Sbjct: 303 SSMTQPEVLIQVLEILKNNSDFSEVESDFHAKVPAVFCRDVSSGLLCKVTAGNDVACLTT 362
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L +++++ R +VL + WA+ I+ G SYS +L+V+F Q +LP
Sbjct: 363 NHLAALAKLEPRLVPLVLAFRHWARLCHIDCQAEGGIPSYSFALMVIFFLQQRKEPVLP- 421
Query: 203 LKDIYPGNLVDDLK 216
+Y G+ ++ +
Sbjct: 422 ---VYLGHWIEGFE 432
>gi|417406539|gb|JAA49923.1| Putative s-m checkpoint control protein cid1 [Desmodus rotundus]
Length = 1496
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLDI C++ G + + L
Sbjct: 1026 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDI--------CMTINGLETAEGLDC 1077
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1078 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLAA 1137
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1138 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEIY 1197
Query: 208 PG 209
G
Sbjct: 1198 KG 1199
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/194 (21%), Positives = 87/194 (44%), Gaps = 18/194 (9%)
Query: 22 REDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCI 79
+E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 310 KENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AI 362
Query: 80 SSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
S + +L + L+ + + HARVP++ + C +S N
Sbjct: 363 MS-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVMCREKQSGLFCKVSAGNENAC 417
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAI 199
+ + L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +
Sbjct: 418 LTTNHLTALGKLESKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMTVFFLQQRKEPL 477
Query: 200 LPPLKDIYPGNLVD 213
LP +Y G+ ++
Sbjct: 478 LP----VYLGSWIE 487
>gi|397470231|ref|XP_003806732.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Pan paniscus]
gi|397470235|ref|XP_003806734.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Pan paniscus]
Length = 1494
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PG 209
G
Sbjct: 1196 KG 1197
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 88/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I++P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDHPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|426219879|ref|XP_004004145.1| PREDICTED: terminal uridylyltransferase 7 [Ovis aries]
Length = 1498
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 1002 NILDQVCIQCYKDFSPTVAEDQAREHIRQNLENFIK--QEFPGTKLSLFGSSKNGFGFKQ 1059
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R L++ G R + + A+VPI+K
Sbjct: 1060 SDLDV--------CMTINGLETAEGLDCVRTIEELARVLKKHSGLRNILPITTAKVPIVK 1111
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L + ID R + + +K + K DI + G+
Sbjct: 1112 FFHLRSGLEVDISLYNTLALHNTRLLSAYAAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1171
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1172 SSYAYTLMVLYFLQQRTPPVIPVLQEIYKG 1201
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDINIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNNDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +++ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESKLVPLIIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ +D
Sbjct: 479 P----VYLGSWID 487
>gi|57999471|emb|CAI45944.1| hypothetical protein [Homo sapiens]
Length = 1494
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PG 209
G
Sbjct: 1196 KG 1197
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|340724519|ref|XP_003400629.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Bombus
terrestris]
Length = 558
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 106/226 (46%), Gaps = 24/226 (10%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G ++ + ARVPI++F ++ N+ CD+S N S+ L+ Q+D R + ++ +
Sbjct: 290 GISNIKKILEARVPIIRFSNVYTNMICDLSSTNTVALHMSELLYIYGQLDWRIKPLIFTI 349
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
++WA+ ++ G + ++SL+LL++F+ QT ILP + + K + +
Sbjct: 350 RKWARDMNLTKIFPGQWITNFSLTLLIIFYLQT--KNILPSISTLN--------KFIELD 399
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+ + A FN + +N SL L F E +S K IC G+++
Sbjct: 400 KKTKKATNSNFNWFVSWQGSIKHVNDESLLSLLYHFFEYYSTFDFKTQ--AICIKDGKFK 457
Query: 282 HIRSNTRWLPNNH--PLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
P N PL+I +PF+ N ++ V+ L ++ + F+
Sbjct: 458 ---------PKNDFSPLYIHNPFDTTLNVSKNVNSTELIRLIDHFQ 494
>gi|307191764|gb|EFN75206.1| U6 snRNA-specific terminal uridylyltransferase 1 [Harpegnathos
saltator]
Length = 720
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 87/333 (26%), Positives = 145/333 (43%), Gaps = 43/333 (12%)
Query: 12 KDILGMLNPLR---EDWETRMKVIS-DLREVVESVESLRGATVEPFGSFVSNLFSRWGDL 67
+++ +LN ++ + TR KVI L ++ + + T+ FGS V+ L + DL
Sbjct: 151 RELAALLNEIQLSDAELMTRYKVICPHLTDIFKL--TFPECTIFSFGSTVAGLSFKECDL 208
Query: 68 DISIELSNGSCIS-----SAGKKVKQSLLGDLLRALRQKGGYRRLQFVA--HARVPILKF 120
DI + L S + +++ +++ +R + + +A A+ PI+KF
Sbjct: 209 DIYMYLGKIGLPSFFNQPNLSQQIITTVIFKRVRKIMYSMKFIFADIIAIPKAKTPIIKF 268
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ N+SCDIS N G KS FL + + D R + ++LL+K WAK I G +
Sbjct: 269 RYLPTNVSCDISFKNGLGVYKSNFLRYCTLRDVRLKPLMLLIKYWAKHLGITG--GGRIS 326
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
+Y L LV+F+FQ +LPPL ++ + + G + N + N S
Sbjct: 327 NYGLVCLVIFYFQQV--DLLPPLLELQRNCMPLIINGWQVNFDE--------NTPLPPSS 376
Query: 241 KYRKINRSSLAHLFVSFLEKF----------SGLSLKASELGICPFTGQWEH------IR 284
R I L H FVSF +F G AS+ + H
Sbjct: 377 NTRSI--PQLFHDFVSFYAEFIFSSRVLCLLDGKIYAASDFVNFFKLPDYMHRYKSYVTM 434
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+NT+ L + ++DPFE +N+ E+ L
Sbjct: 435 NNTKKLDIERAMCVQDPFELDQNTTAITPERVL 467
>gi|281352129|gb|EFB27713.1| hypothetical protein PANDA_008350 [Ailuropoda melanoleuca]
Length = 532
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 100/421 (23%), Positives = 177/421 (42%), Gaps = 55/421 (13%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDI 69
L +L ED + R S + ++ + + TV PFGS V N F + G DLD+
Sbjct: 141 LNTLLKEFQLTEEDIKLRYLTCSLIEDLAAAY--FQDCTVRPFGSSV-NSFGKLGCDLDM 197
Query: 70 SIELS-----NGSCISS------------AGKKVKQSLLGDLLRALRQKG-GYRRLQFVA 111
++L N S S + + Q +L + L G +Q +
Sbjct: 198 FLDLDEIGKFNTSKTSGNFLMEFQVKNVPSERIATQKILSVIGECLDHFGPSCVGVQKIL 257
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR P+++F CD++ +N S+ L+ +D R R +V ++ WA+AH +
Sbjct: 258 NARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYIYGALDSRVRALVFSIRCWARAHSL 317
Query: 172 NNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE-- 228
+ G++ ++SL+++V+F Q P ILP L D LK + ++ I E
Sbjct: 318 TSSIPGSWITNFSLTMMVIFFLQRRSPPILPTL---------DYLKTLADAEDKCIIEGH 368
Query: 229 ICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN 286
C F ++ R N +L L F E F + + + I R
Sbjct: 369 NCTFIRDLNRIKPSG----NTETLESLLKEFFEYFGNFAFNKNSINI-------RQGREQ 417
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLA 346
+ P PL I++PFE N ++ V++ L K + + + L+ ++ R + S
Sbjct: 418 NK--PECSPLHIQNPFETSLNISKNVNQSQLQKFVDLARESAWILSQEDKDRPS--PSSN 473
Query: 347 RPFILQFFGESPVRYANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENRPN--RSMSQ 404
+P+ L + V + +R +P S + N L +SH+ + + N R++S
Sbjct: 474 QPWGLAMLLQPSVVSSVSLAKKKRKKPASERIKN--LLESIKSHSPENDTNTNGKRTVST 531
Query: 405 Q 405
Q
Sbjct: 532 Q 532
>gi|73946401|ref|XP_533505.2| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Canis lupus
familiaris]
Length = 1495
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 78/335 (23%), Positives = 142/335 (42%), Gaps = 44/335 (13%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L + + G + FGS + +
Sbjct: 999 NILDQVCVQCYKDFSPTISEDQAREHIRQNLESFIR--QEFPGTKLSLFGSSKNGFGFKQ 1056
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R L++ G R + + A+VPI+K
Sbjct: 1057 SDLDV--------CMTINGLETAEGLDCVRTIEELARVLKKHSGLRNILPITTAKVPIVK 1108
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 1109 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 1168
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL--KGVRANAERQIAEICAFNIARF 237
+SY+ +L+VL+ Q P ++P L++IY G ++ G QI E+ +
Sbjct: 1169 SSYAYTLMVLYFLQQRNPPVIPVLQEIYKGEKKPEIFVDGWNIYFFDQIDELPTY----- 1223
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRW 289
+Y K N S+ L++ L ++ +S++ L + F QW
Sbjct: 1224 -WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL-LTTFKKQW--------- 1271
Query: 290 LPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + IEDPF+ N +S K I AF
Sbjct: 1272 --TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1304
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 88/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSESFIDVDADFHARVPVVVCKEKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I++P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESKLVPLVIAFRYWAKLCSIDHPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|58331272|ref|NP_078893.2| terminal uridylyltransferase 7 isoform 1 [Homo sapiens]
gi|297307111|ref|NP_001171988.1| terminal uridylyltransferase 7 isoform 1 [Homo sapiens]
gi|67462100|sp|Q5VYS8.1|TUT7_HUMAN RecName: Full=Terminal uridylyltransferase 7; Short=TUTase 7;
AltName: Full=Zinc finger CCHC domain-containing protein
6
Length = 1495
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1025 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1076
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1077 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1136
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1137 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1196
Query: 208 PG 209
G
Sbjct: 1197 KG 1198
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|302148910|pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A
Rna Surveillance Complex
Length = 323
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 96/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ E R + IS +RE V+ + A + FGS+ ++L+ D+D
Sbjct: 25 IKDFVAYISPSREEIEIRNQTISTIREAVKQL--WPDADLHVFGSYSTDLYLPGSDIDCV 82
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L L++K ++ VA ARVPI+KF H I
Sbjct: 83 VT-------SELGGKESRNNLYSLASHLKKKNLATEVEVVAKARVPIIKFVEPHSGIHIA 135
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +NN TG +S+ LV
Sbjct: 136 VSFERTNGIEAAKLIREWLDDTPG-LRELVLIVKQFLHARRLNNVHTGGLGGFSIICLV- 193
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 194 FSFLHMHPRII 204
>gi|350425037|ref|XP_003493993.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Bombus
impatiens]
Length = 507
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 105/226 (46%), Gaps = 33/226 (14%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G ++ + A+VPI+KF ++ N+ CD+S NL S+ L+ Q+D R + +V +
Sbjct: 259 GISDVKKILGAQVPIIKFYNVYTNMKCDLSSTNLIALHMSELLYTYGQLDWRIKPLVYTI 318
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
++WA+ ++ + G + ++SL+LL++F+ Q +KDI P V+ +K A+
Sbjct: 319 RKWARVMNLTKEQPGHWITNFSLTLLIIFYLQ---------VKDILPS--VNTIKCFVAD 367
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
FN ++ N L +L +F E +S K IC G
Sbjct: 368 PN--------FNWFESWKKSIKRTNNEDLHNLLFNFFEYYSIFDFKTQ--AICIRDG--- 414
Query: 282 HIRSNTRWLPNNH--PLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
R+ P N PL+I +PF N ++ V+ L ++ + +
Sbjct: 415 ------RYKPKNDFSPLYIYNPFNTTLNVSKNVTSCELIRLVDCLQ 454
>gi|114625363|ref|XP_001138296.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Pan troglodytes]
gi|114625365|ref|XP_001138539.1| PREDICTED: terminal uridylyltransferase 7 isoform 4 [Pan troglodytes]
gi|410261322|gb|JAA18627.1| zinc finger, CCHC domain containing 6 [Pan troglodytes]
gi|410354383|gb|JAA43795.1| zinc finger, CCHC domain containing 6 [Pan troglodytes]
Length = 1494
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PG 209
G
Sbjct: 1196 KG 1197
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 88/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I++P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDHPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|426362154|ref|XP_004048245.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Gorilla gorilla
gorilla]
gi|426362156|ref|XP_004048246.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Gorilla gorilla
gorilla]
Length = 1494
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PG 209
G
Sbjct: 1196 KG 1197
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+K L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTKHLTALGKLEPKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|118600935|gb|AAH23438.1| Zcchc6 protein [Mus musculus]
Length = 536
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 99/210 (47%), Gaps = 15/210 (7%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
N+L+ + +P + + R + +L ++ + G + FGS + +
Sbjct: 145 NILDQVCVQCYKDFSPTIVEDQAREHIRQNLESFIK--QDFPGTKLSLFGSSKNGFGFKQ 202
Query: 65 GDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQKGGYRRLQFVAHARVPILK 119
DLD+ C++ G + + L + +L R LR+ G R + + A+VPI+K
Sbjct: 203 SDLDV--------CMTINGHETAEGLDCVRTIEELARVLRKHSGLRNILPITTAKVPIVK 254
Query: 120 FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
F + + DIS+ N ++ L S ID R + + +K + K DI + G+
Sbjct: 255 FFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTMKVFTKMCDIGDASRGSL 314
Query: 180 NSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 315 SSYAYTLMVLYFLQQRSPPVIPVLQEIYKG 344
>gi|328867853|gb|EGG16234.1| hypothetical protein DFA_09264 [Dictyostelium fasciculatum]
Length = 918
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 150/333 (45%), Gaps = 39/333 (11%)
Query: 51 EPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK--VKQSLLGDLLRALRQKGGYRRLQ 108
E +GSFV+ + D+D+ + ++ + G+K +K+ L + ++ K Y +
Sbjct: 612 ESYGSFVNGIQLESSDIDVCFK-TDFNTSDPVGRKDLMKRIALCLNKKKVKGKPKYHVER 670
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ +VPI+KF + +S D+ +N S + S+ID R + ++LL+K WA
Sbjct: 671 ILDSIKVPIIKFRDLKHKVSYDMCFNNRLAIGNSLLVKAYSEIDERAKQLMLLIKYWASR 730
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK---DIYPGNLV---DDLKGVRANA 222
IN+ GT +SY +V+F+ QT P +LP L D +P + + DD K +
Sbjct: 731 KYINDASEGTLSSYGWLNMVIFYLQTVQPPVLPSLHSNIDSFPDDQLQQKDDWKFIDPRH 790
Query: 223 ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEH 282
I++ N+ +L LF F +S A++L IC G+ +
Sbjct: 791 TGFISQ-----------------NKMTLFQLFYGFFNFYSKFDY-ANQL-ICIRLGKPTN 831
Query: 283 IRSNTRWLPNNH---PLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTST--NQT 337
I T+ +N+ P+ I+DPF+ N +V + +++F + F S N
Sbjct: 832 ITLATQSYKDNNKECPISIQDPFDSSSNPGASVKD------TSSFGIIIFEFMSMQLNLF 885
Query: 338 RYALLSSLARPFILQFFGESPVRYANYNNGHRR 370
+ + + + + F F +S ++ N +R
Sbjct: 886 KLSYKNDIIQDFDGLLFAKSKLKLNELYNKMKR 918
>gi|268567526|ref|XP_002640018.1| C. briggsae CBR-PUP-3 protein [Caenorhabditis briggsae]
Length = 468
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 128/280 (45%), Gaps = 35/280 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS + + DLD SI++ + + S +K+K+ + D L+++ Y+
Sbjct: 104 GSLAAGVDIHTSDLDFSIKIPSMT-QGSTFQKLKE--ISDRLKSV----SYKIKDEPVFY 156
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
+VP+L+ + I + D++IDN + ++ L W ++ID RF + VK WA +
Sbjct: 157 KVPVLQMKHIKTGVIIDVTIDNDTSKRNTQLLRWYAKIDKRFPLLCKAVKAWASKVGVEG 216
Query: 174 PKTGTFNSYSLSLLVLFHFQTCV-PAILPPL--------KDIYPGNLVDDLKGVRANAER 224
G NS+S+ L+++ + Q V PA+LP + K+ G+ +D R E+
Sbjct: 217 SSKGRLNSFSICLMLINYLQAGVTPAVLPSIQRFSRNFNKNFAVGDKYNDFDW-REKIEK 275
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---CPFTGQWE 281
+F D N+SSLA L++ FL+ +S K + + + +W+
Sbjct: 276 D---------GKFVLDA----NKSSLAALYLGFLKYYSEFDFKKNWISVKRGIVMEKRWD 322
Query: 282 HIRSNTRWLPNNH-PLFIEDPF-EQPENSARAVSEKNLAK 319
+ LP + + +EDPF P N A V + N K
Sbjct: 323 EQENRLGGLPKDSLYIVVEDPFLTVPRNCAGTVRQSNTMK 362
>gi|328871485|gb|EGG19855.1| hypothetical protein DFA_06958 [Dictyostelium fasciculatum]
Length = 1406
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 65/268 (24%), Positives = 120/268 (44%), Gaps = 18/268 (6%)
Query: 51 EPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG--YRRLQ 108
E +GSFV+ + D+D+ + S + + +S+ LL +G Y+ ++
Sbjct: 1096 EAYGSFVNGIQLESSDIDVCFKTSFDTSDPVRRVDLMKSVARCLLAKRDDQGNRDYQLVR 1155
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ +VPI+KF + +S D+ +N S + ++ID R + ++LLVK WA
Sbjct: 1156 LLDSIKVPIIKFTDLKHRVSYDMCFNNRLAIGNSLLVKSYAEIDERAKQLMLLVKYWASR 1215
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL-KDIYPGNLVDDLKGVRANAERQIA 227
DIN+ GT +SY+ +V+F+ QT P +LP L ++Y +++ + +
Sbjct: 1216 KDINDASGGTLSSYAWLNMVIFYLQTVQPPVLPSLHSNVYS----------KSDGQLVQS 1265
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR-SN 286
++ + + N +L LF F + K + IC G+ R ++
Sbjct: 1266 KVDGWKFVDHRHTGFVSQNNKTLFQLFYGFFNFYCKFDFK--DQLICIRLGKPTSNRMAS 1323
Query: 287 TRWLPNNH--PLFIEDPFEQPENSARAV 312
++ N + IEDPF N +V
Sbjct: 1324 QSYMEQNDQSKICIEDPFNTSSNPGSSV 1351
>gi|261857460|dbj|BAI45252.1| zinc finger, CCHC domain containing 6 [synthetic construct]
Length = 1031
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 561 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 612
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 613 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 672
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 673 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 732
Query: 208 PG 209
G
Sbjct: 733 KG 734
>gi|301768567|ref|XP_002919704.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Ailuropoda
melanoleuca]
Length = 584
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 100/421 (23%), Positives = 178/421 (42%), Gaps = 55/421 (13%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDI 69
L +L ED + R S + ++ + + TV PFGS V N F + G DLD+
Sbjct: 192 LNTLLKEFQLTEEDIKLRYLTCSLIEDLAAAY--FQDCTVRPFGSSV-NSFGKLGCDLDM 248
Query: 70 SIELS-----NGSCISS------------AGKKVKQSLLGDLLRALRQKG-GYRRLQFVA 111
++L N S S + + Q +L + L G +Q +
Sbjct: 249 FLDLDEIGKFNTSKTSGNFLMEFQVKNVPSERIATQKILSVIGECLDHFGPSCVGVQKIL 308
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR P+++F CD++ +N S+ L+ +D R R +V ++ WA+AH +
Sbjct: 309 NARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYIYGALDSRVRALVFSIRCWARAHSL 368
Query: 172 NNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI- 229
+ G++ ++SL+++V+F Q P ILP L D LK + ++ I E
Sbjct: 369 TSSIPGSWITNFSLTMMVIFFLQRRSPPILPTL---------DYLKTLADAEDKCIIEGH 419
Query: 230 -CAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN 286
C F ++ R N +L L F E F + + + I R
Sbjct: 420 NCTFIRDLNRIKPSG----NTETLESLLKEFFEYFGNFAFNKNSINI-------RQGREQ 468
Query: 287 TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLA 346
+ P PL I++PFE N ++ V++ L K + + + L+ ++ R + S+
Sbjct: 469 NK--PECSPLHIQNPFETSLNISKNVNQSQLQKFVDLARESAWILSQEDKDRPSPSSN-- 524
Query: 347 RPFILQFFGESPVRYANYNNGHRRARPQSHKSVNSPLQAQHQSHNAKKENRPN--RSMSQ 404
+P+ L + V + +R +P S + N L +SH+ + + N R++S
Sbjct: 525 QPWGLAMLLQPSVVSSVSLAKKKRKKPASERIKN--LLESIKSHSPENDTNTNGKRTVST 582
Query: 405 Q 405
Q
Sbjct: 583 Q 583
>gi|429903875|ref|NP_001258876.1| terminal uridylyltransferase 7 [Gallus gallus]
Length = 1538
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 125/305 (40%), Gaps = 66/305 (21%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + DL + L+++
Sbjct: 1037 GTKLNLFGSSKNGFGFKQSDLDI--------CMTMDGLETAEGLDCIRIIEDLAKVLKKQ 1088
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L + ID R + +
Sbjct: 1089 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTRLLSSYAAIDPRVKYLCYT 1148
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P LVD
Sbjct: 1149 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKEPKKPEILVD--- 1205
Query: 217 GVRANAERQIAEICAFNIARFSSDKYRKI---------NRSSLAHLFVSFLEKFSGLSLK 267
+N+ F DK ++ N S+ L++ L +F
Sbjct: 1206 --------------GWNVYFF--DKIEELPAVWPDSGKNTESVGQLWLGLL-RFYTEEFD 1248
Query: 268 ASELGIC--------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
E IC F QW + + IEDPF+ N +S K
Sbjct: 1249 FKEHVICIRRKNLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNF 1297
Query: 320 ISNAF 324
I AF
Sbjct: 1298 IMKAF 1302
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 35/190 (18%), Positives = 83/190 (43%), Gaps = 14/190 (7%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D + R+K+ + + +++ + L ++ +GS +S + D++I I+
Sbjct: 303 DLQERLKIRTIMEDLLH--QKLPECSLRLYGSSLSGFGFKTSDINIDIQF--------PA 352
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
+ +L + +L+ + + H R+P++ + C +S N + +
Sbjct: 353 SMSQPDVLLLVQESLQNSESFIGVDADFHTRIPVVVCREKQSGLICKVSAGNENAYLTTN 412
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL 203
L I +++ +V+ + WAK ++ P+ G + Y +L+V+F Q LP
Sbjct: 413 HLATIGKLEPTVTSLVIAFRYWAKLCCVDRPEEGGLSPYVFALMVIFFLQQRKEPFLP-- 470
Query: 204 KDIYPGNLVD 213
+Y G+ ++
Sbjct: 471 --VYLGSWIE 478
>gi|328866781|gb|EGG15164.1| Regulator of nonsense transcripts 1 like protein [Dictyostelium
fasciculatum]
Length = 1358
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 68/268 (25%), Positives = 121/268 (45%), Gaps = 18/268 (6%)
Query: 51 EPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG--YRRLQ 108
E +GSFV+ + D+D+ + S + + +S+ LL +G Y+ ++
Sbjct: 1048 EAYGSFVNGIQLESSDIDVCFKTSFDTSDPVRRVDLMKSVARCLLAKRDDQGNRDYQLVR 1107
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ +VPI+KF + +S D+ +N S + ++ID R + ++LLVK WA
Sbjct: 1108 LLDSIKVPIIKFTDLKHRVSYDMCFNNRLAIGNSLLVKSYAEIDERAKQLMLLVKYWASR 1167
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL-KDIYPGNLVDDLKGVRANAERQIA 227
DIN+ GT +SY+ +V+F+ QT P +LP L ++Y + D + V++ +R
Sbjct: 1168 KDINDASGGTLSSYAWLNMVIFYLQTVQPPVLPSLHSNVYSKS---DGQLVQSKVDR--- 1221
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR-SN 286
+ + N +L LF F + K + IC G+ R ++
Sbjct: 1222 ----WKFVDHRHTGFVSQNNKTLFQLFYGFFNFYCKFDFK--DQLICIRLGKPTSNRMAS 1275
Query: 287 TRWLPNNH--PLFIEDPFEQPENSARAV 312
++ N + IEDPF N +V
Sbjct: 1276 QSYMEQNDQSKICIEDPFNTSSNPGSSV 1303
>gi|307207584|gb|EFN85249.1| Poly(A) RNA polymerase gld-2-like protein A [Harpegnathos saltator]
Length = 346
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 126/265 (47%), Gaps = 38/265 (14%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L LR L++ L+ + A+VPI+ F QN++ DI+ ++ + + L+ S+
Sbjct: 73 LNQALRCLQRYKSAENLEII-QAKVPIINFHDSRQNLNIDINCNSSVAILNTHLLYCYSR 131
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLK----- 204
ID R + +VL+VK WA+ H IN+ + T +SYSL+L+V+ Q + P ILP L+
Sbjct: 132 IDWRVKPLVLIVKLWAQFHKINSARNNTLSSYSLTLMVISFLQCGINPPILPNLQNHTSQ 191
Query: 205 -------DIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
DI P +++D+ ++ + I S Y+ N SL L F
Sbjct: 192 FRSFYHEDIQP--IIEDIH------KKDLGPI------YIGSSLYQSRNTQSLGELLHEF 237
Query: 258 LEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNH-----PLFIEDPFEQPENSARA 311
+ + + + I + + E R + PNN+ + IE+PF++ N+A+A
Sbjct: 238 FKYYISFEFEHHAVSIEAGYKIKKETCRLAS--YPNNNRGHWKYIGIEEPFDRT-NTAKA 294
Query: 312 V-SEKNLAKISNAFEMTHFRLTSTN 335
V EK +I + ++ +L N
Sbjct: 295 VFDEKIFYRIQSVIGQSYKQLAGNN 319
>gi|119583118|gb|EAW62714.1| zinc finger, CCHC domain containing 6, isoform CRA_a [Homo sapiens]
Length = 1133
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 663 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 714
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 715 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 774
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 775 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 834
Query: 208 PG 209
G
Sbjct: 835 KG 836
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/102 (24%), Positives = 49/102 (48%), Gaps = 4/102 (3%)
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
HARVP++ + C +S N + +K L + +++ + +V+ + WAK I
Sbjct: 28 HARVPVVVCREKQSGLLCKVSAGNENACLTTKHLTALGKLEPKLVPLVIAFRYWAKLCSI 87
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
+ P+ G Y +L+ +F Q +LP +Y G+ ++
Sbjct: 88 DRPEEGGLPPYVFALMAIFFLQQRKEPLLP----VYLGSWIE 125
>gi|12697967|dbj|BAB21802.1| KIAA1711 protein [Homo sapiens]
Length = 1090
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 620 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 671
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 672 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 731
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 732 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 791
Query: 208 PG 209
G
Sbjct: 792 KG 793
>gi|449513843|ref|XP_002192253.2| PREDICTED: terminal uridylyltransferase 7 [Taeniopygia guttata]
Length = 1531
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 125/305 (40%), Gaps = 66/305 (21%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G ++ FGS + + DLDI C++ G + + L + DL + L+++
Sbjct: 1032 GTKLDLFGSSKNGFGFKQSDLDI--------CMTIDGLETAEGLDCIRIIEDLAKVLKKQ 1083
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L + ID R + +
Sbjct: 1084 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTRLLSSYAAIDPRVKYLCYT 1143
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P LVD
Sbjct: 1144 MKVFTKICDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKEPKKPEILVD--- 1200
Query: 217 GVRANAERQIAEICAFNIARFSSDKYRKI---------NRSSLAHLFVSFLEKFSGLSLK 267
+N+ F DK ++ N S L++ L +F
Sbjct: 1201 --------------GWNVYFF--DKIEELPVVWPDYGKNTESAGQLWLGLL-RFYTEEFD 1243
Query: 268 ASELGIC--------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
E IC F QW + + IEDPF+ N +S K
Sbjct: 1244 FKEHVICIRRKNLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNF 1292
Query: 320 ISNAF 324
I AF
Sbjct: 1293 IMKAF 1297
Score = 46.6 bits (109), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 38/190 (20%), Positives = 80/190 (42%), Gaps = 14/190 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+ + + + ++ + L ++ +GS S + DL+I +
Sbjct: 301 EDLEERLSIKTMMESLLR--QKLPECSLRLYGSSYSRFGFKTSDLNIDTQF--------P 350
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + +L+ + + HARVP++ + C +S N + +
Sbjct: 351 ANMAQPDVLLLVQESLQNSESFTEVDADFHARVPVVVCREKKSGLICKVSAGNENACLTA 410
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +V+ + WAK ++ P+ G + Y +L+V+F Q LP
Sbjct: 411 NHLATLGKLEPTIVPLVIAFRYWAKLCCVDRPEEGGLSPYVFALMVIFFLQQRKEPFLP- 469
Query: 203 LKDIYPGNLV 212
+Y G+ V
Sbjct: 470 ---VYLGSWV 476
>gi|410978241|ref|XP_003995504.1| PREDICTED: terminal uridylyltransferase 7 [Felis catus]
Length = 1492
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1022 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1073
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1074 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1133
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1134 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1193
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1194 KGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1246
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1247 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1294
Query: 318 AKISNAF 324
I AF
Sbjct: 1295 NFIMKAF 1301
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L R D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFRNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSESFVDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I++P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESKLVPLVIAFRYWAKLCSIDHPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|156837261|ref|XP_001642660.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
gi|156113216|gb|EDO14802.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
Length = 524
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P R++ E R + I LR+ V+ A + FGS+ ++L+ D+D
Sbjct: 119 IRDFVSYISPNRKEIELRNQTIGKLRDAVQ--HHWPDANLHVFGSYATDLYLPGSDIDCV 176
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S AG K ++ L L L+++G ++ +A ARVPI+KF I D
Sbjct: 177 VN-------SKAGDKQSRNCLYSLASHLKKEGLAEDIEIIAKARVPIIKFVEPLSKIHVD 229
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ +G R++VL+VK++ +A +N TG +S+ LV
Sbjct: 230 VSFERTNGLEAAKLIRGWLDSTNG-LRELVLIVKQFLQARRLNKVHTGGLGGFSIICLV- 287
Query: 190 FHFQTCVPAIL 200
+ F P IL
Sbjct: 288 YSFLHLHPRIL 298
>gi|146089481|ref|XP_001470395.1| DNA polymerase sigma-like protein [Leishmania infantum JPCM5]
gi|134070428|emb|CAM68768.1| DNA polymerase sigma-like protein [Leishmania infantum JPCM5]
Length = 599
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 94/394 (23%), Positives = 156/394 (39%), Gaps = 88/394 (22%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL---------- 60
L ++L L+P ED ET+++V D+R ++ G ++ +GS + L
Sbjct: 248 LIELLYCLSPTSEDRETKLRVFDDIRTTMQRA----GMDIQIYGSLCTGLVIPASDVDCV 303
Query: 61 FSRWGDLDISIELS-NGSC----ISSA--GKKVKQSLLGDLLRA-------LRQKGGYRR 106
R GD I+ +S N SC I+SA G +SL G L A +R+ +
Sbjct: 304 LMRSGDEQIASAMSANLSCAMLTIASAATGSVPPKSLKGPLSTAVRIVAERMRKSQSFIH 363
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR--FRDMVLLVKE 164
+ +AHA+VPI+K ++ D+S + G + S +L + G R +++LVK
Sbjct: 364 VTSIAHAKVPIVKCRHRRDDVKVDLSFEQ-SGCVSSNYLCELLCAPGNEMARPLIVLVKA 422
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
++ P G S+ +SLLVL++ Q CV
Sbjct: 423 LVNNCGLDEPSMGGLGSFPISLLVLWYLQQCV---------------------------- 454
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
RFS++ R S+ L FL K+ G LGI ++++
Sbjct: 455 ---------RTRFSAELQR-----SIGALLAGFL-KYYGTEFDFRRLGI-------DYVQ 492
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVS--EKNLAKISNAFEMTHFRLTSTNQTRYALL 342
T P L+I +P N A+A + + + T L N + +
Sbjct: 493 QKTFTKPPADELYIVNPIRPETNCAKAATLFATRVMPLFQRASATFVGLLDANASPATME 552
Query: 343 SSLARPFILQFFGESPVRYANYNNGHRRARPQSH 376
S L L +F ++ N+ + RRA + H
Sbjct: 553 SQL-----LHYFAKATSDVRNWRDVSRRAAREPH 581
>gi|66827407|ref|XP_647058.1| hypothetical protein DDB_G0268914 [Dictyostelium discoideum AX4]
gi|60475110|gb|EAL73046.1| hypothetical protein DDB_G0268914 [Dictyostelium discoideum AX4]
Length = 4540
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/292 (22%), Positives = 129/292 (44%), Gaps = 32/292 (10%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGSF+S L D+D++ + IS+ +KQ + + K Y ++
Sbjct: 4256 FGSFLSGLSLGESDIDVNFTTTQKEDIST----IKQ------VSSFLHKKNYELIETRLE 4305
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI++F N+ D+ ++ GQ S + + +D R + +++L+K WA +N
Sbjct: 4306 ARVPIIRFIDTDVNVRFDMCFNSFMGQHNSLLIKDYTMVDSRVKPLIILIKWWASTKCLN 4365
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN--AERQIAEIC 230
+ +F+SY L L++ Q+ P +LP L++ P + + ++AN E + +
Sbjct: 4366 DASQESFSSYCLINLIIHFLQSLSPPVLPNLQEPSPFHFDETKIKLKANCRVENNVVKYY 4425
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWL 290
+ F+S NR ++ L F + + C F + I +
Sbjct: 4426 DWTSLDFTSAD----NRLNIGQLLFKFFQYY------------CTFNYNEDVISISRGIY 4469
Query: 291 PNNH----PLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTR 338
P + L++ DPF + +N A +++ ++ + F + +L + + R
Sbjct: 4470 PRDDYCKGVLYVADPFIEGKNIAASLTPESFSSALVEFALMEHQLKNITEER 4521
>gi|396479778|ref|XP_003840837.1| similar to Poly(A) RNA polymerase cid13 [Leptosphaeria maculans
JN3]
gi|312217410|emb|CBX97358.1| similar to Poly(A) RNA polymerase cid13 [Leptosphaeria maculans
JN3]
Length = 1342
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 134/312 (42%), Gaps = 55/312 (17%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ + P + D + R K + + ++E V FGS + L++ D+DI
Sbjct: 279 MRELYDRIQPTKHDEDIRDKFVKKVERILELEFPGAEMKVLVFGSSGNMLWTAESDVDI- 337
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G R+ + A+V I+K +I+CD
Sbjct: 338 -------CIQTPMKRLEE--VHPLAEAL-DKHGMERVVCIPAAKVRIVKVWDPELHIACD 387
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVL 189
I+++N+ ++ + Q+D R R + +++K W K +N+ GT +SY+ L+L
Sbjct: 388 INVNNVAAIENTRMIKTYIQLDERVRPLAMIIKHWTKRRILNDAGIGGTISSYTWICLIL 447
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD--KYRKI-- 245
QT P ILP L ++ P +D G +++ F+ D K R
Sbjct: 448 NFLQTQDPPILPVLHEL-PHRQIDKSTG-------------QPSLSSFADDVEKLRGFGA 493
Query: 246 -NRSSLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWL 290
N+ SL +L F + G +K E G P GQ E +
Sbjct: 494 KNKQSLGNLLFHFFRAYGHEVDYEKEAISVRQGKRIKREEKGWHPGGGQKEGVNR----- 548
Query: 291 PNNHPLFIEDPF 302
L +E+PF
Sbjct: 549 -----LCVEEPF 555
>gi|389593094|ref|XP_001684056.2| DNA polymerase sigma-like protein [Leishmania major strain
Friedlin]
gi|321399773|emb|CAJ04754.2| DNA polymerase sigma-like protein [Leishmania major strain
Friedlin]
Length = 479
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/394 (22%), Positives = 154/394 (39%), Gaps = 88/394 (22%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
L ++L L+P ED ET+++VI D+R ++ G ++ +GS + L D+D
Sbjct: 128 LIELLYCLSPTSEDRETKLRVIDDIRTTMQRA----GMDIQIYGSLCTGLVIPASDVDCV 183
Query: 71 IELSNGSCISSA-----------------GKKVKQSLLGDLLRA-------LRQKGGYRR 106
+ LS+ I+SA G +SL L A +R+ +
Sbjct: 184 LMLSSDEHIASAMSESLSCAMLTIASAAAGSVPPKSLKRPLSTAVRIVAERMRKSQSFTH 243
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR--FRDMVLLVKE 164
+ +AHA+VPI+K ++ D+S + G + S +L + G R +++LVK
Sbjct: 244 VTSIAHAKVPIVKCRHRRDDVKVDLSFEQ-SGCVSSNYLCKLLCEPGNEMARPLIVLVKA 302
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
+ ++ P G S+ +SLLVL++ Q CV
Sbjct: 303 LMNSCGLDEPSMGGLGSFPISLLVLWYLQQCVR--------------------------- 335
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
RFS++ R S+ L FL K+ G LGI ++++
Sbjct: 336 ----------TRFSAELQR-----SIGALLAGFL-KYYGTEFDFRRLGI-------DYVQ 372
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVS--EKNLAKISNAFEMTHFRLTSTNQTRYALL 342
T P L+I +P N A+A + + + T L N + +
Sbjct: 373 QKTFTKPPADDLYIVNPIRPETNCAKAATLFATRVVPLFQRASATFVGLLDANASPSTME 432
Query: 343 SSLARPFILQFFGESPVRYANYNNGHRRARPQSH 376
S L L +F ++ N+ + RRA + H
Sbjct: 433 SQL-----LHYFAKATSDVRNWRDVSRRAAREPH 461
>gi|16550803|dbj|BAB71052.1| unnamed protein product [Homo sapiens]
Length = 671
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 314 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 365
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 366 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 425
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 426 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 485
Query: 208 PG 209
G
Sbjct: 486 KG 487
>gi|149755241|ref|XP_001495972.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Equus caballus]
Length = 1501
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 135/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1031 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1082
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1083 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1142
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1143 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1202
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ ++ +Y K N S+ L++ L ++
Sbjct: 1203 RGEKKPEIFVDGWNIYFFDQIDELPSY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1255
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1256 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1303
Query: 318 AKISNAF 324
I AF
Sbjct: 1304 NFIMKAF 1310
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ H + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFLDVDADFHARVPVVVCREKHSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ R +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESRLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|335296330|ref|XP_003130685.2| PREDICTED: terminal uridylyltransferase 7 [Sus scrofa]
Length = 1497
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 127/293 (43%), Gaps = 42/293 (14%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLD+ C++ G + + L + +L R L++
Sbjct: 1042 GTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDCVRTIEELARVLKKH 1093
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + +
Sbjct: 1094 SGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYT 1153
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL--KGVR 219
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G ++ G
Sbjct: 1154 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGEKKPEIFVDGWN 1213
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASEL 271
QI E+ + +Y K N S+ L++ L ++ +S++ L
Sbjct: 1214 IYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL 1266
Query: 272 GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ F QW + + IEDPF+ N +S K I AF
Sbjct: 1267 -LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1307
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 310 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKTSDVNIDIQFP---AIM 362
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 363 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 417
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 418 TTNHLTALGKLESKLVPLVIAFRHWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 477
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 478 P----VYLGSWIE 486
>gi|410075647|ref|XP_003955406.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
gi|372461988|emb|CCF56271.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
Length = 630
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 93/179 (51%), Gaps = 11/179 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ + R K +S L ++ + S V FGS+ ++L+ D+D
Sbjct: 173 IKDFVAYISPSREEIKLRNKAVSKLGRAIKELWSDSELLV--FGSYATDLYLPGSDIDCV 230
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S++G K +S L +L R L++K ++ +A ARVPI+KF + D
Sbjct: 231 VN-------SASGNKEHRSYLYELARFLKKKNLATSIEVIARARVPIIKFIEPESGVHID 283
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
IS + G +K + W+ G R++VL+VK++ A +N+ TG +S+ LV
Sbjct: 284 ISFERTNGVEAAKLIREWLDMTPG-LRELVLIVKQFLTARRLNDVHTGGLGGFSIICLV 341
>gi|312380089|gb|EFR26181.1| hypothetical protein AND_07916 [Anopheles darlingi]
Length = 637
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 137/296 (46%), Gaps = 45/296 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELS----------------NGSCISSAGKKVKQSL 90
A PFGS V N + R G DLD+ ++L + + +V++ L
Sbjct: 230 AVAHPFGSSV-NGYGRMGCDLDVILDLDCRSGEPPDRDARLVYHTKATNPNERTQVQRQL 288
Query: 91 --LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+GD+L+ G ++ + ARVPI+K+ H ++ D++++N G S+ L+
Sbjct: 289 ESIGDVLQLFLP--GVNSVRRILKARVPIVKYHHEHLDLEIDLTMNNKAGVYMSELLYLF 346
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPL-KDI 206
++D R R + ++ WA+A + N G + ++SL++LV++ Q ILP + K I
Sbjct: 347 GELDHRVRPLTFAIRRWAQAVGLTNQAPGYWITNFSLTMLVMYFLQQLQAPILPSINKLI 406
Query: 207 YPGNLVDDLKGV-----RANAERQIAE-ICAFNIARFSSDKYRKINRSSLA---HLFVSF 257
+ GV R + + AE +C+F + +R N+S+L H F +F
Sbjct: 407 QLSAAAKESNGVVPPLARLGGDGEDAEWVCSFLKNPSIYECFRSTNQSTLEELLHQFFTF 466
Query: 258 LEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
+F + +A L I + T P++ P++I +P E N ++ V+
Sbjct: 467 YAQFD-FNQRAISLNI-----------AGTILKPDHCPMYIVNPLETVLNVSKNVN 510
>gi|301758438|ref|XP_002915061.1| PREDICTED: terminal uridylyltransferase 7-like [Ailuropoda
melanoleuca]
Length = 1541
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1196 KGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1248
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1249 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1296
Query: 318 AKISNAF 324
I AF
Sbjct: 1297 NFIMKAF 1303
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLENKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|281337739|gb|EFB13323.1| hypothetical protein PANDA_003017 [Ailuropoda melanoleuca]
Length = 1473
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 134/307 (43%), Gaps = 44/307 (14%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 1024 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 1075
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 1076 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1135
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 1136 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 1195
Query: 208 PGNLVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
G ++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 1196 KGEKKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEF 1248
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1249 DFKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMT 1296
Query: 318 AKISNAF 324
I AF
Sbjct: 1297 NFIMKAF 1303
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDVNIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNSDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLENKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAVFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|317419221|emb|CBN81258.1| U6 snRNA-specific terminal uridylyltransferase 1 [Dicentrarchus
labrax]
Length = 801
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 137/339 (40%), Gaps = 59/339 (17%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
+ R ++ L+EV VE + + PFGS V+ DLD+ ++L N + K
Sbjct: 178 KARGLLVQLLQEVF--VEFFPDSQILPFGSSVNTFGIHSCDLDLFLDLENTKVFQARAKS 235
Query: 86 VKQ--------------SLLGD-------------LLRAL--RQKGGYRRLQFVAHARVP 116
+ S+L D L+ A+ R ++ V AR+P
Sbjct: 236 TAEQTGEGTSDDGHSEDSILSDIDLSTASPAEVLDLVAAILRRCVPSVHKVHVVGSARLP 295
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
++KF N+ DI+I+N ++FL S ++ R R +V ++ WAK + +
Sbjct: 296 VVKFHHRELNLQGDITINNRLAVRNTRFLQICSGMEDRLRPLVYTIRYWAKQKQLAGDPS 355
Query: 177 GT---FNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICA 231
G N+Y+L+LL++F Q C P +LP VD LK + E + E C
Sbjct: 356 GAGPLLNNYALTLLIIFFLQNCEPPVLP---------TVDKLKDMACEEEECVIEGWDCT 406
Query: 232 FNIARFSSDKYR-KINRSSLAHLFVSFLEKFSGLS--LKASELGICPFTG---------- 278
F + + K + +L F SF KF S + + + P T
Sbjct: 407 FPSQPIAVPPSKNKQDLCTLLAGFFSFYAKFDFASGVISVRDGRVLPITDFLSQNKKEEA 466
Query: 279 -QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
Q E P PL + DPFE N A ++E++
Sbjct: 467 MQEEKPTKAHHRGPKLGPLNLLDPFELSHNVAGNLNERS 505
>gi|84996071|ref|XP_952757.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303754|emb|CAI76131.1| hypothetical protein, conserved [Theileria annulata]
Length = 475
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 156/349 (44%), Gaps = 61/349 (17%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D ++R + IS+ E + + R +V FGS ++ L++ DLD+ +++ N + S+
Sbjct: 142 DLKSRNERISEFLEKILREKVNRKCSVSFFGSAINGLWTDGSDLDVCVQIPNVNSRSATI 201
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE--------TIHQNI-------- 127
+ +++ + ++L L R Q A++PIL ++ T++ ++
Sbjct: 202 RNLRR--ISNVLTPL---SPSRIFQNRFTAKIPILHWKRDYIKTPNTLYDSLNTQEKMYF 256
Query: 128 ------SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
S DIS++N I S + + R RD+VL +K WA+ +INN GT +S
Sbjct: 257 ECDDIPSIDISVNNDLAIINSILIGNYVSFEPRVRDLVLFLKLWARNRNINNRSEGTLSS 316
Query: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAE-RQIAEICAFNIARFSSD 240
+++SL+++ Q C P +LP L+D+ N E + I+ + RFS+D
Sbjct: 317 FAISLMLIHFLQNCDPPLLPSLQDL----------AFSTNEEPKYISGVD----CRFSTD 362
Query: 241 KYRKI------------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
+ KI N S L F + F +L A I + +
Sbjct: 363 -FNKIKSELNYITKSKRNNSDNKTLLTQFFKYFGWYNLYAQNKPILIRSVDLSEFNTENS 421
Query: 289 WLPNNHP-LFIEDPFEQPENSAR-AVSEKNLAKISNAFEMTHFRLTSTN 335
+ N P L +++PFE + A A+ ++ KI+N F + L S N
Sbjct: 422 II--NEPYLHVDNPFEVGVDVANIAIHQR--TKITNEFRKAYHSLKSGN 466
>gi|321462173|gb|EFX73198.1| hypothetical protein DAPPUDRAFT_325455 [Daphnia pulex]
Length = 619
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 104/208 (50%), Gaps = 17/208 (8%)
Query: 14 ILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL-FSRWGDLDISIE 72
++ ++ E+ + +IS L E + S+E G + FGS V+ L F DLDI +E
Sbjct: 181 LVKIVETTEEEKSRKSHIISSLEEWL-SLE-FPGCCLHLFGSSVTGLAFRNDSDLDIFLE 238
Query: 73 L-SNGSCISSAGKKVKQSLLGD------LLRALRQKGGYRR-------LQFVAHARVPIL 118
+ +N + A + L D +L+ LR+ R L V++AR+P+
Sbjct: 239 IPANDEGHAEADASLSNDELTDEKKREYMLKTLRRASNIIRSHPDITDLVVVSNARIPVS 298
Query: 119 KFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGT 178
KF + CD++ +N+ SK L+ + +D R R + +K WAK+H + + T
Sbjct: 299 KFVYSPIGVKCDLTCNNIIAVQNSKLLYSLQSLDVRIRPYLYALKFWAKSHRLISSPEST 358
Query: 179 FNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+SY+L+L+ +F+ Q P ++P ++ +
Sbjct: 359 LSSYALTLMAVFYLQQTDPPLVPSIESL 386
>gi|158296263|ref|XP_316693.3| AGAP006659-PA [Anopheles gambiae str. PEST]
gi|157016427|gb|EAA11489.4| AGAP006659-PA [Anopheles gambiae str. PEST]
Length = 703
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/344 (23%), Positives = 142/344 (41%), Gaps = 50/344 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ ++ L P D E + ++ D V ++ + AT+ FGS S L + DLD
Sbjct: 79 MSALIAALQPAPADVEMVLNMVKDDLNRVLNLPN-NQATIYEFGSIKSGLLLKDSDLDFY 137
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I + K+ + + R G + + A+VP+L+ + N+ CD
Sbjct: 138 IHYAREKTEREEQIKLIHVVCSRMDRDTSFTGKVK----ILGAKVPLLRAVHVRTNLQCD 193
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF 190
I+ N G SKF+ I + D R + ++VK WA+ I NSY L ++V+F
Sbjct: 194 INFSNARGCYNSKFIHAIMKFDERIHQLTVMVKFWAQCAHILTAH-HQMNSYCLIMMVIF 252
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC--AFNIARFSSDKYRKINRS 248
+ QT ++P ++D+ G IA I +N+ +Y+ N S
Sbjct: 253 YLQTRKLPVIPSVEDLQQG----------------IARITFGPWNLGYPQQIQYKTWNVS 296
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQW----EHIRSNTRWLP--------NNHP- 295
++ L V F + +S + I P+ G+ E + + R L N+P
Sbjct: 297 TVRELLVGFFKYYSEFDFAGN--IISPYVGRLCSTVELEKKSIRELAPYYRAVERENYPE 354
Query: 296 ------LFIEDPFEQPENSARAVSEKNLAK-----ISNAFEMTH 328
+ I+DPFE N + + LA+ + +A+E+
Sbjct: 355 LSLGPFITIQDPFELNVNVGKVLRMNVLAEQMKHSLKHAYELCQ 398
>gi|157109447|ref|XP_001650674.1| hypothetical protein AaeL_AAEL000750 [Aedes aegypti]
gi|108883995|gb|EAT48220.1| AAEL000750-PA [Aedes aegypti]
Length = 652
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/307 (25%), Positives = 134/307 (43%), Gaps = 49/307 (15%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS S L R DLD + KV + G + R +G + L +
Sbjct: 122 FGSIKSGLAFRDSDLDFYVHYEKNCESKQEQTKVIHIIHGRMAR----EGTFHGLVKILG 177
Query: 113 ARVPILKFETIH--QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK-AH 169
A+VP+L+ IH N++CDI+ N G SKF+F +++ D R + +++K WAK A
Sbjct: 178 AKVPLLR--AIHGPTNLTCDINFSNARGCYNSKFIFAVTRFDPRIHKLAIIIKFWAKCAF 235
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGV-RANAERQIAE 228
+ N + N+Y + ++++F+ QT +LP ++D+ KG+ R N
Sbjct: 236 LLTNHR--QMNTYCIIMMLIFYLQTKKLPLLPAVQDLQ--------KGIPRVN------- 278
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW----EHIR 284
+N+ K+ +N+ S+ L ++F + +S + + I PF G+ E +
Sbjct: 279 FGPWNLGYPKDIKFTTMNKESIRLLLLNFFKYYSTFEFEKN--LISPFVGRLCAVEEMKQ 336
Query: 285 SNTRWLP--------NNHPLF-------IEDPFEQPENSARAV-SEKNLAKISNAFEMTH 328
R L N P F I+DPFE N + S + + +F+ H
Sbjct: 337 KKVRELQPYYRAVEHQNFPEFNYGTQISIQDPFELNMNIGGVLNSAAHFEQFKLSFKTAH 396
Query: 329 FRLTSTN 335
+ TN
Sbjct: 397 EVMCETN 403
>gi|452988962|gb|EME88717.1| hypothetical protein MYCFIDRAFT_87561 [Pseudocercospora fijiensis
CIRAD86]
Length = 848
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 102/245 (41%), Gaps = 43/245 (17%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ N G ++ L S+ D R R MVL VK WAK IN+ +G
Sbjct: 445 LDFPKDGVGIQCDINFFNPLGLHNTQMLRCYSKCDPRVRPMVLFVKSWAKGRKINSSYSG 504
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLKDIY--------PGNLVDDLKGVRANAERQIAE 228
T +SY L+VL + V P +LP L+ + PG ++ G + R E
Sbjct: 505 TLSSYGYVLMVLHYLMNVVQPPVLPNLQMPWRPHAACTPPGATRAEVDGWVVDFWRNEDE 564
Query: 229 I-CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS---------------------GLSL 266
I A + + SS N+ SL L FL+ +S G+
Sbjct: 565 IEQALQMGQMSS------NKESLGSLLAGFLQYYSSMGNGPQFRWTQQVLSLRTPGGILT 618
Query: 267 KASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEM 326
K ++ + T + E + R+L IEDPFE N AR V+ + I + F
Sbjct: 619 KDAKGWVKATTEEGEGKKIQHRYL-----FCIEDPFELDHNVARTVTHNGIVAIRDEFRR 673
Query: 327 THFRL 331
FR+
Sbjct: 674 A-FRI 677
>gi|326488529|dbj|BAJ93933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 684
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 78/152 (51%), Gaps = 10/152 (6%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ C+S K++ + + L + Q G + +Q +
Sbjct: 533 YGSCANSFGFSNSDIDL--------CLSIDDKEMSKVDIILKLADILQAGNLQNIQALTR 584
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K + +SCDI ++NL + +K L +QID R R + +VK WAK+ +N
Sbjct: 585 ARVPIVKLMDLDTGLSCDICVNNLLAVVNTKLLRDYAQIDQRLRQLAFIVKHWAKSRRVN 644
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
GT +SYS ++ + Q + ILP L+
Sbjct: 645 ETYQGTLSSYSYVIMCIHLLQ--LRRILPCLQ 674
>gi|451850481|gb|EMD63783.1| hypothetical protein COCSADRAFT_331634 [Cochliobolus sativus
ND90Pr]
Length = 1294
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 134/312 (42%), Gaps = 55/312 (17%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P ++D +TR + + ++ ++E+ V FGS + L++ D+DI
Sbjct: 275 MRELYDRLLPKQQDNDTRERFVKKVQRILETEFPGTQMMVHVFGSSGNMLWTSESDVDI- 333
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G R+ + A+V I+K ++CD
Sbjct: 334 -------CIQTPMKRLEE--MHPLAEAL-DKHGMERVVCIPAAKVRIVKVWDPELQLACD 383
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVL 189
I+++N+ ++ + Q+D R R + +++K W K +N+ GT +SY+ ++L
Sbjct: 384 INVNNVAAIENTRMIKTYIQLDDRVRPLAMIIKYWTKRRILNDAGIGGTISSYTWICMIL 443
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRK 244
QT P +LP L ++ P D G + ++ F+ D + +
Sbjct: 444 NFLQTRDPPVLPNLHEL-PQRARDGTTGQPS-------------LSSFADDVGKLRGFGE 489
Query: 245 INRSSLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWL 290
NR SL L F + G + E G P GQ E +
Sbjct: 490 KNRESLGQLLFHFFRLYGHEVDYEKETISVRQGKRIPREEKGWHPGGGQKEGVNR----- 544
Query: 291 PNNHPLFIEDPF 302
L +E+PF
Sbjct: 545 -----LCVEEPF 551
>gi|326512464|dbj|BAJ99587.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 678
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 78/152 (51%), Gaps = 10/152 (6%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS ++ D+D+ C+S K++ + + L + Q G + +Q +
Sbjct: 533 YGSCANSFGFSNSDIDL--------CLSIDDKEMSKVDIILKLADILQAGNLQNIQALTR 584
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
ARVPI+K + +SCDI ++NL + +K L +QID R R + +VK WAK+ +N
Sbjct: 585 ARVPIVKLMDLDTGLSCDICVNNLLAVVNTKLLRDYAQIDQRLRQLAFIVKHWAKSRRVN 644
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
GT +SYS ++ + Q + ILP L+
Sbjct: 645 ETYQGTLSSYSYVIMCIHLLQ--LRRILPCLQ 674
>gi|329664700|ref|NP_001192681.1| terminal uridylyltransferase 7 [Bos taurus]
gi|296484509|tpg|DAA26624.1| TPA: Caffeine Induced Death homolog family member (cid-1)-like [Bos
taurus]
Length = 1498
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 127/293 (43%), Gaps = 42/293 (14%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLD+ C++ G + + L + +L R L++
Sbjct: 1042 GTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEELDCVRTIEELARVLKKH 1093
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + +
Sbjct: 1094 SGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYT 1153
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDL--KGVR 219
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G ++ G
Sbjct: 1154 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGEKKPEIFVDGWN 1213
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASEL 271
QI E+ + +Y K N S+ L++ L ++ +S++ L
Sbjct: 1214 IYFFDQIDELPNY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEHVISIRRKSL 1266
Query: 272 GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ F QW + + IEDPF+ N +S K I AF
Sbjct: 1267 -LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1307
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDINIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNNDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|402075506|gb|EJT70977.1| hypothetical protein GGTG_11999 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1354
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 103/226 (45%), Gaps = 24/226 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P + E R K++ L + V FGS + L S D+DI
Sbjct: 293 MRELYDRLKPTDKVKENRAKLVKKLDRIFNEGWPGHNIKVHLFGSSGNKLCSDDSDVDI- 351
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ ++ DLLR K G ++ V+ A+VPI+K ++C
Sbjct: 352 -------CITTDWKELENVCMIADLLR----KRGMDKVVCVSSAKVPIVKVWDPELQLAC 400
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R R + +++K W + IN+ GT +SY+ ++
Sbjct: 401 DMNVNNTLALENTRMVLTYVDIDERVRPLAMVIKHWTRRRIINDAAFGGTLSSYTWICMI 460
Query: 189 LFHFQTCVPAILP-----PLKDIYP-----GNLVDDLKGVRANAER 224
+ Q P ILP P K I P DD+ +R E+
Sbjct: 461 IAFLQLRSPPILPSLHLSPHKKILPETGRKSEFADDMSKLRGYGEK 506
>gi|444314869|ref|XP_004178092.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
gi|387511131|emb|CCH58573.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
Length = 711
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 106/205 (51%), Gaps = 14/205 (6%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L +KD + ++P RE+ E R + I +R+ V+ + + + + FGS+ ++L+
Sbjct: 203 DLLTEEIKDFVAYISPSREEIELRNQTIGKIRKAVKKL--WKNSDLYVFGSYATDLYLPG 260
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
D+D I ISSAG K + L L L+Q +++ +A ARVPI+KF
Sbjct: 261 SDIDCVI-------ISSAGDKENRHSLYQLSSWLKQNKLATKVEVIAKARVPIIKFVEPS 313
Query: 125 QNISCDISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
NI D+S + G +K + W++ G R++VL++K++ + +N+ G S
Sbjct: 314 SNIHIDVSFERSNGLEAAKTIRDWLNSTPG-LRELVLIIKQFLNSRRLNDVHLGGLGGLS 372
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYP 208
+ + +++ F + P IL DI P
Sbjct: 373 I-ICMVYSFLSLHPRILT--NDIDP 394
>gi|440902049|gb|ELR52894.1| Terminal uridylyltransferase 7, partial [Bos grunniens mutus]
Length = 1477
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 83/168 (49%), Gaps = 13/168 (7%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLD+ C++ G + + L + +L R L++
Sbjct: 1042 GTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEELDCVRTIEELARVLKKH 1093
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + +
Sbjct: 1094 SGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYT 1153
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1154 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 1201
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 41/193 (21%), Positives = 86/193 (44%), Gaps = 18/193 (9%)
Query: 23 EDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
E+ E R+++ + ++E+V L ++ +GS S L + D++I I+ I
Sbjct: 311 ENLEQRLEI----KRIMENVFQHKLPDCSLRLYGSSCSRLGFKNSDINIDIQFP---AIM 363
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
S + +L + L+ + + HARVP++ + C +S N +
Sbjct: 364 S-----QPDVLLLVQECLKNNDSFIDVDADFHARVPVVVCREKQSGLLCKVSAGNENACL 418
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
+ L + +++ + +V+ + WAK I+ P+ G Y +L+ +F Q +L
Sbjct: 419 TTNHLTALGKLESKLVPLVIAFRYWAKLCSIDRPEEGGLPPYVFALMAIFFLQQRKEPLL 478
Query: 201 PPLKDIYPGNLVD 213
P +Y G+ ++
Sbjct: 479 P----VYLGSWIE 487
>gi|425774063|gb|EKV12382.1| hypothetical protein PDIP_52500 [Penicillium digitatum Pd1]
gi|425776189|gb|EKV14418.1| hypothetical protein PDIG_32940 [Penicillium digitatum PHI26]
Length = 1091
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/301 (23%), Positives = 133/301 (44%), Gaps = 30/301 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ D+ L P E + R +++ L ++ FGS + L S D+DI
Sbjct: 151 IMDLYDRLLPSAESDDRRRQLVRKLEKLFNDQWPGHDIKANIFGSSGNKLCSSDSDVDI- 209
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ LL ++L K G +R+ V+HA+VPI+K ++C
Sbjct: 210 -------CITTNYKELEHVCLLAEVL----AKYGMQRVVCVSHAKVPIVKIWDPELRLAC 258
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ++D R R + + +K W K +N+ GT +SY+ L+
Sbjct: 259 DMNVNNTLALENTRMIRTYVEVDERVRPLAMAIKHWTKQRILNDAALGGTLSSYTWICLI 318
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKIN 246
+ QT P ILP L+ R + +R + +C+F+ + ++ + N
Sbjct: 319 INFLQTRNPPILPSLQ-------------ARPHKKRMTPDGLVCSFDDDLKTLSQFGRKN 365
Query: 247 RSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPE 306
+ S+ L F ++ G + I G + + L N+ L +E+PF
Sbjct: 366 KQSVGGLLFHFF-RYYGYEFDYEKNVISVRDGTLINKEAKGWHLMLNNRLCVEEPFNTSR 424
Query: 307 N 307
N
Sbjct: 425 N 425
>gi|195016169|ref|XP_001984355.1| GH16410 [Drosophila grimshawi]
gi|193897837|gb|EDV96703.1| GH16410 [Drosophila grimshawi]
Length = 679
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 152/369 (41%), Gaps = 64/369 (17%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A V PFGS V+ L + D+DI +E ++ S + + +++ +LG LR+ + +
Sbjct: 114 ARVYPFGSLVTGLVLKDSDIDIYLEHTDTSSNAMSHRQLFDRILG----YLRRNDCFDDV 169
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
HARVPI++F + +S DI++ + S F+ + ++D R R++ L +K WAK
Sbjct: 170 VARRHARVPIIRFMHVVSGLSIDINMTSPKSTYNSCFIAALLRLDVRIRELFLFLKLWAK 229
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
I N +G+ SY L++L+++ Q P ++ + V ++ G+
Sbjct: 230 KLKIINC-SGSMTSYCLAVLIIYGMQQ--RKYFPSIRQMQACCPVQEVMGINYG------ 280
Query: 228 EICAFNIARFSSDKYRKINRS-SLAHLFVSFLEKFSGLSLKASELGICPFTG-------- 278
FS + ++ S + L SF + +S + L P+ G
Sbjct: 281 ---------FSLQQVPQLPDSLTTLDLITSFFQLYSRMDFDKKLLS--PYLGYALDLTTP 329
Query: 279 --------QWEHIRSNTRWLPNNHP--------LFIEDPFEQPENSARAVSEKNLAKISN 322
++E + + P + ++DPFE N +++S NLA +
Sbjct: 330 CSLSQNFPEYEKQLKTIQKVTGEQPEPFQSERCICVQDPFELEHNVGQSISPTNLAYLRQ 389
Query: 323 AFEMTHFRLTSTNQTRYALLSSLARPFILQFFG----------ESPVRYANYNNGHRRAR 372
+ + T LLSS A+ + FG E V A + R
Sbjct: 390 CLKFGYKACTDAK-----LLSSPAQLYDYLLFGLADELLQAQREKAVHPAKMSRQMREEM 444
Query: 373 PQSHKSVNS 381
P + K+ +
Sbjct: 445 PTAAKTTRT 453
>gi|297819100|ref|XP_002877433.1| hypothetical protein ARALYDRAFT_323241 [Arabidopsis lyrata subsp.
lyrata]
gi|297323271|gb|EFH53692.1| hypothetical protein ARALYDRAFT_323241 [Arabidopsis lyrata subsp.
lyrata]
Length = 406
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 80/145 (55%), Gaps = 1/145 (0%)
Query: 20 PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCI 79
P+ D++TR +++ +L + + +E +GSFV + FS DLD+SI +G+
Sbjct: 56 PVSADYDTRKELVKNLNAMAIDIFEESRPVLEAYGSFVMDTFSPQRDLDVSINFGSGTSE 115
Query: 80 SSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCG 138
S KK++ LR+L + R + + +ARVPI+KF I CD+++++ G
Sbjct: 116 LSRVKKLEILERFATKLRSLEGQVFVRNVVPIFNARVPIVKFCDQRTGIECDLAVESKDG 175
Query: 139 QIKSKFLFWISQIDGRFRDMVLLVK 163
+ SK + ISQID RF+ + LL +
Sbjct: 176 ILVSKIIRIISQIDDRFQKLCLLTQ 200
>gi|401406257|ref|XP_003882578.1| Novel protein (Zgc:110560), related [Neospora caninum Liverpool]
gi|325116993|emb|CBZ52546.1| Novel protein (Zgc:110560), related [Neospora caninum Liverpool]
Length = 1027
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 110/257 (42%), Gaps = 55/257 (21%)
Query: 3 SYNVLEPILKDIL---GMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSN 59
S +V E + D+ ++ P ED + +S L++++ V L V PFGS V+
Sbjct: 547 SPSVFEALTNDMRRLESLMLPGTEDQAGMRRFLSQLQDLLNGV--LDACVVTPFGSAVNG 604
Query: 60 LFSRWGDLDISIELSNGSCISSAGKKVKQ------------------------------- 88
L++ DLD+ +++ S +S K ++Q
Sbjct: 605 LWTPQSDLDVCVQVREASTRASQIKVLRQVAHALHPVHTHLVEPRFQARVPIIHWSPRFS 664
Query: 89 ------SLLGDLLR-----ALRQKGGYRRLQFVAHARVPIL-------KFETIHQNISCD 130
+LLG LR AL +K G R + E Q +SCD
Sbjct: 665 HSASGPALLGRFLRDPVARALHEKPGDARSRGREDEETGRRNDSYGEGDGERNTQMVSCD 724
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF 190
IS++NL + SK L ID R R + VK WAK +IN+ GT +S+SL L+++
Sbjct: 725 ISVNNLLAVVNSKLLGAYVGIDPRLRTLGYAVKFWAKGRNINDRSRGTVSSFSLVLMLIH 784
Query: 191 HFQTCV-PAILPPLKDI 206
Q V P ILP L+D+
Sbjct: 785 FLQNHVQPRILPSLQDM 801
>gi|71985071|ref|NP_001021433.1| Protein F31C3.2, isoform a [Caenorhabditis elegans]
gi|3876503|emb|CAB07197.1| Protein F31C3.2, isoform a [Caenorhabditis elegans]
Length = 805
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 144/328 (43%), Gaps = 60/328 (18%)
Query: 26 ETRMKVISDLREVVESVE---SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E R K+ S+++++ E G+TV GSF S D+D+ + C +
Sbjct: 483 EARQKLFSEIKKLFPDTEIKLQTTGSTVNGCGSFNS-------DMDLCL------CFPTN 529
Query: 83 GKK------------VKQSLLGDLLRALRQ-------KGGYRRLQFVAHARVPILK--FE 121
G K +L + +A R+ K + +Q V A+VPI+K
Sbjct: 530 GYKGQVCDDFHCDRNYSTKILRKIDKAFRRSHWSHPLKKIIKTMQLVP-AKVPIVKMILN 588
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
I DI+++N+ G S + + S D R + LLVK WA INN + G NS
Sbjct: 589 GEFDGIEVDINVNNIAGIYNSHLIHYYSLTDARLPALALLVKHWAMVTGINNAQDGFLNS 648
Query: 182 YSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF-NIARFSS 239
Y+ LLV+ + Q V PA++P L+ ++P L + E+ F +IA
Sbjct: 649 YTTILLVVHYLQCGVTPAVIPNLQYLFPHKFDRKLP---------LNELLLFGDIA---- 695
Query: 240 DKY--RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
DK N SL L + F + ++ + G +GQ + R L N+ P+
Sbjct: 696 DKLPTSPPNTWSLGELLIGFFQYYNEFDF--TNFGFSIRSGQVIPRENLPRDLINS-PIV 752
Query: 298 IEDPFEQPENSARAVSE-KNLAKISNAF 324
+E+PF+ N+AR V + ++ I +AF
Sbjct: 753 VEEPFDAI-NTARTVRDVSHMKSIKSAF 779
>gi|308499385|ref|XP_003111878.1| CRE-MUT-2 protein [Caenorhabditis remanei]
gi|308268359|gb|EFP12312.1| CRE-MUT-2 protein [Caenorhabditis remanei]
Length = 444
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 145/332 (43%), Gaps = 49/332 (14%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS----- 77
E++ +M L+ ++ + P GS V+ L ++ DLD++I + +
Sbjct: 58 EEFNRKMNWCYQLKNIISKHNPTWLFNIVPTGSTVTGLATKNSDLDVAIHIPQAARLLDE 117
Query: 78 -----CISSAGKKVK----QSLLGDLLRALRQKG-------GYRRLQFVAHARVPILKFE 121
+S + K Q + +R + +K + + + A++ IL+ E
Sbjct: 118 LYPQIALSEEERFCKWRGMQLEILQTVRLILEKDEQIKPLVNWEKGIHLVQAQIQILQIE 177
Query: 122 TIHQNISCDISI--DNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGT 178
T I CDIS+ + + + F+ ID RF + +VK+WA + + NPK G
Sbjct: 178 TA-DGIECDISVVMEPFLSSMHNSFMIRHYVHIDHRFATLCAVVKKWAASTGVKNPKDGG 236
Query: 179 FNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
FNSY+L +LV+ HF C P ILP L +Y + A +++ E+ F
Sbjct: 237 FNSYALVILVI-HFLQCGAYPPILPNLSKLYKDD------NFIATNDKKYPELLDFGAPL 289
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN--- 293
++N++S A LF+ F+ + + + + + + ++S TR PN
Sbjct: 290 PRDLPKIQMNQASTAQLFLEFVHYYFEFDFQETYISM-----RDSIVKSRTR-CPNETVK 343
Query: 294 ----HPLFIEDPFEQPENSARAV-SEKNLAKI 320
++IEDPF+ N R V S N+ +I
Sbjct: 344 NEKQKDVYIEDPFDA-HNPGRTVRSLTNIKRI 374
>gi|32563609|ref|NP_491842.2| Protein GLD-2, isoform a [Caenorhabditis elegans]
gi|74957307|sp|O17087.2|GLD2_CAEEL RecName: Full=Poly(A) RNA polymerase gld-2; AltName: Full=Defective
in germ line development protein 2
gi|23306648|gb|AAM94369.1| regulatory cytoplasmic polyA polymerase [Caenorhabditis elegans]
gi|351064303|emb|CCD72658.1| Protein GLD-2, isoform a [Caenorhabditis elegans]
Length = 1113
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/261 (25%), Positives = 122/261 (46%), Gaps = 17/261 (6%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
+VL + D ++ E + ++ + L + V L G V GS ++ +
Sbjct: 547 DVLSEKIWDYHNKVSQTDEMLQRKLHLRDMLYTAISPVFPLSGLYV--VGSSLNGFGNNS 604
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILK--FET 122
D+D+ + ++N +K ++ +L+ + Q + Q + A+VPIL+ F
Sbjct: 605 SDMDLCLMITNKDL----DQKNDAVVVLNLILSTLQYEKFVESQKLILAKVPILRINFAA 660
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
+I+ D++ +N + L + S D R R +V +VKEWAK IN+ +F SY
Sbjct: 661 PFDDITVDLNANNSVAIRNTHLLCYYSSYDWRVRPLVSVVKEWAKRKGINDANKSSFTSY 720
Query: 183 SLSLLVLFHFQTCVPA-ILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDK 241
SL L+V+ HF C P +LP L+ YP + + N + E+ A +I + S+K
Sbjct: 721 SLVLMVI-HFLQCGPTKVLPNLQQSYPNRFSNKVDVRTLNVTMALEEV-ADDIDQSLSEK 778
Query: 242 YRKINRSSLAHLFVSFLEKFS 262
++L L + FL+ ++
Sbjct: 779 ------TTLGELLIGFLDYYA 793
>gi|71985077|ref|NP_001021434.1| Protein F31C3.2, isoform b [Caenorhabditis elegans]
gi|38422311|emb|CAE54895.1| Protein F31C3.2, isoform b [Caenorhabditis elegans]
Length = 808
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/328 (27%), Positives = 144/328 (43%), Gaps = 60/328 (18%)
Query: 26 ETRMKVISDLREVVESVE---SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E R K+ S+++++ E G+TV GSF S D+D+ + C +
Sbjct: 486 EARQKLFSEIKKLFPDTEIKLQTTGSTVNGCGSFNS-------DMDLCL------CFPTN 532
Query: 83 GKK------------VKQSLLGDLLRALRQ-------KGGYRRLQFVAHARVPILK--FE 121
G K +L + +A R+ K + +Q V A+VPI+K
Sbjct: 533 GYKGQVCDDFHCDRNYSTKILRKIDKAFRRSHWSHPLKKIIKTMQLVP-AKVPIVKMILN 591
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
I DI+++N+ G S + + S D R + LLVK WA INN + G NS
Sbjct: 592 GEFDGIEVDINVNNIAGIYNSHLIHYYSLTDARLPALALLVKHWAMVTGINNAQDGFLNS 651
Query: 182 YSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF-NIARFSS 239
Y+ LLV+ + Q V PA++P L+ ++P L + E+ F +IA
Sbjct: 652 YTTILLVVHYLQCGVTPAVIPNLQYLFPHKFDRKLP---------LNELLLFGDIA---- 698
Query: 240 DKY--RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
DK N SL L + F + ++ + G +GQ + R L N+ P+
Sbjct: 699 DKLPTSPPNTWSLGELLIGFFQYYNEFDF--TNFGFSIRSGQVIPRENLPRDLINS-PIV 755
Query: 298 IEDPFEQPENSARAVSE-KNLAKISNAF 324
+E+PF+ N+AR V + ++ I +AF
Sbjct: 756 VEEPFDAI-NTARTVRDVSHMKSIKSAF 782
>gi|347441079|emb|CCD34000.1| similar to zinc finger protein [Botryotinia fuckeliana]
Length = 1243
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/333 (21%), Positives = 142/333 (42%), Gaps = 56/333 (16%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E E R +++S L ++ V FGS + L + D+DI
Sbjct: 275 MRELYDGLLPTAETDERRRRLVSKLEDMFNKEWPGHDIRVYVFGSSGNLLCTDASDVDI- 333
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K ++ ++ +LL K G +++ V+ A+VPI+K + C
Sbjct: 334 -------CITTDWKVMEGVCMIAELL----AKNGMQKVICVSTAKVPIVKIFDPELKLLC 382
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D++++N ++ + +ID R R + +++K W K+ IN+ GT +SY+ +++
Sbjct: 383 DMNVNNTQALENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMII 442
Query: 190 FHFQTCVPAILP----------PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSS 239
Q+ P +LP P K+ + DD+ +R ++
Sbjct: 443 NFLQSREPPVLPSLHQRPHLKLPTKEGGESSFADDIDALRGFGQK--------------- 487
Query: 240 DKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW-LPNNHPLFI 298
N+S+L L F +F G + + G+ + + +W + N+ L +
Sbjct: 488 ------NKSTLGELLFQFF-RFYGHEFDYDKQVVSVRMGR-QISKEEKKWAIATNNMLCV 539
Query: 299 EDPFEQPENSARAVSEKNLAKISNAFEMTHFRL 331
E+PF +E+NL ++ F L
Sbjct: 540 EEPFN---------TERNLGNTADDFSFRGLHL 563
>gi|341896630|gb|EGT52565.1| CBN-PUP-3 protein [Caenorhabditis brenneri]
Length = 465
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 138/301 (45%), Gaps = 27/301 (8%)
Query: 44 SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG 103
S A V GS+ + + R DLD ++ + C++ + K S L +RA Q+
Sbjct: 103 SYPDAKVWTVGSYPAGVDIRDSDLDFTLTIP---CLNES----KFSKLM-AIRAQFQRTE 154
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
+V P+LK + +I D++I+N + + L SQ+D RF + +K
Sbjct: 155 EFVNPWVVKGWNPVLKMKHKESDIWLDVTINNDAPKRNTMLLARYSQVDERFAKLCRAIK 214
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP---GNLVDDLKGVRA 220
+WA + N + G NS S+ LLV+F+ Q +LP L++++P GN+ D +
Sbjct: 215 KWAAETGVENSRNGRLNSCSICLLVIFYLQKV--GVLPNLQNVFPELNGNIEVDSDDYQQ 272
Query: 221 NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI--CPFTG 278
+ + + + N+SSL LF FL+ +S + + I
Sbjct: 273 ENILDDLKKAGWVVGK---------NKSSLGALFCGFLKFYSKFDFSSKWISIKRGRALD 323
Query: 279 QWEHIRSNTRWLPNN-HPLFIEDPF-EQPENSARAVSEKN-LAKISNAFEMTHFRLTSTN 335
+++ + LP++ + +EDPF + P N R VS+ + L +I F + R+ +T+
Sbjct: 324 KFDENGNKNEGLPDDVRFIVLEDPFMDTPFNCGRTVSQADILERIQLEFRLAVKRIKATH 383
Query: 336 Q 336
Q
Sbjct: 384 Q 384
>gi|452000520|gb|EMD92981.1| hypothetical protein COCHEDRAFT_1202862 [Cochliobolus
heterostrophus C5]
Length = 1472
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 134/312 (42%), Gaps = 55/312 (17%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P ++D +TR + + ++ ++E+ V FGS + L++ D+DI
Sbjct: 448 MRELYDRLLPKQQDNDTRERFVKKVQRILETEFPGTQMMVHVFGSSGNMLWTSESDVDI- 506
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G R+ + A+V I+K ++CD
Sbjct: 507 -------CIQTPMKRLEE--MHPLAEAL-DKHGMERVVCIPAAKVRIVKVWDPELQLACD 556
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVL 189
I+++N+ ++ + Q+D R R + +++K W K +N+ GT +SY+ ++L
Sbjct: 557 INVNNVAAIENTRMIKTYIQLDDRVRPLAMIIKYWTKRRILNDAGIGGTISSYTWICMIL 616
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRK 244
QT P +LP L ++ P D G +++ F+ D + +
Sbjct: 617 NFLQTRDPPVLPNLHEL-PQRARDGTTG-------------QPSLSSFADDVGKLRGFGE 662
Query: 245 INRSSLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWL 290
NR SL L F + G + E G P GQ E +
Sbjct: 663 KNRESLGQLLFHFFRLYGHEVDYEKETISVRQGKRIPREEKGWHPGGGQKEGVNR----- 717
Query: 291 PNNHPLFIEDPF 302
L +E+PF
Sbjct: 718 -----LCVEEPF 724
>gi|403416514|emb|CCM03214.1| predicted protein [Fibroporia radiculosa]
Length = 1522
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 115/249 (46%), Gaps = 38/249 (15%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEP------FGSFVSNLFSRW 64
L D + L P E+ M V D+R+++E + +R T+EP FGS + R
Sbjct: 696 LFDFVIQLLPTSEE----MTVKEDVRKLLERL--IR--TIEPDSRLLSFGSSANGFSLRN 747
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET-- 122
D+D+ C+ + +++ + L ++L L ++ ++ + AR+PI+K
Sbjct: 748 SDMDLC-------CLIDSEERLSATDLVNMLGDLLERETKFHVKPLPRARIPIVKLSLDP 800
Query: 123 ---IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWAKAHDINNPKTGT 178
+ I+CDI +N ++ L + ID R R MVL +K W+K IN+P GT
Sbjct: 801 APGLPFGIACDIGFENRLALENTRLLMCYAMIDPARVRTMVLFLKVWSKRRKINSPYKGT 860
Query: 179 FNSYSLSLLVL-FHFQTCVPAILPPLKDIYPGNLVDD---LKGVRANAERQIAEICA--- 231
+SY LLVL F P +LP L+ + P + + L + R+ +C
Sbjct: 861 LSSYGYVLLVLYFLIHVKNPPVLPNLQQMPPLRPISEVIQLLDAEKGSPRERNRLCIEDP 920
Query: 232 ----FNIAR 236
FN+AR
Sbjct: 921 FETDFNVAR 929
>gi|327263393|ref|XP_003216504.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 7-like
[Anolis carolinensis]
Length = 1477
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 124/300 (41%), Gaps = 56/300 (18%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRR 106
G + FGS + + DLDI + + + +A + ++ +L R LR+ G R
Sbjct: 1021 GTKLNLFGSSKNGFGFKQSDLDICMTIDG---LETAEELDCIKIIEELARVLRKHSGLRN 1077
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+ + A+VPI+KF + + DIS+ N ++ L + +D R + + +K +
Sbjct: 1078 ILPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTRLLSCYAAVDPRVKYLCYTMKVFT 1137
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLKGVRAN 221
K DI + G+ +SY+ +L+VL+ Q P ++P L++I P LVD
Sbjct: 1138 KMCDIGDASRGSLSSYAYTLMVLYFLQQRSPPVIPVLQEICKEPKKPEILVD-------- 1189
Query: 222 AERQIAEICAFNIARFSSDKYRKI---------NRSSLAHLFVSFLEKFSGLSLKASELG 272
+N+ F DK ++ N+ S+ L++ L +F E
Sbjct: 1190 ---------GWNVYFF--DKMDELPTVWPDFGKNKESVGELWLGLL-RFYTEEFDFKEHV 1237
Query: 273 IC--------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
IC F QW + + IEDPF+ N +S K I AF
Sbjct: 1238 ICIRRKNLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1286
Score = 42.4 bits (98), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 37/190 (19%), Positives = 80/190 (42%), Gaps = 14/190 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+ + + + ++ + L ++ +GS S + D+++ I+
Sbjct: 302 EDLEQRLGIKATMENILH--KKLPECSLRLYGSSSSRFGFKNSDVNLDIQF--------P 351
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + +L+ + + HARVP++ + + C +S N + +
Sbjct: 352 ASMSQPDVLLLVQESLKNSESFIDVDADFHARVPVVVCKEKQSGLVCLVSAGNENAFLTT 411
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +V+ + WAK + P+ G Y +L+V+F Q LP
Sbjct: 412 NHLATLGKLEPHLVSLVIAFRYWAKLCCADRPEEGGLPPYVFALMVIFFLQQRKEPFLP- 470
Query: 203 LKDIYPGNLV 212
+Y G+ V
Sbjct: 471 ---VYLGSWV 477
>gi|307180713|gb|EFN68604.1| U6 snRNA-specific terminal uridylyltransferase 1 [Camponotus
floridanus]
Length = 722
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 138/336 (41%), Gaps = 48/336 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVI-SDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
L +L + + +TR VI + L E+ + FGS V+ L + DLDI
Sbjct: 158 LTALLNTIQLTEFELKTRYDVICTHLNEIFRPI--FPECQTYKFGSTVAGLSFKESDLDI 215
Query: 70 SIELSNGSCISSAGK------KVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVPILKFET 122
+ + + K + ++ + R + K + + + A+ PI+KF
Sbjct: 216 YMYVGEIGLPPACHKPDIPPYMLTLTIFKRVRRIMYSMKSVFSNIISIPKAKTPIIKFRY 275
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
I N+SCDIS N G KS FL + + D R R ++LL+K WA+ ++ G +SY
Sbjct: 276 IPTNVSCDISFKNSLGIYKSNFLHYCASHDPRLRPLMLLIKYWARHFGVSG--IGRISSY 333
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
L L++F+ Q +LP L D+ + + G + N + S
Sbjct: 334 GLICLIIFYLQQESVGLLPSLLDLQKTCVPHIMYGWQVNFNENTV------LPPIS---- 383
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEH-------------------- 282
N SS+A LF +F ++ + IC G+
Sbjct: 384 ---NSSSIAELFHNFFSFYATFHFNSC--VICLLDGKTYLAADFTQIDKLPDYMDRYKTC 438
Query: 283 -IRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+ ++ + L + PL ++DP E +N+A S++ L
Sbjct: 439 IMENSAKKLDVHKPLCLQDPIELNQNTAAITSDRAL 474
>gi|312382886|gb|EFR28176.1| hypothetical protein AND_04206 [Anopheles darlingi]
Length = 670
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 157/373 (42%), Gaps = 57/373 (15%)
Query: 7 LEPILKDI------------LGMLNPLREDWETRMKVIS-DLREVVESVESLRGATVEPF 53
LEP++K + L L P + E + ++ DL+ V+ + A + F
Sbjct: 66 LEPLVKALRECEPGNEMQVFLDELQPSEANIELGLALVKRDLQRVLSFASN--AAQIFEF 123
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS S L R DLD I + ++ + ++ + + + G + ++ + A
Sbjct: 124 GSVKSGLAMRDSDLDFYIHYQH----EQREREDQIKMIHVVASRMDKTGLFGQIVKITGA 179
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
+VP+L+ + N CDI+ N G SKF+ + Q D R + +++K W+K + +
Sbjct: 180 KVPLLRAIHLSTNCCCDINFSNARGCYNSKFIKAVMQFDPRILHLAMIIKFWSKCAYVLD 239
Query: 174 PKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFN 233
K FNSY L ++++F+ QT ++P ++D+ G E +N
Sbjct: 240 EKR-QFNSYCLVMMLIFYLQTRKLPVIPSVEDMQQGI--------------PRIEYGPWN 284
Query: 234 IARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQW----EHIRSNTRW 289
+ + Y+ N +SL L + F +S S I PF G+ E + N R
Sbjct: 285 LGYPQAITYKTWNENSLRDLLIGFFRYYSEFEF--SRNLISPFVGRLCSLQELEKKNIRE 342
Query: 290 LPNNH---------PLF------IEDPFEQPENSARA-VSEKNLAKISNAFEMTHFRLTS 333
L + PL I+DPFE N AR V++K + + + + T
Sbjct: 343 LAKYYRACEVDGYTPLMTGPWITIQDPFELNLNVARVMVTDKRFEQFRLSLQHGYDVCTK 402
Query: 334 TNQTRYA-LLSSL 345
+ +A LL +L
Sbjct: 403 HREASFAKLLEAL 415
>gi|68072113|ref|XP_677970.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56498279|emb|CAH96569.1| conserved hypothetical protein [Plasmodium berghei]
Length = 408
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 133/285 (46%), Gaps = 31/285 (10%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGGY 104
+ V PFGS ++ + + D+DI I++ +K + S L + L G
Sbjct: 100 KNCHVTPFGSVINGFWMKNSDIDICIQIP-----ILLNRKDQISFLKKICLILNNYHNGI 154
Query: 105 RRLQFVAHARVPILKFETI-HQN---ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+F A+VPI+ F H+N +SCDIS++N+ I SK + ID R + M +
Sbjct: 155 IEQRF--SAKVPIIHFYCDDHKNTFQLSCDISVNNILAVINSKLIQKYVSIDKRLQLMGI 212
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDI---------YPGN 210
+K W+K +IN+ G +S+SL L+V+ Q + P IL L+DI Y
Sbjct: 213 ALKYWSKKRNINDRSKGFLSSFSLILMVIHFLQYVMEPKILTSLQDISIRRNEKSFYVMG 272
Query: 211 LVDDLKGVRANAERQIAEICAFNIAR-FSSD--KYRKINRSSLAHLFVSFLEKFSGLSLK 267
+ D K + + + E+ NI SSD Y ++ ++ L + F KF G K
Sbjct: 273 V--DCKYCQDDVIIR-EELKRMNIQNGISSDNKNYDHASQVDISTLMLEFF-KFYGYKYK 328
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
+ + I +E+ S + ++ LF+++PFE +N A +
Sbjct: 329 SGIIAIRDINNYYENFASLKSY--ESYYLFVDNPFEIGKNVANIL 371
>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
Length = 604
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P R + TR VI+ L+ V S G FGS ++L+ D+D+
Sbjct: 169 MKDFVSYISPSRAEIVTRNNVINTLKREVSSF--WPGTEAHVFGSCATDLYLPGSDIDMV 226
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ ISS G +S L L LR K + ++ +A A+VPI+KF N+ D
Sbjct: 227 V-------ISSTGDYENRSRLYQLSSFLRAKNLAKNVEVIASAKVPIIKFVDPESNLPID 279
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + W+ G R++VL+VK++ ++ +NN G Y+ ++++
Sbjct: 280 ISFERTNGLDAARRIRRWLLATPG-LRELVLVVKQFLRSRKLNNVHVGGLGGYA-TIIMC 337
Query: 190 FHFQTCVPAI 199
+HF P I
Sbjct: 338 YHFMQLHPKI 347
>gi|302667945|ref|XP_003025551.1| PAP/25A associated domain family [Trichophyton verrucosum HKI 0517]
gi|291189665|gb|EFE44940.1| PAP/25A associated domain family [Trichophyton verrucosum HKI 0517]
Length = 1178
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 92/195 (47%), Gaps = 8/195 (4%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E E R+K + L +++++ V FGS + L + D D
Sbjct: 150 IKELYQKLLPSPESEERRVKFVRKLEKLLDTQWPGNQIKVNVFGSSGNKLCTSDSDADFL 209
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ + S S+ + GG R+ V+HA+VPI+K ++CD
Sbjct: 210 AKSEHSSLFYSSSTR-------PFFANSSYTGGMERVVCVSHAKVPIVKIWDPELQVACD 262
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + ++D R R + +LVK W K +N+ GT +SY+ L++
Sbjct: 263 MNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAALGGTLSSYTWICLII 322
Query: 190 FHFQTCVPAILPPLK 204
QT +P I+P L+
Sbjct: 323 NFLQTRIPPIVPSLQ 337
>gi|444711081|gb|ELW52035.1| Elongation factor 1-gamma [Tupaia chinensis]
Length = 1212
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 131/330 (39%), Gaps = 59/330 (17%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW---------------------GD 66
R V++ ++EV E G V PFGS V++ GD
Sbjct: 180 RSLVVALMQEVF--TEFFPGCVVHPFGSSVNSFDVHGCDLDLFLDLGDLEEPQEDRGGGD 237
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
L ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 238 LGKALELAEALKGEKPEGVAMLDLVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSG 295
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ N+Y+L+L
Sbjct: 296 LHGDVSLSNRLALHNSRFLSLCSELDERVRPLVYTIRCWAQGRGLSGSGP-HLNNYALTL 354
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR--- 243
LV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 355 LVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPKDASRLEP 402
Query: 244 KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLPNN 293
N L+ L F S L S L + P G WE +R
Sbjct: 403 STNVEPLSSLLAQFFSCVSSWDLHGSLLSLREGQALPVAGGLPSHLWEGLRLG------- 455
Query: 294 HPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 456 -PMNLQDPFDLSHNVAANVTSRVAGRLQNC 484
>gi|71996960|ref|NP_001021845.1| Protein GLD-2, isoform d [Caenorhabditis elegans]
gi|351064306|emb|CCD72661.1| Protein GLD-2, isoform d [Caenorhabditis elegans]
Length = 807
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 110/231 (47%), Gaps = 17/231 (7%)
Query: 35 LREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDL 94
L + V L G V GS ++ + D+D+ + ++N +K ++ +L
Sbjct: 271 LYTAISPVFPLSGLYV--VGSSLNGFGNNSSDMDLCLMITNKDL----DQKNDAVVVLNL 324
Query: 95 LRALRQKGGYRRLQFVAHARVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
+ + Q + Q + A+VPIL+ F +I+ D++ +N + L + S D
Sbjct: 325 ILSTLQYEKFVESQKLILAKVPILRINFAAPFDDITVDLNANNSVAIRNTHLLCYYSSYD 384
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYPGNL 211
R R +V +VKEWAK IN+ +F SYSL L+V+ HF C P +LP L+ YP
Sbjct: 385 WRVRPLVSVVKEWAKRKGINDANKSSFTSYSLVLMVI-HFLQCGPTKVLPNLQQSYPNRF 443
Query: 212 VDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+ + N + E+ A +I + S+K ++L L + FL+ ++
Sbjct: 444 SNKVDVRTLNVTMALEEV-ADDIDQSLSEK------TTLGELLIGFLDYYA 487
>gi|432873614|ref|XP_004072304.1| PREDICTED: terminal uridylyltransferase 7-like [Oryzias latipes]
Length = 1265
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 122/297 (41%), Gaps = 50/297 (16%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRR 106
GA ++ FGS + R DLDI + L I ++ L R L++ +
Sbjct: 851 GARLQLFGSSKNGFGFRQSDLDICMVLEGKENIDDVDCI---RIIESLARCLKKNPDLKN 907
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+ + A+VPI+KF I+ ++ DIS+ N + L + ID R + + ++K +A
Sbjct: 908 ILPITTAKVPIVKFYHINTSLEGDISLYNTLALHNTHLLASYAAIDRRVKILCYVMKVFA 967
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN-------------LVD 213
K DI + G+ +SY+ +L+VLF Q P ++P L++IY G D
Sbjct: 968 KMCDIGDASRGSLSSYAYTLMVLFFLQQRNPPVIPVLQEIYFGKHKPEVLVDGWNVYFFD 1027
Query: 214 DLKGV------RANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK 267
DLK + I E+ + RF ++ + F E + +
Sbjct: 1028 DLKTLPSHWPQHGKNTESIGEL-WLGLLRFYTEDF-------------DFKEHVVSIRQR 1073
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
A + F+ QW + + IEDPF+ N +S K I AF
Sbjct: 1074 AR---LTTFSKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMKAF 1116
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 39/179 (21%), Positives = 82/179 (45%), Gaps = 11/179 (6%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D E R V+S ++E+++SV L + +GS + + D++I IE +
Sbjct: 267 QDVEKRRCVVSVMQELLQSV--LPDIKLRLYGSSCTKFGFKDSDVNIDIECPHMH----- 319
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + +L + L+ HARVP + + + C +S N +
Sbjct: 320 ----QPDVLLVVKESLSTCPLFINLEADFHARVPAVICKEKKSGLVCKVSAGNENAHQTT 375
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+L +S + +V+ ++ WA+ +++N + G Y+ +L+V++ Q +LP
Sbjct: 376 CYLSALSSREPVLLPLVMGLRRWARICEVDNAEVGGLPPYAFALMVIYFLQKRKEPLLP 434
>gi|332249971|ref|XP_003274127.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Nomascus leucogenys]
Length = 912
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 114/272 (41%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
G L + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 356 GGLGKAPELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 413
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 414 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 472
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 473 TLLVIYFLQTRDPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 520
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLP 291
N L+ L F S L+ S L + P G WE +R
Sbjct: 521 EPSTNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLSSNLWEGLRLG----- 575
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
PL ++DPF+ N A V+ + ++ N
Sbjct: 576 ---PLNLQDPFDLSHNVAANVTSRVAGRLQNC 604
>gi|355729958|gb|AES10041.1| zinc finger, CCHC domain containing 6 [Mustela putorius furo]
Length = 904
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 90/182 (49%), Gaps = 15/182 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 455 IRQNLESFIRQEFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 506
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R L++ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 507 VRTIEELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 566
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY
Sbjct: 567 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIY 626
Query: 208 PG 209
G
Sbjct: 627 KG 628
>gi|395852618|ref|XP_003798832.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Otolemur garnettii]
Length = 861
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 111/271 (40%), Gaps = 30/271 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKALELAEAPKGEKTDGGAMLELVGSILRGCVP--GVYRVQTVPSARCPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ N+Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDERVRPLVYTLRCWAQGRGLSG-SGPLLNNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+LLV++ QT P +LP V L + E+ + + R +S
Sbjct: 435 TLLVIYFLQTRNPPVLP---------TVSQLTQKAGDGEQVEVDGWDCSFPRNASGLEPS 485
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTG-----QWEHIRSNTRWLPNNH 294
N L+ L F S L+ S L + P WE +R
Sbjct: 486 TNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAAVLSSNSWEGLRLG-------- 537
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
P+ ++DPF+ N A V+ + ++ N +
Sbjct: 538 PMNLQDPFDLSHNVAANVTSRIAGRLQNCCQ 568
>gi|150951520|ref|XP_001387852.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
gi|149388662|gb|EAZ63829.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
Length = 605
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 97/190 (51%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P E+ TR ++I+ L+ + S V FGS ++L+ D+DI
Sbjct: 182 MKDFVNYISPSSEEIRTRNRLINKLKSSISSYWPETETHV--FGSSATDLYLPGSDIDIV 239
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I +S G +S L L LR KG + ++ +A A+VPI+KF N++ D
Sbjct: 240 I-------VSRTGDYENRSRLYQLSSYLRHKGLAKNMEVIAKAKVPIIKFVDPESNVNID 292
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G + K W++ G R++VL++K++ + +NN +G Y+ ++++
Sbjct: 293 VSFERRNGIEAAKKIRRWMTTTPG-LRELVLIIKQFLSSRRLNNVHSGGLGGYA-TIILC 350
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 351 YHFLMMHPRV 360
>gi|171684135|ref|XP_001907009.1| hypothetical protein [Podospora anserina S mat+]
gi|170942028|emb|CAP67680.1| unnamed protein product [Podospora anserina S mat+]
Length = 1251
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 130/299 (43%), Gaps = 28/299 (9%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
LE + D+ +L P +E R K+++ L ++ V FGS + L S D
Sbjct: 277 LETEMNDLFRVLLPTQEVETKRQKLVNKLEKLFNDEWPGHDIKVHLFGSSGNLLCSDDSD 336
Query: 67 LDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
+DI CI++ K ++ L+ DLL + G + + ++ A+VPI+K
Sbjct: 337 VDI--------CITTPWKGLEHVCLIADLL----DRHGMQDVVCISAAKVPIVKIWDPEL 384
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSL 184
++CD++++N ++ + ID R R + +++K W + IN+ GT +SY+
Sbjct: 385 KLACDMNVNNTLALENTRMVRTYVSIDERVRPLAMIIKHWTRRRIINDAAFGGTLSSYTW 444
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+++ Q P +LP L L+ G R+ + ++ F
Sbjct: 445 ICMIIAFLQLRDPPVLPALHQRQKEKLLKS-DGTRSEFADDVPKLTGFGAK--------- 494
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW-LPNNHPLFIEDPF 302
N+ SLA L F +F + + G+ ++ +W + N+ L IE+PF
Sbjct: 495 -NKESLAALLFQFF-RFYAYEFDYDKFALSIRVGKLL-TKTEKKWHIGTNNTLCIEEPF 550
>gi|17507815|ref|NP_491621.1| Protein PUP-3 [Caenorhabditis elegans]
gi|351049859|emb|CCD63902.1| Protein PUP-3 [Caenorhabditis elegans]
Length = 482
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 107/217 (49%), Gaps = 20/217 (9%)
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+V +P+L+ +S D++IDN + ++ L + SQ+D RF + +K WA +
Sbjct: 155 YVQKGNIPVLQLMHAETKVSIDVTIDNDTSKRNTQLLAFYSQLDTRFPLLCKAMKAWAAS 214
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP---GNL-VDDLKGVRANAER 224
+ G NS+SL L+++ + QT +L +++I+P G++ V+D ++ + +
Sbjct: 215 CGVEGASRGRLNSFSLCLMLIHYLQTV--QVLLNIQEIFPELNGDIVVEDDNYMKRDLKI 272
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI---CPFTGQWE 281
+I E AF+ + N SSLA LF+ F++ +S + K + + I +W
Sbjct: 273 EILEKGAFDFNQ---------NTSSLAVLFIGFMKYYSEFNFKWNWISIKHGNVLKKKWS 323
Query: 282 HIRSNTRWLPNN-HPLFIEDP-FEQPENSARAVSEKN 316
R +P + + + DP E P N A V ++N
Sbjct: 324 KTRVPKNGMPKDCRFIVVADPLLEIPRNCAGTVRQQN 360
>gi|410730487|ref|XP_003671423.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
gi|401780241|emb|CCD26180.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
Length = 582
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 95/191 (49%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P +E+ + R I +R+ V A + FGS+ ++L+ D+D
Sbjct: 173 IRDFVSYISPSKEEIKERNDTIGRIRDAVNHF--WNDANLHVFGSYATDLYLPGSDIDCV 230
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I IS G K +S L L L+++G ++ +A ARVPI+KF I D
Sbjct: 231 I-------ISEKGDKDSRSSLYALANFLKKRGLATDIEVIAKARVPIIKFIDPRSKIHID 283
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + + G +K + W++ G R++ L+VK++ A +NN TG +S+ LV
Sbjct: 284 VSFERINGLEAAKLIREWLNDTPG-LREITLIVKQFLHARRLNNVHTGGLGGFSIICLV- 341
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 342 FSFLQMHPRII 352
>gi|341877205|gb|EGT33140.1| hypothetical protein CAEBREN_32021 [Caenorhabditis brenneri]
Length = 447
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 70/275 (25%), Positives = 125/275 (45%), Gaps = 34/275 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GSF + + DLD +I + + + +S K L ++ L+++ ++ +
Sbjct: 115 GSFAAGIDLPTSDLDFTISIPSLTAETSFEK------LKMIMERLQRQSVFKVV------ 162
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
++P+L I + D++IDN + ++ L W QID RF + VK WA I
Sbjct: 163 KIPVLMLVHIATGVEVDVTIDNDTPKRNTQLLRWYGQIDNRFTTICRAVKYWASESQIEC 222
Query: 174 PKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP---GNL-VDDLKGVRANAERQIAEI 229
K G NS+S+ L+V+ + Q ++LP L+ +P G + ++ + R N +R++
Sbjct: 223 SKQGRLNSFSICLMVIHYLQQV--SVLPNLQAKFPELNGEIKIEADESGRRNLKREL--- 277
Query: 230 CAFNIARFSSDKYR-KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
S YR N SLA L+ F + F K + + T + I +
Sbjct: 278 --------KSRGYRLNKNEDSLAALYWCFFKYFKEFDFKTHWISVKRGTLVEKEIVEEGQ 329
Query: 289 WLP---NNHPLFIEDPF-EQPENSARAVSEKNLAK 319
+ N + +E+PF EQP N AR V + ++ +
Sbjct: 330 EMTHVCNGAFIGVENPFLEQPWNCARTVRQGDITE 364
>gi|154308271|ref|XP_001553472.1| hypothetical protein BC1G_07881 [Botryotinia fuckeliana B05.10]
Length = 1246
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 140/332 (42%), Gaps = 54/332 (16%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E E R +++S L ++ V FGS + L + D+DI
Sbjct: 278 MRELYDGLLPTAETDERRRRLVSKLEDMFNKEWPGHDIRVYVFGSSGNLLCTDASDVDI- 336
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI++ K ++ + + L K G +++ V+ A+VPI+K + CD
Sbjct: 337 -------CITTDWKVMEGVCM---IAELLAKNGMQKVICVSTAKVPIVKIFDPELKLLCD 386
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF 190
++++N ++ + +ID R R + +++K W K+ IN+ GT +SY+ +++
Sbjct: 387 MNVNNTQALENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMIIN 446
Query: 191 HFQTCVPAILP----------PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
Q+ P +LP P K+ + DD+ +R ++
Sbjct: 447 FLQSREPPVLPSLHQRPHLKLPTKEGGESSFADDIDALRGFGQK---------------- 490
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW-LPNNHPLFIE 299
N+S+L L F +F G + + G+ + + +W + N+ L +E
Sbjct: 491 -----NKSTLGELLFQFF-RFYGHEFDYDKQVVSVRMGR-QISKEEKKWAIATNNMLCVE 543
Query: 300 DPFEQPENSARAVSEKNLAKISNAFEMTHFRL 331
+PF +E+NL ++ F L
Sbjct: 544 EPFN---------TERNLGNTADDFSFRGLHL 566
>gi|345783764|ref|XP_533266.3| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Canis lupus familiaris]
Length = 952
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/272 (24%), Positives = 116/272 (42%), Gaps = 36/272 (13%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
G+L ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 396 GNLGKALELAEALKGEKPEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 453
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ ++Y+L
Sbjct: 454 SGLHGDVSLSNRLALHNSRFLSLCSELDERVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 512
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + + E + E+ ++ + F D R
Sbjct: 513 TLLVIYFLQTREPPVLPTVSQL-----------TQKAGEGEQVEVDGWDCS-FPRDASRL 560
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELG-----ICPFTG-----QWEHIRSNTRWLP 291
N+ L+ L F S L+ S L + P G +WE +R
Sbjct: 561 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSLREGQVLPVAGGLPSNRWEGLRLG----- 615
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 616 ---PMNLQDPFDLSHNVAANVTSRVAGRLQNC 644
>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
Length = 624
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 92/179 (51%), Gaps = 11/179 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ E R + LRE + +E V FGS+ ++L+ D+D+
Sbjct: 216 IKDFIAYISPSKEEIELRNNTVRKLREAI--MELWPDCEVHVFGSYATDLYLPGSDIDMV 273
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I +S G ++ L L L++K + ++ +A A+VPI+KF NI D
Sbjct: 274 I-------VSEHGGYESRNSLYSLSSFLKRKNLAKNVEVIAKAKVPIIKFTESTSNIHID 326
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
+S + G +K + WI++ G R++VL+VK++ + +NN G YS+ LV
Sbjct: 327 VSFERTNGIDAAKTIRSWITETPG-LREIVLIVKQFLSSRKLNNVHVGGLGGYSIICLV 384
>gi|241709482|ref|XP_002413373.1| zinc finger protein, putative [Ixodes scapularis]
gi|215507187|gb|EEC16681.1| zinc finger protein, putative [Ixodes scapularis]
Length = 349
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 100/206 (48%), Gaps = 7/206 (3%)
Query: 2 GSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
G +L + D++ +P + + R ++ L ++ + + A + +GS +
Sbjct: 51 GHIMILNDVCLDVMRQCSPRPHEEKDRSTLLHGLERLIRELYT--DARLTLYGSSCNGFG 108
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQS-LLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
DLD+ + + S GK++ S + +L R LR R+ + A+VPI+KF
Sbjct: 109 LARSDLDLCLTFDS----SKDGKELCLSQTIPELARKLRAHPDLARIVPITTAKVPIVKF 164
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ + DIS+ N ++ L S ID R R + +K +AK DI + G+ +
Sbjct: 165 YHLPSRLEGDISLYNTLALHNTRLLKVYSAIDERVRVLGYTLKHFAKTCDIGDASRGSLS 224
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDI 206
SY+ L+VL++ Q C P ++P L+++
Sbjct: 225 SYAYILMVLYYLQQCQPPVIPVLQEV 250
>gi|449275511|gb|EMC84353.1| Terminal uridylyltransferase 7 [Columba livia]
Length = 1485
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 125/305 (40%), Gaps = 66/305 (21%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + DL + L+++
Sbjct: 1034 GTKLNLFGSSKNGFGFKQSDLDI--------CMTMDGLETAEGLDCIKIIEDLAKVLKKQ 1085
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G + + + A+VPI+KF I + DIS+ N ++ L + ID R + +
Sbjct: 1086 SGLKNVLPITTAKVPIVKFFHIRSGLEVDISLYNTLALHNTRLLSSYAAIDPRVKYLCYT 1145
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVDDLK 216
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P LVD
Sbjct: 1146 MKVFTKICDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKEPKKPEILVD--- 1202
Query: 217 GVRANAERQIAEICAFNIARFSSDKYRKI---------NRSSLAHLFVSFLEKFSGLSLK 267
+N+ F DK ++ N S+ L++ L +F
Sbjct: 1203 --------------GWNVYFF--DKIEELSVVWPDCGKNTESVGQLWLGLL-RFYTEEFD 1245
Query: 268 ASELGIC--------PFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ IC F QW + + IEDPF+ N +S K
Sbjct: 1246 FKDHVICIRRKNLLTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNF 1294
Query: 320 ISNAF 324
I AF
Sbjct: 1295 IMKAF 1299
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 36/190 (18%), Positives = 82/190 (43%), Gaps = 14/190 (7%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
ED E R+ + + + ++ + L + +GS S + D++I I+
Sbjct: 300 EDLEERLNIKTMMENLLR--QKLPECSFRLYGSSYSRFGFKTSDVNIDIQF--------P 349
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ +L + +L+ + + HAR+P++ + + C +S N + +
Sbjct: 350 ASVTQPDVLLLVQESLQNSESFVEVDADFHARIPVVVCKEKQSGLICKVSAGNENACLTT 409
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +++ +V+ + WAK +++P+ G + Y +L+V+F Q LP
Sbjct: 410 NHLAALGKLEPTVVPLVIAFRYWAKLCCVDHPEEGGLSPYVFALMVIFFLQQRKEPFLP- 468
Query: 203 LKDIYPGNLV 212
+Y G+ +
Sbjct: 469 ---VYLGSWI 475
>gi|417403016|gb|JAA48333.1| Putative polya rna polymerase mitochondrial [Desmodus rotundus]
Length = 584
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 103/222 (46%), Gaps = 25/222 (11%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q + +AR P+++F CD++ +N S+ L+ S +D R R +V +
Sbjct: 300 GCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALKSSELLYIYSALDSRVRALVFSI 359
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WA+AH + + G + ++SL+++V+F Q P +LP L D LK + A+
Sbjct: 360 RCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPVLPTL---------DYLKTL-AD 409
Query: 222 AERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTG 278
AE + I + F+ D R N +L L F E F + + + I
Sbjct: 410 AEDKC--IIEGHNCTFTCDLNRIKPSENTETLELLLKEFFEYFGNFAFNKNSINI----- 462
Query: 279 QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ R + P + PL I++PFE N ++ VS+ L K
Sbjct: 463 --QQGREQNK--PESSPLHIQNPFETSLNISKNVSQSQLQKF 500
>gi|149641714|ref|XP_001505968.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Ornithorhynchus anatinus]
Length = 648
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 115/266 (43%), Gaps = 42/266 (15%)
Query: 76 GSCISSAGK----KVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
GS ++ GK K L+G +LR G R++ V AR P++KF + D+
Sbjct: 319 GSAEAAQGKAGEEKATLELVGSVLRGCVP--GVHRVRPVPSARRPVVKFCHRPSGLHGDV 376
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
S+ N S++L SQ+DGR R +V ++ WA+ + ++Y+LSLL L+
Sbjct: 377 SLSNRLALYNSQYLRLCSQLDGRVRPLVYSLRCWAQGRGLTGSGP-LLSNYALSLLALYF 435
Query: 192 FQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRS 248
QT P +LPPL + + + G E + E+ ++ + F D R N
Sbjct: 436 LQTRSPPVLPPLTQL------NQMAG-----EGEQVEVDGWDCS-FPQDASRLEPSTNVE 483
Query: 249 SLAHLFVSFLEKFSGLSLKASEL-----------GICPFTGQWEHIRSNTRWLPNNHPLF 297
+++ L F S L+ S L G P G W +R PL
Sbjct: 484 AVSSLLSQFFSCVSAWDLRGSVLSLREGRALAVAGGLP-RGAWGGLRLG--------PLN 534
Query: 298 IEDPFEQPENSARAVSEKNLAKISNA 323
I+DPF+ N A V+ + ++ +
Sbjct: 535 IQDPFDLSHNVAANVTGRVAGRLQSC 560
>gi|32562829|ref|NP_491841.2| Protein GLD-2, isoform c [Caenorhabditis elegans]
gi|351064305|emb|CCD72660.1| Protein GLD-2, isoform c [Caenorhabditis elegans]
Length = 1036
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 110/231 (47%), Gaps = 17/231 (7%)
Query: 35 LREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDL 94
L + V L G V GS ++ + D+D+ + ++N +K ++ +L
Sbjct: 500 LYTAISPVFPLSGLYV--VGSSLNGFGNNSSDMDLCLMITN----KDLDQKNDAVVVLNL 553
Query: 95 LRALRQKGGYRRLQFVAHARVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
+ + Q + Q + A+VPIL+ F +I+ D++ +N + L + S D
Sbjct: 554 ILSTLQYEKFVESQKLILAKVPILRINFAAPFDDITVDLNANNSVAIRNTHLLCYYSSYD 613
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYPGNL 211
R R +V +VKEWAK IN+ +F SYSL L+V+ HF C P +LP L+ YP
Sbjct: 614 WRVRPLVSVVKEWAKRKGINDANKSSFTSYSLVLMVI-HFLQCGPTKVLPNLQQSYPNRF 672
Query: 212 VDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+ + N + E+ A +I + S+K ++L L + FL+ ++
Sbjct: 673 SNKVDVRTLNVTMALEEV-ADDIDQSLSEK------TTLGELLIGFLDYYA 716
>gi|302831043|ref|XP_002947087.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f.
nagariensis]
gi|300267494|gb|EFJ51677.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f.
nagariensis]
Length = 609
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 100/196 (51%), Gaps = 8/196 (4%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLF 61
Y VL ++D++ L+P ++ + +V+ +R+ ++ E L VEPFGS++ L
Sbjct: 231 YTVLGRNIQDLVVQLSPTAQEVRAKEQVLQLVRQAAQAAFPEYLGVMRVEPFGSYMCGLG 290
Query: 62 SRWGDLDISI----ELSNGSCISSAGKKVKQS-LLGDLLRALRQKGGYRRLQFVAHARVP 116
++ D+D+ + E S+ S G++ + + LL L LR + +L V HARVP
Sbjct: 291 TKSSDIDVVLVGLAEPSSSLGFYSKGERPRVARLLDRLTPHLRSRLRVTKLIAVRHARVP 350
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
ILK T +S DISI L G ++++ R +VL++K + + D+
Sbjct: 351 ILKI-TTSAGVSVDISIAGLSGPQAAEYIRQQVSSFPALRPLVLVLKSYMRELDLAEVAK 409
Query: 177 GTFNSYSLSLLVLFHF 192
G +SY L+ +V+ H
Sbjct: 410 GGISSYGLTYMVMAHL 425
>gi|402854572|ref|XP_003891939.1| PREDICTED: terminal uridylyltransferase 4 [Papio anubis]
Length = 1569
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 104/208 (50%), Gaps = 6/208 (2%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L+ + K L+P + R +++ L + ++ E A + FGS + R
Sbjct: 948 ILDLVCKRCFDELSPPCSEQHNREQILIGLEKFIQK-EYDEKARLCLFGSSKNGFGFRDS 1006
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
DLDI + L +A K + ++ +L + L++ G R + + A+VPI+KFE
Sbjct: 1007 DLDICMTLEGHE---NAEKLNCKEIIENLAKILKRHPGLRNILPITTAKVPIVKFEHRRS 1063
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ DIS+ N Q ++ L + ID R + + +K +AK DI + G+ +SY+
Sbjct: 1064 GLEGDISLYNTLAQHNTRMLATYAAIDPRVQYLGYTMKVFAKRCDIGDASRGSLSSYAYI 1123
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVD 213
L+VL+ Q P ++P L++ P +++D
Sbjct: 1124 LMVLYFLQQRKPPVIPVLQE--PVDVMD 1149
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 132/332 (39%), Gaps = 55/332 (16%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D R +++ ++ +V+ + L ++ +GS ++ + D++I I+
Sbjct: 370 DDLRVRQEIVEEMSKVITTF--LPECSLRLYGSSLTKFALKSSDVNIDIKF--------P 419
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
K LL +L L++ Y ++ HA+VP++ + C +S N + +
Sbjct: 420 PKMNHPDLLIKVLGILKKNVLYVDVESDFHAKVPVVVCRDRKSGLLCRVSAGNDMACLTT 479
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
L + +I+ F +VL + WAK I++ G SY +L+V+F Q P +LP
Sbjct: 480 DLLTALGKIEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKPPLLPC 539
Query: 203 L-----KDIYPGNLVD-DLKGV-------------RANAERQIAE--------------- 228
L + P + D LKG+ A + IAE
Sbjct: 540 LLGSWIEGFDPKRMDDFQLKGIVEEKFVKWECNSSSATEKNSIAEENKAKADQPKDDTKK 599
Query: 229 ICAFNIARFSSDKYRK-------INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
N + +K+ K NR SL L++ L KF L E IC
Sbjct: 600 TETDNQSNAMKEKHGKSPLTLETPNRVSLGQLWLELL-KFYTLDFALEEYVICVRIRDI- 657
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
R N W P + IEDPF N AR+++
Sbjct: 658 LTRENKNW-PKRR-IAIEDPFSVKRNVARSLN 687
>gi|328785134|ref|XP_623956.3| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Apis
mellifera]
Length = 561
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 134/302 (44%), Gaps = 54/302 (17%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKV-------------KQSLLGD 93
+V PFGS V+ DLD+ + + I+S+ +K +Q D
Sbjct: 211 NVSVIPFGSSVNGFGQIGCDLDLLCKTDISNIINSSTRKFFYMSQPLHLADRNEQKEFLD 270
Query: 94 LLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ + + G ++ + ARVPI+KF N+ CD+S N+ S+ L+ ++
Sbjct: 271 AISTIMKICIPGIDNIKKILEARVPIIKFFNQTTNMHCDLSSTNVIALHMSELLYTYGEL 330
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
D R + ++ +++WA+ +I +G + ++SL+LL+LF+ Q + ILP LK I
Sbjct: 331 DWRVKPLICTIRKWARNTNITKDISGQWITNFSLTLLILFYLQ--IKNILPSLKVI---- 384
Query: 211 LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI--NRSSLAHLFVSFLEKFSGLSLKA 268
K N+ R +S+D I N SL +L F E +S K
Sbjct: 385 -----KYHIKNSRRS-----------WSNDWKNSINYNNDSLHNLLYGFFEYYSIFDFKM 428
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTH 328
IC G+ R ++ PL+I +PF++ N ++ ++ L ++ + H
Sbjct: 429 Q--AICIKEGK-------PRVKKDSSPLYIYNPFDESLNVSKNITIFELTRL-----IDH 474
Query: 329 FR 330
FR
Sbjct: 475 FR 476
>gi|326935115|ref|XP_003213624.1| PREDICTED: terminal uridylyltransferase 7-like, partial [Meleagris
gallopavo]
Length = 920
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 87/177 (49%), Gaps = 18/177 (10%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLDI C++ G + + L + DL + L+++
Sbjct: 739 GTKLNLFGSSKNGFGFKQSDLDI--------CMTMDGLETAEGLDCIKIIEDLAKVLKKQ 790
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L + ID R + +
Sbjct: 791 SGLRNVLPITTAKVPIVKFFHVRSGLEVDISLYNTLALHNTRLLSSYAAIDPRVKYLCYT 850
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY-----PGNLVD 213
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY P LVD
Sbjct: 851 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKEPKKPEILVD 907
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 39/192 (20%), Positives = 83/192 (43%), Gaps = 18/192 (9%)
Query: 24 DWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D + R+K+ R V+E V + L ++ +GS S + D++I I+
Sbjct: 39 DLQERLKI----RTVMEDVLHQKLPECSLRLYGSSFSGFGFKTSDINIDIQF-------- 86
Query: 82 AGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
+ +L + +L+ + + H R+P++ + C +S N +
Sbjct: 87 PASMSQPDVLLLVQESLQNSESFIGVDADFHTRIPVVVCREKQSGLICKVSAGNENAYLT 146
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+ L I +++ +V+ + WAK +++P+ G + Y +L+V+F Q LP
Sbjct: 147 TNHLATIGKLEPTVASLVIAFRYWAKLCCVDHPEEGGLSPYVFALMVIFFLQQRKEPFLP 206
Query: 202 PLKDIYPGNLVD 213
+Y G+ ++
Sbjct: 207 ----VYLGSWIE 214
>gi|334348804|ref|XP_001375646.2| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Monodelphis
domestica]
Length = 577
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 124/297 (41%), Gaps = 47/297 (15%)
Query: 47 GATVEPFGSFVSNLFSRWG----DLDISIELSNGSCISSAGK-----KVK---------Q 88
TV+PFGS V N F + G ++ S +AG +VK Q
Sbjct: 222 ACTVKPFGSSV-NSFGKLGCDLDMFLDLDDIGKNSAAKTAGSFSMEYQVKNVSSERWATQ 280
Query: 89 SLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+L + L G G +Q + HAR P+++F CD++ +N S+ L+
Sbjct: 281 KILSVIGECLDNFGPGCVSVQKILHARCPLVRFSHQASGFQCDLTANNRIALKSSELLYI 340
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGT-FNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+D R R + V+ WA+ + + G ++SL+++V+F Q P I+P
Sbjct: 341 YGTLDSRVRALAFSVRYWARQQALTSSIPGAWLTNFSLTIMVIFFLQKRSPPIIPS---- 396
Query: 207 YPGNLVDDLKGVRANAERQIAE--ICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+D LK + ++ I E C N+ R +N +L L F E F
Sbjct: 397 -----IDYLKNLAGAEDKCIIEGNDCTLVSNLDRIKPS----LNTETLDILLCQFFEYFG 447
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I + + P++ P++I++PFE N +R V+E L +
Sbjct: 448 NFAFNKNSINI---------RKGKEQNKPDSSPIYIQNPFEPTLNVSRNVNENQLER 495
>gi|322697766|gb|EFY89542.1| hypothetical protein MAC_04397 [Metarhizium acridum CQMa 102]
Length = 1303
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 118/260 (45%), Gaps = 37/260 (14%)
Query: 13 DILGMLNPLREDWETRMKVISDLREV------VESVESLRGATVE----------PFGSF 56
D+ + N L ED E K+ +D+RE+ E VE R V+ P
Sbjct: 274 DLRTVKNKLSEDEE--RKLAADMREIYNHLLPTEEVEEKRKKLVQKLEKIFNDEWPGHDI 331
Query: 57 VSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARV 115
NLF G+L S + CI+++ ++++ ++ DLL + G ++ ++ A+V
Sbjct: 332 RVNLFGSSGNLLCSDDSDVDICITTSWQELEGVCMIADLL----ARRGMEKVVCISAAKV 387
Query: 116 PILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK 175
PI+K ++CD++++N ++ + + D R R + +++K W + IN+
Sbjct: 388 PIVKIWDPELGLACDMNVNNTLALENTRMVRIYVEADPRVRQLAMIIKYWTRRRIINDAA 447
Query: 176 -TGTFNSYSLSLLVLFHFQTCVPAILP---------PLKDIYPGNLVDDLKGVRANAER- 224
GT +SY+ L++ Q P +LP P D P D+LK ++ +
Sbjct: 448 FGGTLSSYTWICLIIAFLQLRSPPVLPALHQLPYKMPRSDGTPSEFADNLKKIKGYGNKN 507
Query: 225 --QIAEICAFNIARFSSDKY 242
+AE+ F RF + ++
Sbjct: 508 KSSVAELL-FQFFRFYAHEF 526
>gi|71996950|ref|NP_001021844.1| Protein GLD-2, isoform b [Caenorhabditis elegans]
gi|351064304|emb|CCD72659.1| Protein GLD-2, isoform b [Caenorhabditis elegans]
Length = 871
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 104/212 (49%), Gaps = 15/212 (7%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N +K ++ +L+ + Q + Q + A
Sbjct: 352 GSSLNGFGNNSSDMDLCLMITN----KDLDQKNDAVVVLNLILSTLQYEKFVESQKLILA 407
Query: 114 RVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+VPIL+ F +I+ D++ +N + L + S D R R +V +VKEWAK I
Sbjct: 408 KVPILRINFAAPFDDITVDLNANNSVAIRNTHLLCYYSSYDWRVRPLVSVVKEWAKRKGI 467
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
N+ +F SYSL L+V+ HF C P +LP L+ YP + + N + E+
Sbjct: 468 NDANKSSFTSYSLVLMVI-HFLQCGPTKVLPNLQQSYPNRFSNKVDVRTLNVTMALEEV- 525
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
A +I + S+K ++L L + FL+ ++
Sbjct: 526 ADDIDQSLSEK------TTLGELLIGFLDYYA 551
>gi|393222417|gb|EJD07901.1| hypothetical protein FOMMEDRAFT_138017 [Fomitiporia mediterranea
MF3/22]
Length = 863
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 88/168 (52%), Gaps = 14/168 (8%)
Query: 49 TVEPFGSFVSNLFSRWGDLDISI---ELSNGSCISSAGKKVKQSL-LGDLLRALRQKGGY 104
TVE FGS + S DLD+ I + NG + + + + +L R+L + G+
Sbjct: 299 TVECFGSTQYGVDSPTSDLDLVILDHDRENGFSPDVSTRSLPVVYNVQNLARSL-EYNGF 357
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
R Q + A VPI+KFE N++CDI++++ G ++ + ++ R ++ L+K+
Sbjct: 358 RVFQTIPTASVPIVKFEDPRTNLNCDINVNDRLGLCNTRLIAQYCKLSPLLRPLLGLIKK 417
Query: 165 WAKAHDINNPK----TGTFNSYSLSLLVLFHFQT-----CVPAILPPL 203
WAK +N+P T TF+SYSL+L+ + Q + A LPPL
Sbjct: 418 WAKTTGLNDPSGDKGTATFSSYSLTLMTIGFLQAHEQLPNLQAGLPPL 465
>gi|348564218|ref|XP_003467902.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Cavia porcellus]
Length = 852
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 109/248 (43%), Gaps = 38/248 (15%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR R G R+ V AR P++KF + DIS+ N S+FL S
Sbjct: 342 LVGSILR--RCVPGVYRVHSVPSARRPVVKFCHRPSGLHGDISLGNRLALHNSRFLSLCS 399
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++DGR R +V V+ WA+ + ++Y+L+LLV++ QT P +LP + +
Sbjct: 400 ELDGRVRPLVYTVRCWAQGRGLTG-SGPLLSNYALTLLVIYFLQTRDPPVLPTVAQLT-- 456
Query: 210 NLVDDLKGVRANAERQIAEI----CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
+A E Q+ E+ C+F R +S+ N L L F S +
Sbjct: 457 --------QKAGEEEQV-EVDGWDCSF--PRDTSNLESSTNVEPLGSLLAQFFSCVSCWN 505
Query: 266 LKASEL------GICPFTGQ----WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
L+ S L + GQ WE +R P+ ++DPF+ N A V+
Sbjct: 506 LRGSLLSLREGQALLVAGGQLADLWEGLRLG--------PMNLQDPFDLSHNVAANVTSC 557
Query: 316 NLAKISNA 323
A++ N
Sbjct: 558 VAARLQNC 565
>gi|221481442|gb|EEE19832.1| poly(A) polymerase cid, putative [Toxoplasma gondii GT1]
Length = 1032
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 117/262 (44%), Gaps = 60/262 (22%)
Query: 3 SYNVLEPILKDIL---GMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSN 59
S +V E + +D+ ++ P ED + +S L++++ SV L V PFGS V+
Sbjct: 545 SPSVFEALNRDMQRLESLMMPGTEDQAGMRRFLSQLQDLLNSV--LDACIVTPFGSAVNG 602
Query: 60 LFSRWGDLDISIELSNGSCISSAGKKVKQ------------------------------- 88
L++ DLD+ +++ + S ++ K ++Q
Sbjct: 603 LWTPQSDLDVCVQVRDASTRANQIKVLRQVAHALHPVHTHLVEPRFQARVPIIHWAPRFS 662
Query: 89 -------SLLGDLLR-----ALRQKGGYRRLQFVAH---ARVPI----LKFETIH----Q 125
+L G LR AL ++ G RL+ AR L E + Q
Sbjct: 663 HSTSGSVALSGRFLRDPVARALYERPGNERLRERGDNESARRTTAGRNLSEEGMEERNTQ 722
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+SCDIS++NL + SK L +D R R + VK WAK +IN+ GT +S+SL
Sbjct: 723 MVSCDISVNNLLAVVNSKLLGAYVGVDPRLRTLGYAVKFWAKGRNINDRSRGTVSSFSLV 782
Query: 186 LLVLFHFQTCV-PAILPPLKDI 206
L+++ Q V P ILP L+D+
Sbjct: 783 LMLIHFLQNHVQPRILPSLQDM 804
>gi|221501958|gb|EEE27708.1| RNA binding motif protein, putative [Toxoplasma gondii VEG]
Length = 1032
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 117/262 (44%), Gaps = 60/262 (22%)
Query: 3 SYNVLEPILKDIL---GMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSN 59
S +V E + +D+ ++ P ED + +S L++++ SV L V PFGS V+
Sbjct: 545 SPSVFEALNRDMQRLESLMMPGTEDQAGMRRFLSQLQDLLNSV--LDACIVTPFGSAVNG 602
Query: 60 LFSRWGDLDISIELSNGSCISSAGKKVKQ------------------------------- 88
L++ DLD+ +++ + S ++ K ++Q
Sbjct: 603 LWTPQSDLDVCVQVRDASTRANQIKVLRQVAHALHPVHTHLVEPRFQARVPIIHWAPRFS 662
Query: 89 -------SLLGDLLR-----ALRQKGGYRRLQFVAH---ARVPI----LKFETIH----Q 125
+L G LR AL ++ G RL+ AR L E + Q
Sbjct: 663 HSTSGSVALSGRFLRDPVARALYERPGNERLRERGDNESARRTTAGRNLSEEGMEERNTQ 722
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+SCDIS++NL + SK L +D R R + VK WAK +IN+ GT +S+SL
Sbjct: 723 MVSCDISVNNLLAVVNSKLLGAYVGVDPRLRTLGYAVKFWAKGRNINDRSRGTVSSFSLV 782
Query: 186 LLVLFHFQTCV-PAILPPLKDI 206
L+++ Q V P ILP L+D+
Sbjct: 783 LMLIHFLQNHVQPRILPSLQDM 804
>gi|403255086|ref|XP_003920278.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Saimiri boliviensis boliviensis]
Length = 874
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 111/269 (41%), Gaps = 30/269 (11%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ L+G +LR G R+Q V AR P++KF
Sbjct: 318 GDLGKALELTEAPKWEKTEGTAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 375
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++D R R +V ++ WA+ ++ ++Y+L
Sbjct: 376 SGLHGDVSLSNRLALHNSRFLSLCSELDDRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 434
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+LLV++ QT P +LP V L + E+ + + R +S
Sbjct: 435 TLLVIYFLQTRDPPVLP---------TVSQLTQKAGDGEQVEVDGWDCSFPRDASRLEPS 485
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLPNNH 294
N L+ L F S L+ S L + P G WE +R
Sbjct: 486 TNVEPLSSLLAQFFSCVSCWDLRGSLLSLREGQALPVAGGLPSNLWEGLRLG-------- 537
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNA 323
P+ ++DPF+ N A V+ + ++ N
Sbjct: 538 PMNLQDPFDLSHNVAANVTSRVAGRLQNC 566
>gi|237844137|ref|XP_002371366.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211969030|gb|EEB04226.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 1032
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 73/262 (27%), Positives = 117/262 (44%), Gaps = 60/262 (22%)
Query: 3 SYNVLEPILKDIL---GMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSN 59
S +V E + +D+ ++ P ED + +S L++++ SV L V PFGS V+
Sbjct: 545 SPSVFEALNRDMQRLESLMMPGTEDQAGMRRFLSQLQDLLNSV--LDACIVTPFGSAVNG 602
Query: 60 LFSRWGDLDISIELSNGSCISSAGKKVKQ------------------------------- 88
L++ DLD+ +++ + S ++ K ++Q
Sbjct: 603 LWTPQSDLDVCVQVRDASTRANQIKVLRQVAHALHPVHTHLVEPRFQARVPIIHWAPRFS 662
Query: 89 -------SLLGDLLR-----ALRQKGGYRRLQFVAH---ARVPI----LKFETIH----Q 125
+L G LR AL ++ G RL+ AR L E + Q
Sbjct: 663 HSTSGSVALSGRFLRDPVARALYERPGNERLRERGDNESARRTTAGRNLSEEGMEERNTQ 722
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+SCDIS++NL + SK L +D R R + VK WAK +IN+ GT +S+SL
Sbjct: 723 MVSCDISVNNLLAVVNSKLLGAYVGVDPRLRTLGYAVKFWAKGRNINDRSRGTVSSFSLV 782
Query: 186 LLVLFHFQTCV-PAILPPLKDI 206
L+++ Q V P ILP L+D+
Sbjct: 783 LMLIHFLQNHVQPRILPSLQDM 804
>gi|403218109|emb|CCK72601.1| hypothetical protein KNAG_0K02380 [Kazachstania naganishii CBS
8797]
Length = 632
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 97/190 (51%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + +R + + +R+ V S+ R + + FGS+ ++L+ D+D
Sbjct: 169 IKDFVAYISPTGSEIISRNRAVQKIRKAVRSL--WRDSDLHVFGSYATDLYMPGSDIDCV 226
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S++ K L +L R LR + +++ ++ RVPI+KF H N+ D
Sbjct: 227 VN-------STSMDKENTQYLYELARHLRDENLAVQIEVISRTRVPIIKFIEPHSNLHID 279
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + L G ++ + W+ + G R++VL++K++ A +N+ TG +S+ LV
Sbjct: 280 VSFERLNGIEAARLIRGWLRETPG-LRELVLIIKQFLAARRLNDVHTGGLGGFSIICLV- 337
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 338 YSFMNLHPKI 347
>gi|354544020|emb|CCE40742.1| hypothetical protein CPAR2_107770 [Candida parapsilosis]
Length = 608
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 99/202 (49%), Gaps = 13/202 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + TR VI+ L++ + S G FGS ++L+ D+D+
Sbjct: 165 MKDFVRYISPSKAEIITRNNVINTLKKEISSF--WPGTEAHVFGSCATDLYLPGSDIDMV 222
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ ISS G +S L L LR K + ++ +A+A+VPI+KF N+ D
Sbjct: 223 V-------ISSTGDYENRSRLYQLSSFLRVKNLAKNVEVIANAKVPIIKFVDPDSNLPVD 275
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + W+ G R++VL+VK++ ++ +NN G Y+ ++++
Sbjct: 276 ISFERTNGLDAARRIRKWLLATPG-LRELVLVVKQFLRSRKLNNVHVGGLGGYA-TIIMC 333
Query: 190 FHFQTCVPAILPPLKDIYPGNL 211
+HF P I D P NL
Sbjct: 334 YHFMQLHPKISTNTMDA-PDNL 354
>gi|407843928|gb|EKG01701.1| hypothetical protein TCSYLVIO_007293 [Trypanosoma cruzi]
Length = 406
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 97/197 (49%), Gaps = 11/197 (5%)
Query: 24 DWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D ET ++ DL+E V + S+ A VE FGS VS D+D+S+ N S
Sbjct: 24 DHET---ILKDLQERVLDIGLRSVNKAHVELFGSHVSGFCKPTSDVDLSLTYRNFSPWLQ 80
Query: 82 AGKKVKQSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
++V + ++R ++ G ++++ AR+P+++F + CD+SI N+ G
Sbjct: 81 GMERVDEQNNKRMMRFSKEAAEMGMEDVRYI-RARIPVVQFIDSLSGLHCDLSIGNVGGV 139
Query: 140 IKSKFLFWISQIDGRF-RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CV 196
SK L I ++ F R + LVKEW KA ++ P+ TFNS++++ + L Q +
Sbjct: 140 ENSKILAAIREVFPDFYRAYIHLVKEWGKAREVIAPERSTFNSFTVTTMALMVLQELGLL 199
Query: 197 PAILPPLKDIYPGNLVD 213
P P D L D
Sbjct: 200 PVFSKPTGDFGELTLPD 216
>gi|399218174|emb|CCF75061.1| unnamed protein product [Babesia microti strain RI]
Length = 551
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/214 (30%), Positives = 99/214 (46%), Gaps = 24/214 (11%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSR 63
+++L+ ++ + L+P +E + +I + +V +E PFGS S + R
Sbjct: 156 FSMLDSEIRGLHFQLSPTLAQYENKNNLI---KALVPILEGKTKGKFYPFGSCESGFWVR 212
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLL---RALRQKGGYRRLQFVAHARVPILKF 120
D+D +C+ G + L L RAL G ++ + HA VPI K
Sbjct: 213 GSDVD--------ACLVIPGCDTRSQWLHKLRLIKRALSSVPGISFIRII-HANVPIAKV 263
Query: 121 ETIHQNI-------SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
I I CDISI+N + F+ +++ID R + ++K WA INN
Sbjct: 264 GKILHEIFNEENANVCDISINNTVALENTLFVKVLNKIDYRTSQLGRIIKYWASCRKINN 323
Query: 174 PKTGTFNSYSLSLLVLFHF-QTCVPAILPPLKDI 206
GT +SY+L LL+LFHF Q P ILP DI
Sbjct: 324 RAQGTMSSYTL-LLMLFHFLQNRKPPILPKYMDI 356
>gi|349605433|gb|AEQ00672.1| Poly(A) RNA polymerase, mitochondrial-like protein, partial [Equus
caballus]
Length = 304
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 103/236 (43%), Gaps = 24/236 (10%)
Query: 88 QSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
Q +L + L Q G G +Q + +AR P+++F CD++ +N S+ L+
Sbjct: 4 QKILSVIGECLDQFGPGCVGVQKILNARCPLVRFSHQASGFQCDLTTNNRIALKSSELLY 63
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKD 205
+D R R +V V+ WA+AH + + G + ++SL+++V+F Q P ILP L
Sbjct: 64 IYGALDSRVRALVFSVRCWARAHSLTSSIPGAWITNFSLTMMVIFFLQRRSPPILPTLD- 122
Query: 206 IYPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG 263
Y NL D C F ++ R + N +L L F E F
Sbjct: 123 -YLENLADAEDKCVIEGHN-----CTFIRDLNRIKPSE----NTETLELLLKEFFEYFGN 172
Query: 264 LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + + I R + P + PL I++PFE N ++ V++ L K
Sbjct: 173 FAFNKNSINI-------RQGREQNK--PESSPLHIQNPFETSLNISKNVTQSQLQK 219
>gi|71412381|ref|XP_808378.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70872571|gb|EAN86527.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 406
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 97/197 (49%), Gaps = 11/197 (5%)
Query: 24 DWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D ET ++ DL+E V + S+ A VE FGS VS D+D+S+ N S
Sbjct: 24 DHET---ILKDLQERVLDIGLRSVNKAHVELFGSHVSGFCKPTSDVDLSLTYRNFSPWLQ 80
Query: 82 AGKKVKQSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
++V + ++R ++ G ++++ AR+P+++F + CD+SI N+ G
Sbjct: 81 GMERVDEQNNKRMMRFSKEAAEMGMEDVRYI-RARIPVVQFIDSLSGLHCDLSIGNVGGV 139
Query: 140 IKSKFLFWISQIDGRF-RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CV 196
SK L I ++ F R + LVKEW KA ++ P+ TFNS++++ + L Q +
Sbjct: 140 ENSKILAAIREVFPDFYRAYIHLVKEWGKAREVIAPERSTFNSFTVTTMALMVLQELGLL 199
Query: 197 PAILPPLKDIYPGNLVD 213
P P D L D
Sbjct: 200 PVFSKPTGDFGELTLPD 216
>gi|242817783|ref|XP_002487018.1| PAP/25A associated domain family [Talaromyces stipitatus ATCC
10500]
gi|218713483|gb|EED12907.1| PAP/25A associated domain family [Talaromyces stipitatus ATCC
10500]
Length = 1073
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 106/233 (45%), Gaps = 24/233 (10%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E + R +++ L ++ V FGS + L S D+DI
Sbjct: 124 LLPSAESDDRRRQLVQKLEKLFNEQWPGNNIDVHVFGSSGNKLCSSDSDVDI-------- 175
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
CI+++ K+++ L L L Q G R+ V+HARVPI+K ++CD++++N
Sbjct: 176 CITTSFKQLENVCL--LAEVLAQHG-MERVVCVSHARVPIVKIWDPQLKMACDMNVNNTL 232
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQTCV 196
++ + ID R R + +++K W K +N+ GT +SY+ L++ QT
Sbjct: 233 ALENTRMIRTYVDIDERVRPLAMIIKHWTKRRVLNDAALGGTLSSYTWICLIINFLQTRD 292
Query: 197 PAILPPLK----------DIYPGNLVDDLKGVR--ANAERQIAEICAFNIARF 237
P ILP L+ D + DDL+ +R ++ RQ F R+
Sbjct: 293 PPILPSLQQQAHKAHKVIDGVQVSFDDDLESLRGYGHSNRQTLGELLFQFFRY 345
>gi|71650833|ref|XP_814106.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70879051|gb|EAN92255.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 406
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 97/197 (49%), Gaps = 11/197 (5%)
Query: 24 DWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D ET ++ DL+E V + S+ A VE FGS VS D+D+S+ N S
Sbjct: 24 DHET---ILKDLQERVLDIGLRSVNKAHVELFGSHVSGFCKPTSDVDLSLTYRNFSPWLQ 80
Query: 82 AGKKVKQSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
++V + ++R ++ G ++++ AR+P+++F + CD+SI N+ G
Sbjct: 81 GMERVDEQNNKRMMRFSKEAAEMGMEDVRYI-RARIPVVQFIDSLSGLHCDLSIGNVGGV 139
Query: 140 IKSKFLFWISQIDGRF-RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CV 196
SK L I ++ F R + LVKEW KA ++ P+ TFNS++++ + L Q +
Sbjct: 140 ENSKILAAIREVFPDFYRAYIHLVKEWGKAREVIAPERSTFNSFTVTTMALMVLQELGLL 199
Query: 197 PAILPPLKDIYPGNLVD 213
P P D L D
Sbjct: 200 PVFSKPTGDFGELTLPD 216
>gi|224128147|ref|XP_002329093.1| predicted protein [Populus trichocarpa]
gi|222869762|gb|EEF06893.1| predicted protein [Populus trichocarpa]
Length = 543
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 146/344 (42%), Gaps = 66/344 (19%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P +E+ +R + + + +V++ + VE FGSF + L+ D+D+
Sbjct: 131 DFCDFLSPTQEEQASRAEAVRCVFDVIKYI--WPNCKVEVFGSFRTGLYLPTSDIDV--- 185
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I +G K Q L L RAL QKG +++Q +A ARVPI+KF +S DIS
Sbjct: 186 -----VILGSGLKSPQIGLNALSRALSQKGVAKKIQVIARARVPIVKFVEKRSGVSFDIS 240
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G I ++F+ R + L++K + + ++N +G +SY+L +++
Sbjct: 241 FDVNGGPIAAEFIKNAISKWPELRPLCLILKVFLQQRELNEVYSGGISSYALLAMLMAML 300
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q + + +A+ ER + + L H
Sbjct: 301 Q--------------------NHRECQASLERNLGLL--------------------LIH 320
Query: 253 LFVSFLEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNHPLF--IEDPFEQPENSA 309
F F G L + +G+ C TG + R+ ++ N P IEDP + PEN
Sbjct: 321 FF-----DFYGRKLNTTNVGVSCKGTGTFFSKRTKG-FMNNGRPFLIAIEDP-QAPENDI 373
Query: 310 RAVSEKNLAKISNAFEMTHFRLTSTNQT-----RYALLSSLARP 348
+ N +I +AF M LT+ ++L ++ RP
Sbjct: 374 -GKNSFNYFQIRSAFAMAFTTLTNPKTILSLGPNRSILGTIIRP 416
>gi|363746116|ref|XP_428151.3| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase,
partial [Gallus gallus]
Length = 852
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 121/299 (40%), Gaps = 54/299 (18%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDL------------ 94
G V PFGS V+ S DLD+ ++L + S G S+L D+
Sbjct: 240 GCAVLPFGSSVNGFDSHCCDLDLLLDLEATKSLPSDGSDAADSILSDVHPGSAPPEELLD 299
Query: 95 -----LRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
LR R G R++ V AR P++KF + DISIDN ++FL +
Sbjct: 300 LVASVLR--RCVPGVTRVRPVPTARRPVVKFCHKQSGLLGDISIDNRLALHNTRFLRLCA 357
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGT---FNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+ D R R +V V+ WAK + G +Y+L+LLVLF QTC P +LP
Sbjct: 358 EADARVRPLVYAVRLWAKRQGLAGNAAGGGPLLTNYALTLLVLFFLQTCSPPVLP----- 412
Query: 207 YPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGL 264
V++L+ + R + C+F ++ N S L F F
Sbjct: 413 ----TVEELRELAGPGCRVVESGWDCSFPCD--AAALQHSTNGRSAGSLLPEFFHLF--- 463
Query: 265 SLKASELGICPFTGQWEHIRSNTRWLP--------NNHPLFIEDPFEQPENSARAVSEK 315
G PFT + +R + R LP PL + DPFE N V+ K
Sbjct: 464 -------GSFPFTTHFPSLR-HGRPLPLSAADPLLKRTPLTLPDPFELSHNVTSNVTSK 514
>gi|298714639|emb|CBJ33963.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 651
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 99/211 (46%), Gaps = 8/211 (3%)
Query: 11 LKDILGMLNPLREDWETRMKVISDL-REVVESVESL--RGATVEPFGSFVSNLFSRWGDL 67
+ +L L P + E R KV + L R +++ + + +G+T+ FGS + + DL
Sbjct: 289 MSALLPTLLPSPDFGEKREKVRASLERTLMKQLPKMIPKGSTLRVFGSSSNGFGNDGADL 348
Query: 68 DISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNI 127
D+ IE + G + +S+ L + G ++ AR+PI+ F +
Sbjct: 349 DMCIEYARGVQHPDDAGALIESIAEKL-----KAAGMTKVDSRPTARIPIVIFNDGASGL 403
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
CDIS+ N ++ + S D R +++ ++K WAK +NN GT +SY L
Sbjct: 404 DCDISVMNPLAVRNTRLMKAYSVADPRVKELAYVLKRWAKRRWVNNASEGTLSSYGYLLC 463
Query: 188 VLFHFQTCVPAILPPLKDIYPGNLVDDLKGV 218
+L QT P ++P L+ + P + L GV
Sbjct: 464 LLHFLQTRNPPVVPNLQALPPDWAGEPLHGV 494
>gi|449282623|gb|EMC89445.1| Poly(A) RNA polymerase, mitochondrial, partial [Columba livia]
Length = 518
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 147/330 (44%), Gaps = 49/330 (14%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGSCISSAGKK-----------------VKQS 89
+TV+PFGS V N F + G D+D+ ++ + ++ KK Q
Sbjct: 169 STVKPFGSSV-NTFGKLGCDVDMFLDFYDTQKHATKMKKGPFEMEYQMKRLPSERLATQK 227
Query: 90 LLGDLLRALRQKG-GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+L + L G G +Q + +AR P++KF CD+S+ N S+ L+
Sbjct: 228 ILSVIGDCLDNFGPGCIGIQKILNARCPLVKFSHQATGFQCDLSVSNSIAIKSSELLYIY 287
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+D R R +V V+ WA+ H + N GT+ ++SL+++V+F Q P I+P L
Sbjct: 288 GCLDPRVRALVFSVRCWARVHGLTNSVPGTWITNFSLTMMVMFFLQRRSPPIIPTL---- 343
Query: 208 PGNLVDDLKGVRANAERQI--AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
D LK + ++ + C+F + S K K N +L L F E F
Sbjct: 344 -----DQLKELADEKDKLVIGGYDCSF-VTDLSKIKPTK-NTETLDVLLGDFFEFFGNFD 396
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+ + L + + + + P + PL+I +PFEQ N ++ V++ L K
Sbjct: 397 FRRNSLNL----RKGKEVNK-----PESSPLYIWNPFEQDLNISKNVNQPQLEKFVAVAR 447
Query: 326 MTHFRLTSTNQTRYAL------LSSLARPF 349
+ + L + ++T+ + L++L PF
Sbjct: 448 ESAWILQNEDKTQQTIKKEPWGLAALLIPF 477
>gi|158299396|ref|XP_319519.4| AGAP003293-PA [Anopheles gambiae str. PEST]
gi|157013847|gb|EAA14654.4| AGAP003293-PA [Anopheles gambiae str. PEST]
Length = 568
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 133/295 (45%), Gaps = 47/295 (15%)
Query: 48 ATVEPFGSFVSNLFSRWG-DLDISIELSNGS----------------CISSAGKKVKQSL 90
A PFGS V N + R G DLD+ ++L + S + +V++ L
Sbjct: 171 AVAHPFGSSV-NGYGRMGCDLDVIMDLDSRSGEPPDRTSRLVYHTKATNPNERTQVQRQL 229
Query: 91 --LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI 148
+GD+L+ G ++ + ARVPI+K+ H ++ D++++N G S+ L+
Sbjct: 230 ESIGDVLQLFLP--GVNSVRRILKARVPIVKYHHEHLDLEIDLTMNNTAGVYMSELLYLF 287
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY 207
Q+D R R + V+ WA++ + N G + ++SL++LV++ Q +LP + +
Sbjct: 288 GQLDARVRPLTFCVRRWAQSVGLTNQTPGYWITNFSLTMLVMYFLQQLARPVLPSINRLI 347
Query: 208 -------PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
P + + E + A N + + S +R N ++L L V F E
Sbjct: 348 QLSASCPPQSSAPVTRF--GEGETEWAYTFLKNPSIYGS--FRSENEATLEQLLVQFFEF 403
Query: 261 FS--GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
+S S +A L + +T P++ P++I +P E N ++ V+
Sbjct: 404 YSQFDFSQRAISLNL-----------GSTILKPDHSPMYIVNPLETVLNVSKNVN 447
>gi|241722590|ref|XP_002413684.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507500|gb|EEC16992.1| conserved hypothetical protein [Ixodes scapularis]
Length = 345
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 130/316 (41%), Gaps = 79/316 (25%)
Query: 24 DWETRMKVISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D E R+ + R++ E + L RG +V PFGS V N F R + DI + + +
Sbjct: 12 DLELRLGFLV-CRQIEEFISGLYPRG-SVLPFGSLV-NGFGRH-NCDIDMVYCVPENVDA 67
Query: 82 AGKKVKQS----------------LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
G+ Q LGDLL + G + + ARVPI+KF+
Sbjct: 68 KGQLYFQDKHQMINDRTLVQRILETLGDLLHYVVP--GVSEVHRILRARVPIVKFQHDIV 125
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSL 184
CD++++N+ G S+ L + SQ+ +V ++ WA A + N GT+ ++ L
Sbjct: 126 GRECDLTLNNMSGVDMSRVLHFCSQLAPSLGPLVFTLRGWASAQGLTNKVPGTWITNFQL 185
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+LLV+FH Q +LPPLK +
Sbjct: 186 TLLVIFHLQR--RGLLPPLKALE------------------------------------- 206
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
L VSF E +S + K+ I P+TGQ P+ + I++P ++
Sbjct: 207 ------EELLVSFFEYYSSVDFKSKS--ISPYTGQLLEK-------PDYSAIHIQNPLDR 251
Query: 305 PENSARAVSEKNLAKI 320
N++R V +L K+
Sbjct: 252 QLNASRNVGAPDLRKL 267
>gi|407404929|gb|EKF30185.1| hypothetical protein MOQ_006008 [Trypanosoma cruzi marinkellei]
Length = 406
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 98/197 (49%), Gaps = 11/197 (5%)
Query: 24 DWETRMKVISDLREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISS 81
D ET ++ DL+E V + S+ A VE FGS VS D+D+S+ N S
Sbjct: 24 DHET---ILKDLQERVLDIGMRSVNKAHVELFGSHVSGFCKPTSDIDLSLTYRNFSPWLQ 80
Query: 82 AGKKVKQSLLGDLLRALRQ--KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
++V + ++R ++ + G ++++ AR+P+++F + CD+SI N+ G
Sbjct: 81 GMERVDEQNNKRMMRFSKEATEMGMEDVRYI-RARIPVVQFIDSLTGLHCDLSIGNVGGV 139
Query: 140 IKSKFLFWISQIDGRF-RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CV 196
SK L I ++ F R + LVKEW KA ++ P+ TFNS++++ + L Q +
Sbjct: 140 ENSKILAAIREVFPDFYRAYIHLVKEWGKAREVIAPERSTFNSFTVTTMALMVLQELGLL 199
Query: 197 PAILPPLKDIYPGNLVD 213
P P D L D
Sbjct: 200 PVFSKPTGDFGELTLPD 216
>gi|301622102|ref|XP_002940378.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Xenopus
(Silurana) tropicalis]
Length = 581
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 100/222 (45%), Gaps = 27/222 (12%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q + +AR P+++F + CD++ DN S+ L+ D R R +V +
Sbjct: 292 GCTGVQKILNARCPLVRFSHQPAGLQCDLTSDNRIALRSSELLYIYGCFDHRLRALVFTL 351
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WA+ H I + G + ++SL++++LF Q P ++P L D LKG+
Sbjct: 352 RCWARVHGITSAIPGAWITNFSLTMMILFFLQKRSPPVIPTL---------DHLKGLAGK 402
Query: 222 AERQIAE--ICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
++ I + C+F N+ R + N +L L F E + + + I
Sbjct: 403 EDKHIIDGHDCSFVSNLNRIKPSQ----NSEALDVLLGEFFEFYGNFDFSKNCIDI---- 454
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + P PL+I +PFEQ N ++ V++ L +
Sbjct: 455 -----RKGKEQNKPEVCPLYIRNPFEQTLNVSKNVNQSQLDR 491
>gi|405976062|gb|EKC40583.1| Poly(A) RNA polymerase, mitochondrial [Crassostrea gigas]
Length = 940
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 128/316 (40%), Gaps = 54/316 (17%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG--- 103
G TV FGS V+ + D+D+ I+L+ I + + DL ++ G
Sbjct: 259 GMTVNQFGSSVNGFGIKGCDMDVYIDLTKLG-IPCRTSNIVLPYIKDLYTLKKKNSGPLS 317
Query: 104 ---------YRRLQFV-----AHA------------RVPILKFETIHQNISCDISIDNLC 137
+L+ + HA R PIL+F + I CD+SI+N
Sbjct: 318 QQEVDNMRPMDKLKLIQRIITEHAPSCMDTRIIPSQRCPILRFTDYNSQIKCDLSINNKL 377
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI-NNPKTGT-FNSYSLSLLVLFHFQTC 195
++ L S D R + +V ++ WAK I NP+ +SY+L++LV+++
Sbjct: 378 ALQNTRLLQTFSLFDARIKPLVYSIRYWAKLKGIAGNPQACNRLSSYALTMLVIYYLMNT 437
Query: 196 VPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRSSLAHL 253
P ILPP++++ +R I + C+F A+F N ++ L
Sbjct: 438 TPPILPPVEEL----------SRMCGRDRTIVDQWDCSFVSAQFMPP---TPNIQTIEEL 484
Query: 254 FVSFLEKFSGLSLKASELGI---CPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSAR 310
F + FS A+ + + P T + + + ++DPF N +
Sbjct: 485 LYGFFQYFSHFDFLANPMSVRTGKPITLDLSVLEKTLKV----GVIILQDPFVLNHNITQ 540
Query: 311 AVSEKNLAKISNAFEM 326
V+EK L KI ++
Sbjct: 541 NVNEKMLTKIVKEMQL 556
>gi|367014043|ref|XP_003681521.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
gi|359749182|emb|CCE92310.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
Length = 663
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 99/192 (51%), Gaps = 14/192 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P R++ E R K IS +R V E A ++ FGS+ ++L+ D+D
Sbjct: 211 IKDFVAYISPNRQEIEIRNKTISKIRAAVR--ELWPDADLQVFGSYATDLYLPGSDIDC- 267
Query: 71 IELSNGSCISSAGK-KVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
++S G+ K ++ L L L+ K R++ +A ARVPI+KF I
Sbjct: 268 -------VVNSKGRDKENRNSLYSLASFLKSKELATRVEVIAKARVPIIKFVEPQSQIHI 320
Query: 130 DISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
D+S + + G ++ + W+ + G R++VL+VK++ + +NN TG +S+ LV
Sbjct: 321 DVSFERINGLEAARLIREWLEETPG-LRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLV 379
Query: 189 LFHFQTCVPAIL 200
+ F P ++
Sbjct: 380 -YSFLHLHPRVV 390
>gi|50286703|ref|XP_445781.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525087|emb|CAG58700.1| unnamed protein product [Candida glabrata]
Length = 485
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 95/189 (50%), Gaps = 12/189 (6%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D + ++P +E+ ETR + I +R V+ E A + FGS+ ++L+ D+D +
Sbjct: 109 DFVAYISPSKEEIETRNRTIGSIRSAVK--ELWPDADLHVFGSYATDLYLPGSDIDCVVN 166
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
S G K ++ L L L++K ++ VA ARVPI+KF + DIS
Sbjct: 167 -------SKQGDKQSRNNLYKLANFLKKKEIATEIEVVAKARVPIIKFVEVESRTHMDIS 219
Query: 133 IDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
+ L G +K + W++ G R++VL+VK++ + +NN +G +S+ LV +
Sbjct: 220 FERLNGLEAAKLIRDWLASTPG-LRELVLVVKQFLHSRRLNNVHSGGLGGFSIICLV-YS 277
Query: 192 FQTCVPAIL 200
F P I+
Sbjct: 278 FLRMHPRII 286
>gi|395735940|ref|XP_003776668.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 2 [Pongo abelii]
Length = 482
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 125/276 (45%), Gaps = 44/276 (15%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRA--LRQKGGYRRLQFVA 111
GS ++ +R D D+ + + C +K + + L+ + Y +
Sbjct: 201 GSSLNGFGTRSSDGDLCLVVKEEPCFFQVNQKTEARHILTLVHKHFCTRLSSYCKQPSRH 260
Query: 112 HARVPILKFETIHQNISC---DISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAK 167
AR+ + K + SC D++++N+ G I++ FL + ++ R R +VL +K+WA
Sbjct: 261 RARLHLFK-----EKKSCVEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLAIKKWAS 314
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
H IN+ GT +SYSL L+VL + QT ILP L+ IYP + + + +
Sbjct: 315 HHQINDASRGTLSSYSLVLMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLV 367
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQ 279
N+ + S N S+L L + FL+ ++ +S++ ++ P +
Sbjct: 368 HQAPCNVPPYLSK-----NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIE 422
Query: 280 WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
W N + +E+PF+ N+ARAV EK
Sbjct: 423 WR-----------NKYICVEEPFDG-TNTARAVHEK 446
>gi|82541397|ref|XP_724941.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii 17XNL]
gi|23479769|gb|EAA16506.1| caffeine-induced death protein 1 [Plasmodium yoelii yoelii]
Length = 534
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 132/286 (46%), Gaps = 30/286 (10%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGG 103
+ V PFGS ++ + + D+DI I++ +K + + L + L G
Sbjct: 224 FKNCHVTPFGSVINGFWMKNSDIDICIQIP-----ILLNRKDQINFLKKICLILNNYHNG 278
Query: 104 YRRLQFVAHARVPILKFETI-HQN---ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMV 159
+F A+VPI+ F H+N +SCDIS++N+ I SK + ID R + M
Sbjct: 279 IIEQRF--SAKVPIIHFYCDDHKNSFQLSCDISVNNILAVINSKLIQKYVSIDKRLQLMG 336
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDI---------YPG 209
+ +K W+K +IN+ G +S+SL L+ + Q + P IL L+DI Y
Sbjct: 337 IALKYWSKKRNINDRSKGFLSSFSLILMAIHFLQYVMEPKILISLQDISIRRNEKSFYVM 396
Query: 210 NLVDDLKGVRANA--ERQIAEICAFNIARFSSDK-YRKINRSSLAHLFVSFLEKFSGLSL 266
+ D K + +A ++ ++ N S DK Y + ++ L + F KF G
Sbjct: 397 GV--DCKYCQDDAIIRDELKKMNIQNGVVSSDDKNYDHASHVDISTLMLEFF-KFYGYKY 453
Query: 267 KASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
K+ + I +E+ S + ++ LF+++PFE +N A +
Sbjct: 454 KSGIIAIRDINNYYENFTSLKSY--ESYYLFVDNPFEIGKNVANIL 497
>gi|403213331|emb|CCK67833.1| hypothetical protein KNAG_0A01440 [Kazachstania naganishii CBS
8797]
Length = 526
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 93/191 (48%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P R + + R I +R V E A + FGS+ ++L+ D+D
Sbjct: 144 VKDFISYISPNRVEIKQRNTTIGKIRAAVS--ELWPDADLHVFGSYATDLYLPGSDIDCV 201
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K QS L L L++ G ++ +A ARVPI+KF I D
Sbjct: 202 VN-------SKGGDKENQSSLYKLATHLKKNGLATEIEIIAKARVPIIKFVEPESRIHID 254
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + + G +K + W+ G R++VL++K++ + +NN TG +S+ LV
Sbjct: 255 VSFERINGLEAAKLIREWLESTPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFSIICLV- 312
Query: 190 FHFQTCVPAIL 200
+ F + P ++
Sbjct: 313 YSFLSMHPRVI 323
>gi|341876510|gb|EGT32445.1| hypothetical protein CAEBREN_23525 [Caenorhabditis brenneri]
Length = 845
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 94/192 (48%), Gaps = 27/192 (14%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIE----LSNGSCISS--AGKKVKQSLLGDLLRAL 98
+ G+TV GSF S D+D+ + + +G +K Q +L + RA+
Sbjct: 543 ITGSTVNGCGSFNS-------DMDMCLCYPTFVYHGKTFDDFYCDRKESQKILRKVDRAV 595
Query: 99 RQ-KGGYRRLQFVAH-----ARVPILKFETI--HQNISCDISIDNLCGQIKSKFLFWISQ 150
R+ K G + A+VPI+K E ++ I DI+++N+ G S + S
Sbjct: 596 RRCKIGANIRSIIGKCSVIPAKVPIVKCELTRAYRFIDVDINVNNIAGIYNSHLTHYYSL 655
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPG 209
ID RF + L+VK WA + N G NSYSL L+V+ + Q V PA+LP L+ ++P
Sbjct: 656 IDARFPALALVVKHWACVCGVGNAPDGYLNSYSLILMVIHYLQCGVTPAVLPNLQYLFPD 715
Query: 210 NL-----VDDLK 216
+DDL+
Sbjct: 716 VFDRKIPIDDLR 727
>gi|395859965|ref|XP_003802293.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Otolemur
garnettii]
Length = 1260
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 109/244 (44%), Gaps = 29/244 (11%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 845 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 904
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 905 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGE 964
Query: 211 LVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG----- 263
++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 965 KKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFK 1017
Query: 264 ---LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+S++ L + F QW + + IEDPF+ N +S K I
Sbjct: 1018 EHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFI 1065
Query: 321 SNAF 324
AF
Sbjct: 1066 MKAF 1069
>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 650
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 98/190 (51%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ TR +V+ DL+ + ++ A V FGS ++L+ D+D+
Sbjct: 193 MKDFVNYISPSKEEILTRNRVVKDLKREINNLWPDTEAHV--FGSSATDLYLPGSDIDMV 250
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S+ G +S L L LR + + ++ +A A+VPI+KF NI D
Sbjct: 251 V-------TSNTGDYENRSKLYQLSSYLRNRKLAKDIEVIAKAKVPIVKFVDPSSNIHID 303
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + + W+ + G R++VL+VK++ ++ +NN G YS ++++
Sbjct: 304 ISFERRNGIEAAKRIRRWLDRTPG-LRELVLIVKQFLRSRRLNNVHVGGLGGYS-TIILC 361
Query: 190 FHFQTCVPAI 199
+HF P I
Sbjct: 362 YHFLRLHPRI 371
>gi|291244425|ref|XP_002742100.1| PREDICTED: terminal uridylyl transferase 1, U6 snRNA-specific-like
[Saccoglossus kowalevskii]
Length = 747
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 144/330 (43%), Gaps = 46/330 (13%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
+D + R + L+E+ + E V PFGS V+ S+ DLD+ ++L +GS
Sbjct: 178 QDVKLRYLICDLLQEIFK--EFFPKCLVFPFGSSVNGFGSKGCDLDLHLDL-HGSNYKYI 234
Query: 83 GKKVKQSL-----------------LGDLLRALRQKG--GYRRLQFVAHARVPILKFETI 123
K+ + + DL+ + +K G + +Q + AR P++KF
Sbjct: 235 FCKIPKEFSDEKVSVFDVDNAEPDEIMDLIAKIIKKCAPGCKHVQAITTARCPVVKFIHS 294
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI--NNPKTG-TFN 180
+SCDIS++N ++ L + ID R + +V +++WAK ++ N G
Sbjct: 295 ESGLSCDISVNNSLAMQNTELLHLYASIDERVQSLVYSLRQWAKYKELAGNASNAGPRLT 354
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAF--NIAR 236
+Y+L+L+V+F+ Q ++P V++LK V ++E I + C F +I +
Sbjct: 355 NYTLTLMVMFYLQQEEFKLIPT---------VEELKAVTDDSEVTIIDNWDCTFTRHIDK 405
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRS-NTRWLP-NNH 294
K K LA F S ICP +G+ I S N +
Sbjct: 406 LCYKKTSKTAEKLLAEFFS------FYSSFDFEHCMICPRSGKKIPIDSINKEEMKIKVT 459
Query: 295 PLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ I DPFE N+A VS K K++ F
Sbjct: 460 SICIRDPFELNHNTAMNVSNKLEHKLAEEF 489
>gi|357629676|gb|EHJ78295.1| hypothetical protein KGM_22716 [Danaus plexippus]
Length = 406
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 130/286 (45%), Gaps = 21/286 (7%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQK-GGYR 105
G V FGS V+ L + DLD +EL S +S K S + +Q+ ++
Sbjct: 56 GIKVHAFGSIVTGLGIKVSDLDCYVELP--SWLSPPEK----SFVFKAKNIFKQEPWKFQ 109
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+L +++A+VPILKF +CD+S N G SK + + +D R + +L+K W
Sbjct: 110 QLLAISYAKVPILKFYHTPTQCNCDLSFSNPTGIQNSKLISYFLNLDVRVLKLAVLIKYW 169
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN---LVDDLKGVRANA 222
+K HD+ T SY L+L+++F+ Q ++PP+ + + L+++
Sbjct: 170 SKIHDLTG--TNLMPSYCLTLMLIFYLQQI--GLVPPVITLQQNSAELLINNWNLAFNEL 225
Query: 223 ERQIA-EICAFNIAR--FSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ 279
E QI+ + F + F K ++ ++ +E+ + +K L F+
Sbjct: 226 EHQISTDQTLFQLLEGFFKFYHTFKFDKYVISLYLGCAIERELFVDVKTVPL---EFSFY 282
Query: 280 WEHIRSN-TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+I N + L + + ++DPFEQ N A V K + N F
Sbjct: 283 HRNISQNLCQQLRLDTAMCVQDPFEQSRNCAVRVHPKLFQHVMNKF 328
>gi|430811692|emb|CCJ30889.1| unnamed protein product, partial [Pneumocystis jirovecii]
Length = 665
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 134/316 (42%), Gaps = 30/316 (9%)
Query: 10 ILKDILGM---LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
I DIL + L P E+ K + L ++E + TV+ FGS V+ L + D
Sbjct: 131 ICGDILQLYETLLPSSENNNRWTKFLKKLTTILEKEWPDKKITVQAFGSTVNQLCTSESD 190
Query: 67 LDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
+D+ CI++ K + + L + G ++ V A+VPI+K +
Sbjct: 191 VDV--------CITTVEKGLADTCK---LAKVLANYGMEKVVCVPRAKVPIVKVWDPELS 239
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLS 185
++CD++I+N ++ + +ID R R + +++K WAK +N+ GT +SY+
Sbjct: 240 VACDMNINNTLALENTRMIKTYVEIDPRVRPLAMIIKYWAKKRILNDAAGGGTLSSYTWI 299
Query: 186 LLVLFHFQTCVPAILPPLKDI-YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+++ Q P ILP L + + N + G+ + I + +F +
Sbjct: 300 CMIINFLQMRKPPILPSLHQLPHEQNENSIIGGIDVSFFDDIDALKSFG----------E 349
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQ 304
N SL L +F KF I G + + L N+ L +E+PF
Sbjct: 350 KNTESLGGLLFAFFRKF-AYEFDYDHCVISVRHGHYLSKLAKGWHLTQNNRLCVEEPFNT 408
Query: 305 PE---NSARAVSEKNL 317
N+A V+ K L
Sbjct: 409 KRNLGNTADDVTVKGL 424
>gi|195125549|ref|XP_002007240.1| GI12491 [Drosophila mojavensis]
gi|193918849|gb|EDW17716.1| GI12491 [Drosophila mojavensis]
Length = 665
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 143/309 (46%), Gaps = 23/309 (7%)
Query: 31 VISDLREVVESVESLRG-ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQS 89
S +R+ ++ + L+G V PFGS V+ L + D+D+ +E ++ +SS +Q
Sbjct: 100 CFSQVRDTLQ--KQLQGRVKVYPFGSLVTGLALKDSDIDLFLEQTD---VSSNAISHRQ- 153
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L + LR+ ++ + + HARVPI++ + ++ +S DI++ + S+F+ +
Sbjct: 154 LFNKIYNFLRRSECFQDVFAIRHARVPIIRCKHVYSGLSLDINMSSPNSTYNSRFVAELL 213
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
D R R++ L +K WAK I +G+ SY L L++F Q +LP +K +
Sbjct: 214 GRDVRMRELFLFLKLWAKKLKIIG--SGSMTSYCLITLIIFGLQQ--QRLLPSIKQLQAR 269
Query: 210 NLVDDLKGVR-ANAERQI----AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGL 264
V ++ GV A + +Q+ A + + ++ Y K+N L +L +
Sbjct: 270 CPVVEVMGVNYAYSFQQVRPIPAGVTSLDLISDFFALYHKMNFER--KLLSPYLGYALDI 327
Query: 265 SLKASELGICP-FTGQWEHIRSNTRWLP----NNHPLFIEDPFEQPENSARAVSEKNLAK 319
S G P + Q + + T P + + ++DPFE N +++S NL
Sbjct: 328 DTAFSVPGTFPEYEQQLQAMAKATGEQPEPFQSQRCVCVQDPFEMQHNVGQSISITNLCY 387
Query: 320 ISNAFEMTH 328
+ + H
Sbjct: 388 LRECLVLAH 396
>gi|296411237|ref|XP_002835340.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629118|emb|CAZ79497.1| unnamed protein product [Tuber melanosporum]
Length = 1007
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 131/320 (40%), Gaps = 26/320 (8%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K + L P + R + + L ++ V FGS ++L D+D+
Sbjct: 129 IKSLFETLKPSEASGDRRRRFLEKLERLLNREWPGHDIQVHAFGSTENHLCMIDSDIDV- 187
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI ++ +K + L A K G R+ V A+VPI+K ++CD
Sbjct: 188 -------CIKTSWDGLKSTCY---LAARLAKCGMERVVCVPGAKVPIVKIWDPEYQVACD 237
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
+++++ +K + +ID R R + +++K W K +N+ GT +SY+ ++L
Sbjct: 238 MNVNSTLALDNTKMIKTYVEIDERVRPLAMIIKHWTKKRVLNDAAGGGTLSSYTWICMIL 297
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
QT P ILP L P + GV + + I + F N+ +
Sbjct: 298 NFLQTRDPPILPALHQ-RPHKKRPPINGVDISFDDDIETLKGFG----------HNNKET 346
Query: 250 LAHLFVSFLEKFSGLSLKASELGICPFTGQ-WEHIRSNTRWLPNNHPLFIEDPFEQPENS 308
L L +F +K+ G L + I G+ N ++L NN L +E+PF N
Sbjct: 347 LGELLFAFFKKY-GHELDYEKRVISVRHGKLLSKEEKNWQYLQNNR-LCVEEPFNFTRNL 404
Query: 309 ARAVSEKNLAKISNAFEMTH 328
+ ++ + F H
Sbjct: 405 GNTADDSSVRGLHLEFRRAH 424
>gi|383854864|ref|XP_003702940.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Megachile
rotundata]
Length = 539
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 124/298 (41%), Gaps = 61/298 (20%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIE--------------LSNGSCISSAGKKVKQSLLGDLL 95
V PFGS V+ R DLD+ L+N C S KVK +++
Sbjct: 220 VLPFGSSVNGFGQRGCDLDLVCSVSGTKNESAQKLHYLTNNICFDS---KVKHQQFLEMV 276
Query: 96 RALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
+ + + +ARVPILKF N+ CD+S N SK L+ S+ID
Sbjct: 277 YTILNTCVPTISNAKRILNARVPILKFSIPSSNMQCDLSGPNEVALYMSKLLYIFSEIDC 336
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLV 212
R + +V +++WAK H I G + ++SL+LL++F+ Q ILPP+ + G L
Sbjct: 337 RVKPLVCTIRKWAKNHRITREIPGQWITNFSLTLLIIFYLQRI--EILPPIAVLTSGRLQ 394
Query: 213 DDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELG 272
+ ++ + SL + F + +S +
Sbjct: 395 N-------------------------KSGWKVSSSLSLQSILRGFFQFYSNFDFSTQAIS 429
Query: 273 ICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFR 330
I WE T+ + P++I++PF + N A+ +++ L ++ + HFR
Sbjct: 430 I------WE---GKTKIKLDVSPIYIQNPFNESLNVAKNINDIQLERL-----IHHFR 473
>gi|164661083|ref|XP_001731664.1| hypothetical protein MGL_0932 [Malassezia globosa CBS 7966]
gi|159105565|gb|EDP44450.1| hypothetical protein MGL_0932 [Malassezia globosa CBS 7966]
Length = 657
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 120/285 (42%), Gaps = 38/285 (13%)
Query: 82 AGKKVKQSLLGDLLRAL----RQKGGYRRLQFVAHARVPILKFE-----TIHQNISCDIS 132
KV+Q +L+ L R++ ++ L + AR+PI+K I +I+CDI
Sbjct: 213 TATKVRQPSPSELVEQLSDLIRKQTDFQVLP-LPKARIPIIKVSRAASSDIPCDIACDIG 271
Query: 133 IDNLCGQIKSKFLFWISQIDG-RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
+N ++ L + +D R R +VL +K W K +N+P GT +SY +LLVLF
Sbjct: 272 FNNQLALENTRLLLSYAMLDPPRLRALVLFIKVWTKRRKLNSPYMGTLSSYGYTLLVLFF 331
Query: 192 F-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSL 250
+P +LP L+ I G + L+ + + I ++ + N SL
Sbjct: 332 LIHVKLPPVLPNLQRIPAGRDL-PLEDIMLDGH----NIYFYDDMDALRQHWHSDNTESL 386
Query: 251 AHLFVSFLEKFS--------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
L + F FS +S++ + + W H L IEDPF
Sbjct: 387 GELLLDFFRYFSRDFNYTKDAISMRTEGGLVTKESRGWTHDL-----------LCIEDPF 435
Query: 303 EQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYA-LLSSLA 346
+ N AR V++ L I F M RL + R LLS L
Sbjct: 436 QLGYNVARTVTKDGLYTIRGEF-MRASRLLANRSIRAPQLLSELC 479
>gi|47226027|emb|CAG04401.1| unnamed protein product [Tetraodon nigroviridis]
Length = 475
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 98/232 (42%), Gaps = 32/232 (13%)
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
++ V+ AR+P++KF N+ DI+ +N ++FL + +D R R +V ++ W
Sbjct: 214 KVHVVSSARLPVVKFHHRELNLQGDITTNNRLAVRNTRFLQLCAGLDERLRPLVYTIRHW 273
Query: 166 AKAHDINNPKTGT---FNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANA 222
AK + +GT N+Y+L+LLV+F Q C P +LP VD LK +
Sbjct: 274 AKQKQLAGNPSGTGPLLNNYALTLLVIFFLQNCDPPVLP---------TVDQLKDMACEE 324
Query: 223 ERQIAEI--CAFNIARFSSDKYRKINRSSLAHLFVSFLE-----KFSGLSLKASELGICP 275
E + E C F + + N L L F F+G L E P
Sbjct: 325 EECVIEGWNCTFPSQPIAVPPSK--NTQDLCTLLAGFFHFYAKFDFAGSVLSLREGRALP 382
Query: 276 FTG-----------QWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
T + E+ + P PL + DPFE N A ++E++
Sbjct: 383 ITDFLKQNKDEEAMEEENPSKVKQHGPKLGPLNLLDPFELSHNVAGNLNERS 434
>gi|315041471|ref|XP_003170112.1| Poly(A) RNA polymerase cid13 [Arthroderma gypseum CBS 118893]
gi|311345146|gb|EFR04349.1| Poly(A) RNA polymerase cid13 [Arthroderma gypseum CBS 118893]
Length = 1090
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 92/195 (47%), Gaps = 26/195 (13%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P +E E R++ + L ++++ T P N+F G
Sbjct: 122 IKELYQKLLPSQESEERRVRFVRKLEKLLD--------TQWPGNEIKVNVFGSSG----- 168
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
N C SS+ V L D L K G R+ V+HA+VPI+K ++CD
Sbjct: 169 ----NKLCTSSSDVCV----LADFL----AKSGMERVVCVSHAKVPIVKIWDPELQVACD 216
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + ++D R R + +LVK W K +N+ GT +SY+ L++
Sbjct: 217 MNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAALGGTLSSYTWICLII 276
Query: 190 FHFQTCVPAILPPLK 204
QT +P I+P L+
Sbjct: 277 NFLQTRIPPIVPSLQ 291
>gi|344230457|gb|EGV62342.1| hypothetical protein CANTEDRAFT_126141 [Candida tenuis ATCC 10573]
Length = 615
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ R V+ L++ ++ A FGSF ++L+ D+D+
Sbjct: 184 IKDFVSYISPSKEEIMARNSVVKTLKQQIKVC--WPDAEAHVFGSFATDLYLPGSDIDMV 241
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ NG C + + L L LR K + ++ +A A+VPI+KF NI D
Sbjct: 242 VVSKNGDCEN-------RHKLYQLSSFLRSKKLAKDIEVIAGAKVPIIKFVDPKTNIHLD 294
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + W+ G R++VL+VK++ ++ +NN G Y+ ++++
Sbjct: 295 ISFERTNGLDAARRIRKWLETTAG-LRELVLVVKQFLRSRKLNNVHVGGLGGYA-TIILC 352
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 353 YHFIKMHPRV 362
>gi|403300969|ref|XP_003941184.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Saimiri
boliviensis boliviensis]
Length = 1257
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 109/244 (44%), Gaps = 29/244 (11%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 842 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 901
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 902 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGE 961
Query: 211 LVDDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG----- 263
++ G QI E+ + +Y K N S+ L++ L ++
Sbjct: 962 KKPEIFVDGWNIYFFDQIDELPTY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFR 1014
Query: 264 ---LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+S++ L + F QW + + IEDPF+ N +S K I
Sbjct: 1015 EHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFI 1062
Query: 321 SNAF 324
AF
Sbjct: 1063 MKAF 1066
>gi|399217978|emb|CCF74865.1| unnamed protein product [Babesia microti strain RI]
Length = 431
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 101/204 (49%), Gaps = 12/204 (5%)
Query: 14 ILGMLNPLREDWETRMKVISDLRE-VVESVESLRG--ATVEPFGSFVSNLFSRWGDLDIS 70
IL + + D+ +++L + E ++ L G A+ +GS + L++ D+D+S
Sbjct: 111 ILSVTQKITPDYSIFTNQLAELSSYLCERLKPLLGNDASYHFYGSTATRLYTYNSDIDLS 170
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET---IHQNI 127
I L + ++ + LL + L++ R + ARVP+L + N
Sbjct: 171 INLP-----CTKPRQAQLKLLRRIGEYLKKIYPQRITEERFTARVPLLHWSNGANSGNNC 225
Query: 128 SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLL 187
+ DI I+N G S + ID R +++ +K+WAK+ DINN G+ +S++L L+
Sbjct: 226 AVDICINNHLGIANSALVSKYVGIDDRVASLIIAIKKWAKSRDINNKSRGSLSSFALVLM 285
Query: 188 VLFHFQTCV-PAILPPLKDIYPGN 210
V+ + Q V P ILP L+DI N
Sbjct: 286 VIHYLQKVVTPPILPFLQDIAISN 309
>gi|195480694|ref|XP_002101355.1| GE15676 [Drosophila yakuba]
gi|194188879|gb|EDX02463.1| GE15676 [Drosophila yakuba]
Length = 1334
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 56/82 (68%), Gaps = 2/82 (2%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWI-SQIDGRFRDMVLLVKEWAKAHDI 171
ARVPIL+F+ I I D++ +N G IK+ +L + +Q+D R R +V++VK WA+ HDI
Sbjct: 1112 ARVPILRFKDITNGIEVDLNFNNCVG-IKNTYLLQLYAQMDWRTRPLVVIVKLWAQYHDI 1170
Query: 172 NNPKTGTFNSYSLSLLVLFHFQ 193
N+ K T +SYSL L+VL + Q
Sbjct: 1171 NDAKRMTISSYSLVLMVLHYLQ 1192
>gi|71834520|ref|NP_001025359.1| speckle targeted PIP5K1A-regulated poly(A) polymerase [Danio rerio]
gi|123908106|sp|Q4KMD7.1|STPAP_DANRE RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|68534706|gb|AAH98614.1| Zgc:112254 [Danio rerio]
Length = 797
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 129/320 (40%), Gaps = 67/320 (20%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-------------SLLGD--- 93
+ PFGS V+ DLD+ ++L N + K +Q S+L D
Sbjct: 200 IVPFGSSVNTFGLHSCDLDLFLDLENTKVFQARAKSSEQTGENQSEDCRSEDSILSDIDL 259
Query: 94 ----------LLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
L+ + +K G ++Q ++ AR+P++KF N+ DI+I+N
Sbjct: 260 STASPAEILELVAVILRKCVPGVHKVQALSTARLPVVKFSHKELNLQGDITINNRLAVRN 319
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG---TFNSYSLSLLVLFHFQTCVPA 198
+KFL S ID R R +V ++ WAK + +G N+Y+L+LLV+F Q P
Sbjct: 320 TKFLQLCSGIDSRLRPLVYTIRLWAKQKQLAGNLSGPGPLLNNYALTLLVIFFLQNRDPP 379
Query: 199 ILPPLKDIYPGNLVDDLKGVRANAERQIAEI--CAFNIARFSSDKYRKINRSSLAHLFVS 256
+LP V+ LK + E E C F FS + N L L
Sbjct: 380 VLPS---------VNQLKNMACEEEECAIEEWDCTFPSQPFSVPPSK--NTEDLCTLLFG 428
Query: 257 FLEKFSGLSLKASELG-----ICPFTGQWEHIRSNTRWL---------------PNNHPL 296
F +S AS + + P T + ++S+ L P P+
Sbjct: 429 FFTFYSKFDFPASVVSLRDGHVLPIT---DFLKSDMEALKTADASSPKPKRSSAPRLGPM 485
Query: 297 FIEDPFEQPENSARAVSEKN 316
+ DPFE N A ++E+
Sbjct: 486 NVLDPFELNHNVAGNLNERT 505
>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 646
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 95/190 (50%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ TR +V+ DL+ + S+ FGS ++L+ D+D+
Sbjct: 192 MKDFVNYISPSKEEILTRNRVVKDLKREINSL--WPDTETHVFGSSATDLYLPGSDIDMV 249
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G +S L L LR + + ++ +A A+VPI+KF NI D
Sbjct: 250 V-------TSKTGDYENRSKLYQLSSYLRNRKLAKDIEVIAKAKVPIIKFVDPSSNIHID 302
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + + W+ + G R++VL++K++ ++ +NN G YS ++++
Sbjct: 303 ISFERRNGIEAAKRIRKWLDKTPG-LRELVLIIKQFLRSRRLNNVHVGGLGGYS-TIILC 360
Query: 190 FHFQTCVPAI 199
+HF P I
Sbjct: 361 YHFLRLHPRI 370
>gi|384248025|gb|EIE21510.1| Nucleotidyltransferase, partial [Coccomyxa subellipsoidea C-169]
Length = 291
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 86/171 (50%), Gaps = 10/171 (5%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P + +R ++ + + V+S+ A+V+ FGSFV+ L+ D+DI + S
Sbjct: 9 LAPSSSELASRQAALARVTDAVQSIWP--SASVQVFGSFVTGLYLPSSDMDIVVMDSQCG 66
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
I SA K V SL+ +K + +Q +A A+VPI+KFE I I DIS D
Sbjct: 67 DIRSALKAVANSLV--------RKNMAKNIQIIAKAKVPIIKFEDIESGIKFDISFDAAN 118
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
G + F+ + Q R +VL++K + ++N G SY+L ++V
Sbjct: 119 GPEAADFVKGLMQRLPPMRPLVLILKVFLHQRELNEVYQGGIGSYALLVMV 169
>gi|383853738|ref|XP_003702379.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Megachile rotundata]
Length = 704
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/293 (23%), Positives = 119/293 (40%), Gaps = 58/293 (19%)
Query: 53 FGSFVSNLFSRWGDLDISIELS-----------NGSCISSAGKKVKQSLLGDLLRALRQK 101
FGS + L + DLDI +++ N + K+VK+ + G
Sbjct: 197 FGSTQTRLGFKECDLDIYMDIGEPIYETESAPPNSWTMQKIFKEVKKIMYG-------MN 249
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
+ + + A+ PI+KF I N+SCDIS N G KS + + +D R + +++L
Sbjct: 250 CTFSDIIAIPKAKTPIIKFCYIRTNVSCDISFKNSLGIYKSHLIKYYISLDDRLKPLMML 309
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+K W K I +G ++Y+L LL++F+ Q I+P L ++ + G + N
Sbjct: 310 IKYWGKHFKI--AGSGKISNYALVLLIIFYLQQPTVNIVPSLMELQKTCQPQIINGWQVN 367
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+ + + N++S+ L F +S ++ K++ ICP G
Sbjct: 368 FNENTV------LPKVT-------NKNSITQLLQGFFLFYSSINFKSN--VICPIDGMI- 411
Query: 282 HIRSNTRWLPN----------------------NHPLFIEDPFEQPENSARAV 312
H S + + N N P+ I+DP E N +
Sbjct: 412 HTESEFKDVENLPSCMNGYKAYVNENENLKFNANKPMCIQDPIELSHNVTMGI 464
>gi|196002225|ref|XP_002110980.1| hypothetical protein TRIADDRAFT_54458 [Trichoplax adhaerens]
gi|190586931|gb|EDV26984.1| hypothetical protein TRIADDRAFT_54458 [Trichoplax adhaerens]
Length = 820
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 99/196 (50%), Gaps = 21/196 (10%)
Query: 30 KVISDLRE--VVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELS------NGSCISS 81
+++ DL + VE ES++ + PFGS V+ GDLDI++ + G C +S
Sbjct: 197 RLVCDLLQQCFVELDESVK---IVPFGSAVNGFGQASGDLDIAMIMDENAITDKGFCETS 253
Query: 82 AGKKVKQ---------SLLGDLLRALRQK-GGYRRLQFVAHARVPILKFETIHQNISCDI 131
++ S + + +++ G + +++AR P++KF+ ++ CD+
Sbjct: 254 TDINIEDEKVTIRRPWSTFSVVAKFIKECIPGCLDVIALSNAREPVIKFKYNECSLCCDL 313
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
+I+N G S+ L S++D R + +V ++ WA I G F SYSL L++++
Sbjct: 314 TINNRLGIANSQLLQEYSKLDPRVKPLVFTIRTWAYCRGITLNSGGQFTSYSLILMIIYF 373
Query: 192 FQTCVPAILPPLKDIY 207
Q P++LP L+ ++
Sbjct: 374 LQCTKPSVLPSLQTLF 389
>gi|390340688|ref|XP_792619.2| PREDICTED: poly(A) RNA polymerase, mitochondrial-like
[Strongylocentrotus purpuratus]
Length = 646
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/312 (24%), Positives = 130/312 (41%), Gaps = 48/312 (15%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCI-------- 79
R S + E +S+ L AT+ PFGS ++ R D+D ++ +
Sbjct: 234 RFLACSLMEEAFQSI--LPDATLHPFGSSINGFGRRSCDVDTYLDRGTAHGVIPLKQGRN 291
Query: 80 ----------SSAGKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVPILKFETIHQNIS 128
+++ + QS L L L R + + +AR P++KF +S
Sbjct: 292 KYKLGYDRQSANSERVATQSTLFTLAEFLERHVPQCSSVNRILNARCPLVKFRHQATGLS 351
Query: 129 CDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLL 187
CD++ DN S+ L+ ++D R R +V +V+ WA+ + I N G + +Y L+LL
Sbjct: 352 CDLTGDNRIAIKSSEMLYIFGRLDPRVRPLVFMVRHWARLNGITNNNPGYWITNYPLTLL 411
Query: 188 VLFHFQTCVPAILPPLKDIY---PGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
V+F QT +LP L I P + + ++ + + + ++R
Sbjct: 412 VIFFLQTRPEPVLPALNKIAMFEPSS--EGMEEEEKDVDLVFTDEACIKVSR-------- 461
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN---HPLFIEDP 301
N+ + L F K L + H +T +PNN P++IE+P
Sbjct: 462 -NKETPTELLQEFFHFCITFDFKKHALSV-------HH--GSTYPVPNNGKIFPMYIENP 511
Query: 302 FEQPENSARAVS 313
E NS++ VS
Sbjct: 512 LEPDLNSSKNVS 523
>gi|297684705|ref|XP_002819965.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Pongo abelii]
Length = 1258
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 108/246 (43%), Gaps = 33/246 (13%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGE 962
Query: 211 LVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG--- 263
++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 963 KKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTEEFD 1013
Query: 264 -----LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLA 318
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1014 FKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTN 1061
Query: 319 KISNAF 324
I AF
Sbjct: 1062 FIMKAF 1067
>gi|392595411|gb|EIW84734.1| hypothetical protein CONPUDRAFT_47123 [Coniophora puteana
RWD-64-598 SS2]
Length = 663
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 97/199 (48%), Gaps = 15/199 (7%)
Query: 2 GSYNVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVS 58
G +NV E +++ + ++P + E R V+ + + V S + A V PFGS+ +
Sbjct: 143 GCHNVAEMFHREVEAFVDYMSPTSIEDEIRGLVVKLVGKAVTS--AFPDAKVLPFGSYGT 200
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPIL 118
L+ GD+D+ IE + + K S+L L L++ G ++ +A A+VPI+
Sbjct: 201 KLYLPSGDIDLVIESDSMQYVP------KNSVLHSLANVLKRAGIADKVTIIAKAKVPIV 254
Query: 119 KFETIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNP 174
KF T H ++ DISI+ + GQI + FL + R +V++ K + +N
Sbjct: 255 KFITRHGRLNVDISINQSNGLVAGQIVNGFLADMRGCGRALRALVMVAKAFLGQRGMNEV 314
Query: 175 KTGTFNSYSLSLLVLFHFQ 193
TG SYS+ + + Q
Sbjct: 315 YTGGLGSYSIVCMAISFLQ 333
>gi|297271198|ref|XP_002800212.1| PREDICTED: terminal uridylyltransferase 7 [Macaca mulatta]
Length = 1254
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 108/246 (43%), Gaps = 33/246 (13%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 839 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 898
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGN 210
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 899 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKGE 958
Query: 211 LVDDL--KGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG--- 263
++ G QI E+ A+ + N S+ L++ L ++
Sbjct: 959 KKPEIFVDGWNIYFFDQIDELPAYWPECGK---------NTESVGQLWLGLLRFYTEEFD 1009
Query: 264 -----LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLA 318
+S++ L + F QW + + IEDPF+ N +S K
Sbjct: 1010 FKEHVISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTN 1057
Query: 319 KISNAF 324
I AF
Sbjct: 1058 FIMKAF 1063
>gi|270005633|gb|EFA02081.1| hypothetical protein TcasGA2_TC007716 [Tribolium castaneum]
Length = 1373
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/316 (22%), Positives = 140/316 (44%), Gaps = 47/316 (14%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS ++ L + DLD+ I+ N ++ V +++ + ++ + + + ++
Sbjct: 179 FGSSITGLDVQGSDLDVYID--NVRPVTKPEVAVLKTIRFLIFKSRK----FCDVLLISG 232
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+ PI+K I CDI++ N S+ + + +D + + +++ VK WA + +
Sbjct: 233 AKTPIIKCIHTKTQICCDINVKNRLSVRNSELIKYYLTLDAKIKPLMIFVKFWADLYGLK 292
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI--C 230
K F+SY+L ++V+++ Q PP V + ++ NA +I +I C
Sbjct: 293 --KVNFFSSYALYMMVIYYLQQ------PPYS-------VPTVLTLQRNAPPEIVDIWNC 337
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ----------- 279
F+ F+S + ++++ L V F +F G S + I PF G
Sbjct: 338 GFDEIDFTSP---ALEKTTILDLLVGFF-RFYGHFDYVSNV-IAPFYGAIIDKASFLKPH 392
Query: 280 -----WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTST 334
+ S + L N + I+DPFE N + +V + + + +N M +LT
Sbjct: 393 DLPRCYHTYMSQSVALAVNSGVCIQDPFEHSRNVSASVPLQCIGRFTNMCRMAE-KLTQN 451
Query: 335 NQTR--YALLSSLARP 348
+T Y LL++ P
Sbjct: 452 GETELLYKLLTTRTEP 467
>gi|189236075|ref|XP_972162.2| PREDICTED: similar to Dual specificity
tyrosine-phosphorylation-regulated kinase [Tribolium
castaneum]
Length = 2981
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/316 (22%), Positives = 140/316 (44%), Gaps = 47/316 (14%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS ++ L + DLD+ I+ N ++ V +++ + ++ + + + ++
Sbjct: 179 FGSSITGLDVQGSDLDVYID--NVRPVTKPEVAVLKTIRFLIFKSRK----FCDVLLISG 232
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+ PI+K I CDI++ N S+ + + +D + + +++ VK WA + +
Sbjct: 233 AKTPIIKCIHTKTQICCDINVKNRLSVRNSELIKYYLTLDAKIKPLMIFVKFWADLYGLK 292
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI--C 230
K F+SY+L ++V+++ Q PP V + ++ NA +I +I C
Sbjct: 293 --KVNFFSSYALYMMVIYYLQQ------PPYS-------VPTVLTLQRNAPPEIVDIWNC 337
Query: 231 AFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ----------- 279
F+ F+S + ++++ L V F +F G S + I PF G
Sbjct: 338 GFDEIDFTSP---ALEKTTILDLLVGFF-RFYGHFDYVSNV-IAPFYGAIIDKASFLKPH 392
Query: 280 -----WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTST 334
+ S + L N + I+DPFE N + +V + + + +N M +LT
Sbjct: 393 DLPRCYHTYMSQSVALAVNSGVCIQDPFEHSRNVSASVPLQCIGRFTNMCRMAE-KLTQN 451
Query: 335 NQTR--YALLSSLARP 348
+T Y LL++ P
Sbjct: 452 GETELLYKLLTTRTEP 467
>gi|297675543|ref|XP_002815733.1| PREDICTED: poly(A) RNA polymerase GLD2 isoform 1 [Pongo abelii]
Length = 478
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 127 ISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
+ D++++N+ G I++ FL + ++ R R +VL +K+WA H IN+ GT +SYSL
Sbjct: 270 VEFDLNVNNIVG-IRNTFLLRTYAYLENRVRPLVLAIKKWASHHQINDASRGTLSSYSLV 328
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
L+VL + QT ILP L+ IYP + + + + N+ + S
Sbjct: 329 LMVLHYLQTLPEPILPSLQKIYPESFSPAI-------QLHLVHQAPCNVPPYLSK----- 376
Query: 246 NRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
N S+L L + FL+ ++ +S++ ++ P +W N +
Sbjct: 377 NESNLGDLLLGFLKYYATEFDWNSQMISVREAKAIPRPDGIEWR-----------NKYIC 425
Query: 298 IEDPFEQPENSARAVSEK 315
+E+PF+ N+ARAV EK
Sbjct: 426 VEEPFDG-TNTARAVHEK 442
>gi|332832216|ref|XP_003312195.1| PREDICTED: terminal uridylyltransferase 7 [Pan troglodytes]
gi|397470233|ref|XP_003806733.1| PREDICTED: terminal uridylyltransferase 7 isoform 2 [Pan paniscus]
Length = 1258
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 961
>gi|297374764|ref|NP_001172003.1| terminal uridylyltransferase 7 isoform 2 [Homo sapiens]
Length = 1259
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 844 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 903
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 904 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 962
>gi|426362158|ref|XP_004048247.1| PREDICTED: terminal uridylyltransferase 7 isoform 3 [Gorilla
gorilla gorilla]
Length = 1258
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 961
>gi|332260005|ref|XP_003279076.1| PREDICTED: terminal uridylyltransferase 7 isoform 1 [Nomascus
leucogenys]
Length = 1257
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L S
Sbjct: 842 IEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 901
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 902 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 960
>gi|426252414|ref|XP_004019909.1| PREDICTED: LOW QUALITY PROTEIN: speckle targeted PIP5K1A-regulated
poly(A) polymerase [Ovis aries]
Length = 885
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 95/212 (44%), Gaps = 20/212 (9%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL ++EL+ A L+G +LR G R+Q V AR P+++F
Sbjct: 366 GDLGKAVELAEALKGEKAEGGAMLELVGSILRGCVP--GVYRVQTVPSARRPVVRFCHRP 423
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS+ N S+FL S++DGR R +V ++ WA+ ++ N+Y+L
Sbjct: 424 SGLHGDISLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLNNYAL 482
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR- 243
+LLV++ QT P +LP + + A Q+ E+ ++ + F D R
Sbjct: 483 TLLVIYFLQTRDPPVLPTVSQLT------------QKAGEQV-EVDGWDCS-FPRDASRL 528
Query: 244 --KINRSSLAHLFVSFLEKFSGLSLKASELGI 273
N+ L+ L F S L+ S L +
Sbjct: 529 EPSTNKEPLSSLLAQFFSCVSCWDLRGSLLSL 560
>gi|70945312|ref|XP_742489.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521502|emb|CAH77577.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 340
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 72/281 (25%), Positives = 130/281 (46%), Gaps = 29/281 (10%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGGY 104
+ V PFGS ++ + + D+DI I++ +K + + L + L G
Sbjct: 71 KNCHVTPFGSVINGFWMKNSDIDICIQIP-----ILLNRKDQINFLKKICLILNNYHNGI 125
Query: 105 RRLQFVAHARVPILKFETI-HQN---ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+F A+VPI+ F H+N +SCDIS++N+ I SK + ID R + M +
Sbjct: 126 IEQRF--SAKVPIIHFYCDDHKNSFQLSCDISVNNILAVINSKLIQKYVSIDKRLQLMGI 183
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDI---------YPGN 210
+K W+K +IN+ G +S+SL L+++ Q + P IL L+DI Y
Sbjct: 184 ALKYWSKNRNINDRSKGFLSSFSLILMIIHFLQYVMEPKILVSLQDISIRRNEKSFYVMG 243
Query: 211 LVDDLKGVRANA--ERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKA 268
+ D K + +A ++ ++ N + Y + ++ L + F KF G K+
Sbjct: 244 V--DCKYCQDDAIIREELKKMNIQNGVNSDNKNYDHASHIDISTLMLEFF-KFYGYKYKS 300
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSA 309
+ I +++ S + ++ LF+++PFE +N A
Sbjct: 301 GIIAIRDINNYYDNFTSLKSY--ESYYLFVDNPFEIGKNVA 339
>gi|357623592|gb|EHJ74680.1| hypothetical protein KGM_08898 [Danaus plexippus]
Length = 443
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/239 (26%), Positives = 108/239 (45%), Gaps = 46/239 (19%)
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A+VPILKF + D++ +N+ G + L+ S++D R R +V + K WA+AH I
Sbjct: 237 QAKVPILKFRDERNGLQVDLNCNNVVGIRNTNLLYCYSRMDWRVRPLVAITKLWARAHRI 296
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
N+ + T +SY+L+L+V+ HF C PA+L RA R A+
Sbjct: 297 NDARRRTLSSYALTLMVI-HFLQCGTSPAVL-----------------CRAGEARSRAQ- 337
Query: 230 CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI--CPFTGQWEHIRSNT 287
NR SL LF++ L+ ++ + + + WE
Sbjct: 338 ----------------NRCSLGELFLNLLKYYAEFPYEQMAVSVRAARRVPVWECRARAA 381
Query: 288 RWLPNNHP-----LFIEDPFEQPENSARAVSE-KNLAKISNAFEMTHFRLTSTNQTRYA 340
P++ P L +E+PF+ N+AR+V + + +I + F ++ RL + R A
Sbjct: 382 AAPPHHSPAHWKLLCVEEPFDL-TNTARSVYDPETFEQIVSTFRSSYTRLARGLRLRDA 439
>gi|291244423|ref|XP_002742099.1| PREDICTED: PAP associated domain containing 1-like [Saccoglossus
kowalevskii]
Length = 332
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/243 (22%), Positives = 108/243 (44%), Gaps = 20/243 (8%)
Query: 80 SSAGKKVKQSLLGDLLRALRQKGGY-RRLQFVAHARVPILKFETIHQNISCDISIDNLCG 138
+S+ + Q LG + L++ + +Q + AR PI+KF N+ CD+S +N
Sbjct: 8 ASSERAATQQTLGTVATFLQENVPHCVSVQRILKARCPIVKFHHKAANLQCDLSSNNSIA 67
Query: 139 QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVP 197
++ L+ D R R +V + WA+ + I G + ++ ++L+V++ QT P
Sbjct: 68 TKTTELLYLYGNYDSRVRPLVFAFRHWARYNGITTSCPGPWITNFGITLMVIYFLQTRSP 127
Query: 198 AILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSF 257
+++P L D L + ++++ I + + +N SL L F
Sbjct: 128 SVVPTL---------DYLCAMADSSDQCIVDDVNCTFLSDINSIPTSLNTQSLGQLMYEF 178
Query: 258 LEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+ ++ + + + + R + P ++PL+IE+PFE N R V L
Sbjct: 179 FDFYARFNFQKFAISL---------RRGDKFPKPEDYPLYIENPFEVDLNVTRNVHPDQL 229
Query: 318 AKI 320
++I
Sbjct: 230 SRI 232
>gi|336465270|gb|EGO53510.1| hypothetical protein NEUTE1DRAFT_126796 [Neurospora tetrasperma
FGSC 2508]
Length = 1285
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 113/255 (44%), Gaps = 31/255 (12%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
L L+++ L P E R K++ L +++ V FGS + L S D
Sbjct: 281 LTTTLRELYDSLIPTPEVERKRKKLVQKLEKILNDEWPGHDIQVNLFGSSGNLLCSDDSD 340
Query: 67 LDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
+DI CI++ K+++ ++ +LL K G ++ V+ A+VPI+K
Sbjct: 341 VDI--------CITTPWKELESVCMIAELL----HKHGMEKVVCVSSAKVPIVKIWDPEL 388
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSL 184
++CD++++N ++ + +ID R R + +++K W + IN+ GT +SY+
Sbjct: 389 QLACDMNVNNTLALENTRMVRTYVEIDERVRPLAMIIKYWTRRRIINDAAFGGTLSSYTW 448
Query: 185 SLLVLFHFQTCVPAILPPL----------KDIYPGNLVDDLKGVRANAERQIAEICA--F 232
L + Q P +LP L D + DD+ +R ++ + A F
Sbjct: 449 ICLTIAFLQLRDPPVLPALHQENSLKLLRPDGTKSDFADDIDKLRGFGDKNKDSLAALLF 508
Query: 233 NIARFSS-----DKY 242
N RF + DKY
Sbjct: 509 NFFRFYAHEFDYDKY 523
>gi|428175459|gb|EKX44349.1| hypothetical protein GUITHDRAFT_139887 [Guillardia theta CCMP2712]
Length = 308
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 119/251 (47%), Gaps = 20/251 (7%)
Query: 80 SSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
+S +K L+G L L ++ G ++ +AR+PI+KF+ CD+S++N+
Sbjct: 4 NSKKEKFTSFLIG--LARLLERQGMLNVEARPNARLPIIKFKGFA--FDCDLSVNNVLAC 59
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAI 199
I + LF + +D R R +++ +K W K I+N G +SY+ +L+V+ + Q +
Sbjct: 60 INTDLLFTYTMLDKRVRPLIMCIKHWVKQRQIHNTFRGYLSSYTYTLMVIQYLQ--YERV 117
Query: 200 LPPLKDI--YPGNLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFV 255
LP L+ + L +D +++ + + C F N+ +S R RSSL L V
Sbjct: 118 LPCLQSLRRVQAKLNND-PSFAVSSDGDLYD-CYFYRNVETLASFGERN-KRSSLGLLLV 174
Query: 256 SFLEKFSG--LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
F +S +S+++ L G W+ N H L IEDPF+ + R V+
Sbjct: 175 GFFHFYSNGVVSVRSGRLLRKRAKG-WDTPED----FRNRHILCIEDPFDINLDLGRYVN 229
Query: 314 EKNLAKISNAF 324
+ + I F
Sbjct: 230 DYTVQDILEEF 240
>gi|154339183|ref|XP_001562283.1| DNA polymerase sigma-like protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134062866|emb|CAM39313.1| DNA polymerase sigma-like protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 519
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 99/217 (45%), Gaps = 31/217 (14%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
L+ L ++L L+P E+ +T+++VI D+R ++ G +E +GS + L
Sbjct: 163 ALDAKLIELLYCLSPTSEERQTKLRVIDDVRATIQQ----SGMDIEIYGSLYTGLTIPAS 218
Query: 66 DLDISIELSNGSCISSA-----------------GKKVKQSLLGDLLRALRQKGGYRR-- 106
D+D + S I+SA G ++ + G L ALR R
Sbjct: 219 DVDCVLMRSGNEQIASAMREDLLCAMSSIASAATGLASQRQVRGSLSVALRTVADRMRRS 278
Query: 107 -----LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL--FWISQIDGRFRDMV 159
+ ++ HARVPI+K ++ D+S + G + S +L + + R ++
Sbjct: 279 QKFTHITWIGHARVPIVKCRHRRDDVKVDMSFEK-GGCVSSNYLCNLFCKPGNEMARPLI 337
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
+LVK +++P G S+ +SLLVL++ Q CV
Sbjct: 338 VLVKALVNNCGLDDPSIGGLGSFPISLLVLWYLQHCV 374
>gi|410077415|ref|XP_003956289.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
gi|372462873|emb|CCF57154.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
Length = 537
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 93/191 (48%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + E R IS +R+ V+ E A + FGS+ ++L+ D+D
Sbjct: 149 MKDFVSYISPSSTEIEDRNITISRIRDAVK--ELWPDADLHVFGSYSTDLYLPGSDIDCV 206
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S G K ++ L L + L K ++ V+ ARVPI+KF H I D
Sbjct: 207 VN-------SERGNKDSKNCLYQLAKFLTTKKLATDVEVVSKARVPIIKFVEPHTGIHID 259
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ A +NN TG +S+ LV
Sbjct: 260 VSFERTNGLEAAKLIRSWLDSTAG-LRELVLVIKQFLHARRLNNVHTGGLGGFSIICLV- 317
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 318 FTFLHMHPRII 328
>gi|71028114|ref|XP_763700.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350654|gb|EAN31417.1| hypothetical protein, conserved [Theileria parva]
Length = 487
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/204 (26%), Positives = 101/204 (49%), Gaps = 26/204 (12%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
D + R I++ E + + R +V FGS ++ L++ DLD+ +++ N + ++
Sbjct: 155 DLKMRSDRITEFLEKILREKVNRKCSVSFFGSAINGLWTDGSDLDVCVQIPNVTSRNAII 214
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE------------TIHQNI---- 127
+ +++ + +L L R Q A++PIL ++ TI Q+
Sbjct: 215 RNLRR--ISSVLTPLSPS---RIFQNRFTAKIPILHWKRDCIKAPNKLINTISQDKMYFE 269
Query: 128 -----SCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
S DIS++N + S + + R RD+VL +K WA+ +INN GT +S+
Sbjct: 270 CDDIPSIDISVNNDLAIVNSILVGSYVSFEPRVRDLVLYLKLWARNRNINNRSEGTLSSF 329
Query: 183 SLSLLVLFHFQTCVPAILPPLKDI 206
++SL+++ Q C P ILP L+D+
Sbjct: 330 AISLMLIHFLQNCNPPILPSLQDL 353
>gi|428180266|gb|EKX49134.1| hypothetical protein GUITHDRAFT_105213 [Guillardia theta CCMP2712]
Length = 362
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 105/228 (46%), Gaps = 18/228 (7%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
T+ +GS V + DLDI+ + + G + K+ LL L + LRQ+ + L
Sbjct: 120 TTLSLYGSTVYGCATVDSDLDITFCIGD----QDMGLETKRKLLKRLSKVLRQRLQCQCL 175
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+ RVP++K E + NI D+S N +++ L S +D R + +LVK W++
Sbjct: 176 A-ILRCRVPLIKLEDKNTNIKADLSTGNAAPIPQARLLQRYSNMDSRISKLAILVKHWSR 234
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP--GNLVDDLKGVRANAERQ 225
IN+ NSY LLVL QT P ILP L P GN+ ++ ++ Q
Sbjct: 235 TRGIND-GANLMNSYCYCLLVLHFCQTIQPPILPILDCNKPIHGNV------LKLSSRDQ 287
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGI 273
+ + F R ++ N +L+ L F + ++ + + + G+
Sbjct: 288 LLQDSKFQGRR----EWVSENVQTLSELLGKFFKYYAEVDMNVRDFGL 331
>gi|407918735|gb|EKG12001.1| PAP/25A-associated [Macrophomina phaseolina MS6]
Length = 1265
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 104/233 (44%), Gaps = 35/233 (15%)
Query: 18 LNPLREDWETRMKVISDLREVV------ESVESLRGATVEPFG----------SFVSNLF 61
LNP E+ K+ D+RE+ E + R VE G F ++F
Sbjct: 250 LNPDEEE-----KLSGDMRELYDRLLPSEESQKRRKLLVEKLGRILRTEWPGNEFKVHVF 304
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
G+L + E CI + KK++ + L AL K G ++ VA A+VPI+K
Sbjct: 305 GSSGNLLCTAESDVDVCIQTPMKKLESVHM--LAEAL-AKHGMSKVVCVASAKVPIVKVW 361
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFN 180
++CD++++N ++ + QID R R + +++K W K +N+ GT +
Sbjct: 362 DPELELACDMNVNNTLALENTRMIKTYVQIDERVRPLTMIIKYWTKQRILNDAAMGGTLS 421
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDI----YP------GNLVDDLKGVRANAE 223
SY+ +VL QT P +LP L + +P + DDL VR E
Sbjct: 422 SYTWICMVLNFLQTRNPPVLPSLHQMPFEKHPTETGEESSFFDDLDKVRGFGE 474
>gi|170028053|ref|XP_001841911.1| monkey king protein [Culex quinquefasciatus]
gi|167868381|gb|EDS31764.1| monkey king protein [Culex quinquefasciatus]
Length = 646
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/352 (22%), Positives = 154/352 (43%), Gaps = 47/352 (13%)
Query: 11 LKDILGMLNPLREDWETRM-KVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
++ ++ L P + + E + KV DL V+ + V FGS S L R DLD
Sbjct: 80 MRTLINTLQPSQHEIEMALNKVKKDLDRVLAFPNN--SYCVYDFGSIKSGLAFRDSDLDF 137
Query: 70 SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
+ S + K+ + R +R K + ++ + A+VP+L+ N++C
Sbjct: 138 YVHYERNSENRNDQTKLIHVIHS---RMMRDKTFHTLVKIIG-AKVPLLRAVHGPTNLTC 193
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK-AHDINNPKTGTFNSYSLSLLV 188
DI+ N G SKF++ +++ D R + +++K WAK A + N + N+Y + +++
Sbjct: 194 DINFSNARGCYNSKFIYALTKFDSRIHKLAIIIKFWAKCAFLLTNHR--QMNTYCIIMML 251
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGV-RANAERQIAEICAFNIARFSSDKYRKINR 247
+F+ QT +LP ++D+ KG+ R N +N+ ++ +NR
Sbjct: 252 IFYLQTKKLPLLPSVQDLQ--------KGIPRVN-------YGPWNLGYPREIIFQSMNR 296
Query: 248 SSLAHLFVSFLEKFSGLSLKASELGICPFTGQ---------------WEHIRSNTRWLPN 292
S+ L +F + ++ + + + I P+ G+ + R+ + P
Sbjct: 297 ESIRQLLTAFFKYYA--TFEYDKYLISPYVGRRVTVDEMKQQKVRELQPYYRAEQQQFPQ 354
Query: 293 ---NHPLFIEDPFEQPENSARAV-SEKNLAKISNAFEMTHFRLTSTNQTRYA 340
L I+DPFE N + S ++ + +F+ H +T Q +A
Sbjct: 355 FNYGTLLHIQDPFELNMNVGGVLNSAQHFEQFKLSFKTAHEVCLATIQEPFA 406
>gi|393245685|gb|EJD53195.1| Nucleotidyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 584
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 120/299 (40%), Gaps = 52/299 (17%)
Query: 5 NVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRW 64
++L +K L+P E+ E R +I + V A+V+ FGSF + L+
Sbjct: 112 DMLHEEVKAFSEYLSPTPEEHEVRQLIIKLIENCVR--RQWPEASVKAFGSFETRLYHPL 169
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GD+D+ I C ++ +L L AL+++G +Q +A ARVPI+KF T H
Sbjct: 170 GDIDLVI------CSERLEMMERKHVLYQLSHALKREGLADNVQVIAKARVPIIKFRTTH 223
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS++ G + + Q R + + VK + + ++N G SYS
Sbjct: 224 GRFAVDISVNQDNGIASGRIVNGFLQELPALRPLAMTVKAFLRERNMNEVYNGGLGSYS- 282
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
++ +L F P I ++ EI A
Sbjct: 283 TVCLLVSFLQMHPKI-------------------------RLGEIRA------------- 304
Query: 245 INRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDP 301
+L + + FLE + L E+G+ G +S+ W N P L IEDP
Sbjct: 305 --EDNLGTMLIEFLELYGHL-FNVEEVGVSLRDGGSYFRKSHRGWQDPNKPFLLSIEDP 360
>gi|345566395|gb|EGX49338.1| hypothetical protein AOL_s00078g371 [Arthrobotrys oligospora ATCC
24927]
Length = 1300
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 137/334 (41%), Gaps = 62/334 (18%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
L+++ L P E E R K + L +++ V PFGS + L S D+D+
Sbjct: 290 LENLYSELLPSEESNERRRKFLEKLEKLLNDEWPGHEIKVRPFGSTENRLCSTDSDVDV- 348
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI + K ++ LL +L R + R+ V +A+VPI++ + C
Sbjct: 349 -------CIVTDFKDLENVCLLAKVLGKHRME----RIVCVQNAKVPIVRIWDPEYKVQC 397
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R + + +++K WAK +N+ GT +SY+ ++
Sbjct: 398 DMNVNNTLALENTRMVKTYVDIDPRVQRLAMIIKYWAKQRILNDAAGGGTLSSYTWICMI 457
Query: 189 LFHFQTCVPAILPPL-----KDIYPGNLV-----DDLKGVRANAERQIAEICAFNIARFS 238
+ QT P ILP L K P N V DD++ +R
Sbjct: 458 VSFLQTREPPILPSLHQREHKKRPPQNGVDVSFDDDIEALR------------------- 498
Query: 239 SDKYRKINRSSLAHLFVSFLEKF--------SGLSLKASELGICPFTGQWEHIRSNTRWL 290
+ K N SL L +F +++ S +S++ L I +W+ + N
Sbjct: 499 --DFGKANTESLGSLLFNFFKRYGYEIDFEKSVISIRMGRL-ISKTEKKWDALLGNR--- 552
Query: 291 PNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L +E+PF N + + + I F
Sbjct: 553 -----LCVEEPFSIARNLSNGADDNAVRGIHEEF 581
>gi|338719630|ref|XP_003364033.1| PREDICTED: terminal uridylyltransferase 7 [Equus caballus]
Length = 1265
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 109/242 (45%), Gaps = 29/242 (11%)
Query: 93 DLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
+L R L++ G R + + A+VPI+KF + + DIS+ N ++ L S ID
Sbjct: 852 ELARVLKKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAID 911
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLV 212
R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 912 PRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYRGEKK 971
Query: 213 DDL--KGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG------- 263
++ G QI E+ ++ +Y K N S+ L++ L ++
Sbjct: 972 PEIFVDGWNIYFFDQIDELPSY------WPEYGK-NTESVGQLWLGLLRFYTEEFDFKEH 1024
Query: 264 -LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISN 322
+S++ L + F QW + + IEDPF+ N +S K I
Sbjct: 1025 VISIRRKSL-LTTFKKQW-----------TSKYIVIEDPFDLNHNLGAGLSRKMTNFIMK 1072
Query: 323 AF 324
AF
Sbjct: 1073 AF 1074
>gi|351713844|gb|EHB16763.1| Poly(A) RNA polymerase, mitochondrial [Heterocephalus glaber]
Length = 544
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/213 (25%), Positives = 99/213 (46%), Gaps = 23/213 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG-DLDI 69
L +L L E+ R + S + ++ + G T+ PFGS V N F + G DLD+
Sbjct: 193 LSTLLKELQLTEENTRLRYLICSLIEDIATAY--FPGCTIRPFGSSV-NTFGKLGCDLDM 249
Query: 70 SIELSNGSCISS-----------------AGKKVKQSLLGDLLRALRQKG-GYRRLQFVA 111
I+L + + + + Q +L + +L G G +Q +
Sbjct: 250 FIDLHEIRKLRTHKRIGNFLMEFQVKNVPSERIATQKILTVIGESLDHFGPGCVGIQKIL 309
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR P+++F CD++ +N S+ L+ +D R R +V ++ WA+AH +
Sbjct: 310 NARCPLVRFSHQASGFQCDLTTNNRVALKSSELLYIYGSMDSRVRALVFSIRCWARAHSL 369
Query: 172 NNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPL 203
+ G++ ++SL+++V+F Q P ILP L
Sbjct: 370 TSNIPGSWITNFSLTMMVIFFLQRRSPPILPTL 402
>gi|339239329|ref|XP_003381219.1| putative poly(A) RNA polymerase Cid13 [Trichinella spiralis]
gi|316975766|gb|EFV59165.1| putative poly(A) RNA polymerase Cid13 [Trichinella spiralis]
Length = 397
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 133/298 (44%), Gaps = 46/298 (15%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK-- 85
+MK++ DL E ++S +LR + GS V+ L D+D+ + I+ ++
Sbjct: 89 KMKMVKDLSEYLKSFCALRAILLT--GSSVTGLGLNDCDMDLCL-------ITPTPRREY 139
Query: 86 -VKQSLLGDLLRALRQKGGYRRLQFVAH----ARVPILKFETIHQ-NISCDISIDNLCGQ 139
+++ L L+A H A+VPIL+ + ++ S DI+ +++ G
Sbjct: 140 YIERHLALQTLQACYNAFCNPNSPICQHQIITAKVPILRGKFVNPWGYSVDINCNHVLGI 199
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PA 198
S L +ID RF +V+ +K WAK + + + G NSYS +LLVL + Q V P
Sbjct: 200 YNSYLLRSYVKIDDRFAPLVICIKHWAKLKGLCDAQNGYLNSYSWTLLVLNYLQCGVRPP 259
Query: 199 ILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFL 258
+LP L+ +YP + NA + +I N ++R N + LF F
Sbjct: 260 VLPSLQSLYPNHF---------NANIDVLDI---NFNTPFPFEFRSENVQPIEQLFAGFF 307
Query: 259 EKFSGLSLKASELGICPFTGQWEHI--RSNTRWLPNNH--PLFIEDPFEQPENSARAV 312
+ C + E I R R N P++IE+PF +N+A+++
Sbjct: 308 RHYG-----------CRVNYEMEMISVRLGCRVPRENQISPIWIEEPFNF-QNTAQSL 353
>gi|296818185|ref|XP_002849429.1| PAP/25A associated domain family protein [Arthroderma otae CBS
113480]
gi|238839882|gb|EEQ29544.1| PAP/25A associated domain family protein [Arthroderma otae CBS
113480]
Length = 986
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 90/195 (46%), Gaps = 21/195 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E + R + + L +++ + V FGS + L + D+DI
Sbjct: 121 IKELYRKLLPSAESEQRRAQFVRKLEKLLNTRWPGNEIKVNVFGSSGNKLCTSVSDVDI- 179
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI++ K G R+ V+HA+VPI+K ++CD
Sbjct: 180 -------CITTPSK------------CFEPVCGMERVVCVSHAKVPIVKIWDPELQVACD 220
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + ++D R R + +L+K W K +N+ GT +SY+ L++
Sbjct: 221 MNVNNTLALENTRMVKTYVELDDRIRPLAMLIKHWTKRRILNDAALGGTLSSYTWICLII 280
Query: 190 FHFQTCVPAILPPLK 204
QT +P I+P L+
Sbjct: 281 NFLQTRIPPIVPSLQ 295
>gi|297597347|ref|NP_001043830.2| Os01g0672700 [Oryza sativa Japonica Group]
gi|56201854|dbj|BAD73304.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|56201907|dbj|BAD73357.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|255673541|dbj|BAF05744.2| Os01g0672700 [Oryza sativa Japonica Group]
Length = 578
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 85/181 (46%), Gaps = 10/181 (5%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++P E+ +R + + V++ + VE FGSF + LF D+D+
Sbjct: 148 DFCDFISPSAEEQSSRTAAVKAVSNVIKHI--WPQCKVEVFGSFRTGLFLPTSDIDV--- 202
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I + K Q L L +AL QKG +++Q +A ARVPI+KF I+ DIS
Sbjct: 203 -----VIFDSRVKTPQVGLYALAKALSQKGVAKKIQVIAKARVPIVKFVERKSEIAFDIS 257
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ + R + +++K + ++N TG SY+L +++ H
Sbjct: 258 FDMDGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHL 317
Query: 193 Q 193
Q
Sbjct: 318 Q 318
>gi|47226704|emb|CAG07863.1| unnamed protein product [Tetraodon nigroviridis]
Length = 317
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 60/97 (61%), Gaps = 2/97 (2%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDI 171
A VPIL+F N+ D++++N G I++ FL + D R + M+L+VK+WA+ + I
Sbjct: 122 ATVPILRFREKGSNLEFDLNVNNTVG-IRNTFLLRGYANADHRIKPMILVVKKWARHNQI 180
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP 208
N+ GT +SY+L L+VL + QT ++P L+ YP
Sbjct: 181 NDASKGTLSSYTLVLMVLHYLQTLQEPVVPSLQLDYP 217
>gi|350413850|ref|XP_003490134.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Bombus impatiens]
Length = 691
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 90/182 (49%), Gaps = 7/182 (3%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELS---NGSCISSA 82
ETR K + + + V R T FGS + L + DLDI +++ N S +S
Sbjct: 163 ETRYKSVCIQMDKIFKVIFPRCKTYR-FGSTQTGLGFKECDLDIYMDIGEPINESKSTST 221
Query: 83 GKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIK 141
+ ++ + + R + + + A+ PI+KF + N+SCDIS N G K
Sbjct: 222 DAWTMHKIFKEVKKVMYRMNCVFSNIILIPKAKTPIIKFYYVRTNVSCDISFKNSLGVYK 281
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
S + +D R + ++LL+K WA+ I++ + ++Y+L LL++F+ Q I+P
Sbjct: 282 SYLIKHCISLDNRLKPLMLLIKYWARHFKISSGQ--KISNYALVLLIIFYLQQPSVNIIP 339
Query: 202 PL 203
PL
Sbjct: 340 PL 341
>gi|116235017|dbj|BAF34948.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 578
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 85/181 (46%), Gaps = 10/181 (5%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++P E+ +R + + V++ + VE FGSF + LF D+D+
Sbjct: 148 DFCDFISPSAEEQSSRTAAVKAVSNVIKHI--WPQCKVEVFGSFRTGLFLPTSDIDV--- 202
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I + K Q L L +AL QKG +++Q +A ARVPI+KF I+ DIS
Sbjct: 203 -----VIFDSRVKTPQVGLYALAKALSQKGVAKKIQVIAKARVPIVKFVERKSEIAFDIS 257
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ + R + +++K + ++N TG SY+L +++ H
Sbjct: 258 FDMDGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGGIGSYALLTMLITHL 317
Query: 193 Q 193
Q
Sbjct: 318 Q 318
>gi|409081996|gb|EKM82354.1| hypothetical protein AGABI1DRAFT_52475, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 559
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 123/307 (40%), Gaps = 56/307 (18%)
Query: 44 SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG 103
+ G+ V PFGS+ + L+ GD+D+ I +S+ S+ K S+L L LR+ G
Sbjct: 177 AFSGSKVFPFGSYETKLYLPSGDIDLVI-VSDSMAYSN-----KSSVLHSLASVLRRAGI 230
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMV 159
+ +A A+VPI+KF TIH + DISI+ + GQ+ FL + R +V
Sbjct: 231 ASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLV 290
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVR 219
L+ K + +N TG SYS+ L + Q ++P
Sbjct: 291 LITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQ------------MHP----------- 327
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ 279
I R D + +L L + F E + G E+GI G
Sbjct: 328 -------------KIRRGEIDPEK-----NLGVLVMEFFELY-GCHFNYDEVGISVRDGG 368
Query: 280 WEHIRSNTRWL-PNNHP-LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT 337
+ W P+ L IEDP + P N + S AK+ F H LTST
Sbjct: 369 TYFNKRMRGWYNPDKRAGLCIEDPVD-PTNDISSGS-FGFAKVRTTFAGAHGILTSTAYN 426
Query: 338 RYALLSS 344
R L +
Sbjct: 427 RATYLDA 433
>gi|117645866|emb|CAL38400.1| hypothetical protein [synthetic construct]
Length = 1258
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A++PI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKMPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 961
>gi|117644866|emb|CAL37899.1| hypothetical protein [synthetic construct]
gi|306921257|dbj|BAJ17708.1| zinc finger, CCHC domain containing 6 [synthetic construct]
Length = 1258
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A++PI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKMPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 961
>gi|52545695|emb|CAH56219.1| hypothetical protein [Homo sapiens]
Length = 1258
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 65/119 (54%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
+ +L R LR+ G R + + A++PI+KF + + DIS+ N ++ L S
Sbjct: 843 IEELARVLRKHSGLRNILPITTAKMPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSA 902
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 903 IDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 961
>gi|388518551|gb|AFK47337.1| unknown [Lotus japonicus]
Length = 159
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/79 (49%), Positives = 53/79 (67%), Gaps = 2/79 (2%)
Query: 289 WLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARP 348
WLP + +F+EDPFEQP+N+AR+VS L+KIS AF T+ LTS NQ + +LL+ LA P
Sbjct: 3 WLPKTYAIFVEDPFEQPQNTARSVSAGQLSKISEAFLRTYSVLTSKNQNQNSLLTFLAPP 62
Query: 349 FILQFFGESPVRYANYNNG 367
+ + + PV NYN G
Sbjct: 63 EVSRLIIK-PV-IPNYNGG 79
>gi|355567541|gb|EHH23882.1| Terminal uridylyltransferase 7 [Macaca mulatta]
Length = 1348
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 87/178 (48%), Gaps = 15/178 (8%)
Query: 35 LREVVESV--ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-- 90
+R+ +ES + G + FGS + + DLD+ C++ G + + L
Sbjct: 922 IRQNLESFIRQDFPGTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDC 973
Query: 91 ---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ +L R LR+ G R + + A+VPI+KF + + DIS+ N ++ L
Sbjct: 974 VRTIEELARVLRKHSGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSA 1033
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
S ID R + + +K + K DI + G+ +SY+ +L+VL+ Q P ++P L++
Sbjct: 1034 YSAIDPRVKYLCYTMKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQE 1091
>gi|325181242|emb|CCA15656.1| Poly(A) polymerase putative [Albugo laibachii Nc14]
Length = 493
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 26/215 (12%)
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
++V ARVPILK I+CD+ + + + + + ++D R R + VK WAK
Sbjct: 129 EYVQSARVPILKLWNSRHQIACDLCVGGFHVVLNTAMMRYYGELDRRVRPLAFAVKYWAK 188
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL--VDDLKGVRANAER 224
+ IN+ GT +SY LLV+F+ Q+ P LP K I+ L +D E
Sbjct: 189 SRGINDSSNGTLSSYGYCLLVIFYLQSRFGPTKLPCSKGIFGDTLEHCEDFASFSKKVEN 248
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF-SGLSLKASELGI------CPFT 277
+ S +N S+ L F + S ++ + + + C
Sbjct: 249 -------YPFPESSLHGSTALNVDSVGSLLKGFFNFYASEFDMERNVVNVREGIQTCK-E 300
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
+WE+ P L IEDPFE + AR +
Sbjct: 301 AKWEY--------PVAWRLSIEDPFESGHDVARVI 327
>gi|403224340|dbj|BAM42470.1| uncharacterized protein TOT_040000837 [Theileria orientalis strain
Shintoku]
Length = 474
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/182 (25%), Positives = 90/182 (49%), Gaps = 28/182 (15%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
+V FGS ++ L++ DLD+ +E+ N + S+ + +++ + L R +
Sbjct: 151 CSVSLFGSAINGLWTEGSDLDVCVEIPNVNSRSAVIRNLRR-----IATVLSPLSPTRVI 205
Query: 108 QFVAHARVPILKF---------ETIHQNI--------------SCDISIDNLCGQIKSKF 144
Q A++PIL + + + +++ S DIS++N+ S
Sbjct: 206 QNRFTAKIPILNWRRDSKKRPVKIVEESLNKQEILDFECESIPSIDISVNNVLAVANSIL 265
Query: 145 LFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
+ + R R ++LL+K WAK+ IN+ GT +S+++SL+V+ Q C P +LP L+
Sbjct: 266 VGSYVSFEPRVRGLILLLKMWAKSKGINDRSRGTLSSFAISLMVIHFLQNCSPPLLPSLQ 325
Query: 205 DI 206
D+
Sbjct: 326 DL 327
>gi|260829841|ref|XP_002609870.1| hypothetical protein BRAFLDRAFT_90748 [Branchiostoma floridae]
gi|229295232|gb|EEN65880.1| hypothetical protein BRAFLDRAFT_90748 [Branchiostoma floridae]
Length = 344
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 56/99 (56%)
Query: 96 RALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRF 155
RA+ + + + A+VPILKF+ + CD++I+NL G + L S++D R
Sbjct: 62 RAISSTSAFIQRPQLIRAKVPILKFKDSVSGVECDVNINNLTGVRNTFLLQAYSRLDWRI 121
Query: 156 RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT 194
R +V LVK WA A IN+ T +SYSL+L+ L + Q
Sbjct: 122 RPLVFLVKLWAGAQGINDASQSTLSSYSLTLMTLHYLQV 160
>gi|427798163|gb|JAA64533.1| Putative terminal uridylyltransferase 4, partial [Rhipicephalus
pulchellus]
Length = 710
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 76/309 (24%), Positives = 135/309 (43%), Gaps = 59/309 (19%)
Query: 23 EDWETRMKVISDLREVVESVE-----SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
E+ E R +V+SDL +++ SL G++ FG SN ++I+L+
Sbjct: 188 EEVELRKRVVSDLETFIKATLPDVKLSLHGSSGNGFGLKTSN---------VNIDLT--- 235
Query: 78 CISSAGKKVKQSLL---GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISID 134
GK L GDLL+ + Y ++ ++VP ++F+ + +SC+IS++
Sbjct: 236 ---PLGKADCAQLFVGTGDLLQECPK---YAQVTKDYLSKVPRIRFKEVDSKLSCEISLN 289
Query: 135 NLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT 194
N Q SK L + +D R + + + + WAK ++ GT ++ +++ +F Q
Sbjct: 290 NSNSQKTSKLLDDYASLDRRVKILGVAFRLWAKHCGLDQQDRGTLPPHAFAIMTVFFLQQ 349
Query: 195 CVPAILPPLKDIYPGNLVD------DLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
C PA+LP L ++ G + DL+G R+S N
Sbjct: 350 CKPAVLPVLHEMKDGKESESYLKPKDLEG------------------RWSCK-----NDR 386
Query: 249 SLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENS 308
S+ L+V L +F K ++ +C Q I +W N + IEDP+ N
Sbjct: 387 SIGQLWVELL-RFYATEFKLNKRVVCIRRSQPMLI-VEKKW--NKRYIAIEDPYSCKRNL 442
Query: 309 ARAVSEKNL 317
AR++ + +
Sbjct: 443 ARSIPSERM 451
>gi|357130698|ref|XP_003566984.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Brachypodium distachyon]
Length = 619
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 142/343 (41%), Gaps = 58/343 (16%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++P E+ +R + + +VV+ + VE FGSF + L+ D+D+
Sbjct: 182 DFCDFISPSAEEQSSRTAAVQAVSDVVKHI--WPHCKVEVFGSFRTGLYLPTSDIDV--- 236
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I + K Q L L +AL QKG +++Q +A ARVPI+KF I DIS
Sbjct: 237 -----VIFESRVKTPQVGLYALAKALSQKGVAKKIQVIAKARVPIVKFVERVSGIPFDIS 291
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ + R + +++K + ++N TG SY+L +++ H
Sbjct: 292 FDIDGGPQAADFIKDAIRKMPALRPLCMILKVFLHQRELNEVYTGGVGSYALLTMLITHL 351
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q V D+ G YR+ +L
Sbjct: 352 QLIWG--------------VKDMLG------------------------YRQSKEHNLGI 373
Query: 253 LFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRW--LPNNHPLFIEDPFEQPENSAR 310
L V F + F G L ++GI + + ++S+ + L H + I+DP P+N
Sbjct: 374 LLVKFFD-FYGRKLNNWDVGISCNSARTFFLKSDKDFVNLDRPHLIAIQDPM-VPDNDI- 430
Query: 311 AVSEKNLAKISNAFE-----MTHFRLTSTNQTRYALLSSLARP 348
+ N K+ +AF +T +L ++ ++L ++ RP
Sbjct: 431 GKNSFNYFKVKSAFSKAYSVLTDAKLITSLGPNRSILGAIVRP 473
>gi|19114069|ref|NP_593157.1| poly(A) polymerase Cid13 [Schizosaccharomyces pombe 972h-]
gi|26392335|sp|Q9UT49.1|CID13_SCHPO RecName: Full=Poly(A) RNA polymerase cid13; Short=PAP; AltName:
Full=Caffeine-induced death protein 13; AltName:
Full=Polynucleotide adenylyltransferase cid13
gi|6014438|emb|CAB57438.1| poly(A) polymerase Cid13 [Schizosaccharomyces pombe]
Length = 578
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 127/315 (40%), Gaps = 30/315 (9%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
E R + L ++++ + FGS S L S D+D+ I C + +
Sbjct: 70 ERRYAFVQKLEQILKKEFPYKNIKTSLFGSTQSLLASNASDIDLCIITDPPQCAPTTCE- 128
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
+ A + G +++ ++ A+VPI+K +SCD +I+ + ++ +
Sbjct: 129 ---------VSAAFARNGLKKVVCISTAKVPIVKVWDSELQLSCDCNINKTISTLNTRLM 179
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINN-PKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
D R R +++++K WAK +N+ + GT SY++S +V+ Q P ILP L+
Sbjct: 180 RSYVLCDPRVRPLIVMIKYWAKRRCLNDAAEGGTLTSYTISCMVINFLQKRDPPILPSLQ 239
Query: 205 DIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDK----YRKINRSSLAHLFVSFLEK 260
++ L+ + +++ F + N SL LFV F +
Sbjct: 240 ------MLPHLQDSSTMTD-------GLDVSFFDDPDLVHGFGDKNEESLGILFVEFF-R 285
Query: 261 FSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
F G + G + R+ N+ L +E+PF N A E + I
Sbjct: 286 FFGYLFDYEHFVLSIRHGTFLSKRAKGWQFQLNNFLCVEEPFHTSRNLANTADEITMKGI 345
Query: 321 SNAFEMTHFRLTSTN 335
F FRL + N
Sbjct: 346 QLEFRRV-FRLLAYN 359
>gi|167535384|ref|XP_001749366.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772232|gb|EDQ85887.1| predicted protein [Monosiga brevicollis MX1]
Length = 369
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 92/228 (40%), Gaps = 50/228 (21%)
Query: 106 RLQFVAHARVPILKFETIHQN-ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
R Q + ARVP+ KF H++ + CD+S+ N ++ L +D R+R + +K+
Sbjct: 98 RFQLITRARVPLFKFR--HKDGLDCDVSVSNRLALCNTRLLEAYCLLDERYRPLGYFLKK 155
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK--------------DIYPGN 210
W KA +++ G F+SY++++++L Q P +LP L+ D Y
Sbjct: 156 WCKAVGLHDASQGGFSSYAMTMMLLASLQQASPPVLPYLQQLASPACPKQQRLVDGYDAY 215
Query: 211 LVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASE 270
DL V+ R ++N +LA L F + +
Sbjct: 216 FCTDLPYVQQTWRRT------------------EVNTQTLAELVAGFFDFCATF------ 251
Query: 271 LGICPFTGQWEHIRSNTRWLPNNHP-----LFIEDPFEQPENSARAVS 313
PF + +R T + + +EDPF+ N R VS
Sbjct: 252 ----PFEKRVMQVREGTVLFKADKDWDADIIAVEDPFDPTHNLTRTVS 295
>gi|255730627|ref|XP_002550238.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240132195|gb|EER31753.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 603
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P ++ TR KVI+ L++ + G TV FGS ++L+ D+D+
Sbjct: 171 IRDFVNYISPSSDEIITRNKVIAALKKSISDF--WPGTTVHVFGSCATDLYLPGSDIDMV 228
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G S L L LR ++ +AHA+VPI+KF + D
Sbjct: 229 V-------VSDTGSYENASRLYQLSTFLRTNKLATEVEVIAHAKVPIIKFVDPKSRLHID 281
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ + +NN G Y+ ++++
Sbjct: 282 VSFERTNGIDAAKRIRRWLVSTPG-LRELVLVVKQFLRTRRLNNVHVGGLGGYA-TIIMC 339
Query: 190 FHFQTCVPAI 199
+HF P I
Sbjct: 340 YHFLRLHPKI 349
>gi|325181825|emb|CCA16280.1| U3 small nucleolar ribonucleoprotein protein IMP4 p [Albugo
laibachii Nc14]
Length = 784
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 90/215 (41%), Gaps = 26/215 (12%)
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
++V ARVPILK I+CD+ + + + + + ++D R R + VK WAK
Sbjct: 129 EYVQSARVPILKLWNSRHQIACDLCVGGFHVVLNTAMMRYYGELDRRVRPLAFAVKYWAK 188
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNL--VDDLKGVRANAER 224
+ IN+ GT +SY LLV+F+ Q+ P LP K I+ L +D E
Sbjct: 189 SRGINDSSNGTLSSYGYCLLVIFYLQSRFGPTKLPCSKGIFGDTLEHCEDFASFSKKVEN 248
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF-SGLSLKASELGI------CPFT 277
+ S +N S+ L F + S ++ + + + C
Sbjct: 249 -------YPFPESSLHGSTALNVDSVGSLLKGFFNFYASEFDMERNVVNVREGIQTCK-E 300
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV 312
+WE+ P L IEDPFE + AR +
Sbjct: 301 AKWEY--------PVAWRLSIEDPFESGHDVARVI 327
>gi|294654384|ref|XP_456434.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
gi|199428840|emb|CAG84386.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
Length = 600
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + TR +V++ L++ + + FGS ++L+ D+D+
Sbjct: 176 IKDFVNYISPSEAEIMTRNRVVNQLKQQIGQF--WPATELHVFGSCATDLYLPGSDIDMV 233
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G +S L L LR K + ++ +A A+VPI+KF NI D
Sbjct: 234 V-------VSETGDYEHRSRLYQLSSFLRNKKLAKNIEVIAKAKVPIIKFVDPTSNIHID 286
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G K W+S G R++VL+VK++ ++ +NN G YS ++++
Sbjct: 287 ISFERTNGIDAAKKIRRWLSSTPG-LRELVLIVKQFLRSRKLNNVHVGGLGGYS-TIILC 344
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 345 YHFLKLHPRL 354
>gi|146085154|ref|XP_001465192.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398014433|ref|XP_003860407.1| hypothetical protein, conserved [Leishmania donovani]
gi|134069289|emb|CAM67439.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322498628|emb|CBZ33700.1| hypothetical protein, conserved [Leishmania donovani]
Length = 406
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 87/179 (48%), Gaps = 8/179 (4%)
Query: 30 KVISDLREVVESVES--LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
+V+ DLR V + S + ++ FGS + + D D+S+ N S S + V
Sbjct: 27 EVVDDLRWRVVDLCSRCVNKVELQLFGSLATGFCTTGADADLSLTFRNFSPWLSGIEVVD 86
Query: 88 QSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
L R R+ G G ++ + +AR+P+L+F+ I CD++I NL G SK L
Sbjct: 87 AQNFKRLARVGREAGEMGMENVRLI-NARIPVLQFQDAISGIRCDLTIGNLGGVANSKIL 145
Query: 146 FWISQIDGRFRD-MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CVPAILP 201
I ++ F V LVKEWAK ++ P FNS++++ + L Q +P +P
Sbjct: 146 AEIHRVLPDFYGAYVYLVKEWAKKCEVVAPDKSMFNSFTMTTMSLMVLQELGLLPIFVP 204
>gi|428186051|gb|EKX54902.1| hypothetical protein GUITHDRAFT_99551 [Guillardia theta CCMP2712]
Length = 489
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 91/194 (46%), Gaps = 29/194 (14%)
Query: 38 VVESVESLRGATVE--PFGSFVSNLFSRWGDLDISIELSNGS-----------------C 78
+ +V+ + GA E +GS + S DLD+ + LS+
Sbjct: 158 LTRAVKCILGADAELRAYGSAAAGFGSVDSDLDLQLSLSSKRKQLRTPMAVRGTYLPRVL 217
Query: 79 ISSAGKKVKQ-------SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
+ GKK +L L ALR + G + + + ARVP++ ++ +S DI
Sbjct: 218 VRMQGKKTVMMRRRENIRMLKVLSHALRSRFGLKAVA-ILRARVPLVTVQSEDATLSFDI 276
Query: 132 SI--DNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
S+ + G+ FL + ++D R +++VL VK W+K +IN GT NS+SL ++VL
Sbjct: 277 SVHEEENFGRFAVNFLEVMRRVDERVKEVVLAVKTWSKRREINEAFRGTLNSFSLIIMVL 336
Query: 190 FHFQTCVPAILPPL 203
F Q P ILP L
Sbjct: 337 FVLQRLDPPILPNL 350
>gi|401423734|ref|XP_003876353.1| DNA polymerase sigma-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322492595|emb|CBZ27872.1| DNA polymerase sigma-like protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 479
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 101/212 (47%), Gaps = 31/212 (14%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL---------- 60
L ++L L+P ED E ++VI D+R ++ G ++ +GS + L
Sbjct: 128 LIELLYCLSPTSEDRERMLRVIDDIRATMQRT----GMDIQIYGSLCTGLVIPASDVDCV 183
Query: 61 FSRWGDLDISIELS-NGSC----ISSA--GKKVKQSLLGDLLRA-------LRQKGGYRR 106
R GD I+ +S N SC I+SA G ++SL L A +R+ +
Sbjct: 184 LMRSGDQQIASAMSENLSCAMLTIASAATGSVSQKSLKISLSTAVRVVAERMRKSQKFAH 243
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL--FWISQIDGRFRDMVLLVKE 164
+ +AHA+VPI+K H ++ D+S + G + S +L + + R +++LVK
Sbjct: 244 VTSIAHAKVPIVKCRHRHDDVKVDLSFEQ-SGCVSSNYLCELFCEPGNEMARPLIVLVKA 302
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
++ P G S+ +SLLVL++ Q CV
Sbjct: 303 LVNNCGLDEPSMGGLGSFPISLLVLWYLQQCV 334
>gi|195375624|ref|XP_002046600.1| GJ12395 [Drosophila virilis]
gi|194153758|gb|EDW68942.1| GJ12395 [Drosophila virilis]
Length = 726
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 145/343 (42%), Gaps = 58/343 (16%)
Query: 31 VISDLREVVESVESLRG-ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQS 89
+ +RE ++ + L+G V PFGS V+ L + D+D+ +E ++ S S + ++
Sbjct: 100 CFAQVRETLQ--KQLQGRVKVYPFGSLVTGLALKDSDIDLFLEQTDTSSNSMSHRQ---- 153
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L + LR+ ++ + + HARVPI++ + ++ +S DI++ + S+F+ +
Sbjct: 154 LFNKIYNFLRRTDCFQDVFAIRHARVPIIRCKHVYSGLSLDINMSSPNSTYNSRFVAELL 213
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
D R R++ L +K WAK I +G+ SY L L++F Q LP +K +
Sbjct: 214 GRDVRMRELFLFLKLWAKKLKIIG--SGSMTSYCLITLIIFGMQQ--RRQLPSIKQL--- 266
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS-SLAHLFVSFLEKFSGLSLKA 268
A + E+ N A +S + R I S + L SF E +S + +
Sbjct: 267 -----------QARCPVLEVMGVNYA-YSFQQVRPIPASLTTLDLISSFFELYSQMEFEK 314
Query: 269 SELGICPFTG--------------------QWEHIRSNTRWLPN----NHPLFIEDPFEQ 304
L P+ G Q + + T P + ++DPFE
Sbjct: 315 KLLS--PYLGCALDLETAFSTPGKFPEYEEQLKAMHKATGEQPEPFQYQRCVCVQDPFEL 372
Query: 305 PENSARAVSEKNLAKISNAFEM-----THFRLTSTNQTRYALL 342
N +++S NL + + + RL ST Y L
Sbjct: 373 QHNVGQSISITNLCYLRECLALATKACSDKRLVSTPAKLYDYL 415
>gi|345480249|ref|XP_001607530.2| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Nasonia
vitripennis]
Length = 589
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 131/311 (42%), Gaps = 41/311 (13%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL---------LGDLLRALRQ 100
V PFGS V+ + DLD+S+ + + V Q+ + L+ +
Sbjct: 242 VLPFGSSVNGFGKQGCDLDLSVIFEEDKMEKNTSRLVFQTKSILTHEKYQMKRLMETVAD 301
Query: 101 KG-----GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRF 155
G ++ + ARVPI+KF+ + CD+++ N+ S+ L+ ++D R
Sbjct: 302 TMNIFVPGISNVRKILEARVPIIKFDHSLTRVECDLAMTNMSAYYMSELLYMYGEMDRRV 361
Query: 156 RDMVLLVKEWAKAHDIN--NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
R ++ V++WA+ + N ++SLSL+VLF Q ILP L NL
Sbjct: 362 RPLIFTVRKWAQCLKLTTKNIPGPWITNFSLSLMVLFFLQE--KKILPSL------NL-- 411
Query: 214 DLKGVRANAERQIAEI---CAFNIARFSSDKYRKI-NRSSLAHLFVSFLEKFSGLSLKAS 269
LK + +IA+I C F K KI N SL L + F F +
Sbjct: 412 -LKSCATREDIRIADIYVDCTFQRDITKIPKNNKIPNTESLEALLLEFFTYFGNFDFETK 470
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHF 329
L + + + I P L+I +P E N ++ V + L +I A +
Sbjct: 471 ALSL----REGKPISK-----PEYTALYICNPLETNLNVSKNVRFEELERIRVAMRSASW 521
Query: 330 RLTST-NQTRY 339
+L +T N+ R+
Sbjct: 522 QLEATDNEKRF 532
>gi|255710469|ref|XP_002551518.1| KLTH0A01276p [Lachancea thermotolerans]
gi|238932895|emb|CAR21076.1| KLTH0A01276p [Lachancea thermotolerans CBS 6340]
Length = 609
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 93/184 (50%), Gaps = 11/184 (5%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ R IS +R V+ + A + FGSF ++L+ D+D
Sbjct: 173 IKDFVAYISPSQEEIRVRNSTISKIRNAVKDL--WPDADLHVFGSFATDLYLPGSDIDCV 230
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I SS+G K ++ L L L+++ +++ ++ ARVPI+KF I D
Sbjct: 231 IN-------SSSGDKENRNCLYSLASFLKRRKLATQVEVISKARVPIIKFVEPISQIHID 283
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + + G ++ + W+ + G R++VL+VK++ A +N TG +S+ LV
Sbjct: 284 ISFERVNGLEAARVIRGWLKETPG-LRELVLIVKQFLAARRLNMVHTGGLGGFSIICLVY 342
Query: 190 FHFQ 193
Q
Sbjct: 343 AFLQ 346
>gi|366997671|ref|XP_003683572.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
gi|357521867|emb|CCE61138.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
Length = 678
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 94/191 (49%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD ++P RE+ + R ++I+ +++ V + S A + FGS+ ++L+ D+D
Sbjct: 180 IKDFTSYISPSREEIKLRNRIIAAIKQAVRDLWS--DADLLVFGSYATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S G K + L L LRQ +++ +A ARVPI+KF I D
Sbjct: 238 IN-------SEKGDKESRYNLYILASHLRQLNLATQVEVIAKARVPIIKFVEPKSQIHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ +NN TG +S+ LV
Sbjct: 291 VSFERTNGVEAAKLIREWLDDTPG-LRELVLVIKQFLATRRLNNVHTGGLGGFSIICLV- 348
Query: 190 FHFQTCVPAIL 200
F F P I+
Sbjct: 349 FCFLKMHPKII 359
>gi|324529009|gb|ADY48977.1| Poly(A) RNA polymerase gld-2 A, partial [Ascaris suum]
Length = 190
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 59/101 (58%), Gaps = 5/101 (4%)
Query: 110 VAHARVPILKFET--IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+ HA VPILK + +I DI+ +NL G S L ++ID RF + + +K WA+
Sbjct: 90 IVHATVPILKCRVTDVLGDIYVDINCNNLAGVYNSYLLHHYARIDSRFPALCMTIKRWAE 149
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCV--PAILPPLKDI 206
A +IN P G+ NSY++ L+++ HF C P ILP L+ +
Sbjct: 150 AANINMPMNGSLNSYTIKLMIV-HFLQCAIWPPILPNLRKL 189
>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 94/190 (49%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + TR VI L+ + + V FGS ++L+ D+D+
Sbjct: 172 IKDFVNYISPSKLEITTRNNVIGRLKSTI--TKFWPDTEVHVFGSSATDLYLPGSDIDMV 229
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ IS G + ++S L L LR K + ++ +A A+VPI+KF NI D
Sbjct: 230 V-------ISRDGDREQRSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVKFVDPDSNIHID 282
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G K W++ G R++VL+VK++ ++ +NN G YS ++++
Sbjct: 283 VSFERSNGIDAAIKIREWLASTPG-LRELVLVVKQFLRSRRLNNVHVGGLGGYS-TIILC 340
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 341 YHFLKLHPRV 350
>gi|343470399|emb|CCD16892.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 406
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 87/182 (47%), Gaps = 8/182 (4%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D DIS+ N S ++V
Sbjct: 28 VIGELQKRVLDIGMLAVNKAHVELFGSHVSGFCTPHSDADISLTYRNFSPWLQGMERVDD 87
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
+ R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 88 QNNKRMTRFAKEASNMGMEDVRYI-RARIPVVQFTDSVTGIHCDVSIGNVGGVENSKILC 146
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CVPAILPPL 203
I + F + LVKEW KA ++ P+ TFNS++L+ + L Q +P P
Sbjct: 147 AIRAVFPDFYGAYIHLVKEWGKAREVVAPERSTFNSFTLTTMALMVLQELGLLPVFANPT 206
Query: 204 KD 205
D
Sbjct: 207 GD 208
>gi|428161452|gb|EKX30845.1| hypothetical protein GUITHDRAFT_51872, partial [Guillardia theta
CCMP2712]
Length = 89
Score = 71.2 bits (173), Expect = 1e-09, Method: Composition-based stats.
Identities = 35/89 (39%), Positives = 48/89 (53%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
+RVPI+K + CDIS+ N K L +ID RF+ +V LVK WAKA IN
Sbjct: 1 SRVPIVKISDQTSGVHCDISMQNDLSLYKDALLRSYVKIDSRFQKLVALVKTWAKARAIN 60
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+ T NS+ +LL++ Q C P + P
Sbjct: 61 DAAAHTLNSFGYTLLIIQFLQVCSPPVFP 89
>gi|302828782|ref|XP_002945958.1| hypothetical protein VOLCADRAFT_102915 [Volvox carteri f.
nagariensis]
gi|300268773|gb|EFJ52953.1| hypothetical protein VOLCADRAFT_102915 [Volvox carteri f.
nagariensis]
Length = 992
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/208 (25%), Positives = 91/208 (43%), Gaps = 56/208 (26%)
Query: 10 ILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
+L+ ++ P R+D+ R+ +I+ + ++ V +++ V P+GSFVS ++ DLD+
Sbjct: 18 VLESLVQACTPTRDDYNKRLALITRIEAALQRVGNVQNLRVIPYGSFVSQFYNSTSDLDL 77
Query: 70 SI---------------ELSNGSC------ISSAGKKVKQSLLGDLLRALRQKGGYRR-- 106
++ E+ G+ + ++ K+ LL D+ L G RR
Sbjct: 78 AVCGTIPVDCLKPGSRAEIFGGAAEEEDVSVHKLDQRTKRQLLRDVGLRLLAAGLARRGC 137
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
++F+ HARVPI+KF I VK WA
Sbjct: 138 VEFILHARVPIIKFADPSSGIE---------------------------------VKLWA 164
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQT 194
KAH IN+ + NS+ L+L+V+ QT
Sbjct: 165 KAHCINDGASHMLNSWCLTLVVISFLQT 192
>gi|154283983|ref|XP_001542787.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150410967|gb|EDN06355.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 974
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 97/226 (42%), Gaps = 40/226 (17%)
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
G+ L + GG R+ V+HARVPI+K ++CD++++N ++ + +I
Sbjct: 116 GNKLCSSDSDGGMERVVCVSHARVPIVKIWDPELRLACDMNVNNTLALENTRMIRTYVEI 175
Query: 152 DGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQTCVPAILP--------- 201
D R R + ++VK W K +N+ GT +SY+ L++ QT P ILP
Sbjct: 176 DERVRQLAMIVKYWTKRRILNDAALGGTLSSYTWICLIINFLQTRNPPILPSLQERRAKQ 235
Query: 202 PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
P K PG+ DD + ++ F + N+SSL L F ++
Sbjct: 236 PKKADDPGSSFDD----------DLEKLTGFG----------QENKSSLGELLFQFF-RY 274
Query: 262 SGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPEN 307
G + E + W H+ N R L +E+PF N
Sbjct: 275 YGHEVDY-ETKVMSEGKGW-HLLQNNR-------LCVEEPFNTSRN 311
>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC
6260]
Length = 588
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 94/190 (49%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + TR VI L+ + + V FGS ++L+ D+D+
Sbjct: 172 IKDFVNYISPSKLEITTRNNVIGRLKSTI--TKFWPDTEVHVFGSSATDLYLPGSDIDMV 229
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ IS G + ++S L L LR K + ++ +A A+VPI+KF NI D
Sbjct: 230 V-------ISRDGDREQRSRLYQLSTHLRSKKLAKNIEVIAKAKVPIVKFVDPDSNIHID 282
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G K W++ G R++VL+VK++ ++ +NN G YS ++++
Sbjct: 283 VSFERSNGIDAAIKIREWLASTPG-LRELVLVVKQFLRSRRLNNVHVGGLGGYS-TIILC 340
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 341 YHFLKLHPRV 350
>gi|449296924|gb|EMC92943.1| hypothetical protein BAUCODRAFT_77162 [Baudoinia compniacensis UAMH
10762]
Length = 618
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 96/239 (40%), Gaps = 42/239 (17%)
Query: 117 ILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
+L F I DI+ N G ++ L S D R R MVL VK WAK IN+ +
Sbjct: 231 LLDFPKSGIGIQSDINFFNPLGLHNTQLLRCYSLCDQRVRPMVLFVKSWAKRRKINSSYS 290
Query: 177 GTFNSYSLSLLVLFHF-QTCVPAILPPLKDIY--------PGNLVDDLKGVRANAERQIA 227
GT +SY ++VL + P +LP L+ + PG ++ G + R
Sbjct: 291 GTLSSYGYVMMVLHYLVNVAQPPVLPNLQAPWRPNRSCTPPGASTTEVDGWIVDFWRNED 350
Query: 228 EIC-AFNIARFSSDKYRKINRSSLAHLFVSFLEKFS---------------------GLS 265
EI A + S NR SL L + F E +S GL
Sbjct: 351 EILGAVRNGQLSQ------NRESLGSLLLGFFEYYSSQGYGPRFQWMQDVLSLRSPGGLL 404
Query: 266 LKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
KA + + T + E + R+L IEDPFE N AR V+ + + I + F
Sbjct: 405 GKAEKGWVKAITEEGEGKKVQHRYL-----FCIEDPFELSHNVARTVTHQGIVAIRDEF 458
>gi|342179839|emb|CCC89313.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 406
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 87/182 (47%), Gaps = 8/182 (4%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D DIS+ N S ++V
Sbjct: 28 VIGELQKRVLDIGMLAVNKAHVELFGSHVSGFCTPHSDADISLTYRNFSPWLQGMERVDD 87
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
+ R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 88 QNNKRMTRFAKEASNMGMEDVRYI-RARIPVVQFTDSVTGIHCDVSIGNVGGIENSKILC 146
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CVPAILPPL 203
I + F + LVKEW KA ++ P+ TFNS++L+ + L Q +P P
Sbjct: 147 AIRAVFPDFYGAYIHLVKEWGKAREVVAPERSTFNSFTLTTMALMVLQELGLLPVFANPT 206
Query: 204 KD 205
D
Sbjct: 207 GD 208
>gi|426199822|gb|EKV49746.1| hypothetical protein AGABI2DRAFT_63272 [Agaricus bisporus var.
bisporus H97]
Length = 481
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/307 (27%), Positives = 123/307 (40%), Gaps = 56/307 (18%)
Query: 44 SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG 103
+ G+ V PFGS+ + L+ GD+D+ I +S+ S+ K S+L L LR+ G
Sbjct: 178 AFSGSKVFPFGSYETKLYLPSGDIDLVI-VSDSMAYSN-----KSSVLHSLASVLRRAGI 231
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMV 159
+ +A A+VPI+KF TIH + DISI+ + GQ+ FL + R +V
Sbjct: 232 ASNVTVIAKAKVPIVKFVTIHGRFNVDISINQTNGIVGGQVIKGFLQNLVTGGLALRSLV 291
Query: 160 LLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVR 219
L+ K + +N TG SYS+ L + Q ++P
Sbjct: 292 LITKLFLSQRSMNEVFTGGLGSYSIVCLAISFLQ------------MHP----------- 328
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ 279
I R D + +L L + F E + G E+GI G
Sbjct: 329 -------------KIRRGEIDPEK-----NLGVLVMEFFELY-GCHFNYDEVGISVRDGG 369
Query: 280 WEHIRSNTRWL-PNNHP-LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQT 337
+ W P+ L IEDP + P N + S AK+ F H LTST
Sbjct: 370 TYFNKRMRGWYNPDKRAGLCIEDPVD-PTNDISSGS-FGFAKVRTTFAGAHGILTSTAYN 427
Query: 338 RYALLSS 344
R L +
Sbjct: 428 RATYLDA 434
>gi|324506764|gb|ADY42880.1| Terminal uridylyltransferase 4 [Ascaris suum]
Length = 611
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS V+ D+DIS + S ++ L L Q G + + +
Sbjct: 315 FGSIVNGFGVIGSDVDISFRFGSDK---SPEDFDADDVIMKLAEVLSQIAGIVDVYAIPN 371
Query: 113 ARVPILKF--ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
A+VPI+KF E + D+S+ N ++ L S+ID R R + ++K+W+
Sbjct: 372 AKVPIVKFKYEDTLYHFESDLSLYNALALENTRLLREYSEIDKRVRPLGTMLKKWSSYCG 431
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEIC 230
I G +SY+L ++++ Q P +LP L+ + R ++ +
Sbjct: 432 IRGASCGKLSSYALIVMLIHFLQRTTPPVLPFLQ-----------QAQRYGRPKECRIVD 480
Query: 231 AFNIARFSSDK--YRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQW 280
+++ ++ + ++ N S + L++ FL ++ + ++ SE P
Sbjct: 481 GWDVYFCNAAEVGWKVENAESTSQLWLGFLGYYAKHFDFESMVVQIRMSE----PVN--- 533
Query: 281 EHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYA 340
+ RWL P+ IEDPF+ N + V A I ++F +H R ++
Sbjct: 534 ---KLQKRWLW--RPMAIEDPFDLDHNLSNGVHWDTFAYIKDSFLRSHKRFALLKYSKRQ 588
Query: 341 LLSSLARPFILQFF-GESPVRYA 362
+LSS F+ F G PV A
Sbjct: 589 VLSSELHDFLADVFSGCRPVDDA 611
>gi|156096867|ref|XP_001614467.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803341|gb|EDL44740.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 567
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 139/300 (46%), Gaps = 41/300 (13%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGGY 104
+ V PFGS ++ ++R D+DI I++ +K + + L + L G
Sbjct: 272 KNCHVTPFGSIINGFWTRNSDIDICIQIP-----ILLSRKDQITFLKKICLILNSFNDGI 326
Query: 105 RRLQFVAHARVPILKF--ETIHQN--ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+F A+VPI+ F +++ + +SCDIS++N+ + SK + ID R + M +
Sbjct: 327 IEQRF--SAKVPIIHFYCKSLRHSFELSCDISVNNILAVVNSKLIQKYVSIDKRLQLMGI 384
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVR 219
+K W+K +IN+ G +S+SL L+++ Q P IL L+DI +
Sbjct: 385 ALKYWSKNRNINDRSKGFLSSFSLILMIIHFLQYVTEPKILTSLQDI----------SFK 434
Query: 220 ANAE--RQIAEICAF----NIARFSSDKYRKINRSSLAH-----LFVSFLEKFSGLSLKA 268
N + + C F N+ R ++ R+IN + + L + F KF G K+
Sbjct: 435 RNEKPFYVMGVDCKFCQDENVIR---EELRRINNYNDVYVDTSTLLIEFF-KFFGYKYKS 490
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTH 328
+ I +++ ++ + ++ LF+++PFE +N A + + N I N + +
Sbjct: 491 GIIAIRDINDYYQNFQAVRSY--ESYFLFVDNPFEVGKNVANVLPQ-NYKTIVNEMKRAY 547
>gi|340052136|emb|CCC46407.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 406
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 81/164 (49%), Gaps = 6/164 (3%)
Query: 41 SVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ 100
V ++ A VE FGS VS + D DIS+ N S ++V + + R ++
Sbjct: 40 GVLAVNKAHVELFGSHVSGFCTPTSDADISLTYRNFSPWLQGMERVDEQNNKRMTRFGKE 99
Query: 101 KG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
G ++++ AR+P+++F I CD+SI N+ G SK L I QI F
Sbjct: 100 AAAMGMENVRYI-RARIPVVQFTDSVTGIHCDVSIGNVGGVENSKILAAIRQIYPDFYGA 158
Query: 159 VL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
+ LVK W KA ++ P+ TFNS++++ + L Q +LP
Sbjct: 159 YIHLVKAWGKAREVIAPERSTFNSFTVTTMALMVLQEL--GLLP 200
>gi|225562120|gb|EEH10400.1| PAP/25A associated domain family [Ajellomyces capsulatus G186AR]
Length = 1079
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 96/215 (44%), Gaps = 35/215 (16%)
Query: 18 LNPLRE--DWETRMKVISDLREVV------ESVESLRGATVEPFGSFVSNLFSRWGDLDI 69
L PL++ D K+ D+RE+ E ES R V+ + L +W +I
Sbjct: 101 LGPLKDQLDSADEKKLSGDMRELYDRLLPSEESESRRLKFVDKLENL---LNKQWPGNNI 157
Query: 70 SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
+ + S+G K+ S GG R+ V+HARVPI+K ++C
Sbjct: 158 RVHV-----FGSSGNKLCSS---------DSDGGMERVVCVSHARVPIVKIWDPELRLAC 203
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W K +N+ GT +SY+ L+
Sbjct: 204 DMNVNNTLALENTRMIRTYVEIDERVRQLAMIVKYWTKRRILNDAALGGTLSSYTWICLI 263
Query: 189 LFHFQTCVPAILP---------PLKDIYPGNLVDD 214
+ QT P ILP P K PG+ DD
Sbjct: 264 INFLQTRNPPILPSLQERRAKQPKKADDPGSSFDD 298
>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
Length = 1512
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/346 (22%), Positives = 138/346 (39%), Gaps = 73/346 (21%)
Query: 5 NVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + +D+ + ++P E+ E R V++ + V + A V PFGS+ + L+
Sbjct: 155 NVAEMLHRDVEAFVKYISPTPEEDEVRSLVVTLISRAV--TRAFPDAQVLPFGSYETKLY 212
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
G+ K+S+L L +++ G R++ +A A+VPI+KF
Sbjct: 213 LPIGN--------------------KESVLHALANTVKRAGITDRVKIIAKAKVPIVKFV 252
Query: 122 TIHQNISCDISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
T H + S DIS++ G K + +++++ R ++L++K + +N TG
Sbjct: 253 TTHGHFSVDISVNQGNGVTAGKMIKHYLAELPA-LRSLILVIKSFLSQRSMNEVYTGGLG 311
Query: 181 SYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
SYS+ L + Q ++P I R D
Sbjct: 312 SYSIVCLAISFLQ------------MHP------------------------KIRRGEID 335
Query: 241 KYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFI 298
R +L L + F E + G E+GI G ++ WL P L I
Sbjct: 336 PSR-----NLGVLVMEFFELY-GCYFNYHEVGISLLDGGTYFNKAERGWLDYGQPKLLSI 389
Query: 299 EDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSS 344
EDP + + +R + K+ H +T+ + ++SS
Sbjct: 390 EDPGDPTNDISRG--SYGIVKVRTTLAGAHGIMTAAAYMQAGIMSS 433
>gi|221055315|ref|XP_002258796.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193808866|emb|CAQ39569.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 548
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/300 (24%), Positives = 140/300 (46%), Gaps = 41/300 (13%)
Query: 46 RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-KGGY 104
+ V PFGS ++ ++R D+DI I++ +K + + L + L G
Sbjct: 253 KNCHVTPFGSIINGFWTRNSDIDICIQIP-----ILLSRKDQITFLKKICLILNNFNDGI 307
Query: 105 RRLQFVAHARVPILKF--ETIHQN--ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+F A+VPI+ F +++ + +SCDIS++N+ I SK + ID R + M +
Sbjct: 308 IEQRF--SAKVPIIHFYCKSLRHSFELSCDISVNNILAVINSKLIQKYVSIDRRLQLMGI 365
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVR 219
+K W+K+ +IN+ G +S+SL L+++ Q P IL ++DI +
Sbjct: 366 ALKYWSKSRNINDRSKGFLSSFSLILMIIHFLQYVAEPKILTSIQDI----------SFK 415
Query: 220 ANAE--RQIAEICAF----NIARFSSDKYRKINRSSLAH-----LFVSFLEKFSGLSLKA 268
N + + C F N+ R ++ R+IN + + L + F KF G K+
Sbjct: 416 RNEKPFYVMGVDCKFCQDENVIR---EELRRINNYNDVYVDTSTLLIEFF-KFFGYKYKS 471
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTH 328
+ I +++ ++ + ++ LF+++PFE +N A + + N I N + +
Sbjct: 472 GIIAIRDINDYYQNFQAVRSY--ESYFLFVDNPFEVGKNVANVLPQ-NYKTIVNEMKRAY 528
>gi|367007982|ref|XP_003688720.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
gi|357527030|emb|CCE66286.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 98/191 (51%), Gaps = 12/191 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++D + ++P R + E R + I+++R ++ E A + FGS+ ++L+ D+D
Sbjct: 130 IRDFVSYISPNRTEIEMRNQTINNIRNSIK--EHWPDADLHVFGSYATDLYLPGSDIDCV 187
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S+ G K +SL+ L L++KG + +A+ARVPI+KF I D
Sbjct: 188 IN-------SNKGDKGSRSLMYSLASFLKRKGLATDITIIANARVPIIKFVEPVSGIHID 240
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G + + W++ R++VL+VK++ + +N+ TG +S+ LV
Sbjct: 241 VSFERDNGLDAANIIRSWLTSTPS-LRELVLIVKQFLNSRRLNDVHTGGLGGFSIICLV- 298
Query: 190 FHFQTCVPAIL 200
+ F + P I+
Sbjct: 299 YSFLSLHPRII 309
>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
HHB-10118-sp]
Length = 478
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 94/196 (47%), Gaps = 19/196 (9%)
Query: 5 NVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + K++ + ++P +E+ E R ++ + V ++ A V PFGS+ + L+
Sbjct: 148 NVAEMLHKEVEAFVKYISPTQEEDEIRSLIVESISRAV--TKAFPDARVLPFGSYETKLY 205
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ IE S K ++L L +++ G ++ +A A+VPI+KF
Sbjct: 206 LPLGDIDLVIESD------SMAYNNKVNVLQALATTMKRAGITDKVTIIAKAKVPIIKFV 259
Query: 122 TIHQNISCDISIDNL----CGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
T H S DIS++ + G + +FL I + +VL+ K + +N TG
Sbjct: 260 TRHGRFSVDISLNQMNGVKAGTMIKRFLDHIPALQA----LVLITKSFLSQRSMNEVFTG 315
Query: 178 TFNSYSLSLLVLFHFQ 193
SYS+ L + Q
Sbjct: 316 GLGSYSIVCLAISFLQ 331
>gi|156843407|ref|XP_001644771.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
70294]
gi|156115421|gb|EDO16913.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
70294]
Length = 640
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 99/199 (49%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ E R IS +R ++ + S + + FGS+ ++L+ D+D
Sbjct: 154 IKDFVAYISPSKEEIEIRNVTISKIRNSLKELWS--DSDLHVFGSYATDLYLPSSDIDCV 211
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ NG K ++ L L ++ G +++ +A ARVPI+KF + + D
Sbjct: 212 VNSENGD-------KENRNDLYSLATHFKRNGLAIQVEVIAKARVPIIKFVEPNSKLHID 264
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + L G +K + W+ G R++VL+VK++ A +NN TG S+ LV
Sbjct: 265 ISFERLNGLEVAKIIREWLDDTPG-LRELVLIVKQFLAARRLNNVHTGGLGGLSIICLV- 322
Query: 190 FHFQTCVPAILPPLKDIYP 208
+ F P I+ DI P
Sbjct: 323 YSFLKLHPRIIT--NDIDP 339
>gi|327274653|ref|XP_003222091.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Anolis
carolinensis]
Length = 574
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 95/220 (43%), Gaps = 27/220 (12%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q + +AR P++KF CD++ +N ++ L+ +D R R +V V
Sbjct: 296 GCVSVQKILNARCPLVKFSHQPSGFQCDLTANNRIAMRSTELLYIYGSLDPRVRALVFGV 355
Query: 163 KEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
+ WA+ H I + G + +++L+ +VLF Q P I+P L D LKG+
Sbjct: 356 RCWARTHGITSSIPGPWITNFALTTMVLFFLQKRQPPIVPTL---------DHLKGLADA 406
Query: 222 AERQIAE--ICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFT 277
++ I E C F N+ + + N +L L F E F + L I
Sbjct: 407 EDKHIVEGYDCTFVSNLNKIKPTE----NTETLDVLLGEFFEYFGNFAFNKHSLNI---- 458
Query: 278 GQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+ + P L I++PFEQ N ++ V+ L
Sbjct: 459 -----RKGKEQNKPEASALHIQNPFEQSLNISKNVNATQL 493
>gi|341881648|gb|EGT37583.1| CBN-MUT-2 protein [Caenorhabditis brenneri]
Length = 397
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 126/327 (38%), Gaps = 85/327 (25%)
Query: 24 DWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
+ + ++ DL+ ++ + + P GS V+ L +R DLD++I I
Sbjct: 59 ELDRKIAFCEDLQSTIQRINPTWNFRIVPTGSSVTGLATRNSDLDVAIH------IPQVA 112
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
+ V++ G L+ A + K
Sbjct: 113 RIVEEMCSGRLVTA-------------------------------------------EEK 129
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILP 201
+ W + D RF + +VK+WA + + NPK G FNSY+L LLV+ HF C P ILP
Sbjct: 130 LVMW-RECDDRFAPLCFVVKKWADSTGVKNPKDGGFNSYALVLLVI-HFLQCGTSPPILP 187
Query: 202 PLKDIYPG-NLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
L IY G N + A +E E N +S+AHLF FL
Sbjct: 188 NLIKIYKGMNFI-------AQSEHDFPERLDLEAPFPKPLPTFSPNDASIAHLFFEFLNY 240
Query: 261 FSGLSLKASELGI---CPFTGQWEHIRS----------NTRWLP------NNHPLFIEDP 301
+SG + + I C ++ + + + LP ++IEDP
Sbjct: 241 YSGFKFDENYISIKDACVYSRSLIFVGNCSVCELFCFRKSSILPEAVRNQKQKQVYIEDP 300
Query: 302 FEQPENSARAV-SEKNLAKISNAFEMT 327
F+ N R V S + L KI F+MT
Sbjct: 301 FDS-HNPGRTVRSIRTLKKI---FKMT 323
>gi|431902879|gb|ELK09094.1| Terminal uridylyltransferase 7 [Pteropus alecto]
Length = 1332
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 80/164 (48%), Gaps = 13/164 (7%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSL-----LGDLLRALRQK 101
G + FGS + + DLD+ C++ G + + L + +L R L++
Sbjct: 940 GTKLSLFGSSKNGFGFKQSDLDV--------CMTINGLETAEGLDCVRTIEELARVLKKH 991
Query: 102 GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + +
Sbjct: 992 SGLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYT 1051
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
+K + K DI + G+ +SY+ +L+VL+ Q P ++P L++
Sbjct: 1052 MKVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQE 1095
>gi|327298301|ref|XP_003233844.1| PAP/25A associated domain-containing protein [Trichophyton rubrum
CBS 118892]
gi|326464022|gb|EGD89475.1| PAP/25A associated domain-containing protein [Trichophyton rubrum
CBS 118892]
Length = 1035
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 82/168 (48%), Gaps = 13/168 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K++ L P E E R+K + L +++++ V FGS + L + D+DI
Sbjct: 122 IKELYQKLLPSPESEERRVKFVRKLEKLLDTQWPGNEIKVNVFGSSGNKLCTSDSDVDI- 180
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K + +L D L K G R+ V+HA+VPI+K ++C
Sbjct: 181 -------CITTPSKCFEPVCVLADFL----AKSGMERVVCVSHAKVPIVKIWDPELQVAC 229
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
D++++N ++ + ++D R R + +LVK W K +N+ + G
Sbjct: 230 DMNVNNTLALENTRMIKTYVELDDRIRPLAMLVKHWTKRRILNDAEKG 277
>gi|242002656|ref|XP_002435971.1| zinc finger protein, putative [Ixodes scapularis]
gi|215499307|gb|EEC08801.1| zinc finger protein, putative [Ixodes scapularis]
Length = 1047
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 93/193 (48%), Gaps = 7/193 (3%)
Query: 2 GSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
G +L + D++ +P + + R ++ +L ++ + S A + +GS +
Sbjct: 601 GHIMILNDVCLDVMRECSPRPHEEKDRNMLLHELESLIRELYS--DARLTLYGSSCNGFG 658
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQS-LLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
DLDI + + S GK++ S ++ +L + LR R+ + A+VPI+KF
Sbjct: 659 LARSDLDICLTFDS----SKDGKELCHSKMIPELAKKLRAHPDLDRIVPITTAKVPIVKF 714
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFN 180
+ DIS+ N Q ++ L S ID R R + +K +AK DI + G+ +
Sbjct: 715 YHRPSRLEGDISLYNTLAQHNTRLLKVYSDIDKRVRVLGYTLKHFAKTCDIGDASRGSLS 774
Query: 181 SYSLSLLVLFHFQ 193
SY+ L+VL++ Q
Sbjct: 775 SYAYILMVLYYLQ 787
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 131/295 (44%), Gaps = 32/295 (10%)
Query: 30 KVISDLREVVESVE-SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
K+ + LRE + ++ +L G++V FG + D +++++LS S GK
Sbjct: 96 KLEASLRETLPDIKLTLHGSSVNGFGLY---------DSEVNLDLS------STGKTEVA 140
Query: 89 SLLGDLLRALRQ-KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
LL +L + Q + + + A+VP +F ++ C+IS++N S+ L
Sbjct: 141 ELLVELSDKITQDEDNFSSPERDFLAKVPRFRFVDGPTDLKCEISLNNSNSIKTSRLLAD 200
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIY 207
+ +D R + + ++ + W ++ + GT ++ ++V++ Q C P ++P L
Sbjct: 201 YASLDPRVQSLGVIFRYWGHVCKLDRQERGTLPPHAYPIMVIYFLQQCKPPVVPRLG--- 257
Query: 208 PGNLVDDLKGVRA-----NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
P +++ L + + N + ++ + K++ N SL L+ L +F
Sbjct: 258 PPPVLEGLGELPSFVLMRNCLLTELLLLMLHVQEW---KWKSTNNRSLGDLWCELL-RFY 313
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+ + +C + I S +W N + IEDP+ N AR++ + +
Sbjct: 314 AAEFRLEKHVVCIRRLKPVLI-SEKKW--NKRYIAIEDPYSSKRNLARSIPSEEM 365
>gi|124806440|ref|XP_001350723.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
gi|23496850|gb|AAN36403.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
Length = 469
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 81/166 (48%), Gaps = 14/166 (8%)
Query: 44 SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG 103
+L+G + GS +N++ + D+D CI + K S L +L+ ++
Sbjct: 163 NLKGK-IYFIGSCENNIWIKNSDID--------CCIVVENCEDKNSYLY-ILKVIKSAIN 212
Query: 104 --YRRLQF-VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
Y L + A VPI K NI CDISI+N + +KF+ I ID R +
Sbjct: 213 LIYPSLTINIIKASVPIAKIYKEETNI-CDISINNTVAIVNTKFVSSICNIDERVTIINR 271
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
++K WAK +INN GTF+SY+L LL + FQ +LP K I
Sbjct: 272 IIKYWAKQKNINNRSQGTFSSYALFLLTYYFFQNINNPLLPSYKSI 317
>gi|197101725|ref|NP_001124898.1| U6 snRNA-specific terminal uridylyltransferase 1 [Pongo abelii]
gi|55726283|emb|CAH89913.1| hypothetical protein [Pongo abelii]
Length = 519
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 72/142 (50%), Gaps = 3/142 (2%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 344 GDLGKAPELAETPKQEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 401
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 402 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 460
Query: 185 SLLVLFHFQTCVPAILPPLKDI 206
+LLV++ QT P +LP + ++
Sbjct: 461 TLLVIYFLQTRDPPVLPTVSEL 482
>gi|321477215|gb|EFX88174.1| hypothetical protein DAPPUDRAFT_311781 [Daphnia pulex]
Length = 1342
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/307 (24%), Positives = 137/307 (44%), Gaps = 29/307 (9%)
Query: 21 LREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCIS 80
++++ ET ++ E ++V S+ G++ FG + D D+ + L+ S
Sbjct: 889 IQKELETHIRT-----EYADAVLSMFGSSCNGFG---------FADSDVDLCLTFESNED 934
Query: 81 SAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
G +++ L R L++ + + ++ A+VPI+K + DIS+ N G+
Sbjct: 935 GKGLDFV-AIVKKLARTLKRNRFFCDIVPISSAKVPIVKLRHRPTGLESDISLYNQLGRR 993
Query: 141 KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAIL 200
SK L ID R R + + K AK +I + G+ +SY+ +L+V+ + Q P ++
Sbjct: 994 NSKLLATYCAIDSRVRILGYMAKLLAKQCEIGDASRGSLSSYAYTLMVIHYLQQVKPPVI 1053
Query: 201 PPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
P L++I VD K + + E A ++ R + NR + L++ FL
Sbjct: 1054 PVLQEITVDG-VDRKKYIVEGWDTWFFE-NAQDLGRVWP--HHGANREPPSTLWLGFLLY 1109
Query: 261 FSGL---SLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
++ L L+ + P + E + W + P+ IEDPF+ N VS +
Sbjct: 1110 YAKLFDYRLQVVSIRQLPPLYRLEKM-----W--TDRPIAIEDPFDLSHNLGSGVSLRMG 1162
Query: 318 AKISNAF 324
A I F
Sbjct: 1163 AYIRRVF 1169
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 63/300 (21%), Positives = 127/300 (42%), Gaps = 31/300 (10%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
V+ ++DI +ED E R LR+++E+ + G ++ P+GS V+ L R
Sbjct: 465 VVGQFIQDIFHCSGLTKEDLEKRRVATEHLRKMIET--AFPGFSIRPYGSVVTELGLRAS 522
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVA---HARVPILKFET 122
+ SN + + K LL + + Q ++++ + R PI+ F
Sbjct: 523 N-------SNVGLVEISPDIKKDGLLIIKILSWLQGPASNTFRYISDDLNCRTPIIHFHY 575
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
++ + +++ + L ID RF + + + +A+ ++ P+ G+ S+
Sbjct: 576 EQMGVTFAMVVESEAAHKTTVLLQQYRMIDPRFAVLTVAFRTFARICCLDQPELGSLPSH 635
Query: 183 SLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKY 242
+ +L+VL++ Q +LP L + + DD + F + +
Sbjct: 636 AFTLMVLYYLQQ--DHVLPVLHQLKKSDAEDDY----------LTAAEFFEMKDRGEWEP 683
Query: 243 RKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
R++N L L++ L KF + +E +C + + +RS W N + +EDPF
Sbjct: 684 RELN---LGQLWLGLL-KFYAYTFTHNEHAVCVRSIE-PLLRSTKNW--GNRRIAVEDPF 736
>gi|380494577|emb|CCF33047.1| hypothetical protein CH063_05313 [Colletotrichum higginsianum]
Length = 1068
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 111/246 (45%), Gaps = 26/246 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P + E R K++ L ++ V FGS + L S D+DI
Sbjct: 40 MRELYDRLTPTEKVEENRQKLVVKLEKIFNEEWPGNDIRVHLFGSSGNLLCSDDSDVDI- 98
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ ++ DLL + G +++ ++ A+VPI+K ++C
Sbjct: 99 -------CITTPWKELEGVCVIADLL----ARKGMKKVVCISAAKVPIVKIWDPELGLAC 147
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W + +N+ GT +SY+ L+
Sbjct: 148 DMNVNNTLALENTRMVRTYVEIDPRVRPLAMIVKYWTRQRIVNDAAFGGTLSSYTWICLI 207
Query: 189 LFHFQTCVPAILP----------PLKDIYPGNLVDDLKGVRANAERQIAEICA--FNIAR 236
+ Q P +LP P + DDL +R ++ A + F R
Sbjct: 208 IGFLQLRDPPVLPSLHQRQHQRLPKRGGQESAFADDLDKLRGFGDKNKASLGELLFQFFR 267
Query: 237 FSSDKY 242
F + ++
Sbjct: 268 FYAHEF 273
>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
Length = 464
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 89/192 (46%), Gaps = 11/192 (5%)
Query: 5 NVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + +D+ + ++P + E R ++ + V ++ A V PFGS+ + L+
Sbjct: 136 NVAEMLQRDVEAFIDYISPTPAEDEIRGLIVQLISRAV--TQAFPDAQVLPFGSYETKLY 193
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ I+ S K ++L L +R+ G R+ VA A+VPI+KF
Sbjct: 194 LPLGDIDLVIQSP------SMAYSDKVTVLHALANTMRRAGITDRVTIVAKAKVPIIKFI 247
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS 181
T H + DIS++ G K + + R +V++ K + +N TG S
Sbjct: 248 TTHGRFAVDISLNQTNGVAAGKMINRYLRELPALRGLVMITKAFLSQRSMNEVYTGGLGS 307
Query: 182 YSLSLLVLFHFQ 193
YS+ L + Q
Sbjct: 308 YSIVCLAISFLQ 319
>gi|159478216|ref|XP_001697200.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
gi|158274674|gb|EDP00455.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
Length = 1144
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 98/187 (52%), Gaps = 17/187 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
L ++ ++ P E+ R ++ +REVV S+ A VE FGS+ + L+ D+D+
Sbjct: 312 LVELCELVQPTPEEAAARAGAVAAVREVVGSI--WPSARVEVFGSYATGLYVPTSDVDLV 369
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI-HQNISC 129
I L +G AG L L +AL ++G + +Q + ARVPI+KFET+ + N++
Sbjct: 370 I-LDSGCTNIQAG-------LQALAKALTKRGVGQSIQVIGKARVPIVKFETVDYGNLAF 421
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRF---RDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
D+S D G ++ + ++ GR+ R ++L +K + + ++N TG SY+L
Sbjct: 422 DVSFDVANGPQAAEL---VKEMTGRWPMMRPLILALKLFLQQRELNEVYTGGIGSYALIT 478
Query: 187 LVLFHFQ 193
LV Q
Sbjct: 479 LVSAFLQ 485
>gi|440631915|gb|ELR01834.1| hypothetical protein GMDG_00933 [Geomyces destructans 20631-21]
Length = 1241
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/221 (22%), Positives = 101/221 (45%), Gaps = 24/221 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++ + L P E R + + L ++ + V FGS + L + D+DI
Sbjct: 278 MEKLYAELLPTDESEAKRQRFVQKLEHLLNTEWPGHDIKVHVFGSSGNLLCTDESDVDI- 336
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K++++ +L DLL + G ++ V+ A+VPI+K ++C
Sbjct: 337 -------CITTEWKELERVCMLADLL----YRNGMTKVNCVSTAKVPIVKIWDPELGLAC 385
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + Q+D R R + +++K W K +N+ GT +SY+ ++
Sbjct: 386 DMNVNNTLALENTRMIKTYVQVDPRVRPLAMIIKHWTKRRILNDAAYGGTLSSYTWICMI 445
Query: 189 LFHFQTCVPAILP----------PLKDIYPGNLVDDLKGVR 219
+ Q P +LP P D + DD++ ++
Sbjct: 446 INFLQLQDPPVLPVLHERQHQRLPQADGHESAFADDVEALQ 486
>gi|328793493|ref|XP_003251886.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like [Apis mellifera]
Length = 656
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 137/301 (45%), Gaps = 29/301 (9%)
Query: 53 FGSFVSNLFSRWGDLDISIELS---NGSCISSAGKKVKQSLLGDLLRAL-RQKGGYRRLQ 108
FGS + L + DLDI +++ N S +S + + + + R + +
Sbjct: 153 FGSSQTGLGFKECDLDIYMDIGEPINESKNTSTDSWTMNKIFKSVKKIMYRMNDVFSNII 212
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ A+ PI+KF N+SCDIS N+ G KS + + +D R + +++++K WA+
Sbjct: 213 GIPKAKTPIIKFYYNRTNVSCDISFKNILGIYKSYLIKYCLSLDNRLKPLMMIIKYWARH 272
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
I+N + ++Y+L LL++F+ Q I+PPL + + G +AN +
Sbjct: 273 FKISNGQ--KISNYALVLLIIFYLQQPSVNIIPPLMVLQNTCQPQIINGWQANFDENTV- 329
Query: 229 ICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTR 288
+ + N++S+ L F ++ K+ ICP G H S +
Sbjct: 330 -----LPPIT-------NKNSVPQLLHGFFFFYATFEYKSQ--VICPIDGMI-HTESEFK 374
Query: 289 WLPNNHPLFIE--DPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLA 346
+ N PL + + + +++ + + K + + + E++H T T+++ L+SL
Sbjct: 375 EIE-NLPLCMNRYKTYIKEDDNLKFIVNKPMC-VQDPIELSH---NVTANTKFSTLNSLV 429
Query: 347 R 347
+
Sbjct: 430 K 430
>gi|312070196|ref|XP_003138034.1| PAP/25A associated domain-containing protein [Loa loa]
gi|307766804|gb|EFO26038.1| PAP/25A associated domain-containing protein [Loa loa]
Length = 664
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 80/158 (50%), Gaps = 7/158 (4%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N V +L + AL + +Q + A
Sbjct: 134 GSSLNGFGTNSSDMDLCLMITNKDLDQRTDAVV---VLNMIQSALAETKWVSHMQLIL-A 189
Query: 114 RVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+VPIL+ F +I+ D++ +N + L + S D R R +V +VKEWAK DI
Sbjct: 190 KVPILRIRFYEPFTDITVDLNANNSVAIRNTHLLCYYSSFDWRVRPLVSVVKEWAKRRDI 249
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVP-AILPPLKDIYP 208
N+ +F SYSL L+V+ + Q + ILP L+ +YP
Sbjct: 250 NDANRSSFTSYSLVLMVIHYLQCGLKQPILPSLQVVYP 287
>gi|170592965|ref|XP_001901235.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158591302|gb|EDP29915.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 664
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 80/158 (50%), Gaps = 7/158 (4%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N V +L + AL + +Q + A
Sbjct: 136 GSSLNGFGTNSSDMDLCLMITNKDLDQRTDAVV---VLNMIQSALAETKWVSHMQLIL-A 191
Query: 114 RVPILK--FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+VPIL+ F +I+ D++ +N + L + S D R R +V +VKEWAK DI
Sbjct: 192 KVPILRIRFYEPFTDITVDLNANNSVAIRNTHLLCYYSSFDWRVRPLVSVVKEWAKRRDI 251
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVP-AILPPLKDIYP 208
N+ +F SYSL L+V+ + Q + ILP L+ +YP
Sbjct: 252 NDANRSSFTSYSLVLMVIHYLQCGLKQPILPSLQVVYP 289
>gi|52139145|gb|AAH82663.1| LOC494678 protein, partial [Xenopus laevis]
Length = 837
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 101/233 (43%), Gaps = 26/233 (11%)
Query: 96 RALRQK-GGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGR 154
+ LRQ G +Q V AR P++ F+ + D++++N S FL S +D R
Sbjct: 300 KVLRQCVPGVHGVQSVPTARRPVIHFQHKTSGLRGDVTLNNRLALRNSSFLRLCSDLDTR 359
Query: 155 FRDMVLLVKEWAKAHDI-NNPKTGT--FNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
+V V+ WA+ + + NP G N+Y+L+LLV F QT P +LP L +
Sbjct: 360 VPQLVYTVRYWARVNQLAGNPLGGGPLLNNYALTLLVFFFLQTRSPPVLPTLVHL----- 414
Query: 212 VDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKA 268
R ++ ++ F SD + N+ SL+ L F ++ L L
Sbjct: 415 -------REETANEVPQVIDGWDCSFPSDTTQVKESGNQQSLSSLLAEFFSFYASLDLHL 467
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNH-----PLFIEDPFEQPENSARAVSEKN 316
L +CP G + ++ +H PL I+DPFE N VS +
Sbjct: 468 --LILCPCNGLTVPLPFSSSPPAWSHGFRLGPLNIQDPFELSHNVGGNVSSRT 518
>gi|398408115|ref|XP_003855523.1| hypothetical protein MYCGRDRAFT_21922, partial [Zymoseptoria
tritici IPO323]
gi|339475407|gb|EGP90499.1| hypothetical protein MYCGRDRAFT_21922 [Zymoseptoria tritici IPO323]
Length = 486
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 98/241 (40%), Gaps = 31/241 (12%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ N G ++ L S D R R MVL VK WAK IN+ +G
Sbjct: 231 LDFPNEGVGIQCDINFFNPLGLHNTQMLRCYSMCDPRVRPMVLFVKSWAKQRKINSSYSG 290
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLKDIY------PGNLV-DDLKGVRANAERQIAEI 229
T +SY L+VL + P +LP L+ + P ++ ++ G + R EI
Sbjct: 291 TLSSYGYVLMVLHYLMNVARPPVLPNLQMAWRPQGCTPSSVTRTEVDGWTVDFWRNEEEI 350
Query: 230 -CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-------------LSLKASELGICP 275
A + SS NR SL L F + +S LSL+ +
Sbjct: 351 QNAVRKGQMSS------NRDSLGSLLADFFQYYSSMGQGPQFRWTQQVLSLRTEHGILTK 404
Query: 276 FTGQWEHIRSNT---RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLT 332
W ++ + + + + IEDPFE N AR V+ + I + F + L
Sbjct: 405 EAKGWVKAQTEEGEGKKIQHRYLFCIEDPFELAHNVARTVTHNGIVAIRDEFRRAYRILQ 464
Query: 333 S 333
S
Sbjct: 465 S 465
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 10/112 (8%)
Query: 36 REVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLL 95
R++ + SL ++E FGSF S S D+D+ I + + + ++S ++ L L
Sbjct: 31 RQICQENRSLPAVSLECFGSFQSGFASAGSDMDLVIVVQDANAMASCFSLLEHDLPRTLE 90
Query: 96 RALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFW 147
+ L + G RL + RVPI+K C D+ G+++ + W
Sbjct: 91 KKLLESGFGARL--LTRTRVPIIKV--------CQSPADDFLGKLREEREKW 132
>gi|353236988|emb|CCA68971.1| hypothetical protein PIIN_02831 [Piriformospora indica DSM 11827]
Length = 963
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 117/250 (46%), Gaps = 37/250 (14%)
Query: 53 FGSFVSNLFSRWGDLDISI---ELSNGSCISSAGKKVKQSLLGDL--LRAL--RQKGGYR 105
FGS + DLDI I L G VK+ L D+ LR L R K +
Sbjct: 492 FGSSRYGVSDSGSDLDIVILDKRLEKG-----FEPHVKKKDLHDIYNLRQLSYRMKSTFD 546
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
++ + A+VPI+K I N++ DI+I++ G ++ L + +++ +VK+W
Sbjct: 547 KMVVIDGAKVPIIKARDIRSNVAVDININDRLGLYNTELLSHYCALWPSLSNLIYVVKKW 606
Query: 166 AKAHDINN----PKTG--TFNSYSLSLLVLFHFQTCVPAILPPLKDI-YPGNLVDDLKG- 217
AK+ +N+ P+ G +F+SY L+L+V+ QT +LP L+D Y +L+G
Sbjct: 607 AKSRGLNDPAGLPRAGGPSFSSYCLTLMVIGFLQTH--GVLPNLQDAKYLIRHSPELRGH 664
Query: 218 ------VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFL------EKFSGLS 265
R+ R + F + ++R +N L +F +++ KFS +
Sbjct: 665 ADVFWIRRSETSRMKTNVDWFPLPLH---EWRPLNSPPLGRVFYAWMMYFAYDHKFSDFA 721
Query: 266 LKASELGICP 275
++ +E G+ P
Sbjct: 722 IRIAEGGVVP 731
>gi|242039829|ref|XP_002467309.1| hypothetical protein SORBIDRAFT_01g024420 [Sorghum bicolor]
gi|241921163|gb|EER94307.1| hypothetical protein SORBIDRAFT_01g024420 [Sorghum bicolor]
Length = 411
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 4/93 (4%)
Query: 91 LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQ 150
L D+L+A G + +Q + ARVPI+K +SCDI ++NL + +K L +Q
Sbjct: 323 LADILKA----GNLQNIQPLTRARVPIVKLMDPETGLSCDICVNNLLAVVNTKLLRDYAQ 378
Query: 151 IDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
ID R R + +VK WAK +N GT +SY+
Sbjct: 379 IDRRLRQLAFIVKHWAKIRRVNETYQGTLSSYA 411
>gi|426193415|gb|EKV43348.1| hypothetical protein AGABI2DRAFT_210004, partial [Agaricus bisporus
var. bisporus H97]
Length = 515
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 47/163 (28%), Positives = 80/163 (49%), Gaps = 14/163 (8%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
FGS + R D+D+ C+ + +++ + L +L L ++ ++ + H
Sbjct: 57 FGSTANGFSLRNSDMDLC-------CLIDSPERLNPADLVTILGDLLERETRFHVKPLPH 109
Query: 113 ARVPILKFET-----IHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLVKEWA 166
AR+PI+K + I+CDI +N ++ L ++ID R R +VL +K W+
Sbjct: 110 ARIPIVKLSLDPSPGLPSGIACDIGFENRLAIENTRLLLTYAKIDPTRVRTLVLFLKIWS 169
Query: 167 KAHDINNPKTGTFNSYSLSLLVL-FHFQTCVPAILPPLKDIYP 208
K IN+P GT +SY LLV+ F P +LP L+ + P
Sbjct: 170 KRRKINSPYKGTLSSYGYVLLVIYFLVHVKNPPVLPNLQQMPP 212
>gi|403163236|ref|XP_003323340.2| hypothetical protein PGTG_04877 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163970|gb|EFP78921.2| hypothetical protein PGTG_04877 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 1452
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 111/262 (42%), Gaps = 37/262 (14%)
Query: 107 LQFVAHARVPILKFE---TIHQ--NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLL 161
++ + AR+PI+K ++ Q +SCDI +N ++ L + +D R R MVL
Sbjct: 403 VKMLPKARIPIIKLSLPPSLGQPFGLSCDIGFENRLALENTRLLLTYAMVDPRMRTMVLF 462
Query: 162 VKEWAKAHDINNPKTGTFNSYSLSLLVLFHF-------------QTCVPAILPPLKDIYP 208
+K W+K IN+P GT +SY L+V+++ Q P + PP +Y
Sbjct: 463 LKVWSKRRRINDPYLGTLSSYGYVLMVIYYLVNGRKAPVLPNLQQLPPPRVTPPEDTVYE 522
Query: 209 GNLV---DDLKG-----VRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEK 260
G+ + DDL V N E + F SS +Y H +S +
Sbjct: 523 GHDIYFFDDLDALPRFWVGMNRENVGELLIEFFRFFSSSFRY--------THDVIS-VRT 573
Query: 261 FSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
GL K S+ + + + R + N L IEDPF+ N AR V+ L I
Sbjct: 574 PGGLLSKESKGWMHDIIEESKDGRGGFLKIDLNR-LCIEDPFQTNYNVARTVTRDGLFTI 632
Query: 321 SNAFEMTHFRLTSTNQTRYALL 342
F M R+ ST R ++
Sbjct: 633 RGEF-MRAVRVLSTRTDRLDVM 653
>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
Length = 567
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P +E+ R VI L+ + A V FGS ++L+ D+D+
Sbjct: 162 IKDFVNYISPSKEEIVVRNTVIRRLKRRIAEFWPQTQAHV--FGSCATDLYLPGSDIDMV 219
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ IS+ G ++ L L LR + ++ +A A+VPI+KF NI D
Sbjct: 220 V-------ISTTGDYEQRGKLYQLSSFLRTNKLAKNIEVIATAKVPIIKFVDPQYNIHVD 272
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + W+ + G R++VL+VK++ ++ +NN G Y+ +++++
Sbjct: 273 ISFERTNGLDAARRIRKWLDSMPG-LRELVLIVKQFLRSRRLNNVHVGGLGGYA-TIILM 330
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 331 YHFLRLHPRV 340
>gi|453083459|gb|EMF11505.1| PAP/OAS1 substrate-binding domain-containing protein
[Mycosphaerella populorum SO2202]
Length = 632
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/250 (26%), Positives = 99/250 (39%), Gaps = 42/250 (16%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ N G ++ L S+ D R R +VL +K WAK IN+ +G
Sbjct: 234 LDFPKDGIGIQCDINFSNPLGLHNTQMLRCYSKCDPRVRPIVLFMKSWAKQRKINSSYSG 293
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLKDIY--------PGNLVDDLKGVRANAERQIAE 228
T +SY L+VL + V P +LP L+ + PG ++ G + R E
Sbjct: 294 TLSSYGYVLMVLHYLINVVRPPVLPNLQMQWRPHTHCTPPGRTRMEVDGWVVDFWRNENE 353
Query: 229 I-CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS---------------------GLSL 266
I A + S+ N S+ L + +S G+
Sbjct: 354 IESALRNGQMSA------NSDSIGSLLAGLFQYYSSMGNGPQFRWTQQVLSLRTPGGVLT 407
Query: 267 KASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEM 326
K ++ + T + E R R+L IEDPFE N AR V+ + I + F
Sbjct: 408 KGAKGWVKATTEEGEGKRIQHRYL-----FCIEDPFEHSHNVARTVTHNGIVAIRDEFRR 462
Query: 327 THFRLTSTNQ 336
+ L + Q
Sbjct: 463 AYRILNAVAQ 472
>gi|427797921|gb|JAA64412.1| Putative terminal uridylyltransferase 4, partial [Rhipicephalus
pulchellus]
Length = 749
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 142/333 (42%), Gaps = 68/333 (20%)
Query: 23 EDWETRMKVISDLREVVESVE-----SLRGATVEPFGSFVSNLF-------------SRW 64
E+ E R +V+SDL +++ SL G++ FG SN+ +
Sbjct: 188 EEVELRKRVVSDLETFIKATLPDVKLSLHGSSGNGFGLKTSNVXXXRVVSDLETFIKATL 247
Query: 65 GDLDISIELSNGSC-----------ISSAGKKVKQSLL---GDLLRALRQKGGYRRLQFV 110
D+ +S+ S+G+ ++ GK L GDLL+ + Y ++
Sbjct: 248 PDVKLSLHGSSGNGFGLKTSNVNIDLTPLGKADCAQLFVGTGDLLQECPK---YAQVTKD 304
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
++VP ++F+ + +SC+IS++N Q SK L + +D R + + + + WAK
Sbjct: 305 YLSKVPRIRFKEVDSKLSCEISLNNSNSQKTSKLLDDYASLDRRVKILGVAFRLWAKHCG 364
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD------DLKGVRANAER 224
++ GT ++ +++ +F Q C PA+LP L ++ G + DL+G
Sbjct: 365 LDQQDRGTLPPHAFAIMTVFFLQQCKPAVLPVLHEMKDGKESESYLKPKDLEG------- 417
Query: 225 QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIR 284
R+S N S+ L+V L +F K ++ +C Q I
Sbjct: 418 -----------RWSCK-----NDRSIGQLWVELL-RFYATEFKLNKRVVCIRRSQPMLI- 459
Query: 285 SNTRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
+W N + IEDP+ N AR++ + +
Sbjct: 460 VEKKW--NKRYIAIEDPYSCKRNLARSIPSERM 490
>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor
FP-101664 SS1]
Length = 660
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 119/299 (39%), Gaps = 52/299 (17%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A V PFGS+ + L+ GD+D+ I S + + S+L L +++ G R+
Sbjct: 207 AQVLPFGSYETKLYLPLGDIDLVI------YSQSMARMDRVSVLHSLANIVKRAGITDRV 260
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+A A+VPI+KF T H S DISI+ G K + + R +VL++K +
Sbjct: 261 TIIAKAKVPIIKFVTTHGRFSVDISINQGNGVTAGKMVKQFLEELPALRSLVLIIKSFLS 320
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIA 227
+N TG SYS+ L + Q ++P
Sbjct: 321 QRSMNEVFTGGLGSYSIVCLAISFLQ------------MHP------------------- 349
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
+ R D + ++ L + F E + G E+GI G ++
Sbjct: 350 -----KVRRGEIDPSK-----NMGVLVMEFFELY-GCYFNYGEVGISLRDGGSYFNKTQR 398
Query: 288 RWLPNNHP--LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSS 344
W+ L IEDP + + +R N+AK+ H +T+ T+ +++S+
Sbjct: 399 GWMDYGQQRLLCIEDPGDPTNDISRG--SYNIAKVRTTLAGAHTIMTAAAYTQASIISA 455
>gi|76559935|ref|NP_001029073.1| speckle targeted PIP5K1A-regulated poly(A) polymerase [Rattus
norvegicus]
gi|118595569|sp|Q3MHT4.1|STPAP_RAT RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|75773232|gb|AAI04696.1| Terminal uridylyl transferase 1, U6 snRNA-specific [Rattus
norvegicus]
gi|149062339|gb|EDM12762.1| similar to RNA binding motif protein 21 [Rattus norvegicus]
Length = 866
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 104/241 (43%), Gaps = 40/241 (16%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G R+Q V AR P++KF + DIS+ N S+FL S
Sbjct: 344 LVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSGLHGDISLSNRLALYNSRFLNLCS 401
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++D R R +V ++ WA+ + ++ N+Y+L+LLV++ QT P +LP + +
Sbjct: 402 EMDSRVRPLVYTLRCWAQHNGLSG-GGPLLNNYALTLLVIYFLQTRDPPVLPTVAQL--- 457
Query: 210 NLVDDLKGVRANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSL 266
+ + E + E+ ++ + F D R N L+ L F S L
Sbjct: 458 --------TQRSGEGEQVEVDGWDCS-FPKDASRLEPSTNVEPLSSLLAQFFSCVSCWDL 508
Query: 267 KASELGICPFTGQ------------WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSE 314
S L + GQ WE +R P+ ++DPF+ N A V+
Sbjct: 509 SGSLLSL--REGQALMVAGGLPSDLWEGLRLG--------PMNLQDPFDLSHNVAANVTS 558
Query: 315 K 315
+
Sbjct: 559 R 559
>gi|452823931|gb|EME30937.1| nucleotidyltransferase family protein [Galdieria sulphuraria]
Length = 876
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 71/132 (53%), Gaps = 1/132 (0%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISS-AGKKVKQSLLGDLLRALRQKGGYRRLQFVA 111
FGS +++ R GDLD+ + + + I G+++++ + + L + ++ ++
Sbjct: 491 FGSSINSFGLRSGDLDMCLTVPSEDAIHRVTGERLEERHIVNRLGVILRQAKMENVECRF 550
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
ARVPI+KF +S D+ I+N + S L + +D R + + LL+K WAK I
Sbjct: 551 RARVPIVKFHDPLTRLSVDVCINNKLARHNSALLRTYASLDPRVQVLGLLIKYWAKCRGI 610
Query: 172 NNPKTGTFNSYS 183
N P TGT +SY+
Sbjct: 611 NQPFTGTLSSYA 622
>gi|348665581|gb|EGZ05410.1| hypothetical protein PHYSODRAFT_342248 [Phytophthora sojae]
Length = 569
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 122/293 (41%), Gaps = 24/293 (8%)
Query: 56 FVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARV 115
F N R G I+ + + + KK + + L R+ Y L+ ++ RV
Sbjct: 275 FFFNASGRIGGRAIATATAKLATYPYSIKKQRSASPTRFLSNSRRTSNYEVLEVISRTRV 334
Query: 116 PILKFETI--HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINN 173
PI++F + CD+S+ N+ +K L + D R R + + VK WAK I++
Sbjct: 335 PIIRFRSSCSEYEYECDLSVGNVIATCNTKLLRAYASFDIRARQLGIAVKYWAKKRGISD 394
Query: 174 PKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLK-GVRANAERQIAEICAF 232
G +SYS LL +++ Q V +LP L+D +L+D K + IA
Sbjct: 395 ASVGFLSSYSYVLLSIYYVQ--VVHVLPNLQD---PDLLDSAKVPAKYYNGVNIAFCEDA 449
Query: 233 NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT----- 287
+IAR + +SL L V F F A+ F+ + +RS T
Sbjct: 450 DIAREFYQRRGFDTDASLQSLLVGFFNFF------ATHFN---FSHCFVAVRSPTTPKLK 500
Query: 288 -RWLPNNH-PLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTR 338
W H + I+DP E + + N K+ + F +L+S ++R
Sbjct: 501 RHWASCGHRSISIQDPLETTRDLGGVLKRHNQQKVIHEFRRAFGKLSSGERSR 553
>gi|443687927|gb|ELT90760.1| hypothetical protein CAPTEDRAFT_220251 [Capitella teleta]
Length = 492
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 119/295 (40%), Gaps = 55/295 (18%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIE-------------------LSNGSCISSAGKK 85
G T++ FGS S L S D+D+ + LS S GK
Sbjct: 196 FHGTTLDAFGSITSQLGSSSSDIDLILTQPAHKTQERYHNSPLRFYFLSQDFLDSQKGKT 255
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
S+L LR + + + +A VPI++F+ + I D+S + + SK L
Sbjct: 256 QMMSILASYLRCFHPT--FHSVTKILNANVPIIRFKH-NSEIDIDVSSTDNVAVLMSKIL 312
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
F++S + RF+ +++ +K WAK+ I +G +++L+ L++F QT V I+PP +
Sbjct: 313 FYLSSHEHRFKTLLMSIKFWAKSLGITVMPSGYPTNFTLTSLLIFFLQTKV--IIPPFES 370
Query: 206 IYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
DD+ + D ++ + +S L + F E ++
Sbjct: 371 FTQDKTGDDII-----------------LPVKWQDIFKTDDTTSADDLLIGFFEFYANFD 413
Query: 266 LKASELGICPFTGQWEHIRSN---TRWLPNNHPLFIEDPFEQPENSARAVSEKNL 317
Q IR+ +W + PL +E+P + N R V + L
Sbjct: 414 Y-----------SQTMSIRNGECIEKWQEEDWPLKLENPLQPDRNIPRNVCKNEL 457
>gi|1493831|gb|AAC49397.1| Trf5p [Saccharomyces cerevisiae]
Length = 625
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNIDPRI 358
>gi|449446931|ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cucumis
sativus]
Length = 544
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 85/343 (24%), Positives = 137/343 (39%), Gaps = 64/343 (18%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P E+ R + + VV+ + VE FGSF + L+ D+D+
Sbjct: 132 DFCEFLSPTEEERVARDSAVERVFSVVKHI--WPHCKVEVFGSFQTGLYLPTSDIDV--- 186
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I +G Q L L RAL QKG +++Q + ARVPI+KF IS DIS
Sbjct: 187 -----VILGSGIPKPQLGLQALSRALSQKGIAKKIQVIGKARVPIIKFIEKQSGISFDIS 241
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ R + L++K + + ++N +G SY+L +++
Sbjct: 242 FDVQNGPKAADFIKGAVSKWPPLRPLCLILKVFLQQRELNEVYSGGLGSYALLTMLMAML 301
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q+ ++ P +L +L GV L H
Sbjct: 302 QSI---------NVPPSSLEHNL-GVL------------------------------LVH 321
Query: 253 LFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSAR 310
F F G L S++G+ G +S ++ P L IEDP + P+N
Sbjct: 322 FF-----DFYGRKLNTSDVGVSCNAGGIFFSKSYRGFMTKGRPCLLSIEDP-QAPDNDI- 374
Query: 311 AVSEKNLAKISNAFEMTHFRLTSTNQT-----RYALLSSLARP 348
+ N +I +AF M + LT+ ++L ++ RP
Sbjct: 375 GKNSFNYFQIRSAFAMAYSILTNVKTVLGLGPNRSILGTIIRP 417
>gi|428182086|gb|EKX50948.1| hypothetical protein GUITHDRAFT_66394 [Guillardia theta CCMP2712]
Length = 242
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 105/228 (46%), Gaps = 25/228 (10%)
Query: 112 HARVPILKFET---IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ARVPI+KF+ + CD+S++N+ I + LF + +D R R +++ +K W K
Sbjct: 9 YARVPIIKFKAQDGLDFVFDCDLSVNNVLACINTDLLFTYTMLDKRVRPLIMCIKHWVKQ 68
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI--YPGNLVDDLKGVRANAERQI 226
I+ G +SY+ +L+V+ + Q +LP L+ + L +D + + +
Sbjct: 69 RRIHKTFRGYLSSYTYTLMVIQYLQ--YERVLPCLQSLRRVQATLNND-PSFAVSCDGDV 125
Query: 227 AEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPF 276
+ C F N+ +S R NRSSL L V F +S +S+++ L
Sbjct: 126 YD-CYFYRNVETLASFGERN-NRSSLGLLLVGFFHFYSNVFPIDKGVVSIRSGRLLRKKA 183
Query: 277 TGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
G W+ + N H IEDPF+ + R V++ + I F
Sbjct: 184 KG-WD----TSEDFRNRHIFCIEDPFDINLDLGRYVNDYTVQDILEEF 226
>gi|412993216|emb|CCO16749.1| predicted protein [Bathycoccus prasinos]
Length = 767
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 52/183 (28%), Positives = 82/183 (44%), Gaps = 12/183 (6%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG---- 83
R ++ +RE ++ S A V FGS + L + D+D+ I + G
Sbjct: 288 RDEIEQTVRETIQKY-SFSKAQVHRFGSGATGLALKDADVDLVILGVGPQSVKGGGGGFT 346
Query: 84 KKVKQSL---LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQI 140
+ KQ L L + + LR KG +R + ++ A+VPI K + H I +D G
Sbjct: 347 RSEKQLLVQHLRTIAKTLRNKGVCKRAEIISSAKVPIAKLDAYHAETKTHIKLDLAIGVS 406
Query: 141 KS-KFLFWISQIDGRF---RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
WI + G F + +VL++K + H +NN TG Y L LV+ H + C
Sbjct: 407 NGLAAAQWIREQVGEFPALKPLVLVLKRLLQIHKLNNAATGGCGGYLLISLVVSHLKQCT 466
Query: 197 PAI 199
PA+
Sbjct: 467 PAM 469
>gi|126333639|ref|XP_001366127.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Monodelphis domestica]
Length = 857
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 103/250 (41%), Gaps = 42/250 (16%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G ++ V AR P++KF + DIS+ N S+FL +
Sbjct: 338 LVGSVLRGCVP--GVHSVRTVPSARRPVVKFCHRPSGLHGDISLSNRLALYNSRFLNFCC 395
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
++D R R +V ++ WA+ + ++Y+L+LLV++ QT P +LPPL +
Sbjct: 396 ELDRRVRPLVYTLRRWAQGRGLTG-SGPLLSNYALTLLVIYFLQTRDPPVLPPLTKL--- 451
Query: 210 NLVDDLKGVRANAERQIAEI----CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
+ E + E+ C+F + S N L L F S
Sbjct: 452 --------TQMAGEEEQVEVDGWDCSF--PQEVSCLEPSTNTEPLDALLAQFFACVSSWE 501
Query: 266 LKASEL------------GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
L+ S L G+ P G W +R PL ++DPF+ N A V+
Sbjct: 502 LQGSLLSLREGRPLPIAEGLPP--GLWGGLRVG--------PLNVQDPFDLSHNVAANVT 551
Query: 314 EKNLAKISNA 323
+ ++ N
Sbjct: 552 SRVAGRLQNC 561
>gi|323335976|gb|EGA77253.1| Trf5p [Saccharomyces cerevisiae Vin13]
Length = 576
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|328850784|gb|EGF99944.1| hypothetical protein MELLADRAFT_112216 [Melampsora larici-populina
98AG31]
Length = 956
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 149/387 (38%), Gaps = 79/387 (20%)
Query: 20 PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIEL------ 73
P E+++ + + L + + V GA + PFGS S L R D+D+ +
Sbjct: 139 PTAEEYQIKEQTRQYLETLADKVSP--GARLLPFGSIASGLALRNSDMDLCCLIDHTEEK 196
Query: 74 -------SNGSCISSAG------------------KKVKQS------LLGDLLRALRQKG 102
SN S+ K VKQ+ +LG L+ Q+
Sbjct: 197 PTSPKPSSNAEDQSTQADPSESQESQEPQEPKLTTKPVKQTPAEMVLILGKLI----QEE 252
Query: 103 GYRRLQFVAHARVPILKFETIHQ-----NISCDISIDNLCGQIKSKFLFWISQIDGRFRD 157
++ + AR+PI+K +SCDI +N ++ L + +D R R
Sbjct: 253 TSFMVKMLPKARIPIIKLSLPPSAGQPFGLSCDIGFENRLALENTRLLLTYAMVDPRMRT 312
Query: 158 MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC-VPAILPPLKD----------- 205
+VL +K W K IN+P GT +SY LLV+++ A+LP L+
Sbjct: 313 IVLFLKVWTKRRRINDPYLGTLSSYGYVLLVIYYLVNGRKDAVLPNLQQLPPPRPSPPEE 372
Query: 206 -IYPGNLV---DDLKGVR---ANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFL 258
I+ G+ + DDL + R+ + RF + +R H VS +
Sbjct: 373 LIHDGHSIYFFDDLDALPRYWTGTNRENVGELLIDFFRFFASTFR------YTHDVVS-I 425
Query: 259 EKFSGLSLKASE---LGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
GL K + I G RS + N L IEDPF+ N AR V+
Sbjct: 426 RSIGGLLSKEHKGWMQDIIEEGGAVNGGRSPYAKVDLNR-LCIEDPFQINYNVARTVTRD 484
Query: 316 NLAKISNAFEMTHFRLTSTNQTRYALL 342
L I F M R+ ST R ++
Sbjct: 485 GLFTIRGEF-MRAVRVLSTRTDRLDVM 510
>gi|400598981|gb|EJP66688.1| Poly(A) RNA polymerase cid11 [Beauveria bassiana ARSEF 2860]
Length = 1262
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 102/221 (46%), Gaps = 23/221 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+K+I L P + + R +++ L + R V FGS + L S D+DI
Sbjct: 282 MKEIYDKLLPTEQVEKNRKRLVEKLEMLFNDEWPDRDIKVHLFGSSGNLLCSDSSDVDI- 340
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ +++ ++ DLL + G ++ ++ A+VPI+K ++C
Sbjct: 341 -------CITTPWHELEGVCMIADLL----ARRGMEKVVCISAAKVPIVKIWDPELGLAC 389
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + + D R R + +++K W + +N+ GT +SY+ L+
Sbjct: 390 DMNVNNTVALENTRMVRTYVEADPRVRKLAMIIKYWTRRRIVNDAAFGGTLSSYTWICLI 449
Query: 189 LFHFQTCVPAILP---------PLKDIYPGNLVDDLKGVRA 220
+ Q PA+LP P D G+ D++K ++
Sbjct: 450 IAFLQLRNPAVLPALHQLPYKLPKPDGSVGDFADNMKKIKG 490
>gi|323307579|gb|EGA60848.1| Trf5p [Saccharomyces cerevisiae FostersO]
Length = 575
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 113 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 170
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 171 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 223
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 224 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 281
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 282 YSFLNMHPRI 291
>gi|401421278|ref|XP_003875128.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491364|emb|CBZ26633.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 408
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 85/179 (47%), Gaps = 8/179 (4%)
Query: 30 KVISDL--REVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
+V++DL R V + ++ FGS + + D D+S+ N S S + V
Sbjct: 27 EVVADLLLRMVDLCSRCVNKVELQLFGSLATGFCTTDADADLSLTFRNFSPWLSGIEVVD 86
Query: 88 QSLLGDLLRALRQ--KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
L L R R+ + G ++ V HA +P+L+F+ I CD++I NL G SK L
Sbjct: 87 AQNLKRLARVGREAVEMGMENVRLV-HASIPVLQFQDAVSGIRCDLTIGNLGGVANSKIL 145
Query: 146 FWISQIDGRFRD-MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CVPAILP 201
I + F V LVKEWAK ++ P FNS++++ + L Q +P +P
Sbjct: 146 AEIHHVLPDFYGAYVYLVKEWAKRCEVVAPDKAMFNSFTMTTMSLMVLQELGLLPIFVP 204
>gi|218188825|gb|EEC71252.1| hypothetical protein OsI_03224 [Oryza sativa Indica Group]
Length = 571
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 72/147 (48%), Gaps = 8/147 (5%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
+ VE FGSF + LF D+D+ I + K Q L L +AL QKG +++
Sbjct: 171 SNVEVFGSFRTGLFLPTSDIDV--------VIFDSRVKTPQVGLYALAKALSQKGVAKKI 222
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
Q +A ARVPI+KF I+ DIS D G + F+ + R + +++K +
Sbjct: 223 QVIAKARVPIVKFVERKSEIAFDISFDMDGGPQAADFIKDYVKKFPALRHLCMILKVFLH 282
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQT 194
++N TG SY+L +++ H Q
Sbjct: 283 QRELNEVYTGGIGSYALLTMLITHLQV 309
>gi|242016334|ref|XP_002428784.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212513469|gb|EEB16046.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 506
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/348 (24%), Positives = 149/348 (42%), Gaps = 49/348 (14%)
Query: 49 TVEPFGSFVSNLFSRWGDLDISIELSN----------GSCISSAGKKVKQSLLGDLLRAL 98
+V PFGS V+ L DLD+ + + G I+S + SLL +++
Sbjct: 149 SVYPFGSAVNGLGDVTSDLDLVVLRDDSTSFRWHKNPGVDINSVQNDL--SLLTNVMT-- 204
Query: 99 RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
+ G ++ F+ A+VPI+K++ + CD+S+ S+ LF +S D R + +
Sbjct: 205 KTSVGCAKVVFIKQAKVPIIKYKHSFTGVECDVSMHQTEALKMSQILFALSNFDPRIKPL 264
Query: 159 VLLVKEWAKAHDINNPKTG-TFNSYSLSLLVLFHFQ---TCVPAILPPLKDIYP---GNL 211
+ +K WA+ + + G T ++SL LL +F Q + ILPP D++P +
Sbjct: 265 IFFIKIWARELRLTREQPGPTITNFSLILLTIFFLQQKGSSHTEILPPF-DLFPLRDTDA 323
Query: 212 VDDLKGVRANAERQIAEICAFNIARFSSDKYRKI---------NRSSLAHLFVSFLEKFS 262
DD+ + E+ + F SS N S L + F + +S
Sbjct: 324 NDDVMTMENTDEKLECILNNFKCHVTSSSSSSTSSPTSSPSAPNGKSTVELLLLFYKFYS 383
Query: 263 GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI-- 320
+ + G C TGQ IR N N+ ++I++PF N ++ VS + K
Sbjct: 384 VFNFNS--YGCCLNTGQI--IRKNNF---KNYGMYIKNPFCSIYNVSKNVSLSEVNKFQF 436
Query: 321 -----SNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYAN 363
+ E + ++ + TR A++ F+L F + YAN
Sbjct: 437 YCQIAATLLEESETTISRNSTTRTAVVDE--SKFLLNFLNYNS--YAN 480
>gi|365763604|gb|EHN05131.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|259149072|emb|CAY82314.1| Trf5p [Saccharomyces cerevisiae EC1118]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|295982238|pdb|3HJ1|A Chain A, Minor Editosome-Associated Tutase 1 With Bound Utp
gi|295982239|pdb|3HJ1|B Chain B, Minor Editosome-Associated Tutase 1 With Bound Utp
Length = 387
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 84/168 (50%), Gaps = 6/168 (3%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D DIS+ N S ++V +
Sbjct: 30 VIHELQKRVLDIGMLAVNKAHVELFGSHVSGFCTPHSDADISLTYRNFSPWLQGMERVDE 89
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
+ R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 90 QNNKRMTRFGKEASAMGMEDVRYI-RARIPVVQFTDGVTGIHCDVSIGNIGGVENSKILC 148
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
I Q+ F + LVK W KA ++ P+ TFNS++++ + L Q
Sbjct: 149 AIRQVFPDFYGAYIHLVKAWGKAREVIAPERSTFNSFTVTTMALMVLQ 196
>gi|1050861|gb|AAC49099.1| Ynl0440p [Saccharomyces cerevisiae]
gi|1302392|emb|CAA96217.1| TRF5 [Saccharomyces cerevisiae]
Length = 625
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|81360384|ref|NP_014100.2| non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
S288c]
gi|148887014|sp|P48561.2|TRF5_YEAST RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
Full=Topoisomerase 1-related protein TRF5
gi|151944249|gb|EDN62528.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
gi|285814367|tpg|DAA10261.1| TPA: non-canonical poly(A) polymerase TRF5 [Saccharomyces
cerevisiae S288c]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|340515918|gb|EGR46169.1| predicted protein [Trichoderma reesei QM6a]
Length = 1294
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/246 (23%), Positives = 111/246 (45%), Gaps = 27/246 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ +I L P + E R K+++ L + + V FGS + L S D+DI
Sbjct: 290 MNEIYQKLLPTEKVEENRRKLVNKLETIFNTEWPGHDIKVHLFGSSGNLLCSDDSDVDI- 348
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ +++ ++ DLL + G ++ ++ A+VPI+K ++C
Sbjct: 349 -------CITTPWHEMEDVCMIADLL----ARRGMEKVVCISAAKVPIVKIWDPELGLAC 397
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + + D R R + +++K W + +N+ GT +SY+ L+
Sbjct: 398 DMNVNNTLALENTRMVRTYVEADPRVRQLAMILKHWTRRRIVNDAAFGGTLSSYTWICLI 457
Query: 189 LFHFQTCVPAILP-----PLKDIYPGNLVDD-------LKGVRANAERQIAEICAFNIAR 236
+ Q PA+LP P K P V D +KG + + AE+ F R
Sbjct: 458 IAFLQLRNPAVLPALHQLPYKSTRPDGTVSDFADNLKKIKGFGSKNKSSEAELL-FQFFR 516
Query: 237 FSSDKY 242
F + ++
Sbjct: 517 FYAHEF 522
>gi|207341968|gb|EDZ69877.1| YNL299Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|115504085|ref|XP_001218835.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|83642317|emb|CAJ16088.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261326046|emb|CBH08872.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 406
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 84/168 (50%), Gaps = 6/168 (3%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D DIS+ N S ++V +
Sbjct: 28 VIHELQKRVLDIGMLAVNKAHVELFGSHVSGFCTPHSDADISLTYRNFSPWLQGMERVDE 87
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
+ R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 88 QNNKRMTRFGKEASAMGMEDVRYI-RARIPVVQFTDGVTGIHCDVSIGNIGGVENSKILC 146
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
I Q+ F + LVK W KA ++ P+ TFNS++++ + L Q
Sbjct: 147 AIRQVFPDFYGAYIHLVKAWGKAREVIAPERSTFNSFTVTTMALMVLQ 194
>gi|323346953|gb|EGA81231.1| Trf5p [Saccharomyces cerevisiae Lalvin QA23]
Length = 616
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|256271295|gb|EEU06367.1| Trf5p [Saccharomyces cerevisiae JAY291]
gi|349580652|dbj|GAA25811.1| K7_Trf5p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|190409265|gb|EDV12530.1| hypothetical protein SCRG_03424 [Saccharomyces cerevisiae RM11-1a]
Length = 642
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|170050025|ref|XP_001859054.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167871634|gb|EDS35017.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 520
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 124/300 (41%), Gaps = 50/300 (16%)
Query: 52 PFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVA 111
PFGS V+ L + D+DI ++L +G+ + K Q + L+ G + + +
Sbjct: 216 PFGSRVTGLGNDTSDVDIYLDL-DGNQEGNLSKDTIQRYSNQVQALLQASGHWSEFKPIL 274
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+AR PIL+ + Q + CD+S N ++ + + +I + L VKEWA+ +I
Sbjct: 275 NARTPILRTWNLQQKLDCDLSFSNGLSMCNTELIQYFFEIQPVCSAVTLYVKEWARYLNI 334
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICA 231
N Y+++LL +F FQ +LP + + N + E ++ I
Sbjct: 335 E-----ALNGYTVTLLTIFFFQ--AHKLLPSVYQLQTNN----------SEEHKVRRISH 377
Query: 232 FNI--ARFSSDKYRKIN-RSSLAHLFVSFLEKFSGLSLKASELGICPFTG---------- 278
+ + R S ++ + + + L++ F G + K +CPF G
Sbjct: 378 WRVDFERKSLEQLKIVPIPEAQIELYLGGFFAFYGEAFKFETNMVCPFLGTPQLKIHFDP 437
Query: 279 -------QWEHIRSNTRWLPNNH------------PLFIEDPFEQPENSARAVSEKNLAK 319
Q + +R L N P+ ++DPFE N A+ + N++K
Sbjct: 438 LGTRIPAQMKALREYYATLNMNEAHPVNELLQYAKPMVVQDPFELNHNVAKGLMPINVSK 497
>gi|18676470|dbj|BAB84887.1| FLJ00132 protein [Homo sapiens]
Length = 572
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 69/137 (50%), Gaps = 3/137 (2%)
Query: 65 GDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
GDL + EL+ A L+G +LR G R+Q V AR P++KF
Sbjct: 333 GDLGKASELAETPKEEKAEGAAMLELVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRP 390
Query: 125 QNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ D+S+ N S+FL S++DGR R +V ++ WA+ ++ ++Y+L
Sbjct: 391 SGLHGDVSLSNRLALHNSRFLSLCSELDGRVRPLVYTLRCWAQGRGLSG-SGPLLSNYAL 449
Query: 185 SLLVLFHFQTCVPAILP 201
+LLV++ QT P +LP
Sbjct: 450 TLLVIYFLQTRDPPVLP 466
>gi|19114483|ref|NP_593571.1| poly(A) polymerase Cid16 (predicted) [Schizosaccharomyces pombe
972h-]
gi|3219960|sp|O13798.1|CID16_SCHPO RecName: Full=Caffeine-induced protein 16
gi|2330708|emb|CAB11210.1| poly(A) polymerase Cid16 (predicted) [Schizosaccharomyces pombe]
Length = 1202
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 124/280 (44%), Gaps = 27/280 (9%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF 109
+ FGS+ + L ++ DLD+ I S+ + +VK + + + +G
Sbjct: 924 IACFGSYRTGLMTKNSDLDLVI-YSSKEALLPYYDRVKSIIKNEFSNVMPIRG------- 975
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
AR+PI+KF T NI CD+S DNL S + S ID R + +++LVK WA
Sbjct: 976 ---ARIPIIKF-TGQYNIHCDLSFDNLLPIHNSDLILNYSLIDERVKTLLMLVKYWASNR 1031
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI 229
I+ +SY+ ++V+F+ Q ILP L+ + K VR N +
Sbjct: 1032 LIDKTHHAFPSSYTWCIMVIFYLQQIPEPILPNLQKLSTQY----SKIVRDNDYGNVN-- 1085
Query: 230 CAFN----IARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRS 285
C FN R S K RK N + L F + + S I + Q + R
Sbjct: 1086 CWFNRDTECYRGSMQKGRK-NIALLLRGFFCYYGLTTQYSFDWEAYMIDISSSQLK--RK 1142
Query: 286 NTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+T + + P + DPF + +N +A+++K++ + E
Sbjct: 1143 STEF--KDCPFVVLDPFLKKKNLTKALTQKSVKVVRYELE 1180
>gi|392296994|gb|EIW08095.1| Trf5p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 608
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 146 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 203
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 204 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 256
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 257 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 314
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 315 YSFLNMHPRI 324
>gi|294950329|ref|XP_002786575.1| caffeine-induced death protein 1, putative [Perkinsus marinus ATCC
50983]
gi|239900867|gb|EER18371.1| caffeine-induced death protein 1, putative [Perkinsus marinus ATCC
50983]
Length = 435
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/311 (25%), Positives = 134/311 (43%), Gaps = 61/311 (19%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF 109
V FGS ++ ++ D+D+ CI G + + + LLR L F
Sbjct: 132 VHAFGSAINGFWTPHSDVDV--------CIQVPGHQTRAEQI-VLLRKLATSLARVTTHF 182
Query: 110 VA---HARVPILKF--ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
V AR+PI+ + + ++ DIS++N + S+ + +ID R R + + VK
Sbjct: 183 VEPRFSARIPIIHWAPKVPGSMLATDISVNNTLAVVNSRLIGAYMEIDPRLRPLGIAVKY 242
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDI-----YPGNLVDDLKGV 218
W KA IN+ GT +S+SL +L++ HF PA +LP L+D+ P V +
Sbjct: 243 WCKARGINDRSRGTLSSFSL-ILMMIHFLQRRPAPVLPSLQDLALQHNMPPLYVQGVDCR 301
Query: 219 RANAERQIAEI----CAFNIARFSSDKYRKINRSSLAHL------FVSFLEKFSGLSLKA 268
A + IAE C N R N S+ L + ++ KF ++++
Sbjct: 302 FATDPKMIAEDLDYQCKENGGR---------NTESVGFLLHEFFRYYGYMYKFGNIAIR- 351
Query: 269 SELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAV-------------SEK 315
+ TG + S + + LF+++PFE ++ A + +++
Sbjct: 352 ---DVVAATGAQPKVASPSAGV----YLFVDNPFEVGKDVANVLPNQHTRLRQEFRRAQQ 404
Query: 316 NLAKISNAFEM 326
LAK + FEM
Sbjct: 405 MLAKGVSFFEM 415
>gi|268567932|ref|XP_002640115.1| C. briggsae CBR-GLD-2 protein [Caenorhabditis briggsae]
Length = 859
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 80/158 (50%), Gaps = 7/158 (4%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N +K ++ +L+ + Q + Q + A
Sbjct: 341 GSSLNGFGNNSSDMDLCLMITNKDL----DQKNDAVVVLNLILSTLQYEKFVASQKLILA 396
Query: 114 RVPIL--KFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+VPIL KF +I+ D++ +N + L + S D R R +V +VKEWAK I
Sbjct: 397 KVPILRIKFAAPFDDITVDLNANNSVAIRNTHLLCYYSSYDWRVRPLVSVVKEWAKRKGI 456
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYP 208
N+ +F SYSL L+V+ + Q A +LP L+ YP
Sbjct: 457 NDANKSSFTSYSLVLMVIHYLQCGTEARVLPNLQQSYP 494
>gi|323331833|gb|EGA73245.1| Trf5p [Saccharomyces cerevisiae AWRI796]
Length = 453
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 238 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|342883438|gb|EGU83933.1| hypothetical protein FOXB_05550 [Fusarium oxysporum Fo5176]
Length = 1279
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 131/312 (41%), Gaps = 53/312 (16%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E R K++S L + V FGS + L S D+DI
Sbjct: 286 MREVYDRLLPTAAVEENRKKLVSKLEKTFNDEWPGHDIRVNLFGSSGNLLCSDDSDVDI- 344
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ ++++ ++ +LL K G ++ ++ A+VPI+K ++C
Sbjct: 345 -------CITTTWRELEDVCMIANLL----AKRGMEKVVCISAAKVPIVKIWDPELGLAC 393
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R R++ +++K W + +N+ GT +SY+ L+
Sbjct: 394 DMNVNNTLALENTRMVRTYIDIDPRVRELAMIIKYWTRRRIVNDAAFGGTLSSYTWICLI 453
Query: 189 LFHFQTCVPAILP-----PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR 243
+ Q P +LP P K P D A +I + + Y
Sbjct: 454 IAFLQLRSPPVLPALHQSPHKLPKPDGTTPDF---------------ADDIDKLAG--YG 496
Query: 244 KINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHP 295
K N+SS A L F ++ LS++ +L I +W + +N
Sbjct: 497 KKNKSSTAELLFQFFRFYAHEFDYDKHVLSVRHGKL-ITKHEKKWHYAINNQ-------- 547
Query: 296 LFIEDPFEQPEN 307
L +E+PF N
Sbjct: 548 LCVEEPFNTSRN 559
>gi|310801611|gb|EFQ36504.1| hypothetical protein GLRG_11649 [Glomerella graminicola M1.001]
Length = 1322
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 109/246 (44%), Gaps = 26/246 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++ + L P + E R K++ L + V FGS + L S D+DI
Sbjct: 276 MRKLYDRLTPTAKVEENRQKLVVKLERIFNEEWPGNDIRVNLFGSSGNLLCSDDSDVDI- 334
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ ++ DLL + G +++ ++ A+VPI+K ++C
Sbjct: 335 -------CITTPWKELEGVCIIADLL----ARKGMKKVVCISAAKVPIVKIWDPELGLAC 383
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W + +N+ GT +SY+ L+
Sbjct: 384 DMNVNNTLALENTRMVRTYVEIDPRVRPLAMIVKYWTRQRIVNDAAFGGTLSSYTWICLI 443
Query: 189 LFHFQTCVPAILP----------PLKDIYPGNLVDDLKGVRANAERQIAEICA--FNIAR 236
+ Q P +LP P + DDL +R ++ A + F R
Sbjct: 444 IGFLQLRDPPVLPSLHQRQHQRLPKRGGQESAFADDLDKLRGFGDKNKASLGELLFQFFR 503
Query: 237 FSSDKY 242
F + ++
Sbjct: 504 FYAHEF 509
>gi|213410491|ref|XP_002176015.1| Poly(A) RNA polymerase cid13 [Schizosaccharomyces japonicus yFS275]
gi|212004062|gb|EEB09722.1| Poly(A) RNA polymerase cid13 [Schizosaccharomyces japonicus yFS275]
Length = 663
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 90/183 (49%), Gaps = 11/183 (6%)
Query: 23 EDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSA 82
E+ + R + + + ++++ +V+ FGS S L SR D+D+ CI
Sbjct: 68 EELKRRAQFVKKVDDILKKARPDHSLSVKVFGSTSSMLASRDADVDL--------CIVCD 119
Query: 83 GKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
KK + L ++ + G +++ + A+VPI+KF ++ D +I+N+ +
Sbjct: 120 VKKSAPTTCE--LASIFSQNGMQQVVCIPRAKVPIVKFWDPEYKLASDCNINNILSISNT 177
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQTCVPAILP 201
+ + D R R +++++K WAK +N+ GT SY+LS +++ Q P I+P
Sbjct: 178 RMMRTYVDADIRVRQLIMIIKHWAKRRCLNDAAGGGTLTSYTLSCMIVNFLQMRRPPIVP 237
Query: 202 PLK 204
L+
Sbjct: 238 SLQ 240
>gi|407036075|gb|EKE37989.1| poly(A) polymerase, putative [Entamoeba nuttalli P19]
Length = 344
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 72/149 (48%), Gaps = 8/149 (5%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS L + GDLDI C S +G++V LL + + + +
Sbjct: 84 YGSTDYGLCLKDGDLDIC-------CTSQSGRQVNAILLESFAECFK-RNNFEIRNVIEK 135
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+K + ++ D+S + QI S+F + + FR + +L+K W K+ ++N
Sbjct: 136 AKVPIIKMVDLGTKVNIDLSFNQPVAQIHSEFFSTMIHCNKHFRIVAVLLKYWLKSRNLN 195
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
P G +S +L ++L +F + P + P
Sbjct: 196 CPFKGGLSSAALCFMILHYFTSFEPPLFP 224
>gi|358398352|gb|EHK47710.1| hypothetical protein TRIATDRAFT_272508 [Trichoderma atroviride IMI
206040]
Length = 1296
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 110/246 (44%), Gaps = 27/246 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ +I L P + E R K+++ L + V FGS + L S D+DI
Sbjct: 282 MNEIYNKLLPTDKIEENRTKLVNKLEMIFNDEWPGHDIKVHLFGSSGNLLCSDDSDVDI- 340
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CIS+ +++ ++ DLL + G ++ ++ A+VPI+K ++C
Sbjct: 341 -------CISTPWHEMEDVCMIADLL----ARRGMEQVVCISAAKVPIVKVWDPELGLAC 389
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + + D R R + +++K W + +N+ GT +SY+ L+
Sbjct: 390 DMNVNNTLALENTRMVRTYVETDPRVRQLAMILKHWTRRRIVNDAAFGGTLSSYTWICLI 449
Query: 189 LFHFQTCVPAILP-----PLKDIYPGNLVDD-------LKGVRANAERQIAEICAFNIAR 236
+ Q PA+LP P K P V D LKG + + AE+ F R
Sbjct: 450 IAFLQLRNPAVLPALHQLPHKTTKPDGAVSDFADNLKKLKGFGSKNKSSEAELL-FQFFR 508
Query: 237 FSSDKY 242
F + ++
Sbjct: 509 FYAHEF 514
>gi|115433668|ref|XP_001216971.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189823|gb|EAU31523.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 998
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 60/254 (23%), Positives = 105/254 (41%), Gaps = 43/254 (16%)
Query: 63 RWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFET 122
+W DI + + S+G K+ S D G R+ V+HA+VPI+K
Sbjct: 152 QWPGHDIKVHV-----FGSSGNKLCSSDSDD---------GMERVVCVSHAKVPIVKIWD 197
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNS 181
++CD++++N ++ + +ID R R + +++K W K + + GT +S
Sbjct: 198 PELRLACDMNVNNTLALENTRMVRTYVEIDERVRPLAMIIKYWTKRRILCDAGLGGTLSS 257
Query: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSS 239
Y+ L++ QT P ILP L+ R + +R E +C+F+ S
Sbjct: 258 YTWICLIINFLQTRDPPILPSLQ-------------ARPHKKRLSPEGFVCSFDDDMNSL 304
Query: 240 DKYRKINRSSLAHLFVSFLE------KFSGLSLKASELGICPFTGQWEHIRSNTRWLPNN 293
Y + N+ +L L F + + E G+ + H+ N R
Sbjct: 305 SGYGRKNKQTLGELLFQFFRYYGHELNYEKYVVSVREGGLISKEDKGWHLLQNNR----- 359
Query: 294 HPLFIEDPFEQPEN 307
L +E+PF N
Sbjct: 360 --LCVEEPFNTSRN 371
>gi|336367333|gb|EGN95678.1| hypothetical protein SERLA73DRAFT_60289 [Serpula lacrymans var.
lacrymans S7.3]
Length = 538
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 142/349 (40%), Gaps = 61/349 (17%)
Query: 5 NVLEPILKDILGMLN---PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + +++ +N P + E R VIS + + V S + A V PFGS+ + L+
Sbjct: 180 NVPEMLHREVEAFVNYMSPSPVEDEIRGLVISLVTKAVSS--AFPDAQVLPFGSYETKLY 237
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ I+ S K ++L L L++ ++ +A A+VPI+KF
Sbjct: 238 LPDGDIDLVIQSE------SMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFV 291
Query: 122 TIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
T H ++ DISI+ + G+I + FL + R +V++ K + +N TG
Sbjct: 292 TNHGRLNVDISINQGNGVIAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTG 351
Query: 178 TFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARF 237
SYS+ L + Q ++P ++ +AE+
Sbjct: 352 GLGSYSIVCLAISFLQ------------MHP-----KIRSGEIDAEK------------- 381
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLP--NNHP 295
+L L + F E + G E+GI G + W +
Sbjct: 382 -----------NLGVLVMEFFELY-GCYFNYEEVGISVRKGGTYFNKRQRGWYDFYKTNL 429
Query: 296 LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSS 344
L IEDP E P N S +AK+ H +TST+ R +L S
Sbjct: 430 LSIEDPTE-PSNDISKGS-FGIAKVRQTLAGAHGIMTSTSFLRAGILGS 476
>gi|390352572|ref|XP_798256.3| PREDICTED: uncharacterized protein LOC593693 [Strongylocentrotus
purpuratus]
Length = 953
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 105/247 (42%), Gaps = 50/247 (20%)
Query: 5 NVLEPILK--DILGMLNPLRE-------DWETRMKVISDLREVVESVESLRGATVEPFGS 55
NVLE +L+ D+ + L E D + R + L+EV VE V P+GS
Sbjct: 152 NVLEGLLEAEDVCSQMTALVEETCLDQSDLQLRYLICDLLQEVF--VEMFPKCRVFPYGS 209
Query: 56 FVSNLFSRWGDLDISIEL-------------------------------SNGSCISS--- 81
VS + DLD+ I+L S+ SS
Sbjct: 210 SVSGFGVKGCDLDLQIDLGRDSEQYKYKFASMFPDEDDMETNEEMAAGTSDADGTSSEQP 269
Query: 82 -AGKKVKQSLLGDLLRALRQ-KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQ 139
+ +L L R L+Q + ++ + +R P++KF + CD+S+DN
Sbjct: 270 ETSNMTHEEILQILCRLLKQCVPSCQHVRVIPSSRRPVIKFIHKESGLHCDLSLDNRLAL 329
Query: 140 IKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG---TFNSYSLSLLVLFHFQTCV 196
++ L + S +D R R +V +++WAK ++ + G +Y+L+LLV+ + Q
Sbjct: 330 RNTELLHFYSSLDERIRPLVCCLRQWAKHQQLSVNQQGPGPKMTNYALTLLVIHYLQNTQ 389
Query: 197 PAILPPL 203
P +LP +
Sbjct: 390 PTLLPTI 396
>gi|255566595|ref|XP_002524282.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223536473|gb|EEF38121.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 526
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 87/177 (49%), Gaps = 10/177 (5%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P E+ + R + + +V++ + VE FGS+ + L+ D+D+
Sbjct: 131 DFCDFLSPTPEEEDARNTAVKCVFDVIKYI--WPNCKVEVFGSYKTGLYLPTSDIDV--- 185
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I +G K Q L L RAL QKG +++Q +A ARVPI+KF +S DIS
Sbjct: 186 -----VIFRSGIKNPQIGLQALSRALSQKGIAKKIQVIAKARVPIVKFVEKRSGVSFDIS 240
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D G ++F+ + R + L++K + + ++N +G SY+L +++
Sbjct: 241 FDVDNGPKAAEFIKDAVRKWPALRPLSLILKVFLQQRELNEVYSGGIGSYALLTMLM 297
>gi|336380050|gb|EGO21204.1| hypothetical protein SERLADRAFT_476100 [Serpula lacrymans var.
lacrymans S7.9]
Length = 592
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 142/349 (40%), Gaps = 61/349 (17%)
Query: 5 NVLEPILKDILGMLN---PLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + +++ +N P + E R VIS + + V S + A V PFGS+ + L+
Sbjct: 180 NVPEMLHREVEAFVNYMSPSPVEDEIRGLVISLVTKAVSS--AFPDAQVLPFGSYETKLY 237
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ I+ S K ++L L L++ ++ +A A+VPI+KF
Sbjct: 238 LPDGDIDLVIQSE------SMAYSNKVTVLHALANTLKRAKITSKVTIIAKAKVPIVKFV 291
Query: 122 TIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
T H ++ DISI+ + G+I + FL + R +V++ K + +N TG
Sbjct: 292 TNHGRLNVDISINQGNGVIAGKIVNGFLKDMHGCGFALRSLVMITKAFLNQRGMNEVYTG 351
Query: 178 TFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARF 237
SYS+ L + Q ++P ++ +AE+
Sbjct: 352 GLGSYSIVCLAISFLQ------------MHP-----KIRSGEIDAEK------------- 381
Query: 238 SSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLP--NNHP 295
+L L + F E + G E+GI G + W +
Sbjct: 382 -----------NLGVLVMEFFELY-GCYFNYEEVGISVRKGGTYFNKRQRGWYDFYKTNL 429
Query: 296 LFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSS 344
L IEDP E P N S +AK+ H +TST+ R +L S
Sbjct: 430 LSIEDPTE-PSNDISKGS-FGIAKVRQTLAGAHGIMTSTSFLRAGILGS 476
>gi|323352825|gb|EGA85127.1| Trf5p [Saccharomyces cerevisiae VL3]
Length = 510
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 146 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 203
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 204 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 256
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 257 VSFERTNGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 314
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 315 YSFLNMHPRI 324
>gi|323303298|gb|EGA57094.1| Trf5p [Saccharomyces cerevisiae FostersB]
Length = 419
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR V+ + S A + FGSF ++L+ D+D
Sbjct: 146 IKDFVHYISPSKNEIKCRNRTIDKLRRAVKELWS--DADLHVFGSFATDLYLPGSDIDCV 203
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ N K ++ + +L R L+ KG R++ + RVPI+KF + D
Sbjct: 204 VNSRNRD-------KEDRNYIYELARHLKNKGLAIRMEVIVKTRVPIIKFIEPQSQLHID 256
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 257 VSFERTXGLEAAKLIREWLRDSPG-LRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLV- 314
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 315 YSFLNMHPRI 324
>gi|398405986|ref|XP_003854459.1| hypothetical protein MYCGRDRAFT_38269, partial [Zymoseptoria
tritici IPO323]
gi|339474342|gb|EGP89435.1| hypothetical protein MYCGRDRAFT_38269 [Zymoseptoria tritici IPO323]
Length = 486
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 133/322 (41%), Gaps = 35/322 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E E R K++ L ++ V+ FGS + L S D+DI
Sbjct: 123 MRELYDRLLPSEESEENRAKLLVKLERLLNEEWPGNDIRVKVFGSSGNLLSSTDSDVDIC 182
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I I+ + + L LL K G ++ A A+VPI+K ++ D
Sbjct: 183 I-------IAPLPQLISMHTLASLL----AKNGMEKVVCRAAAKVPIVKCWDPELQLAAD 231
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N+ ++ + ++D R R + ++K W K +N+ GT +SY+ +++
Sbjct: 232 LNVNNVQALQNTRMIKTYVELDDRIRPLAKIIKYWTKRRILNDAAYGGTISSYTWICMII 291
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
Q P ILP L+ I PG + G + I E+ + D N+ S
Sbjct: 292 NFLQRRSPPILPSLQKI-PGCRLPSETGKVSPFADDIEELKKSGALKGYGDS----NKES 346
Query: 250 LAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNH-------PLFIEDPF 302
L L F + G + S+ + G+ R W P+N+ L +E+PF
Sbjct: 347 LGELLYQFFRHY-GYDFEYSQYVVSIKEGK-SLSRKEKGWQPSNYLDKEARQRLCVEEPF 404
Query: 303 EQPENSARAVSEKNLAKISNAF 324
E+NL ++ +
Sbjct: 405 ---------TVERNLGNTADDY 417
>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii
CBS 8904]
Length = 624
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 87/177 (49%), Gaps = 10/177 (5%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P + ++ R +I + ++ + R ATV PFGS+ + L+ GD+D+ + S
Sbjct: 116 MSPTQHEFHVRKTMIDLITHIIR--KEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLS 173
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+K K ++L L R +R + + A+VPI+KF T I+ DIS++
Sbjct: 174 ------EKNKVTMLHQLARMMRGNHITETVAVITRAKVPIIKFVTAEGGINVDISLNQTN 227
Query: 138 GQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G K + ++ + G R+++L++K + +N TG SYS+ L L Q
Sbjct: 228 GVSAVKIVNHYLKALPG-ARELILVIKAFLSQRSMNEVYTGGLGSYSVICLALSFLQ 283
>gi|156058866|ref|XP_001595356.1| hypothetical protein SS1G_03445 [Sclerotinia sclerotiorum 1980]
gi|154701232|gb|EDO00971.1| hypothetical protein SS1G_03445 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1017
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/194 (24%), Positives = 93/194 (47%), Gaps = 12/194 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E E R K++ L ++ V FGS + L + D+DI
Sbjct: 46 MRELYDRLLPTAETDERRRKLVLKLEDMFNKEWPGHDIRVHVFGSSGNLLCTDESDVDI- 104
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI++ K ++ + + L K G +++ V+ A+VPI+K + CD
Sbjct: 105 -------CITTDWKAMEGVCM---IAELLAKNGMQKVICVSTAKVPIVKIFDPDLKLFCD 154
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
++++N ++ + +ID R R + +++K W K+ IN+ GT +SY+ +++
Sbjct: 155 MNVNNTLALENTRMIKTYIEIDPRVRPLAMIIKHWTKSRVINDAAFGGTLSSYTWICMII 214
Query: 190 FHFQTCVPAILPPL 203
Q+ P +LP L
Sbjct: 215 NFLQSREPPVLPAL 228
>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii
CBS 2479]
Length = 631
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 87/177 (49%), Gaps = 10/177 (5%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P + ++ R +I + ++ + R ATV PFGS+ + L+ GD+D+ + S
Sbjct: 116 MSPTQHEFHVRKTMIDLITHIIR--KEWRDATVTPFGSWETQLYLPTGDIDLVVSTPRLS 173
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+K K ++L L R +R + + A+VPI+KF T I+ DIS++
Sbjct: 174 ------EKNKVTMLHQLARMMRGNHITETVAVITRAKVPIIKFVTAEGGINVDISLNQTN 227
Query: 138 GQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G K + ++ + G R+++L++K + +N TG SYS+ L L Q
Sbjct: 228 GVSAVKIVNHYLKALPG-ARELILVIKAFLSQRSMNEVYTGGLGSYSVICLALSFLQ 283
>gi|341874753|gb|EGT30688.1| hypothetical protein CAEBREN_25845 [Caenorhabditis brenneri]
Length = 476
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/253 (23%), Positives = 102/253 (40%), Gaps = 41/253 (16%)
Query: 97 ALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFR 156
LR+K FV P+L+ T + D++IDN + L ++D RF
Sbjct: 129 VLREKSTAFEKLFVTKGHTPVLQLVTKVPRVEIDVTIDNETPIRNTHLLANYGKVDARFP 188
Query: 157 DMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDL 215
+ ++K WA + + + NS+S+ LL++ + Q+ V PA+LP L+ I+P
Sbjct: 189 QLCRVIKHWAAETGVEDSRNERLNSFSVCLLLIHYLQSGVTPAVLPNLQAIFP------- 241
Query: 216 KGVRANAERQIAEICAFNIARFSSDKYRKI----NRSSLAHLFVSFLEKFSGLSLKASEL 271
N E ++ + + R + N SS+ L F + ++
Sbjct: 242 ---EYNGEYEVGTGAFQDWDLLKELEGRGVPLGQNTSSVGALLQGFFKFYATFD------ 292
Query: 272 GICPFTGQWEHIRSNT-------------RWLPN-NHPLFIEDPF-EQPENSARAVSEKN 316
F QW I+ T LP+ N L +EDPF E P N R + + +
Sbjct: 293 ----FKNQWISIKRGTALEKKRDDQENPLEGLPDKNWCLVVEDPFLETPWNCGRTLQQMD 348
Query: 317 -LAKISNAFEMTH 328
L ++ F +
Sbjct: 349 TLERVQEEFRLAE 361
>gi|167387955|ref|XP_001738379.1| poly(A) RNA polymerase cid11 [Entamoeba dispar SAW760]
gi|165898475|gb|EDR25323.1| poly(A) RNA polymerase cid11, putative [Entamoeba dispar SAW760]
Length = 344
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 83/178 (46%), Gaps = 16/178 (8%)
Query: 32 ISDLR-EVVESVESLRGAT-------VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
+ D+R EV++ VE + + +GS L + GDLDI C S +G
Sbjct: 55 LVDMRYEVMQRVEQVLNQNYVDFHFRAQVYGSTDYGLCLKDGDLDIC-------CTSQSG 107
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
++V +L + + + + A+VPI+K + +S D+S + QI S+
Sbjct: 108 RQVNAIVLESFAECFK-RNNFEIKNVIEKAKVPIIKMIDLGTKVSIDLSFNQPVAQIHSE 166
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
F + + FR + +L+K W K ++N P G +S +L ++L +F + P + P
Sbjct: 167 FFSTMIHCNKHFRIVAVLLKYWLKTRNLNCPFKGGLSSAALCFMILHYFSSFEPPLFP 224
>gi|157868531|ref|XP_001682818.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126274|emb|CAJ03778.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|254839889|gb|ACT83522.1| mitochondrial editosome-like complex associated TUTase [Leishmania
major]
Length = 409
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 85/181 (46%), Gaps = 12/181 (6%)
Query: 26 ETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
+ R++V+ V VE ++ FGS + + D D+S+ N S S +
Sbjct: 31 DLRLRVVDLCSRCVNKVE------LQLFGSLATGFCTTGADADLSLTFRNFSPWLSGIEA 84
Query: 86 VKQSLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
V L R R+ G G + ++ + +A +P++ F+ I CD+SI N+ G SK
Sbjct: 85 VDAQNFKRLARVGREAGEMGMKNVRLI-NACIPVVHFQDAVSGIRCDLSIGNVNGVANSK 143
Query: 144 FLFWISQIDGRFRD-MVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--CVPAIL 200
L I ++ F V LVKEWAK ++ P FNS++++ + L Q +P +
Sbjct: 144 ILAEIHRVLPDFYGAYVYLVKEWAKKCEVVAPDKSMFNSFTMTTMSLMVLQELGLLPIFV 203
Query: 201 P 201
P
Sbjct: 204 P 204
>gi|255072677|ref|XP_002500013.1| predicted protein [Micromonas sp. RCC299]
gi|226515275|gb|ACO61271.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 292
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 81/181 (44%), Gaps = 10/181 (5%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L P E+ R + +R V S+ GA E GSF + ++ D+D
Sbjct: 10 DFCRYLEPTAEESSARTAAVERVRGAVLSIWP--GARFEVHGSFATGMYLPNSDID---- 63
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ I +G K + L L +L +KG R++Q +A ARVPI+KFE D+S
Sbjct: 64 ----AVILGSGCKSPATCLKALALSLSRKGMARKIQLIAKARVPIVKFEERPSGFQFDVS 119
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G ++ + + R + ++K + + +N TG SY+L +V+ H
Sbjct: 120 FDVANGPASAEIVRANMRRFPALRPLTTVLKAFLQQRALNEVYTGGVGSYALLCMVMAHL 179
Query: 193 Q 193
Q
Sbjct: 180 Q 180
>gi|431891368|gb|ELK02243.1| Poly(A) RNA polymerase, mitochondrial [Pteropus alecto]
Length = 638
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 102/212 (48%), Gaps = 27/212 (12%)
Query: 24 DWETRMKVISDLREVVESVES--LRGATVEPFGSFVSNLFSRWG-DLDISIEL------- 73
D TR++ ++ ++E V + V PFGS V N F + G DLD+ ++L
Sbjct: 275 DENTRLRYLTC--SLIEDVAAAYFPDCAVRPFGSSV-NGFGKLGCDLDMFLDLDEIGKSD 331
Query: 74 ---SNGSCIS-------SAGKKVKQSLLGDLLRALRQKG-GYRRLQFVAHARVPILKFET 122
++G+ + ++ + Q +L + +L G G +Q + +AR P+++F
Sbjct: 332 AHKTSGNFLMEFQVKNVASERIATQKILSVIGESLDHFGPGCVGVQKILNARCPLVRFSH 391
Query: 123 IHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF-NS 181
CD++ +N S+ L+ +D R R +V ++ WA+AH + + G + +
Sbjct: 392 QASGFQCDLTTNNRIALKSSELLYIYGAVDPRVRALVFTIRCWARAHSLTSSIPGAWITN 451
Query: 182 YSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
+SL+++V+F Q P ILP L Y L D
Sbjct: 452 FSLTMMVIFFLQRRSPPILPTLD--YLKTLAD 481
>gi|402592503|gb|EJW86431.1| PAP/25A associated domain-containing protein [Wuchereria bancrofti]
Length = 518
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 66/122 (54%), Gaps = 4/122 (3%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILK--FETIHQNISCDISIDNLCGQIKSKFLFW 147
+L + AL + +Q + A+VPIL+ F +I+ D++ +N + L +
Sbjct: 21 VLNMIQSALAETKWVSHMQLIL-AKVPILRIRFYEPFTDITVDLNANNSVAIRNTHLLCY 79
Query: 148 ISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDI 206
S D R R +V +VKEWAK DIN+ +F SYSL L+V+ + Q + ILP L+ +
Sbjct: 80 YSSFDWRVRPLVSVVKEWAKRRDINDANRSSFTSYSLVLMVIHYLQCGLKQPILPSLQVV 139
Query: 207 YP 208
YP
Sbjct: 140 YP 141
>gi|452843642|gb|EME45577.1| hypothetical protein DOTSEDRAFT_52815 [Dothistroma septosporum
NZE10]
Length = 1085
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/252 (24%), Positives = 107/252 (42%), Gaps = 23/252 (9%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P +E E R K++ L ++ V FGS + L S D+DI
Sbjct: 118 MRELYDRLLPSQESEERRTKLVPKLDRILNDEWPGNDIRVNVFGSSGNMLSSTDSDVDI- 176
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI++ +K++ L AL K G ++ A A+VPI+K ++ D
Sbjct: 177 -------CITTPLRKLESM---HSLAALLHKHGMEKIVCRAAAKVPIVKAWDPDLQLAID 226
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLVL 189
I+++N ++ + Q+D R R + ++K W K +N+ GT +SY+ +++
Sbjct: 227 INVNNPLALQNTRMIRTYVQLDDRVRPLAKIIKYWTKRRILNDAAYGGTISSYTWICMII 286
Query: 190 FHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSS 249
Q P ILP L+ I+ R E A A ++ + N+ S
Sbjct: 287 NFLQRREPPILPSLQKIH---------DRRQKTESGEASTFADDVDALKG--FGDANKES 335
Query: 250 LAHLFVSFLEKF 261
L L F +
Sbjct: 336 LGELLFQFFRHY 347
>gi|37574078|ref|NP_932110.1| speckle targeted PIP5K1A-regulated poly(A) polymerase [Mus
musculus]
gi|81915027|sp|Q8R3F9.1|STPAP_MOUSE RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|19344068|gb|AAH25499.1| Terminal uridylyl transferase 1, U6 snRNA-specific [Mus musculus]
gi|23274106|gb|AAH23900.1| Terminal uridylyl transferase 1, U6 snRNA-specific [Mus musculus]
Length = 869
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 62/112 (55%), Gaps = 3/112 (2%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G R+Q V AR P++KF + D+S+ N S+FL S
Sbjct: 346 LVGSILRGCVP--GVYRVQTVPSARRPVVKFCHRPSGLHGDVSLSNRLALYNSRFLNLCS 403
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
++DGR R +V ++ WA+ + ++ N+Y+L+LLV++ QT P +LP
Sbjct: 404 EMDGRVRPLVYTLRCWAQHNGLSG-GGPLLNNYALTLLVIYFLQTRDPPVLP 454
>gi|149244754|ref|XP_001526920.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449314|gb|EDK43570.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 664
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 91/190 (47%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P E+ R KV++ L+ + G FGS ++L+ D+D+
Sbjct: 227 MKDFVNYISPSSEEIVIRNKVVNTLKTQIALF--WPGTEAHVFGSSATDLYLPGSDIDMV 284
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G +S L L L+ K ++ +A A+VPI+KF NI D
Sbjct: 285 V-------LSDTGDYENRSRLYQLSSFLKAKKLATNVEVIASAKVPIIKFVDPDSNIHVD 337
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
IS + G + W++ G R++VL+VK++ ++ +NN G Y+ ++++
Sbjct: 338 ISFERKNGLDAARRIRRWLASTPG-LRELVLVVKQFLRSRKLNNVHVGGLGGYA-TIIIC 395
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 396 YHFLRLHPKL 405
>gi|145534215|ref|XP_001452852.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420551|emb|CAK85455.1| unnamed protein product [Paramecium tetraurelia]
Length = 357
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 85/172 (49%), Gaps = 11/172 (6%)
Query: 32 ISDLREVVESVESLRG--ATVEPFGSFVSNLFSRWGDLDISI----ELSNGSCISSAGKK 85
ISD + + SV SL + PFGSF + DLD + ELS + + + K
Sbjct: 27 ISDTLKALGSVISLANLKGNLLPFGSFCNGFHGNNSDLDCVLITDSELSTTTILRNLRKA 86
Query: 86 VKQSLLGDLLRALR--QKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
V++ L+ Q Y ++ + +++VPI+K I +I+ D+SI+N+ G + SK
Sbjct: 87 VQEYKYTYQTPQLQFDQLILYAKVNSITYSKVPIIKITDITNDIAIDLSINNINGVLNSK 146
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC 195
L SQI + + + L+K W K + TG SY++ LL L HF C
Sbjct: 147 LLKEYSQIHPKIQQLGQLLKLWGKNQRL--IVTGQLTSYAI-LLTLIHFLQC 195
>gi|389586429|dbj|GAB69158.1| hypothetical protein PCYB_145860 [Plasmodium cynomolgi strain B]
Length = 482
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
+ A VPI K NI CDISI+N + +K + + D R + ++K WAK
Sbjct: 273 IIKASVPIAKIYREQNNI-CDISINNTVAIVNTKLVSSLCNTDERVTIINRVIKYWAKQK 331
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+INN GTF+SY+L LL + Q +LPP DI
Sbjct: 332 NINNRSQGTFSSYALFLLTYYFLQNLETPLLPPYNDI 368
>gi|395544406|ref|XP_003774101.1| PREDICTED: speckle targeted PIP5K1A-regulated poly(A) polymerase
[Sarcophilus harrisii]
Length = 889
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/249 (26%), Positives = 100/249 (40%), Gaps = 42/249 (16%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
L+G +LR G + V AR P++KF + DIS+ N S+FL
Sbjct: 362 LVGSVLRGCVP--GVHSVWTVPSARRPVVKFCHRPSGLHGDISLSNRLALSNSRFLNLCC 419
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
+D R R +V ++ WA+ + N+Y+L+LLV++ QT P +LP L +
Sbjct: 420 ALDRRVRPLVYTLRCWAQGRGLTG-SGPLLNNYALTLLVIYFLQTRDPPVLPSLTRL--- 475
Query: 210 NLVDDLKGVRANAERQIAEI----CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS 265
+ E + E+ C F R +S N SL L F S
Sbjct: 476 --------TQMAGEEERVEVDGWDCTF--PREASHLEPSANTESLPSLLAQFFSCVSSWE 525
Query: 266 LKASEL------------GICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVS 313
L+ S L G+ P G W +R PL ++DPF+ N A V+
Sbjct: 526 LRGSLLSLREGLPLPVADGLPP--GLWGGLRLG--------PLNLQDPFDLSHNVAANVT 575
Query: 314 EKNLAKISN 322
+ ++ N
Sbjct: 576 GRVAGRLQN 584
>gi|82541613|ref|XP_725036.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23479890|gb|EAA16601.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 316
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 77/162 (47%), Gaps = 14/162 (8%)
Query: 44 SLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGG 103
+L+G + GS +N++ + D+D SCI + K S L +L+ ++
Sbjct: 154 NLKGK-IYFIGSCENNIWIKNSDID--------SCIVVENCEDKNSYLY-ILKVIKSAIN 203
Query: 104 YRRLQF---VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVL 160
+ A VPI K NI CDISI+N + + + + ID R +
Sbjct: 204 LIHPSLTVNIIKASVPIAKIYKDQTNI-CDISINNTVAIVNTHLVSCLCNIDERVPIINR 262
Query: 161 LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPP 202
++K WAK +INN GTF+SY+L LL F FQ ILPP
Sbjct: 263 IIKYWAKQKNINNRSQGTFSSYALFLLTYFFFQNLETPILPP 304
>gi|308461806|ref|XP_003093191.1| CRE-GLD-2 protein [Caenorhabditis remanei]
gi|308250668|gb|EFO94620.1| CRE-GLD-2 protein [Caenorhabditis remanei]
Length = 894
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 79/158 (50%), Gaps = 7/158 (4%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N +K ++ +L+ + Q + Q + A
Sbjct: 374 GSSLNGFGNNSSDMDLCLMITNKDL----DQKNDAVVVLNLILSTLQYEKFVASQKLILA 429
Query: 114 RVPIL--KFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
+VPIL KF +I+ D++ +N + L + S D R R +V +VKEWAK I
Sbjct: 430 KVPILRIKFAAPFDDITVDLNANNSVAIRNTHLLCYYSSYDWRVRPLVSVVKEWAKRKGI 489
Query: 172 NNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYP 208
N+ +F SYSL L+V+ + Q +LP L+ YP
Sbjct: 490 NDANKSSFTSYSLVLMVIHYLQCGTQTKVLPNLQQSYP 527
>gi|413934363|gb|AFW68914.1| hypothetical protein ZEAMMB73_981239 [Zea mays]
Length = 235
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 90/208 (43%), Gaps = 29/208 (13%)
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
+SCDI ++NL + +K L QID R + + +VK WAK +N GT +SY+ +
Sbjct: 7 LSCDICVNNLLAVVNTKLLRDYGQIDKRLQQLAFIVKHWAKTRRVNETYQGTLSSYAYVI 66
Query: 187 LVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEI-CAFNIARFSSDKYRKI 245
+ + Q + ILP L+++ A ++ EI CA+ + Y
Sbjct: 67 MCIHLLQ--LRRILPCLQEM------------EATYYVKVEEINCAYFDQVDKLNNYGAH 112
Query: 246 NRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLF 297
NR +++ L SF ++ +S++ + I W N R H +
Sbjct: 113 NRDTVSRLLWSFFHYWAYEHDYTRDVISIRTGRI-ISKERKDWTRRVGNDR-----HLIC 166
Query: 298 IEDPFEQPENSARAVSEKNLAKISNAFE 325
IEDPFE + R V + + + FE
Sbjct: 167 IEDPFEISHDLGRVVDKFTIKILREEFE 194
>gi|429859729|gb|ELA34498.1| pap 25a associated domain family [Colletotrichum gloeosporioides
Nara gc5]
Length = 1135
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/195 (24%), Positives = 94/195 (48%), Gaps = 14/195 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P + E R K++ L ++ V FGS + L S D+DI
Sbjct: 129 MRELYDRLIPTEKVEENRKKLVVKLEKIFNEEWPGNDIRVNLFGSSGNLLCSDDSDVDI- 187
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ ++ +LL K G ++ ++ A+VPI+K ++C
Sbjct: 188 -------CITTPWKEMEGVCMIANLL----AKKGMEKVVCISAAKVPIVKIWDPELGLAC 236
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + +ID R R + ++VK W + +N+ GT +SY+ L+
Sbjct: 237 DMNVNNTLALENTRMVRTYVEIDPRVRPLAMIVKYWTRKRIVNDAAFGGTLSSYTWICLI 296
Query: 189 LFHFQTCVPAILPPL 203
+ Q P +LP L
Sbjct: 297 IGFLQLRDPPVLPSL 311
>gi|363751202|ref|XP_003645818.1| hypothetical protein Ecym_3523 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889452|gb|AET39001.1| Hypothetical protein Ecym_3523 [Eremothecium cymbalariae
DBVPG#7215]
Length = 683
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 96/199 (48%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P RE+ R I+ +R+ V+S + + FGS+ ++L+ D+D
Sbjct: 192 IKDFVSYISPNREEIRKRNDAITKIRKAVKSF--WPDSDLHCFGSYATDLYLPGSDIDCV 249
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S +G K ++ L LR+ G ++ +A ARVPI+KF I D
Sbjct: 250 VN-------SKSGDKDNKNALYSFASYLRKNGLASQVSVIAKARVPIIKFVEPVSQIHID 302
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL +K++ A +N+ G +S+ + +
Sbjct: 303 VSFERTNGVDAAKIIRGWLGDTPG-LRELVLTIKQFLYARRLNDVHIGGLGGFSI-ICLT 360
Query: 190 FHFQTCVPAILPPLKDIYP 208
+ F P I+ +D+ P
Sbjct: 361 YSFLKLHPRII--CQDVDP 377
>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens
LYAD-421 SS1]
Length = 647
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 70/146 (47%), Gaps = 6/146 (4%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRL 107
A V PFGS+ + L+ GD+D+ I S + K S+L L +++ G R+
Sbjct: 197 AKVLPFGSYETKLYLPSGDIDLVI------YSHSMMRMDKVSVLHSLANIMKRAGITDRV 250
Query: 108 QFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+A A+VPI+KF T H S DIS++ G K + + R +VL++K +
Sbjct: 251 TIIAKAKVPIIKFVTAHGRFSVDISVNQGNGVDTGKMVKQFLRELPALRSLVLIIKNFLS 310
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQ 193
+N TG SYS+ L + Q
Sbjct: 311 QRSMNEVFTGGLGSYSIVCLAISFLQ 336
>gi|357629686|gb|EHJ78305.1| hypothetical protein KGM_22719 [Danaus plexippus]
Length = 592
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 125/293 (42%), Gaps = 49/293 (16%)
Query: 50 VEPFGSFVSNLFSRWGDLDISI--ELSNGSCISSAGKKVKQS---------------LLG 92
V PFGS V+ DLD+ + L++G +S + V Q L+G
Sbjct: 210 VLPFGSSVNGFGKMGCDLDLVLTNSLTDGM-MSPTNRLVYQEKRSEGSRGPWQRHMELVG 268
Query: 93 DLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
LL LR G R+Q + +ARVPI+K+ ++ D+ N+ G S L+ + +D
Sbjct: 269 ALLE-LRVPGA-TRVQRILNARVPIVKYSQELADVDVDLCFKNMSGVHMSALLYSLGALD 326
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQTCVPAILPPLKDIY--PG 209
+ + V+ WA A + P G + ++ L+L+VLF T ILP + + G
Sbjct: 327 PAGPALAVSVRRWAAAVQLTQPHPGRWITNFPLTLMVLFFLMT--QKILPTFRCLLECAG 384
Query: 210 NLVDDLKGVRANAERQIAEICAF--NIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK 267
L D N C F +++R YR + L L + F E +S +
Sbjct: 385 RLYTD------NIN------CTFVRDLSRLPPHSYRP-SSDDLQTLLLKFFEFYSQFDFQ 431
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
+ + + + IR PN PL+I +P E N +R VS + ++
Sbjct: 432 EHAISVI----EGKPIRK-----PNTLPLYIVNPLEPALNVSRNVSYEECERL 475
>gi|159463436|ref|XP_001689948.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283936|gb|EDP09686.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1846
Score = 67.0 bits (162), Expect = 2e-08, Method: Composition-based stats.
Identities = 60/201 (29%), Positives = 94/201 (46%), Gaps = 14/201 (6%)
Query: 17 MLNPLREDWETRMKVISDLREVV--ESVESLRGATVE-PFGSFVSNLFSRWGDLDISIE- 72
M P ED R+ V++DL +V ++ SL G+ P GSF+ F+ L++++
Sbjct: 1 MEAPTPEDDGARLDVVADLARLVSADTAGSLGGSLAAVPHGSFLMGCFTSASALEVAVTG 60
Query: 73 ----LSNGSCISSAGKKVKQSLLGDLLR--ALRQKGGYRRLQFVA---HARVPILKFETI 123
+ G + + Q L L R L + G VA ARVP + FE
Sbjct: 61 RLPASTQGGEATDVERLPPQERLQLLERVHGLVRDSGMAAQGTVAINRTARVPSVSFEHA 120
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
I + + +++ + + Q++ R R + LV+ WA+A +NNP GTFNS++
Sbjct: 121 GSGIPVRVCVAYPGFALRAHVVRGLMQLEPRLRSLTQLVELWAEARGLNNPAAGTFNSWA 180
Query: 184 LSLLVLFHFQTC-VPAILPPL 203
L LV F QT +LPPL
Sbjct: 181 LMNLVFFAVQTFQSQPLLPPL 201
>gi|46105240|ref|XP_380424.1| hypothetical protein FG00248.1 [Gibberella zeae PH-1]
Length = 1289
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 129/322 (40%), Gaps = 59/322 (18%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E R K++S L ++ V FGS + L S D+DI
Sbjct: 287 MREVFDRLLPTAAVEENRKKLVSKLEKIFNDEWPGHDIRVNLFGSSGNLLCSDDSDVDIC 346
Query: 71 IELS----NGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
I S G C ++ +LL K G ++ ++ A+VPI+K
Sbjct: 347 ITTSWHELEGVC-----------MIANLL----AKRGMEKVVCISAAKVPIVKIWDPELG 391
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLS 185
++CD++++N ++ + ID R R++ +++K W + +N+ GT +SY+
Sbjct: 392 LACDMNVNNTLALENTRMVRTYIDIDPRVRELAMIIKYWTRRRIVNDAAFGGTLSSYTWI 451
Query: 186 LLVLFHFQTCVPAILP-----PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
L++ Q P +LP P K P + D A +I + +
Sbjct: 452 CLIIAFLQLRNPPVLPCLHQSPHKLPKPDGTLPDF---------------ADDIDKLAG- 495
Query: 241 KYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
Y N+SS A L F ++ LS++ +L I +W + +N
Sbjct: 496 -YGSKNKSSTAELLFQFFRFYAHEFDYDKQVLSVRQGKL-ITKHEKKWHYAINNQ----- 548
Query: 293 NHPLFIEDPFEQPENSARAVSE 314
L +E+PF N E
Sbjct: 549 ---LCVEEPFNTSRNLGNTADE 567
>gi|408395224|gb|EKJ74408.1| hypothetical protein FPSE_05415 [Fusarium pseudograminearum CS3096]
Length = 1288
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 129/322 (40%), Gaps = 59/322 (18%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E R K++S L ++ V FGS + L S D+DI
Sbjct: 287 MREVFDRLLPTAAVEENRKKLVSKLEKIFNDEWPGHDIRVNLFGSSGNLLCSDDSDVDIC 346
Query: 71 IELS----NGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQN 126
I S G C ++ +LL K G ++ ++ A+VPI+K
Sbjct: 347 ITTSWHELEGVC-----------MIANLL----AKRGMEKVVCISAAKVPIVKIWDPELG 391
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLS 185
++CD++++N ++ + ID R R++ +++K W + +N+ GT +SY+
Sbjct: 392 LACDMNVNNTLALENTRMVRTYIDIDPRVRELAMIIKYWTRRRIVNDAAFGGTLSSYTWI 451
Query: 186 LLVLFHFQTCVPAILP-----PLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD 240
L++ Q P +LP P K P + D A +I + +
Sbjct: 452 CLIIAFLQLRNPPVLPCLHQSPHKLPKPDGTLPDF---------------ADDIDKLAG- 495
Query: 241 KYRKINRSSLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPN 292
Y N+SS A L F ++ LS++ +L I +W + +N
Sbjct: 496 -YGSKNKSSTAELLFQFFRFYAHEFDYDKQVLSVRQGKL-ITKHEKKWHYAINNQ----- 548
Query: 293 NHPLFIEDPFEQPENSARAVSE 314
L +E+PF N E
Sbjct: 549 ---LCVEEPFNTSRNLGNTADE 567
>gi|341038737|gb|EGS23729.1| poly(A) RNA polymerase-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1199
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 127/285 (44%), Gaps = 49/285 (17%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
R K+++ L ++ R V FGS +N S D+DI CI++ ++++
Sbjct: 224 RKKLVAKLEKLFNDKWPGRDIKVHLFGSSGNNTCSDDSDVDI--------CITTPWRELE 275
Query: 88 Q-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
++ +LL + G ++ V+ A+VPI+K ++CD++++N ++ +
Sbjct: 276 NVCMIAELL----HQHGMEKVVCVSSAKVPIVKIWDPELKLACDMNVNNTLALENTRMVR 331
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
ID R R + ++VK W P GT +SY+ +V+ Q P +LP L
Sbjct: 332 TYVDIDERVRQLAMIVKYW-------TPFGGTLSSYTWICMVIAFLQLRDPPVLPAL--- 381
Query: 207 YPGNLVDDLKGVRANAER-QIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSG-- 263
+ D L+ R + R + A+ RF DK N+ SLA L +F ++
Sbjct: 382 ---HQCDGLRLPRDDNTRSEFADDVEALQERF-GDK----NKESLASLLFNFFRFYAHEF 433
Query: 264 ------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPF 302
LS++ +L + +W HI SN L +E+PF
Sbjct: 434 DYDKYVLSIRMGKL-LTKTEKKW-HIGSNNM-------LCVEEPF 469
>gi|378734522|gb|EHY60981.1| poly(A) polymerase [Exophiala dermatitidis NIH/UT8656]
Length = 1374
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 93/229 (40%), Gaps = 29/229 (12%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L+F T I CDI+ N + L D R ++ L VK WAK DIN P G
Sbjct: 409 LEF-TADCGIQCDINFTNFVALYNTALLKLYHDFDKRVGELGLFVKIWAKMRDINTPYHG 467
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLK-------DIYPGNLVDDLKGVRANAERQIAEI 229
T +SY ++VL + P ++P L+ D +P V ++G I +
Sbjct: 468 TLSSYGYIMMVLHYLMNVASPPVIPNLQHLVTCQDDWFPDLKVKLIEGC------DIRYL 521
Query: 230 C-AFNIARFSSDKYRKINRSSLAHLFVSFLEKFS---GLSLKASELGICPFTG---QWEH 282
C +IA + + NR + L F + ++ G + I G + E
Sbjct: 522 CDPRSIAEVRQEMASRPNRETSGQLLRGFFQYYATREGFHWTRDVISIRRKGGIVSKQEK 581
Query: 283 IRSNTRWL--PNNHP-----LFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ +W+ +NH L IEDPFE N AR V L I + F
Sbjct: 582 GWTEAKWVQKGDNHVRLRYLLAIEDPFEVDHNIARTVGHNGLVAIRDEF 630
>gi|198432244|ref|XP_002119718.1| PREDICTED: similar to rCG24089 [Ciona intestinalis]
Length = 402
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/293 (23%), Positives = 132/293 (45%), Gaps = 33/293 (11%)
Query: 49 TVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQ 108
T++ FGS V+ S+ D+D+ LSN + + + ++ ++R L++ + ++
Sbjct: 94 TLKLFGSSVNGFGSKDSDVDVC--LSN---LPNTKQNKQRKHFEQIVRCLKKCKQFNDVE 148
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
++ H+RVPI+K ++ + S++N SK L S+ID R +V L+K K
Sbjct: 149 YI-HSRVPIIKCIHKKSSLHFEFSLNNEWPIYNSKLLHRYSKIDERCLVLVHLIKYLVKQ 207
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAE 228
++ P G +SY+ +++VLF+ Q +LP L+ +LK A+ Q+ +
Sbjct: 208 CNVVGPFHGYMSSYAYTIMVLFYLQQIDTPVLPVLQ---------ELKANDTKAQVQMVD 258
Query: 229 ICAFNIARFSSDKYRKI------NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEH 282
+ + S+ + + I NR S L++ L ++ + G C T +
Sbjct: 259 GYSTYYFKQSNQELKSIWPKFNQNRMSCGELWMGMLHFYTNTDDFIN--GNCVITIRQHG 316
Query: 283 IRSNTRWLPNNHPLF----------IEDPFEQPENSARAVSEKNLAKISNAFE 325
S+ + +P IEDPFE N +++ K L I F+
Sbjct: 317 FLSSKEKIYQQYPSLSRLWHKGRFRIEDPFELHRNLGSSLNSKTLPLIVKVFK 369
>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
MF3/22]
Length = 732
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 97/196 (49%), Gaps = 19/196 (9%)
Query: 5 NVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + K++ L ++P + E R V+ + ++ V S + V PFGSF + L+
Sbjct: 144 NVAELMHKEVEAYLKYVSPTPVEHEVRWMVVQLISSSIKRVYS--DSEVLPFGSFGTKLY 201
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ ++ + K + L L +++ G ++ ++ ARVPI+KF
Sbjct: 202 LPQGDIDLVVQSRTLASFE------KVTALKSLANIVKRTGLADKVTIISQARVPIIKFT 255
Query: 122 TIHQNISCDISIDN----LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
T++ + DIS++ G + ++FL + R +VL+VK + K ++N +G
Sbjct: 256 TLYGRFAVDISMNQSNGVKTGDMINRFLNEFPAL----RAIVLIVKSFLKQRNLNEVYSG 311
Query: 178 TFNSYSLSLLVLFHFQ 193
SY++ L + H Q
Sbjct: 312 GLGSYAIVCLAVSHLQ 327
>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
Length = 897
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 127/322 (39%), Gaps = 65/322 (20%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D+ + P + R KV +R+ V + + FGS +NLF D+D+ +E
Sbjct: 88 DLYHWIKPNEIEVRLRTKVYEKVRDSVSQRWQHKPIKISMFGSLRTNLFLPTSDIDVLVE 147
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ + + G LG+ R L + A VPI+K +S DIS
Sbjct: 148 CDD--WVGTPG-----DWLGETARGLENDNIAESVTVFGGAFVPIVKMVDRDTRLSIDIS 200
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
+ + G + ++ + + +VLL+K++ ++N TG +SY L LL++ F
Sbjct: 201 FNTVQGVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSYGLVLLLVNFF 260
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS--SL 250
Q + A N+ ++R I S +L
Sbjct: 261 Q-----------------------------------LYALNM------RHRTIYDSGVNL 279
Query: 251 AHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT--------RWLPNNHPLFIEDPF 302
HL + FLE +S + E+GI P GQ +I + R P N L +EDP
Sbjct: 280 GHLLLRFLEVYS-MEFNYEEIGISP--GQCCYISKSAAGARYGHKRAQPGN--LALEDPL 334
Query: 303 EQPENSARAVSEKNLAKISNAF 324
+ R S N + I+NAF
Sbjct: 335 LTANDVGR--STYNFSSIANAF 354
>gi|428168714|gb|EKX37655.1| hypothetical protein GUITHDRAFT_77870 [Guillardia theta CCMP2712]
Length = 244
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 106/235 (45%), Gaps = 27/235 (11%)
Query: 107 LQFVAHARVPILKFETIHQNIS---CDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
++ ++ ARVPI+KFE S CD+S++N+ + + L+ + +D R R +++ VK
Sbjct: 4 VETLSDARVPIIKFEADDGRGSAFHCDLSVNNVLACVNTDLLYTYTMLDARTRPLIMCVK 63
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLV---DDLKGVRA 220
W K I+N +SY+ +L+V+ + Q +LP L++I D V
Sbjct: 64 HWVKQRQIHNAFKRYLSSYTYALMVIQYLQ--YERVLPCLQNIRREEAKWKNDPSFSVLW 121
Query: 221 NAERQIAEICAF--NIARFSSDKYR-KINRSSLAHLFVSFLEKFSG--------LSLKAS 269
N E A C F N + + + + N SSL L V F +S +S+++
Sbjct: 122 NGE---AYDCYFYRNFETLAGNSTKLRNNSSSLGLLLVGFFHFYSNVFEVDQGVVSIRSG 178
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L G W+ + N H L IEDPF+ + R V ++ I F
Sbjct: 179 RLLKKKAKG-WD----SPEGFRNQHILCIEDPFDVDLDLGRYVIGTTVSDIREEF 228
>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 94/190 (49%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR+ V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKSEIKCRNRTIDKLRQAVKKLWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S K ++ + +L R L+ +G R++ + RVPI+KF + D
Sbjct: 238 IN-------SRHHDKEDRNYIYELARYLKNEGLAIRMEVIVRTRVPIIKFIEPLSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G ++ + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAARLIREWLRDSPG-LRELVLVIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 642
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 94/190 (49%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + I LR+ V+ + S A + FGSF ++L+ D+D
Sbjct: 180 IKDFVHYISPSKSEIKCRNRTIDKLRQAVKQLWS--DADLHVFGSFATDLYLPGSDIDCV 237
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S K ++ + +L R L+ +G R++ + RVPI+KF + D
Sbjct: 238 IN-------SRHHDKEDRNYIYELARYLKNEGLAIRMEVIVRTRVPIIKFIEPLSQLHID 290
Query: 131 ISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G ++ + W+ G R++VL++K++ + +NN TG +++ LV
Sbjct: 291 VSFERTNGLEAARLIREWLRDSPG-LRELVLVIKQFLHSRRLNNVHTGGLGGFTVICLV- 348
Query: 190 FHFQTCVPAI 199
+ F P I
Sbjct: 349 YSFLNMHPRI 358
>gi|302915118|ref|XP_003051370.1| hypothetical protein NECHADRAFT_93859 [Nectria haematococca mpVI
77-13-4]
gi|256732308|gb|EEU45657.1| hypothetical protein NECHADRAFT_93859 [Nectria haematococca mpVI
77-13-4]
Length = 1290
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/246 (23%), Positives = 111/246 (45%), Gaps = 27/246 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P E R K++ L ++ V FGS + L S D+DI
Sbjct: 287 MREVYDRLLPTAAVEENRKKLVLKLEKIFNDEWPGHDIRVHLFGSSGNLLCSDDSDVDI- 345
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI+++ ++++ ++ DLL + G ++ ++ A+VPI+K ++C
Sbjct: 346 -------CITTSWRELEGVCMIADLL----ARRGMEKVVCISAAKVPIVKIWDPELGLAC 394
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + D R R++ ++VK W + +N+ GT +SY+ L+
Sbjct: 395 DMNVNNTLALENTRMVRTYIDTDPRVRELAMIVKYWTRRRIVNDAAFGGTLSSYTWICLI 454
Query: 189 LFHFQTCVPAILP---------PLKDIYPGNLVDDLK---GVRANAERQIAEICAFNIAR 236
+ Q P +LP P D + DDLK G + +AE+ F R
Sbjct: 455 IAFLQLRSPPVLPALHQLSHKLPRPDGTMPDFADDLKKLSGFGNKNKSSVAELL-FQFFR 513
Query: 237 FSSDKY 242
F + ++
Sbjct: 514 FYAHEF 519
>gi|406860522|gb|EKD13580.1| pap 25a associated domain family protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1271
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 93/216 (43%), Gaps = 33/216 (15%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVA 111
FGS + L + D+DI CI++ + ++ DLL K G ++ +
Sbjct: 318 FGSSGNLLCTDESDVDI--------CITTEWDAMPNVCMVADLL----AKNGMEKVLCIG 365
Query: 112 HARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDI 171
A++PI+K ++CD++++N ++ + QID R R + +++K W K +
Sbjct: 366 GAKIPIVKIWDPELKLACDMNVNNPLALENTRMIKTYVQIDPRVRPLAMIIKHWTKERIV 425
Query: 172 NNPKTG-TFNSYSLSLLVLFHFQTCVPAILPPL----KDIYPGNLVDDLKGVRANAERQI 226
N+ G T +SY+ ++++ Q P ILP L +D P R N +
Sbjct: 426 NDAAFGCTLSSYTWICMIIYFLQNRNPPILPALHQRPQDKLP----------RPNGDESA 475
Query: 227 AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
+A F D N+ SL L F +S
Sbjct: 476 FADDLHALAGFGKD-----NQDSLGDLLFQFFRYYS 506
>gi|221061669|ref|XP_002262404.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193811554|emb|CAQ42282.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 508
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
+ A VPI K NI CDISI+N + +K + + D R + ++K WAK
Sbjct: 277 IIKASVPIAKIYKEQNNI-CDISINNTVAIVNTKLVSSLCNTDERVTIINRVIKYWAKQK 335
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+INN GTF+SY+L LL + Q +LPP K I
Sbjct: 336 NINNRSQGTFSSYALFLLTYYFLQNLETPLLPPYKSI 372
>gi|350295566|gb|EGZ76543.1| hypothetical protein NEUTE2DRAFT_98466 [Neurospora tetrasperma FGSC
2509]
Length = 1111
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 110/260 (42%), Gaps = 45/260 (17%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVES------VESLRGATVEPFGSFVSNL 60
L+P+ D + L ED E K+ + LRE+ +S VE R V+ +++
Sbjct: 111 LDPVDPDKIKSR--LSEDHEA--KLTTTLRELYDSLIPTPEVERKRKKLVQKLEKILND- 165
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKF 120
W DI + L S G+LL + G ++ V+ A+VPI+K
Sbjct: 166 --EWPGHDIQVHLFGSS--------------GNLLCSDDSDDGMEKVVCVSSAKVPIVKI 209
Query: 121 ETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTF 179
++CD++++N ++ + +ID R R + +++K W + IN+ GT
Sbjct: 210 WDPELQLACDMNVNNTLALENTRMVRTYVEIDERVRPLAMIIKYWTRRRIINDAAFGGTL 269
Query: 180 NSYSLSLLVLFHFQTCVPAILPPL----------KDIYPGNLVDDLKGVRANAERQIAEI 229
+SY+ L + Q P +LP L D + DD+ +R ++ +
Sbjct: 270 SSYTWICLTIAFLQLRDPPVLPALHQENSLKLLRPDGTKSDFADDIDKLRGFGDKNKDSL 329
Query: 230 CA--FNIARFSS-----DKY 242
A FN RF + DKY
Sbjct: 330 AALLFNFFRFYAHEFDYDKY 349
>gi|396491063|ref|XP_003843481.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
gi|312220060|emb|CBY00002.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
Length = 729
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 93/248 (37%), Gaps = 54/248 (21%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ +N G S+ L S D R R +VL VK WAK +N+ +G
Sbjct: 446 LDFPKTGCGIQCDINFENPLGIHNSQMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSSYSG 505
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYP-------------GNLVDDLKGVRANAE 223
T +SY L+VL + P + P L+ P +D+ E
Sbjct: 506 TLSSYGWVLMVLHYLVNVAQPPVCPNLQHSIPLPTEVAALETFFKETTIDNYNVRFWRNE 565
Query: 224 RQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS--------------------- 262
++I I A R S+ NR S+ L F + +S
Sbjct: 566 QEI--IKAAQAGRLSN------NRQSIGALLRGFFQYYSSMSPGYGAQRTPQFYWTTEVL 617
Query: 263 ------GLSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKN 316
GL K S+ + T + R+L IEDPFE N AR V+ +
Sbjct: 618 SLRTPGGLQTKQSKGWVSATTKVTAEKKVTNRYL-----FAIEDPFEIDHNVARTVTHRG 672
Query: 317 LAKISNAF 324
+ I + F
Sbjct: 673 IVAIRDEF 680
>gi|145534217|ref|XP_001452853.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420552|emb|CAK85456.1| unnamed protein product [Paramecium tetraurelia]
Length = 335
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 80/154 (51%), Gaps = 12/154 (7%)
Query: 42 VESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQK 101
+ L+GA + PFGS+ + S DLD + L+ C + ++Q G +R +
Sbjct: 38 MTKLKGA-LYPFGSYCNGFGSEIKDLD-CVFLT--PCDDKSSSLLRQVHAG--IRDYNHQ 91
Query: 102 GGYRRLQF---VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
LQ + HA+VPI+K N+ D+S++N+ G SK L+ SQ+ + + M
Sbjct: 92 NLQPTLQVQAHITHAKVPIIKLVDTTNNVEIDLSVNNINGIANSKLLYEYSQLHPKIKQM 151
Query: 159 VLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
LL+K W K + + KTG+ SYS+ ++ + HF
Sbjct: 152 GLLLKLWGKRNRL--IKTGSLTSYSI-IIFMIHF 182
>gi|183230525|ref|XP_655604.2| poly(A) polymerase [Entamoeba histolytica HM-1:IMSS]
gi|169802884|gb|EAL50218.2| poly(A) polymerase, putative [Entamoeba histolytica HM-1:IMSS]
gi|449704192|gb|EMD44480.1| poly(A) RNA polymerase cid11, putative [Entamoeba histolytica KU27]
Length = 344
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 71/149 (47%), Gaps = 8/149 (5%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS L + GDLDI C S +G++V +L + + + +
Sbjct: 84 YGSTDYGLCLKDGDLDIC-------CSSQSGRQVSAIILESFAECFK-RNNFEIRNVIEK 135
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+K + ++ D+S + QI S+F + + FR + +L+K W K ++N
Sbjct: 136 AKVPIIKMVDLGTKVNIDLSFNQPVAQIHSEFFSTMIHCNKHFRIVAVLLKYWLKTRNLN 195
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
P G +S +L ++L +F + P + P
Sbjct: 196 CPFKGGLSSAALCFMILHYFTSFEPPLFP 224
>gi|70953211|ref|XP_745721.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526133|emb|CAH75138.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 292
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 74/156 (47%), Gaps = 13/156 (8%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF---V 110
GS +N++ + D+D SCI + K S L +L+ ++ +
Sbjct: 91 GSCENNIWIKNSDID--------SCIVVENCEDKNSYLY-ILKVIKSAINLIDPSLTVNI 141
Query: 111 AHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
A VPI K NI CDISI+N + ++ + + ID R + ++K WAK +
Sbjct: 142 IKASVPIAKIYKDQTNI-CDISINNTVAIVNTQLVSSLCSIDERIPIINRIIKYWAKQKN 200
Query: 171 INNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
INN GTF+SY+L LL F FQ +LP K I
Sbjct: 201 INNRSQGTFSSYALFLLTYFFFQNLETPLLPSYKSI 236
>gi|295982236|pdb|3HIY|A Chain A, Minor Editosome-Associated Tutase 1 With Bound Utp And Mg
gi|295982237|pdb|3HIY|B Chain B, Minor Editosome-Associated Tutase 1 With Bound Utp And Mg
gi|295982240|pdb|3HJ4|A Chain A, Minor Editosome-Associated Tutase 1
gi|295982241|pdb|3HJ4|B Chain B, Minor Editosome-Associated Tutase 1
Length = 384
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 82/168 (48%), Gaps = 6/168 (3%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D DIS+ N S ++V +
Sbjct: 27 VIHELQKRVLDIGXLAVNKAHVELFGSHVSGFCTPHSDADISLTYRNFSPWLQGXERVDE 86
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 87 QNNKRXTRFGKEASAXGXEDVRYI-RARIPVVQFTDGVTGIHCDVSIGNIGGVENSKILC 145
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
I Q+ F + LVK W KA ++ P+ TFNS++++ L Q
Sbjct: 146 AIRQVFPDFYGAYIHLVKAWGKAREVIAPERSTFNSFTVTTXALXVLQ 193
>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 787
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/338 (23%), Positives = 137/338 (40%), Gaps = 52/338 (15%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D + P E+ + R + + VV+S+ + V+ FGSF + L+ D+D+
Sbjct: 209 DFCEFVAPTEEEQQMRETAVERVSGVVQSI--WPHSQVKVFGSFATGLYLPTSDVDV--- 263
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ +G Q L L +AL + + +Q + ARVPI+KF NI DIS
Sbjct: 264 -----VVLDSGCTALQDGLKALAKALTRGHVGKNIQVIGKARVPIIKFVETVSNIPFDIS 318
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G + F+ R + L++K + + ++N G SY+L +++L H
Sbjct: 319 FDVANGPEAADFIKAAMGAIPPLRPLCLVLKIFLQQRELNEVYQGGIGSYALLVMLLTHL 378
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q ++P R SS ++L
Sbjct: 379 Q------------MHPSK------------------------RRVSSRGQGPPLETNLGI 402
Query: 253 LFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSAR 310
L V FL+ + G +L ++GI G + + + + P L +EDP + P+N
Sbjct: 403 LLVDFLDLY-GRTLNMKDVGISCRGGGRFFPKRDRGFNDSKRPFLLCVEDP-QSPDNDI- 459
Query: 311 AVSEKNLAKISNAFEMTHFRLTS-TNQTRYALLSSLAR 347
+ + K+ +AF M H LT+ + +LS + R
Sbjct: 460 GKNSYAIQKVRSAFMMAHRLLTNLSANNEVGILSRIVR 497
>gi|389742809|gb|EIM83995.1| hypothetical protein STEHIDRAFT_170415 [Stereum hirsutum FP-91666
SS1]
Length = 1212
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 69/116 (59%), Gaps = 7/116 (6%)
Query: 94 LLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
L RAL Q+ G+ ++ + A VPI+KF+ NI CDI+I++ G ++ + ++
Sbjct: 322 LGRAL-QRAGFVSVECIPGATVPIVKFKDPRTNIHCDININDRLGVKNTELIARYIELLP 380
Query: 154 RFRDMVLLVKEWAKAHDINNP--KTG--TFNSYSLSLLVLFHFQTCVPAILPPLKD 205
R ++ +K+WA H +NNP + G +F+SY+L+++ + FQ + +LP L+D
Sbjct: 381 VLRPLLSAIKKWAGVHGLNNPSGRQGAVSFSSYALTVMSIAFFQ--MKGLLPNLQD 434
>gi|322967062|sp|A9JTS5.1|STPAP_XENTR RecName: Full=Speckle targeted PIP5K1A-regulated poly(A)
polymerase; Short=Star-PAP; AltName: Full=RNA-binding
motif protein 21; Short=RNA-binding protein 21; AltName:
Full=U6 snRNA-specific terminal uridylyltransferase 1;
Short=U6-TUTase
gi|160773864|gb|AAI55458.1| tut1 protein [Xenopus (Silurana) tropicalis]
Length = 843
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 96/224 (42%), Gaps = 25/224 (11%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q V AR P++ F+ + D++++N S FL S +D R +V V
Sbjct: 316 GVHGVQSVPTARRPVIHFQHKTSGLRGDVTLNNRLALRNSSFLRLCSDLDARVPQLVYTV 375
Query: 163 KEWAKAHDI-NNPKTGT--FNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVR 219
+ WA+ + + NP G N+Y+L+LLV F QT P +LP L + R
Sbjct: 376 RYWARVNQLAGNPFGGGPLLNNYALTLLVFFFLQTRNPPVLPTLVHL------------R 423
Query: 220 ANAERQIAEICAFNIARFSSDKYR---KINRSSLAHLFVSFLEKFSGLSLKASELGICPF 276
++ ++ F SD + N+ SL+ L F ++ L L L +CP
Sbjct: 424 EETANEVPQVIDGWDCSFPSDPAQVKESGNQQSLSSLLSEFFSFYASLDLHL--LILCPC 481
Query: 277 TGQWEHIRSNT---RWLPNNH--PLFIEDPFEQPENSARAVSEK 315
G + ++ W PL I+DPFE N VS +
Sbjct: 482 NGLTIPLPFSSPPPAWSEGFRLGPLNIQDPFELSHNVCGNVSSR 525
>gi|358379586|gb|EHK17266.1| hypothetical protein TRIVIDRAFT_41696 [Trichoderma virens Gv29-8]
Length = 1287
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 56/245 (22%), Positives = 110/245 (44%), Gaps = 25/245 (10%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ +I L P + E R K+++ L + + V FGS + L S D+DI
Sbjct: 284 MNEIYNKLLPTEKVEEDRRKLVNKLETIFNTEWPGHDIKVHLFGSSGNLLCSDDSDVDI- 342
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI+++ +++ ++ DLL + G ++ ++ A+VPI+K ++C
Sbjct: 343 -------CITTSWHEMEDVCMIADLL----ARRGMEKVVCISAAKVPIVKIWDPELGLAC 391
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + + D R R + +++K W + +N+ GT +SY+ L+
Sbjct: 392 DMNVNNTLALENTRMVRTYVEADPRVRMLAMILKHWTRRRIVNDAAFGGTLSSYTWICLI 451
Query: 189 LFHFQTCVPAILP-----PLKDIYPGNLV----DDLKGVR--ANAERQIAEICAFNIARF 237
+ Q P +LP P K P V D+LK ++ N + F RF
Sbjct: 452 IAFLQLRNPPVLPALHQLPHKTTKPDGTVSDFADNLKKIKGFGNKNKSTEAELLFQFFRF 511
Query: 238 SSDKY 242
+ ++
Sbjct: 512 YAHEF 516
>gi|18423551|ref|NP_568798.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|27754278|gb|AAO22592.1| unknown protein [Arabidopsis thaliana]
gi|332009022|gb|AED96405.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 530
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 87/326 (26%), Positives = 130/326 (39%), Gaps = 75/326 (23%)
Query: 39 VESVESL-----RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGD 93
VESV S+ VE FGS+ + L+ D+D+ I +G Q L
Sbjct: 146 VESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDV--------VILESGLTNPQLGLRA 197
Query: 94 LLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
L RAL Q+G + L +A ARVPI+KF NI+ D+S D G ++F+
Sbjct: 198 LSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
R + L++K + + ++N +G SY+L
Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYAL----------------------------- 288
Query: 214 DLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH----LFVSFLEKFSGLSLKAS 269
+A + AF KY K RS+ H L V F + F G L +
Sbjct: 289 ------------LAMLIAFL-------KYLKDGRSAPEHNLGVLLVKFFD-FYGRKLNTA 328
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
++GI G + N +L P + IEDP + PEN S N +I +AF M
Sbjct: 329 DVGISCKMGGSFFSKYNKGFLNRARPSLISIEDP-QTPENDI-GKSSFNYFQIRSAFAMA 386
Query: 328 HFRLTSTNQT-----RYALLSSLARP 348
LT+T ++L ++ RP
Sbjct: 387 LSTLTNTKAILSLGPNRSILGTIIRP 412
>gi|343172004|gb|AEL98706.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
Length = 501
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 66/344 (19%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P E+ +R I + +V++ + VE FGS+ + L+ D+D+ I
Sbjct: 129 DFCDFLSPTPEEGRSRNSAIQRVSDVIKYI--WPNCRVEVFGSYRTGLYLPSSDIDVVIL 186
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
SN K Q L L RAL QKG +++Q + ARVPI+KF ++ D+S
Sbjct: 187 DSN--------IKSPQIGLQALSRALSQKGIAKKIQVIGKARVPIIKFVEKVTDVQFDVS 238
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G ++++ + R + L++K + + ++N +G SY+L +++
Sbjct: 239 FDVDNGPKAAEYIKDVISRLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAM- 297
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
LK + G +Y + N L
Sbjct: 298 ----------LKSVNEGQ------------------------------RYPEHN---LGV 314
Query: 253 LFVSFLEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNHPLF--IEDPFEQPENSA 309
L V F E F L ++G+ C G + ++S + N P IEDP + PEN
Sbjct: 315 LLVKFFE-FYAHKLNTWDVGVSCNGRGTF-FLKSRKGFQQNGRPFLICIEDP-QSPENDI 371
Query: 310 RAVSEKNLAKISNAFEMTHFRLTSTNQTR-----YALLSSLARP 348
S N +I AF M + LT+T ++L ++ RP
Sbjct: 372 -GKSSYNYFQIKLAFMMAYASLTNTKAIMNLGPDRSILGTIIRP 414
>gi|343172002|gb|AEL98705.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
Length = 501
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 66/344 (19%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P E+ +R I + +V++ + VE FGS+ + L+ D+D+ I
Sbjct: 129 DFCDFLSPTPEEGRSRNSAIQRVSDVIKYI--WPNCRVEVFGSYRTGLYLPSSDIDVVIL 186
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
SN K Q L L RAL QKG +++Q + ARVPI+KF ++ D+S
Sbjct: 187 DSN--------IKSPQIGLQALSRALSQKGIAKKIQVIGKARVPIIKFVEKVTDVQFDVS 238
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G ++++ + R + L++K + + ++N +G SY+L +++
Sbjct: 239 FDVDNGPKAAEYIQDVISRLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAM- 297
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
LK + G +Y + N L
Sbjct: 298 ----------LKSVNEGQ------------------------------RYPEHN---LGV 314
Query: 253 LFVSFLEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNHPLF--IEDPFEQPENSA 309
L V F E F L ++G+ C G + ++S + N P IEDP + PEN
Sbjct: 315 LLVKFFE-FYAHKLNTWDVGVSCNGRGTF-FLKSRKGFQQNGRPFLICIEDP-QSPENDI 371
Query: 310 RAVSEKNLAKISNAFEMTHFRLTSTNQTR-----YALLSSLARP 348
S N +I AF M + LT+T ++L ++ RP
Sbjct: 372 -GKSSYNYFQIKLAFMMAYASLTNTKAIMNLGPDRSILGTIIRP 414
>gi|169621199|ref|XP_001804010.1| hypothetical protein SNOG_13807 [Phaeosphaeria nodorum SN15]
gi|160704201|gb|EAT78831.2| hypothetical protein SNOG_13807 [Phaeosphaeria nodorum SN15]
Length = 993
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 93/193 (48%), Gaps = 23/193 (11%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ + P +E+ R K ++ ++ ++E+ V FGS + L++ D+DI
Sbjct: 275 MRELYDRIQPTQENTAVRNKFVAKVQRILETEFPGNEFKVSIFGSSGNMLWTAESDVDI- 333
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G R+ + A+V I+K +SCD
Sbjct: 334 -------CIQTPMKRLEEMHM--LAEAL-DKHGMERVVCIPAAKVRIVKVWDPELQLSCD 383
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT-----------GTF 179
++++N+ ++ + QID R R + ++VK W K +N+ + G
Sbjct: 384 MNVNNVAALENTRMINLYVQIDDRVRPLAMIVKHWTKRRILNDAGSFADDLEALRGFGKS 443
Query: 180 NSYSLSLLVLFHF 192
N+ SL L LFHF
Sbjct: 444 NAESLGQL-LFHF 455
>gi|374107870|gb|AEY96777.1| FAEL207Wp [Ashbya gossypii FDAG1]
Length = 626
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 96/199 (48%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + +R+ V+ A + FGS+ ++L+ D+D
Sbjct: 199 IKDFVSYISPNKTEIQLRNDALKRIRDAVQDF--WPDANLHCFGSYATDLYLPGSDIDCV 256
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S +G K ++ L L L++ G ++ +A ARVPI+KF I D
Sbjct: 257 VN-------SKSGDKDNKNALYSLASYLKRNGLATQVSVIAKARVPIIKFVEPASQIHID 309
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +N+ G +S+ + +
Sbjct: 310 LSFERTNGVEAAKIIRGWLHDTPG-LRELVLIVKQFLHARRLNDVHIGGLGGFSI-ICLA 367
Query: 190 FHFQTCVPAILPPLKDIYP 208
+ F P I+ +DI P
Sbjct: 368 YSFLKLHPRII--CRDIEP 384
>gi|167384681|ref|XP_001737054.1| poly(A) RNA polymerase cid11 [Entamoeba dispar SAW760]
gi|165900330|gb|EDR26674.1| poly(A) RNA polymerase cid11, putative [Entamoeba dispar SAW760]
Length = 344
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS L + GDLDI C S +G++V +L + + + +
Sbjct: 84 YGSTHYGLCLKDGDLDIC-------CTSQSGRQVSPVVLESFAECFK-RNKFEVKNVIET 135
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+K + + D+S + QI S+F + + FR + +L+K W K ++N
Sbjct: 136 AKVPIIKMIDLETKVKIDLSFNQPVAQIHSEFFYTMIACIKHFRIVAVLLKYWLKIRNLN 195
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
P G +S +L ++ +F T P + P
Sbjct: 196 CPFKGGLSSAALCFMICHYFTTFDPPLFP 224
>gi|254839887|gb|ACT83521.1| mitochondrial editosome-like complex associated TUTase [Trypanosoma
brucei]
Length = 406
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 83/168 (49%), Gaps = 6/168 (3%)
Query: 31 VISDLREVVESVESL--RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQ 88
VI +L++ V + L A VE FGS VS + D IS+ N S ++V +
Sbjct: 28 VIHELQKRVLDIGMLAVNKAHVELFGSHVSGFCTPHSDAAISLTYRNFSPWLQGMERVDE 87
Query: 89 SLLGDLLRALRQKG--GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLF 146
+ R ++ G ++++ AR+P+++F I CD+SI N+ G SK L
Sbjct: 88 QNNKRMTRFGKEASAMGMEDVRYI-RARIPVVQFTDGVTGIHCDVSIGNIGGVENSKILC 146
Query: 147 WISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
I Q+ F + LVK W KA ++ P+ TFNS++++ + L Q
Sbjct: 147 AIRQVFPDFYGAYIHLVKAWGKAREVIAPERSTFNSFTVTTMALMVLQ 194
>gi|45190400|ref|NP_984654.1| AEL207Wp [Ashbya gossypii ATCC 10895]
gi|50401682|sp|Q9HFW3.1|TRF5_ASHGO RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
Full=Topoisomerase 1-related protein TRF5
gi|10444115|gb|AAG17722.1|AF286114_3 Trf5 [Eremothecium gossypii]
gi|44983296|gb|AAS52478.1| AEL207Wp [Ashbya gossypii ATCC 10895]
Length = 626
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 96/199 (48%), Gaps = 14/199 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + + + R + +R+ V+ A + FGS+ ++L+ D+D
Sbjct: 199 IKDFVSYISPNKTEIQLRNDALKRIRDAVQDF--WPDANLHCFGSYATDLYLPGSDIDCV 256
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ S +G K ++ L L L++ G ++ +A ARVPI+KF I D
Sbjct: 257 VN-------SKSGDKDNKNALYSLASYLKRNGLATQVSVIAKARVPIIKFVEPASQIHID 309
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ A +N+ G +S+ + +
Sbjct: 310 LSFERTNGVEAAKIIRGWLHDTPG-LRELVLIVKQFLHARRLNDVHIGGLGGFSI-ICLA 367
Query: 190 FHFQTCVPAILPPLKDIYP 208
+ F P I+ +DI P
Sbjct: 368 YSFLKLHPRII--CRDIEP 384
>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 779
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 27/194 (13%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P RE++E R+ +I + + + A V PFGS+ + L+ GD+D+ +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKL--WPEAEVTPFGSWQTQLYLPQGDIDLVVAHK--- 208
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH------------- 124
+S A K Q LL +L +A+RQ + +A ARVPI+KF T+
Sbjct: 209 YLSDANK---QRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVSSLEYFSK 265
Query: 125 ----QNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
I+ DIS++ G K + ++ + G R ++L+VK + +N TG
Sbjct: 266 QEGIGKINVDISLNQANGVTAGKIINQYLDALPG-ARQLILIVKYFLSQRSMNEVYTGGL 324
Query: 180 NSYSLSLLVLFHFQ 193
SYS+ +V+ Q
Sbjct: 325 GSYSVICMVISFLQ 338
>gi|444732632|gb|ELW72916.1| Terminal uridylyltransferase 7 [Tupaia chinensis]
Length = 1409
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 58/107 (54%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G R + + A+VPI+KF + + DIS+ N ++ L S ID R + + +
Sbjct: 1024 GLRNILPITTAKVPIVKFFHLRSGLEVDISLYNTLALHNTRLLSAYSAIDPRVKYLCYTM 1083
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
K + K DI + G+ +SY+ +L+VL+ Q P ++P L++IY G
Sbjct: 1084 KVFTKMCDIGDASRGSLSSYAYTLMVLYFLQQRNPPVIPVLQEIYKG 1130
>gi|302834665|ref|XP_002948895.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
nagariensis]
gi|300266086|gb|EFJ50275.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
nagariensis]
Length = 1052
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 98/205 (47%), Gaps = 23/205 (11%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESV-ESLRGATVEPFGSFVSNLFS 62
Y+ L ++D + P + RM+VI +R V V + R ++ FGSF + L +
Sbjct: 651 YSPLHYNIEDFCTKVVPTEGEKRQRMEVIDAIRAGVRKVWPNSRQVELQVFGSFANGLST 710
Query: 63 RWGDLDISI-------------ELSNGSCISSAGKKVKQSL-LGDLLRALRQKGGYRRLQ 108
DLD+ + EL++ + I++ +K+ +L + LRQ Q
Sbjct: 711 WSSDLDLVVTGVMEPDRVSGGYELADRAKITARLRKIADALNRAKNIDILRQ-------Q 763
Query: 109 FVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
+ AR+PILK T + D+S+ + G ++++ + + +VL+VK + KA
Sbjct: 764 LIPRARIPILKLWT-KSRVCVDVSVSDDSGPRAARYMVQQCRAFPPVKPLVLVVKTYLKA 822
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQ 193
+N TG +SYSL+ +V+ H Q
Sbjct: 823 CRLNEVNTGGLSSYSLTNMVIAHLQ 847
>gi|341876935|gb|EGT32870.1| CBN-GLD-2 protein [Caenorhabditis brenneri]
Length = 869
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 100/216 (46%), Gaps = 22/216 (10%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS ++ + D+D+ + ++N +K ++ +L+ + Q + Q + A
Sbjct: 345 GSSLNGFGNNSSDMDLCLMITNKDL----DQKNDAVVVLNLILSTLQYEKFVATQKLILA 400
Query: 114 RVPILK------FETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAK 167
+VPIL+ F+ I D++ +N + L + S D R R +V +VKEWAK
Sbjct: 401 KVPILRIKFAAPFDDITV----DLNANNSVAIRNTHLLCYYSSYDWRVRPLVSVVKEWAK 456
Query: 168 AHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYPGNLVDDLKGVRANAERQI 226
IN+ +F SYSL L+V+ + Q PA +LP L+ YP N + VR
Sbjct: 457 RKGINDANKSSFTSYSLVLMVIHYLQCGTPAKVLPNLQQSYP-NRFSNRVDVRTLNVTMP 515
Query: 227 AEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
E +I S+K ++L L + FL+ ++
Sbjct: 516 LEAVPDDIDPILSEK------TTLGELLIGFLDYYA 545
>gi|391341255|ref|XP_003744946.1| PREDICTED: poly(A) RNA polymerase gld-2 homolog B-like [Metaseiulus
occidentalis]
Length = 546
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 115/257 (44%), Gaps = 46/257 (17%)
Query: 94 LLRALR-QKGG---YRRLQFVAHARVPILKF---ETIHQNISCDISIDNLCGQIKSKFLF 146
+LR LR Q G Y+R +A A P+L F + H +S +I++++ G + ++FL+
Sbjct: 279 ILRNLRTQMAGNKVYKRPIVIA-AICPLLTFTYERSRHSPMSVEINVNSQVGIVNTQFLY 337
Query: 147 WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
S++DGR +V K WA I + T +SYSL+L+V+ + Q ++P L ++
Sbjct: 338 AYSRMDGRVAPLVGACKRWATIMGIKDAHKSTLSSYSLTLMVINYLQQ--QEVVPVLHNL 395
Query: 207 YPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSL 266
P + G + R +Y N +L LF FL FS
Sbjct: 396 VPE--FESGLGFKGG-------------DRLILPRYASTNSQNLGTLFKGFLAYFSAFDF 440
Query: 267 KASELGIC-------PFTGQWEHIRSNTRWLPNNHP-------LFIEDPFEQPENSARAV 312
+ IC T Q E I + NNH + I++PF N+ARAV
Sbjct: 441 E----NICISVREGRTLTKQ-EGIDQDPDIPHNNHSSRADWKFINIQEPFNL-TNTARAV 494
Query: 313 SE-KNLAKISNAFEMTH 328
+ K K+ + F+++H
Sbjct: 495 YQPKAFRKVRDVFKISH 511
>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 779
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 27/194 (13%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P RE++E R+ +I + + + A V PFGS+ + L+ GD+D+ +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKL--WPEAEVTPFGSWQTQLYLPQGDIDLVVAHK--- 208
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH------------- 124
+S A K Q LL +L +A+RQ + +A ARVPI+KF T+
Sbjct: 209 YLSDANK---QRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVSSLEYFSK 265
Query: 125 ----QNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
I+ DIS++ G K + ++ + G R ++L+VK + +N TG
Sbjct: 266 QEGVGKINVDISLNQANGVTAGKIINQYLDALPG-ARQLILIVKYFLSQRSMNEVYTGGL 324
Query: 180 NSYSLSLLVLFHFQ 193
SYS+ +V+ Q
Sbjct: 325 GSYSVICMVISFLQ 338
>gi|268637610|ref|XP_002649102.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
gi|256012839|gb|EEU04050.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
Length = 938
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 88/177 (49%), Gaps = 5/177 (2%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE-LSNG 76
L P + R K+I DL +V+S + A V FGSF SNLF D+DI I ++N
Sbjct: 457 LEPSELESRIRQKIIRDLDAIVKS--NWPKANVVVFGSFSSNLFIPSSDIDIQISGINNA 514
Query: 77 SCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNL 136
++ + + L ++R + + ++ + A+VPI+K + H + + DI D
Sbjct: 515 ESVNKYNQNPIRDLFDIIIR--NHQDSFINVRNIFGAKVPIIKMTSSHSHYNIDICFDTP 572
Query: 137 CGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G + + + + R ++L++K + +++N TG SY+L+L+V+ Q
Sbjct: 573 NGIENTAVVKGLLKQYKSMRTLLLIIKFFLHQNNLNETYTGGIGSYALALMVVSFIQ 629
>gi|294893686|ref|XP_002774596.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|294893688|ref|XP_002774597.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239879989|gb|EER06412.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239879990|gb|EER06413.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 1017
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 83/190 (43%), Gaps = 40/190 (21%)
Query: 53 FGSFVSNLFSRWGDLDISIEL-----------------SNGSC------------ISSAG 83
+GS V+ + D+D+++EL +G C ++
Sbjct: 305 YGSLVNGFPTAHSDIDVAVELRDDVKEELLSKQLDADGEDGGCSDKEENENNQEVLTEKA 364
Query: 84 KKVKQSLLG-DLLRALRQKGGYRRLQFVAHARVPILKF-------ETIHQNISCDISIDN 135
K K ++ +LL K GY + V ARVPIL + + + +IS D+
Sbjct: 365 KDRKATIAAIELLGEEFDKRGYA-VNEVVTARVPILLLVKEVTGPDGEKEKVEFNISFDH 423
Query: 136 LCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTC 195
S+ L S + R +V+LVK WAK D+N+ GT +SYS +LLV+F Q
Sbjct: 424 EITLYNSRLLRCYSMLRPEVRTLVVLVKHWAKTRDVNDACNGTLSSYSYALLVIFFLQQ- 482
Query: 196 VPAILPPLKD 205
ILP L+D
Sbjct: 483 -KGILPSLQD 491
>gi|358056534|dbj|GAA97503.1| hypothetical protein E5Q_04181 [Mixia osmundae IAM 14324]
Length = 734
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 82/168 (48%), Gaps = 18/168 (10%)
Query: 49 TVEPFGSFVSNLFSRWGDLDISIELSN------GSCISSAGKKVKQSLLGDLLRALRQKG 102
T + FGS ++ L + DLD++I +N + G L D+LR +
Sbjct: 413 TAQAFGSHLTGLDNEHSDLDLTILDANFPFGVGPEQLKKTGPIYNLRKLEDILR----RR 468
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +++ V H VPI+KFE I D++++ G SK + +I R + + V
Sbjct: 469 GAQKVTVVRHTLVPIIKFEM--DGIKVDLNVNERLGIYNSKLIAEYCRISPIMRPLCVFV 526
Query: 163 KEWAKAHDINNPK----TGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
K W+K ++N+P +F+SY+L LLV+ + Q LP L+D+
Sbjct: 527 KFWSKRRELNDPAGQAGKKSFSSYALILLVIAYLQQL--GELPNLQDM 572
>gi|297792777|ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
lyrata]
gi|297310108|gb|EFH40532.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
lyrata]
Length = 530
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 131/326 (40%), Gaps = 75/326 (23%)
Query: 39 VESVESL-----RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGD 93
VESV S+ VE FGS+ + L+ D+D+ I +G Q L
Sbjct: 146 VESVSSVITYIWPSCKVEVFGSYKTGLYLPTSDIDV--------VILESGLTNPQLGLRA 197
Query: 94 LLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
L RAL Q+G + L +A ARVPI+KF NI+ D+S D G ++F+
Sbjct: 198 LSRALSQRGIAKNLVVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
R + L++K + + ++N +G SY+L
Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYAL----------------------------- 288
Query: 214 DLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH----LFVSFLEKFSGLSLKAS 269
+A + AF KY K RS+ H L V F + F G L +
Sbjct: 289 ------------LAMLIAFL-------KYLKDGRSAPEHNLGVLLVKFFD-FYGRKLNTA 328
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
++G+ TG + + +L P + IEDP + PEN S N +I +AF M
Sbjct: 329 DVGVSCKTGGSFFSKYDKGFLNRARPGLISIEDP-QTPENDI-GKSSFNYFQIRSAFAMA 386
Query: 328 HFRLTSTNQT-----RYALLSSLARP 348
LT+T ++L ++ RP
Sbjct: 387 LSTLTNTKAILSLGPNRSILGTIIRP 412
>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
Length = 756
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/191 (24%), Positives = 85/191 (44%), Gaps = 19/191 (9%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++ P + E R +++ LR +++ + V PFGS++S L+ D+D+ +
Sbjct: 434 DFYEVVRPRDFEHEMRTQLVERLRRSLKTSHFYKDCDVRPFGSYMSGLYLPTADMDLVV- 492
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGG---------YRRLQFVAHARVPILKFETI 123
C S + G ++ALR G Y ++F+A A+VP++K+
Sbjct: 493 -----CARSWLDGAHSNFFG--MKALRNFGKFLAQNKVTHYNTMEFIASAKVPLVKYIDN 545
Query: 124 HQNISCDISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSY 182
+ DIS D L G Q F W Q +V ++K + +N P G S+
Sbjct: 546 ITGLRVDISFDRLDGPQAVKTFAEWKEQYPA-MPILVTMIKHFLAMRGLNEPVNGGIGSF 604
Query: 183 SLSLLVLFHFQ 193
+++ +V+ Q
Sbjct: 605 TVTCMVVSMLQ 615
>gi|156095663|ref|XP_001613866.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802740|gb|EDL44139.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 377
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
+ A VPI K NI CDISI+N + +K + + D R + ++K WAK
Sbjct: 133 IIKASVPIAKIYREQNNI-CDISINNTVAIVNTKLVSSLCNTDERVTIINRVIKYWAKQK 191
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+INN GTF+SY+L LL + Q +LPP K I
Sbjct: 192 NINNRSQGTFSSYALFLLTYYFLQNLETPLLPPYKSI 228
>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
Length = 784
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 27/194 (13%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P RE++E R+ +I + + + A V PFGS+ + L+ GD+D+ +
Sbjct: 154 VSPTREEFEVRLFMIELITRTINKL--WPEAEVTPFGSWQTQLYLPQGDIDLVVAHK--- 208
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH------------- 124
+S A K Q LL +L +A+RQ + +A ARVPI+KF T+
Sbjct: 209 YLSDANK---QRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGKSHVFSLAYLTK 265
Query: 125 ----QNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
I+ DIS++ G K + ++ + G R ++L+VK + +N TG
Sbjct: 266 QEGIGKINVDISLNQGNGVTAGKIINQYLDALPG-ARQLILIVKYFLSQRSMNEVYTGGL 324
Query: 180 NSYSLSLLVLFHFQ 193
SYS+ +V+ Q
Sbjct: 325 GSYSVICMVISFLQ 338
>gi|294936225|ref|XP_002781666.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239892588|gb|EER13461.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 882
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 101/224 (45%), Gaps = 26/224 (11%)
Query: 3 SYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFS 62
S +L+ L +G + ED ++ + V S + AT+E FGS S L
Sbjct: 88 SGELLDSELSSFVGRVALTEEDHHVHNHILGSILSVARSHLDIE-ATIETFGSAASGLSE 146
Query: 63 RWGDLDISIELSNGSCISSAGKKV------KQSL-------LGDLLRALRQKG---GYRR 106
+ D+D +I C SA KK ++SL LG + ++ G R
Sbjct: 147 KSSDIDATI-----ICRFSALKKRFAAAVDEKSLCSAAVLGLGKAISKFEKEAPGVGLRV 201
Query: 107 LQFVAHARVPILKFETIHQNISC---DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
+Q + A+VPI+ I N + D+SI+N + L ++D R + + L VK
Sbjct: 202 VQVIPSAKVPIVVLSWIGPNGNVQIVDVSINNQLPLHNTALLRNYVEMDKRVQILALCVK 261
Query: 164 EWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLKDI 206
WAK I++ K G +SYS +LL ++ Q A+LP L+ +
Sbjct: 262 RWAKLCGISDAKQGNLSSYSWTLLCIYFLQVRSKGAVLPSLQSM 305
>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
commune H4-8]
Length = 671
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 86/181 (47%), Gaps = 13/181 (7%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P + E R ++ + +++ + A V PFGS+ + L+ GD+D+ ++ +
Sbjct: 170 ISPTPAEDEVRSMIVLLIARIIQ--DKFPDAEVRPFGSYGTKLYLPHGDIDLVVQ---SN 224
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI--HQNISCDISIDN 135
+ KK L DL+R+ R G ++Q + ARVPI+KF T + DIS++
Sbjct: 225 TLEQNNKKTVLQRLADLIRSARLSSG--KVQVIG-ARVPIIKFITAAEYGRFQIDISVNQ 281
Query: 136 LCGQIKSKFLFWIS---QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
G + S + Q R +VL++K + +N TG SYS+ LVL
Sbjct: 282 FSGLVSSDIINGFQRGMQCPIAIRSLVLILKLYLSQRGMNEVYTGGLGSYSIVCLVLSFL 341
Query: 193 Q 193
Q
Sbjct: 342 Q 342
>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum
NRRL Y-27907]
Length = 615
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 94/190 (49%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P ++ TR V++ L+ + + G FGS ++L+ D+D+
Sbjct: 188 IKDFVDYVSPSSDEIVTRNTVVNRLKTQI--AKFWPGTEAHVFGSCATDLYLPGSDIDMV 245
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ IS G +S L L LR K + ++ +A+A+VPI+KF I D
Sbjct: 246 V-------ISETGDYENRSRLYQLSSFLRSKKLAKNVEVIANAKVPIIKFVDPESEIHID 298
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL+VK++ ++ +NN G Y+ ++++
Sbjct: 299 VSFERTNGIDAAKRIRKWLITTPG-LRELVLIVKQFLRSRRLNNVHVGGLGGYA-TIIMC 356
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 357 YHFLRLHPKV 366
>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
Length = 603
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P E+ TR VIS L++ + G FGS ++L+ D+D+
Sbjct: 173 MKDFVNYISPSSEEIVTRNNVISTLKKEIGKF--WPGTETHVFGSCATDLYLPGSDIDMV 230
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G +S L L LR K + ++ +A A+VPI+KF + D
Sbjct: 231 V-------VSETGDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHID 283
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ ++ +NN G Y+ ++++
Sbjct: 284 VSFERTNGLDAAKRIRRWLISTPG-LRELVLVIKQFLRSRRLNNVHVGGLGGYA-TIIMC 341
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 342 YHFLRLHPKL 351
>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
Length = 730
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/177 (29%), Positives = 90/177 (50%), Gaps = 10/177 (5%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
+ P + ETR VI + ++S R A V PFGS + L+ GDLD+ + +SN
Sbjct: 110 MTPTAAEHETRCMVIELISRAIKS--QFRDAEVYPFGSQETKLYLPQGDLDLVV-VSN-- 164
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
S A +V QS L + LR+ +Q +A A+VPI+KF T + + DIS+++
Sbjct: 165 --SMANLRV-QSALRTMAACLRRHNLATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 221
Query: 138 GQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G + ++ W+ + R ++L+VK ++ +G SYS+ ++V+ Q
Sbjct: 222 GLTTASYVNSWLRKWP-HIRPLILVVKYLLMQRGMSEVFSGGLGSYSVIIMVISFLQ 277
>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 603
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 93/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P E+ TR VIS L++ + G FGS ++L+ D+D+
Sbjct: 173 MKDFVNYISPSSEEIVTRNNVISTLKKEIGKF--WPGTETHVFGSCATDLYLPGSDIDMV 230
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G +S L L LR K + ++ +A A+VPI+KF + D
Sbjct: 231 V-------VSETGDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPVSELHID 283
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ ++ +NN G Y+ ++++
Sbjct: 284 VSFERTNGLDAAKRIRRWLISTPG-LRELVLVIKQFLRSRRLNNVHVGGLGGYA-TIIMC 341
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 342 YHFLRLHPKL 351
>gi|359493669|ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis
vinifera]
gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 16/198 (8%)
Query: 2 GSYNVLEPILK------DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGS 55
G+ + P+LK D L+P ++ R I + V+ + VE FGS
Sbjct: 94 GNSRLRSPMLKLHKEILDFSDFLSPTPKEQSARNAAIESVFNVIRYI--WPNCKVEVFGS 151
Query: 56 FVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARV 115
F + L+ D+D+ I GS I K Q L L RAL QKG +++Q +A ARV
Sbjct: 152 FKTGLYLPTSDIDVVIL---GSDI-----KTPQIGLYALSRALSQKGIAKKIQVIAKARV 203
Query: 116 PILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK 175
PI+KF +++ DIS D G ++++ R + L++K + + ++N
Sbjct: 204 PIIKFIEKRSSVAFDISFDVENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVY 263
Query: 176 TGTFNSYSLSLLVLFHFQ 193
+G SY+L +++ Q
Sbjct: 264 SGGIGSYALLAMLIAMLQ 281
>gi|183234091|ref|XP_651039.2| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|169801263|gb|EAL45653.2| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449709777|gb|EMD48978.1| poly(A) RNA polymerase cid11, putative [Entamoeba histolytica KU27]
Length = 344
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 69/149 (46%), Gaps = 8/149 (5%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+GS L + GDLDI C S +G++V +L + + + +
Sbjct: 84 YGSTHYGLCLKDGDLDIC-------CTSQSGRQVNPIILESFADCFK-RSKFEIKNVIET 135
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+K + + D+S + QI S+F + + FR + +L+K W K ++N
Sbjct: 136 AKVPIIKMVDLETKVKIDLSFNQPVAQIHSEFFYTMITCVKHFRIVAVLLKYWLKIRNLN 195
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
P G +S +L ++ +F T P + P
Sbjct: 196 CPFKGGLSSAALCFMICHYFTTFDPPLFP 224
>gi|9759071|dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana]
Length = 533
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 127/326 (38%), Gaps = 72/326 (22%)
Query: 39 VESVESL-----RGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGD 93
VESV S+ VE FGS+ + L+ D+D+ I +G Q L
Sbjct: 146 VESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDV--------VILESGLTNPQLGLRA 197
Query: 94 LLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDG 153
L RAL Q+G + L +A ARVPI+KF NI+ D+S D G ++F+
Sbjct: 198 LSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVD 213
R + L++K + + ++N +G SY+L
Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYAL----------------------------- 288
Query: 214 DLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH----LFVSFLEKFSGLSLKAS 269
A IA Y K RS+ H L V F + F G L +
Sbjct: 289 ----------------LAMLIAFLKVQVYLKDGRSAPEHNLGVLLVKFFD-FYGRKLNTA 331
Query: 270 ELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDPFEQPENSARAVSEKNLAKISNAFEMT 327
++GI G + N +L P + IEDP + PEN S N +I +AF M
Sbjct: 332 DVGISCKMGGSFFSKYNKGFLNRARPSLISIEDP-QTPENDI-GKSSFNYFQIRSAFAMA 389
Query: 328 HFRLTSTNQT-----RYALLSSLARP 348
LT+T ++L ++ RP
Sbjct: 390 LSTLTNTKAILSLGPNRSILGTIIRP 415
>gi|345495646|ref|XP_001605838.2| PREDICTED: terminal uridylyltransferase 7-like [Nasonia
vitripennis]
Length = 338
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 138/310 (44%), Gaps = 45/310 (14%)
Query: 30 KVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQS 89
K++ + VV+S L A + FGS +S+L + DLDI ++ N + + ++
Sbjct: 36 KLLESVEAVVKSKYPLAKAYL--FGSRISSLGFKDSDLDIFLDCENQYVKPKSMVESQEQ 93
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS 149
LL + + + ++ + RVPI+K + N++CDIS N G KSK L +
Sbjct: 94 LLTVQDCFHKHQDIWVIMEVIVRTRVPIIKLKHRSTNLNCDISFINGLGVEKSKILGYYV 153
Query: 150 QIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
R ++L +K+W ++ + T +Y++S L +F+ Q + ILP ++ +
Sbjct: 154 DACTPCRKLILFLKKWNLLCRLSGSRAIT--TYAISWLAIFYLQ--IKEILPSVQSLIK- 208
Query: 210 NLVDDLKGVRANAERQIA-EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKA 268
+ + + A E ++ EI A NI SD L F ++ F+ +S A
Sbjct: 209 --LQNRSNIVAGWETGVSKEISAKNIDFSISD---------LLKGFFTYYADFNYISDVA 257
Query: 269 SELGICPFTG------QWEHI---------------RSNTRWLPNNHPLFIEDPFEQPEN 307
CPF G Q+ +I + N + + P+ I+DP + +N
Sbjct: 258 -----CPFLGKVMKKSQFSNIDDLPEEMSIYKSQIKKENVEFFRLDSPMCIQDPIDLSQN 312
Query: 308 SARAVSEKNL 317
+AV++ L
Sbjct: 313 ITKAVTKLQL 322
>gi|189206852|ref|XP_001939760.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975853|gb|EDU42479.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1240
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 74/333 (22%), Positives = 133/333 (39%), Gaps = 81/333 (24%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P +ED + R + + ++ ++E+ V FGS + L++ D+DI
Sbjct: 265 MRELYDRLEPKQEDTDNRERFVRKVQRILETEFPSTKIMVHVFGSSGNMLWTSESDVDI- 323
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
CI + K++++ + L AL K G +R+ + A+V I+K +SCD
Sbjct: 324 -------CIQTPMKRLEE--MHPLAEAL-DKHGMQRVVCIPAAKVRIVKVWDPELQLSCD 373
Query: 131 ISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLF 190
I+++N+ ++ + Q+D R R GT +SY+ L+L
Sbjct: 374 INVNNVAAIENTRMIKTYIQLDDRVR------------------IGGTISSYTWICLILN 415
Query: 191 HFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSD-----KYRKI 245
QT P +LP L ++ P D+ G +++ F+ D Y K
Sbjct: 416 FLQTRDPPVLPNLHEL-PDRARDETTG-------------QPSLSSFADDVGKLRGYGKD 461
Query: 246 NRSSLAHLFVSFLEKFS--------------GLSLKASELGICPFTGQWEHIRSNTRWLP 291
N+ SL L F + G + E G P GQ E +
Sbjct: 462 NKESLGQLLFHFFRLYGHEIDYEKEAISVRQGKRILREEKGWHPGGGQKEGVNR------ 515
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
L +E+PF +E+NL ++ +
Sbjct: 516 ----LCVEEPFN---------TERNLGNSADDY 535
>gi|432954902|ref|XP_004085587.1| PREDICTED: poly(A) RNA polymerase, mitochondrial-like [Oryzias
latipes]
Length = 241
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 86/183 (46%), Gaps = 25/183 (13%)
Query: 142 SKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNS-YSLSLLVLFHFQTCVPAIL 200
++ L+ ++D R R +V V+ WA+AH I + G + S +SL+++VLF Q P I+
Sbjct: 4 TELLYLYGELDPRVRRLVFTVRCWARAHGITSSIPGAWISNFSLTVMVLFFLQKRNPPII 63
Query: 201 PPLKDIYPGNLVDDLKGVRANAERQIAE--ICAFNIARFSSDKYRKINRSSLAHLFVSFL 258
P L D L+ + A++ + E C F ++ F+ + R+ N +L HL F
Sbjct: 64 PTL---------DQLRDLAGPADKSVIEGNDCTF-VSDFTKIQLRR-NTEALEHLLYEFF 112
Query: 259 EKFSGLSLKASELGICPFTGQWEHIRSNTRW-LPNNHPLFIEDPFEQPENSARAVSEKNL 317
E ++ PF+ IR P PL I++PFE N ++ V+ L
Sbjct: 113 EFYATF----------PFSRMSVDIRKGKEQNKPEVAPLHIQNPFETALNVSKNVNATQL 162
Query: 318 AKI 320
+
Sbjct: 163 ERF 165
>gi|320589406|gb|EFX01867.1| pap 25a associated domain family protein protein [Grosmannia
clavigera kw1407]
Length = 1249
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 90/195 (46%), Gaps = 14/195 (7%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+ ++ L P E E R + + L + V FGS + L S D+DI
Sbjct: 174 MGELYARLRPSDEKQEHRERFVQKLETIFNEEWPGHDIRVHIFGSSGNRLCSDDSDVDI- 232
Query: 71 IELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ K+++ ++ +LL K G ++ V+ A+VPI+K +++C
Sbjct: 233 -------CITTKWKELENVCMIAELL----AKRGMEKVVCVSSAKVPIVKIWDPELSLAC 281
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSLSLLV 188
D++++N ++ + ID R R + +++K W + IN+ GT +SY+ +V
Sbjct: 282 DMNVNNTLALENTRMVLTYVGIDERVRPLAMIIKYWTRQRIINDAAFGGTLSSYTWICMV 341
Query: 189 LFHFQTCVPAILPPL 203
+ Q ILP L
Sbjct: 342 ICFLQLRKVPILPSL 356
>gi|189211747|ref|XP_001942202.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187979401|gb|EDU46027.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 636
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 89/239 (37%), Gaps = 44/239 (18%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ +N G + L S D R R MVL VK WAK +N+ +G
Sbjct: 382 LDFPKTGCGIQCDINFENPLGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRKVNSAYSG 441
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
T +SY L+VL + P + P +L+ + E I + R
Sbjct: 442 TLSSYGWVLMVLHYLVNIAYPPVCP------------NLQLIAKKPEHTTTRIISGYQVR 489
Query: 237 FSSDKYRKI----------NRSSLAHLFVSFLEKFSG------------------LSLKA 268
F ++ I N+ SL L F + ++ LSL+
Sbjct: 490 FWRHEHEIIHSAQTGQLTENKESLGSLLRGFFQYYASLSGYNYPRPPQFHWTNEVLSLRT 549
Query: 269 SELGICPFTGQWEHIRSNT---RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ W + + + N + IEDPFE N AR V+ + + I + F
Sbjct: 550 PGGILTKQAKGWVSATTKITAEKKVTNRYLFAIEDPFEVDHNVARTVTHRGIVTIRDEF 608
>gi|302420415|ref|XP_003008038.1| Poly(A) RNA polymerase cid11 [Verticillium albo-atrum VaMs.102]
gi|261353689|gb|EEY16117.1| Poly(A) RNA polymerase cid11 [Verticillium albo-atrum VaMs.102]
Length = 1162
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/302 (21%), Positives = 130/302 (43%), Gaps = 46/302 (15%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
++++ L P + E R K+++ L+++ V FGS + L S D+DI
Sbjct: 135 MRELYDRLLPTPKVEENRQKLVAKLQKIFNDEWPGHDIRVHLFGSSGNLLCSDDSDVDI- 193
Query: 71 IELSNGSCISSAGKKVKQSL-LGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC 129
CI++ +++ + +LL K G ++ ++ A+VPI+K ++C
Sbjct: 194 -------CITTTWAELEGVCKIAELL----HKKGMEKVVCISAAKVPIVKIWDPELGLAC 242
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG-TFNSYSLSLLV 188
D++++N ++ + + D R R + +++K W + +N+ G T +SY+ L+
Sbjct: 243 DMNVNNTSALENTRMVRTYVETDPRVRPLAMIIKYWTRRRIVNDAAFGSTLSSYTWICLI 302
Query: 189 LFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
+ Q P +LP L K +R + + A +I R + N+S
Sbjct: 303 IAFLQLRDPPVLPALHQN---------KAMRLSKKGGPESTFADDIDRLKG--FGDKNKS 351
Query: 249 SLAHLFVSFLEKFSG--------LSLKASELGICPFTGQWEHIRSNTRWLPNNHPLFIED 300
+L L F ++ LS++ + + +RS +W N L +E+
Sbjct: 352 TLGELLFQFFRYYAHEFDYDKHVLSVRQGK----------KLVRSEKKWANN---LCVEE 398
Query: 301 PF 302
PF
Sbjct: 399 PF 400
>gi|195158112|ref|XP_002019938.1| GL12677 [Drosophila persimilis]
gi|194116529|gb|EDW38572.1| GL12677 [Drosophila persimilis]
Length = 541
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 74/339 (21%), Positives = 140/339 (41%), Gaps = 78/339 (23%)
Query: 8 EPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDL 67
+ I +D++G+L P+ +W R + FGS +S + +R DL
Sbjct: 227 DAIEEDLIGVLTPVFPNWAMR---------------------IYKFGSRISGIGTRCSDL 265
Query: 68 DISIELSNGSCISSAGKKVKQSLLGDLLRALR----QKGGYRRLQFVAHARVPILKFETI 123
D+ +++ N I + K++L LRA+R +R + + ARVPI+K +
Sbjct: 266 DVFVDIGNTFDIFE-HRASKETLAK--LRAMRPAFCASNKWRIINVIEQARVPIIKVSHL 322
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
I CDI +++L G + L +I I + M + K W + K ++YS
Sbjct: 323 TTGIECDICLNSL-GFCNTNLLKYIFDIQPLAQYMCIYAKNW-----LERCKQTDISTYS 376
Query: 184 LSLLVLFHFQTCVPAILPPLKDI--------YPGNLVDDLKGVRANAERQIAEICAFNIA 235
++L+V++ Q + +LP + + + G + + G ++ + ++ E
Sbjct: 377 ITLMVIYFMQ--LHGLLPSVFALQHEQPFNQFVGPWIVNF-GQKSLQDLRLPE------- 426
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE-HIRSNTRWLPNNH 294
+D R+ L F +F KF +CP+ G E I+ + +PN +
Sbjct: 427 ---ADTDAPAVRNILGQFF-AFYSKFD-----YERFLVCPYFGSAEVQIQHVEKLMPNRY 477
Query: 295 ----------------PLFIEDPFEQPENSARAVSEKNL 317
P+ ++DP + N +AV+ L
Sbjct: 478 SKYTRENPECTLQLRKPMVVQDPIQLNHNVTKAVTRSAL 516
>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
Length = 689
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 87/177 (49%), Gaps = 10/177 (5%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
+ P + ETR VI + ++S R A V PFGS + L+ GDLD+ + + +
Sbjct: 109 MAPTAAEHETRCMVIELISRAIKS--QFRDAEVHPFGSQETKLYLPQGDLDLVVVSRSMA 166
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
+ + QS L + LR+ +Q +A A+VPI+KF T + + DIS+++
Sbjct: 167 NLRT------QSALRTMAACLRRHNLATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 220
Query: 138 GQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G + F+ W+ + R ++++VK ++ +G SYS+ ++V+ Q
Sbjct: 221 GLTTASFVNSWLRKWP-HIRPLIIVVKHLLMQRGMSEVFSGGLGSYSIIIMVISFLQ 276
>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
Length = 649
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 54/194 (27%), Positives = 92/194 (47%), Gaps = 27/194 (13%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P RE++E R+ +I + + + A V PFGS+ + L+ GD+D+ +
Sbjct: 19 VSPTREEFEVRLFMIELITRTINKL--WPEAEVTPFGSWQTQLYLPQGDIDLVVAHK--- 73
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH------------- 124
+S A K Q LL +L +A+RQ + +A ARVPI+KF T+
Sbjct: 74 YLSDANK---QRLLAELGKAMRQANITDVVAIIARARVPIIKFVTLEGESHVTSLADSSK 130
Query: 125 ----QNISCDISIDNLCGQIKSKFLF-WISQIDGRFRDMVLLVKEWAKAHDINNPKTGTF 179
I+ DIS++ G K + ++ + G R ++L+VK + +N TG
Sbjct: 131 QGAIGKINVDISLNQGNGVTAGKIINQYLDALPG-ARQLILIVKYFLSQRSMNEVYTGGL 189
Query: 180 NSYSLSLLVLFHFQ 193
SYS+ +V+ Q
Sbjct: 190 GSYSVICMVISFLQ 203
>gi|299752783|ref|XP_002911796.1| Trf5 [Coprinopsis cinerea okayama7#130]
gi|298409998|gb|EFI28302.1| Trf5 [Coprinopsis cinerea okayama7#130]
Length = 816
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 94/202 (46%), Gaps = 30/202 (14%)
Query: 5 NVLEPILKDI---LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLF 61
NV E + K++ + ++P + E R ++ + V+S A+V PFGS+ + L+
Sbjct: 273 NVAEMMHKEVEAFVKWISPTPVEDEIRGLIVKQIAVTVQS--KFPDASVLPFGSYETKLY 330
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
GD+D+ I LS S+ K S+L L L++ G R+ +A ARVPI+KF
Sbjct: 331 LPMGDIDLVI-LSESMAYSN-----KVSVLHTLANTLKRAGITSRVTVIAKARVPIVKFV 384
Query: 122 TIHQNISCDISIDN----LCGQIKSKFLFWIS-------QIDGR--------FRDMVLLV 162
T H + DISI+ + G I + FL + + D R +VL+
Sbjct: 385 TTHGRFNVDISINQENGLVSGNIINGFLRHLHNPTSNTPEFDANGNPKTSLALRSLVLIT 444
Query: 163 KEWAKAHDINNPKTGTFNSYSL 184
K + +N TG SYS+
Sbjct: 445 KAFLAQRSMNEVYTGGLGSYSI 466
>gi|241948905|ref|XP_002417175.1| topoisomerase 1-related protein TRF4, putative [Candida
dubliniensis CD36]
gi|223640513|emb|CAX44767.1| topoisomerase 1-related protein TRF4, putative [Candida
dubliniensis CD36]
Length = 606
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 92/190 (48%), Gaps = 12/190 (6%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P E+ TR VIS L+ + G FGS ++L+ D+D+
Sbjct: 176 MKDFVNYISPSSEEIVTRNNVISTLKTEIGMF--WPGTETHVFGSCATDLYLPGSDIDMV 233
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ +S G +S L L LR K + ++ +A A+VPI+KF + D
Sbjct: 234 V-------VSETGDYENRSRLYQLSTFLRTKKLAKNVEVIASAKVPIIKFVDPISELHID 286
Query: 131 ISIDNLCGQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
+S + G +K + W+ G R++VL++K++ ++ +NN G Y+ ++++
Sbjct: 287 VSFERTNGLDAAKRIRRWLISTPG-LRELVLVIKQFLRSRRLNNVHVGGLGGYA-TIIMC 344
Query: 190 FHFQTCVPAI 199
+HF P +
Sbjct: 345 YHFLRLHPKL 354
>gi|402220735|gb|EJU00806.1| Nucleotidyltransferase, partial [Dacryopinax sp. DJM-731 SS1]
Length = 266
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 87/180 (48%), Gaps = 13/180 (7%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
+ P E+ R+ VI +R + A V FGS + L+ GD+D+ +
Sbjct: 12 IRPRDEEHSVRLMVIECIRSSI--TRKWPSARVLAFGSQETQLYFPNGDIDLVVHYDG-- 67
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
IS K S L ++ L+Q RR+ + ARVPI+KF T + + DIS++
Sbjct: 68 -ISVERKDQIVSFLSEISCLLQQAKVSRRVNLIGKARVPIIKFVTELGHFAVDISVNQTN 126
Query: 138 G----QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
G + ++FL+++ + R +V+++K + +N P +G F SY++ +V+ Q
Sbjct: 127 GLRAVTVVNRFLWYLPAV----RPLVMVIKAFLLQRGLNEPYSGGFGSYTVICMVVSFLQ 182
>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p)
[Komagataella pastoris GS115]
gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
Length = 601
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 84/179 (46%), Gaps = 10/179 (5%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDIS 70
+KD + ++P + E R + LR+ + + V FGSF ++L+ D+D+
Sbjct: 139 IKDFINYISPSIAEIEARNNAVKRLRKEI-TTNLWPDCYVNVFGSFATDLYLPGSDIDMV 197
Query: 71 IELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
I S +GK +S L L LR K ++ +A A+VPI+KF I D
Sbjct: 198 I-------TSDSGKYCAKSYLYQLSSFLRSKNLGVNIETIARAKVPIIKFIEPRSKIHID 250
Query: 131 ISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV 188
+S + G + + W+ + G R++VL+VK++ +NN G +S+ LV
Sbjct: 251 VSFEKTNGLRAAERIQGWLRETPG-LRELVLIVKQFLAVRRMNNVHHGGLGGFSIICLV 308
>gi|118359226|ref|XP_001012854.1| hypothetical protein TTHERM_00094000 [Tetrahymena thermophila]
gi|89294621|gb|EAR92609.1| hypothetical protein TTHERM_00094000 [Tetrahymena thermophila
SB210]
gi|152926621|gb|ABS32301.1| RNA-dependent RNA polymerase-associated nucleotidyltransferase 2
[Tetrahymena thermophila]
Length = 557
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 79/158 (50%), Gaps = 16/158 (10%)
Query: 45 LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRAL--RQKG 102
L+ V P+GS VS D+DISI N C ++V SL+ D ++ + K
Sbjct: 246 LQNVYVHPYGSVVSGFGQNDSDIDISI---NTDC--YLDERVFLSLIYDFMKNYLHKYKI 300
Query: 103 GYRRLQFVAHARVPILKFETIHQNI-------SCDISIDNLCGQIKSKFLFWISQIDGRF 155
Y++L+ AR+P++ N+ S DI I+NL G SK L ISQI
Sbjct: 301 KYQKLELKLDARIPLITLVKQKDNVNLKSNTVSIDICINNLLGCANSKMLKVISQIHPLV 360
Query: 156 RDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
+ + +++K WAK + + + K T +SY+ L+++ Q
Sbjct: 361 KQLGIIIKYWAKQNGLISKK--TLSSYAFILIMICFLQ 396
>gi|219127188|ref|XP_002183822.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404545|gb|EEC44491.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1336
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 76/174 (43%), Gaps = 23/174 (13%)
Query: 47 GATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG-GYR 105
G V FGS + S DLD+ ++L GS ++ + L L K
Sbjct: 955 GTKVVIFGSSANGFGSPKSDLDMCLQLPEGSRLNHEAGGEAMAKLAQYLDTFGMKSVDTA 1014
Query: 106 RLQFVAHARVPILKFETIHQN---------ISCDISIDNLCGQIKSKFLFWISQIDGRFR 156
RL AR+PI+ F+ + I CD+S+ N + + L ++I R
Sbjct: 1015 RLT----ARIPIVMFQCPNPMSTGNGEDDLIECDLSMHNTLAVLNTALLRTYAEITPVTR 1070
Query: 157 DMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT--------CVPAILPP 202
+ ++K WAKA DINNP T +SY +++L HF + V A+ PP
Sbjct: 1071 VLAAIIKRWAKARDINNPARHTLSSYGY-IIMLLHFLSYHKRNGNGLVSAVAPP 1123
>gi|308805789|ref|XP_003080206.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
gi|116058666|emb|CAL54373.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
Length = 555
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/337 (22%), Positives = 129/337 (38%), Gaps = 57/337 (16%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L P + +R + +R+VV+ + A E GSF + ++ D+D
Sbjct: 133 DFSRFLEPTEAEASSRTAAVERVRDVVKGI--WPNARFEVHGSFATGMYLPGSDID---- 186
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ I +G K L L AL ++ ++Q +A ARVPI+KFE + DIS
Sbjct: 187 ----AVILDSGAKNPGVCLKALAIALARRDMAIKIQLIAKARVPIVKFEEVESGHQFDIS 242
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
D G ++ + + R + ++K + +N +G SY+L +V+ H
Sbjct: 243 FDVANGPASAEIVRENMRRFPALRPLTTVLKAFLAQRGLNEVYSGGIGSYALLCMVMAHL 302
Query: 193 Q----TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRS 248
Q TC K + G ++ +E C
Sbjct: 303 QLHNTTC--------KSTWAG----------SHGASDASEGC------------------ 326
Query: 249 SLAHLFVSFLEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNHPLF--IEDPFEQP 305
L L + F E F G L A E+GI C G + + N P IEDP ++
Sbjct: 327 -LGTLLIDFFELF-GRRLNAEEVGISCGGKGPGFFKKRDRGMFEENRPFLWAIEDPQDET 384
Query: 306 ENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALL 342
+ R + ++ +AF+ +T+ ++ L
Sbjct: 385 NDLGR--NSYACRQVKSAFDHAFTVITAPVDSKNGFL 419
>gi|147800856|emb|CAN64475.1| hypothetical protein VITISV_017481 [Vitis vinifera]
Length = 493
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/198 (29%), Positives = 92/198 (46%), Gaps = 26/198 (13%)
Query: 2 GSYNVLEPILK------DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGS 55
G+ + P+LK D L+P E+ R I ESV VE FGS
Sbjct: 94 GNSRLRSPMLKLHKEILDFSDFLSPTPEEQSARNAAI-------ESV-----FNVEVFGS 141
Query: 56 FVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARV 115
F + L+ D+D+ I GS I K Q L L RAL QKG +++Q +A ARV
Sbjct: 142 FKTGLYLPTSDIDVVIL---GSDI-----KTPQIGLYALSRALSQKGIAKKIQVIAKARV 193
Query: 116 PILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK 175
PI+KF +++ DIS D G ++++ R + L++K + + ++N
Sbjct: 194 PIIKFVEKRSSVAFDISFDVENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVY 253
Query: 176 TGTFNSYSLSLLVLFHFQ 193
+G SY+L +++ Q
Sbjct: 254 SGGIGSYALLAMLIAMLQ 271
>gi|351699165|gb|EHB02084.1| U6 snRNA-specific terminal uridylyltransferase 1 [Heterocephalus
glaber]
Length = 833
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 105/251 (41%), Gaps = 42/251 (16%)
Query: 90 LLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDN--LCGQI--KSKFL 145
L+G +LR R G R+ V AR P++KF + D+S+ N L G S+FL
Sbjct: 319 LVGSILR--RCVPGVYRVHSVPSARRPVVKFCHRPSGLHGDVSLGNRYLGGLALHNSQFL 376
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
+++DGR R +V V+ WA+ + ++Y+L+LLV++ QT P +LP +
Sbjct: 377 SLCAELDGRVRPLVYTVRCWAQGRGLAG-SGPLLSNYALTLLVVYFLQTRDPPVLPTVAQ 435
Query: 206 IYPGNLVDDLKGVRANAERQIAEI----CAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
+ R E + E+ C+F R + N L F
Sbjct: 436 L-----------TRKADEGERVEVDGWDCSF--PRDARSLEPSANVEPPGALLAQFFFCV 482
Query: 262 SGLSLKASELGI-----CPFTGQ-----WEHIRSNTRWLPNNHPLFIEDPFEQPENSARA 311
S L+ S L + P G WE +R P+ ++DPFE N A
Sbjct: 483 SCWDLRGSLLSLREGRALPVAGGQPAIFWEGLRLG--------PMNLQDPFELSHNVAAN 534
Query: 312 VSEKNLAKISN 322
V+ + ++ N
Sbjct: 535 VTSRVAGRLQN 545
>gi|313226618|emb|CBY21763.1| unnamed protein product [Oikopleura dioica]
gi|313246685|emb|CBY35564.1| unnamed protein product [Oikopleura dioica]
Length = 622
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/270 (25%), Positives = 124/270 (45%), Gaps = 32/270 (11%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS + ++ D+DI + + + + I + + ++ L +A+RQ G ++ + A
Sbjct: 79 GSTSNGFGTKNSDVDICLVIDHNTEIVNKTESMR--ALKACRKAMRQVGRFQDFSELIPA 136
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS----QI-DGRFRDMVLLVKEWAKA 168
+VPIL+ + + DI+ +NL G + L S QI D R + + + +K+ K
Sbjct: 137 KVPILRLNL--RGVQIDINCNNLTGLRNTWLLNAYSASGNQINDPRVKPLAMFIKKICKK 194
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG-NLVDDLKGVRANAERQIA 227
INN GT SYS++L+++ + QT P ILP L+ + N+ + L+ N R+I
Sbjct: 195 LTINNASEGTLTSYSINLMLINYLQTRSPPILPVLQVLDEEINISEGLE----NLPRRIR 250
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
++ +K+ N +++ L F + ++ I GQ
Sbjct: 251 QV---------PEKWEIKNTATVGQLAFGFFDYYNQFDFNQV---ISTRLGQPVKASDGR 298
Query: 288 RWLPNNH-----PLFIEDPFEQPENSARAV 312
P+N + IE+PF+ N+ARAV
Sbjct: 299 LMFPDNQLFTDKKMRIEEPFDGT-NTARAV 327
>gi|198436697|ref|XP_002130666.1| PREDICTED: similar to PAP associated domain containing 1 [Ciona
intestinalis]
Length = 778
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 90/187 (48%), Gaps = 25/187 (13%)
Query: 43 ESLRGATVEPFGSFVSNLFSRWGDLDISIELSN----------------GSCISSA---G 83
E+ G V+PFGS V+ DLD++ + S+ G+ SA
Sbjct: 193 EAFPGCEVQPFGSSVNGFGVHGCDLDLNFDYSSIHDDVMAGITQNMHETGTAEVSAEDMD 252
Query: 84 KKVKQSLLGDLLRALRQ-KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKS 142
+ K +L + ++Q +++ + +AR+P+++F + CDIS+ N +
Sbjct: 253 RSHKSGVLLAIAEIIKQCVPDCHKIKTILNARLPVVRFYHKTSGVRCDISLKNDLAIHNT 312
Query: 143 KFLFWISQIDGRFRDMVLLVKEWAKAHDI--NNPKTGT-FNSYSLSLLVLFHFQTCVPAI 199
++L SQ+ F +V L++ W K + N GT N+Y+++LLVLF+ Q C
Sbjct: 313 QYLQLCSQLTPNFSLLVFLIRAWMKHWKLAGNLQFNGTSLNNYAVTLLVLFYMQNC--NC 370
Query: 200 LPPLKDI 206
+P LKD+
Sbjct: 371 IPKLKDL 377
>gi|348665580|gb|EGZ05409.1| hypothetical protein PHYSODRAFT_533503 [Phytophthora sojae]
Length = 632
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 55/98 (56%), Gaps = 4/98 (4%)
Query: 109 FVAHARVPILKFETIHQNIS--CDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+HARVP+++F+ ++ CD+ DN G ++ L + D R RD+ L VK WA
Sbjct: 398 LASHARVPVIRFQYRQGDLDYKCDLCFDNELGLRNTRLLRAYASYDDRARDLGLAVKYWA 457
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLK 204
K I++ +G +SYS LL +++ Q + +LP L+
Sbjct: 458 KQRGISDTASGFLSSYSYVLLSIYYLQ--IVHVLPNLQ 493
>gi|330915205|ref|XP_003296937.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
gi|311330654|gb|EFQ94964.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
Length = 513
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 92/233 (39%), Gaps = 32/233 (13%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ +N G + L S D R R MVL VK WAK +N+ +G
Sbjct: 245 LDFPKTGCGIQCDINFENPLGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRKVNSAYSG 304
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIY---PGNLVDDLKGVRANAERQIAEICAFN 233
T +SY L+VL + P + P L+ + + + G + R EI
Sbjct: 305 TLSSYGWVLMVLHYLVNVAYPPVCPNLQLMSHRPESEMTRIISGYQVRFWRHEQEII--- 361
Query: 234 IARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKA-----------------SELGICP- 275
R + N+ S+ L F + ++ LS + + GI
Sbjct: 362 --RSAQSGRLTENKESVGSLLRGFFQYYASLSGYSYPRPPQFYWTNEVLSLRTPGGILSK 419
Query: 276 ----FTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ G I + + + N + IEDPFE N AR V+ + + I + F
Sbjct: 420 QAKGWVGATTKITAEKK-VTNRYLFAIEDPFEIDHNVARTVTHRGIVTIRDEF 471
>gi|194748723|ref|XP_001956794.1| GF10110 [Drosophila ananassae]
gi|190624076|gb|EDV39600.1| GF10110 [Drosophila ananassae]
Length = 665
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/385 (21%), Positives = 154/385 (40%), Gaps = 45/385 (11%)
Query: 33 SDLREVVESVESLRGAT---------VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAG 83
SDLR++ +R V PFGS V+ L + D+D+ ++ +
Sbjct: 103 SDLRKLESCFGHVRNCIEKEMKGRVKVFPFGSLVTGLSLKESDIDLYLQPCDDQ------ 156
Query: 84 KKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSK 143
+ + L + LR+ + + + HARVPI++ + +S DI++ N S+
Sbjct: 157 NMMHKQLYNRVSHFLRRSKCFTDIFTIRHARVPIIRCKHTLTGLSIDINMSNPNSTYNSR 216
Query: 144 FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL 203
F+ + D R R++ L +K WAK + G+ SY L ++L Q V +LPP+
Sbjct: 217 FVGELILRDERLRELCLFLKIWAKKLKLIG--HGSMTSYCLLSMILVSLQ--VRKLLPPI 272
Query: 204 KDIYPGNLVDDLKGVRANAERQIA-----EICAFNIARFSSDKYRKINRSSLAHLFVSFL 258
K + ++ GV Q+ + ++ R + + ++ L
Sbjct: 273 KQLQSLCPPVNVFGVNYAYCLQLVPPIPRALKTLDLIRGFFEYFSNVDFEKCV------L 326
Query: 259 EKFSGLSLKASEL----GICPFTGQWEHIRSNTRWLPN----NHPLFIEDPFEQPENSAR 310
F G +L L G + Q + + + P + + ++DPFE N A+
Sbjct: 327 SPFLGCALDKETLARPGGFPEYEYQLKTMEESVGEAPEPFQLDRFMCVQDPFELQHNVAK 386
Query: 311 AVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFILQFFGESPVRYANYNNGHRR 370
VS NL+ + RL + LL + A+ + FG + + + G +
Sbjct: 387 GVSPTNLSYLRQC-----LRLAAQACNEQNLLQNPAQLYDCLLFGLADKLVTDVHKG--K 439
Query: 371 ARPQSHKSVNSPLQAQHQSHNAKKE 395
A P + L Q ++ ++ +
Sbjct: 440 ATPPKQQRTEIELSEQPKTTESETK 464
>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 911
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 88/189 (46%), Gaps = 12/189 (6%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
++P E+ + R VI+ + +VVE++ + FGS ++++ D+D+ I +N
Sbjct: 272 VSPSAEEKQMREDVIARISKVVETL--WPSVQLRVFGSCATDIYLPTSDIDLCIMGANA- 328
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
S + +L ALR++ R+Q +A ARVPI+K DIS D
Sbjct: 329 --------CSPSPIDELASALRRRS-MGRVQAIATARVPIIKLVDAATGCLVDISFDVPT 379
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
G + + + + LL+K + K +N P TG SY+L ++++ + Q P
Sbjct: 380 GPAHINLIKRYLDEEPSVKPLALLIKYYLKQFGMNEPYTGGLGSYALIIMIISYLQLHKP 439
Query: 198 AILPPLKDI 206
+ +D+
Sbjct: 440 RAVEKQQDL 448
>gi|440467546|gb|ELQ36762.1| caffeine-induced death protein 1 [Magnaporthe oryzae Y34]
gi|440488651|gb|ELQ68366.1| caffeine-induced death protein 1 [Magnaporthe oryzae P131]
Length = 1067
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 62/156 (39%), Gaps = 16/156 (10%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L+F + CDI+ + L S +D R R MVL VK WAKA IN P G
Sbjct: 746 LEFPKSGVGVQCDINFSAHLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRG 805
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
T +SY L+VL + P + P L+ L AE + + C R
Sbjct: 806 TLSSYGYVLMVLHYLVNVAQPFVCPNLQ-----QLARPPNPHMTPAEMEATQFCKGKDVR 860
Query: 237 F----------SSDKYRKINRSSLAHLFVSFLEKFS 262
F +S NR S+ HL F E ++
Sbjct: 861 FWRDEEEIKGLASQNLLTQNRDSVGHLLRGFFEYYA 896
>gi|85078256|ref|XP_956137.1| hypothetical protein NCU04364 [Neurospora crassa OR74A]
gi|28917186|gb|EAA26901.1| predicted protein [Neurospora crassa OR74A]
gi|40882260|emb|CAF06085.1| related to caffeine-induced death protein 1 Cid1 [Neurospora
crassa]
Length = 1187
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 62/148 (41%), Gaps = 16/148 (10%)
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
I CDI+ + L S D R R MVL VK WAK IN+P GT +SY
Sbjct: 790 GIQCDINFSAHLAMHNTHLLRCYSSCDPRVRPMVLFVKHWAKVRGINSPYRGTLSSYGYV 849
Query: 186 LLVLFHFQTCV-PAILPPLKDIYPG----------NLVDDLKGVRANAERQIAEICAFNI 234
L+VL + V P + P L+ + P N V KG + R E I
Sbjct: 850 LMVLHYLINVVKPFVCPNLQQLAPPLPPDLTPEQLNDVAFCKGKNVHFWRDDQE-----I 904
Query: 235 ARFSSDKYRKINRSSLAHLFVSFLEKFS 262
R ++ NR S+ HL F E ++
Sbjct: 905 QRLAAMGMINQNRDSIGHLLRGFFEYYA 932
>gi|389634385|ref|XP_003714845.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
gi|351647178|gb|EHA55038.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
Length = 1067
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 62/156 (39%), Gaps = 16/156 (10%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L+F + CDI+ + L S +D R R MVL VK WAKA IN P G
Sbjct: 746 LEFPKSGVGVQCDINFSAHLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRG 805
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
T +SY L+VL + P + P L+ L AE + + C R
Sbjct: 806 TLSSYGYVLMVLHYLVNVAQPFVCPNLQ-----QLARPPNPHMTPAEMEATQFCKGKDVR 860
Query: 237 F----------SSDKYRKINRSSLAHLFVSFLEKFS 262
F +S NR S+ HL F E ++
Sbjct: 861 FWRDEEEIKGLASQNLLTQNRDSVGHLLRGFFEYYA 896
>gi|221487135|gb|EEE25381.1| poly(A) polymerase cid, putative [Toxoplasma gondii GT1]
Length = 940
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 55/119 (46%), Gaps = 13/119 (10%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
V A+VPI K H D+S++N S F+ ID R R + +K WA
Sbjct: 585 VVPAQVPIAKVCNAHGKGLIDVSVNNCTALENSIFVETFGAIDDRVRPLGRFIKHWATQR 644
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL----KDIYP---------GNLVDDL 215
+INN GT ++Y+L L + F Q P ILPP K+++P G L+ D
Sbjct: 645 NINNRAEGTLSTYTLMLQLFFFLQQRSPPILPPYTYIRKNLFPEYGHNKETVGELIHDF 703
>gi|170109615|ref|XP_001886014.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164638944|gb|EDR03218.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 397
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 96/198 (48%), Gaps = 18/198 (9%)
Query: 6 VLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWG 65
+L +K + ++P + E R +++ + V++ S A V PFGS+ + L+ G
Sbjct: 101 MLHAEVKAFVHWISPSPVEDEVRGLIVTQISNTVKA--SFPDARVLPFGSYETKLYLPLG 158
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
D+D+ I LS+ S+ K ++L L L++ G + +A A+VPI+KF T H
Sbjct: 159 DIDLVI-LSDSMAYSN-----KVNVLHALANTLKRSGVTSHVTIIAKAKVPIVKFVTTHG 212
Query: 126 NISCDISIDN----LCGQIKSKFL--FWISQIDGR----FRDMVLLVKEWAKAHDINNPK 175
DIS++ L G+I + FL + +G+ R +V++ K + +N
Sbjct: 213 RFHVDISLNQSNGLLSGKIINGFLKDMHGNGAEGKGSMALRSLVMVTKAFLTQRSMNEVY 272
Query: 176 TGTFNSYSLSLLVLFHFQ 193
TG SYS+ L + Q
Sbjct: 273 TGGLGSYSIVCLAVSFLQ 290
>gi|237831433|ref|XP_002365014.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211962678|gb|EEA97873.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|221506820|gb|EEE32437.1| hypothetical protein TGVEG_076640 [Toxoplasma gondii VEG]
Length = 940
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 4/103 (3%)
Query: 110 VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAH 169
V A+VPI K H D+S++N S F+ ID R R + +K WA
Sbjct: 585 VVPAQVPIAKVCNAHGKGLIDVSVNNCTALENSIFVETFGAIDDRVRPLGRFIKHWATQR 644
Query: 170 DINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL----KDIYP 208
+INN GT ++Y+L L + F Q P ILPP K+++P
Sbjct: 645 NINNRAEGTLSTYTLMLQLFFFLQQRSPPILPPYTYIRKNLFP 687
>gi|367043082|ref|XP_003651921.1| hypothetical protein THITE_2112714 [Thielavia terrestris NRRL 8126]
gi|346999183|gb|AEO65585.1| hypothetical protein THITE_2112714 [Thielavia terrestris NRRL 8126]
Length = 1275
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 136/322 (42%), Gaps = 44/322 (13%)
Query: 7 LEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGD 66
LE ++++ L P R +++S L + + V FGS + L S D
Sbjct: 280 LETDMRELYDRLLPTEAIEVNRRELVSKLERLFNTEWPGHDIRVHLFGSSGNLLCSDDSD 339
Query: 67 LDISIELSNGSCISSAGKKVKQ-SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
+DI CI++ ++++ ++ +LL + G ++ V+ A+VPI+K
Sbjct: 340 VDI--------CITTPWRELESVCMIAELL----DRHGMEKVVCVSSAKVPIVKIWDPEL 387
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPK-TGTFNSYSL 184
++CD++++N ++ + ID R R + +++K W + +N+ GT +SY+
Sbjct: 388 KLACDMNVNNTLALENTRMVRTYVSIDDRVRPLAMIIKYWTRRRVVNDAAFGGTLSSYTW 447
Query: 185 SLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRK 244
+++ Q P +LP L + LV G ++ I ++ F
Sbjct: 448 ICMIIAFLQLRDPPVLPALHQQHDLKLVKQ-DGALSDFADDIPKLRGFGAK--------- 497
Query: 245 INRSSLAHLFVSFL---------EKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP 295
N+ SLA L F +K++ LS++ L + W+++ +N
Sbjct: 498 -NKDSLAVLLFQFFRFYAHEFDYDKYT-LSIRMGTL-LTKAEKNWQYLVNNA-------- 546
Query: 296 LFIEDPFEQPENSARAVSEKNL 317
L +E+PF N E +
Sbjct: 547 LCVEEPFNDGRNLGNTADETSF 568
>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
Length = 701
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 92/195 (47%), Gaps = 17/195 (8%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
+ P + ETR VI + ++S R A V PFGS + L+ GDLD+ +
Sbjct: 110 MAPTGAEHETRCMVIELIARAIKS--QFRDAEVRPFGSQETKLYLPQGDLDLVV------ 161
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
S QS L + LR+ +Q +A A+VPI+KF T + + DIS+++
Sbjct: 162 VSRSMANLRTQSALRTMAACLRRHNLATDVQVIAKAKVPIIKFVTTYARLKVDISLNHTN 221
Query: 138 GQIKSKFL-FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCV 196
G + ++ W+ + R ++L++K ++ +G SYS+ ++V+ Q
Sbjct: 222 GLTTASYVNGWLRKWP-HIRPLILVIKHLLMQRGMSEVFSGGLGSYSVIIMVISFLQ--- 277
Query: 197 PAILPPLK--DIYPG 209
+ P L+ +I PG
Sbjct: 278 --LHPKLQRGEIEPG 290
>gi|452839338|gb|EME41277.1| hypothetical protein DOTSEDRAFT_73629 [Dothistroma septosporum
NZE10]
Length = 835
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 91/237 (38%), Gaps = 40/237 (16%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ N G ++ L S D R R MVL VK WAK IN+ +G
Sbjct: 447 LDFPKDGVGIQCDINFFNPLGLHNTQLLRCYSSCDPRVRPMVLFVKSWAKKRRINSSYSG 506
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIY--------PGNLVDDLKGVRANAERQIAE 228
T +SY ++VL + P +LP L+ + PG ++ + R E
Sbjct: 507 TLSSYGYVMMVLHYLVNVAQPPVLPNLQLPWRPHPHCTPPGAAKIEVDNWTVDFWRNEDE 566
Query: 229 I-CAFNIARFSSDKYRKINRSSLAHLFVSFLEKFS--------------------GLSLK 267
I A + + S N SL L F + +S G L
Sbjct: 567 IQAALHNGQMSG------NSESLGSLLAGFFQYYSSQGRGPQFRWTQWVVSIRTPGGILT 620
Query: 268 ASELGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ G T + N + + + + IEDPFE N AR V+ + I + F
Sbjct: 621 KDQKGWVKATTE----EGNGKKIQHRYLFCIEDPFELSHNVARTVTHNGIVAIRDEF 673
>gi|116192867|ref|XP_001222246.1| hypothetical protein CHGG_06151 [Chaetomium globosum CBS 148.51]
gi|88182064|gb|EAQ89532.1| hypothetical protein CHGG_06151 [Chaetomium globosum CBS 148.51]
Length = 1097
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 123/262 (46%), Gaps = 28/262 (10%)
Query: 55 SFVSNLFSRWGDLDISIELSNGSCISS---------AGKKVKQSLLG---DLLRALRQKG 102
S + LFSR D +IE++ + G ++ +L G +LL +
Sbjct: 127 SHIQELFSRLLPTD-AIEMNRRKLVDKLEKLFNDEWPGHDIRVNLFGSSGNLLCSDDSDD 185
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G ++ V+ A+VPI+K +++CD++++N ++ + ID R R + +++
Sbjct: 186 GMEKVVCVSSAKVPIVKIWDPELSLACDMNVNNTLALENTRMVRTYVSIDDRVRPLAIII 245
Query: 163 KEWAKAHDINNPK-TGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRAN 221
K W + IN+ GT +SY+ +++ Q P +LP L + +LK ++A+
Sbjct: 246 KYWTRRRIINDAAFGGTLSSYTWICMIIAFLQLREPPVLPALHQRH------NLKLLKAD 299
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+R +E A +I++ + N+ +LA L F +F + + TG+
Sbjct: 300 GKR--SEF-ADDISKLRG--FGAKNKDNLATLLFQFF-RFYAHEFDYDKHALSIRTGKLL 353
Query: 282 HIRSNTRW-LPNNHPLFIEDPF 302
++ +W + +N+ L IE+PF
Sbjct: 354 -TKTEKKWHIGSNNALCIEEPF 374
>gi|301624426|ref|XP_002941502.1| PREDICTED: u6 snRNA-specific terminal uridylyltransferase 1
[Xenopus (Silurana) tropicalis]
Length = 843
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 25/224 (11%)
Query: 103 GYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLV 162
G +Q V AR P++ F+ + D++++N S FL S +D R +V V
Sbjct: 316 GVHGVQSVPTARRPVIHFQHKTSGLRGDVTLNNRLALRNSSFLRLCSDLDARVPQLVYTV 375
Query: 163 KEWAKAHDI-NNPKTGT--FNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVR 219
+ WA+ + + NP G N+Y+L+LLV F QT P +LP L + R
Sbjct: 376 RYWARVNQLAGNPFGGGPLLNNYALTLLVFFFLQTRNPPVLPTLVHL------------R 423
Query: 220 ANAERQIAEICAFNIARFSSDKYRKINR---SSLAHLFVSFLEKFSGLSLKASELGICPF 276
++ ++ F SD + SS+ L F ++ L L L +CP
Sbjct: 424 EETANEVPQVIDGWDCSFPSDPAQVKESGXYSSIGSLLSEFFSFYASLDLHL--LILCPC 481
Query: 277 TGQWEHIRSNT---RWLPNNH--PLFIEDPFEQPENSARAVSEK 315
G + ++ W PL I+DPFE N VS +
Sbjct: 482 NGLTIPLPFSSPPPAWSEGFRLGPLNIQDPFELSHNVCGNVSSR 525
>gi|353241543|emb|CCA73351.1| related to TRF4-topoisomerase I-related protein [Piriformospora
indica DSM 11827]
Length = 628
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 70/151 (46%), Gaps = 12/151 (7%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKG-GYRR 106
AT FGS+ + L+ GD+D+ IE + + K Q L L LR G RR
Sbjct: 130 ATATAFGSYATGLYLPTGDIDVVIETKYATA---STKNAAQRALSQLATILRSAGLAERR 186
Query: 107 LQFVAHARVPILKFETIHQNISCDISIDNLCG----QIKSKFLFWISQIDGRFRDMVLLV 162
V ARV I+KF+++H I DIS++ G + +++L + R ++++V
Sbjct: 187 KIQVISARVSIIKFDSVHGGIPVDISLNQTTGVSAIPVINRYLEHFPAL----RPLIMVV 242
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ 193
K + +N G SYS+ L + Q
Sbjct: 243 KAFLNQRGMNEVYKGGLGSYSIICLAISFLQ 273
>gi|348665578|gb|EGZ05407.1| hypothetical protein PHYSODRAFT_533748 [Phytophthora sojae]
Length = 1111
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 62/117 (52%), Gaps = 9/117 (7%)
Query: 94 LLRALRQKGGYRRLQFV-AHARVPILKFETIH----QNISCDISIDNLCGQIKSKFLFWI 148
LLRA+ Q+ ++ V A ARVPI++F +H + CD+ +N+ + L
Sbjct: 533 LLRAILQRAAKCEVRHVIAGARVPIIRF--LHTRSGREYECDLCFENVLATRNTPLLRAY 590
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKD 205
+ D R R + L VK WAK I++ G +SYS LL +++ Q V +LP L+D
Sbjct: 591 ASFDDRARALGLAVKHWAKQRSISDASMGFLSSYSFVLLSIYYLQ--VVHVLPNLQD 645
>gi|195344129|ref|XP_002038641.1| GM10512 [Drosophila sechellia]
gi|194133662|gb|EDW55178.1| GM10512 [Drosophila sechellia]
Length = 563
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/317 (23%), Positives = 138/317 (43%), Gaps = 45/317 (14%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ----KGGYR 105
V FGS ++ + +R DLD+ +++ G+ + + + + L RA+++ +R
Sbjct: 265 VYKFGSRITGIGNRSSDLDLFVDI--GNTFHTFEHRASNATVAKL-RAMKKFFCVSEDWR 321
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+ F+ ARVPI+K + I CDI ++++ G + L +I + + M + VK W
Sbjct: 322 LINFIEQARVPIIKTCHLPTGIECDICLNSM-GFCNTNLLKYIFESQPLTQYMCIYVKNW 380
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
+ + T ++YS++L+V++ Q + A+LPP+ + ++D A +
Sbjct: 381 LERCKL----TEQISTYSITLMVIYFLQ--LQALLPPIAMLQ----IED-----AANQAV 425
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLK--ASELGICPFTGQWE-- 281
+ N A+ S + R + + SFL F K +CP+ G+
Sbjct: 426 LVGPWVVNFAQKSFSELRLQKLQATVPVIKSFLRNFFAYFAKFDYEHFVVCPYIGKANVE 485
Query: 282 --------HIRSNTRWLPN-------NHPLFIEDPFEQPENSARAVSEKNLAKISNAFEM 326
H R + N P+ ++DP + N +AV++ L + +
Sbjct: 486 IPKVERMLHARYSAYVSENPECSIQLKKPMVVQDPIQLNHNVTKAVTKYGLQTFVDYCQQ 545
Query: 327 THFRLT--STN-QTRYA 340
T L STN + RYA
Sbjct: 546 TAELLEEPSTNWRQRYA 562
>gi|336471277|gb|EGO59438.1| hypothetical protein NEUTE1DRAFT_79537 [Neurospora tetrasperma FGSC
2508]
gi|350292370|gb|EGZ73565.1| hypothetical protein NEUTE2DRAFT_108267 [Neurospora tetrasperma
FGSC 2509]
Length = 1083
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 63/146 (43%), Gaps = 14/146 (9%)
Query: 127 ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSL 186
I CDI+ + L S D R R MVL VK WAK IN+P GT +SY +
Sbjct: 689 IQCDINFSAHLAMHNTHLLRCYSSCDPRVRPMVLFVKHWAKVRGINSPYRGTLSSYGYVM 748
Query: 187 LVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFN---------IAR 236
+VL + V P + P L+ + P L DL + N +A N I R
Sbjct: 749 MVLHYLINVVKPFVCPNLQQLAP-PLPPDLTAEQLN---DVAFCKGKNVHFWRDDQEIQR 804
Query: 237 FSSDKYRKINRSSLAHLFVSFLEKFS 262
++ NR S+ HL F E ++
Sbjct: 805 LAAMGMINQNRDSIGHLLRGFFEYYA 830
>gi|145348860|ref|XP_001418861.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579091|gb|ABO97154.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 347
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 123/315 (39%), Gaps = 57/315 (18%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGS 77
L P E+ +R + +R VV + A E GSF + ++ D+D +
Sbjct: 76 LEPTEEEATSRAAAVERVRAVVNGI--WPDARFEVHGSFATGMYLPSSDID--------A 125
Query: 78 CISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLC 137
I +G K L L AL ++G ++Q +A ARVPI+KFE + DIS D
Sbjct: 126 VILDSGAKNAGLCLKALAVALARRGMAIKIQLIAKARVPIVKFEEVESGHQFDISFDVAN 185
Query: 138 GQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQ---- 193
G ++ + + R + ++K + +N +G SY+L +V+ H Q
Sbjct: 186 GPASAEIVRENMRRFPALRPLTTVLKAFLHQRGLNEVYSGGIGSYALLCMVMAHLQLHNT 245
Query: 194 TCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAHL 253
TC K + G+ + +E C L L
Sbjct: 246 TC--------KSTWAGS----------HGASDASEGC-------------------LGTL 268
Query: 254 FVSFLEKFSGLSLKASELGI-CPFTGQWEHIRSNTRWLPNNHPLF--IEDPFEQPENSAR 310
+ F E F G L A E+GI C G + + ++ P IEDP ++ + R
Sbjct: 269 LIDFFELF-GRRLVAEEVGISCGGKGPGFFKKRDKGMYEDSRPFLWAIEDPQDETNDLGR 327
Query: 311 AVSEKNLAKISNAFE 325
+ ++ +AFE
Sbjct: 328 --NSYACRQVKSAFE 340
>gi|219124672|ref|XP_002182622.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405968|gb|EEC45909.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 502
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 101/216 (46%), Gaps = 24/216 (11%)
Query: 11 LKDILGMLNPLREDWETRMKVISDLREVVESVESLR--GATVEPFGSFVSNL-FSRWGDL 67
L D+ L+ R + + + I+ L E + ++ R A + +GS +S+L + D+
Sbjct: 41 LWDVACGLHQDRNERQAYTRAINILHEHLSTLVQQRFPDARLGVYGSCLSDLSLGKSSDV 100
Query: 68 DISIELSN----------GSCISSAGKKVKQSLLGDLLRAL-RQKGGYRRLQFVAHARVP 116
D+S++ G C + +SL+ + R + R+K +R +Q V ARVP
Sbjct: 101 DLSLDFKRARKVKDQFEIGKCPVQRYESEMKSLVYAVCRTMERRKHEFRAMQPVTRARVP 160
Query: 117 ILKFE--------TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKA 168
++K T+ +I D+ N + S L S +D R + +++ VK WAKA
Sbjct: 161 VIKGTYLGANNPYTVDGSIDFDVCFLNDIAVVNSSLLREYSIVDDRVKALMIAVKRWAKA 220
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTC--VPAILPP 202
I + + T +SY+ LV+F+ Q VP + P
Sbjct: 221 FGICSSQHNTLSSYAWMNLVIFYLQNVGFVPNLQSP 256
>gi|440637467|gb|ELR07386.1| hypothetical protein GMDG_08401 [Geomyces destructans 20631-21]
Length = 753
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 91/197 (46%), Gaps = 11/197 (5%)
Query: 1 MGSYNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNL 60
MGS+ +E + D P + R +ISDLR V+ V A + PFGS+ + L
Sbjct: 417 MGSWLHME--IMDFFHRFKPAEVEENMRGALISDLRRAVQKV--WHDADILPFGSYPAGL 472
Query: 61 FSRWGDLDISIELSNGSCISSAGKKVKQSLL---GDLLRALRQKGGYRRLQFVAHARVPI 117
+ D+D+ + +S G GK ++ L D L + Y ++ ++ A+VP+
Sbjct: 473 YLPTADMDL-VFVSRGYMDGGYGKYTNKNALFRFRDFLDREKIAAPYS-IEVISKAKVPL 530
Query: 118 LKFETIHQNISCDISIDNLCGQIKSK-FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKT 176
+K+ + + D+S +N G I +K F W +V +VK++ +N P
Sbjct: 531 VKYIDYYTGLRVDVSFENDTGLIANKTFQNWKDTFPA-MPILVTIVKQFLAMRGLNEPVN 589
Query: 177 GTFNSYSLSLLVLFHFQ 193
G ++++ LV+ Q
Sbjct: 590 GGIGGFTVTCLVVSLLQ 606
>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
Length = 501
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/190 (23%), Positives = 87/190 (45%), Gaps = 12/190 (6%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSR 63
Y L + D + ++P ++ +R ++I ++RE+V+ + ATVE FGS + +F
Sbjct: 129 YACLHEEIMDFVSFISPTEQELSSRAELIEEMREIVKGL--WPEATVETFGSHYTQMFLP 186
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
D+D+ + G ++ L L + L +K L+ + AR+PI+K
Sbjct: 187 QSDIDMVL----------FGVPEGKAPLFKLAQCLEEKELVSYLEVIDKARIPIVKMVHK 236
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
+I D+S + G + ++ FR + L++K + +N TG S+
Sbjct: 237 ASDIHVDVSFNVAGGLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLNETYTGGVGSFL 296
Query: 184 LSLLVLFHFQ 193
L ++V+ Q
Sbjct: 297 LQMMVVSFLQ 306
>gi|384484085|gb|EIE76265.1| hypothetical protein RO3G_00969 [Rhizopus delemar RA 99-880]
Length = 539
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 63/121 (52%), Gaps = 4/121 (3%)
Query: 86 VKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFL 145
+K ++ G + L GG + + V A+VPI++ +SCDI+++N +K +
Sbjct: 14 IKPNVFGSSVNNL---GGMQHIVCVPRAKVPIVRLFDPEMQLSCDINVNNTVALENTKMI 70
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKT-GTFNSYSLSLLVLFHFQTCVPAILPPLK 204
+D R R ++++VK W K +N+ GT +SY+ + +++ Q P ILP L
Sbjct: 71 KVYVSLDPRVRPLIMIVKHWTKQRLLNDAANGGTLSSYTWTCMIINFLQQREPPILPVLH 130
Query: 205 D 205
+
Sbjct: 131 E 131
>gi|391326037|ref|XP_003737532.1| PREDICTED: uncharacterized protein LOC100904685 [Metaseiulus
occidentalis]
Length = 2575
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 125/295 (42%), Gaps = 35/295 (11%)
Query: 37 EVVESVESLRGATVEP-----FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLL 91
E+V+++ES T+ FGS + DLDI ++ N I + VK ++
Sbjct: 1486 EIVQNIESFIQKTMPEAYLTLFGSSRNGFSLEKADLDICLKYKNKEDIDPS-MDVK-DII 1543
Query: 92 GDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQI 151
+ + L + +Q +A A+VPI+KF + DIS+ N+ + L S I
Sbjct: 1544 KRISKILEKHPDISDVQAIASAKVPIVKFHHDPFGVDGDISLYNVLAVHNTAMLKAYSMI 1603
Query: 152 DGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNL 211
D R + K++ K + + G+ +SY+ ++++ + Q V ++P L+ I P
Sbjct: 1604 DERVVRLGAAFKQYVKLCHMGDASRGSLSSYAYIVMLIHYLQ--VENVVPVLQSIPP--- 1658
Query: 212 VDDLKGVRANAERQIAEICAFNIARFSS-DKYRKI------NRSSLAHLFVSFLEKFSGL 264
G A E I +N F +K ++ NR ++ L++ ++ +
Sbjct: 1659 ----IGHPAGEELPKVMIAGWNAFYFKDIEKLSEVWPEYGSNRKTVGQLWLGLIDYY--- 1711
Query: 265 SLKASELGICPFTGQWEHIRSNTR----WLPNNHPLFIEDPFEQPENSARAVSEK 315
A++ F + TR W N PL IEDPFE N +S K
Sbjct: 1712 ---ATKFRFDHFVVSIRQLEPLTRLEKMW--TNKPLCIEDPFELDHNLGTGISAK 1761
Score = 41.6 bits (96), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 48/253 (18%), Positives = 101/253 (39%), Gaps = 39/253 (15%)
Query: 66 DLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
D+++ L N S ++ + ++L+ L + +A +P++K
Sbjct: 1074 DINMDFHLCN-----SGPPNIQAKIYFEVLKILEEWSALIDFDAQLNAAIPMVKAFHRSS 1128
Query: 126 NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLS 185
N + +I + ++ L +D R + + + WAK +++ G ++S +
Sbjct: 1129 NFAVEIVFGGVASLKTNRLLQDYGSLDERVAPLAVNFRYWAKQCSLDDSHIGFLPAHSFA 1188
Query: 186 LLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKI 245
++ +++ Q P +LP + D DD K E+Q + ++
Sbjct: 1189 IMTVYYLQQISPPVLPCIHDSMKDTEDDDYK----KPEQQ--------------NDWKSE 1230
Query: 246 NRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSN------TRWLPNNHPLFIE 299
N S+A L++ L +F +L I IRS TR P+ + IE
Sbjct: 1231 NNMSIAELWLGML-RFYAAEFPVRKLCIS--------IRSRKKTTLATRQWPSRF-IGIE 1280
Query: 300 DPFEQPENSARAV 312
DP+ + ++ A+ +
Sbjct: 1281 DPYSKKKSLAKCI 1293
>gi|326430489|gb|EGD76059.1| hypothetical protein PTSG_00768 [Salpingoeca sp. ATCC 50818]
Length = 425
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 85/182 (46%), Gaps = 15/182 (8%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D + P++E+ R + ++EV+ + TV FGSF + L+ D+D+ +
Sbjct: 103 DFHKYMEPMKEEVTLRRAFVDRVKEVILGLWPKAEVTV--FGSFNTGLYLPTSDIDVVV- 159
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ + L L RALRQ +++ +A ARVPI+KF N+ DIS
Sbjct: 160 FGDWAVPP----------LQTLARALRQVNIPDKMEVIAKARVPIVKFRDKVTNLWMDIS 209
Query: 133 IDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFH 191
+ G Q W +Q G +VL++K++ +N P +G SY++ LLV+
Sbjct: 210 FNQPSGPQDSINVKKWKTQYRG-LVPLVLIIKQFLLQRGLNEPFSGGIGSYAVFLLVMSF 268
Query: 192 FQ 193
Q
Sbjct: 269 LQ 270
>gi|268562716|ref|XP_002646755.1| Hypothetical protein CBG13152 [Caenorhabditis briggsae]
Length = 308
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 14/162 (8%)
Query: 43 ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLL--RALRQ 100
E +R V FGSF++ S DLD+ I L+ S V QS+ DL R L+Q
Sbjct: 31 EGVRIDRVAVFGSFITQCVSNNSDLDLCICLNLEFGGKSMPVTVLQSVYRDLQHNRNLKQ 90
Query: 101 KGGYRR---LQFVAHARVPILKFETIHQNISCDISIDNLCGQ-----IKSKFLFWISQID 152
G R L FV+ A+VPI+KF+ I+ D+S C + +KF+ Q+D
Sbjct: 91 FFGDNRITHLSFVSSAKVPIIKFKM--NGIAIDLSAI-FCTSPPSSCVAAKFINAYCQLD 147
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTF-NSYSLSLLVLFHFQ 193
RF +V +K W ++ N F NSY+L++L++ Q
Sbjct: 148 DRFVILVTFIKTWLRSEGDPNDHLREFPNSYALTILLIHALQ 189
>gi|224098465|ref|XP_002311184.1| predicted protein [Populus trichocarpa]
gi|222851004|gb|EEE88551.1| predicted protein [Populus trichocarpa]
Length = 140
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 50/87 (57%), Gaps = 1/87 (1%)
Query: 292 NNHPLFIEDPFEQPENSARAVSEKNLAKISNAFEMTHFRLTSTNQTRYALLSSLARPFIL 351
+NH FI +PEN+ARAVS NL KIS A + T+ +L NQ R+++L +L RP I
Sbjct: 34 DNHWPFIS-LLVKPENTARAVSAGNLVKISEAIQTTYLKLVLVNQNRFSVLEALVRPRIS 92
Query: 352 QFFGESPVRYANYNNGHRRARPQSHKS 378
+F +P ++ +GH P + S
Sbjct: 93 RFIAGTPAGNSSNTDGHHVRAPVGNSS 119
>gi|442617798|ref|NP_001262327.1| CG1091, isoform C [Drosophila melanogaster]
gi|440217142|gb|AGB95710.1| CG1091, isoform C [Drosophila melanogaster]
Length = 563
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 73/322 (22%), Positives = 139/322 (43%), Gaps = 50/322 (15%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELS-NGSCISSAGKKVKQSLLGDLLRALRQ----KGGY 104
V FGS ++ + +R DLD+ +++ +G+ + + + + L RA+R+ +
Sbjct: 262 VYKFGSRITGIGNRSSDLDLFVDIGKSGNTFHTFEHRASNATVAKL-RAMRKFFCDSEDW 320
Query: 105 RRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
R + F+ ARVPI+K + I CDI ++++ G + L +I + + M + VK
Sbjct: 321 RLINFIEQARVPIIKTCHLPTGIECDICLNSM-GFCNTNLLKYIFESQPLTQYMCIYVKN 379
Query: 165 WAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAER 224
W + + T ++YS++L+V++ Q + A+LPP+ + ++D A
Sbjct: 380 WLERCKL----TEQISTYSITLMVIYFLQ--LQALLPPIAMLQ----IED-------AAN 422
Query: 225 QIAEICAFNIARFSSDKYRKINRSSL---AHLFVSFLEKFSGLSLK--ASELGICPFTGQ 279
Q + + + F+ + ++ L + FL F K +CP+ GQ
Sbjct: 423 QAVLVGPW-VVNFAQKSFSELGLQQLKATVPVIKGFLRNFFAYFAKFDYEHFLVCPYIGQ 481
Query: 280 WE----------HIRSNTRWLPN-------NHPLFIEDPFEQPENSARAVSEKNLAKISN 322
H R + N P+ ++DP + N +AV++ L +
Sbjct: 482 ANVEIAKIERMLHARYSAYVSDNPECSIQLKKPMVVQDPIQLNHNVTKAVTKYGLQTFVD 541
Query: 323 AFEMTHFRLT--STN-QTRYAL 341
+ T L STN + RYA
Sbjct: 542 YCQQTAELLEEPSTNWRQRYAF 563
>gi|313218095|emb|CBY41415.1| unnamed protein product [Oikopleura dioica]
Length = 528
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 125/271 (46%), Gaps = 34/271 (12%)
Query: 54 GSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHA 113
GS + ++ D+DI + + + + + + + ++ L +A+RQ G ++ + A
Sbjct: 79 GSTSNGFGTKNSDVDICLVIDHNTEMVNKTESMR--ALKACRKAMRQVGRFQDFSELIPA 136
Query: 114 RVPILKFETIHQNISCDISIDNLCGQIKSKFLFWIS----QI-DGRFRDMVLLVKEWAKA 168
+VPIL+ + + DI+ +NL G + L S QI D R + + + +K+ K
Sbjct: 137 KVPILRLNL--RGVQIDINCNNLTGLRNTWLLNAYSASGNQINDPRVKPLAMFIKKICKK 194
Query: 169 HDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG-NLVDDLKGVRANAERQIA 227
INN GT SYS++L+++ + QT P ILP L+ + N+ + L+ N R+I
Sbjct: 195 LTINNASEGTLTSYSINLMLINYLQTRSPPILPVLQVLDEEINISEGLE----NLPRRIR 250
Query: 228 EICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNT 287
++ +K+ N +++ L F + ++ I GQ
Sbjct: 251 QV---------PEKWEIKNTATVGQLAFGFFDYYNQFDFNQV---ISTRLGQPVKASDGR 298
Query: 288 RWLPNNHPLF------IEDPFEQPENSARAV 312
P+N LF IE+PF+ N+ARAV
Sbjct: 299 LMFPDNQ-LFTDKKIRIEEPFDGT-NTARAV 327
>gi|159112073|ref|XP_001706266.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
gi|157434361|gb|EDO78592.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
Length = 520
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 85/167 (50%), Gaps = 12/167 (7%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
R V+ LR++++ V L ATV+ FGS+ + + S DLDI + + ++SA +
Sbjct: 118 REYVLGQLRDIIQLV--LPDATVDVFGSYSTGMSSYSSDLDICVNVP----VNSAA--MM 169
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC--DISIDNLCGQIKSKFL 145
Q + D+ LR+ + +HARVPI+K +H +S DIS ++ G + +
Sbjct: 170 QCHMHDIATLLRRSISTNYVDVRSHARVPIIK--GVHSELSLEYDISFNSPHGAAHRETI 227
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
+ R ++++VK + K +N P TG +SY L L++ +
Sbjct: 228 LGYIEKHPLARILIMVVKSFLKKRGLNQPYTGGMSSYILLQLIVVYI 274
>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
in germ line development protein 4; AltName:
Full=Germline development defective-4
gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
Length = 845
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 127/320 (39%), Gaps = 61/320 (19%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D+ + P + R KV +R+ V + + FGS +NLF D+D+ +E
Sbjct: 86 DMYHWIKPNEIESRLRTKVFEKVRDSVLRRWKQKTIKISMFGSLRTNLFLPTSDIDVLVE 145
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
+ + + G L + R L + A VPI+K +S DIS
Sbjct: 146 CDD--WVGTPG-----DWLAETARGLEADNIAESVMVYGGAFVPIVKMVDRDTRLSIDIS 198
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
+ + G + ++ + + +VLL+K++ ++N TG +SY L LL++ F
Sbjct: 199 FNTVQGVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSYGLVLLLVNFF 258
Query: 193 QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYRKINRSSLAH 252
Q +Y N+ R I + R +N L H
Sbjct: 259 Q------------LYALNM----------RSRTIYD--------------RGVN---LGH 279
Query: 253 LFVSFLEKFSGLSLKASELGICPFTGQWEHI---RSNTRW-----LPNNHPLFIEDPFEQ 304
L + FLE +S L E+GI P GQ +I S R+ P N L +EDP
Sbjct: 280 LLLRFLELYS-LEFNFEEMGISP--GQCCYIPKSASGARYGHKQAQPGN--LALEDPLLT 334
Query: 305 PENSARAVSEKNLAKISNAF 324
+ R S N + I+NAF
Sbjct: 335 ANDVGR--STYNFSSIANAF 352
>gi|169610601|ref|XP_001798719.1| hypothetical protein SNOG_08406 [Phaeosphaeria nodorum SN15]
gi|160702106|gb|EAT84682.2| hypothetical protein SNOG_08406 [Phaeosphaeria nodorum SN15]
Length = 664
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 93/244 (38%), Gaps = 45/244 (18%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ +N G + L S D R R MVL VK WAK IN+ +G
Sbjct: 362 LDFPKEGVGIQCDINFENPLGIHNTHMLRCYSLTDPRVRLMVLFVKAWAKRRKINSSYSG 421
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLKDIYP-----GNLVDDLKGVRANA--------E 223
T +SY L+VL + P + P L+ P ++ D KG E
Sbjct: 422 TLSSYGWVLMVLHYLVNIAQPPVCPNLQHSIPQPKDISHIEDFFKGPTVAGYTVRFWRNE 481
Query: 224 RQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLS------------------ 265
++I + A R S NR S+ L F + ++ L
Sbjct: 482 QEIMQ--AAQSGRLSQ------NRQSVGDLLRGFFQYYASLPQYNNHGPRAPQFYWTNEV 533
Query: 266 LKASELGICPFTGQWEHIRSNT-----RWLPNNHPLFIEDPFEQPENSARAVSEKNLAKI 320
L LG + + T R + N + IEDPFE N AR V+ + + I
Sbjct: 534 LSLRTLGGIRTKQDKGWVSAKTTITAERKVTNRYLFAIEDPFELDHNVARTVTHRGIVAI 593
Query: 321 SNAF 324
+ F
Sbjct: 594 RDEF 597
>gi|328699572|ref|XP_001949146.2| PREDICTED: hypothetical protein LOC100166285 [Acyrthosiphon pisum]
Length = 1645
Score = 62.0 bits (149), Expect = 5e-07, Method: Composition-based stats.
Identities = 53/186 (28%), Positives = 89/186 (47%), Gaps = 22/186 (11%)
Query: 31 VISDLREVVESVES-----LRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKK 85
+I L +V S+ S +G+ FGS +S L + D+D+ +++ + G +
Sbjct: 1308 IIDRLFLLVNSIHSCAGQHFKGSKTYAFGSRMSGLALKDSDVDLYFDIA-----GTFGGE 1362
Query: 86 VKQSLLG--DLLRAL-----RQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCG 138
+ L DL+R Q Y+ +Q + ARVPI+KF + + CD+S +
Sbjct: 1363 LSNDLYAQEDLVRYFGKVFRSQNNDYKHIQQITGARVPIVKFLHVPSGLYCDLSFKSGLS 1422
Query: 139 QIKSKFLFWISQIDGRFRDMVL-LVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVP 197
+K + +D R +V +VK WA +D+ N F SY+L+ LVLF+ T
Sbjct: 1423 THNTKLVRLYLALDERVHWIVCAVVKRWALQNDLKN--QSMFTSYALAWLVLFYLMTI-- 1478
Query: 198 AILPPL 203
++PPL
Sbjct: 1479 DVVPPL 1484
>gi|432113341|gb|ELK35753.1| Terminal uridylyltransferase 7 [Myotis davidii]
Length = 268
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 61/117 (52%)
Query: 93 DLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQID 152
+L R L++ R + + A+VPI+K + DI + N ++ LF S ID
Sbjct: 20 ELARVLKKHSDLRNILPITTAKVPIVKSYHWRSGLEVDICLYNTLALHNTRLLFAYSAID 79
Query: 153 GRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPG 209
R + + +K +AK DI + G+ +SY+ +L+VL+ Q ++P L++IY G
Sbjct: 80 PRVKYLCYTMKVFAKICDIGDASRGSLSSYAYTLMVLYFLQQRKSPVIPVLQEIYKG 136
>gi|390179639|ref|XP_003736949.1| GA10633, isoform C [Drosophila pseudoobscura pseudoobscura]
gi|388859932|gb|EIM53022.1| GA10633, isoform C [Drosophila pseudoobscura pseudoobscura]
Length = 444
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/339 (21%), Positives = 139/339 (41%), Gaps = 78/339 (23%)
Query: 8 EPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDL 67
+ I +D++ +L P+ +W R + FGS +S + +R DL
Sbjct: 130 DAIEEDLISVLTPVFPNWAMR---------------------IYKFGSRISGIGTRCSDL 168
Query: 68 DISIELSNGSCISSAGKKVKQSLLGDLLRALR----QKGGYRRLQFVAHARVPILKFETI 123
D+ +++ N I + K++L LRA+R +R + + ARVPI+K +
Sbjct: 169 DVFVDIGNTFDIFE-HRASKETLAK--LRAMRPAFCASNKWRIINVIEQARVPIIKVSHL 225
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
I CDI +++L G + L +I I + M + K W + K ++YS
Sbjct: 226 TTGIECDICLNSL-GFCNTNLLKYIFDIQPLAQYMCIYAKNW-----LERCKQTDISTYS 279
Query: 184 LSLLVLFHFQTCVPAILPPLKDI--------YPGNLVDDLKGVRANAERQIAEICAFNIA 235
++L+V++ Q + +LP + + + G + + G ++ + ++ E
Sbjct: 280 ITLMVIYFMQ--LHGLLPSVFALQHEQPFNQFVGPWIVNF-GQKSLQDLRLPE------- 329
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE-HIRSNTRWLPNNH 294
+D R+ L F +F KF +CP+ G E I+ + +PN +
Sbjct: 330 ---ADTDAPAVRNILGQFF-AFYSKFD-----YERFLVCPYFGSAEVQIQHVEKLMPNRY 380
Query: 295 ----------------PLFIEDPFEQPENSARAVSEKNL 317
P+ ++DP + N +AV+ L
Sbjct: 381 SKYTRENPECTLQLRKPMVVQDPIQLNHNVTKAVTRSAL 419
>gi|345495286|ref|XP_001606670.2| PREDICTED: speckle targeted PIP5K1A-regulated poly(A)
polymerase-like, partial [Nasonia vitripennis]
Length = 678
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 116/296 (39%), Gaps = 42/296 (14%)
Query: 53 FGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF--- 109
FGS VS L R DLDI I+ C + K + ++ A ++ Y R
Sbjct: 195 FGSTVSGLGFRNCDLDIYIDPGFPVCQENNSKLGPNVVTASVIFAEVKRILYARTYIFSK 254
Query: 110 ---VAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
+ A+ PI+KF I SCDIS N S + +D R R ++++K W
Sbjct: 255 VVPIPKAKTPIIKFFYIPSKTSCDISFKNSLAVHNSLLVKHCLSLDPRLRPAMMVIKYWV 314
Query: 167 KAHDINNPKTG-TFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
++ K G + YSL+LL LF+ Q ++PPL ++ + ++G + N +
Sbjct: 315 SNFEL---KGGDKMSKYSLTLLFLFYLQQKSVKLVPPLIELKRRVVPQIIEGWQVNFDN- 370
Query: 226 IAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQ------ 279
+ + + D N ++ L F + ++ K + +CP GQ
Sbjct: 371 -------SKSANNEDHEGAGNSKTIPELLHGFFDFYARYEFKHN--VVCPINGQSHKKVS 421
Query: 280 ----------------WEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAK 319
+ + N P N + ++DP E N + K+L K
Sbjct: 422 FNDVDNIPESMDRYKEYLKTKENPLPFPINKTMCVQDPNELSHNVTGNICPKHLEK 477
>gi|125778590|ref|XP_001360053.1| GA10633, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|54639804|gb|EAL29206.1| GA10633, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 541
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/339 (21%), Positives = 139/339 (41%), Gaps = 78/339 (23%)
Query: 8 EPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDL 67
+ I +D++ +L P+ +W R + FGS +S + +R DL
Sbjct: 227 DAIEEDLISVLTPVFPNWAMR---------------------IYKFGSRISGIGTRCSDL 265
Query: 68 DISIELSNGSCISSAGKKVKQSLLGDLLRALR----QKGGYRRLQFVAHARVPILKFETI 123
D+ +++ N I + K++L LRA+R +R + + ARVPI+K +
Sbjct: 266 DVFVDIGNTFDIFE-HRASKETLAK--LRAMRPAFCASNKWRIINVIEQARVPIIKVSHL 322
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
I CDI +++L G + L +I I + M + K W + K ++YS
Sbjct: 323 TTGIECDICLNSL-GFCNTNLLKYIFDIQPLAQYMCIYAKNW-----LERCKQTDISTYS 376
Query: 184 LSLLVLFHFQTCVPAILPPLKDI--------YPGNLVDDLKGVRANAERQIAEICAFNIA 235
++L+V++ Q + +LP + + + G + + G ++ + ++ E
Sbjct: 377 ITLMVIYFMQ--LHGLLPSVFALQHEQPFNQFVGPWIVNF-GQKSLQDLRLPE------- 426
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE-HIRSNTRWLPNNH 294
+D R+ L F +F KF +CP+ G E I+ + +PN +
Sbjct: 427 ---ADTDAPAVRNILGQFF-AFYSKFD-----YERFLVCPYFGSAEVQIQHVEKLMPNRY 477
Query: 295 ----------------PLFIEDPFEQPENSARAVSEKNL 317
P+ ++DP + N +AV+ L
Sbjct: 478 SKYTRENPECTLQLRKPMVVQDPIQLNHNVTKAVTRSAL 516
>gi|28573195|ref|NP_649693.3| CG1091, isoform B [Drosophila melanogaster]
gi|17945369|gb|AAL48740.1| RE16970p [Drosophila melanogaster]
gi|28381162|gb|AAF54068.3| CG1091, isoform B [Drosophila melanogaster]
gi|220960146|gb|ACL92609.1| CG1091-PA [synthetic construct]
Length = 560
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 138/321 (42%), Gaps = 51/321 (15%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ----KGGYR 105
V FGS ++ + +R DLD+ +++ G+ + + + + L RA+R+ +R
Sbjct: 262 VYKFGSRITGIGNRSSDLDLFVDI--GNTFHTFEHRASNATVAKL-RAMRKFFCDSEDWR 318
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+ F+ ARVPI+K + I CDI ++++ G + L +I + + M + VK W
Sbjct: 319 LINFIEQARVPIIKTCHLPTGIECDICLNSM-GFCNTNLLKYIFESQPLTQYMCIYVKNW 377
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
+ + T ++YS++L+V++ Q + A+LPP+ + ++D A Q
Sbjct: 378 LERCKL----TEQISTYSITLMVIYFLQ--LQALLPPIAMLQ----IED-------AANQ 420
Query: 226 IAEICAFNIARFSSDKYRKINRSSL---AHLFVSFLEKFSGLSLK--ASELGICPFTGQW 280
+ + + F+ + ++ L + FL F K +CP+ GQ
Sbjct: 421 AVLVGPW-VVNFAQKSFSELGLQQLKATVPVIKGFLRNFFAYFAKFDYEHFLVCPYIGQA 479
Query: 281 E----------HIRSNTRWLPN-------NHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
H R + N P+ ++DP + N +AV++ L +
Sbjct: 480 NVEIAKIERMLHARYSAYVSDNPECSIQLKKPMVVQDPIQLNHNVTKAVTKYGLQTFVDY 539
Query: 324 FEMTHFRLT--STN-QTRYAL 341
+ T L STN + RYA
Sbjct: 540 CQQTAELLEEPSTNWRQRYAF 560
>gi|24644730|ref|NP_731129.1| CG1091, isoform A [Drosophila melanogaster]
gi|23170625|gb|AAN13359.1| CG1091, isoform A [Drosophila melanogaster]
Length = 505
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 138/321 (42%), Gaps = 51/321 (15%)
Query: 50 VEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ----KGGYR 105
V FGS ++ + +R DLD+ +++ G+ + + + + L RA+R+ +R
Sbjct: 207 VYKFGSRITGIGNRSSDLDLFVDI--GNTFHTFEHRASNATVAKL-RAMRKFFCDSEDWR 263
Query: 106 RLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEW 165
+ F+ ARVPI+K + I CDI ++++ G + L +I + + M + VK W
Sbjct: 264 LINFIEQARVPIIKTCHLPTGIECDICLNSM-GFCNTNLLKYIFESQPLTQYMCIYVKNW 322
Query: 166 AKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQ 225
+ + T ++YS++L+V++ Q + A+LPP+ + ++D A Q
Sbjct: 323 LERCKL----TEQISTYSITLMVIYFLQ--LQALLPPIAMLQ----IED-------AANQ 365
Query: 226 IAEICAFNIARFSSDKYRKINRSSL---AHLFVSFLEKFSGLSLK--ASELGICPFTGQW 280
+ + + F+ + ++ L + FL F K +CP+ GQ
Sbjct: 366 AVLVGPW-VVNFAQKSFSELGLQQLKATVPVIKGFLRNFFAYFAKFDYEHFLVCPYIGQA 424
Query: 281 E----------HIRSNTRWLPN-------NHPLFIEDPFEQPENSARAVSEKNLAKISNA 323
H R + N P+ ++DP + N +AV++ L +
Sbjct: 425 NVEIAKIERMLHARYSAYVSDNPECSIQLKKPMVVQDPIQLNHNVTKAVTKYGLQTFVDY 484
Query: 324 FEMTHFRLT--STN-QTRYAL 341
+ T L STN + RYA
Sbjct: 485 CQQTAELLEEPSTNWRQRYAF 505
>gi|392562566|gb|EIW55746.1| Nucleotidyltransferase [Trametes versicolor FP-101664 SS1]
Length = 382
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 46/186 (24%), Positives = 85/186 (45%), Gaps = 14/186 (7%)
Query: 23 EDWETRMKVISDLREVVESVESL----RGATVE--PFGSFVSNLFSRWGDLDISIELSNG 76
E W+ M+ + E ++ + L G T + PFGS S D+D+ I ++
Sbjct: 133 EAWQQTMERRKEREETLKRLTQLIRFHYGDTYDARPFGSTCYGASSSTSDIDVVIIDADR 192
Query: 77 SCISSAGKKVKQSLLGDLLR--ALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISID 134
AG K + D+ R L ++ GY+ + + +A VP++K +SCD++I+
Sbjct: 193 PYGIPAGDKTALPPIYDVRRLAKLLKEEGYKSVSSIPYAAVPLVKLTDPDTGMSCDVNIN 252
Query: 135 NLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG------TFNSYSLSLLV 188
N G + L + +K W K+ D+NNP +F+SY+++L+
Sbjct: 253 NRLGVFNTALLRQYCLRAPSLARYLRTIKLWVKSVDLNNPSGEIDKGPRSFSSYAITLMT 312
Query: 189 LFHFQT 194
+ + Q+
Sbjct: 313 VAYLQS 318
>gi|7019642|emb|CAB75789.1| putative protein [Arabidopsis thaliana]
Length = 442
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 82/152 (53%), Gaps = 8/152 (5%)
Query: 18 LNPLREDWETRMKVISDLREVV-----ESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
P+ D+ TR +++ +L + +S ES +E +GSF N FS DLD+SI
Sbjct: 54 FRPVSADYNTRKELVKNLNAMAIDIFGKSEES--SPVLEAYGSFAMNTFSSQKDLDVSIN 111
Query: 73 LSNGSCISSAGKKVK-QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
S+G+ KK++ + LR+L +G R + + ARVPI++F I CD+
Sbjct: 112 FSSGTSEFYREKKLEILTRFATKLRSLEGQGFVRNVVPILSARVPIVRFCDQGTGIECDL 171
Query: 132 SIDNLCGQIKSKFLFWISQIDGRFRDMVLLVK 163
++++ G + S+ + ISQID RF+ + LL +
Sbjct: 172 TVESKDGILTSQIIRIISQIDDRFQKLCLLTQ 203
>gi|397644340|gb|EJK76352.1| hypothetical protein THAOC_01889 [Thalassiosira oceanica]
Length = 604
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 45/89 (50%), Gaps = 6/89 (6%)
Query: 113 ARVPILKFETIHQN------ISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWA 166
AR+PI+KF H + + CD+S+ N + + L S + R + +VK WA
Sbjct: 287 ARIPIVKFNVPHGDGDGRLLVECDLSLQNPLAVLNTALLRAYSSMSSDLRVLASIVKRWA 346
Query: 167 KAHDINNPKTGTFNSYSLSLLVLFHFQTC 195
KA DIN P T +SY L+++ TC
Sbjct: 347 KARDINCPSRHTLSSYGYVLMLISFLTTC 375
>gi|341895667|gb|EGT51602.1| hypothetical protein CAEBREN_28562 [Caenorhabditis brenneri]
Length = 510
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 120/284 (42%), Gaps = 30/284 (10%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNG----SCISSAGKKVKQSLLGDLLRALRQKGG 103
++ +GS + +R+ D+D+S+ S + S + L D +A+ ++
Sbjct: 174 VVLDIYGSTRNGFGTRFCDVDMSLSFSPAPPPWATNSDRVMRAVAKALVDFPKAMDER-- 231
Query: 104 YRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQID-GRFRDMVLLV 162
+A+VPI++F + ++ DIS N ++ L + D R + + V
Sbjct: 232 ------YVNAKVPIVRFRSSDMDMEADISYKNDLALHNTQLLHQYCKWDPERLPTLGVWV 285
Query: 163 KEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPA-ILPPLKDIYPGNLVDDLKGVRAN 221
K WAK I G+ +SY+ ++++ + Q P +LP L+++ + N
Sbjct: 286 KAWAKRSGIGEASKGSLSSYAWIVMLIHYLQQVEPVPVLPCLQEM----------NHQKN 335
Query: 222 AERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWE 281
+ + + + R+ RSS+ LFV FL+ ++ S I T + E
Sbjct: 336 ENVYVQGYNTYYWKFVDASRARRC-RSSIVDLFVGFLDYYATY-FDYSANVIQMVTKRLE 393
Query: 282 HIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAFE 325
+ RWL +P+ I DPFE N A+ V + I E
Sbjct: 394 Y--KPDRWL--KYPMCIADPFETDHNLAQGVDQPMFEYIRACME 433
>gi|281203028|gb|EFA77229.1| hypothetical protein PPL_12439 [Polysphondylium pallidum PN500]
Length = 788
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 80/169 (47%), Gaps = 20/169 (11%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVE---------SVESLRGATVEPF--GSFVSNLF 61
D L+P + R ++I +++VE +V+S +G E F GS + L
Sbjct: 579 DKYSHLDPNHPLAQQRKRIIETTKKLVEINHSMKELNNVKSFKGYNPELFLFGSSSNGLA 638
Query: 62 SRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFE 121
+ DLDIS+ S + ++ DLL+ + ++ +Q + RVPI+KF
Sbjct: 639 FQSSDLDISLVTSKPLDQTRGTFRI-----ADLLK----RNNFKDIQPITRTRVPIVKFR 689
Query: 122 TIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
+SCD+SI+N SK ++ ID R R + L++K+W +D
Sbjct: 690 DEDSKLSCDLSINNPLAIYNSKMIYDYCSIDNRVRPLALVIKKWLCQYD 738
>gi|118359234|ref|XP_001012858.1| hypothetical protein TTHERM_00094040 [Tetrahymena thermophila]
gi|89294625|gb|EAR92613.1| hypothetical protein TTHERM_00094040 [Tetrahymena thermophila
SB210]
Length = 622
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 147/356 (41%), Gaps = 74/356 (20%)
Query: 7 LEPILKDILGMLNPLREDWETRM--------KVISDLREVVESVESLRGATVEPFGSFVS 58
LE IL D+ + E + R+ K+IS +R ++ + L TV+P+GS VS
Sbjct: 265 LEQILLDVYNQ-QKIEESFSKRLLQEVNFIRKIISMMR--LKELFGLEYVTVQPYGSIVS 321
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF--VAHARVP 116
+ D+DISI N +C ++ LL + ++ + ++ V A+ P
Sbjct: 322 GFAQKSSDVDISI---NTNCYIDESSFIQ--LLHNFIKQYCSNKSIKNVETEPVLQAQTP 376
Query: 117 ILKF---ETIHQ---NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
+LK+ +T+ Q I DI ++N+ G S L+ +SQ+ + + + +++K WAK
Sbjct: 377 LLKYTRKDTVDQQQIKIDIDICVNNILGCTNSLMLYTLSQLHPKIQQLGIIIKHWAKQRG 436
Query: 171 INNPKTGTFNSYSLSLLVLFHF-------------------QTCVPAILPPLKD---IYP 208
+++ + S+ +L++F F C I KD I+
Sbjct: 437 VSSKSHLSSYSF---ILMMFSFLFREKILNSEFVKKSKSKENQCQVKIKRKKKDGEQIFQ 493
Query: 209 GNL-----VDDLKGVRA--NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
NL V+DLK A E+ + ++ SL+ LF+ F++ +
Sbjct: 494 TNLYFYSNVEDLKMKLAIWRKEKNLPS----------------LDDVSLSQLFIDFIKFY 537
Query: 262 -SGLSLKASE-LGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
S + K + I F + + N+ IEDPF+ N + K
Sbjct: 538 QSSFNFKDKRFISIFNFDVNYSNNNFKYELYDENYYFNIEDPFDTKHNPGQKTQNK 593
>gi|301114445|ref|XP_002998992.1| Poly(A) polymerase, putative [Phytophthora infestans T30-4]
gi|262111086|gb|EEY69138.1| Poly(A) polymerase, putative [Phytophthora infestans T30-4]
Length = 1062
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 65/123 (52%), Gaps = 11/123 (8%)
Query: 94 LLRALRQKGGYRRLQFV-AHARVPILKFETIH----QNISCDISIDNLCGQIKSKFLFWI 148
L+RA+ ++ ++ V A ARVPI++F +H ++ CD+ DN+ + L
Sbjct: 506 LVRAILERAAKCEVRHVIAGARVPIIRF--LHTRSGRDYECDLCFDNVLATWNTPLLRAY 563
Query: 149 SQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDIYP 208
+ D R R + L VK WAK I++ G +SYS LL +++ Q V +LP L+ P
Sbjct: 564 ASFDDRARTLGLAVKHWAKQRGISDASMGFLSSYSFVLLSIYYLQ--VVRVLPNLQ--AP 619
Query: 209 GNL 211
G L
Sbjct: 620 GLL 622
>gi|408389494|gb|EKJ68941.1| hypothetical protein FPSE_10866 [Fusarium pseudograminearum CS3096]
Length = 708
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 88/190 (46%), Gaps = 10/190 (5%)
Query: 12 KDILGMLNPLR-EDWETRMK--VISDLREVVE-SVESLRGATVEPFGSFVSNLFSRWGDL 67
K+++ + +R D+E R++ ++ +LR+ + + A+V PFGSF+S L+ D+
Sbjct: 379 KEVMDFYDYVRPRDFEQRIRDNLVENLRKAMRRDGRNFASASVHPFGSFMSGLYLPTADM 438
Query: 68 DI---SIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIH 124
D+ S G + G K L A +Q ++ +AHAR+P++KF
Sbjct: 439 DLVVCSASFMRGGPPTYLGAKSWLYKFQKFLVA-QQVAEQHSIEVIAHARIPLVKFVDKQ 497
Query: 125 QNISCDISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
+ D+S +NL G FL W Q +V ++K + +N P G +S
Sbjct: 498 TGLKVDVSFENLGGVNAIDTFLQWKEQYPA-MPILVTVIKHFLLMRGLNEPVNGGIGGFS 556
Query: 184 LSLLVLFHFQ 193
+ LV+ Q
Sbjct: 557 VICLVVSMLQ 566
>gi|159470731|ref|XP_001693510.1| poly(a) polymerase [Chlamydomonas reinhardtii]
gi|158283013|gb|EDP08764.1| poly(a) polymerase [Chlamydomonas reinhardtii]
Length = 517
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 89/184 (48%), Gaps = 13/184 (7%)
Query: 20 PLREDWETRMKVISDLREVVESV-ESLRGATVEPFGSFVSNLFSRWGDLDISI------- 71
P + R +VI +R V V RG ++ FGSF + L + DLD+ +
Sbjct: 116 PTEGERRQRQEVIEAVRGGVRRVWPGARGVELQVFGSFANGLSTWNSDLDLVVTGIYEPD 175
Query: 72 ELSNGSCISSAGKKVKQSLLGDLLRAL-RQKG-GYRRLQFVAHARVPILKFETIHQNISC 129
++ G I+ G+ + L + AL R K R Q + AR+PILK T ++
Sbjct: 176 RMTGGYEINDRGRITAK--LRKIAEALNRSKAIDIERQQLIPRARIPILKLWT-KARVTV 232
Query: 130 DISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D+S+ + G ++++ + + +VL++K + KA +N +G +SYSL+ +V+
Sbjct: 233 DVSMSDDSGPRAARYMAQQCRAYPPLKPLVLVLKAYLKACRLNEVNSGGLSSYSLTNMVI 292
Query: 190 FHFQ 193
H Q
Sbjct: 293 AHLQ 296
>gi|84468450|dbj|BAE71308.1| hypothetical protein [Trifolium pratense]
Length = 518
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 83/177 (46%), Gaps = 10/177 (5%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D L+P E+ R I + EV++ + VE FGSF + L+ D+D+
Sbjct: 117 DFCEFLSPTPEEKAKRDAAIESVFEVIKHI--WPHCQVEIFGSFRTGLYLPTSDIDV--- 171
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
I +G Q L + R+L Q+ +++Q + ARVPI+KF +S DIS
Sbjct: 172 -----VILKSGLPNPQIGLNAISRSLSQRSMAKKIQVIGKARVPIIKFVEKKSGLSFDIS 226
Query: 133 IDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVL 189
D G ++++ + R + L++K + + ++N +G SY+L +++
Sbjct: 227 FDIDNGPKAAEYIQEAVAKWPQLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMLM 283
>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
98AG31]
Length = 987
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 73/150 (48%), Gaps = 14/150 (9%)
Query: 15 LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELS 74
+ + P RE+ E R+ +I +R+ V A V PFGSF + L+ GD+D+ I
Sbjct: 242 VAYIRPTREEDELRLMIIEMIRKAV--TMQWPDADVVPFGSFGTKLYLPGGDIDLVI--- 296
Query: 75 NGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDISID 134
+ K K +L L LR++ + + +A A+VPI+KF+TI N DISI+
Sbjct: 297 ---LSTRMMKDAKSKILYRLAPLLREQNIGQDVVVIAKAKVPIIKFKTIFGNFQVDISIN 353
Query: 135 NLCGQIKSKFLFWISQIDGRFRDMVLLVKE 164
G L + +++ D+ L K+
Sbjct: 354 QSNG------LVALEKVNELLDDVKYLSKD 377
>gi|171688616|ref|XP_001909248.1| hypothetical protein [Podospora anserina S mat+]
gi|170944270|emb|CAP70380.1| unnamed protein product [Podospora anserina S mat+]
Length = 1136
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 7/147 (4%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L+F + + CDI+ + L S D R R ++L VK WAK IN+P G
Sbjct: 838 LEFPKSNIGVQCDINFSAHLAVENTTLLRCYSLCDPRVRPLILFVKHWAKVRQINSPYRG 897
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYP-GNLVDDLKGVRANAERQIAEICAFNIA 235
T SY ++++L + P ++P L+ + P G KG + R A+ I
Sbjct: 898 TLGSYGYAIMMLHYLINVARPFVVPNLQLLGPSGQPPQMCKGYPIHFWRDEAQ-----IE 952
Query: 236 RFSSDKYRKINRSSLAHLFVSFLEKFS 262
R + +NR SL L F E ++
Sbjct: 953 RLAKGNELTMNRESLGMLLRGFFEYYA 979
>gi|429852855|gb|ELA27970.1| poly rna polymerase cid13 [Colletotrichum gloeosporioides Nara gc5]
Length = 1059
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 61/151 (40%), Gaps = 8/151 (5%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L+F + CDI+ + L S D R R +VL +K WAK IN P G
Sbjct: 711 LEFPKSGVGVQCDINFSAHLALHNTLLLRCYSHTDPRVRPLVLFIKHWAKVRGINTPYRG 770
Query: 178 TFNSYSLSLLVLFHFQTCV-PAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAF---- 232
T +SY L++L + V P I P L+ + P + + + I F
Sbjct: 771 TLSSYGYVLMMLHYLVNVVQPFICPNLQSLGPAPPPEGISPT--GLDDSIGAFVGFWRDE 828
Query: 233 -NIARFSSDKYRKINRSSLAHLFVSFLEKFS 262
I R + NR S+ HL F E ++
Sbjct: 829 PEIRRLAQMNLINSNRESIGHLLRGFFEYYA 859
>gi|66823977|ref|XP_645343.1| hypothetical protein DDB_G0271962 [Dictyostelium discoideum AX4]
gi|60473471|gb|EAL71415.1| hypothetical protein DDB_G0271962 [Dictyostelium discoideum AX4]
Length = 466
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 99/214 (46%), Gaps = 23/214 (10%)
Query: 3 SYNVLEPILKDILGMLNPLRED---WETRMKVISDLREVVESVESLRG---ATVEPFGSF 56
+Y + IL+ + +N L E + R ++ L EV++ SL VE FGS
Sbjct: 137 NYKLNGEILRKLSVDINKLSERVRCYRDRTIILKRLEEVIKRETSLNKFGEIKVEIFGSS 196
Query: 57 VSNLFSRWGDLDISI----ELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAH 112
+ L + D+DI + EL+ + I+ + +LRA G+ ++ + H
Sbjct: 197 STQLALKKSDVDIVMSFETELTKRNDITKWCYQ-----FSSILRA----NGFYNIKPIIH 247
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A+VPI+KF CDI++ G + + Q+ +++ K WA +IN
Sbjct: 248 AKVPIVKFFDPKTEFHCDITLTKDSGN--TGVVKEFCQLLPILPVLIIFCKNWASVLNIN 305
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILPPLKDI 206
+ GT +SYS++ +V+F Q +LP KDI
Sbjct: 306 DASQGTLSSYSITNMVIFVLQK--KGLLPSYKDI 337
>gi|152926619|gb|ABS32300.1| RNA-dependent RNA polymerase-associated nucleotidyltransferase 1
[Tetrahymena thermophila]
Length = 543
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/356 (23%), Positives = 147/356 (41%), Gaps = 74/356 (20%)
Query: 7 LEPILKDILGMLNPLREDWETRM--------KVISDLREVVESVESLRGATVEPFGSFVS 58
LE IL D+ + E + R+ K+IS +R ++ + L TV+P+GS VS
Sbjct: 188 LEQILLDVYNQ-QKIEESFSKRLLQEVNFIRKIISMMR--LKELFGLEYVTVQPYGSIVS 244
Query: 59 NLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQF--VAHARVP 116
+ D+DISI N +C ++ LL + ++ + ++ V A+ P
Sbjct: 245 GFAQKSSDVDISI---NTNCYIDESSFIQ--LLHNFIKQYCSNKSIKNVETEPVLQAQTP 299
Query: 117 ILKF---ETIHQ---NISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHD 170
+LK+ +T+ Q I DI ++N+ G S L+ +SQ+ + + + +++K WAK
Sbjct: 300 LLKYTRKDTVDQQQIKIDIDICVNNILGCTNSLMLYTLSQLHPKIQQLGIIIKHWAKQRG 359
Query: 171 INNPKTGTFNSYSLSLLVLFHF-------------------QTCVPAILPPLKD---IYP 208
+++ + S+ +L++F F C I KD I+
Sbjct: 360 VSSKSHLSSYSF---ILMMFSFLFREKILNSEFVKKSKSKENQCQVKIKRKKKDGEQIFQ 416
Query: 209 GNL-----VDDLKGVRA--NAERQIAEICAFNIARFSSDKYRKINRSSLAHLFVSFLEKF 261
NL V+DLK A E+ + ++ SL+ LF+ F++ +
Sbjct: 417 TNLYFYSNVEDLKMKLAIWRKEKNLPS----------------LDDVSLSQLFIDFIKFY 460
Query: 262 -SGLSLKASE-LGICPFTGQWEHIRSNTRWLPNNHPLFIEDPFEQPENSARAVSEK 315
S + K + I F + + N+ IEDPF+ N + K
Sbjct: 461 QSSFNFKDKRFISIFNFDVNYSNNNFKYELYDENYYFNIEDPFDTKHNPGQKTQNK 516
>gi|414881049|tpg|DAA58180.1| TPA: hypothetical protein ZEAMMB73_639297, partial [Zea mays]
Length = 260
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 65/133 (48%), Gaps = 10/133 (7%)
Query: 13 DILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIE 72
D ++P E+ +R + D+ +VV+ + VE FGSF + L+ D+D+ I
Sbjct: 137 DFCDFISPSTEEQSSRAAAVQDVSDVVKHI--WPQCKVEVFGSFRTGLYLPTSDIDVVIF 194
Query: 73 LSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDIS 132
S K Q L L +AL QKG +++Q +A ARVPI+KF I+ DIS
Sbjct: 195 ESR--------VKTPQVGLYALAKALSQKGVAKKIQVIAKARVPIVKFVERKSGIAFDIS 246
Query: 133 IDNLCGQIKSKFL 145
D G + F+
Sbjct: 247 FDIDGGPQAADFI 259
>gi|66826981|ref|XP_646845.1| hypothetical protein DDB_G0268926 [Dictyostelium discoideum AX4]
gi|60475116|gb|EAL73052.1| hypothetical protein DDB_G0268926 [Dictyostelium discoideum AX4]
Length = 109
Score = 61.2 bits (147), Expect = 1e-06, Method: Composition-based stats.
Identities = 32/89 (35%), Positives = 49/89 (55%)
Query: 113 ARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDIN 172
A++PI++F+ I I D+ +++ S L ID R D+ LLVK WA + D+N
Sbjct: 3 AKIPIIRFKEISSGIHFDMCFNSMISYHNSLLLGEYCSIDNRCIDLALLVKWWAISKDLN 62
Query: 173 NPKTGTFNSYSLSLLVLFHFQTCVPAILP 201
N TF+S+ L +V+ Q+ P ILP
Sbjct: 63 NAAEKTFSSFCLVNMVIHFLQSLNPPILP 91
>gi|308162052|gb|EFO64479.1| Topoisomerase I-related protein [Giardia lamblia P15]
Length = 520
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/167 (27%), Positives = 82/167 (49%), Gaps = 12/167 (7%)
Query: 28 RMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVK 87
R V+ LR++++ V L ATV+ FGS+ + + S DLDI + + + +
Sbjct: 118 REYVLGQLRDIIQLV--LPDATVDVFGSYSTGMSSYSSDLDICVHVPMNNTAT------M 169
Query: 88 QSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISC--DISIDNLCGQIKSKFL 145
Q + D+ LR+ + +HARVPI+K +H +S DIS ++ G + +
Sbjct: 170 QCHMHDIATLLRRSISTNYVDVRSHARVPIIK--GVHSELSLEYDISFNSPHGAAHRETI 227
Query: 146 FWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHF 192
+ R ++++VK + K +N P TG +SY L L++ +
Sbjct: 228 LGYIEKHPLARILIMVVKSFLKKRGLNQPYTGGMSSYILLQLIVVYI 274
>gi|294877870|ref|XP_002768168.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239870365|gb|EER00886.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 621
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 52/172 (30%), Positives = 83/172 (48%), Gaps = 15/172 (8%)
Query: 48 ATVEPFGSFVSNLFSRWGDLDISIELSNGSC---ISSAG--KKVKQSLLGDLLRALRQ-- 100
AT+E FGS S L + D+D +I + +AG K + + + L +A+ +
Sbjct: 130 ATIETFGSAASRLSEKSSDIDATIICRFAALKKRFGAAGDEKSLCSAAVMGLGKAISKFE 189
Query: 101 ----KGGYRRLQFVAHARVPILKFETIHQNISC---DISIDNLCGQIKSKFLFWISQIDG 153
G R +Q + A+VPI+ I N + D+SI+N + L ++D
Sbjct: 190 KEAPGVGLRVVQVIPSAKVPIVVLSWIGPNGNVQIVDVSINNQLPLHNTALLRNYVEMDK 249
Query: 154 RFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQT-CVPAILPPLK 204
R + + L VK WAK I++ K G +SYS +LL ++ Q AILP L+
Sbjct: 250 RVQILALCVKRWAKLCGISDAKQGNLSSYSWTLLCIYFLQVRSKGAILPSLQ 301
>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
Length = 494
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 117/302 (38%), Gaps = 54/302 (17%)
Query: 4 YNVLEPILKDILGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSR 63
Y L + D + ++P E+ + R +I+ ++ +V ++ A VE FGS + +F
Sbjct: 123 YLCLHEEILDFVHFISPHDEELQARENLIAQMKNLVSNLWP--RAAVETFGSHETQMFLP 180
Query: 64 WGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETI 123
D+D+ I G + L L L + L+ + AR+PI+KF
Sbjct: 181 QSDIDLVI----------FGAPTGKESLFVLAAELEARDMVSYLEVIDKARIPIVKFVDK 230
Query: 124 HQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYS 183
+ I DIS + G + + +I FR +VL++K + ++N G S+
Sbjct: 231 NSAIQVDISFNISSGLATADLIKQYMRIFPSFRPLVLVLKYFLAQRELNETFQGGIGSFL 290
Query: 184 LSLLVLFHFQTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIARFSSDKYR 243
L L+V+ Q + G L DD +
Sbjct: 291 LQLMVVSFLQQYRRQL---------GTLYDDFR--------------------------- 314
Query: 244 KINRSSLAHLFVSFLEKFSGLSLKASELGICPFTGQWEHIRSNTRWLPNNHP--LFIEDP 301
++L L V F + G ++GI G + + N WL +N P L +E+P
Sbjct: 315 ---YNNLGKLLVEFFTLY-GREFNYEQVGISVQKGGFYFNKENRDWLDHNRPFLLSVENP 370
Query: 302 FE 303
E
Sbjct: 371 NE 372
>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
Length = 723
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 84/187 (44%), Gaps = 10/187 (5%)
Query: 12 KDILGMLNPLR-EDWETRMK--VISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLD 68
K+I+ +R D+E RM+ +I +R+ + RG V PFGS++S L+ D+D
Sbjct: 409 KEIVDFYEHVRPRDFEQRMRGELIERIRDSLRRNPKYRGCEVHPFGSYMSGLYLPTADMD 468
Query: 69 ISI---ELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQ 125
I I E +G + G G L + + ++ +A ARVP++K+
Sbjct: 469 IVICSKEWLSGRMTAFPGGSSLYKFRGFLTQ--NRLADPSSVEVIAKARVPLVKYIDAVT 526
Query: 126 NISCDISIDNLCGQIKSK-FLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSL 184
+ DIS D + G K FL W Q +V ++K + +N P G S+S
Sbjct: 527 GLRVDISFDRMDGPAAIKTFLDWKEQYPA-LPILVTIIKHFLAMRGLNEPVNGGIGSFSS 585
Query: 185 SLLVLFH 191
LV H
Sbjct: 586 KNLVPEH 592
>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 876
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/132 (32%), Positives = 68/132 (51%), Gaps = 14/132 (10%)
Query: 15 LGMLNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLD---ISI 71
+ + P E+ + R +I +R+ V S A VEPFGSF + L+ GD+D IS
Sbjct: 80 VAYIQPTHEEHQLRQMIIQMIRKTVHS--RWPDADVEPFGSFGTKLYLPAGDIDLVIIST 137
Query: 72 ELSNGSCISSAGKKVKQSLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCDI 131
++ N + K +L L +R+ + + +A A+VPI+KF+TI NI+ DI
Sbjct: 138 QMMN---------EQKSRILYKLAPLIRENNIGQDVVVIAKAKVPIIKFKTIFGNINVDI 188
Query: 132 SIDNLCGQIKSK 143
SI+ G + K
Sbjct: 189 SINQTNGIVAMK 200
>gi|159123133|gb|EDP48253.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
A1163]
Length = 703
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 101/220 (45%), Gaps = 24/220 (10%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSN-- 75
+ P++ + R +I+ L+ +S G + PFGSF S L+ D+D+ + SN
Sbjct: 273 VKPMQYEQIVRADLITRLQVAFQS--RYYGVQLRPFGSFASGLYLPTADIDLVLLSSNFM 330
Query: 76 GSCISSAGKKVKQ-----SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ I + G++ Q + L +L A+ ++ +AHARVPILKF + D
Sbjct: 331 RNGIKTFGERKGQIYAFAAFLKNLEIAVPNS-----IETIAHARVPILKFVDKMTGLRVD 385
Query: 131 ISIDNLCGQI-KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV- 188
+S DN G I + F W S+ +V +VK++ +N TG +S++ LV
Sbjct: 386 LSFDNDSGLIANNTFQNWKSEYPA-MPVIVAVVKQFLLLRGLNEVPTGGLGGFSITCLVT 444
Query: 189 -----LFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRAN 221
L H T + +IL + Y N + G+R N
Sbjct: 445 SLLQHLPHGHTAPNLGSILMDFFEFYGNNFDFENVGIRLN 484
>gi|70987233|ref|XP_749095.1| topoisomerase family protein TRF4 [Aspergillus fumigatus Af293]
gi|66846725|gb|EAL87057.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
Af293]
Length = 702
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 101/220 (45%), Gaps = 24/220 (10%)
Query: 18 LNPLREDWETRMKVISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLDISIELSN-- 75
+ P++ + R +I+ L+ +S G + PFGSF S L+ D+D+ + SN
Sbjct: 272 VKPMQYEQIVRADLITRLQVAFQS--RYYGVQLRPFGSFASGLYLPTADIDLVLLSSNFM 329
Query: 76 GSCISSAGKKVKQ-----SLLGDLLRALRQKGGYRRLQFVAHARVPILKFETIHQNISCD 130
+ I + G++ Q + L +L A+ ++ +AHARVPILKF + D
Sbjct: 330 RNGIKTFGERKGQIYAFAAFLKNLEIAVPNS-----IETIAHARVPILKFVDKMTGLRVD 384
Query: 131 ISIDNLCGQI-KSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTGTFNSYSLSLLV- 188
+S DN G I + F W S+ +V +VK++ +N TG +S++ LV
Sbjct: 385 LSFDNDSGLIANNTFQNWKSEYPA-MPVIVAVVKQFLLLRGLNEVPTGGLGGFSITCLVT 443
Query: 189 -----LFHFQTC--VPAILPPLKDIYPGNLVDDLKGVRAN 221
L H T + +IL + Y N + G+R N
Sbjct: 444 SLLQHLPHGHTAPNLGSILMDFFEFYGNNFDFENVGIRLN 483
>gi|451996743|gb|EMD89209.1| hypothetical protein COCHEDRAFT_1196132 [Cochliobolus
heterostrophus C5]
Length = 718
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 92/239 (38%), Gaps = 37/239 (15%)
Query: 118 LKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDMVLLVKEWAKAHDINNPKTG 177
L F I CDI+ N G + L S D R R +VL VK WAK +N+ +G
Sbjct: 440 LDFPKDGCGIQCDINFANPLGIHNTHMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSAYSG 499
Query: 178 TFNSYSLSLLVLFHF-QTCVPAILPPLKDIYPGNLVDDLKGVRANAERQIAEICAFNIAR 236
T +SY L+VL + P + P L+ P L D + + +I + + R
Sbjct: 500 TLSSYGWVLMVLHYLVNVASPPVCPNLQHAVP--LPTDAAALEQYFKS--TKISGYEV-R 554
Query: 237 FSSDKYRKI----------NRSSLAHLFVSFLEKFSG------------------LSLKA 268
F ++ I N S+ L F + F+ LSL+
Sbjct: 555 FWRNEEEIIKAAQEGRLTQNTQSIGALLRGFFQYFAALSGYGYPRPPQFHWTNEVLSLRT 614
Query: 269 SELGICPFTGQWEHIRSN---TRWLPNNHPLFIEDPFEQPENSARAVSEKNLAKISNAF 324
+ + W + + + N + IEDPFE N AR V+ + + I + F
Sbjct: 615 PGGIVSKQSKGWVSATTKITAEKKVTNRYLFAIEDPFETDHNVARTVTHRGIVAIRDEF 673
>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
Length = 703
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 93/196 (47%), Gaps = 25/196 (12%)
Query: 12 KDILGMLNPLR-EDWETRMK--VISDLREVVESVESLRGATVEPFGSFVSNLFSRWGDLD 68
K+I+ N + D+E +++ +I +L +++ + + R ATV PFGSF SNL+ GD+D
Sbjct: 376 KEIVDFYNYAKPRDFEEKLRQGLIDELAKLIRNSQ-FRDATVYPFGSFKSNLYLPTGDMD 434
Query: 69 ISIELSNGSCISS--AGKKVKQSLLGDLLR-----ALRQKGGYRRLQFVAHARVPILKFE 121
+ C S +G+ + S + + +Q ++ ++ ARVP++K+
Sbjct: 435 LVF------CSDSYMSGRAARYSSKNHVFKFGAFIERKQLAVDNHVEKISKARVPLVKYV 488
Query: 122 TIHQNISCDISIDNLCG-QIKSKFLFWISQIDGRFRDMVLLV---KEWAKAHDINNPKTG 177
+ D+S +N+ G + FL W Q F DM +LV K + +N P G
Sbjct: 489 DSRTGLKVDVSFENITGIRAIETFLAWREQ----FPDMPVLVTCIKHFLAMRGLNEPANG 544
Query: 178 TFNSYSLSLLVLFHFQ 193
++ LV+ Q
Sbjct: 545 GIGGTTVICLVVSMLQ 560
>gi|195498865|ref|XP_002096708.1| GE25820 [Drosophila yakuba]
gi|194182809|gb|EDW96420.1| GE25820 [Drosophila yakuba]
Length = 559
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 86/165 (52%), Gaps = 17/165 (10%)
Query: 43 ESLRGATVEPFGSFVSNLFSRWGDLDISIELSNGSCISSAGKKVKQSLLGDLLRALRQ-- 100
+SLR V FGS ++ + +R DLD+ +++ G+ + + + + L RA+R+
Sbjct: 258 QSLR---VYKFGSRITGIGNRSSDLDVFVDI--GNTFHTFEHRASNATIAKL-RAMRKFF 311
Query: 101 --KGGYRRLQFVAHARVPILKFETIHQNISCDISIDNLCGQIKSKFLFWISQIDGRFRDM 158
+R + F+ ARVPI+K + I CDI +++L G + L +I + + M
Sbjct: 312 CVSNDWRLINFIEQARVPIIKTCHLPTGIECDICLNSL-GFCNTNLLKYIFESQPLTQYM 370
Query: 159 VLLVKEWAKAHDINNPKTGTFNSYSLSLLVLFHFQTCVPAILPPL 203
+ VK W + + T ++YS++L+V++ Q + +LPP+
Sbjct: 371 CIYVKNWLERCKL----TEQISTYSITLMVIYFLQ--LQNLLPPI 409
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.134 0.397
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,716,394,562
Number of Sequences: 23463169
Number of extensions: 259644304
Number of successful extensions: 748740
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1034
Number of HSP's successfully gapped in prelim test: 894
Number of HSP's that attempted gapping in prelim test: 744417
Number of HSP's gapped (non-prelim): 3347
length of query: 445
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 299
effective length of database: 8,933,572,693
effective search space: 2671138235207
effective search space used: 2671138235207
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)