BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000465
(1478 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224129926|ref|XP_002320705.1| predicted protein [Populus trichocarpa]
gi|222861478|gb|EEE99020.1| predicted protein [Populus trichocarpa]
Length = 1566
Score = 1729 bits (4479), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 925/1495 (61%), Positives = 1097/1495 (73%), Gaps = 71/1495 (4%)
Query: 1 MSKSNPRPNLSQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSIL 60
MS NP N S++ + N+LM QN LI SLTSHISLYHS S + + RSSIL
Sbjct: 1 MSVPNPNNNFSKH---QNIPNALMTQNHLIGSLTSHISLYHSQSNPPSSPNPNP-RSSIL 56
Query: 61 KWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCY 120
KWF SL+VHQRQ+HLT VD KF Q+L+QML KL ++GH FIILPDL SRD LP LC+
Sbjct: 57 KWFKSLSVHQRQSHLTTVDFKFTQILLQMLAKLHSHGHCRFIILPDLLSRD---LPSLCF 113
Query: 121 KKSRGLLSRVAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDR 180
KKSRGLLSR+AESNES R +FESTRLFSSREGEK+++ LD+ TVS + +ENV++
Sbjct: 114 KKSRGLLSRIAESNESERLIFESTRLFSSREGEKVDDCRSGAEGLDSVTVSEDLIENVEK 173
Query: 181 FIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNN 240
F+++MD+ISNGGFLRGEESEL DWVE +WLK +GYY IEAF+ N+LEV LRLAWLNC N
Sbjct: 174 FVELMDDISNGGFLRGEESELGTDWVELEWLKVRGYYCIEAFLANKLEVALRLAWLNCGN 233
Query: 241 GKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVL 300
GKKRGVKLKEKL+AAG+AANV+WR+KGCVDWW NLD +RRKVL LGKAAKSLT E+L
Sbjct: 234 GKKRGVKLKEKLSAAGVAANVFWRRKGCVDWWRNLDAEVRRKVLNFALGKAAKSLTREIL 293
Query: 301 KEASNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLA 360
K+ S D + LF AG+++ R H++S QR L D E GLA P S SG AS A
Sbjct: 294 KDVSGVSGDELSLFRAGVQRPWRDLHAESRQRIFLKLPADAEFGLAPKP-SFSGKDASFA 352
Query: 361 TVFSGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCT 420
+F+ LFVLQDI ++VL Q +EYD IFFS L + T +DC+LRKLRGL+MV+SLDCT
Sbjct: 353 NIFNSLFVLQDIVSLVLPDQGSEYDTSHIFFSMLGSLGTLSDCILRKLRGLVMVISLDCT 412
Query: 421 KLELFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKR-QNPLPKSALDELSLDKLPKDPE 479
+LEL GEG SS NK EK RRKK + + K+ NP P ++DE S KL +D +
Sbjct: 413 RLELLGEGTSNSSANKPSEKLGAGSRRKKGKTQNMKKLMNPTPVKSVDESSFKKLAEDIK 472
Query: 480 GALTDTEKVDLMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKT 539
A +K +LM S+++PGI + + +R+ S+ +EM EH + LV K RT RK
Sbjct: 473 CAPACIKKTELMESNEMPGIPHENENHRDISSPTVEM-----EHTQGLVHEKKRTAGRKN 527
Query: 540 KTVKNKNKNCTYNNPVPVKDPKVSVLETSS-SISLQDEVEKYDKLSAQNVSVDNSTCSNV 598
+ +NK K +++NPV V+ P+++V E S S+ DE K +LS DN T
Sbjct: 528 RKGRNKKKKSSFSNPVEVRKPEIAVSEAPSFSVCSSDEEAKLCRLS------DNLTTQK- 580
Query: 599 LASNQSSCT-SASVPAREGIATQSTQEDCVVNSVNS------ECRRFSNGRIDNQTQHFL 651
ASN S S + P R+ I ED V E R SNG +DN++
Sbjct: 581 -ASNDSLIDPSINEPTRKEIDALGIPEDHAVGCTEGISDAGLEHYRSSNGFVDNKSMPSR 639
Query: 652 QETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEI 711
+ET V NII ++L N G +F N E + + +K + E+
Sbjct: 640 RETR-CGVGQNIIYQVATTKELITVSSNE--GTSFLNKKTEVKLD----VGNKLVRTHEV 692
Query: 712 KKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHN 771
K+ + + + +E+F + K +C SYEWP++ PVYFPSI+SHL PAT RLHLDVGHN
Sbjct: 693 KEVPTLNRREESENFHESGSKGLSDCLSYEWPSLGPVYFPSINSHLPPATYRLHLDVGHN 752
Query: 772 WHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSG 831
WHNH+ QPF+PT+HQARN P +GG N++LSQPLPMSLDWPPMV++ G+AP++TCNYDSG
Sbjct: 753 WHNHIHQPFLPTVHQARNSPIEGGSNRMLSQPLPMSLDWPPMVRSNCGLAPTMTCNYDSG 812
Query: 832 FISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEE 891
FIS QS FQ+++ K MQ+ +KT DDE +CSGD +D E T++QE DE ++HW+SEEE
Sbjct: 813 FISRWQSTFQKSYTAKNMQYISKTFDDERRCSGDAIDFTEATSSQELMDEYENHWISEEE 872
Query: 892 LEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMV 951
EVH VSGIDYNQ+FGGGVMYW+ SDHPGTGFSRPPSLSSDDS W WHEA++ RAVDDMV
Sbjct: 873 YEVHAVSGIDYNQHFGGGVMYWDPSDHPGTGFSRPPSLSSDDSGWPWHEAELNRAVDDMV 932
Query: 952 AFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEE 1011
AFSSSYST GLTSPTAASFCS FDPL PGHQA YV+ GNEVPGK + SST TD A EE
Sbjct: 933 AFSSSYSTTGLTSPTAASFCSAFDPLVPGHQALGYVMSGNEVPGKAM-LSSTVTDAAAEE 991
Query: 1012 EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRRE 1071
++SGS ASLS DV+ KA D+LP PILRPIIIPN+SRERSRSDFKRS +HKSPCVPP+RRE
Sbjct: 992 DVSGSLASLSSDVEGKAGDSLPYPILRPIIIPNMSRERSRSDFKRSLDHKSPCVPPTRRE 1051
Query: 1072 QPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTT 1131
PRIKRPPSPVVLCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPR WGVRGWYH+GT
Sbjct: 1052 HPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRQWGVRGWYHDGTN 1111
Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEH------- 1184
EE C RMDG+EVVWPSWRNK LS HPM+QPL GALLQD LIA+S LARDQ+H
Sbjct: 1112 LEEACGRMDGAEVVWPSWRNKKLSTHPMVQPLPGALLQDRLIAMSHLARDQDHVSVLLYC 1171
Query: 1185 --PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRV 1242
PDV FPLQ E+QNCPTRKASL L+ SLLH+EIDSFCKQVAA N ARKP+INWAVKRV
Sbjct: 1172 AIPDVLFPLQRAEIQNCPTRKASLCLVQSLLHDEIDSFCKQVAAANMARKPFINWAVKRV 1231
Query: 1243 TRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1302
TRSLQVLWPRSR NIFGS+ATGL+LP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET
Sbjct: 1232 TRSLQVLWPRSRINIFGSSATGLALPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKET 1291
Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
CLQ IP+IMLVVEVP DLI SAAS+VQSPKE+ H T
Sbjct: 1292 CLQ------------------------IPVIMLVVEVPTDLITSAASNVQSPKEEPIHLT 1327
Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
+HD V S+MV L+DS SPKC+ + D+ + S+RLDISFKSPSHTGLQTT LVK+LT
Sbjct: 1328 GEHDIQVQSNMVVLEDSISPKCTQLNCDSKRDVKSIRLDISFKSPSHTGLQTTQLVKDLT 1387
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
EQFPA+TPLALVLKQFLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1388 EQFPAATPLALVLKQFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1442
>gi|147819078|emb|CAN65347.1| hypothetical protein VITISV_000637 [Vitis vinifera]
Length = 1500
Score = 1609 bits (4166), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 882/1474 (59%), Positives = 1042/1474 (70%), Gaps = 134/1474 (9%)
Query: 11 SQNPLAEYFTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQ 70
S NP F +S NQL+DSLT+HISLYH+ S SS+PNP+ NPRSSILKWF+SLTV Q
Sbjct: 45 SSNPPGNLFMDS----NQLVDSLTAHISLYHNRSPSSSPNPNPNPRSSILKWFSSLTVQQ 100
Query: 71 RQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRV 130
RQ++++ VDS F Q+L+QM KL +GHGFFIILPDLPSRD P+LP LC++KSRGLL+RV
Sbjct: 101 RQSYISAVDSNFVQILLQMQFKLYTHGHGFFIILPDLPSRDRPHLPSLCFRKSRGLLARV 160
Query: 131 AESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISN 190
+ESN+ R + +S RLF S+EGE++E+ SC + LD+ TV EFV NVDRF+ MD +SN
Sbjct: 161 SESNDLERLINDSVRLFGSKEGERVEDCSCSASFLDSLTVCEEFVSNVDRFVAAMDSVSN 220
Query: 191 GGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC-NNGKKRGVKLK 249
GGFLRGEES L DWVE +WLKAKGYYSIE+F+ NRLEV LRLAW NC NNGKKRGVKLK
Sbjct: 221 GGFLRGEESGLGSDWVELEWLKAKGYYSIESFVANRLEVALRLAWFNCGNNGKKRGVKLK 280
Query: 250 EKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALED 309
EK+N AG+AANV+WRKKGC+DWW NLD AMRRK++ V+LGKAAKSLT E+LK A +ALED
Sbjct: 281 EKVNVAGIAANVFWRKKGCIDWWQNLDCAMRRKMIIVVLGKAAKSLTDEILKGAYSALED 340
Query: 310 GMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVL 369
WLFNAG Q ++ ++ S QRT LS D E G + P+S+SG
Sbjct: 341 EKWLFNAGGGQPVKYKYTASSQRTDQALSDDAEAGSIMIPSSVSG-------------KT 387
Query: 370 QDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGN 429
QDI ++L+ QH+EYD +KIFFS+L +ST +DC+ RKLRGLLMVV LD TKLEL GEGN
Sbjct: 388 QDILNIILTCQHSEYDRDKIFFSTLGSISTISDCIFRKLRGLLMVVWLDFTKLELLGEGN 447
Query: 430 FKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVD 489
KS PNKSKEK T R+K+ + + K+ NP+P+S D+ K KD L + VD
Sbjct: 448 LKSPPNKSKEKLGTGXRKKRGKTRNMKKLNPVPRSCGDBSKSLKPLKDHGCRLAYAKCVD 507
Query: 490 LMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNC 549
+ S+++ G D+ E S+S +EM + +GK + ARK++ +NKN+
Sbjct: 508 FVESNRMAGELQQSDLRMEASSSVVEM-------ENDMFSGKVQNAARKSRKERNKNRIY 560
Query: 550 TYNNPVPVKDPKVSVLETSS-SISLQDEVEKY----DKLSAQNVSVDNSTCSNVLASNQS 604
+ +PV V+D + E S+ S+ Q E K D ++NV D S + S S
Sbjct: 561 SLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIGCDKFIS--S 618
Query: 605 SCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNII 664
C + P+R QS +ED VV+S+
Sbjct: 619 PCKPTNGPSRAETTAQSIREDPVVSSI--------------------------------- 645
Query: 665 SPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNE 724
++D AF S I FQNS H SET V DK I+A E+++E Q+Q
Sbjct: 646 -------EVDVAF--SGEDIKFQNSEHLSETDTKCV-SDKPIKATELEEEIVQNQEQERG 695
Query: 725 SFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA-TDRLHLDVGHNWHNHVRQPFVPT 783
F T SS ECPSYEWPT+AP++F SI+S LPA TDRLHLDVG NWHNH Q FVP+
Sbjct: 696 KFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHFHQSFVPS 755
Query: 784 LHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQN 843
+HQ RN P D GC+QILS+PLPMSLDWPPMV+++S +APS+TCNYD GFIS QS F+Q
Sbjct: 756 IHQTRNPPLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRMQSSFRQG 815
Query: 844 FATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYN 903
F +Q N TS+DE K SGD MDL + T QE DECDSHW+SEEE E+H VSG+DY+
Sbjct: 816 FPAHNVQVNTATSEDERKYSGDLMDLSDLTNVQELADECDSHWISEEEFELHAVSGLDYS 875
Query: 904 QYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLT 963
QYFGGGVMYWN+SDHPG+GFSRPPSLSSDDSSWAWHEAD+ RAVDDMVAFSSSYSTNGL
Sbjct: 876 QYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSSYSTNGLA 935
Query: 964 SPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGD 1023
SPTAASFCSPFDPLG GHQ YV+ GNE PGKVLHSSS + D EE++SGS A+L D
Sbjct: 936 SPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGSLANLPVD 995
Query: 1024 VDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVV 1083
V+ K D LP +L PIIIPN+SRERSRS+FKR+ + KSPCVPP+RREQPRIKRPPSPVV
Sbjct: 996 VEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIKRPPSPVV 1055
Query: 1084 LCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSE 1143
LCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+ EE CV +DG+E
Sbjct: 1056 LCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEACVCIDGAE 1115
Query: 1144 VVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRK 1203
VVWPSWRNKNLS PMIQPL GALLQD LIAISQLARDQEHPDVAFPLQP ++ +C RK
Sbjct: 1116 VVWPSWRNKNLSTRPMIQPLPGALLQDRLIAISQLARDQEHPDVAFPLQPPDLLSCSMRK 1175
Query: 1204 ASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1263
+LS+MHSLLHEEIDSF K+VAAEN RKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT
Sbjct: 1176 TALSMMHSLLHEEIDSFWKKVAAENMIRKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1235
Query: 1264 GLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLK 1323
GLSLP+SDVDLV+CLPPVRNLEPIKEAGILEGRNGIKETCLQ
Sbjct: 1236 GLSLPTSDVDLVICLPPVRNLEPIKEAGILEGRNGIKETCLQ------------------ 1277
Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
IPIIMLVVEVP DL SAA ++Q+ KE+ +H+ SD
Sbjct: 1278 ------IPIIMLVVEVPPDLTTSAAPNLQTSKEEPTPMPGGQGSHI-SDRNGW------- 1323
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
F+ T VKELTEQFPA+TPLALVLKQFLADRS
Sbjct: 1324 --------------------FRELCFTK------VKELTEQFPAATPLALVLKQFLADRS 1357
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
LDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1358 LDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1391
>gi|297734532|emb|CBI16583.3| unnamed protein product [Vitis vinifera]
Length = 1331
Score = 1547 bits (4006), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 822/1301 (63%), Positives = 960/1301 (73%), Gaps = 87/1301 (6%)
Query: 185 MDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC-NNGKK 243
MD +SNGGFLRGEES L DWVE +WLKAKGYYSIE+F+ NRLEV LRLAW NC NNGKK
Sbjct: 1 MDSVSNGGFLRGEESGLGSDWVELEWLKAKGYYSIESFVANRLEVALRLAWFNCGNNGKK 60
Query: 244 RGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEA 303
RGVKLKEK+N AG+AANV+WRKKGC+DWW NLD AMRRK++ V+LGKAAKSLT E+LK A
Sbjct: 61 RGVKLKEKVNVAGIAANVFWRKKGCIDWWQNLDCAMRRKMIIVVLGKAAKSLTDEILKGA 120
Query: 304 SNALEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVF 363
+ALED WLFNAG Q ++ ++ S QRT LS D E G + P+S+SG P S
Sbjct: 121 YSALEDEKWLFNAGGGQPVKYKYTASSQRTDQALSDDAEAGSIMIPSSVSGKPKSFFNFS 180
Query: 364 SGLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLE 423
+GLFV+QDI ++L+ QH+EYD +KIFFS+L +ST +DC+ RKLRGLLMVV LD TKLE
Sbjct: 181 NGLFVVQDILNIILTCQHSEYDRDKIFFSTLGSISTISDCIFRKLRGLLMVVWLDFTKLE 240
Query: 424 LFGEGNFKSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALT 483
L GEGN KS PNKSKEK T GR+K+ R + K+ NP+P+S D+ K KD L
Sbjct: 241 LLGEGNLKSPPNKSKEKLGTGGRKKRGRTRNMKKLNPVPRSCGDDSKSLKPLKDHGCGLA 300
Query: 484 DTEKVDLMGSDKVPGISNGKDINRETSTS-EMEMVVCHQEHARALVAGKGRTNARKTKTV 542
+ VD + S+++ G D++ E S+S E +M +GK + ARK++
Sbjct: 301 YAKCVDFVESNRMAGELQQSDLHMEASSSVENDMF-----------SGKVQNAARKSRKE 349
Query: 543 KNKNKNCTYNNPVPVKDPKVSVLETSS-SISLQDEVEKY----DKLSAQNVSVDNSTCSN 597
+NKN+ + +PV V+D + E S+ S+ Q E K D ++NV D S +
Sbjct: 350 RNKNRIYSLKDPVEVRDLETITTEPSAPSVISQSEPSKSNWKSDSSVSENVPNDASIGCD 409
Query: 598 VLASNQSSCTSASVPAREGIATQSTQEDCVVNSVNSECRRFSNGRIDNQTQHFLQETTDS 657
S S C + P+R QS +ED VV+S+
Sbjct: 410 KFIS--SPCKPTNGPSRAETTAQSIREDPVVSSI-------------------------- 441
Query: 658 KVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAV 717
++D AF S I FQNS H SET V DK I+A E+++E
Sbjct: 442 --------------EVDVAF--SGEDIKFQNSEHLSETDTKCV-SDKPIKATELEEEIVQ 484
Query: 718 TQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA-TDRLHLDVGHNWHNHV 776
Q+Q F T SS ECPSYEWPT+AP++F SI+S LPA TDRLHLDVG NWHNH
Sbjct: 485 NQEQERGKFCNTGSTSSSECPSYEWPTVAPIHFTSINSQHLPAATDRLHLDVGRNWHNHF 544
Query: 777 RQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSR 836
Q FVP++HQ RN D GC+QILS+PLPMSLDWPPMV+++S +APS+TCNYD GFIS
Sbjct: 545 HQSFVPSIHQTRNPSLDAGCSQILSRPLPMSLDWPPMVRSISRLAPSMTCNYDPGFISRM 604
Query: 837 QSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHT 896
Q K SGD MDL + T QE DECDSHW+SEEE E+H
Sbjct: 605 Q-----------------------KYSGDLMDLSDLTNVQELADECDSHWISEEEFELHA 641
Query: 897 VSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSS 956
VSG+DY+QYFGGGVMYWN+SDHPG+GFSRPPSLSSDDSSWAWHEAD+ RAVDDMVAFSSS
Sbjct: 642 VSGLDYSQYFGGGVMYWNSSDHPGSGFSRPPSLSSDDSSWAWHEADMNRAVDDMVAFSSS 701
Query: 957 YSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGS 1016
YSTNGL SPTAASFCSPFDPLG GHQ YV+ GNE PGKVLHSSS + D EE++SGS
Sbjct: 702 YSTNGLASPTAASFCSPFDPLGAGHQPLGYVISGNEGPGKVLHSSSASADAMPEEKVSGS 761
Query: 1017 FASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIK 1076
A+L DV+ K D LP +L PIIIPN+SRERSRS+FKR+ + KSPCVPP+RREQPRIK
Sbjct: 762 LANLPVDVEGKTGDPLPYSLLPPIIIPNMSRERSRSEFKRNFDRKSPCVPPARREQPRIK 821
Query: 1077 RPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGC 1136
RPPSPVVLCVPRAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+ EE C
Sbjct: 822 RPPSPVVLCVPRAPRPPPPSPVSDSRKNRGFPTVRSGSSSPRHWGMRGWYHDGSNLEEAC 881
Query: 1137 VRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEV 1196
V +DG+EVVWPSWRNKNLS PMIQPL GALLQD LIAISQLARDQEHPDVAFPLQP ++
Sbjct: 882 VCIDGAEVVWPSWRNKNLSTRPMIQPLPGALLQDRLIAISQLARDQEHPDVAFPLQPPDL 941
Query: 1197 QNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
+C RK +LS+MHSLLHEEIDSF K+VAAEN RKPYINWAVKRVTRSLQVLWPRSRTN
Sbjct: 942 LSCSMRKTALSMMHSLLHEEIDSFWKKVAAENMIRKPYINWAVKRVTRSLQVLWPRSRTN 1001
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
IFGSNATGLSLP+SDVDLV+CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW
Sbjct: 1002 IFGSNATGLSLPTSDVDLVICLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1061
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
VK+DSLKTVENTAIPIIMLVVEVP DL SAA ++Q+ KE+ +H+ ++M L
Sbjct: 1062 VKNDSLKTVENTAIPIIMLVVEVPPDLTTSAAPNLQTSKEEPTPMPGGQGSHIQTEMGGL 1121
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
++SASPKC+ + DN K + SVR+DISFKSPSHTGLQTT+LVKELTEQFPA+TPLALVLK
Sbjct: 1122 ENSASPKCAQINYDNSKDSKSVRIDISFKSPSHTGLQTTELVKELTEQFPAATPLALVLK 1181
Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
QFLADRSLDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1182 QFLADRSLDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1222
>gi|449432996|ref|XP_004134284.1| PREDICTED: uncharacterized protein LOC101221970 [Cucumis sativus]
Length = 1526
Score = 1529 bits (3958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 833/1484 (56%), Positives = 1027/1484 (69%), Gaps = 97/1484 (6%)
Query: 24 MAQNQLIDSLTSHISLYHSHSLSSNPNPSSN--PRSSILKWFASLTVHQRQAHLTIVDSK 81
MAQNQLIDSLTSHISLYHS SL NP+ +SN PRSSILKWF+SL+VHQRQAHLT+VD K
Sbjct: 1 MAQNQLIDSLTSHISLYHSTSLPLNPDTNSNLNPRSSILKWFSSLSVHQRQAHLTVVDFK 60
Query: 82 FAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVF 141
F Q+LIQM+ ++R GHGFFIILPD+ S DP +LP LC+KKSRGLLSRV++SNES R +F
Sbjct: 61 FVQILIQMVAEVRKRGHGFFIILPDILSTDPLHLPSLCFKKSRGLLSRVSQSNESQRMIF 120
Query: 142 ESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESEL 201
ESTRLF SREG+K+EE SC + +D+ TVS EFV NVD+F++ MD +SNG FLRGE +L
Sbjct: 121 ESTRLFGSREGDKLEECSCSLKNIDSITVSEEFVSNVDKFVEAMDGVSNGAFLRGEGGDL 180
Query: 202 AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANV 261
A +W E +WLKAKGYYS+EAF+ N+LEV LRL+W+N NNGKKR VK KEK A GMA NV
Sbjct: 181 ASNWAELNWLKAKGYYSMEAFVANKLEVALRLSWMNLNNGKKRSVKFKEKATATGMATNV 240
Query: 262 YWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKS-LTHEVLKEASNALEDGMWLFNAGMKQ 320
+WRKKGCVDWW LD + R+ +LT ILGK+AK+ LTHE+L+ S E M LF+A +
Sbjct: 241 FWRKKGCVDWWDKLDYSSRKNILTAILGKSAKNLLTHEILRWTSGLAEHEMGLFSAEWNR 300
Query: 321 SSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSSQ 380
R+ + S R++ T D+ I P + SG P L+ +F L VLQDI TMV S
Sbjct: 301 PFRYNCTTSPPRSMLTSQADLHIDFNIIPDTHSGKPYLLSNIFRNLLVLQDIVTMVSSCL 360
Query: 381 HNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
H+EY +F+S+L + DC+LRKLR LM +SLDCTK EL GEGN KS P+KS+E+
Sbjct: 361 HDEYYKCNLFYSTLGSICAIPDCILRKLREFLMFISLDCTKFELLGEGNGKSFPSKSREQ 420
Query: 441 PSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPE----------GALTDTEKVDL 490
RRKK + S K QNP ++ +D+LS + K E +TD+ + +
Sbjct: 421 VGASSRRKKGK--SRKSQNPALRACVDDLSSNNFTKRQEFDKECGHRGREVMTDSTTMSI 478
Query: 491 MGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNAR--KTKTVKNKNKN 548
M S G + RE + V H + + GK + R K KN N
Sbjct: 479 M--------SKGNETCREIPADVSKTV-----HDQKMSVGKDQGTVRKKKKHKSKNSGGN 525
Query: 549 CTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLS---------AQNVSVDNSTCSNVL 599
P P V +S S S QD+V + DK S +N S +N S ++
Sbjct: 526 SRLVEIRPSVGPAVKF--SSPSFSSQDQVAELDKDSIFIKPSISNIKNDSTNNFDSSTLI 583
Query: 600 ASNQSSCTSASVPAREGIATQSTQEDCV------VNSVNSECRRFSNGRIDNQTQHFLQE 653
S ++ P RE + + V V+ + +FS G I+NQ E
Sbjct: 584 P---SPLVLSNEPNREYESILKIEVHEVSGITKSVSQIGPGESQFSKGIIENQFLSSTLE 640
Query: 654 TTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKK 713
+ S ++C+ + +P+ +L N S +N ++S E G S L DK +++K+
Sbjct: 641 NSSSFMDCSAVPSHLPSLELKNIVK---SDVNVKSSVRTCEVGNKSSLLDKLPRTIDVKE 697
Query: 714 ESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWH 773
+S ++ Q + + LE YEW +A +Y PS +SHL PATDRLHLDVGHNWH
Sbjct: 698 KSCSSRHQFSGDTCNARTLNPLEHSPYEWHGVASLYIPSFNSHLPPATDRLHLDVGHNWH 757
Query: 774 NHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFI 833
NH R+ F P +HQ+RN G CN IL++PL MSLDWPP++++ SG+A ++T N+DSG +
Sbjct: 758 NHFRRSFTPAMHQSRNSSAKGSCNPILTRPLLMSLDWPPVLRSASGLASTMTSNHDSGKL 817
Query: 834 SSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELE 893
+ D P+ + Q+ DECD +W+SEEE+E
Sbjct: 818 T---------------------------------DFPDLSNNQDLADECDGNWISEEEME 844
Query: 894 VHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAF 953
+H VSGIDYNQYFGGGVMYWN SDH G GFSRPPSLSSDDSSWAW EAD+ R VDDMVAF
Sbjct: 845 MHAVSGIDYNQYFGGGVMYWNPSDHHGAGFSRPPSLSSDDSSWAWREADMNRTVDDMVAF 904
Query: 954 SSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEI 1013
SSSYS NGLTSPTA SFCS FDPLG G QA YVV G ++P +LHSS+T D TEE+
Sbjct: 905 SSSYS-NGLTSPTATSFCS-FDPLGSGKQALGYVVQGTDLPNNMLHSSTTMKDTVTEEDD 962
Query: 1014 SGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQP 1073
S +L DV+ KA D+ PILRPI+IP++SRERSRS+F ++HKSPC+PP+RREQ
Sbjct: 963 PRSLPNLPSDVEGKA-DSHSFPILRPIVIPSMSRERSRSEFCHGYDHKSPCIPPTRREQS 1021
Query: 1074 RIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSE 1133
R+KRPPSPVVLCVPRAP PPPPSPVSDSRK RGFPTVRSGSSSPRHWGV+GWY +GT E
Sbjct: 1022 RVKRPPSPVVLCVPRAPIPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVKGWYPDGTNLE 1081
Query: 1134 EGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQP 1193
E C+R+DG+EVVWP+WRNK+ S +QPLS LIA+ Q+A DQEHPDVAFPL P
Sbjct: 1082 EACLRIDGAEVVWPNWRNKSNSNCSRVQPLS-------LIAMPQIALDQEHPDVAFPLFP 1134
Query: 1194 LEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS 1253
+ +C +K SLSLMHS LH+EIDSFCK VAAEN A+KPYI WAVKRVTRSLQVLWPRS
Sbjct: 1135 PTI-SCSVKKESLSLMHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRVTRSLQVLWPRS 1193
Query: 1254 RTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
RTNIFGSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL+N
Sbjct: 1194 RTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLSN 1253
Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM 1373
QEWVKSDSLKTVENTAIPIIMLVVEVPH+L+ S+ S++QSPKE+++ + + D + +DM
Sbjct: 1254 QEWVKSDSLKTVENTAIPIIMLVVEVPHELVTSSTSNMQSPKEESSAVSGEQDANNLNDM 1313
Query: 1374 VALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLAL 1433
+L+DS PKC + D+ + SVR+DISFK+PSHTGLQT++LVKELTEQFPA+ PLAL
Sbjct: 1314 ASLEDSILPKCLEVNYDSSISTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLAL 1373
Query: 1434 VLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
VLK+FLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1374 VLKKFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1417
>gi|449478246|ref|XP_004155262.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101221970 [Cucumis
sativus]
Length = 1514
Score = 1527 bits (3953), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 834/1485 (56%), Positives = 1029/1485 (69%), Gaps = 98/1485 (6%)
Query: 24 MAQNQLIDSLTSHISLYHSHSLSSNPNPSSN--PRSSILKWFASLTVHQRQAHLTIVDSK 81
MAQNQLIDSLTSHISLYHS SL NP+ +SN PRSSILKWF+SL+VHQRQAHLT+VD K
Sbjct: 1 MAQNQLIDSLTSHISLYHSTSLPLNPDTNSNLNPRSSILKWFSSLSVHQRQAHLTVVDFK 60
Query: 82 FAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVF 141
F Q+LIQM+ ++R GHGFFIILPD+ S DP +LP LC+KKSRGLLSRV++SNES R +F
Sbjct: 61 FVQILIQMVAEVRKRGHGFFIILPDILSTDPLHLPSLCFKKSRGLLSRVSQSNESQRMIF 120
Query: 142 ESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESEL 201
ESTRLF SREG+K+EE SC + +D+ TVS EFV NVD+F++ MD +SNG FLRGE +L
Sbjct: 121 ESTRLFGSREGDKLEECSCSLKNIDSITVSEEFVSNVDKFVEAMDGVSNGAFLRGEGGDL 180
Query: 202 AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANV 261
A +W E +WLKAKGYYS+EAF+ N+LEV LRL+W+N NNGKKR VK KEK A GMA NV
Sbjct: 181 ASNWAELNWLKAKGYYSMEAFVANKLEVALRLSWMNLNNGKKRSVKFKEKATATGMATNV 240
Query: 262 YWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKS--LTHEVLKEASNALEDGMWLFNAGMK 319
+WRKKGCVDWW LD + R+ +LT ILGK+AK+ LTHE+L+ S E M LF+A
Sbjct: 241 FWRKKGCVDWWDKLDYSSRKNILTAILGKSAKNLILTHEILRWTSGLAEHEMGLFSAEWN 300
Query: 320 QSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVLSS 379
+ R+ + S R++ T D+ I P + SG P L+ +F L VLQDI TMV S
Sbjct: 301 RPFRYNCTTSPPRSMLTSQADLHIDFNIIPDTHSGKPYLLSNIFRNLLVLQDIVTMVSSC 360
Query: 380 QHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKE 439
H+EY +F+S+L + DC+LRKLR LM +SLDCTK EL GEGN KS P+KS+E
Sbjct: 361 LHDEYYKCNLFYSTLGSICAIPDCILRKLREFLMFISLDCTKFELLGEGNGKSFPSKSRE 420
Query: 440 KPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPE----------GALTDTEKVD 489
+ RRKK + S K QNP ++ +D+LS + K E +TD+ +
Sbjct: 421 QVGASSRRKKGK--SRKSQNPALRACVDDLSSNNFTKRQEFDKECGHRGREVMTDSTTMS 478
Query: 490 LMGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNK-- 547
+M S G + RE + V H + + GK + RK K K+KN
Sbjct: 479 IM--------SKGNETCREIPADVSKTV-----HDQKMSVGKDQGTVRKKKKHKSKNSGG 525
Query: 548 NCTYNNPVPVKDPKVSVLETSSSISLQDEVEKYDKLS---------AQNVSVDNSTCSNV 598
N P P V +S S S QD+V + DK S +N S +N S +
Sbjct: 526 NSRLVEIRPSVGPAVKF--SSPSFSSQDQVAELDKDSIFIKPSISNIKNDSTNNFDSSTL 583
Query: 599 LASNQSSCTSASVPAREGIATQSTQEDCV------VNSVNSECRRFSNGRIDNQTQHFLQ 652
+ S ++ P RE + + V V+ + +FS G I+NQ
Sbjct: 584 IP---SPLVLSNEPNREYESILKIEVHEVSGITKSVSQIGPGESQFSKGIIENQFLSSTL 640
Query: 653 ETTDSKVECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIK 712
E + S ++C+ + +P+ +L N S +N ++S E G S L DK +++K
Sbjct: 641 ENSSSFMDCSAVPSHLPSLELKNIVK---SDVNVKSSVRTCEVGNKSSLLDKLPRTIDVK 697
Query: 713 KESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNW 772
++S ++ Q + + LE YEW +A +Y PS +SHL PATDRLHLDVGHNW
Sbjct: 698 EKSCSSRHQFSGDTCNARTLNPLEHSPYEWHGVASLYIPSFNSHLPPATDRLHLDVGHNW 757
Query: 773 HNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGF 832
HNH R+ F P +HQ+RN G CN IL++PL MSLDWPP++++ SG+A ++T N+DSG
Sbjct: 758 HNHFRRSFTPAMHQSRNSSAKGSCNPILTRPLLMSLDWPPVLRSASGLASTMTSNHDSGK 817
Query: 833 ISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEEL 892
++ D P+ + Q+ DECD +W+SEEE+
Sbjct: 818 LT---------------------------------DFPDLSNNQDLADECDGNWISEEEM 844
Query: 893 EVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVA 952
E+H VSGIDYNQYFGGGVMYWN SDH G GFSRPPSLSSDDSSWAW EAD+ R VDDMVA
Sbjct: 845 EMHAVSGIDYNQYFGGGVMYWNPSDHHGAGFSRPPSLSSDDSSWAWREADMNRTVDDMVA 904
Query: 953 FSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEE 1012
FSSSYS NGLTSPTA SFCS FDPLG G QA YVV G ++P +LHSS+T D TEE+
Sbjct: 905 FSSSYS-NGLTSPTATSFCS-FDPLGSGKQALGYVVQGTDLPNNMLHSSTTMKDTVTEED 962
Query: 1013 ISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQ 1072
S +L DV+ KA D+ PILRPI+IP++SRERSRS+F ++HKSPC+PP+RREQ
Sbjct: 963 DPRSLPNLPSDVEGKA-DSHSFPILRPIVIPSMSRERSRSEFCHGYDHKSPCIPPTRREQ 1021
Query: 1073 PRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTS 1132
R+KRPPSPVVLCVPRAP PPPPSPVSDSRK RGFPTVRSGSSSPRHWGV+GWY +GT
Sbjct: 1022 SRVKRPPSPVVLCVPRAPIPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGVKGWYPDGTNL 1081
Query: 1133 EEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQ 1192
EE C+R+DG+EVVWP+WRNK+ S +QPLS LIA+ Q+A DQEHPDVAFPL
Sbjct: 1082 EEACLRIDGAEVVWPNWRNKSNSNCSRVQPLS-------LIAMPQIALDQEHPDVAFPLF 1134
Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
P + +C +K SLSLMHS LH+EIDSFCK VAAEN A+KPYI WAVKR TRSLQVLWPR
Sbjct: 1135 PPTI-SCSVKKESLSLMHSRLHDEIDSFCKHVAAENMAKKPYITWAVKRXTRSLQVLWPR 1193
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
SRTNIFGSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL+
Sbjct: 1194 SRTNIFGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLS 1253
Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
NQEWVKSDSLKTVENTAIPIIMLVVEVPH+L+ S+ S++QSPKE+++ + + D + +D
Sbjct: 1254 NQEWVKSDSLKTVENTAIPIIMLVVEVPHELVTSSTSNMQSPKEESSAVSGEQDANNLND 1313
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
M +L+DS PKC + D+ + SVR+DISFK+PSHTGLQT++LVKELTEQFPA+ PLA
Sbjct: 1314 MASLEDSILPKCLEVNYDSSISTKSVRIDISFKTPSHTGLQTSELVKELTEQFPATIPLA 1373
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
LVLK+FLADRSLDQSYSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1374 LVLKKFLADRSLDQSYSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1418
>gi|334186240|ref|NP_191917.3| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|332656419|gb|AEE81819.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 1481
Score = 1451 bits (3755), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 826/1470 (56%), Positives = 1029/1470 (70%), Gaps = 114/1470 (7%)
Query: 16 AEYFTN-SLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQA 73
+++F++ S MAQNQLIDSLTSHISLYHSHS SS+ N NPRS+IL+WF+SL+VHQR +
Sbjct: 8 SKFFSSASSMAQNQLIDSLTSHISLYHSHSSSSSMANTIPNPRSAILRWFSSLSVHQRLS 67
Query: 74 HLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAES 133
HLT+VD KF Q+L+QMLG +R G FIILPDLPS LP LC+KKSRGL+SRV+ES
Sbjct: 68 HLTVVDPKFVQILLQMLGYIRTKGPCSFIILPDLPSSSD--LPSLCFKKSRGLISRVSES 125
Query: 134 NESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGF 193
NES R+VF+STRLF S EGE+ ++ SC VN LD+ ++ EF+ NVDRF++ MD +S+G F
Sbjct: 126 NESERFVFDSTRLFGSGEGERAQDCSCSVNSLDSVVMAEEFLTNVDRFVETMDVLSDGAF 185
Query: 194 LRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLN 253
LRGEES+L +WVE +WLKAKGYYS+EAF+ NRLEV +RLAWLN N+GK+RG+KLKEKLN
Sbjct: 186 LRGEESDLGSNWVELEWLKAKGYYSMEAFVANRLEVSMRLAWLNTNSGKRRGIKLKEKLN 245
Query: 254 AAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWL 313
AA AAN YWRKK CVDWW NLD A +K+ T + GK+AKS+ +E+L+EA+ A + MWL
Sbjct: 246 AAAAAANSYWRKKACVDWWQNLDAATHKKIWTCLFGKSAKSVIYEILREANQAQQGEMWL 305
Query: 314 FNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDIT 373
FN F ++ + S +S C + + P S+ P ++A+ SGL+VLQ+
Sbjct: 306 FN--------FASARKGRTDTSAVSF---CDMILEPNSVPRKPITVASNLSGLYVLQEFA 354
Query: 374 TMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSS 433
++++ Q+ + +FFSS+ ++T DC+LRKLRG LMV+S+D K EL + K S
Sbjct: 355 SLLILCQNGLVPVHSVFFSSMGTITTLVDCILRKLRGFLMVISIDSVKSELLDDNTHKCS 414
Query: 434 PNKS-KEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMG 492
P+ S +K + R++K + + K+ P KS + K K + L + + +
Sbjct: 415 PSSSSNQKLGSTNRKQKGKTRNMKKPTPEAKSDKNVNLSTKNGKKDQAKLEFNKSREAIE 474
Query: 493 SDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYN 552
KVP S + + E S + ME+V LVA K +K + KNK+K CT
Sbjct: 475 CKKVPTASTMIN-DPEASAATMEVV-------PGLVARK--GRTKKKRKEKNKSKKCT-- 522
Query: 553 NPVPVKDPKVSVLETSSSISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVP 612
SL++ E N SV NS+ + SSCTSA+
Sbjct: 523 -------------------SLENNGEV-------NKSVVNSSAIVKASKCDSSCTSANQH 556
Query: 613 AREGIATQSTQEDCVVNSVNSECRRFSNGRIDN-----QTQHFLQETTDSKVECNIISPD 667
+E I Q +E + C R +G + ++ +E + SK E ++IS D
Sbjct: 557 PQEYINAQIIEEHGSFS-----CERNRSGTCASVNGAANCEYSGEEESHSKAETHVISSD 611
Query: 668 MPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFF 727
+ + +D A G S +N Q S + +P++ L+ + ES QR E+ +
Sbjct: 612 LSS--VDPAGGPSCENVNPQKSCCRGDRKEKLTMPNERSRTLD-EGESHRIHHQRREAGY 668
Query: 728 GTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQA 787
G A SS E SYEWP +AP+YF +SSHL ATDRLHLDVGHN H +VRQPFV T+ A
Sbjct: 669 GFASSSS-EFVSYEWPAVAPMYFSHVSSHLPTATDRLHLDVGHNLHPYVRQPFVSTVQHA 727
Query: 788 RNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATK 847
RN +G Q+LS+P+PMSLDWPPMV + G+ + TCNYDSG +
Sbjct: 728 RNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGLTTAFTCNYDSGIL-------------- 773
Query: 848 GMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFG 907
+D+PE E G+EC+++W+ EE+ EVHTVSG+DYNQYFG
Sbjct: 774 -------------------VDIPEQKNKHELGNECENNWMLEEDFEVHTVSGVDYNQYFG 814
Query: 908 GGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTA 967
GGVMYWN SDH GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPTA
Sbjct: 815 GGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPTA 874
Query: 968 ASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSK 1027
ASFCSPF PLGP +Q YVVPGNE+ K+L + TT + A EEE+SG+ ASLSGDV+
Sbjct: 875 ASFCSPFHPLGPPNQPLGYVVPGNEISTKILQAPPTTIEGAGEEEVSGTLASLSGDVEGN 934
Query: 1028 ALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVP 1087
+ D+LP PILRPIIIPN+S+ S++KRS++ KSP VPP+RRE PRIKRPPSPVVLCVP
Sbjct: 935 SGDSLPYPILRPIIIPNMSK----SEYKRSYDTKSPNVPPTRREHPRIKRPPSPVVLCVP 990
Query: 1088 RAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWP 1147
RAPRPPPPSPVS+SR RGFPTVRSGSSSPRHWG+RGW+H+G EE G+E+V P
Sbjct: 991 RAPRPPPPSPVSNSRARRGFPTVRSGSSSPRHWGMRGWFHDGVNWEEP----RGAEIVLP 1046
Query: 1148 SWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLS 1207
WRNK+L+ P+IQPL GALLQDHLIA+SQL RDQEHPDVAFPLQP E+ NCP + SLS
Sbjct: 1047 -WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGRDQEHPDVAFPLQPPELLNCPMQGESLS 1105
Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
L+H +L++EIDSFCKQVAAEN ARKPYINWA+KRVTRSLQVLWPRSRTNIFGS+ATGLSL
Sbjct: 1106 LIHGILNDEIDSFCKQVAAENMARKPYINWAIKRVTRSLQVLWPRSRTNIFGSSATGLSL 1165
Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK+DSLKTVEN
Sbjct: 1166 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKTDSLKTVEN 1225
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
TAIPIIMLVVEVP DLI S+QSPK+ T+ D++ +++MV +DSA+ T
Sbjct: 1226 TAIPIIMLVVEVPCDLIC----SIQSPKDGPDCITVDQDSNGNTEMVGFEDSAAANSLPT 1281
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
++ N+ A SVRLDISFK+PSHTGLQTT LVK+LTEQFPA+TPLALVLKQFLADR+LDQS
Sbjct: 1282 NTGNLAIAKSVRLDISFKTPSHTGLQTTQLVKDLTEQFPAATPLALVLKQFLADRTLDQS 1341
Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
YSGGLSSYCL+LLITRFLQHEHHLGR INQ
Sbjct: 1342 YSGGLSSYCLVLLITRFLQHEHHLGRSINQ 1371
>gi|255541176|ref|XP_002511652.1| nucleotidyltransferase, putative [Ricinus communis]
gi|223548832|gb|EEF50321.1| nucleotidyltransferase, putative [Ricinus communis]
Length = 1420
Score = 1285 bits (3325), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 689/1120 (61%), Positives = 813/1120 (72%), Gaps = 103/1120 (9%)
Query: 371 DITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNF 430
DI ++L QH EYD+ KIFFS+LR VS+ +DC+LRKLRGL+MV+SLDCTKLEL GEGN
Sbjct: 279 DIVALILPGQHREYDVSKIFFSTLRSVSSISDCILRKLRGLVMVISLDCTKLELLGEGNM 338
Query: 431 KSSPNKSKEKPSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDL 490
K +K KEKP+T RRKK + + K+ P ++E ++K K
Sbjct: 339 KHLTSKQKEKPNTGSRRKKAKTHNMKKPESAPDIVVNEAYINKPLK-------------- 384
Query: 491 MGSDKVPGISNGKDINRETSTSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKN-- 548
EH++ LV KGRT ARK + +NKNKN
Sbjct: 385 -------------------------------EHSQGLVLAKGRTAARKNRRGRNKNKNRN 413
Query: 549 CTYNNPVPVKDPKVSVLETSSS-ISLQDEVEKY----DKLSAQNVSVDNSTCSNVLASNQ 603
+ N+PV +++ + SV E + DE D L+ QNV D+ S N
Sbjct: 414 SSLNDPVDIRNSERSVAEAPCVPVISSDEAAMLGRASDDLAIQNVFSDDLVESANFTLNT 473
Query: 604 SSCTSASVPAREGIATQSTQEDCV-----VNSVNSECRRFSNGRIDNQTQHFLQETTDSK 658
S C + P +EGI + Q V S+NSEC++ SN I+++T E + K
Sbjct: 474 SFCGCVTEPRKEGIDAKRVQGRVVGCNGGTCSINSECKQTSNVMIEDRTISSRAEGVNFK 533
Query: 659 VECNIISPDMPARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVT 718
+E +IS + +LD N I F+N ET S + + + +K+ S +
Sbjct: 534 MEDKVISHVVQTPELDTVSSNE--DIKFRNE----ETKGKSNFSYRTVRNINVKEGSTLI 587
Query: 719 QDQRNESFFGTALKSSL-ECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVR 777
+++ A ++L E SYEWP++APVYFPSI+SHLLPA DRLHLDVG NWH+H+R
Sbjct: 588 KNK----ILNEARSTNLSEYISYEWPSLAPVYFPSITSHLLPAADRLHLDVGRNWHSHIR 643
Query: 778 QPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQ 837
QPFVPT+HQAR P + G N+ LS+PLPMSLDWPPMV+++SG+APS+TCNYDSGFIS Q
Sbjct: 644 QPFVPTVHQARTSPIESGYNRTLSRPLPMSLDWPPMVRSISGLAPSMTCNYDSGFISRLQ 703
Query: 838 SGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTV 897
+ F +E E +SH +SEEE+E+H V
Sbjct: 704 TAFH---------------------------------PKEPMAEYESHCISEEEMEMHAV 730
Query: 898 SGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSY 957
SGIDYNQYFGGGVMYWN SD+PGTGFSRPPSLSSDDS+WAWHE D+ RAVDDMVAFSSSY
Sbjct: 731 SGIDYNQYFGGGVMYWNPSDYPGTGFSRPPSLSSDDSTWAWHE-DMNRAVDDMVAFSSSY 789
Query: 958 STNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSF 1017
STNGL SPTAASFCSPFDP+G GHQA YVVPGNE+ GKVL SSST TD A EE++GS
Sbjct: 790 STNGLASPTAASFCSPFDPIGSGHQALGYVVPGNELTGKVLQSSSTVTDTAALEELTGSL 849
Query: 1018 ASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKR 1077
A++SGDV+ KA D+LP PIL PIIIPN+SRE+SRSDFKRSH+HKSPCVPPSRRE+PRIKR
Sbjct: 850 ANVSGDVEGKAGDSLPYPILPPIIIPNISREKSRSDFKRSHDHKSPCVPPSRRERPRIKR 909
Query: 1078 PPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCV 1137
PPSPVVLCVPRAP PPPPSPVS+SRK RGFPTVRSGSSSPRHW +RGWY E T SEE +
Sbjct: 910 PPSPVVLCVPRAPHPPPPSPVSNSRKQRGFPTVRSGSSSPRHWSMRGWY-ERTNSEEAYM 968
Query: 1138 RMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQ 1197
MDG+EVVWPSWRNKNLS HPMIQPL G LLQDHLIA+SQLARDQEHPDV+FPLQP E+
Sbjct: 969 HMDGTEVVWPSWRNKNLSTHPMIQPLPGGLLQDHLIAMSQLARDQEHPDVSFPLQPPELH 1028
Query: 1198 NCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNI 1257
NCP RKASLSLMHSLLH+EID FCK+VAAEN RKP+INWAVKRVTRSLQVLWPRSRTN+
Sbjct: 1029 NCPARKASLSLMHSLLHDEIDFFCKKVAAENMDRKPFINWAVKRVTRSLQVLWPRSRTNV 1088
Query: 1258 FGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1317
+GSNATGLSLP+SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV
Sbjct: 1089 YGSNATGLSLPTSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1148
Query: 1318 KSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALD 1377
K+DSLKTVENTAIPIIMLVVEVP DLI SA S++QS K++ T +++N V+SD+V +
Sbjct: 1149 KNDSLKTVENTAIPIIMLVVEVPSDLIISATSNIQSTKDEPTRMTAENENCVNSDIVISE 1208
Query: 1378 DSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQ 1437
+S+SPKC + D+ K S+RLDISFKSPSHTGLQTT+LVKELTEQFPA+TPLALVLKQ
Sbjct: 1209 ESSSPKCLQVNHDSRKDVKSIRLDISFKSPSHTGLQTTELVKELTEQFPAATPLALVLKQ 1268
Query: 1438 FLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
FLADRSLDQSYSGGLSSYCL+LLITRFLQHEHHLGRPINQ
Sbjct: 1269 FLADRSLDQSYSGGLSSYCLVLLITRFLQHEHHLGRPINQ 1308
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/277 (73%), Positives = 226/277 (81%), Gaps = 5/277 (1%)
Query: 19 FTNSLMAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIV 78
TNSL+ QNQ +DSLT+HISLYHS SL NP+P+ NPRSSILKWF+SLTVHQRQAHLT V
Sbjct: 7 LTNSLVTQNQFVDSLTAHISLYHSKSLPLNPSPNPNPRSSILKWFSSLTVHQRQAHLTTV 66
Query: 79 DSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGR 138
D KF QL+IQMLGKLR GHG FIILPDLP D LP LCYKKSRGLLSR AESNES R
Sbjct: 67 DFKFVQLVIQMLGKLRTQGHGRFIILPDLPVSD---LPSLCYKKSRGLLSRDAESNESER 123
Query: 139 WVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEE 198
+FES RLF SREGE E SC + CLD+ TVS + + NVD+F++ MD +SNG FLRGE+
Sbjct: 124 LIFESIRLFGSREGE--ESTSCSIKCLDSLTVSEDLIGNVDKFVETMDRVSNGEFLRGED 181
Query: 199 SELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMA 258
SEL DWVE DWLKAKGYYSIEAF+ NRLEV LRLAWLNC NGKKRGVK+KEK+NAAG A
Sbjct: 182 SELGSDWVELDWLKAKGYYSIEAFVANRLEVALRLAWLNCGNGKKRGVKVKEKVNAAGAA 241
Query: 259 ANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSL 295
ANV+ RKKGCVDWW+NLD RRK TV+LGKAA SL
Sbjct: 242 ANVFLRKKGCVDWWLNLDAETRRKFFTVVLGKAANSL 278
>gi|356506428|ref|XP_003521985.1| PREDICTED: uncharacterized protein LOC100809291 [Glycine max]
Length = 1436
Score = 1178 bits (3047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 658/1110 (59%), Positives = 788/1110 (70%), Gaps = 77/1110 (6%)
Query: 381 HNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
++EYDIE +FFSSL V T +DC+LRK+RG LMV+SLDCTKLEL GE + KSS K KEK
Sbjct: 282 NSEYDIENLFFSSLGSVCTISDCILRKMRGFLMVISLDCTKLELLGEEHDKSSSGKPKEK 341
Query: 441 PSTIGRRKKCRACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGIS 500
PS R+KK R + KRQNP+ K+ +D++S + KD + + + +K DL+ S ++P +
Sbjct: 342 PSVSNRKKKGRNRNNKRQNPVSKTCVDDISHENPLKDIDCKVDNKKKTDLVASSELPAVH 401
Query: 501 NGKDINRETSTSEMEMVVCHQEHARALVAGK--GRTNARKTKTVKNKNKNCTYNNPVPVK 558
GK+I+ E +S +H + L GK RT +R K KNK+KN +
Sbjct: 402 MGKEISMEFPSS--------TDHTQGLDIGKIKVRTTSRSRKE-KNKSKNILIS--AGGD 450
Query: 559 DPKVSVLETSSSISLQDEVEKYDKL----SAQNVSVDNSTCSNVLASNQSSCTSASVPAR 614
K S+ S+++ + EV D+ + QNV DN+ +++LASN S C+S S +R
Sbjct: 451 SQKSSIHAASTTVISEGEVAICDRFLNSSTIQNVKNDNAIGNDILASNSSLCSSLSGLSR 510
Query: 615 EGIATQSTQ---EDCVV--NSVNSECRRFSNGRIDNQTQHFLQETTDSKVECNI-ISPDM 668
E +T+ + ED NS+ +C S+ R +T +T ++C +P +
Sbjct: 511 ENSSTRKVEGKTEDLAESGNSLGPQCCLLSDER---KTLCSGLDTLTCDIDCTAATTPPV 567
Query: 669 PARDLDNAFGNSISGINFQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFG 728
PA + F S NS ++ + +PDK I + K+ + + R+ F
Sbjct: 568 PAVKQGSFF--SKEDTCPLNSSCAAKADLKTTVPDKPIREVNAKEFGLLKE--RDRCLFE 623
Query: 729 TALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQAR 788
+ + +C YEWP + +YFPS +SHL PATDRLHLDVGHNWHNH PFVPTL QAR
Sbjct: 624 SRNSAFSKCSPYEWPGLPSIYFPSFNSHLPPATDRLHLDVGHNWHNHFCHPFVPTLQQAR 683
Query: 789 NHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKG 848
N P +GGCN ILS+P+PMS DWPP+ + G+ PS NYDSGFIS
Sbjct: 684 NPPIEGGCNPILSRPIPMSFDWPPVFRG--GMTPSPNYNYDSGFIS-------------- 727
Query: 849 MQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGG 908
DLP+ T T E DE D+H +SEEE EVHTVSGIDYNQYFGG
Sbjct: 728 -------------------DLPDLTNTLELADEFDNHCVSEEEYEVHTVSGIDYNQYFGG 768
Query: 909 GVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAA 968
GVMYWN SD+PG GFSRPPSLSSDDS WA +AD+ R VDDMVAFSSSYSTNGLTSPTAA
Sbjct: 769 GVMYWNPSDYPGKGFSRPPSLSSDDSLWALRDADMNRTVDDMVAFSSSYSTNGLTSPTAA 828
Query: 969 SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFAS-LSGDVDSK 1027
+FCSPFDP+G Q YV+ GNEVPGK+LHSSS T D A +E+ SGS + L G+V+ K
Sbjct: 829 TFCSPFDPVGTATQTIGYVMSGNEVPGKMLHSSSVT-DAAVDEDTSGSLGNNLPGEVEGK 887
Query: 1028 ALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVP 1087
A D+ P PILRPIIIPNLSRER +HKSPCVPPSRREQPRIKRPPSPVVLCVP
Sbjct: 888 AGDSHPYPILRPIIIPNLSRER--------FDHKSPCVPPSRREQPRIKRPPSPVVLCVP 939
Query: 1088 RAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWP 1147
RAPRPPPPSPVSDSRK RGFPTVRSGSSSPRHWG+RGWYH+G+ EE C+RMDG+EVVWP
Sbjct: 940 RAPRPPPPSPVSDSRKHRGFPTVRSGSSSPRHWGMRGWYHDGSNFEEACLRMDGAEVVWP 999
Query: 1148 SWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLS 1207
WR+ NL+ P+IQPL ALLQD LIA+SQ+ARDQEHPDV FPLQP ++Q+C + ASL+
Sbjct: 1000 -WRSNNLAVRPLIQPLPAALLQDRLIALSQIARDQEHPDVTFPLQPPDLQSCSAQSASLT 1058
Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
LMH +LH+EIDSFCKQVAAEN AR+PYINWAVKRVTR LQVLWPRSRTNIFGSNATG+SL
Sbjct: 1059 LMHGILHDEIDSFCKQVAAENMARRPYINWAVKRVTRFLQVLWPRSRTNIFGSNATGMSL 1118
Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
P+SDVDLVV LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ+WVK+DSLKTVEN
Sbjct: 1119 PTSDVDLVVGLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQDWVKNDSLKTVEN 1178
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
TAIPIIMLVVEVP D+I S A +QS E+ T +H N SD + L+DSA PK S
Sbjct: 1179 TAIPIIMLVVEVPQDVITSLAPMIQSLNEEPHCTPGEHGNDNQSDSIRLEDSALPKGSQM 1238
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
D +K + SVRLDISFKSPSHTGLQTT++VKELT QFPA+TPLALVLKQFLADRSLDQS
Sbjct: 1239 KFDALK-SKSVRLDISFKSPSHTGLQTTEMVKELTAQFPAATPLALVLKQFLADRSLDQS 1297
Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1298 YSGGLSSYCLVLLIIRFLQHEHHLGRPINQ 1327
Score = 368 bits (945), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 182/274 (66%), Positives = 218/274 (79%), Gaps = 3/274 (1%)
Query: 24 MAQNQLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKFA 83
MA +QLID+LTSHISLYHS S + NPNP+ NPRSSILKWF+SL++H RQAHLTIVD+ F
Sbjct: 1 MAHHQLIDTLTSHISLYHSQSPNPNPNPNPNPRSSILKWFSSLSIHHRQAHLTIVDANFV 60
Query: 84 QLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWVFES 143
Q+L+QML KLR++GHG FI+LPDLPSRD LP LC+KKSRGLL+RVA+S+ +GR VFES
Sbjct: 61 QILLQMLAKLRSHGHGSFILLPDLPSRD--NLPTLCFKKSRGLLARVADSDAAGRAVFES 118
Query: 144 TRLFSSREGEKIEEWSCP-VNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESELA 202
+RLF SREGE+ + P LD ++ FV +VDRF++ MD IS GGFLRGEE+EL
Sbjct: 119 SRLFDSREGEEAAIATLPSARRLDALALAEGFVGDVDRFVEAMDRISGGGFLRGEEAELG 178
Query: 203 GDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAANVY 262
DWVE WLK+KGYY IEAFI NR+EV +RLAWLNC G+KRGVKLKEK+ AAG+ NV+
Sbjct: 179 EDWVELHWLKSKGYYGIEAFIANRIEVSMRLAWLNCCGGRKRGVKLKEKMGAAGVGVNVF 238
Query: 263 WRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLT 296
WRKKGCVDWW NLD RRKV++ L KAAK L
Sbjct: 239 WRKKGCVDWWGNLDAGTRRKVISTFLMKAAKPLV 272
>gi|297810181|ref|XP_002872974.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp.
lyrata]
gi|297318811|gb|EFH49233.1| hypothetical protein ARALYDRAFT_912246 [Arabidopsis lyrata subsp.
lyrata]
Length = 840
Score = 1031 bits (2665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/779 (67%), Positives = 615/779 (78%), Gaps = 51/779 (6%)
Query: 701 LPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPA 760
+P+ LE + ES QR E+ +G A SS E SYEWP +AP+YF ++SHL A
Sbjct: 1 MPNGRSRTLE-EGESHRIHHQRREAGYGIASSSS-EFVSYEWPAVAPMYFSHVNSHLPTA 58
Query: 761 TDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGI 820
TDRLHLDVGHN H +VRQPFV T+ ARN +G Q+LS+P+PMSLDWPPMV + G+
Sbjct: 59 TDRLHLDVGHNLHAYVRQPFVSTVQHARNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGL 118
Query: 821 APSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGD 880
+ CNYDSG + +D+PE E G+
Sbjct: 119 TTAFACNYDSGIL---------------------------------VDIPEQKNKPELGN 145
Query: 881 ECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHE 940
EC+++W+ EE+ EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSRPPSLSSDDSSWAWHE
Sbjct: 146 ECENNWMLEEDFEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHE 205
Query: 941 ADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHS 1000
A++KR+VDDMVAFSSSYS NGL SPTAASFCSPF PLGP +Q YVVPGNE+ K+L +
Sbjct: 206 AEMKRSVDDMVAFSSSYSANGLDSPTAASFCSPFHPLGPANQPLGYVVPGNEISTKILQA 265
Query: 1001 SSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEH 1060
TT++VA EEE+SG+ ASLSGDV+ + D+LP PILRPIIIPN+ S+SD+KRS++
Sbjct: 266 PPTTSEVAGEEEVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNM----SKSDYKRSYDT 321
Query: 1061 KSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHW 1120
KSP VPP+RRE PRIKRPPSPVVLCVPRAPR PPPSPVS+SR RGFPTVRSGSSSPRHW
Sbjct: 322 KSPNVPPTRREHPRIKRPPSPVVLCVPRAPRLPPPSPVSNSRARRGFPTVRSGSSSPRHW 381
Query: 1121 GVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLAR 1180
G+RGW+H+G EE C G+EVV P WRNK+L+ P+IQPL GALLQDHLIA+SQL R
Sbjct: 382 GMRGWFHDGVNWEEPC----GAEVVLP-WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGR 436
Query: 1181 DQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVK 1240
DQEHPDVAFPLQP E+ NCP ++ SLSL+H LL++EIDSFCKQVAAEN ARKPYI+WA+K
Sbjct: 437 DQEHPDVAFPLQPPELLNCPMQE-SLSLIHGLLNDEIDSFCKQVAAENMARKPYISWAIK 495
Query: 1241 RVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 1300
RVTRSLQVLWPRSRTNIFGS+ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK
Sbjct: 496 RVTRSLQVLWPRSRTNIFGSSATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIK 555
Query: 1301 ETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKE--DA 1358
ETCLQHAARYLANQEWVK+DSLKTVENTAIPIIMLVVEVP DL+ S+QSPK+ D
Sbjct: 556 ETCLQHAARYLANQEWVKTDSLKTVENTAIPIIMLVVEVPCDLVC----SIQSPKDGPDC 611
Query: 1359 AHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
T+ D++ +++MV +DSA+ T++ N+ A SVRLDISFK+PSHTGLQTT LV
Sbjct: 612 ISITVDQDSNGNTEMVGFEDSAAANSLPTNTGNLANAKSVRLDISFKTPSHTGLQTTQLV 671
Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
K+LT+QFPA+TPLALVLKQFLADR+LDQSYSGGLSSYCL+LLITRFLQHEHHLG INQ
Sbjct: 672 KDLTDQFPAATPLALVLKQFLADRTLDQSYSGGLSSYCLVLLITRFLQHEHHLGLSINQ 730
>gi|19699089|gb|AAL90911.1| AT4g00060/F6N15_10 [Arabidopsis thaliana]
gi|25090419|gb|AAN72296.1| At4g00060/F6N15_10 [Arabidopsis thaliana]
Length = 839
Score = 999 bits (2584), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/766 (68%), Positives = 608/766 (79%), Gaps = 47/766 (6%)
Query: 712 KKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAPVYFPSISSHLLPATDRLHLDVGHN 771
+ ES QR E+ +G A SS E SYEWP +AP+YF +SSHL ATDRLHLDVGHN
Sbjct: 11 EGESHRIHHQRREAGYGFASSSS-EFVSYEWPAVAPMYFSHVSSHLPTATDRLHLDVGHN 69
Query: 772 WHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSG 831
H +VRQPFV T+ ARN +G Q+LS+P+PMSLDWPPMV + G+ + TCNYDSG
Sbjct: 70 LHPYVRQPFVSTVQHARNPSIEGSHKQVLSRPMPMSLDWPPMVHSNCGLTTAFTCNYDSG 129
Query: 832 FISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEE 891
+ +D+PE E G+EC+++W+ EE+
Sbjct: 130 IL---------------------------------VDIPEQKNKHELGNECENNWMLEED 156
Query: 892 LEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMV 951
EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMV
Sbjct: 157 FEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMV 216
Query: 952 AFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEE 1011
AFSSSYS NGL SPTAASFCSPF PLGP +Q YVVPGNE+ K+L + TT + A EE
Sbjct: 217 AFSSSYSANGLDSPTAASFCSPFHPLGPPNQPLGYVVPGNEISTKILQAPPTTIEGAGEE 276
Query: 1012 EISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRRE 1071
E+SG+ ASLSGDV+ + D+LP PILRPIIIPN+S+ S++KRS++ KSP VPP+RRE
Sbjct: 277 EVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNMSK----SEYKRSYDTKSPNVPPTRRE 332
Query: 1072 QPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTT 1131
PRIKRPPSPVVLCVPRAPRPPPPSPVS+SR RGFPTVRSGSSSPRHWG+RGW+H+G
Sbjct: 333 HPRIKRPPSPVVLCVPRAPRPPPPSPVSNSRARRGFPTVRSGSSSPRHWGMRGWFHDGVN 392
Query: 1132 SEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPL 1191
EE G+E+V P WRNK+L+ P+IQPL GALLQDHLIA+SQL RDQEHPDVAFPL
Sbjct: 393 WEEP----RGAEIVLP-WRNKSLAVRPIIQPLPGALLQDHLIAMSQLGRDQEHPDVAFPL 447
Query: 1192 QPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWP 1251
QP E+ NCP + SLSL+H +L++EIDSFCKQVAAEN ARKPYINWA+KRVTRSLQVLWP
Sbjct: 448 QPPELLNCPMQGESLSLIHGILNDEIDSFCKQVAAENMARKPYINWAIKRVTRSLQVLWP 507
Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
RSRTNIFGS+ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL
Sbjct: 508 RSRTNIFGSSATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 567
Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
ANQEWVK+DSLKTVENTAIPIIMLVVEVP DLI S+QSPK+ T+ D++ ++
Sbjct: 568 ANQEWVKTDSLKTVENTAIPIIMLVVEVPCDLIC----SIQSPKDGPDCITVDQDSNGNT 623
Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
+MV +DSA+ T++ N+ A SVRLDISFK+PSHTGLQTT LVK+LTEQFPA+TPL
Sbjct: 624 EMVGFEDSAAANSLPTNTGNLAIAKSVRLDISFKTPSHTGLQTTQLVKDLTEQFPAATPL 683
Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
ALVLKQFLADR+LDQSYSGGLSSYCL+LLITRFLQHEHHLGR INQ
Sbjct: 684 ALVLKQFLADRTLDQSYSGGLSSYCLVLLITRFLQHEHHLGRSINQ 729
>gi|3193327|gb|AAC19309.1| F6N15.10 gene product [Arabidopsis thaliana]
gi|7267093|emb|CAB80764.1| hypothetical protein [Arabidopsis thaliana]
Length = 1260
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/1135 (48%), Positives = 692/1135 (60%), Gaps = 161/1135 (14%)
Query: 392 SSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKS-KEKPSTIGRRKKC 450
SS+ ++T DC+LRKLRG LMV+S+D K EL + K SP+ S +K + R++K
Sbjct: 218 SSMGTITTLVDCILRKLRGFLMVISIDSVKSELLDDNTHKCSPSSSSNQKLGSTNRKQKG 277
Query: 451 RACSTKRQNPLPKSALDELSLDKLPKDPEGALTDTEKVDLMGSDKVPGISNGKDINRETS 510
+ + K+ P KS + K K + L + + + KVP S + + E S
Sbjct: 278 KTRNMKKPTPEAKSDKNVNLSTKNGKKDQAKLEFNKSREAIECKKVPTASTMIN-DPEAS 336
Query: 511 TSEMEMVVCHQEHARALVAGKGRTNARKTKTVKNKNKNCTYNNPVPVKDPKVSVLETSSS 570
+ ME+V LVA K +K + KNK+K CT
Sbjct: 337 AATMEVV-------PGLVARK--GRTKKKRKEKNKSKKCT-------------------- 367
Query: 571 ISLQDEVEKYDKLSAQNVSVDNSTCSNVLASNQSSCTSASVPAREGIATQSTQEDCVVNS 630
SL++ E N SV NS+ + SSCTSA+ +E I Q +E +
Sbjct: 368 -SLENNGEV-------NKSVVNSSAIVKASKCDSSCTSANQHPQEYINAQIIEEHGSFS- 418
Query: 631 VNSECRRFSNGRIDN-----QTQHFLQETTDSKVECNIISPDMPARDLDNAFGNSISGIN 685
C R +G + ++ +E + SK E ++IS D+ + +D A G S +N
Sbjct: 419 ----CERNRSGTCASVNGAANCEYSGEEESHSKAETHVISSDLSS--VDPAGGPSCENVN 472
Query: 686 FQNSFHESETGAISVLPDKGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTI 745
Q S + +P++ L+ + ES QR E+ +G A SS E SYEWP +
Sbjct: 473 PQKSCCRGDRKEKLTMPNERSRTLD-EGESHRIHHQRREAGYGFASSSS-EFVSYEWPAV 530
Query: 746 APVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILSQPLP 805
AP+YF +SSHL ATDRLHLDVGHN H +VRQPFV T+ ARN +G Q+LS+P+P
Sbjct: 531 APMYFSHVSSHLPTATDRLHLDVGHNLHPYVRQPFVSTVQHARNPSIEGSHKQVLSRPMP 590
Query: 806 MSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGD 865
MSLDWPPMV + G+ + TCNYDSG +
Sbjct: 591 MSLDWPPMVHSNCGLTTAFTCNYDSGIL-------------------------------- 618
Query: 866 FMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSR 925
+D+PE E G+EC+++W+ EE+ EVHTVSG+DYNQYFGGGVMYWN SDH GTGFSR
Sbjct: 619 -VDIPEQKNKHELGNECENNWMLEEDFEVHTVSGVDYNQYFGGGVMYWNPSDHLGTGFSR 677
Query: 926 PPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFS 985
PPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPTAASFCSPF PLGP +Q
Sbjct: 678 PPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPTAASFCSPFHPLGPPNQPLG 737
Query: 986 YVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNL 1045
YVVPGNE+ K+L + TT + A EEE+SG+ ASLSGDV+ + D+LP PILRPIIIPN+
Sbjct: 738 YVVPGNEISTKILQAPPTTIEGAGEEEVSGTLASLSGDVEGNSGDSLPYPILRPIIIPNM 797
Query: 1046 SRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKTR 1105
S+S++KRS++ KSP VPP+RRE PRIKRPPSPVVLCVPRAPRPPPPSPVS+SR R
Sbjct: 798 ----SKSEYKRSYDTKSPNVPPTRREHPRIKRPPSPVVLCVPRAPRPPPPSPVSNSRARR 853
Query: 1106 GFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLSG 1165
GFPTVRSGSSSPRHWG+RGW+H+G EE G+E+V P WRNK+L+ P+IQPL G
Sbjct: 854 GFPTVRSGSSSPRHWGMRGWFHDGVNWEEP----RGAEIVLP-WRNKSLAVRPIIQPLPG 908
Query: 1166 ALLQDHLIAISQLARDQEH--------PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEI 1217
ALLQDHLIA+SQL RDQEH PDVAFPLQP E+ NCP + SLSL+H + I
Sbjct: 909 ALLQDHLIAMSQLGRDQEHVSNVFDIYPDVAFPLQPPELLNCPMQGESLSLIHE-NYMSI 967
Query: 1218 DSFC------------------KQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
C +VAAEN ARKPYINWA+KRVTRSLQVLWPRSRTNIFG
Sbjct: 968 FEVCIYFSWDGVSSNPLNYILLAKVAAENMARKPYINWAIKRVTRSLQVLWPRSRTNIFG 1027
Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
S+ATGLSLPSSDVDLVVCLPPVRNL + + I+ L + NQ+ V
Sbjct: 1028 SSATGLSLPSSDVDLVVCLPPVRNL--VSKPKIIPHLVLKIYISLLLKWVVVYNQDSVLK 1085
Query: 1320 DSLKTV--ENTAIPIIM-------------------------------LVVEVPHDLIAS 1346
+ + NT + I + LVVEVP DLI
Sbjct: 1086 FYIYFILMMNTCLYIFVLGASLIIPYMVLFFLICGCSGHVIYEIPIIMLVVEVPCDLIC- 1144
Query: 1347 AASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKS 1406
S+QSPK+ T+ D++ +++MV +DSA+ T++ N+ A SVRLDISFK+
Sbjct: 1145 ---SIQSPKDGPDCITVDQDSNGNTEMVGFEDSAAANSLPTNTGNLAIAKSVRLDISFKT 1201
Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
PSHTGLQTT LVK+LTEQFPA+TPLALVLKQFLADR+LDQSYSGGLSSYCL+ I
Sbjct: 1202 PSHTGLQTTQLVKDLTEQFPAATPLALVLKQFLADRTLDQSYSGGLSSYCLVRCI 1256
Score = 249 bits (637), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 140/232 (60%), Positives = 179/232 (77%), Gaps = 5/232 (2%)
Query: 16 AEYFTN-SLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQA 73
+++F++ S MAQNQLIDSLTSHISLYHSHS SS+ N NPRS+IL+WF+SL+VHQR +
Sbjct: 8 SKFFSSASSMAQNQLIDSLTSHISLYHSHSSSSSMANTIPNPRSAILRWFSSLSVHQRLS 67
Query: 74 HLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAES 133
HLT+VD KF Q+L+QMLG +R G FIILPDLPS LP LC+KKSRGL+SRV+ES
Sbjct: 68 HLTVVDPKFVQILLQMLGYIRTKGPCSFIILPDLPSSSD--LPSLCFKKSRGLISRVSES 125
Query: 134 NESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGF 193
NES R+VF+STRLF S EGE+ ++ SC VN LD+ ++ EF+ NVDRF++ MD +S+G F
Sbjct: 126 NESERFVFDSTRLFGSGEGERAQDCSCSVNSLDSVVMAEEFLTNVDRFVETMDVLSDGAF 185
Query: 194 LRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRG 245
LRGEES+L +WVE +WLKAKGYYS+EAF+ N +G ++C K RG
Sbjct: 186 LRGEESDLGSNWVELEWLKAKGYYSMEAFVANS-SMGTITTLVDCILRKLRG 236
>gi|218198719|gb|EEC81146.1| hypothetical protein OsI_24054 [Oryza sativa Indica Group]
Length = 1254
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)
Query: 732 KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
+++L SYEWP++AP +F S PAT DRLHLDVG+ W QPF+P HQ RN
Sbjct: 401 RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 460
Query: 791 PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
P + GCNQ+LS +P+S DWPP+ + + + YD + QS F +
Sbjct: 461 PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 520
Query: 850 QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
Q + ++ + K GD +P + GD+ +S+W SEEE + +SG D NQYFGGG
Sbjct: 521 QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 576
Query: 910 VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V +Y+ N ++SP +
Sbjct: 577 VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 636
Query: 969 SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
S CS + P Q+ + + N + + L S S+ D ++ IS S + G K
Sbjct: 637 SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 696
Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
DTLP +LRPI++P +SR SRS+FK H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 697 -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 754
Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R + E +DG EVVWPS
Sbjct: 755 VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 813
Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
W NK A + Q + +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 814 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 872
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
MH+ LHEEID FCKQV+AEN RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 873 MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 932
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
+SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 933 TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 992
Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
AIP+IMLV +VP D ++S SSV + + D S S P CS
Sbjct: 993 AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1052
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+ D+ S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1053 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1112
Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1113 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1144
Score = 122 bits (306), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 125/247 (50%), Gaps = 23/247 (9%)
Query: 207 EFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRGVKLKEKLNAAG 256
E WLKAKGYY IE F+ N +E+ LR++W + G +KEK AG
Sbjct: 8 ELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVGKGVKEK---AG 64
Query: 257 MAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNA 316
+AAN +WR+KG VDWWM L+ +R +++ GK +L +E++ E + + F
Sbjct: 65 LAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALANEII-EGTYIASSEKFSFCL 123
Query: 317 GMKQS--SRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASL-ATVFSGLFVLQDIT 373
G +S + Y+ + Q S + C L + P++LS + L ++Q+I
Sbjct: 124 GEPESFVAETYYESTRQ---SFFRRNRPCCLDV-PSTLSCKKNPIFVKELQRLMLVQEI- 178
Query: 374 TMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSS 433
+ L S + + IFF+ L T D +L KLRG+LMVVS + LEL G+ K++
Sbjct: 179 -VYLKSNISNRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLELIGDEAPKAA 237
Query: 434 PNKSKEK 440
K EK
Sbjct: 238 KKKDVEK 244
>gi|222636058|gb|EEE66190.1| hypothetical protein OsJ_22308 [Oryza sativa Japonica Group]
Length = 1486
Score = 772 bits (1993), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)
Query: 732 KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
+++L SYEWP++AP +F S PAT DRLHLDVG+ W QPF+P HQ RN
Sbjct: 633 RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 692
Query: 791 PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
P + GCNQ+LS +P+S DWPP+ + + + YD + QS F +
Sbjct: 693 PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 752
Query: 850 QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
Q + ++ + K GD +P + GD+ +S+W SEEE + +SG D NQYFGGG
Sbjct: 753 QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 808
Query: 910 VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V +Y+ N ++SP +
Sbjct: 809 VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 868
Query: 969 SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
S CS + P Q+ + + N + + L S S+ D ++ IS S + G K
Sbjct: 869 SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 928
Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
DTLP +LRPI++P +SR SRS+FK H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 929 -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 986
Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R + E +DG EVVWPS
Sbjct: 987 VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 1045
Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
W NK A + Q + +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 1046 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 1104
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
MH+ LHEEID FCKQV+AEN RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 1105 MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 1164
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
+SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 1165 TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 1224
Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
AIP+IMLV +VP D ++S SSV + + D S S P CS
Sbjct: 1225 AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1284
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+ D+ S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1285 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1344
Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1345 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1376
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 126/376 (33%), Positives = 188/376 (50%), Gaps = 42/376 (11%)
Query: 89 MLGKLRANGHG-FFIILPDLPS-----RDPPYLPGLCYKKSRGLLSRVAESNESGRWVFE 142
ML +LRA GH FF++ P PS + P + + SRGLL+R AE++ VF+
Sbjct: 1 MLRRLRARGHSSFFVLHPPSPSSAAVAEEGAAAPTVLSRLSRGLLARAAEASRGQGLVFD 60
Query: 143 STRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE-ESEL 201
LF S LD TV+ E + ++D F+ MDEIS G FL G+ E +L
Sbjct: 61 HLLLFPSSPTSSSR--------LDAITVAEELLGDLDGFVAAMDEISGGRFLCGDGEVDL 112
Query: 202 AG----DWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRGVK 247
++ E WLKAKGYY IE F+ N +E+ LR++W + G
Sbjct: 113 VAMVGEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVGKG 172
Query: 248 LKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNAL 307
+KEK AG+AAN +WR+KG VDWWM L+ +R +++ GK +L +E++ E +
Sbjct: 173 VKEK---AGLAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALANEII-EGTYIA 228
Query: 308 EDGMWLFNAGMKQS--SRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASL-ATVFS 364
+ F G +S + Y+ + Q S + C L + P++LS +
Sbjct: 229 SSEKFSFCLGEPESFVAETYYESTRQ---SFFRRNRPCCLDV-PSTLSCKKNPIFVKELQ 284
Query: 365 GLFVLQDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLEL 424
L ++Q+I + L S + + IFF+ L T D +L KLRG+LMVVS + LEL
Sbjct: 285 RLMLVQEI--VYLKSNISNRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLEL 342
Query: 425 FGEGNFKSSPNKSKEK 440
G+ K++ K EK
Sbjct: 343 IGDEAPKAAKKKDVEK 358
>gi|52075837|dbj|BAD45445.1| nucleotidyltransferase-like [Oryza sativa Japonica Group]
Length = 1461
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 423/752 (56%), Positives = 522/752 (69%), Gaps = 14/752 (1%)
Query: 732 KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
+++L SYEWP++AP +F S PAT DRLHLDVG+ W QPF+P HQ RN
Sbjct: 634 RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 693
Query: 791 PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
P + GCNQ+LS +P+S DWPP+ + + + YD + QS F +
Sbjct: 694 PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 753
Query: 850 QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
Q + ++ + K GD +P + GD+ +S+W SEEE + +SG D NQYFGGG
Sbjct: 754 QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 809
Query: 910 VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V +Y+ N ++SP +
Sbjct: 810 VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 869
Query: 969 SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
S CS + P Q+ + + N + + L S S+ D ++ IS S + G K
Sbjct: 870 SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVSKSVSCGSEVIKG 929
Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
DTLP +LRPI++P +SR SRS+FK H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 930 -DTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 987
Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R + E +DG EVVWPS
Sbjct: 988 VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 1046
Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
W NK A + Q + +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 1047 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 1105
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
MH+ LHEEID FCKQV+AEN RKPYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP
Sbjct: 1106 MHNALHEEIDQFCKQVSAENLVRKPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALP 1165
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
+SDVDLV+ LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR LANQ+WV++DSLKTVENT
Sbjct: 1166 TSDVDLVISLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLANQDWVRTDSLKTVENT 1225
Query: 1329 AIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS--PKCS 1385
AIP+IMLV +VP D ++S SSV + + D S S P CS
Sbjct: 1226 AIPVIMLVAQVPCDTNLSSEYSSVLDSSHEQLSINVLGDQGSPSRSDNSSSEGSSMPMCS 1285
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+ D+ S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+FLADRSLD
Sbjct: 1286 RINKDDGNVVQSIRLDISFKSPSHTGLQTTELVCELTQQFPAAVPLALILKKFLADRSLD 1345
Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
YSGGLSSYCL+LLI RFLQHEHHLGRPINQ
Sbjct: 1346 HPYSGGLSSYCLVLLIVRFLQHEHHLGRPINQ 1377
Score = 160 bits (406), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 119/341 (34%), Positives = 176/341 (51%), Gaps = 39/341 (11%)
Query: 28 QLIDSLTSHISLYHSHSLSSNPNPSSNPRSSILKWFASLTVHQRQAHLT-IVDSKFAQLL 86
QL D+LT+H+SLYH+ + + S R++IL+W ASL+ R A T ++ A L
Sbjct: 15 QLADALTAHLSLYHAATPPPPSSSSP--RAAILRWLASLSPPCRAAAATSLLTPAAAAAL 72
Query: 87 IQMLGKLRANGHG-FFIILPDLPS-----RDPPYLPGLCYKKSRGLLSRVAESNESGRWV 140
+ ML +LRA GH FF++ P PS + P + + SRGLL+R AE++ V
Sbjct: 73 LSMLRRLRARGHSSFFVLHPPSPSSAAVAEEGAAAPTVLSRLSRGLLARAAEASRGQGLV 132
Query: 141 FESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE-ES 199
F+ LF S LD TV+ E + ++D F+ MDEIS G FL G+ E
Sbjct: 133 FDHLLLFPSSPTSSSR--------LDAITVAEELLGDLDGFVAAMDEISGGRFLCGDGEV 184
Query: 200 ELAG----DWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----------NNGKKRG 245
+L ++ E WLKAKGYY IE F+ N +E+ LR++W + G
Sbjct: 185 DLVAMVGEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAGGGGGGGGGGRKAVRVG 244
Query: 246 VKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLK-EAS 304
+KEK AG+AAN +WR+KG VDWWM L+ +R +++ GK +L LK S
Sbjct: 245 KGVKEK---AGLAANAFWREKGYVDWWMRLEPRVRARIMGAFFGKGTTALEIVYLKSNIS 301
Query: 305 NALEDGMW---LFNAGMKQSSRFYHSKSLQRTISTLSVDVE 342
N D ++ L +AG + + +ST S+++E
Sbjct: 302 NRSGDAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLE 342
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 32/54 (59%)
Query: 387 EKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSPNKSKEK 440
+ IFF+ L T D +L KLRG+LMVVS + LEL G+ K++ K EK
Sbjct: 306 DAIFFTKLMSAGTVADHILMKLRGILMVVSTESINLELIGDEAPKAAKKKDVEK 359
>gi|242096678|ref|XP_002438829.1| hypothetical protein SORBIDRAFT_10g026810 [Sorghum bicolor]
gi|241917052|gb|EER90196.1| hypothetical protein SORBIDRAFT_10g026810 [Sorghum bicolor]
Length = 1031
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 416/743 (55%), Positives = 512/743 (68%), Gaps = 33/743 (4%)
Query: 739 SYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCN 797
SYEWP++ P +F S PA DRLHLDVG+ W QPF+P HQ RN P D GCN
Sbjct: 208 SYEWPSVTPSHFVSPQMQQRPAAEDRLHLDVGYKWPTQFSQPFLPANHQVRNPPIDAGCN 267
Query: 798 QIL-SQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTS 856
Q+L S +P+S DWPP+ + G + +YD + QS F + +Q S
Sbjct: 268 QMLPSLAVPLSFDWPPVFR---GYGKNAAVSYDPLYTPQMQSSAWSGFPAQLVQRGGICS 324
Query: 857 DDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTS 916
D + D+ +S+W SEEE + SG D NQYFGGGVMYW+ +
Sbjct: 325 D------------------SDVADDTESYWFSEEESDSRAHSGRDINQYFGGGVMYWSPA 366
Query: 917 DHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDP 976
+H GTGFSRPPSLSSDDS+WAWHEAD+ R VDD+ S TNG++SP + CS +
Sbjct: 367 EHAGTGFSRPPSLSSDDSAWAWHEADVIRVVDDVANGIPSTYTNGVSSPPSTPSCSQNES 426
Query: 977 LGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPI 1036
L PG ++ + GNE+ + L S S+ D + E++I+ ++S + DTLP +
Sbjct: 427 LDPG----AHSITGNEINNEALTSPSSVQD-SPEDKITSVAKAVSCGSEVVKADTLPYAM 481
Query: 1037 LRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPS 1096
LRPI++PN+SR SRSD K H+H+SPCVP +RR+ P ++RPPSPVVL VPR PRPPPPS
Sbjct: 482 LRPIVVPNISRRSSRSDIKGGHDHRSPCVPSTRRDIPILRRPPSPVVLSVPRVPRPPPPS 541
Query: 1097 PVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSA 1156
P +SRK RGFP VRSGSSSPRHWG+RG + E +DG EVVWPSW NK S
Sbjct: 542 PAGESRK-RGFPIVRSGSSSPRHWGMRGLFSEDKIFHRAQFCLDGPEVVWPSWGNKGTSG 600
Query: 1157 HPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEE 1216
++Q + +LQDHL+ ISQL+RDQ HPDVA P+QPL++ N + K SLSLMH+ LHEE
Sbjct: 601 T-LVQTIEDTVLQDHLVKISQLSRDQ-HPDVALPVQPLDMLNGLSHKTSLSLMHNALHEE 658
Query: 1217 IDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
ID FCKQVAA + R+PYINWAV+RVTR LQVLWPRSRTN+FGSNATGL+LP+SDVDLVV
Sbjct: 659 IDQFCKQVAAADLVRRPYINWAVRRVTRCLQVLWPRSRTNLFGSNATGLALPTSDVDLVV 718
Query: 1277 CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SDSLKTVENTAIP+IMLV
Sbjct: 719 SLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLTNQDWVRSDSLKTVENTAIPVIMLV 778
Query: 1337 VEVPHDLIAS--AASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKA 1394
+VP D S +S + +E + + + + SD + + S CS + D+
Sbjct: 779 ADVPCDTNMSNEYSSVLDGSQEYSVNMLGEQGSPPRSDTSSSEGSNMLVCSKLNKDDCDI 838
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
S+RLDISFKS SHTGLQTT+LV+ELT+QFPA PLAL+LK+FLADRSLD YSGGLSS
Sbjct: 839 VQSIRLDISFKSSSHTGLQTTELVRELTQQFPAVVPLALILKKFLADRSLDHPYSGGLSS 898
Query: 1455 YCLMLLITRFLQHEHHLGRPINQ 1477
YCL+LLI RFLQHEHHLGRPINQ
Sbjct: 899 YCLVLLIVRFLQHEHHLGRPINQ 921
>gi|357117360|ref|XP_003560438.1| PREDICTED: uncharacterized protein LOC100844132 [Brachypodium
distachyon]
Length = 1247
Score = 712 bits (1838), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/801 (51%), Positives = 519/801 (64%), Gaps = 41/801 (5%)
Query: 685 NFQNSFHESETGAISVLPD---KGIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYE 741
N + FH S VLP I I V ++ E + ++ L SYE
Sbjct: 462 NMDHKFHNSVVTTDKVLPPVIPANILQSAINDNGTVMKNGGGEYYVFK--RNLLGGTSYE 519
Query: 742 WPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQILS 801
WP++AP + A DRLHLD+G+ W QPF+PT HQ R+ P + GCNQ+LS
Sbjct: 520 WPSVAPHFVSPEIQQRPAAADRLHLDIGYRWPTQFDQPFLPTNHQLRSPPIESGCNQMLS 579
Query: 802 Q-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEG 860
+P+S DWPP+ + ++ + +YD + QS F + MQ S+ +
Sbjct: 580 SLAVPLSFDWPPVFRGYGKLSQNAALSYDPVYAPQMQSSAWPGFPAQLMQRGGFCSEKDR 639
Query: 861 KCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPG 920
K D +P T + GD+ +S+W SEEE + VSG D NQYFGGGVMYW+ ++H G
Sbjct: 640 KYFAD----SDPRNTSDVGDDTESYWCSEEESDGRAVSGRDINQYFGGGVMYWSPAEHAG 695
Query: 921 TGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAASFCSPFDPLGP 979
TGFSRPPSLSSDDS+WAWHEAD+ R VDD+ V S+Y+ G +SP + CS + P
Sbjct: 696 TGFSRPPSLSSDDSAWAWHEADVSRVVDDLGVGIPSAYNA-GASSPPSTPSCSQNETSDP 754
Query: 980 GHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRP 1039
Q + + GN++ + LHSSS+T D + E++ + + S S + DTLP +LRP
Sbjct: 755 STQPVCHSMVGNDINNEALHSSSSTQD-SPEDKTTSAARSPSSVSEIVKGDTLPYAMLRP 813
Query: 1040 IIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVS 1099
I++PN+SR SRSDF+ H+H+SPCV +RR+ P ++RPPSPVVL VPR PRPPPPSPV
Sbjct: 814 IVVPNISRRLSRSDFRGGHDHRSPCVSSTRRDIPVVRRPPSPVVLSVPRMPRPPPPSPVG 873
Query: 1100 DSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPM 1159
+SRK RGFP VRSGSSSPRHWG+R + + +DG EVVWPSW NK A +
Sbjct: 874 ESRK-RGFPIVRSGSSSPRHWGMRSLFSDDKIFNRAQFCLDGPEVVWPSWVNKGTYAGTL 932
Query: 1160 IQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDS 1219
+Q + +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSLM + LHEEID
Sbjct: 933 VQSIEDTVLQDHLVKISQLSRDQ-HPDVAVPLQPPDMSNCSSPKASLSLMLNALHEEIDQ 991
Query: 1220 FCKQV-------------------AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS 1260
FCKQV AA N KPYINWAVKRVTR LQVLWPRSRTN+FGS
Sbjct: 992 FCKQVSPLSPAFCITTFIRNTCKVAAGNLVTKPYINWAVKRVTRCLQVLWPRSRTNLFGS 1051
Query: 1261 NATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSD 1320
NATGL+LP+SDVDLVV LPPVRNLEPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SD
Sbjct: 1052 NATGLALPTSDVDLVVSLPPVRNLEPIKEAGILEGRNGIKETCLQHAARCLGNQDWVRSD 1111
Query: 1321 SLKTVENTAIPIIMLVVEVPHDLIASA--ASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
SLKT IP+IMLV +VP D S +S + S ++++ + + + SD + +
Sbjct: 1112 SLKT-----IPVIMLVAQVPCDTNMSIEYSSVLDSSQDNSVNVLAEQASPPRSDNSSSEG 1166
Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
S + S + D+ A S+RLDISFKSPSHTGLQTT+LV ELT+QFPA+ PLAL+LK+F
Sbjct: 1167 SNTLMGSKMNKDDCDAVKSIRLDISFKSPSHTGLQTTELVCELTQQFPAALPLALILKKF 1226
Query: 1439 LADRSLDQSYSGGLSSYCLML 1459
LADRSLD YSGGLSSYCL L
Sbjct: 1227 LADRSLDHPYSGGLSSYCLHL 1247
Score = 122 bits (306), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 157/336 (46%), Gaps = 79/336 (23%)
Query: 28 QLIDSLTSHISLYHSHSLSSNPNPSSNPRSS-------ILKWFASLTVHQRQAHLTIVDS 80
QL+D+LT+H+S YH+ + S + SS+ S IL+W ASL R A T
Sbjct: 15 QLVDALTAHLSHYHASNPSPASSSSSSSSSPSSSPRAAILRWLASLPPASRAAACTS--- 71
Query: 81 KFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESGRWV 140
+LP S P + + SRGLL+R A ++ + +
Sbjct: 72 ----------------------LLPPAASG-----PTVLSRLSRGLLARAAAASRAHELL 104
Query: 141 FESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGEESE 200
F S LF+S D TV+ + ++D F+ MDEIS G FLR + E
Sbjct: 105 FSSALLFASSPASPRP---------DAITVAEALLADLDAFVAAMDEISGGRFLRCGDGE 155
Query: 201 L------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGK--------KRGV 246
+ ++ E WLKAKGYY IE F+ N +E+ LR++W G + G
Sbjct: 156 VDLAALACEEFPELPWLKAKGYYVIEEFVANWVEIALRMSWAAAAGGGGGGGKKAVRVGK 215
Query: 247 KLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNA 306
+K+K AG+A+N +WR+KG VDWWM L+ +R +++ GK A A
Sbjct: 216 SVKDK---AGLASNTFWREKGYVDWWMRLELPVRVRIMGAFFGKGAI------------A 260
Query: 307 LEDGMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVE 342
L++G ++ Q+SR SKS QRT+S +E
Sbjct: 261 LDNGC---SSTESQNSRVV-SKSNQRTLSVGDTTIE 292
>gi|413943401|gb|AFW76050.1| hypothetical protein ZEAMMB73_219043 [Zea mays]
Length = 1447
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/776 (52%), Positives = 515/776 (66%), Gaps = 39/776 (5%)
Query: 691 HESETGAISVLPDK---GIEALEIKKESAVTQDQRNESFFGTALKSSLECPSYEWPTIAP 747
H+SE +LP + I K V ++ E + ++ L SYEWP++ P
Sbjct: 610 HKSEVTKDRILPSAIPTDMLQSAISKNGGVMKNGGGECYVYN--RNLLGGTSYEWPSVTP 667
Query: 748 VYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQIL-SQPLP 805
+F S PA DRLHLDVG+ W QPF+P HQ RN P D G NQ+L S +P
Sbjct: 668 SHFVSPQMQQRPAAADRLHLDVGYRWPTQFNQPFLPANHQVRNPPIDAGHNQMLPSLAVP 727
Query: 806 MSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDDEGKCSGD 865
+S DWPP+ + G + + +YD + QS F + +Q SD
Sbjct: 728 LSFDWPPVFR---GYGKNASVSYDPLYTPQMQSSAWSGFPAQLVQRGGICSD-------- 776
Query: 866 FMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDHPGTGFSR 925
+ D+ +S+W SEEE + SG D NQYFGGGVMYW+ ++H GTGFSR
Sbjct: 777 ----------SDVADDTESYWFSEEESDSRAHSGRDINQYFGGGVMYWSPAEHAGTGFSR 826
Query: 926 PPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPTAASFCSPFDPLGPGHQAFS 985
PPSLSSDDS+WAWHEAD+ R VDD+ S TNG++SP + CS + L P +
Sbjct: 827 PPSLSSDDSAWAWHEADVIRVVDDVANGIPSTYTNGVSSPPSTPSCSQNESLDPA----A 882
Query: 986 YVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNL 1045
+++ GNE+ + L S S+ D + E++I ++S + DTLP +LRPI++P++
Sbjct: 883 HLITGNEINNEALTSPSSVQD-SPEDKIMQVAKAVSCGSEVVKADTLPYAMLRPIVVPSI 941
Query: 1046 SRERSRSDFKRSHEHK-SPCVPPSRREQPRIKRPPSPVVLCVPRAPRPPPPSPVSDSRKT 1104
SR SRS+ K +H+H+ SPCVP +RR+ P ++RPPSPVVL VPR PRPPPPSP +SRK
Sbjct: 942 SRRSSRSEIKGAHDHRRSPCVPSTRRDIPILRRPPSPVVLSVPRVPRPPPPSPAGESRK- 1000
Query: 1105 RGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSAHPMIQPLS 1164
RGFP VRSGSSSPRHWG+RG + E +DG EVVWPSW NK S ++Q +
Sbjct: 1001 RGFPIVRSGSSSPRHWGMRGLFSEDKIFHRAQFCLDGPEVVWPSWGNKGTSG-TLVQTIE 1059
Query: 1165 GALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQV 1224
+LQDHL+ ISQL+ DQ HPDVA P+QPL++ N + KASLSLMH+ LHEEID FCKQV
Sbjct: 1060 DTVLQDHLVKISQLSCDQ-HPDVALPVQPLDMLNGSSHKASLSLMHNALHEEIDQFCKQV 1118
Query: 1225 AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNL 1284
AA N R+PYINWAVKRVTR LQVLWPRSRTN+FGSNATGL+LP+SDVDLVV LPPVRNL
Sbjct: 1119 AAGNLVRRPYINWAVKRVTRCLQVLWPRSRTNLFGSNATGLALPTSDVDLVVSLPPVRNL 1178
Query: 1285 EPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDL- 1343
EPIKEAGILEGRNGIKETCLQHAAR L NQ+WV+SDSLKTVENTAIP+IMLV +VP D
Sbjct: 1179 EPIKEAGILEGRNGIKETCLQHAARCLTNQDWVRSDSLKTVENTAIPVIMLVADVPCDTN 1238
Query: 1344 IASAASSVQSPKEDAAHTTL-KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDI 1402
+++ SSV +D + L + + SD + + S CS + D+ S+RLDI
Sbjct: 1239 MSNEYSSVLDGSQDYSVNMLGEQGSPPRSDTSSSEGSNMLMCSKLNKDDCDIVQSIRLDI 1298
Query: 1403 SFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
SFKSPSHTGLQTT+LV+EL++QFPA PLAL+LK+FLADRSLD YSGGLSSYCL+
Sbjct: 1299 SFKSPSHTGLQTTELVRELSQQFPAVIPLALILKKFLADRSLDHPYSGGLSSYCLV 1354
Score = 103 bits (256), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 96/265 (36%), Positives = 139/265 (52%), Gaps = 35/265 (13%)
Query: 28 QLIDSLTSHISLYHSHS-----LSSNPNPSSNPRSSILKWFASLTVHQRQAHLTIVDSKF 82
+L+D+LT+H+SLYH+ + ++ SS+PR++IL+W ASL+ R A T + S
Sbjct: 13 ELMDALTAHLSLYHAAANPSPATAAASTSSSSPRAAILRWLASLSPAARAAAATSLLSPA 72
Query: 83 AQLLIQ-MLGKLRANGHGFFIIL----PDLPSRDPPYLPGLCYKKSRGLLSRVAESNESG 137
A + M+ +LR GH F +L P +R P + + SRGLL+R A S+ +
Sbjct: 73 AAAALLSMVRRLRLRGHSSFFVLHSSSPSTSARGAEE-PTVLSRLSRGLLARAAASSRAQ 131
Query: 138 RWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE 197
+F +T LF S D TV+ FV ++D F+ MDEIS G FL
Sbjct: 132 ALLFANTLLFPSSPASSRRP--------DVITVAEAFVADLDGFVAAMDEISGGRFLCCG 183
Query: 198 ESEL------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGK-------KR 244
E E+ D+ E WL AKGYY IE F+ NRL++ LR++W G +
Sbjct: 184 EGEVDLAALACQDFPELPWLNAKGYYVIEEFLANRLDIALRMSWAAAGGGSAGGRKAVRI 243
Query: 245 GVKLKEKLNAAGMAANVYWRKKGCV 269
G KEK AG+AAN +WR+KG +
Sbjct: 244 GKSAKEK---AGLAANAFWREKGIL 265
>gi|297810179|ref|XP_002872973.1| hypothetical protein ARALYDRAFT_912245 [Arabidopsis lyrata subsp.
lyrata]
gi|297318810|gb|EFH49232.1| hypothetical protein ARALYDRAFT_912245 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 258/486 (53%), Positives = 339/486 (69%), Gaps = 20/486 (4%)
Query: 19 FTNSLMAQNQLIDSLTSHISLYHSHSLSSN-PNPSSNPRSSILKWFASLTVHQRQAHLTI 77
F+ S MAQNQLIDSLTSHISLYHSHS SS+ N NPRS+IL+WF+SL+VHQR +HLT
Sbjct: 12 FSGSSMAQNQLIDSLTSHISLYHSHSSSSSMANSIPNPRSAILRWFSSLSVHQRLSHLTF 71
Query: 78 VDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYLPGLCYKKSRGLLSRVAESNESG 137
VD KF Q+L+QMLG +R G G FIILPDLPS D LP LC+KKSRGL+SRVAESNE
Sbjct: 72 VDPKFVQILLQMLGYIRTKGPGSFIILPDLPSSD---LPSLCFKKSRGLISRVAESNEPE 128
Query: 138 RWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEISNGGFLRGE 197
R+VF+STRLF SREGE ++ SC VN LD+ ++ + + NVDRF++ MD +SNG FLRGE
Sbjct: 129 RFVFDSTRLFGSREGENAQDCSCSVNSLDSVVMADDLLTNVDRFVETMDALSNGAFLRGE 188
Query: 198 ESELAGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGM 257
ES+L +WVE +WLKAKGYYS+EAFI NRLEV LRLAWLN N+GKKRG+KLKEKLNAA
Sbjct: 189 ESDLGSNWVELEWLKAKGYYSMEAFIANRLEVSLRLAWLNTNSGKKRGIKLKEKLNAAAA 248
Query: 258 AANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTHEVLKEASNALEDGMWLFNAG 317
AAN YWRKK CVDWW NLD A RK+ T +LGK+AKS+ +E+L+EA+ A + +WLFN
Sbjct: 249 AANSYWRKKACVDWWQNLDAATHRKIWTCLLGKSAKSVIYEILREANQAQQGDIWLFN-- 306
Query: 318 MKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVLQDITTMVL 377
F ++ + S +S C + + P S+S P ++A+ SGL+VLQ+ ++++
Sbjct: 307 ------FASARKRRTETSEVSF---CDMILEPNSVSRKPITVASNLSGLYVLQEFASLLI 357
Query: 378 SSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELFGEGNFKSSP-NK 436
Q+ ++ +FFSSL ++T DC+LRKLRG LMV+S+D K+EL + K SP +
Sbjct: 358 LCQNGLVPVQSVFFSSLGTITTLVDCILRKLRGFLMVISIDSVKVELLDDNTHKCSPSSS 417
Query: 437 SKEKPSTIGRRKKCRACSTKRQNPLPKSALDE-LSLD-KLPKDPEGALTDTEKVDLMGSD 494
S +K R+ K + + K+ P P++ D+ +SL K K + L T+ + +
Sbjct: 418 SNQKLGLTSRKHKGKTRNLKK--PTPEATSDKNVSLSTKSGKKDQAKLEFTKNREAIECK 475
Query: 495 KVPGIS 500
KVP S
Sbjct: 476 KVPAAS 481
>gi|168032650|ref|XP_001768831.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679943|gb|EDQ66384.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1918
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 314/804 (39%), Positives = 428/804 (53%), Gaps = 110/804 (13%)
Query: 728 GTALKSSLECP------SYEWPTIAPVYFPSISSHLLP-ATDRLHLDVGHNWHNHVRQPF 780
G +L+S P S+EWP + V P S + P AT+RLHLDV H+W +
Sbjct: 1055 GGSLRSQHNTPAFAMFYSHEWPGYSQVRVPG--STVRPTATERLHLDVVHDWPKCLGASS 1112
Query: 781 VPTLHQARNHPFDGGCNQILSQPLPMSLDWPPMVQNVSGIAPSVTCN------------Y 828
V + P P +LDW P+V N + P V +
Sbjct: 1113 VSISNTV---------------PAPGNLDWSPLVHNGYSLTPMVGYADSNEPKVVTLSPF 1157
Query: 829 DSGFISSRQ--SGFQQNFATKGMQFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHW 886
S +SR +G + ++ +D E +GDF DL ++E+G
Sbjct: 1158 SSSPYASRTPPNGHTRLGEDSEVEERLYDTDLEIFDAGDFGDLDGYLVSEEEG------- 1210
Query: 887 LSEEELEVHT-VSGIDYNQYFGGGVMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKR 945
E ++E + V DYNQ FGGGVMYWN++D+ G G+SRP S+SSDDSSWA EAD+
Sbjct: 1211 --ERQIEDNAHVKAEDYNQVFGGGVMYWNSADYTGMGYSRPGSMSSDDSSWARREADLGA 1268
Query: 946 AVDDMVA--FSSSYSTNGLTSPTAASFCS---------PFDPLGPGHQAFSYVVPGNEVP 994
+DD+V + Y + G TS +S PFD +G G++V
Sbjct: 1269 VLDDIVGIPYGVPYGSKGSTSSNLSSTSPSQPSTSLPVPFDQIG-----------GSQVG 1317
Query: 995 GKVLHSSSTTTDVAT-------EEEISGSFASLSGDVDSKALDTLPCPILRPIII---PN 1044
+V + T +D + + + +G+ S V + P LRPI++ P
Sbjct: 1318 PRVCLTRPTGSDFSVSYRAFSDDSQTAGTNVGNSAGVVDGIMSDGFMPRLRPIVVMRDPG 1377
Query: 1045 LSRERSRSDFKRSHEHKSPCV--------PPSRREQPRIKRPPSPVVLCVPRAPRPPPPS 1096
LSR +++ R E +SP + P RR +RP PV AP PPPPS
Sbjct: 1378 LSRSMGKNEAVRLRELRSPHIVSRGAPDFPGDRR-----RRPSPPVTR---HAPPPPPPS 1429
Query: 1097 PVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPSWRNKNLSA 1156
PV+ ++ +G+ + RSGSSSPRHWG+ W+++ + R L++
Sbjct: 1430 PVAGLKRRKGWMSARSGSSSPRHWGLTSWWNKDDVDNFEVQAAGRDNALENGRRMAILAS 1489
Query: 1157 HPMIQPLSGALLQDHLIAISQLARDQEH-PDVAFPLQPLEVQN-CPTRKASLSLMHSLLH 1214
P I+P+S L +H +AI A DQEH D A +Q + N P+ + ++L H LH
Sbjct: 1490 TPPIRPMSRVLPLEHSVAIHSSALDQEHMADEALVMQSTVLFNKNPSLQLVITLFHKSLH 1549
Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+EI++FC QVAAE R P+IN AVK+VT SLQVLWPRSR IFGSNATGL+LP+SDVD+
Sbjct: 1550 KEIETFCLQVAAEKRLRMPFINTAVKKVTHSLQVLWPRSRAKIFGSNATGLALPTSDVDI 1609
Query: 1275 VVCLPPVRNLE-PIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
VV LPPVR PIK+AGILEGR +G+KETCL HAAR L +Q+WVKS L+ E T +PI
Sbjct: 1610 VVSLPPVRKARNPIKQAGILEGRIDGVKETCLLHAARNLEDQDWVKS--LQVFERTMVPI 1667
Query: 1333 IMLVVEV-PHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
I L + PH + + SV+ + + + D + P S+
Sbjct: 1668 IRLEAHILPHQCELGDISLSVKETCPMGSSNDAQDGSPTSQDSGDSGEKEGPWGSN---- 1723
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
K VRLDISF+S ++TGL+T++LV+EL QFP TPLALV KQFLADRSLD +Y G
Sbjct: 1724 --KDRQIVRLDISFESLANTGLRTSELVRELVGQFPPLTPLALVSKQFLADRSLDHAYKG 1781
Query: 1451 GLSSYCLMLLITRFLQHEHHLGRP 1474
GLSSYC +LLITRFLQH+ HLGRP
Sbjct: 1782 GLSSYCQVLLITRFLQHQQHLGRP 1805
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 165/356 (46%), Gaps = 64/356 (17%)
Query: 111 DPPYLPGLCYKKSRGLLSRVAESNESGRWVFESTRLFSSREGEK--IEEWSCPVNCLDTF 168
D LPGLCY+K+ GLL+R+ + + ++ ++FSS++G K I S PV D
Sbjct: 315 DSVTLPGLCYRKANGLLARLKSQQLAEEKLRKAVQIFSSQDGHKSLISCGSEPV--FDAL 372
Query: 169 TVSVEFVENVDRFIDIMDEISNGGFLRGEESELAGDWVEFDWLKAKGYYSIEAFIVNRLE 228
++S E EN+D F+++M EIS G FL A W E WL+ +GYYS+ AFI N+LE
Sbjct: 373 SLSSELAENLDVFLEVMGEISYGEFLEHPLGLTASPWEETPWLQVQGYYSVAAFIANKLE 432
Query: 229 VGLRLAWLNCNNGKKRGVK----------LKEK---------LNAAGMAANVYWRKKGCV 269
+ + ++ + + G KR K L+ K L +A A +++ RK+GC+
Sbjct: 433 ICIWSSY-SLSMGSKRSFKNQWIIKTSESLRSKWTSGRSVDVLESAAAAQSLHERKRGCI 491
Query: 270 DWWMNLDDAMRRKVLTVILGKAAK--------------------SLTHEVLKEASNALED 309
DWW +D ++ K + L A K +HE+L+ ++
Sbjct: 492 DWWNRVDCDLKEKTVRSALIAATKFEVISAVSSARSAIRSIKPAHRSHELLEATKYRGKN 551
Query: 310 GMWLFNAGMKQSSRFYHSKSLQRTISTLSVDVECGLAISPASLSGIPASLATVFSGLFVL 369
W + + +H L+ + ST + +SP L G+ A SG+
Sbjct: 552 TRW---TSLSDTEVEHHKLDLKNSWSTST--------LSPL-LLGLAALKEAAESGILSG 599
Query: 370 QDITTMVLSSQHNEYDIEKIFFSSLRFVSTTTDCLLRKLRGLLMVVSLDCTKLELF 425
+ +T S+ +FFS+L V T D ++R+ R +L V + + EL
Sbjct: 600 ESLTIAPRSNN--------LFFSNLESVHTLPDRIVRRSRNVLARVMNEAVEYELL 647
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 57/104 (54%), Gaps = 16/104 (15%)
Query: 22 SLMAQNQLIDSLTSHISLYHSHS------------LSSNPNPSSNP----RSSILKWFAS 65
S M Q +D+LT H+++YH + L +NP P + + +++W +
Sbjct: 136 STMDTQQAVDALTDHMAIYHRSAERLKAGRSTGGDLKNNPKPGAKGGNVVKQRVIEWLSE 195
Query: 66 LTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGFFIILPDLPS 109
L+ QRQA LT+ D + LL+QM KL NG G FIILPD+P+
Sbjct: 196 LSTSQRQAVLTVFDRSWVLLLLQMQQKLNENGPGNFIILPDIPA 239
>gi|297725171|ref|NP_001174949.1| Os06g0669275 [Oryza sativa Japonica Group]
gi|255677308|dbj|BAH93677.1| Os06g0669275, partial [Oryza sativa Japonica Group]
Length = 615
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 243/511 (47%), Positives = 323/511 (63%), Gaps = 11/511 (2%)
Query: 732 KSSLECPSYEWPTIAPVYFPSISSHLLPAT-DRLHLDVGHNWHNHVRQPFVPTLHQARNH 790
+++L SYEWP++AP +F S PAT DRLHLDVG+ W QPF+P HQ RN
Sbjct: 110 RNTLGGTSYEWPSVAPHHFVSPEMQQRPATTDRLHLDVGYRWPAQFEQPFLPPNHQMRNP 169
Query: 791 PFDGGCNQILSQ-PLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGM 849
P + GCNQ+LS +P+S DWPP+ + + + YD + QS F +
Sbjct: 170 PVEAGCNQVLSSLSVPLSFDWPPVFRGYGKLNQNNALGYDPLYTPQMQSSAWSGFHAPLI 229
Query: 850 QFNAKTSDDEGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGG 909
Q + ++ + K GD +P + GD+ +S+W SEEE + +SG D NQYFGGG
Sbjct: 230 QRGSICNEKDRKYFGD----SDPRNKSDVGDDTESYWFSEEESDGRGLSGRDINQYFGGG 285
Query: 910 VMYWNTSDHPGTGFSRPPSLSSDDSSWAWHEADIKRAVDDM-VAFSSSYSTNGLTSPTAA 968
VMYW+ ++H GTGFSRPPSLSSDDS+WAWHEAD+ + +DD+ V +Y+ N ++SP +
Sbjct: 286 VMYWSPAEHAGTGFSRPPSLSSDDSAWAWHEADVSQVLDDIPVGIPCTYNANSVSSPPST 345
Query: 969 SFCSPFDPLGPGHQAFSYVVPGNEVPGKVLHSSSTTTDVATEEEISGSFASLSGDVDSKA 1028
S CS + P Q+ + + N + + L S S+ D ++ IS S S+S +
Sbjct: 346 SVCSQNESPDPLPQSVGHSMTANGMNSESLQSPSSMQDSPEDKTISVS-KSVSCGSEVIK 404
Query: 1029 LDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVLCVPR 1088
DTLP +LRPI++P +SR SRS+FK H+H+SPCVP +RR+ P ++RPPSPVVL VPR
Sbjct: 405 GDTLPYAMLRPIVVP-ISRRPSRSEFKGGHDHRSPCVPSNRRDIPLVRRPPSPVVLSVPR 463
Query: 1089 APRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEVVWPS 1148
PRPPPPSPV +SRK RGFP VRSGSSSPRHWG+R + E +DG EVVWPS
Sbjct: 464 VPRPPPPSPVGESRK-RGFPIVRSGSSSPRHWGMRSLFTEDKILNRAQFCLDGPEVVWPS 522
Query: 1149 WRNKNLSAHPMIQPLSGALLQDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSL 1208
W NK A + Q + +LQDHL+ ISQL+RDQ HPDVA PLQP ++ NC + KASLSL
Sbjct: 523 WGNKGSPAGTLGQSIEDTVLQDHLVKISQLSRDQ-HPDVALPLQPPDMLNCSSSKASLSL 581
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAV 1239
MH+ LHEEID FCKQV Y+ + +
Sbjct: 582 MHNALHEEIDQFCKQVCPFCATSFMYVRYVI 612
>gi|345291705|gb|AEN82344.1| AT4G00060-like protein, partial [Capsella grandiflora]
gi|345291707|gb|AEN82345.1| AT4G00060-like protein, partial [Capsella grandiflora]
gi|345291709|gb|AEN82346.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291711|gb|AEN82347.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291713|gb|AEN82348.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291715|gb|AEN82349.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291717|gb|AEN82350.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291719|gb|AEN82351.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291721|gb|AEN82352.1| AT4G00060-like protein, partial [Capsella rubella]
gi|345291723|gb|AEN82353.1| AT4G00060-like protein, partial [Neslia paniculata]
Length = 197
Score = 253 bits (647), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 131/228 (57%), Positives = 159/228 (69%), Gaps = 33/228 (14%)
Query: 739 SYEWPTIAPVYFPSISSHLLPATDRLHLDVGHNWHNHVRQPFVPTLHQARNHPFDGGCNQ 798
SYEWP +AP+YF ++SHL ATDRLHLDVGHN H +VRQPFV ++H RN +G Q
Sbjct: 2 SYEWPAVAPMYFSHVNSHLPTATDRLHLDVGHNLHAYVRQPFVSSVHHTRNPSIEGSHKQ 61
Query: 799 ILSQPLPMSLDWPPMVQNVSGIAPSVTCNYDSGFISSRQSGFQQNFATKGMQFNAKTSDD 858
+LS+P+PMSLDWPPMV + G+ + CNYDSG +
Sbjct: 62 VLSRPMPMSLDWPPMVHSNCGLTTAFACNYDSGIL------------------------- 96
Query: 859 EGKCSGDFMDLPEPTTTQEQGDECDSHWLSEEELEVHTVSGIDYNQYFGGGVMYWNTSDH 918
+D+PE E G+EC+++W+ EE+ E+HTVSG+DYNQYFGGGVMYWN SDH
Sbjct: 97 --------VDIPEQKNKPELGNECENNWMLEEDFEMHTVSGVDYNQYFGGGVMYWNPSDH 148
Query: 919 PGTGFSRPPSLSSDDSSWAWHEADIKRAVDDMVAFSSSYSTNGLTSPT 966
GTGFSRPPSLSSDDSSWAWHEA++KR+VDDMVAFSSSYS NGL SPT
Sbjct: 149 LGTGFSRPPSLSSDDSSWAWHEAEMKRSVDDMVAFSSSYSANGLDSPT 196
>gi|428166591|gb|EKX35564.1| hypothetical protein GUITHDRAFT_118270 [Guillardia theta CCMP2712]
Length = 1889
Score = 172 bits (437), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/266 (36%), Positives = 148/266 (55%), Gaps = 40/266 (15%)
Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
LMH++L ++ F +++ +E R Y A+ ++ + LWPR++ IFGS AT LS+
Sbjct: 1485 LMHAMLDSDMLLFSRRIQSELDRRSHYRLCAIAKIQSAASSLWPRAQCKIFGSVATSLSV 1544
Query: 1268 PSSDVDLVVCLPPV--RNLEP-IKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
P+SDVD++VCLP V R EP + AG++EGR IK T + A+ LAN +W++S++LKT
Sbjct: 1545 PTSDVDIIVCLPKVMSRESEPSLARAGVMEGRRVIKGTLQANLAQVLANADWIQSETLKT 1604
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
+EN +P+I + P L LD S
Sbjct: 1605 IENAVVPVITFSTK-PMSLF-------------------------------LDPS----- 1627
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
H +S+ + + +RLD++F+ P H GL LV+ L + FPA L V+K L+D+ L
Sbjct: 1628 QHKASEEEEKISVIRLDVTFEGPDHRGLPAVRLVQSLVQDFPALKHLFFVVKSLLSDKGL 1687
Query: 1445 DQSYSGGLSSYCLMLLITRFLQHEHH 1470
D Y+GGL+S+ L+LL+TR+LQH +
Sbjct: 1688 DNPYTGGLTSFGLVLLVTRYLQHSNQ 1713
>gi|387219009|gb|AFJ69213.1| hypothetical protein NGATSA_2001900, partial [Nannochloropsis
gaditana CCMP526]
gi|422292689|gb|EKU19991.1| hypothetical protein NGA_2001900, partial [Nannochloropsis gaditana
CCMP526]
Length = 1119
Score = 170 bits (431), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/272 (37%), Positives = 147/272 (54%), Gaps = 18/272 (6%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
+ S L ++I +F ++V+ +R P A+ R + +Q LWPR++ FGS +GL+LP
Sbjct: 694 IRSRLCDDIAAFVERVSVCTGSRLPQQTVAINRCRQVIQSLWPRAQVKAFGSFVSGLALP 753
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
SSDVDL++CLP VR P + G+LEGRN IKET Q AR L WV S+K + +T
Sbjct: 754 SSDVDLIICLPNVRRDAPAQAPGVLEGRNAIKETWQQELARRLRATSWVNPASIKIISHT 813
Query: 1329 AIPIIMLVVEV----------PHDLIASAASSVQSPK--EDAAHTTLKHDNHVHSDMVAL 1376
AIP+I + VE P + +V S + E AA + H +D+ A
Sbjct: 814 AIPVIKMTVESQPRPSPASPHPTSPVEGGTGTVASGQHSEMAAASQRPHSCRNEADLSAE 873
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
+ + T V LD+S + H GL ++ +L + A PL LVLK
Sbjct: 874 PEGVALNMGWT------VEYGVHLDVSIEGGQHNGLLANRVIAKLLGENFALRPLVLVLK 927
Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
QF+ +R L +SYSGGLSSY L+L++ R+LQ +
Sbjct: 928 QFMKERGLMESYSGGLSSYGLVLMVARYLQEQ 959
>gi|348685799|gb|EGZ25614.1| hypothetical protein PHYSODRAFT_297213 [Phytophthora sojae]
Length = 1731
Score = 167 bits (424), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 101/259 (38%), Positives = 134/259 (51%), Gaps = 56/259 (21%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
S LH ++ F Q+ A++ AR P A + T ++Q LWPR++ +GS+ + L LPSS
Sbjct: 1062 SKLHWDLCEFAAQLQADSDARLPAQLAAQRLCTATVQSLWPRAQVRPYGSHVSRLVLPSS 1121
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVDLV+CLP VR P AG+LEGRN IKET Q+ AR L + WV DS+KT+ + A+
Sbjct: 1122 DVDLVICLPKVRRDAPADAAGVLEGRNAIKETWQQNLARKLRQEPWVVRDSVKTLPHAAV 1181
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
PII L+ P++ V LD
Sbjct: 1182 PIITLLTAPPYN-------------------------------VRLD------------- 1197
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
ISF+ P H GL T D+V L +FP P+ LVLK F +R L +YSG
Sbjct: 1198 -----------ISFEGPGHNGLATNDVVLSLLHEFPPLAPMMLVLKSFAIERGLAVAYSG 1246
Query: 1451 GLSSYCLMLLITRFLQHEH 1469
GLSSY L+LL+ R+LQ EH
Sbjct: 1247 GLSSYALLLLVARYLQ-EH 1264
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 113/278 (40%), Gaps = 64/278 (23%)
Query: 56 RSSILKWFASLTVHQRQAHLTIVDSKFAQLLIQMLGKLRANGHGF--FIILPDL------ 107
+ ++LKW L+ + + + D F + L+ M + R + G F +LP
Sbjct: 31 KRAVLKWLRGLSGEELASLCCVEDVGFVKTLLHMAARSRGSSGGVQEFQLLPQTVGAGSG 90
Query: 108 ----------PSRDPP--YLPGLCYKKSRG-----LLSRVAESNESGRWVFESTRLFSSR 150
P R P ++ K + G SR E ES R + R+ ++
Sbjct: 91 VQSKRTATKTPPRTAPREFVKRPVVKTADGDVVGSFCSR--EYEESSRKLLSGMRVLNTL 148
Query: 151 EGEKIEEWSCPVNCLDTFTVSVEFVENVDR--------FIDIMDEISNGGFLRG--EESE 200
+ SC DT +S+EF V+R +M+ IS G FL ES
Sbjct: 149 Q-------SC-----DTVALSMEFFHPVERKKGAKFAECFQLMEVISRGAFLIECPPEST 196
Query: 201 LAGD-WVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNCNNGKKRGVKLKEKLNAAGMAA 259
L W E WLK +GYYS++A VN++E+ + W K+R K+ G+A+
Sbjct: 197 LKNRVWGETKWLKEQGYYSLQALFVNQIELNVWAFW------KQRVKDASTKIPLQGLAS 250
Query: 260 NVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSLTH 297
++ V W DA R L V+ K+++H
Sbjct: 251 KLH-----LVREWEAASDAQRTAALKVL---GDKTISH 280
>gi|325184829|emb|CCA19321.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1322
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 55/256 (21%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
S LH ++ F + A++ RK A + T ++Q LWPR++ +GS T L+LPSS
Sbjct: 695 SRLHWDMCEFVTLLQAQSNGRKSAQTAAQRLCTTTVQSLWPRAQVRAYGSFVTQLALPSS 754
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
D+DLV+CLP VR+ P + AG+LEGRN IKET Q+ AR L + WV +S+KT+ + A+
Sbjct: 755 DLDLVICLPKVRHDAPAEAAGVLEGRNAIKETWQQNLARKLRREPWVVPESVKTIPHAAM 814
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
PII L+ + P++ V LD
Sbjct: 815 PIITLLTKAPYN-------------------------------VRLD------------- 830
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
ISF+ P H GL T D+V +FP+ PL LVLK F+ +R +YSG
Sbjct: 831 -----------ISFEGPGHNGLATNDVVHAFIHEFPSVVPLMLVLKSFVIERGYAFAYSG 879
Query: 1451 GLSSYCLMLLITRFLQ 1466
GLSSY L+L++ R+LQ
Sbjct: 880 GLSSYALLLMVVRYLQ 895
>gi|219121652|ref|XP_002181176.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407162|gb|EEC47099.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1458
Score = 164 bits (414), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 93/265 (35%), Positives = 130/265 (49%), Gaps = 53/265 (20%)
Query: 1205 SLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATG 1264
SL M S L ++I F + AV+R+TR + +WPR++ ++GS+ TG
Sbjct: 1100 SLRGMQSRLSKDILHFLDATNMHLRKQDSKRMKAVERMTRLVNTVWPRAQVKVYGSHVTG 1159
Query: 1265 LSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
L LPSSD+D V+CLP V P G LEGRN I ET + AR L ++ W+ S+K
Sbjct: 1160 LCLPSSDLDFVICLPAVHKRAPAVAPGALEGRNAINETSQKLLARKLKSESWIDPRSMKL 1219
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
+E +P+I +
Sbjct: 1220 IERAVVPVITV------------------------------------------------- 1230
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
S+ + KA T ++LDISF P H GLQ D+V E+ E+ P PL L+LKQFL DR L
Sbjct: 1231 ---STKDTKART-IQLDISFDGPGHHGLQAIDMVSEILEELPMLRPLVLILKQFLLDRGL 1286
Query: 1445 DQSYSGGLSSYCLMLLITRFLQHEH 1469
+Y+GGLSSYCL L++ R+LQ +
Sbjct: 1287 LTAYTGGLSSYCLFLMVARYLQEQQ 1311
>gi|301111622|ref|XP_002904890.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095220|gb|EEY53272.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 1620
Score = 164 bits (414), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 100/259 (38%), Positives = 132/259 (50%), Gaps = 56/259 (21%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
S LH ++ F Q+ A++ AR P A + ++Q LWPR++ +GS+ T L LPSS
Sbjct: 958 SKLHWDLCEFAAQLQADSDARLPAQLAAQRLCMATVQSLWPRAQVRPYGSHVTRLVLPSS 1017
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVDLV+CLP VR P AG+LEGRN IKET Q+ AR L + WV DS+KT+ + A+
Sbjct: 1018 DVDLVICLPKVRRDAPADAAGVLEGRNAIKETWQQNLARKLKQEPWVVRDSVKTLPHAAV 1077
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
PII L+ P++ V LD S
Sbjct: 1078 PIITLLTAPPYN-------------------------------VRLDIS----------- 1095
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
F+ P H GL T D+V L +FP P+ LVLK F +R L +YSG
Sbjct: 1096 -------------FEGPGHNGLATNDVVLALLHEFPPLAPMMLVLKSFAIERGLAVAYSG 1142
Query: 1451 GLSSYCLMLLITRFLQHEH 1469
GLSSY L+LL+ R+LQ EH
Sbjct: 1143 GLSSYALLLLVARYLQ-EH 1160
>gi|242096680|ref|XP_002438830.1| hypothetical protein SORBIDRAFT_10g026820 [Sorghum bicolor]
gi|241917053|gb|EER90197.1| hypothetical protein SORBIDRAFT_10g026820 [Sorghum bicolor]
Length = 303
Score = 138 bits (348), Expect = 2e-29, Method: Composition-based stats.
Identities = 103/299 (34%), Positives = 148/299 (49%), Gaps = 42/299 (14%)
Query: 28 QLIDSLTSHISLYHSHSLSSNPNPSSNPRSS----------ILKWFASLT-VHQRQAHLT 76
+L+D+LT+H+SLYH+ + S +S SS IL+W ASLT + A +
Sbjct: 13 ELMDALTAHLSLYHAAANPSPAAAASASSSSSSSSSSPRAAILRWLASLTPAARAAAAAS 72
Query: 77 IVDSKFAQLLIQMLGKLRANGHGFFIILPDLPSRDPPYL-------PGLCYKKSRGLLSR 129
++ A L+ M+ +LR GH F +L S P + + SRGLL+R
Sbjct: 73 LLSPAAAAALLSMVRRLRLRGHSSFFVLHSSSSSPSTSGSARGAEEPTVLSRLSRGLLAR 132
Query: 130 VAESNESGRWVFESTRLFSSREGEKIEEWSCPVNCLDTFTVSVEFVENVDRFIDIMDEIS 189
A + + +F + LF S C D TV+ F+ ++D F+ MDEIS
Sbjct: 133 AAAGSRAQTLLFANILLFPSSPASS--------RCPDAITVAEAFLADLDGFVAAMDEIS 184
Query: 190 NGGFLRGEESEL------AGDWVEFDWLKAKGYYSIEAFIVNRLEVGLRLAWLNC----- 238
G FL E E+ D+ E WL KGYY IE F+ NRLE+ LR++W
Sbjct: 185 GGRFLCCGEGEVDLAVLACQDFPELPWLNDKGYYVIEEFLANRLEIALRMSWAAAGGGGV 244
Query: 239 --NNGKKRGVKLKEKLNAAGMAANVYWRKKGCVDWWMNLDDAMRRKVLTVILGKAAKSL 295
+ G +KEK AG+AAN +WR+KG VDWWM L+ MR +++ GK A +L
Sbjct: 245 GGRKAVRIGKGVKEK---AGLAANAFWREKGYVDWWMRLEPRMRARIIGAFFGKGAIAL 300
>gi|323454423|gb|EGB10293.1| hypothetical protein AURANDRAFT_71230 [Aureococcus anophagefferens]
Length = 2400
Score = 132 bits (333), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/264 (34%), Positives = 126/264 (47%), Gaps = 52/264 (19%)
Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
S L +++ +F ++V+ E AR A ++V +++ LWPR+R +GS T LSL
Sbjct: 840 QSRLADDVRAFAEKVSREQRARDFAWRCAREKVRGAVRSLWPRARVEPYGSCVTRLSLAG 899
Query: 1270 -SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
SD+DLV+ LP VR P G LEGRN IKET Q AR L ++ WV + S++ V +
Sbjct: 900 VSDLDLVIRLPRVRVATPAMTPGDLEGRNAIKETWPQELARRLRSEPWVAAMSVRVVASA 959
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
+P++ L V L D
Sbjct: 960 RVPVVKLAT------------------------------------VPLGD---------- 973
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
++ A RLD+SF+ P H GL LV + Q P + + LVLKQ ADR L SY
Sbjct: 974 ---VREAV--RLDVSFEGPWHRGLDANRLVGRVLGQHPHARQILLVLKQHAADRGLCASY 1028
Query: 1449 SGGLSSYCLMLLITRFLQHEHHLG 1472
+GGLSSY L LL+ RFL + G
Sbjct: 1029 TGGLSSYALTLLVARFLSEQPRRG 1052
>gi|298705070|emb|CBJ28529.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2789
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/259 (34%), Positives = 117/259 (45%), Gaps = 68/259 (26%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
S L E++ F V E R W A +R ++ LWPR++ +GS TGLSLP
Sbjct: 1666 SRLCEDLSRFLHLVT-ERERRTFDAKWVAYQRCRAVVRGLWPRAQVKPYGSFVTGLSLPW 1724
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+DLV+CLP VR P G LEGRN IKET Q AR L + WV +N +
Sbjct: 1725 SDLDLVICLPKVRRDAPADTPGALEGRNAIKETWQQELARSLGGERWV--------DNNS 1776
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+ II SHT+
Sbjct: 1777 LKII---------------------------------------------------SHTAV 1785
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
IK T F+ H G LVK+L + FPA PLAL+LKQF+ ++ L SY+
Sbjct: 1786 PVIKLTT-------FEGQGHNGQAANQLVKDLVDGFPALRPLALLLKQFMMEKGLSVSYT 1838
Query: 1450 GGLSSYCLMLLITRFLQHE 1468
GGLSSY L L++ R+LQ +
Sbjct: 1839 GGLSSYALTLMVARYLQEQ 1857
>gi|440803146|gb|ELR24057.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 1552
Score = 108 bits (269), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 128/272 (47%), Gaps = 49/272 (18%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LH+EI+ + + + + R+ Y + ++ + LW S +GS AT L +PSSD
Sbjct: 1217 FLHKEIEDYVDHIVSLTSVRRVYQELMISKIRAIVAKLWSNSLVECYGSYATSLCIPSSD 1276
Query: 1272 VDLVVCLPPVRNL----EPIKEAGILEGRNGIKE--TCLQHAARYLANQEWVKSDSLKTV 1325
+DLVV L P L + ++ N ++ T +Q L +WV SL+ +
Sbjct: 1277 LDLVVLLSPASTLLCGYKSVETNNAGSNDNLLRRDWTHMQILVEELRKHDWVH--SLQFI 1334
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ +P+I L+ D H + +D + S S
Sbjct: 1335 DTAKVPVIKLMTVA--------------------------DGHT----ILMDVTFSFSNS 1364
Query: 1386 HTSSDNIKAATSVRL-DISFKSPS----------HTGLQTTDLVKELTEQFPASTPLALV 1434
++S + A+T+ L D+ F S H+G+ + LV+ PA TPLALV
Sbjct: 1365 TSTSSPLHASTAESLADLPFNPTSLLRSNASTFEHSGIAASHLVRSYIRSLPALTPLALV 1424
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL++R L+ +Y+GGLSSYCL+L+I FL+
Sbjct: 1425 LKQFLSERGLNNTYTGGLSSYCLVLMIVAFLR 1456
>gi|224002835|ref|XP_002291089.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972865|gb|EED91196.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1570
Score = 100 bits (249), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 74/127 (58%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
M S L +I F + + ++ + WAV+ +TR++ +WPR++ +GS+ T L LP
Sbjct: 1407 MQSRLSTDIHRFMRALKSQLKKNERRRLWAVENITRAVNAVWPRAQIKQYGSHVTKLCLP 1466
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
SSD+D V+CLP V P G LEGRN I ET + +R L + W+ S+K +E T
Sbjct: 1467 SSDMDFVICLPAVHKNAPAVAPGDLEGRNAINETNQKVLSRKLKGEPWLDQRSIKVIERT 1526
Query: 1329 AIPIIML 1335
A+P+I +
Sbjct: 1527 AVPVIKV 1533
>gi|357620388|gb|EHJ72600.1| hypothetical protein KGM_01329 [Danaus plexippus]
Length = 564
Score = 94.4 bits (233), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 115/274 (41%), Gaps = 80/274 (29%)
Query: 1195 EVQNCPTRKASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + +L HEEI+ F ++ T V R+ ++ LWP+
Sbjct: 94 EYGGCPWRIPNYNYKPGVLGLHEEIEHFYMYMSPSETEHLVRTT-VVTRIRSAILSLWPQ 152
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
+R +FGS TGL LP+SD+DLVV ++ L R L
Sbjct: 153 ARVEVFGSFRTGLYLPTSDIDLVVI-------------------GQWEKLPLWTLERELV 193
Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
Q+ + DS+K +E +P
Sbjct: 194 AQDIAEQDSIKVLEKATVP----------------------------------------- 212
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+V + D S V++DISF S G+++ +L+K+ EQ+P + L
Sbjct: 213 IVKMTDKYS---------------DVKVDISFNMSS--GVKSAELIKQFKEQYPELSRLV 255
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 256 MVLKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 289
>gi|298715261|emb|CBJ27910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 2092
Score = 93.2 bits (230), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 117/251 (46%), Gaps = 37/251 (14%)
Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
F +++ R+ + + RV + +W +R ++GS TGL LPSSDVD+VVC
Sbjct: 1430 FMQKLGRAMRPRRRAWSQVIARVRELVSGIWKEARVEMYGSCFTGLELPSSDVDVVVCG- 1488
Query: 1280 PVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVE 1338
+ L+ G G + ++ LQ A L Q WV+ +K +E A+P+I ++
Sbjct: 1489 -INGLK-YSYGGSRAGYSSMQTVANLQKLANVLQEQTWVR--GMKVIETAAVPVIKILA- 1543
Query: 1339 VPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
D +++ + DDS + + + +
Sbjct: 1544 ---DPVSNG-----------------------NGQFMFDDSRNGRDGWLGGGS--GLDYI 1575
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTE--QFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
LDISF+SP H G+ ++ V++ + ++ + P +VLK+ L R L+Q +SGGLSSY
Sbjct: 1576 PLDISFESPQHGGIASSKWVRDCIQGGKYGFALPAVMVLKEILCQRGLNQPWSGGLSSYS 1635
Query: 1457 LMLLITRFLQH 1467
L+ ++ LQ
Sbjct: 1636 LINMMITVLQR 1646
>gi|281209393|gb|EFA83561.1| hypothetical protein PPL_02627 [Polysphondylium pallidum PN500]
Length = 1489
Score = 91.7 bits (226), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 115/266 (43%), Gaps = 71/266 (26%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F V + ++ + ++ LWP + I+GS TGL LPSSDV
Sbjct: 1095 LHQEILDFVSDVGERTYPHVQNCHQVIEMIRTVVKRLWPNAELEIYGSFYTGLWLPSSDV 1154
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV + K+ S+++ +N +
Sbjct: 1155 DIVV--------------------------------------NYGKNMSIRS-KNAQFLL 1175
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
+L ++ HDL N+ M+ + + P + +NI
Sbjct: 1176 EVLEKQIRHDL-----------------------NNFILTMMCIPSARIPVIKLVTKNNI 1212
Query: 1393 KAATSVRLDISFK-SPS--HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+DISF+ SP+ HTG+ +LV E E PLA+VLK FL +R L+ +YS
Sbjct: 1213 A------VDISFRESPTSPHTGIAARNLVSENVETLVGLYPLAIVLKWFLRERGLNNTYS 1266
Query: 1450 GGLSSYCLMLLITRFLQHEHHLGRPI 1475
GGLSSYCL+L++ FL++ H + I
Sbjct: 1267 GGLSSYCLVLMLISFLKNNEHCPKKI 1292
>gi|313232447|emb|CBY24115.1| unnamed protein product [Oikopleura dioica]
Length = 887
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 110/260 (42%), Gaps = 78/260 (30%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + E T R V RV +++ +P ++ +FGS TGL LP+
Sbjct: 144 LHEEIIDFHNWIRSTPEEYTMRHD----VVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPT 199
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR-YLANQEWVKSDSLKTVENT 1328
SD+D+VV +E R G + + + L Q + S+K +++
Sbjct: 200 SDIDMVVL------------GEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVIDSA 247
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
A+PII + DM+
Sbjct: 248 AVPIIKM-----------------------------------RDMI-------------- 258
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
T +++DISF TG+ LVK QFPA L LVLKQFL R +++ +
Sbjct: 259 -------TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVW 309
Query: 1449 SGGLSSYCLMLLITRFLQHE 1468
+GG+SSY L+L++ FLQH+
Sbjct: 310 TGGISSYGLILMVVSFLQHQ 329
>gi|320169914|gb|EFW46813.1| hypothetical protein CAOG_04771 [Capsaspora owczarzaki ATCC 30864]
Length = 1503
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/254 (24%), Positives = 105/254 (41%), Gaps = 62/254 (24%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH++I F + V+ + P + A+ V + +WP S ++GS + GLSLP SD+
Sbjct: 1157 LHDDILRFYQYVSKTAQSNAPNVQTAIHHVQGVVLSIWPNSYVVLYGSYSYGLSLPHSDI 1216
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
DLVV PP + ++ + A L W ++ +E + IP+
Sbjct: 1217 DLVVVRPP-----------MAPNSGAPAQSAVAELAARLTAFPWCS--GVRAIETSRIPV 1263
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
+ V + ++ P + + H
Sbjct: 1264 VKFVATI---------GGLRIPVDVSFH-------------------------------- 1282
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
D S SH+G+ D V+ +P PL +VLKQ+L + L+++YSGGL
Sbjct: 1283 --------DTSRGPLSHSGILARDFVRSQISAYPMLAPLVMVLKQYLHSKGLNEAYSGGL 1334
Query: 1453 SSYCLMLLITRFLQ 1466
+SY ++LL+T LQ
Sbjct: 1335 TSYGIVLLVTFLLQ 1348
>gi|313242854|emb|CBY39607.1| unnamed protein product [Oikopleura dioica]
Length = 833
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 110/260 (42%), Gaps = 78/260 (30%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + E T R V RV +++ +P ++ +FGS TGL LP+
Sbjct: 90 LHEEIIDFHNWIRSTPEEYTMRHD----VVLRVEEAIKQEFPGAQVEVFGSFQTGLYLPT 145
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR-YLANQEWVKSDSLKTVENT 1328
SD+D+VV +E R G + + + L Q + S+K +++
Sbjct: 146 SDIDMVVL------------GEKIEPRYGNPQNGPHYRLQDRLLKQGIAERYSIKVIDSA 193
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
A+PII + DM+
Sbjct: 194 AVPIIKM-----------------------------------RDMI-------------- 204
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
T +++DISF TG+ LVK QFPA L LVLKQFL R +++ +
Sbjct: 205 -------TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVW 255
Query: 1449 SGGLSSYCLMLLITRFLQHE 1468
+GG+SSY L+L++ FLQH+
Sbjct: 256 TGGISSYGLILMVVSFLQHQ 275
>gi|301119917|ref|XP_002907686.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106198|gb|EEY64250.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 647
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 110/258 (42%), Gaps = 77/258 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI ++ + +I + V S+Q LWP+S+ FGS +TG+ LPSSDV
Sbjct: 148 LHEEILAYTQYTKKTVDTMAVHIEQMISNVRASVQSLWPQSKVETFGSYSTGIWLPSSDV 207
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNG-IKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
DLV+ ++E + + L+ A+ L ++WV +SL ++ +P
Sbjct: 208 DLVIL-------------DVVEVNDSKLTAKHLRQLAKVLEKKKWV--ESLLCLDAAKVP 252
Query: 1332 IIMLV---VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
++ LV VP D+ +A++ HS ++A
Sbjct: 253 VLKLVSAETSVPIDITFESAAT-------------------HSGLLA------------- 280
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
DL+K + P PLA+V KQ L +R L+ +Y
Sbjct: 281 --------------------------RDLIKRYADNMPELYPLAIVFKQLLRERDLNDAY 314
Query: 1449 SGGLSSYCLMLLITRFLQ 1466
+GGLSSY ++L+I F Q
Sbjct: 315 TGGLSSYSVVLMIIHFSQ 332
>gi|328872412|gb|EGG20779.1| hypothetical protein DFA_00644 [Dictyostelium fasciculatum]
Length = 1767
Score = 87.8 bits (216), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 114/282 (40%), Gaps = 69/282 (24%)
Query: 1193 PLEVQNCPTRKASLSLMHSL----LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQV 1248
PL+V P L H L LH+EI F V + + + ++
Sbjct: 1355 PLQVYLQPWYDPPLPYPHPLFSIKLHQEILDFVSDVGDRTYPHVQSCHHVIALIKGVVKR 1414
Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
LWP + +I+GS TGL LPSSD+D+VV + P +LE L+
Sbjct: 1415 LWPNAEIDIYGSFMTGLWLPSSDIDIVVDYNKNMAIRPKNAQFLLE--------VLEKQI 1466
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
R+ N + ++ + + IP+I LV KE+ A
Sbjct: 1467 RHDLNSFIL---TMLCIPSAKIPVIKLVT-----------------KENIA--------- 1497
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
V + SP H TG+ DLV + ++
Sbjct: 1498 -----VDISFRESPTSPH-----------------------TGIAARDLVADNVKKVIGL 1529
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
PLA+VLK FL DR L+ +YSGGLSSYCL+L++ FL++ H
Sbjct: 1530 YPLAIVLKWFLRDRGLNNTYSGGLSSYCLVLMLISFLKNNEH 1571
>gi|325183687|emb|CCA18146.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 737
Score = 86.7 bits (213), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 108/255 (42%), Gaps = 75/255 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH EI Q + K +I ++ VT+S+Q LW S FGS +TGL LP+SDV
Sbjct: 191 LHSEIVHIASQTKKTVDSMKIHIAKIIENVTKSVQELWHDSTIETFGSYSTGLWLPTSDV 250
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETC--LQHAARYLANQEWVKSDSLKTVENTAI 1330
DLV+ +LE + ++ T L A L+ Q+WV S
Sbjct: 251 DLVIL-------------NVLENCD-LRTTVANLGKLADVLSQQDWVAS----------- 285
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
+V L+ + P S D
Sbjct: 286 ------------------------------------------IVVLETAKVPVIKLVSKD 303
Query: 1391 NIKAATSVRLDISFKSPS-HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
S+ +DISF++ S H+GL DLV ++ P P A+++KQ L +R L+ +Y+
Sbjct: 304 -----VSIPIDISFETSSTHSGLLARDLVSLYVKELPELYPAAIIMKQLLRERDLNDAYT 358
Query: 1450 GGLSSYCLMLLITRF 1464
GGLSSY ++L++ F
Sbjct: 359 GGLSSYSIVLMLIHF 373
>gi|348690009|gb|EGZ29823.1| hypothetical protein PHYSODRAFT_466551 [Phytophthora sojae]
Length = 666
Score = 84.7 bits (208), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 67/257 (26%), Positives = 109/257 (42%), Gaps = 75/257 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI + + +I + V S+ LWP+S+ FGS +TG+ LPSSDV
Sbjct: 161 LHEEILEYTEYTKKTVDKMAVHIEQMIANVRASVLSLWPQSKVETFGSYSTGIWLPSSDV 220
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
DLV+ ++ + ++ + L+ A+ L ++WV +SL ++ +P+
Sbjct: 221 DLVIL-----DVVDVNDSQLTAKH-------LKELAKVLEKKKWV--ESLLVLDTAKVPV 266
Query: 1333 IMLV---VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+ LV VP D+ +A++ HS ++A
Sbjct: 267 LKLVSAETSVPIDITFESAAT-------------------HSGLLA-------------- 293
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
DL+K + P PLA+V KQ L +R L+ +Y+
Sbjct: 294 -------------------------RDLIKRYADNMPELYPLAIVFKQLLRERDLNDAYT 328
Query: 1450 GGLSSYCLMLLITRFLQ 1466
GGLSSY ++L+I F Q
Sbjct: 329 GGLSSYSVVLMIIHFSQ 345
>gi|299115714|emb|CBN74279.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 871
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 73/267 (27%), Positives = 108/267 (40%), Gaps = 56/267 (20%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
+LHEEI F + RK + +R+ + +LWP + +GS A G+ P SD
Sbjct: 309 ILHEEILDFAARCRDVQEKRKRVLENLWRRIEDVICLLWPDTVVVPYGSYACGMMTPESD 368
Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRN--------GIKETCLQHAARYLANQEWVKSDSLK 1323
VD++V +N P EA + R G T + A +L S +K
Sbjct: 369 VDIMVFPGDCKNSSPPFEAKDDQPRRRGSRTDSVGGMLTRIHELADHLTQHGTFSS--MK 426
Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
+ + +PII A + S +D + T+
Sbjct: 427 VIPHAMVPII------------KARTGALSDGDDGSGDTV-------------------- 454
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S +DIS SHTGL T+ L + P PL +VLKQ L R
Sbjct: 455 -------------SFSIDISIDCQSHTGLATSAFTSYLADHLPNLCPLTMVLKQLLQSRG 501
Query: 1444 LDQSYSGGLSSYCLMLLIT-RFLQHEH 1469
L+ ++GGLSSY L+L++T LQ +H
Sbjct: 502 LNDPFTGGLSSYGLVLMVTFALLQRDH 528
>gi|124481633|gb|AAI33102.1| LOC568678 protein [Danio rerio]
Length = 535
Score = 84.0 bits (206), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + ++ E R V R+ R ++ LWP + +FGS +TGL LP+
Sbjct: 50 LHEEIKDFYEYISPRPEEEQMRHE----VVARIQRVIKDLWPNAEVCVFGSFSTGLYLPT 105
Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV LP L ++EA L ++ +S+K
Sbjct: 106 SDIDLVVFGNWETLP----LWTLEEA--------------------LRKRKVADENSIKV 141
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
++ +PII L+ +HT +K D
Sbjct: 142 LDKATVPIIKLM---------------------DSHTEVKVD------------------ 162
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
ISF S G++ +L+K+ +Q+P L LVLKQFL R L
Sbjct: 163 -----------------ISFNVQS--GVKAANLIKDYKQQYPVLPYLVLVLKQFLLQREL 203
Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
++ ++GG+ SY L L+ FLQ
Sbjct: 204 NEVFTGGIGSYSLFLMAVSFLQ 225
>gi|68363844|ref|XP_697115.1| PREDICTED: PAP-associated domain-containing protein 5 [Danio rerio]
Length = 653
Score = 84.0 bits (206), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + ++ E R V R+ R ++ LWP + +FGS +TGL LP+
Sbjct: 168 LHEEIKDFYEYISPRPEEEQMRHE----VVARIQRVIKDLWPNAEVCVFGSFSTGLYLPT 223
Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV LP L ++EA L ++ +S+K
Sbjct: 224 SDIDLVVFGNWETLP----LWTLEEA--------------------LRKRKVADENSIKV 259
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
++ +PII L+ +HT +K D
Sbjct: 260 LDKATVPIIKLM---------------------DSHTEVKVD------------------ 280
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
ISF S G++ +L+K+ +Q+P L LVLKQFL R L
Sbjct: 281 -----------------ISFNVQS--GVKAANLIKDYKQQYPVLPYLVLVLKQFLLQREL 321
Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
++ ++GG+ SY L L+ FLQ
Sbjct: 322 NEVFTGGIGSYSLFLMAVSFLQ 343
>gi|307200518|gb|EFN80680.1| PAP-associated domain-containing protein 5 [Harpegnathos saltator]
Length = 636
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 111/274 (40%), Gaps = 80/274 (29%)
Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + LHEEI+ F + + + VKR+ + LWP
Sbjct: 159 EYGGCPWRMPNKHYSKGVIGLHEEIEDFFAYMCPSHEEHVLRMR-VVKRIEYVIYDLWPD 217
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
S+ +FGS TGL LP+SD+DL V + +NL L+ R L
Sbjct: 218 SKVEVFGSFRTGLYLPTSDIDL-VVIGMWKNLP------------------LRTLERALL 258
Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
+Q + S+K ++ ++P
Sbjct: 259 DQNIAEPSSIKVLDKASVP----------------------------------------- 277
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+V L D S +++DISF + G+++ +L+ Q+P L
Sbjct: 278 IVKLTDKES---------------EIKVDISFNM--NNGVKSAELINSFKRQYPVLEKLV 320
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 321 MVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 354
>gi|66557991|ref|XP_625041.1| PREDICTED: PAP-associated domain-containing protein 5-like [Apis
mellifera]
Length = 539
Score = 83.2 bits (204), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 114/278 (41%), Gaps = 88/278 (31%)
Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + LHEEI+ F + N I VKR+ + + LWP
Sbjct: 101 EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRIR-VVKRIEQVIYDLWPD 159
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S+ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 160 SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
R L +Q + S+K ++ ++PI+ L + KE T +K D
Sbjct: 197 RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
S ++ +K+A + SFK ++FP
Sbjct: 234 ---------------ISFNMNNGVKSAELIN---SFK-----------------KRFPVL 258
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L +VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 259 EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296
>gi|383852647|ref|XP_003701838.1| PREDICTED: PAP-associated domain-containing protein 5-like [Megachile
rotundata]
Length = 573
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 114/278 (41%), Gaps = 88/278 (31%)
Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + LHEEI+ F + N I VKR+ + + LWP
Sbjct: 101 EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRIR-VVKRIEQVIYDLWPD 159
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S+ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 160 SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
R L +Q + S+K ++ ++PI+ L + KE T +K D
Sbjct: 197 RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
S S+ +K+A + SFK +++P
Sbjct: 234 ---------------ISFNMSNGVKSAELIN---SFK-----------------KRYPVL 258
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L +VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 259 EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296
>gi|412986788|emb|CCO15214.1| predicted protein [Bathycoccus prasinos]
Length = 687
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 105/254 (41%), Gaps = 77/254 (30%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH EI FC+ + K + AV+RV ++ +WP SR + GS AT + LPSSDV
Sbjct: 169 LHGEIVDFCRFLEPTEEEEKSRLA-AVERVREAVMTIWPSSRFEVHGSFATKMYLPSSDV 227
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D A IL+ CL+ A YLA + E T I +
Sbjct: 228 D----------------AVILDSGAKSPAVCLKALALYLA----------RRGEATNIQL 261
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I S + P +V +++ S
Sbjct: 262 I---------------SKARVP------------------IVKFEETKS----------- 277
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
V+ D+SF + G + ++VK T +FPA PL VLK FL+ R L++ YSGG+
Sbjct: 278 ----GVQFDVSFDVAN--GPASAEIVKRNTRRFPALRPLTTVLKCFLSQRGLNEVYSGGI 331
Query: 1453 SSYCLMLLITRFLQ 1466
SY L+ +I FLQ
Sbjct: 332 GSYALLCMIMTFLQ 345
>gi|348687890|gb|EGZ27704.1| hypothetical protein PHYSODRAFT_343641 [Phytophthora sojae]
Length = 501
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/262 (25%), Positives = 109/262 (41%), Gaps = 85/262 (32%)
Query: 1209 MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
+++ LHEEI F ++ E ++R I + + ++ LWP + FGS+ T +
Sbjct: 128 VYACLHEEIMDFVSFISPTEQELSSRAELI----EEMREIVKGLWPEATVETFGSHYTQM 183
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
LP SD+D+V+ G+ EG+ L A+ L +E V L+ +
Sbjct: 184 FLPQSDIDMVLF-------------GVPEGK-----APLFKLAQCLEEKELV--SYLEVI 223
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ IPI+ K
Sbjct: 224 DKARIPIV-------------------------------------------------KMV 234
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
H +SD + +D+SF GL T DLVK +P+ PL LVLK F+A R L+
Sbjct: 235 HKASD-------IHVDVSFNVAG--GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLN 285
Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
++Y+GG+ S+ L +++ FLQH
Sbjct: 286 ETYTGGVGSFLLQMMVVSFLQH 307
>gi|345493399|ref|XP_001604785.2| PREDICTED: PAP-associated domain-containing protein 5-like [Nasonia
vitripennis]
Length = 462
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 105/258 (40%), Gaps = 86/258 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI+ F + N + +KR+ + LWP S+ IFGS TGL LP+SD+
Sbjct: 46 LHEEIEDFFTYMCPTNEEHLLRVK-VIKRIENVIYDLWPDSKVEIFGSFRTGLYLPTSDI 104
Query: 1273 DLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
DLVV P+ LE R L +Q V+ S+K ++
Sbjct: 105 DLVVIGMWTNLPLHTLE-----------------------RALIDQNIVEPSSVKVLDRA 141
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
++PI+ L T +K D S
Sbjct: 142 SVPIVKLT---------------------DRETEIKVD-----------------ISFNM 163
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
++ +K+A ++ +FK Q+PA L +VLKQFL R L++ +
Sbjct: 164 NNGVKSAELIK---TFK-----------------RQYPALEKLVMVLKQFLLQRDLNEVF 203
Query: 1449 SGGLSSYCLMLLITRFLQ 1466
+GG+SSY L+L+ FLQ
Sbjct: 204 TGGISSYSLILMTISFLQ 221
>gi|301093772|ref|XP_002997731.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
gi|262109980|gb|EEY68032.1| Poly(A) RNA polymerase, putative [Phytophthora infestans T30-4]
Length = 489
Score = 81.3 bits (199), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 106/262 (40%), Gaps = 85/262 (32%)
Query: 1209 MHSLLHEEIDSFCKQVA-AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
++ LH+EI F ++ E+ R V+ + ++ LWP + FGS+ T + L
Sbjct: 117 VYECLHDEIMDFVTFISPTEDELRSR--AQLVEEMRGVVKGLWPEATVETFGSHYTQMFL 174
Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKE--TCLQHAARYLANQEWVKSDSLKTV 1325
P SD+D+V+ G+ EG+ + + CL+ R L+ +
Sbjct: 175 PQSDIDMVLF-------------GVPEGKEPLYKLAQCLEEKDR---------VSYLEVI 212
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ IPI+ +V
Sbjct: 213 DKARIPIVKMV------------------------------------------------- 223
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
H SD + +D+SF GL T DLVK +P+ PL LVLK F+A R L+
Sbjct: 224 HKGSD-------IHVDVSFNVAG--GLATGDLVKHYMRVYPSFRPLTLVLKYFMAQRGLN 274
Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
++YSGG+ S+ L +++ FLQH
Sbjct: 275 ETYSGGVGSFLLQMMVVSFLQH 296
>gi|294948742|ref|XP_002785872.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239899980|gb|EER17668.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 436
Score = 80.5 bits (197), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 81/264 (30%)
Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
MH +LHEE+ + K V E AR+ + V R+ + LWP+ +FGS +GL
Sbjct: 118 MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SLPS D+DL + PV +E ++ + L + +KE L+ AR
Sbjct: 174 SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS-PK 1383
+PI+ LV +P HV D+ D+S+ P
Sbjct: 219 -----VPIVKLVDVLPP--------------------------HVEVDVSVHDNSSGEPD 247
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S+T T +++E E++PA PL + LK +LA R
Sbjct: 248 SSNT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
L+ +++GG+ SY L+ FLQ
Sbjct: 282 LNLTFTGGVGSYLAACLVIAFLQQ 305
>gi|241855549|ref|XP_002416033.1| PAP-associated domain-containing protein, putative [Ixodes
scapularis]
gi|215510247|gb|EEC19700.1| PAP-associated domain-containing protein, putative [Ixodes
scapularis]
Length = 347
Score = 80.1 bits (196), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 108/262 (41%), Gaps = 94/262 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI+ F + + AE+ R ++R+ + LWP++ IFGS TGL LP+
Sbjct: 1 LHDEIEEFYRYMQPSPAEHEMRLG----VIQRIKEVILSLWPQAEVEIFGSFRTGLYLPT 56
Query: 1270 SDVDLVVC----LPPVRNLE-PIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+D+VV P+ LE + GI E R S+K
Sbjct: 57 SDIDVVVLGKWETLPMWTLEKALLTHGIAEPR------------------------SIKV 92
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
++ ++PI+ L A TT+K D
Sbjct: 93 LDKASVPIVKLT---------------------DARTTVKVD-----------------I 114
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
S ++ +K+A L+K E+FPA L LVLKQFL R L
Sbjct: 115 SFNMNNGVKSAR--------------------LIKSFKEKFPALAKLVLVLKQFLLQRDL 154
Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
++ ++GG+SSY L+L+ FLQ
Sbjct: 155 NEVFTGGISSYSLILMTVSFLQ 176
>gi|348678978|gb|EGZ18795.1| hypothetical protein PHYSODRAFT_502025 [Phytophthora sojae]
Length = 778
Score = 80.1 bits (196), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 112/273 (41%), Gaps = 64/273 (23%)
Query: 1207 SLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGL 1265
++M +LH+EI + + E ++P+I A+ VT +Q ++ + +GS AT L
Sbjct: 288 AMMQVVLHDEIVDYYQHRRREAREKRPWIQKAIAMVTEEIQRVYNHCVQVEGYGSFATRL 347
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SSDVDLVV G N + C+ A +L + + +
Sbjct: 348 HGESSDVDLVVF-------------GATGEFNFTSQQCVAALAEHL--HKLPAFVDVSAI 392
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+IP++ +VV +V +D C
Sbjct: 393 TRASIPLLKVVV-----------------------------------LVTID------CE 411
Query: 1386 HTSSDNIKAATSVRL----DISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
T S+N V L DI+F P H G+ + LV+ L + F L LVLK F
Sbjct: 412 ETQSENAAREGGVELRIPFDITFDEPHGIHHNGVASVTLVQGLVDAFYGLRELTLVLKHF 471
Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQHEHHL 1471
L +R L+ Y GGLSSY L+L++ L+ + L
Sbjct: 472 LVERGLNDPYVGGLSSYGLLLMVVYVLREQGAL 504
>gi|115673160|ref|XP_796681.2| PREDICTED: uncharacterized protein LOC592046 [Strongylocentrotus
purpuratus]
Length = 830
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 93/229 (40%), Gaps = 77/229 (33%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRN 1297
V+R+ ++ +WP+++ I+GS T L LP+SD+DLV+
Sbjct: 159 VVQRIQGIVRSIWPKAKVEIYGSTRTMLYLPTSDIDLVLF-------------------G 199
Query: 1298 GIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKED 1357
I E+ L + S+K ++ ++PI+ L
Sbjct: 200 DIGESPFFRLGNELEKSGIAEQGSIKVLDKASVPIVKL---------------------- 237
Query: 1358 AAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDL 1417
+DN+ T VR+DISF TG L
Sbjct: 238 -------------------------------TDNV---TKVRVDISFN--MQTGTDCAKL 261
Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++E QFP+ + VLKQFL R L++ ++GG+SSY L+L+I FLQ
Sbjct: 262 IEEFICQFPSLPFMVFVLKQFLLQRDLNEVWTGGISSYSLILMIVSFLQ 310
>gi|427795543|gb|JAA63223.1| Putative pap-associated domain-containing protein 5, partial
[Rhipicephalus pulchellus]
Length = 627
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 108/258 (41%), Gaps = 86/258 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI+ F + + AE+ R I +R+ + LWP++ IFGS TGL LP+
Sbjct: 142 LHDEIEDFYRYMQPTPAEHQMRLGVI----QRIKDVILGLWPQAEVEIFGSFRTGLYLPT 197
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTVENT 1328
SD+D+VV G ET + + L + + S+K ++
Sbjct: 198 SDIDVVVL--------------------GKWETLPMWTLEKALLSHGIAEPQSIKVLDKA 237
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
++PI+ L A TT+K D
Sbjct: 238 SVPIVKLT---------------------DAKTTVKVD---------------------- 254
Query: 1389 SDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
ISF + G+++ L++ E+FPA L LVLKQFL R L++ +
Sbjct: 255 -------------ISFN--MNNGVKSACLIQSFKEKFPALPKLVLVLKQFLLQRDLNEVF 299
Query: 1449 SGGLSSYCLMLLITRFLQ 1466
+GG+SSY L+L+ FLQ
Sbjct: 300 TGGISSYSLILMTVSFLQ 317
>gi|119603156|gb|EAW82750.1| PAP associated domain containing 5, isoform CRA_c [Homo sapiens]
Length = 371
Score = 79.7 bits (195), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 102/255 (40%), Gaps = 80/255 (31%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 28 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
DLVV G+ + L+ A R + DS+K ++ +P
Sbjct: 87 DLVVF-----------------GKWENLPLWTLEEALR---KHKVADEDSVKVLDKATVP 126
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
II L + T +K D S +
Sbjct: 127 IIKLT---------------------DSFTEVKVD-----------------ISFNVQNG 148
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++AA DL+K+ T+++P L LVLKQFL R L++ ++GG
Sbjct: 149 VRAA--------------------DLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGG 188
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L L+ FLQ
Sbjct: 189 IGSYSLFLMAVSFLQ 203
>gi|270009939|gb|EFA06387.1| hypothetical protein TcasGA2_TC009265 [Tribolium castaneum]
Length = 513
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 111/278 (39%), Gaps = 88/278 (31%)
Query: 1195 EVQNCPTRKASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP RK +L H+EID F + ++ T + V R+ + + WP
Sbjct: 91 EFGGCPWRKDPHVYARGILGLHQEIDHFYQYMSPTRTEHLVRDD-VVCRIKQIILSKWPE 149
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
++ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 150 AQVEVFGSYRTGLYLPTSDIDLVVIGKWSNLPLRTLE----------------------Q 187
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
+L N + + +S+K ++ ++PI+ L T +K D
Sbjct: 188 EFLDN-DVAQENSIKVLDKASVPIVKLT---------------------DKQTEIKVD-- 223
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
S S+ +K+A +L+K QFP
Sbjct: 224 ---------------ISFNMSNGVKSA--------------------ELIKTYIHQFPVL 248
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L VLKQFL +R L++ ++GG+SSY L+L+ FLQ
Sbjct: 249 PKLVYVLKQFLLERDLNEVFTGGISSYSLILMCISFLQ 286
>gi|119603155|gb|EAW82749.1| PAP associated domain containing 5, isoform CRA_b [Homo sapiens]
Length = 374
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 102/255 (40%), Gaps = 80/255 (31%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 28 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGR-NGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
DLVV G+ + L+ A R + DS+K ++ +P
Sbjct: 87 DLVVF-----------------GKWENLPLWTLEEALR---KHKVADEDSVKVLDKATVP 126
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
II L + T +K D S +
Sbjct: 127 IIKLT---------------------DSFTEVKVD-----------------ISFNVQNG 148
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++AA DL+K+ T+++P L LVLKQFL R L++ ++GG
Sbjct: 149 VRAA--------------------DLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGG 188
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L L+ FLQ
Sbjct: 189 IGSYSLFLMAVSFLQ 203
>gi|294896867|ref|XP_002775743.1| PAP-associated domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239882023|gb|EER07559.1| PAP-associated domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 588
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 81/264 (30%)
Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
MH +LHEE+ + K V E AR+ + V R+ + LWP+ +FGS +GL
Sbjct: 118 MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SLPS D+DL + PV +E ++ + L + +KE L+ AR
Sbjct: 174 SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS-PK 1383
+PI+ LV +P HV D+ D+S+ P
Sbjct: 219 -----VPIVKLVDVLP--------------------------PHVEVDVSVHDNSSGEPD 247
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S+T T +++E E++PA PL + LK +LA R
Sbjct: 248 SSNT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
L+ +++GG+ SY L+ FLQ
Sbjct: 282 LNLTFTGGVGSYLAACLVIAFLQQ 305
>gi|294929610|ref|XP_002779288.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239888351|gb|EER11083.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 589
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 108/264 (40%), Gaps = 81/264 (30%)
Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
MH +LHEE+ + K V E AR+ + V R+ + LWP+ +FGS +GL
Sbjct: 118 MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELRVFGSFLSGL 173
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SLPS D+DL + PV +E ++ + L + +KE L+ AR
Sbjct: 174 SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS-ASPK 1383
+PI+ LV +P HV D+ D+S P
Sbjct: 219 -----VPIVKLVDVLP--------------------------PHVEVDVSVQDNSTGEPD 247
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S T T +++E E++PA PL + LK +LA R
Sbjct: 248 SSDT--------------------------TKFVIREGVEKYPAFKPLMIFLKTYLASRK 281
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
L+ +++GG+ SY L+ FLQ
Sbjct: 282 LNLTFTGGVGSYLAACLVIAFLQQ 305
>gi|343172004|gb|AEL98706.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
Length = 501
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 109/271 (40%), Gaps = 91/271 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI FC ++ E +R N A++RV+ ++ +WP R +FGS TGL LPS
Sbjct: 123 LHNEILDFCDFLSPTPEEGRSR----NSAIQRVSDVIKYIWPNCRVEVFGSYRTGLYLPS 178
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V IL+ + LQ +R L+ + K ++ +
Sbjct: 179 SDIDVV----------------ILDSNIKSPQIGLQALSRALSQKGIAK--KIQVIGKAR 220
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PII V +V T ++ D V+ D PK +
Sbjct: 221 VPIIKFVEKV---------------------TDVQFD-------VSFDVDNGPKAA---- 248
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+ +K++ + P PL L+LK FL R L++ YS
Sbjct: 249 --------------------------EYIKDVISRLPPLRPLCLILKVFLQQRELNEVYS 282
Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
GG+ SY L+ ++ L+ EH+LG
Sbjct: 283 GGIGSYALLAMLIAMLKSVNEGQRYPEHNLG 313
>gi|334311788|ref|XP_003339660.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Monodelphis domestica]
Length = 809
Score = 79.0 bits (193), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 108/275 (39%), Gaps = 93/275 (33%)
Query: 1202 RKASLSLMHS-----LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
RK S +HS LHEEI F + ++ K + V R+ ++ LWP +
Sbjct: 341 RKFSTMPVHSTVPGTYLHEEISDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPSADVQ 399
Query: 1257 IFGSNATGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
IFGS TGL LP+SD+DLVV LP L ++EA L
Sbjct: 400 IFGSFKTGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------------------L 435
Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
+ DS+K ++ +PII L + T +K D
Sbjct: 436 RKHKVADEDSVKVLDKATVPIIKLT---------------------DSFTEVKVD----- 469
Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
S + +KAA L+K+ T+++P L
Sbjct: 470 ------------ISFNVQNGVKAA--------------------QLIKDFTKKYPVLPYL 497
Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 498 VLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 532
>gi|350415058|ref|XP_003490519.1| PREDICTED: PAP-associated domain-containing protein 5-like [Bombus
impatiens]
Length = 572
Score = 78.6 bits (192), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 113/278 (40%), Gaps = 88/278 (31%)
Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + LHEEI+ F + N + VKR+ + + LW
Sbjct: 101 EYGGCPWRVPNKHYSKGVIGLHEEIEDFFAYMCPSNEEHSLRMR-VVKRIEQVIYDLWQD 159
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S+ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 160 SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 196
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
R L +Q + S+K ++ ++PI+ L + KE T +K D
Sbjct: 197 RALLDQNIAEPSSIKVLDKASVPIVKL-----------------TDKE----TEIKVD-- 233
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
S ++ +K+A + SFK ++FP
Sbjct: 234 ---------------ISFNMNNGVKSAELIN---SFK-----------------KRFPVL 258
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L +VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 259 EKLVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 296
>gi|325185090|emb|CCA19582.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 328
Score = 78.2 bits (191), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 112/256 (43%), Gaps = 69/256 (26%)
Query: 1217 IDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
+++F + + AE + + A+ + + +Q +W ++ I+GS T L+LP+SD+D V+
Sbjct: 1 MEAFVEHLDAEVFYLEQHQRQAISTLQQVVQEIWQDAQIEIYGSTYTRLALPTSDIDCVL 60
Query: 1277 CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
P + P LQ A+ +A+Q + L+ + N IP++ ++
Sbjct: 61 VSPSCEHESP--------------SFILQEIAQGIASQP--ELHRLELLTNAKIPVLKVM 104
Query: 1337 VEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAAT 1396
+ PK+ + LD + C+H
Sbjct: 105 YQ---------------PKDQM-------------RAILLDLT----CAH---------- 122
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
SP H+GL DLV + PA PL LVLK+ L L+ +++GGLSSY
Sbjct: 123 ---------SPGHSGLSARDLVFSYQTRMPALRPLVLVLKRHLKALGLNCAFTGGLSSYA 173
Query: 1457 LMLLITRFLQH--EHH 1470
L+LL+ RFLQ +HH
Sbjct: 174 LVLLVVRFLQACGDHH 189
>gi|294879926|ref|XP_002768827.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239871765|gb|EER01545.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 509
Score = 78.2 bits (191), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 108/264 (40%), Gaps = 81/264 (30%)
Query: 1209 MHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
MH +LHEE+ + K V E AR+ + V R+ + LWP+ +FGS +GL
Sbjct: 118 MHLMLHEELVDYTKWVVLNDDEVKAREEF----VARINGVCKDLWPQCELKVFGSFLSGL 173
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKE-AGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SLPS D+DL + PV +E ++ + L + +KE L+ AR
Sbjct: 174 SLPSGDIDLSLQNVPVGAVEAVRMLSDKLLSQGQLKELELRETAR--------------- 218
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS-ASPK 1383
+PI+ LV +P HV D+ D+S P
Sbjct: 219 -----VPIVKLVDLLP--------------------------PHVEVDVSVQDNSTGEPD 247
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S T T +++E E++PA PL + LK +LA R
Sbjct: 248 SSDT--------------------------TKFVMQEGVEKYPAFKPLMIFLKTYLASRK 281
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQH 1467
L+ +++GG+ SY L+ FLQ
Sbjct: 282 LNLTFTGGVGSYLAACLVIAFLQQ 305
>gi|343172002|gb|AEL98705.1| Nucleotidyltransferase family protein, partial [Silene latifolia]
Length = 501
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/271 (25%), Positives = 110/271 (40%), Gaps = 91/271 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC ++ E +R N A++RV+ ++ +WP R +FGS TGL LPS
Sbjct: 123 LHKEILDFCDFLSPTPEEGRSR----NSAIQRVSDVIKYIWPNCRVEVFGSYRTGLYLPS 178
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V IL+ + LQ +R L+ + K ++ +
Sbjct: 179 SDIDVV----------------ILDSNIKSPQIGLQALSRALSQKGIAK--KIQVIGKAR 220
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PII V +V T ++ D V+ D PK +
Sbjct: 221 VPIIKFVEKV---------------------TDVQFD-------VSFDVDNGPKAA---- 248
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+ ++++ + P PL L+LK FL R L++ YS
Sbjct: 249 --------------------------EYIQDVISRLPPLRPLCLILKVFLQQRELNEVYS 282
Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
GG+ SY L+ ++ L+ EH+LG
Sbjct: 283 GGIGSYALLAMLIAMLKSVNEGQRYPEHNLG 313
>gi|255566595|ref|XP_002524282.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223536473|gb|EEF38121.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 526
Score = 77.8 bits (190), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 109/267 (40%), Gaps = 89/267 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC ++ E AR N AVK V ++ +WP + +FGS TGL LP+
Sbjct: 125 LHKEIVDFCDFLSPTPEEEDAR----NTAVKCVFDVIKYIWPNCKVEVFGSYKTGLYLPT 180
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIK--ETCLQHAARYLANQEWVKSDSLKTVEN 1327
SD+D+V+ R+GIK + LQ +R L+ + K ++ +
Sbjct: 181 SDIDVVIF------------------RSGIKNPQIGLQALSRALSQKGIAK--KIQVIAK 220
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PI+ V + + D ++ D PK +
Sbjct: 221 ARVPIVKFV---------------------EKRSGVSFD-------ISFDVDNGPKAAEF 252
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
D ++ ++PA PL+L+LK FL R L++
Sbjct: 253 IKDAVR------------------------------KWPALRPLSLILKVFLQQRELNEV 282
Query: 1448 YSGGLSSYCLMLLITRFLQ--HEHHLG 1472
YSGG+ SY L+ ++ L+ EH+LG
Sbjct: 283 YSGGIGSYALLTMLMAVLKASSEHNLG 309
>gi|384248025|gb|EIE21510.1| Nucleotidyltransferase, partial [Coccomyxa subellipsoidea C-169]
Length = 291
Score = 77.4 bits (189), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 107/262 (40%), Gaps = 83/262 (31%)
Query: 1216 EIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
E+ FC+ +A +E +R+ A+ RVT ++Q +WP + +FGS TGL LPSSD+
Sbjct: 1 ELVEFCRFLAPSSSELASRQA----ALARVTDAVQSIWPSASVQVFGSFVTGLYLPSSDM 56
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV ++ + G + L+ A L + K +++ + +PI
Sbjct: 57 DIVV----------------MDSQCGDIRSALKAVANSLVRKNMAK--NIQIIAKAKVPI 98
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I K + + +K D
Sbjct: 99 I---------------------KFEDIESGIKFD-------------------------- 111
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
ISF + + G + D VK L ++ P PL L+LK FL R L++ Y GG+
Sbjct: 112 ---------ISFDAAN--GPEAADFVKGLMQRLPPMRPLVLILKVFLHQRELNEVYQGGI 160
Query: 1453 SSYCLMLLITRFLQHEHHLGRP 1474
SY L++++ FL G P
Sbjct: 161 GSYALLVMVAGFLLLHPSRGGP 182
>gi|301097760|ref|XP_002897974.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106419|gb|EEY64471.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 735
Score = 76.6 bits (187), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 107/269 (39%), Gaps = 69/269 (25%)
Query: 1207 SLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGL 1265
++M +LH+EI F E ++P++ A+ VT + +LW + +GS AT L
Sbjct: 286 AMMQVVLHDEIVDFYNHHRREARDKRPWMQQAIAMVTEEIDLLWNHCVQVEGYGSFATRL 345
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SSDVDLVV G E N + C++ A ++ +E + +
Sbjct: 346 HGESSDVDLVV-------------FGATEEFNFTSQQCVEALADHV--RELAAYVDVSAI 390
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+IP++ +VV +V +D
Sbjct: 391 TRASIPLLKVVV-----------------------------------LVTIDG------- 408
Query: 1386 HTSSDNIKAATSVRLDISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
+ DI+F P H G+ + LV+ L F L LVLK FL +R
Sbjct: 409 --------VELRIPFDITFDEPHGIHHNGVASVTLVQGLASAFYGLRELTLVLKHFLVER 460
Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQHEHHL 1471
L+ Y GGLSSY L+L++ L+ + L
Sbjct: 461 GLNDPYVGGLSSYGLLLMVVYVLREQGAL 489
>gi|348512677|ref|XP_003443869.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
Length = 805
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/261 (24%), Positives = 102/261 (39%), Gaps = 92/261 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEE+ F K ++ E RK +N R+ ++ LWP I+GS +TGL LP+
Sbjct: 253 LHEEVMDFYKYISPRPEEAAMRKEVVN----RIEVVIKELWPTVDVQIYGSFSTGLYLPT 308
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DL+V PP++ LE + L E +S+K +
Sbjct: 309 SDIDLMVTGKWERPPLQELE-----------------------QALRKHEVADPNSIKVL 345
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ +PII L +
Sbjct: 346 DKATVPIIKLTDQ----------------------------------------------- 358
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
++K S ++ K+ S +K +++P L VLKQFL R L+
Sbjct: 359 ---RTDVKVDISFNVETGVKAAS--------FIKGYMKKYPVLPYLIFVLKQFLLQRDLN 407
Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
+ ++GG+SSY L+L++ FLQ
Sbjct: 408 EVFTGGISSYSLILMVISFLQ 428
>gi|297698707|ref|XP_002826459.1| PREDICTED: PAP-associated domain-containing protein 5 [Pongo abelii]
Length = 588
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 113/287 (39%), Gaps = 100/287 (34%)
Query: 1185 PDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTR 1244
P+VA ++Q C + KA LLHEEI F + ++ K + V R+
Sbjct: 73 PEVA-----TQLQTCSSFKA-------LLHEEISDFYEYMSPRPEEEKMRME-VVNRIES 119
Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGI 1299
++ LWP + IFGS TGL LP+SD+DLVV LP L ++EA
Sbjct: 120 VIKELWPSADVQIFGSFKTGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------- 166
Query: 1300 KETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAA 1359
L + DS+K ++ +PII L +
Sbjct: 167 -----------LRKHKVADEDSVKVLDKATVPIIKLT---------------------DS 194
Query: 1360 HTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVK 1419
T +K D S + ++AA ++
Sbjct: 195 FTEVKVD-----------------ISFNVQNGVRAADLIK-------------------- 217
Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ T+++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 218 DFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 264
>gi|66821005|ref|XP_644037.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
gi|60472363|gb|EAL70316.1| hypothetical protein DDB_G0274847 [Dictyostelium discoideum AX4]
Length = 1880
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 69/260 (26%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F V + + + ++ LW + +I+GS TGL LPSSD+
Sbjct: 1473 LHQEILDFVSDVGERTYPHVQSCHHVIGLIRSVVKKLWSNADLDIYGSFMTGLWLPSSDI 1532
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV ++P +L+ L+ R N + ++ + + IP+
Sbjct: 1533 DIVVNYGKNMAIKPKNAQFLLK--------VLEKQFRTDLNGMIL---TMLCIPSAKIPV 1581
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPKCSHTSSD 1390
I LV ++ +++D S SP HT
Sbjct: 1582 IKLVT---------------------------------TENISVDISFRESPTSVHT--- 1605
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
G+ DLV + + PLA+VLK FL +R L+ +Y+G
Sbjct: 1606 --------------------GIAARDLVADCVREIIGLYPLAIVLKWFLRERGLNNTYTG 1645
Query: 1451 GLSSYCLMLLITRFLQHEHH 1470
GLSSYCL+L++ FL++ H
Sbjct: 1646 GLSSYCLVLMLISFLKNNEH 1665
>gi|21240658|gb|AAM44369.1| putative protein contains similarity to topoisomerase-related
function protein [Dictyostelium discoideum]
Length = 1880
Score = 76.3 bits (186), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 69/260 (26%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F V + + + ++ LW + +I+GS TGL LPSSD+
Sbjct: 1473 LHQEILDFVSDVGERTYPHVQSCHHVIGLIRSVVKKLWSNADLDIYGSFMTGLWLPSSDI 1532
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV ++P +L+ L+ R N + ++ + + IP+
Sbjct: 1533 DIVVNYGKNMAIKPKNAQFLLK--------VLEKQFRTDLNGMIL---TMLCIPSAKIPV 1581
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPKCSHTSSD 1390
I LV ++ +++D S SP HT
Sbjct: 1582 IKLVT---------------------------------TENISVDISFRESPTSVHT--- 1605
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
G+ DLV + + PLA+VLK FL +R L+ +Y+G
Sbjct: 1606 --------------------GIAARDLVADCVREIIGLYPLAIVLKWFLRERGLNNTYTG 1645
Query: 1451 GLSSYCLMLLITRFLQHEHH 1470
GLSSYCL+L++ FL++ H
Sbjct: 1646 GLSSYCLVLMLISFLKNNEH 1665
>gi|330792667|ref|XP_003284409.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
gi|325085656|gb|EGC39059.1| hypothetical protein DICPUDRAFT_93688 [Dictyostelium purpureum]
Length = 1460
Score = 75.9 bits (185), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 107/270 (39%), Gaps = 83/270 (30%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F V + + + ++ LW + ++FGS TGL LPSSD+
Sbjct: 1067 LHQEILDFVSDVGERTYPHVQSCHQVIGWIRAVVKKLWSHADLDLFGSFMTGLWLPSSDI 1126
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSD------SLKTV 1325
D+VV G N IK Q + L Q +++D S+ +
Sbjct: 1127 DIVVNY----------------GNNMSIKPKNAQFLLKVLEKQ--IRNDLDGFILSMVCI 1168
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS--ASPK 1383
+ IP+I LV ++ +++D S SP
Sbjct: 1169 PSAKIPVIKLVT---------------------------------TENISVDISFRESPT 1195
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
HT G+ DL+ + + PLA+VLK FL +R
Sbjct: 1196 SIHT-----------------------GIAARDLIADCVKDVVGLYPLAIVLKWFLRERG 1232
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
L+ +Y+GGLSSYCL+L++ FL++ H R
Sbjct: 1233 LNNTYTGGLSSYCLVLMLISFLKNNEHCPR 1262
>gi|18423551|ref|NP_568798.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
gi|27754278|gb|AAO22592.1| unknown protein [Arabidopsis thaliana]
gi|332009022|gb|AED96405.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
Length = 530
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 106/268 (39%), Gaps = 85/268 (31%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC A K + AV+ V+ ++ +WP + +FGS TGL LP+SD+
Sbjct: 121 LHKEIVDFC-DFLLPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDI 179
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+V+ E+G+ + G++ + R +A +L + +PI
Sbjct: 180 DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLLVIAKARVPI 221
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I V + +++ D+ + D PK +
Sbjct: 222 IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAE------ 247
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
+++ + P PL L+LK FL R L++ YSGG+
Sbjct: 248 ------------------------FIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGI 283
Query: 1453 SSYCLMLLITRFLQH--------EHHLG 1472
SY L+ ++ FL++ EH+LG
Sbjct: 284 GSYALLAMLIAFLKYLKDGRSAPEHNLG 311
>gi|405123317|gb|AFR98082.1| DNA polymerase sigma [Cryptococcus neoformans var. grubii H99]
Length = 649
Score = 75.1 bits (183), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 111/264 (42%), Gaps = 58/264 (21%)
Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
S L EEI++F K V+ + + + ++ +TR++ LWP + FGS T L LP
Sbjct: 5 QSCLTEEINAFYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQ 63
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
D+DLVV + + K+ L + A ++ +D + +
Sbjct: 64 GDIDLVVAHKYLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARAR 107
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PII V TL+ ++HV S L DS+
Sbjct: 108 VPIIKFV-------------------------TLEGESHVTS----LADSSKQG------ 132
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
A + +DIS G+ ++ + + P + L L++K FL+ RS+++ Y+
Sbjct: 133 ----AIGKINVDISLNQ--GNGVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYT 186
Query: 1450 GGLSSYCLMLLITRFLQHEHHLGR 1473
GGL SY ++ ++ FLQ L R
Sbjct: 187 GGLGSYSVICMVISFLQLHPKLRR 210
>gi|168031583|ref|XP_001768300.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680478|gb|EDQ66914.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 787
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 105/257 (40%), Gaps = 83/257 (32%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC+ VA ++ AV+RV+ +Q +WP S+ +FGS ATGL LP+SDV
Sbjct: 203 LHQEIVDFCEFVAPTEEEQQ-MRETAVERVSGVVQSIWPHSQVKVFGSFATGLYLPTSDV 261
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV ++G ++G+K A+ L K +++ + +PI
Sbjct: 262 DVVVL-----------DSGCTALQDGLKAL-----AKALTRGHVGK--NIQVIGKARVPI 303
Query: 1333 IMLVVEV---PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
I V V P D ++ D + P+ +
Sbjct: 304 IKFVETVSNIPFD-------------------------------ISFDVANGPEA----A 328
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D IKAA P PL LVLK FL R L++ Y
Sbjct: 329 DFIKAAMGA--------------------------IPPLRPLCLVLKIFLQQRELNEVYQ 362
Query: 1450 GGLSSYCLMLLITRFLQ 1466
GG+ SY L++++ LQ
Sbjct: 363 GGIGSYALLVMLLTHLQ 379
>gi|410983511|ref|XP_003998082.1| PREDICTED: PAP-associated domain-containing protein 5 [Felis catus]
Length = 514
Score = 74.7 bits (182), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/269 (25%), Positives = 106/269 (39%), Gaps = 88/269 (32%)
Query: 1203 KASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNA 1262
+ +L L + LHEEI F + ++ K + V R+ ++ LWP + IFGS
Sbjct: 52 EVTLELSKADLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFK 110
Query: 1263 TGLSLPSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWV 1317
TGL LP+SD+DLVV LP L ++EA L +
Sbjct: 111 TGLYLPTSDIDLVVFGKWENLP----LWTLEEA--------------------LRKHKVA 146
Query: 1318 KSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALD 1377
DS+K ++ +PII L + T +K D
Sbjct: 147 DEDSVKVLDKATVPIIKLT---------------------DSFTEVKVD----------- 174
Query: 1378 DSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQ 1437
S + ++AA ++ + T+++P L LVLKQ
Sbjct: 175 ------ISFNVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQ 208
Query: 1438 FLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
FL R L++ ++GG+ SY L L+ FLQ
Sbjct: 209 FLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 237
>gi|449446931|ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cucumis
sativus]
Length = 544
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 112/271 (41%), Gaps = 91/271 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC+ ++ E AR + AV+RV ++ +WP + +FGS TGL LP+
Sbjct: 126 LHKEIVDFCEFLSPTEEERVAR----DSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPT 181
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V+ +GI + + G LQ +R L+ + K ++ +
Sbjct: 182 SDIDVVIL-----------GSGIPKPQLG-----LQALSRALSQKGIAK--KIQVIGKAR 223
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PII + + + D ++ D PK ++
Sbjct: 224 VPIIKFI---------------------EKQSGISFD-------ISFDVQNGPK----AA 251
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D IK A S ++P PL L+LK FL R L++ YS
Sbjct: 252 DFIKGAVS--------------------------KWPPLRPLCLILKVFLQQRELNEVYS 285
Query: 1450 GGLSSYCLMLLITRFLQH--------EHHLG 1472
GGL SY L+ ++ LQ EH+LG
Sbjct: 286 GGLGSYALLTMLMAMLQSINVPPSSLEHNLG 316
>gi|119910013|ref|XP_001256516.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|297485254|ref|XP_002694925.1| PREDICTED: PAP-associated domain-containing protein 5 [Bos taurus]
gi|296478153|tpg|DAA20268.1| TPA: DNA polymerase sigma-like [Bos taurus]
Length = 467
Score = 74.3 bits (181), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 102/260 (39%), Gaps = 88/260 (33%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD
Sbjct: 14 LLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSD 72
Query: 1272 VDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
+DLVV LP L ++EA L + DS+K ++
Sbjct: 73 IDLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLD 108
Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
+PII L + T +K D S
Sbjct: 109 KATVPIIKLT---------------------DSFTEVKVD-----------------ISF 130
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 131 NVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNE 170
Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 171 VFTGGIGSYSLFLMAVSFLQ 190
>gi|145479659|ref|XP_001425852.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392924|emb|CAK58454.1| unnamed protein product [Paramecium tetraurelia]
Length = 1013
Score = 74.3 bits (181), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 114/262 (43%), Gaps = 52/262 (19%)
Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW---PRSRTNIFGSNATGLSLPSSDV 1272
+I F + +E P+ A RV +Q ++ P +FGS ATGL+L SD+
Sbjct: 670 DILDFTDNIMSEYENMLPFRLLAFDRVKSVIQKVFLGIPDGMIMLFGSCATGLALIDSDI 729
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+ + V N RN +K + + ++WV + + N+A+PI
Sbjct: 730 DIGINGLEVYN------------RNMLK-SHFDNLFFEFTRKKWVVKAN--PIFNSAVPI 774
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I L ++ ++ + LDD +
Sbjct: 775 IKLEIDPQINIFEYEGRN-------------------------LDDQ---QIQLWKKLKQ 806
Query: 1393 KAATSVRLDISFK------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
K + +++DISF P+H G TTDLVK+ E++P+ + L+LK + L +
Sbjct: 807 KLKSGIKVDISFNFNGNGNYPTHIGSITTDLVKKWMEEYPSLQQIVLILKSMIKKLKLSE 866
Query: 1447 SYSGGLSSYCLMLLITRFLQHE 1468
SY+GGLSSY L++++ +++ +
Sbjct: 867 SYTGGLSSYSLIIMVYSYMREQ 888
>gi|145530896|ref|XP_001451220.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418864|emb|CAK83823.1| unnamed protein product [Paramecium tetraurelia]
Length = 367
Score = 73.9 bits (180), Expect = 7e-10, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 42/68 (61%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R+DISF+ P H G+Q L+K P PL +VLKQFL+ SL Y GGLSSY L
Sbjct: 176 RIDISFQEPRHNGIQCVSLIKSYLNYLPELRPLTMVLKQFLSQSSLLDPYQGGLSSYGLN 235
Query: 1459 LLITRFLQ 1466
L+I FLQ
Sbjct: 236 LMIISFLQ 243
>gi|363738258|ref|XP_414098.3| PREDICTED: PAP-associated domain-containing protein 5 [Gallus gallus]
Length = 534
Score = 73.9 bits (180), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 107/268 (39%), Gaps = 88/268 (32%)
Query: 1204 ASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNAT 1263
A +++ ++LHEEI F K ++ + + + V R+ ++ LWP + IFGS T
Sbjct: 25 AIAAVVKAILHEEIIDFYKYMSPRHEEERMRME-VVNRIENVIKELWPNADVQIFGSFKT 83
Query: 1264 GLSLPSSDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK 1318
GL LP+SD+DLVV LP L ++EA L
Sbjct: 84 GLYLPTSDIDLVVFGKWETLP----LWTLEEA--------------------LRKHNVAD 119
Query: 1319 SDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
+S+K ++ +PII L + T +K D
Sbjct: 120 ENSVKVLDKATVPIIKLT---------------------DSFTEVKVD------------ 146
Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
S + +KAA ++ D +K ++P L LVLKQF
Sbjct: 147 -----ISFNVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQF 181
Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L R L++ ++GG+ SY L L+ FLQ
Sbjct: 182 LLQRDLNEVFTGGIGSYSLFLMAVSFLQ 209
>gi|344289184|ref|XP_003416325.1| PREDICTED: PAP-associated domain-containing protein 5-like [Loxodonta
africana]
Length = 595
Score = 73.9 bits (180), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 69/264 (26%), Positives = 103/264 (39%), Gaps = 88/264 (33%)
Query: 1208 LMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
L LLHEEI F + ++ K + V R+ ++ LWP + IFGS TGL L
Sbjct: 91 LSSELLHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYL 149
Query: 1268 PSSDVDLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
P+SD+DLVV LP L ++EA L + DS+
Sbjct: 150 PTSDIDLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSV 185
Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
K ++ +PII L + T +K D
Sbjct: 186 KVLDKATVPIIKLT---------------------DSFTEVKVD---------------- 208
Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
S + ++AA ++ + T+++P L LVLKQFL R
Sbjct: 209 -ISFNVQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQR 247
Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQ 1466
L++ ++GG+ SY L L+ FLQ
Sbjct: 248 DLNEVFTGGIGSYSLFLMAVSFLQ 271
>gi|395505923|ref|XP_003757286.1| PREDICTED: PAP-associated domain-containing protein 5, partial
[Sarcophilus harrisii]
Length = 615
Score = 73.9 bits (180), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 116 LHEEISDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPSADVQIFGSFKTGLYLPTSDI 174
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 175 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 210
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 211 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 232
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ +KAA ++ + T+++P L LVLKQFL R L++
Sbjct: 233 VQNGVKAAQLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 272
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 273 FTGGIGSYSLFLMAVSFLQ 291
>gi|58260578|ref|XP_567699.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57229780|gb|AAW46182.1| hypothetical protein CNK02250 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 779
Score = 73.9 bits (180), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 123/314 (39%), Gaps = 67/314 (21%)
Query: 1169 QDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHS---------LLHEEIDS 1219
Q+H S RD A P E Q R A + + LL EEI++
Sbjct: 90 QEHKARKSDRKRDNSRGIEAGPRNKKEEQKAAERHAPWTELVDVDLCRDPVDLLTEEINA 149
Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
F K V+ + + + ++ +TR++ LWP + FGS T L LP D+DLVV
Sbjct: 150 FYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHK 208
Query: 1280 PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEV 1339
+ + K+ L + A ++ +D + + +PII V
Sbjct: 209 YLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVPIIKFV--- 249
Query: 1340 PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVR 1399
TL+ +HV S + S + +
Sbjct: 250 ----------------------TLEGKSHVSS------------LEYFSKQ--EGVGKIN 273
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS + G+ ++ + + P + L L++K FL+ RS+++ Y+GGL SY ++
Sbjct: 274 VDISLNQAN--GVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGGLGSYSVIC 331
Query: 1460 LITRFLQHEHHLGR 1473
++ FLQ L R
Sbjct: 332 MVISFLQLHPKLRR 345
>gi|9759071|dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana]
Length = 533
Score = 73.6 bits (179), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 59/259 (22%), Positives = 104/259 (40%), Gaps = 77/259 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC + A K + AV+ V+ ++ +WP + +FGS TGL LP+SD+
Sbjct: 121 LHKEIVDFCDFLLP-TQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDI 179
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+V+ E+G+ + G++ + R +A +L + +PI
Sbjct: 180 DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLLVIAKARVPI 221
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I V + +++ D+ + D PK +
Sbjct: 222 IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAE------ 247
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
+++ + P PL L+LK FL R L++ YSGG+
Sbjct: 248 ------------------------FIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGI 283
Query: 1453 SSYCLMLLITRFLQHEHHL 1471
SY L+ ++ FL+ + +L
Sbjct: 284 GSYALLAMLIAFLKVQVYL 302
>gi|297792777|ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
lyrata]
gi|297310108|gb|EFH40532.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp.
lyrata]
Length = 530
Score = 73.6 bits (179), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 63/268 (23%), Positives = 105/268 (39%), Gaps = 85/268 (31%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC + A K + AV+ V+ + +WP + +FGS TGL LP+SD+
Sbjct: 121 LHKEIVDFCDFLLP-TQAEKAERDAAVESVSSVITYIWPSCKVEVFGSYKTGLYLPTSDI 179
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+V+ E+G+ + G++ + R +A +L + +PI
Sbjct: 180 DVVIL-----------ESGLTNPQLGLRALSRALSQRGIAK-------NLVVIAKARVPI 221
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I V + +++ D+ + D PK + D +
Sbjct: 222 IKFV---------------------------EKKSNIAFDL-SFDMENGPKAAEFIQDAV 253
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
+ P PL L+LK FL R L++ YSGG+
Sbjct: 254 S------------------------------KLPPLRPLCLILKVFLQQRELNEVYSGGI 283
Query: 1453 SSYCLMLLITRFLQH--------EHHLG 1472
SY L+ ++ FL++ EH+LG
Sbjct: 284 GSYALLAMLIAFLKYLKDGRSAPEHNLG 311
>gi|348500306|ref|XP_003437714.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Oreochromis niloticus]
Length = 672
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 70/259 (27%), Positives = 100/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F ++ K + V R+ + LWP + +FGS +TGL LP+SD+
Sbjct: 180 LHEEIIDFYNYISPRPEEEKMRLE-VVDRIKEVIHDLWPSAEVEVFGSFSTGLYLPTSDI 238
Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + +S+K ++
Sbjct: 239 DLVVFGKWESLP----LWTLEEA--------------------LRKKNVADENSIKVLDK 274
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L ++T +K D S
Sbjct: 275 ATVPIIKLT---------------------DSYTEVKVD-----------------ISFN 296
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+KAA RL FK E++P L LVLKQFL R L++
Sbjct: 297 VMSGVKAA---RLIKEFK-----------------EKYPVLPYLVLVLKQFLLQRDLNEV 336
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 337 FTGGIGSYSLFLMAVSFLQ 355
>gi|312077329|ref|XP_003141256.1| PAP/25A associated domain-containing protein [Loa loa]
gi|307763579|gb|EFO22813.1| PAP/25A associated domain-containing protein [Loa loa]
Length = 419
Score = 73.6 bits (179), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 109/268 (40%), Gaps = 86/268 (32%)
Query: 1202 RKASLSLMHSLLHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIF 1258
R+ +LSL LHEE+ ++ K E R +RV LQ +WP ++ ++F
Sbjct: 56 RRYALSLRG--LHEELLDLYAWLKPSPLEKALRLR----VFERVRGVLQRIWPTAKIDVF 109
Query: 1259 GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVK 1318
GS T L LP+SD+D+VV V P+ + + +GI E
Sbjct: 110 GSLYTSLFLPTSDIDVVVESDLVSEEPPLWKTAVALKESGITE----------------- 152
Query: 1319 SDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDD 1378
S+ ++ +PI+ +V D ++ D+
Sbjct: 153 --SINVLDKAFVPIVKMV---------------------------DKDTKIYLDI----- 178
Query: 1379 SASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQF 1438
S + ++ S R ++++ ++P PL LVLKQF
Sbjct: 179 ---------SFNTVQGVRSAR-----------------FIEDMKMRYPVLEPLVLVLKQF 212
Query: 1439 LADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L R L+Q ++GGLSSY L+L++ FLQ
Sbjct: 213 LMQRQLNQVFTGGLSSYGLILMLISFLQ 240
>gi|134117055|ref|XP_772754.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50255372|gb|EAL18107.1| hypothetical protein CNBK1280 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 779
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 123/314 (39%), Gaps = 67/314 (21%)
Query: 1169 QDHLIAISQLARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHS---------LLHEEIDS 1219
Q+H S RD A P E Q R A + + LL EEI++
Sbjct: 90 QEHKARKSDRKRDNSRGIEAGPRNKKEEQKAAERHAPWTELVDVDLCRDPVDLLTEEINA 149
Query: 1220 FCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLP 1279
F K V+ + + + ++ +TR++ LWP + FGS T L LP D+DLVV
Sbjct: 150 FYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGDIDLVVAHK 208
Query: 1280 PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEV 1339
+ + K+ L + A ++ +D + + +PII V
Sbjct: 209 YLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVPIIKFV--- 249
Query: 1340 PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVR 1399
TL+ +HV S + S + +
Sbjct: 250 ----------------------TLEGKSHVSS------------LEYFSKQ--EGIGKIN 273
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS + G+ ++ + + P + L L++K FL+ RS+++ Y+GGL SY ++
Sbjct: 274 VDISLNQAN--GVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGGLGSYSVIC 331
Query: 1460 LITRFLQHEHHLGR 1473
++ FLQ L R
Sbjct: 332 MVISFLQLHPKLRR 345
>gi|59800139|sp|Q8NDF8.2|PAPD5_HUMAN RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Terminal uridylyltransferase 3; Short=TUTase 3;
AltName: Full=Topoisomerase-related function protein 4-2;
Short=TRF4-2
Length = 572
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 120 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 178
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 179 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 214
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 215 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 236
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 237 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 276
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 277 FTGGIGSYSLFLMAVSFLQ 295
>gi|358341786|dbj|GAA49377.1| DNA polymerase sigma subunit [Clonorchis sinensis]
Length = 826
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 67/119 (56%), Gaps = 10/119 (8%)
Query: 1362 TLKH---DNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
TLKH + + S++ LD + P T + T +++DISF + ++ L+
Sbjct: 655 TLKHALSSSGISSEITVLDKATVPIVKMTDKE-----TGLKVDISFNMIN--SVRAAVLI 707
Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
++ FP L VLKQFL R+L++ ++GGLSSY L+L++ RFLQ HL P+N+
Sbjct: 708 QDYMRTFPCMPYLVFVLKQFLLQRNLNEVWTGGLSSYALILMVVRFLQVRAHLIYPLNR 766
Score = 44.3 bits (103), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI F K E AR + V ++ + +WP ++FGS TGL LP+
Sbjct: 582 LHNEIKDFYNYIKPTDEEQYAR----DVVVSKIKDVVHSMWPDCEVDVFGSFKTGLYLPT 637
Query: 1270 SDVDLVV 1276
SD+D+V+
Sbjct: 638 SDIDMVI 644
>gi|301756837|ref|XP_002914273.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Ailuropoda melanoleuca]
Length = 593
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 141 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 199
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 200 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 235
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 236 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 257
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 258 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 297
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 298 FTGGIGSYSLFLMAVSFLQ 316
>gi|224004818|ref|XP_002296060.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586092|gb|ACI64777.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 775
Score = 73.2 bits (178), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 115/261 (44%), Gaps = 25/261 (9%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYI-NWAVKRVTRSLQVLWPRSRTNI--FGSNATGLSLPS 1269
LH E+ SF K + E T + + VKRVT + + + + + FGS TGL LP+
Sbjct: 160 LHNEMVSFVKLM--EPTKEELEVRERMVKRVTELAERTFGKGKCEVLPFGSQVTGLCLPN 217
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC--LQHAARYLANQEWVKSDSLKTVEN 1327
SD+D V+ P + + G G K L + + ++ + + +
Sbjct: 218 SDIDFVIRFPKKKKDATKIKKG------GSKPDAINLDESEKKTKEKDGGQEEGEDLEDL 271
Query: 1328 TAIPIIMLVVEVPHDL-IASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
+ P+ L V + I S ++ +D +H +++ ++ + P
Sbjct: 272 ISNPLHKLAEAVREEFGIRSELDTMDPAVDDEVDDASEHLSYLE----VIEQTRVPLVKF 327
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
T A ++ +D+ F P G ++ +L+ E P PL +LK FLA R +++
Sbjct: 328 TV-----APYNLDIDVCFDQPG--GPESAELMHRFMESMPPLRPLTFILKYFLASRDMNK 380
Query: 1447 SYSGGLSSYCLMLLITRFLQH 1467
Y+GG+ SY L ++I FLQ
Sbjct: 381 PYTGGIGSYLLQIMIVSFLQQ 401
>gi|256818782|ref|NP_001035375.2| PAP-associated domain-containing protein 5 isoform b [Homo sapiens]
Length = 651
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 199 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 257
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 258 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 293
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 294 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 315
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 316 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 355
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 356 FTGGIGSYSLFLMAVSFLQ 374
>gi|332845909|ref|XP_003315148.1| PREDICTED: PAP-associated domain-containing protein 5 [Pan
troglodytes]
Length = 586
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 87 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 145
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 146 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 181
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 182 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 203
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 204 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 243
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 244 FTGGIGSYSLFLMAVSFLQ 262
>gi|296231051|ref|XP_002760982.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Callithrix jacchus]
Length = 664
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 212 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 270
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 271 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 306
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 307 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 328
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 329 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 368
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 369 FTGGIGSYSLFLMAVSFLQ 387
>gi|441597299|ref|XP_003263084.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Nomascus leucogenys]
Length = 666
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 214 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 272
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 273 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 308
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 309 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 330
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 331 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 370
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 371 FTGGIGSYSLFLMAVSFLQ 389
>gi|355710188|gb|EHH31652.1| hypothetical protein EGK_12764, partial [Macaca mulatta]
Length = 564
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 65 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 123
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 124 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 159
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 160 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 181
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 182 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 221
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 222 FTGGIGSYSLFLMAVSFLQ 240
>gi|297283970|ref|XP_002802516.1| PREDICTED: PAP-associated domain-containing protein 5 [Macaca
mulatta]
Length = 653
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 201 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 259
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 260 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 295
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 296 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 317
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 318 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 357
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 358 FTGGIGSYSLFLMAVSFLQ 376
>gi|380798533|gb|AFE71142.1| PAP-associated domain-containing protein 5 isoform a, partial [Macaca
mulatta]
Length = 618
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 119 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 177
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 178 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 213
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 214 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 235
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 236 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 275
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 276 FTGGIGSYSLFLMAVSFLQ 294
>gi|281338901|gb|EFB14485.1| hypothetical protein PANDA_002140 [Ailuropoda melanoleuca]
Length = 632
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 133 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 191
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 192 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 227
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 228 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 249
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 250 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 289
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 290 FTGGIGSYSLFLMAVSFLQ 308
>gi|119603153|gb|EAW82747.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
gi|119603154|gb|EAW82748.1| PAP associated domain containing 5, isoform CRA_a [Homo sapiens]
Length = 527
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 28 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 86
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 87 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 122
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 123 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 144
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 145 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 184
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 185 FTGGIGSYSLFLMAVSFLQ 203
>gi|441597295|ref|XP_003263083.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Nomascus leucogenys]
gi|348031139|emb|CCB84642.1| PAP associated domain containing 5 [Homo sapiens]
Length = 631
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 132 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 191 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 227 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 249 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 289 FTGGIGSYSLFLMAVSFLQ 307
>gi|321263807|ref|XP_003196621.1| hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
gi|317463098|gb|ADV24834.1| Hypothetical protein CGB_K1560W [Cryptococcus gattii WM276]
Length = 784
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 110/262 (41%), Gaps = 58/262 (22%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LL EEI++F K V+ + + + ++ +TR++ LWP + FGS T L LP D
Sbjct: 142 LLTEEINAFYKYVSPTREEFEVRL-FMIELITRTINKLWPEAEVTPFGSWQTQLYLPQGD 200
Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
+DLVV + + K+ L + A ++ +D + + +P
Sbjct: 201 IDLVVAHKYLSDAN--------------KQRLLAELGK--AMRQANITDVVAIIARARVP 244
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
II V TL+ +HV S + L T +
Sbjct: 245 IIKFV-------------------------TLEGKSHVFS-LAYL----------TKQEG 268
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I + +DIS G+ ++ + + P + L L++K FL+ RS+++ Y+GG
Sbjct: 269 I---GKINVDISLNQ--GNGVTAGKIINQYLDALPGARQLILIVKYFLSQRSMNEVYTGG 323
Query: 1452 LSSYCLMLLITRFLQHEHHLGR 1473
L SY ++ ++ FLQ L R
Sbjct: 324 LGSYSVICMVISFLQLHPKLRR 345
>gi|403292555|ref|XP_003937307.1| PREDICTED: PAP-associated domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 631
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 132 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 191 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 227 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 249 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 289 FTGGIGSYSLFLMAVSFLQ 307
>gi|256818780|ref|NP_001035374.2| PAP-associated domain-containing protein 5 isoform a [Homo sapiens]
gi|194374871|dbj|BAG62550.1| unnamed protein product [Homo sapiens]
Length = 698
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 199 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 257
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 258 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 293
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 294 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 315
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 316 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 355
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 356 FTGGIGSYSLFLMAVSFLQ 374
>gi|397498213|ref|XP_003819879.1| PREDICTED: PAP-associated domain-containing protein 5, partial [Pan
paniscus]
Length = 593
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 94 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 152
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 153 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 188
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 189 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 210
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 211 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 250
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 251 FTGGIGSYSLFLMAVSFLQ 269
>gi|395839409|ref|XP_003792582.1| PREDICTED: PAP-associated domain-containing protein 5 [Otolemur
garnettii]
Length = 629
Score = 72.8 bits (177), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 130 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 188
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 189 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 224
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 225 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 246
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 247 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 286
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 287 FTGGIGSYSLFLMAVSFLQ 305
>gi|426382139|ref|XP_004057678.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 664
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 212 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 270
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 271 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 306
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 307 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 328
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 329 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 368
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 369 FTGGIGSYSLFLMAVSFLQ 387
>gi|390477686|ref|XP_002760981.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Callithrix jacchus]
Length = 631
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 132 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 191 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 227 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 249 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 288
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 289 FTGGIGSYSLFLMAVSFLQ 307
>gi|359319041|ref|XP_535307.4| PREDICTED: PAP-associated domain-containing protein 5 [Canis lupus
familiaris]
Length = 641
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 142 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 200
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 201 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 236
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 237 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 258
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 259 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 298
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 299 FTGGIGSYSLFLMAVSFLQ 317
>gi|354474676|ref|XP_003499556.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Cricetulus griseus]
Length = 464
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 12 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 70
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 71 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 106
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 107 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 128
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 129 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 168
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 169 FTGGIGSYSLFLMAVSFLQ 187
>gi|335308290|ref|XP_003361170.1| PREDICTED: PAP-associated domain-containing protein 5-like [Sus
scrofa]
Length = 511
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 12 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 70
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 71 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 106
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 107 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 128
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 129 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 168
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 169 FTGGIGSYSLFLMAVSFLQ 187
>gi|297283968|ref|XP_001083145.2| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Macaca mulatta]
Length = 700
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 201 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 259
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 260 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 295
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 296 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 317
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 318 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 357
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 358 FTGGIGSYSLFLMAVSFLQ 376
>gi|402908342|ref|XP_003916909.1| PREDICTED: PAP-associated domain-containing protein 5 [Papio anubis]
Length = 605
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 106 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 164
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 165 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 200
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 201 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 222
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 223 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 262
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 263 FTGGIGSYSLFLMAVSFLQ 281
>gi|312384741|gb|EFR29395.1| hypothetical protein AND_01692 [Anopheles darlingi]
Length = 1101
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/74 (45%), Positives = 48/74 (64%), Gaps = 2/74 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF S G+Q+ L+KE + +P L LVLKQFL R L++ ++GG+SSY
Sbjct: 378 TQVKVDISFNMES--GVQSAKLIKEYKQDYPVLEKLVLVLKQFLLQRDLNEVFTGGISSY 435
Query: 1456 CLMLLITRFLQHEH 1469
L+L+ FLQ H
Sbjct: 436 SLILMCISFLQQHH 449
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 71/150 (47%), Gaps = 31/150 (20%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EID F + T + V R+ + + LWP +R +FGS TGL LP+SD+
Sbjct: 271 LHQEIDQFYNHMIPTPTEHALRV-MVVNRIEQIVLNLWPSARVEMFGSFRTGLYLPTSDI 329
Query: 1273 DLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
DLVV P+R LE ++ RY+A + +S++ ++
Sbjct: 330 DLVVIGQWEKLPLRTLE------------------MELINRYIA-----EPNSVRVLDKA 366
Query: 1329 AIPIIMLV---VEVPHDLIASAASSVQSPK 1355
++PI+ L +V D+ + S VQS K
Sbjct: 367 SVPIVKLTDRETQVKVDISFNMESGVQSAK 396
>gi|431914108|gb|ELK15367.1| PAP-associated domain-containing protein 5 [Pteropus alecto]
Length = 530
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 31 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 89
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 90 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 125
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 126 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 147
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 148 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 187
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 188 FTGGIGSYSLFLMAVSFLQ 206
>gi|51328369|gb|AAH80314.1| Papd5 protein [Mus musculus]
Length = 583
Score = 72.4 bits (176), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 84 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 142
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 143 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 178
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 179 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 200
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 201 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 240
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 241 FTGGIGSYSLFLMAVSFLQ 259
>gi|327278603|ref|XP_003224050.1| PREDICTED: PAP-associated domain-containing protein 5-like [Anolis
carolinensis]
Length = 665
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI+ F K ++ ++ + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 169 LHEEINDFYKYMSPRPEEQRMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 227
Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L S+K ++
Sbjct: 228 DLVVFGKWETLP----LWTLEEA--------------------LRKHNVADKGSVKVLDK 263
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 264 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 285
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ +KAA ++ D +K ++P L LVLKQFL R L++
Sbjct: 286 VQNGVKAADLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNEV 325
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 326 FTGGIGSYSLFLMAVSFLQ 344
>gi|426243516|ref|XP_004015600.1| PREDICTED: PAP-associated domain-containing protein 5 [Ovis aries]
Length = 588
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 136 LHEEISDFYEYMSPRPEEEKMRME-VVNRIEGVIKELWPSADVQIFGSFKTGLYLPTSDI 194
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 195 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 230
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 231 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 252
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 253 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 292
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 293 FTGGIGSYSLFLMAVSFLQ 311
>gi|291410211|ref|XP_002721395.1| PREDICTED: DNA polymerase sigma-like [Oryctolagus cuniculus]
Length = 522
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 23 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 81
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 82 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 117
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 118 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 139
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 140 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 179
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 180 FTGGIGSYSLFLMAVSFLQ 198
>gi|60392891|sp|Q68ED3.2|PAPD5_MOUSE RecName: Full=PAP-associated domain-containing protein 5; AltName:
Full=Topoisomerase-related function protein 4-2;
Short=TRF4-2
gi|148878177|gb|AAI45738.1| Papd5 protein [Mus musculus]
gi|219519562|gb|AAI44797.1| Papd5 protein [Mus musculus]
Length = 633
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 134 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 192
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 193 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 228
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 229 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 250
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 251 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 290
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 291 FTGGIGSYSLFLMAVSFLQ 309
>gi|256818784|ref|NP_001157969.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
gi|256818786|ref|NP_001157970.1| PAP-associated domain-containing protein 5 isoform a [Mus musculus]
Length = 680
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 181 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 239
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 240 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 275
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 276 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 297
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 298 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 337
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 338 FTGGIGSYSLFLMAVSFLQ 356
>gi|410911160|ref|XP_003969058.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 803
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/261 (26%), Positives = 103/261 (39%), Gaps = 92/261 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEE+ F ++ E RK +N R+ ++ LWP + IFGS +TGL LP+
Sbjct: 254 LHEEVIDFYNFMSPRPEEAAMRKEVVN----RIETIIKELWPTADVQIFGSFSTGLYLPT 309
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV PP++ LE + L + S+K +
Sbjct: 310 SDIDLVVFGKWERPPLQELE-----------------------QALRKHNVAEPFSIKVL 346
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ +PII L + T +K D S
Sbjct: 347 DKATVPIIKLTDQ---------------------ETEVKVD-----------------IS 368
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+KAA+ ++ D VK +P L VLKQFL R L+
Sbjct: 369 FNVETGVKAASFIK----------------DYVK----MYPVLPYLIFVLKQFLLQRDLN 408
Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
+ ++GG+SSY L+L++ FLQ
Sbjct: 409 EVFTGGISSYSLILMVISFLQ 429
>gi|351712688|gb|EHB15607.1| PAP-associated domain-containing protein 5, partial [Heterocephalus
glaber]
Length = 599
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 101/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 100 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 158
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 159 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 194
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 195 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 216
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ + T+++P L LVLKQFL R L++
Sbjct: 217 VQNGVRAADLIK--------------------DFTKKYPVLPYLVLVLKQFLLQRDLNEV 256
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 257 FTGGIGSYSLFLMAVSFLQ 275
>gi|326927263|ref|XP_003209812.1| PREDICTED: PAP-associated domain-containing protein 5-like [Meleagris
gallopavo]
Length = 571
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 102/260 (39%), Gaps = 88/260 (33%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
+LHEEI F K ++ + + + V R+ ++ LWP + IFGS TGL LP+SD
Sbjct: 70 VLHEEIIDFYKYMSPRHEEERMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSD 128
Query: 1272 VDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
+DLVV LP L ++EA L +S+K ++
Sbjct: 129 IDLVVFGKWETLP----LWTLEEA--------------------LRKHNVADENSVKVLD 164
Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
+PII L + T +K D S
Sbjct: 165 KATVPIIKLT---------------------DSFTEVKVD-----------------ISF 186
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
+ +KAA ++ D +K ++P L LVLKQFL R L++
Sbjct: 187 NVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNE 226
Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 227 VFTGGIGSYSLFLMAVSFLQ 246
>gi|159478216|ref|XP_001697200.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
gi|158274674|gb|EDP00455.1| trf4 poly(A) polymerase [Chlamydomonas reinhardtii]
Length = 1144
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/256 (26%), Positives = 102/256 (39%), Gaps = 80/256 (31%)
Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+E+ C+ Q E A + AV+ V S+ WP +R +FGS ATGL +P+S
Sbjct: 308 LHQELVELCELVQPTPEEAAARAGAVAAVREVVGSI---WPSARVEVFGSYATGLYVPTS 364
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVDLV+ N+ + LQ A+ L + S++ + +
Sbjct: 365 DVDLVILDSGCTNI----------------QAGLQALAKALTKRGV--GQSIQVIGKARV 406
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
PI+ E + L D V+ D + P+ +
Sbjct: 407 PIVKF--------------------ETVDYGNLAFD-------VSFDVANGPQAAE---- 435
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
LVKE+T ++P PL L LK FL R L++ Y+G
Sbjct: 436 --------------------------LVKEMTGRWPMMRPLILALKLFLQQRELNEVYTG 469
Query: 1451 GLSSYCLMLLITRFLQ 1466
G+ SY L+ L++ FLQ
Sbjct: 470 GIGSYALITLVSAFLQ 485
>gi|403337445|gb|EJY67938.1| Nucleotidyltransferase domain containing protein [Oxytricha
trifallax]
Length = 2816
Score = 72.0 bits (175), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 109/232 (46%), Gaps = 36/232 (15%)
Query: 1234 YINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGIL 1293
YI VK+ + + + ++GS ATGL++ SSD+D++V V N E G++
Sbjct: 2448 YIEQFVKQAYQE----FVNVKVQMYGSMATGLAIDSSDMDVLVS--GVFNQE-----GLI 2496
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQS 1353
+ I++ H +YL ++S+SL +E +P+I +++++ Q
Sbjct: 2497 DRPQLIEQMRRLH--KYLNELYCLESNSL--IETATVPVIKIILDL------------QK 2540
Query: 1354 PKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQ 1413
++ T N + DM L + ++ I F P H G++
Sbjct: 2541 IRDQNQEQT--SSNIIEEDMKHLKIDITFNEQEGENNYITG-------FGFAQPIHMGIK 2591
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
+K+ ++P L L+LK+FLA R L+ ++GGLSSY L+LLI F+
Sbjct: 2592 CCHFIKQKLHEYPQLQSLTLLLKKFLALRDLNSPFTGGLSSYGLILLIIAFI 2643
>gi|170059968|ref|XP_001865594.1| sigma DNA polymerase [Culex quinquefasciatus]
gi|167878539|gb|EDS41922.1| sigma DNA polymerase [Culex quinquefasciatus]
Length = 618
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 55/92 (59%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ Q+P L LV
Sbjct: 94 VLDKASVPIVKLTDRE-----TQVKVDISFNMQS--GVQSAELIKDFKRQYPVLAKLVLV 146
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 147 LKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 178
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
+LHEEI+ F + T + V R+ + LWP +R +FGS TGL LP+SD
Sbjct: 2 ILHEEIEQFYAHMIPTGTEHTLRVQ-VVARIEAIVLNLWPMARVEMFGSFRTGLYLPTSD 60
Query: 1272 VDLVVC----LPPVRNLE 1285
+DLVV P+R LE
Sbjct: 61 IDLVVIGRWEKLPLRTLE 78
>gi|313218878|emb|CBY43207.1| unnamed protein product [Oikopleura dioica]
Length = 677
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/73 (46%), Positives = 48/73 (65%), Gaps = 2/73 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++DISF TG+ LVK QFPA L LVLKQFL R +++ ++GG+SSY
Sbjct: 48 TDIKVDISFNM--KTGVTAIGLVKGYIRQFPALRYLVLVLKQFLLQRDMNEVWTGGISSY 105
Query: 1456 CLMLLITRFLQHE 1468
L+L++ FLQH+
Sbjct: 106 GLILMVVSFLQHQ 118
>gi|402583139|gb|EJW77083.1| PAP/25A associated domain-containing protein, partial [Wuchereria
bancrofti]
Length = 416
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/254 (24%), Positives = 101/254 (39%), Gaps = 78/254 (30%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEE+ + R + +RV LQ +WP ++ ++FGS T L LP+SD+
Sbjct: 1 LHEELLDLYAWLKPSPLERALRLR-VFERVRGVLQRIWPTAKIDVFGSLYTSLFLPTSDI 59
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV V P+ + I +GI E S+ ++ +PI
Sbjct: 60 DVVVESDLVSEEPPLWKTAIALKESGITE-------------------SINVLDKAFVPI 100
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
+ +V D ++ D+ S + +
Sbjct: 101 VKMV---------------------------DKDTKIYLDI-----------SFNTVQGV 122
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
++A + +++ ++P PL LVLKQFL R L+Q ++GGL
Sbjct: 123 RSAKFI--------------------EDMKMRYPVLEPLVLVLKQFLMQRQLNQVFTGGL 162
Query: 1453 SSYCLMLLITRFLQ 1466
SSY L+L++ FLQ
Sbjct: 163 SSYGLILMLISFLQ 176
>gi|340371638|ref|XP_003384352.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Amphimedon queenslandica]
Length = 462
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 10/111 (9%)
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
EDA LK + ++ LD + P T + T V++DISF TG+ +
Sbjct: 168 EDAF---LKAQIAIEDSIMVLDKTTVPIIKFTDRE-----TEVKVDISFNQ--ETGIYSA 217
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+L+ + +QFP LAL++KQFLA R L++ Y GG++SY L+L++ F Q
Sbjct: 218 NLICQYVQQFPYLPYLALIVKQFLAQRQLNEVYYGGINSYSLILMLVSFFQ 268
Score = 43.9 bits (102), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
HEEI F ++ + + R + ++V + LWPR++ FGS T L LP+SD+
Sbjct: 93 FHEEIVHFYSYMSPKRSERCMRLR-VFEKVKAIILKLWPRAQVYPFGSFCTNLYLPTSDI 151
Query: 1273 DLVV 1276
D+VV
Sbjct: 152 DIVV 155
>gi|49899785|gb|AAH76872.1| LOC445836 protein, partial [Xenopus laevis]
Length = 563
Score = 71.6 bits (174), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F K ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 69 LHEEIIDFYKYMSPRPEEEKMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 127
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L +S+K ++
Sbjct: 128 DLVVFGKWENLP----LWTLEEA--------------------LRKHNVADENSVKVLDK 163
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 164 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 185
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ +KAA ++ D +K ++P L LVLKQFL R L++
Sbjct: 186 VQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDLNEV 225
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 226 FTGGIGSYSLFLMAVSFLQ 244
>gi|392598007|gb|EIW87329.1| Nucleotidyltransferase [Coniophora puteana RWD-64-598 SS2]
Length = 383
Score = 71.2 bits (173), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 109/255 (42%), Gaps = 74/255 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKR-VTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LH+EI +F V T+++ Y+ V+ V R L+ + RS N+FGS A GLSLP D
Sbjct: 73 LHDEITAFVTYVGL--TSQEEYMREQVRALVHRILRNRFRRSEVNVFGSAAQGLSLPGGD 130
Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIP 1331
VD+V+ LP + R+ +K + Q AA + + V SD ++ IP
Sbjct: 131 VDIVLSLPHA------------DSRSELKTSLFQLAA--MLKRTGVASD-VEVRHQARIP 175
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
II S Q+ E TL+ D V+++
Sbjct: 176 II----------------SFQTVPE---LCTLRFDIGVNNE------------------- 197
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
GL+ +V++ + P PL LVLK FL+ R L + +GG
Sbjct: 198 ------------------DGLKALPIVEKYMQDHPVLKPLMLVLKSFLSRRGLSSASNGG 239
Query: 1452 LSSYCLMLLITRFLQ 1466
++S+ L+ ++ F+Q
Sbjct: 240 INSFVLLCMVVSFIQ 254
>gi|326676716|ref|XP_686065.4| PREDICTED: DNA polymerase sigma [Danio rerio]
Length = 706
Score = 70.9 bits (172), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/261 (26%), Positives = 104/261 (39%), Gaps = 92/261 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F ++ E T R+ V R+ ++ LWP + IFGS +TGL LP+
Sbjct: 210 LHEEIMDFYNFMSPRPEEATMRQE----VVDRIESVIKELWPTADVQIFGSFSTGLFLPT 265
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV PP++ LE + +H+ + S+K +
Sbjct: 266 SDIDLVVFGKWEKPPLQQLE---------------QALRKHSV--------AEPYSIKVL 302
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ +PII L + T +K D S
Sbjct: 303 DKATVPIIKLTDQ---------------------ETEVKVD-----------------IS 324
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
IKAA+ ++ E +++ L VLKQFL R L+
Sbjct: 325 FNVETGIKAASFIK--------------------EYVKKYTVLPYLIFVLKQFLLQRDLN 364
Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
+ ++GG+SSY L+L++ FLQ
Sbjct: 365 EVFTGGISSYSLILMVISFLQ 385
>gi|449282422|gb|EMC89255.1| PAP-associated domain-containing protein 5, partial [Columba livia]
Length = 501
Score = 70.5 bits (171), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 101/262 (38%), Gaps = 94/262 (35%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F K ++ E R +N R+ ++ LWP + IFGS TGL LP+
Sbjct: 1 LHEEIIDFYKYMSPRPEEERMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPT 56
Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV LP L ++EA L +S+K
Sbjct: 57 SDIDLVVFGKWETLP----LWTLEEA--------------------LRKHNVADENSVKV 92
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
++ +PII L + T +K D
Sbjct: 93 LDKATVPIIKLT---------------------DSFTEVKVD-----------------I 114
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
S + +KAA ++ D +K ++P L LVLKQFL R L
Sbjct: 115 SFNVQNGVKAAQLIK----------------DFIK----KYPVLPYLVLVLKQFLLQRDL 154
Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
++ ++GG+ SY L L+ FLQ
Sbjct: 155 NEVFTGGIGSYSLFLMAVSFLQ 176
>gi|260798238|ref|XP_002594107.1| hypothetical protein BRAFLDRAFT_118785 [Branchiostoma floridae]
gi|229279340|gb|EEN50118.1| hypothetical protein BRAFLDRAFT_118785 [Branchiostoma floridae]
Length = 946
Score = 70.5 bits (171), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 59/105 (56%), Gaps = 8/105 (7%)
Query: 1363 LKHDNHVHSDMV-ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
L+H N D + LD + P T T V++DISF P+ G+++ L+K+
Sbjct: 376 LRHSNIADPDSIKVLDKATVPIVKLTDRK-----TDVKVDISFNMPN--GVKSAKLIKKF 428
Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ +P L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 429 MQDYPTLKYLVLVLKQFLLQRDLNEVFTGGISSYSLILMTVSFLQ 473
Score = 47.4 bits (111), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 37/60 (61%), Gaps = 5/60 (8%)
Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LHEEI F K ++ E A + N V+R+T ++ +WPR++ IFGS TGL LP+S
Sbjct: 307 LHEEILDFYKYMSPRMEEFAMR---NEVVQRITNIIRSIWPRAKVEIFGSFETGLYLPTS 363
>gi|403338429|gb|EJY68454.1| hypothetical protein OXYTRI_10932 [Oxytricha trifallax]
Length = 1545
Score = 70.5 bits (171), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 95/225 (42%), Gaps = 46/225 (20%)
Query: 1242 VTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKE 1301
+T +Q P S ++GS+AT L LP SD+DLV+ + K + R E
Sbjct: 1183 ITSVIQKALPNSEVKVYGSHATKLCLPWSDIDLVIKTNSTDHYSTPKHVLSIITR----E 1238
Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
H + W++ +K VEN ++P++ + ++ H + S +S
Sbjct: 1239 LQSDHTTK------WIQE--VKFVENASVPVVKVKCQIDHIMQTSGLAS----------- 1279
Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
++ L+ S +DI+ + +H GL+ LV+E
Sbjct: 1280 -----QNISKYQTFLEQPFS------------------IDITQLTDNHNGLECVKLVQEF 1316
Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ PL LVLKQ+L + Y GG+SSY L L+I +LQ
Sbjct: 1317 LSENEVIEPLILVLKQYLKVCQYNDPYFGGISSYALFLMIVSYLQ 1361
>gi|328713984|ref|XP_001948412.2| PREDICTED: PAP-associated domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 595
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 57/95 (60%), Gaps = 7/95 (7%)
Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
++ LD ++ P T AT VR+DISF + G+++ +++K + FP L
Sbjct: 205 NIQVLDKASVPIVKMTDK-----ATDVRVDISFNM--NNGVKSAEMIKHYMDVFPMLPKL 257
Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LVLKQFLA R +++ + GG+SSY L+L++ FLQ
Sbjct: 258 VLVLKQFLAQRDMNEVFYGGISSYSLILMVVSFLQ 292
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 42/68 (61%), Gaps = 7/68 (10%)
Query: 1213 LHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI+ F ++A E++ R ++RVT + WP+++ +FGS TGL LP+
Sbjct: 118 LHKEIEDFFAYMSATPEEHSMRLD----VLERVTNVIHAEWPKAKVEVFGSFRTGLYLPT 173
Query: 1270 SDVDLVVC 1277
SD+DLVV
Sbjct: 174 SDMDLVVI 181
>gi|325181595|emb|CCA16045.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
gi|325191995|emb|CCA26462.1| Poly(A) RNA polymerase putative [Albugo laibachii Nc14]
Length = 494
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 51/78 (65%), Gaps = 3/78 (3%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+++++DISF S GL T DL+K+ FP+ PL LVLK FLA R L++++ GG+ S+
Sbjct: 232 SAIQVDISFNISS--GLATADLIKQYMRIFPSFRPLVLVLKYFLAQRELNETFQGGIGSF 289
Query: 1456 CLMLLITRFL-QHEHHLG 1472
L L++ FL Q+ LG
Sbjct: 290 LLQLMVVSFLQQYRRQLG 307
>gi|157133264|ref|XP_001656207.1| sigma DNA polymerase, putative [Aedes aegypti]
gi|108870912|gb|EAT35137.1| AAEL012692-PA [Aedes aegypti]
Length = 491
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 55/92 (59%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ ++P L LV
Sbjct: 81 VLDKASVPIVKLTDRE-----TQVKVDISFNMQS--GVQSAELIKDFKRKYPVLAKLVLV 133
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 134 LKQFLLQRDLNEVFTGGISSYSLILMCISFLQ 165
Score = 46.6 bits (109), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 32/52 (61%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V R+ + + LWP +R +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 14 VVARIEQIVLNLWPAARVEMFGSFRTGLYLPTSDIDLVVIGLWEKLPLRTLE 65
>gi|145478719|ref|XP_001425382.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392452|emb|CAK57984.1| unnamed protein product [Paramecium tetraurelia]
Length = 592
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 45/74 (60%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K + RLDIS + H GLQ L+++ + + PL L++KQFL L +YSGGL
Sbjct: 374 KQYMNKRLDISIQETKHNGLQCVQLIRKYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGL 433
Query: 1453 SSYCLMLLITRFLQ 1466
SSY L+L+I FLQ
Sbjct: 434 SSYGLILMIVSFLQ 447
>gi|198451695|ref|XP_001358484.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
gi|198131603|gb|EAL27623.2| GA14518 [Drosophila pseudoobscura pseudoobscura]
Length = 400
Score = 70.1 bits (170), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 111/275 (40%), Gaps = 92/275 (33%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
SLLH EI+ F + ++TA + Y+ AV+ + L +WP + +FGS TGL+LP
Sbjct: 26 SLLHLEIEGFYNFI--QSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPL 83
Query: 1270 SDVDLVV-------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
SD+DLVV PP+ LE + AR +A+ + ++
Sbjct: 84 SDIDLVVENRKSYWYTPPLYELES------------------ELVARGVADPQ-----TV 120
Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
V+ A+P++ V + +K D
Sbjct: 121 NVVDTAAVPVVKFTERV---------------------SQIKFD---------------- 143
Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
S +KAA ++ DL++E FP L +VLKQFLA
Sbjct: 144 -ISFNVGSGVKAAELIK----------------DLIQE----FPDLPKLVMVLKQFLAMH 182
Query: 1443 SLDQSY-SGGLSSYCLMLLITRFLQHEHHLGRPIN 1476
L+ Y SGG+SSY + L+ FLQH+ + N
Sbjct: 183 GLNDVYRSGGISSYGITLMCIGFLQHQSQSNKKYN 217
>gi|440900205|gb|ELR51393.1| PAP-associated domain-containing protein 5, partial [Bos grunniens
mutus]
Length = 563
Score = 70.1 bits (170), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 64 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 122
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 123 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 158
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 159 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 180
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ D K ++P L LVLKQFL R L++
Sbjct: 181 VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 220
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 221 FTGGIGSYSLFLMAVSFLQ 239
>gi|145530898|ref|XP_001451221.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124418865|emb|CAK83824.1| unnamed protein product [Paramecium tetraurelia]
Length = 583
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 45/74 (60%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K + RLDIS + H GLQ L+++ + + PL L++KQFL L +YSGGL
Sbjct: 365 KQYMNKRLDISIQETKHNGLQCVQLIRKYIKNYEPLKPLTLIMKQFLHKSDLSDTYSGGL 424
Query: 1453 SSYCLMLLITRFLQ 1466
SSY L+L+I FLQ
Sbjct: 425 SSYGLILMIVSFLQ 438
>gi|118794510|ref|XP_321547.3| AGAP001564-PA [Anopheles gambiae str. PEST]
gi|19572384|emb|CAD27930.1| putative DNA topoisomerase [Anopheles gambiae]
gi|116116329|gb|EAA01236.4| AGAP001564-PA [Anopheles gambiae str. PEST]
Length = 1039
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 47/75 (62%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF S G+Q+ L+K ++P L LVLKQFL R L++ ++GG+SSY
Sbjct: 378 TQVKVDISFNMES--GVQSAKLIKGFKREYPVLEKLVLVLKQFLLQRDLNEVFTGGISSY 435
Query: 1456 CLMLLITRFLQHEHH 1470
L+L+ FLQ H
Sbjct: 436 SLILMCISFLQQHHQ 450
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 76/166 (45%), Gaps = 33/166 (19%)
Query: 1199 CPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
CP R + + LH+EI+ F + A T + V R+ + +Q LWP +R
Sbjct: 255 CPWRPPNFKYGRGIIGLHQEIEQFYAHMIATPTEHALRV-MVVSRIEKIVQNLWPSARVE 313
Query: 1257 IFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
+FGS TGL LP+SD+DLVV + P+R LE L
Sbjct: 314 MFGSFRTGLYLPTSDIDLVVIGQWTMLPLRTLE-----------------------MELI 350
Query: 1313 NQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
+Q + +S++ ++ ++PI+ L +V D+ + S VQS K
Sbjct: 351 SQGIAEPNSVRVLDKASVPIVKLTDRQTQVKVDISFNMESGVQSAK 396
>gi|426382137|ref|XP_004057677.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 631
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 132 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 190
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 191 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 226
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 227 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 248
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ D K ++P L LVLKQFL R L++
Sbjct: 249 VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 288
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 289 FTGGIGSYSLFLMAVSFLQ 307
>gi|5565687|gb|AAD45198.1|AF089896_1 topoisomerase-related function protein [Homo sapiens]
Length = 517
Score = 69.7 bits (169), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 101/264 (38%), Gaps = 98/264 (37%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 30 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 82
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 83 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 119
Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
K ++ +PII L + V D+
Sbjct: 120 KVLDKATVPIIKLT---------------------------DQETEVKVDI--------- 143
Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
S ++AA + +K +++ L LVLKQFL R
Sbjct: 144 --SFNMETGVRAA--------------------EFIKNYMKKYSLLPYLILVLKQFLLQR 181
Query: 1443 SLDQSYSGGLSSYCLMLLITRFLQ 1466
L++ ++GG+SSY L+L+ FLQ
Sbjct: 182 DLNEVFTGGISSYSLILMAISFLQ 205
>gi|313241181|emb|CBY33472.1| unnamed protein product [Oikopleura dioica]
Length = 422
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 103/256 (40%), Gaps = 82/256 (32%)
Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LHEEI+ F K Q A + + W +++V + L WP ++ FGS TGL LP
Sbjct: 88 LHEEIEDFIKFMQPTESEQAMRDDVVWRIRQVVKEL---WPSAKLETFGSYNTGLYLPDG 144
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
D+D+V+ ++ + + RN + E R +A +E ++ +E +
Sbjct: 145 DIDMVIQ-------GQWEQLPMWQLRNKLVE-------RRIAREE-----NITVIEKAVV 185
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
PII L+ + + VH D+ S +S+
Sbjct: 186 PIIKLI---------------------------ESNTLVHVDI-----------SFNTSN 207
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
+AA LVK+ ++P L ++LK L R L++ + G
Sbjct: 208 GREAAA--------------------LVKKYMAEYPNLKQLVVLLKYILNHRGLNEVWKG 247
Query: 1451 GLSSYCLMLLITRFLQ 1466
GL SY L LL+ FLQ
Sbjct: 248 GLGSYALTLLVVNFLQ 263
>gi|145514940|ref|XP_001443375.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410753|emb|CAK75978.1| unnamed protein product [Paramecium tetraurelia]
Length = 962
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 107/248 (43%), Gaps = 52/248 (20%)
Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPVRNLEPIKE 1289
IN+ + + L S+ +FGS ATGL+LP SD+D+ + C P NL PI+
Sbjct: 626 INYLYRGFIHHIYQLDFNSQVCLFGSCATGLALPESDIDIGITGFEMCSPSQLNL-PIQ- 683
Query: 1290 AGILEGRNGIKETCLQHAARYLANQEWVKSDSL-KTVENTAIPIIMLVVEVPHDLIASAA 1348
K T R++ N + KS + + ++A+P+I L V+ P ++
Sbjct: 684 ----------KLTEFLQKMRWVNNIVYFKSIYFFRAITSSAMPLIKLQVD-PTISFVQSS 732
Query: 1349 SSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISF---- 1404
+ P + D+V D PK + +D+SF
Sbjct: 733 LPIGLP---------------YIDLVLNPDEDIPKQIFS------------VDVSFFQYS 765
Query: 1405 --KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLIT 1462
K H GL +TDL + + P+ L+ K L R L+ Y GG+SS+C++ ++
Sbjct: 766 GPKQNQHLGLISTDLTLQWLSFYSELRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMVL 825
Query: 1463 RFLQHEHH 1470
FL+ +H
Sbjct: 826 AFLECFYH 833
>gi|194208578|ref|XP_001491462.2| PREDICTED: PAP-associated domain-containing protein 5 [Equus
caballus]
Length = 528
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 100/259 (38%), Gaps = 88/259 (33%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 29 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKGLWPSADVQIFGSFKTGLYLPTSDI 87
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 88 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 123
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII L + T +K D S
Sbjct: 124 ATVPIIKLT---------------------DSFTEVKVD-----------------ISFN 145
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+ ++AA ++ D K ++P L LVLKQFL R L++
Sbjct: 146 VQNGVRAADLIK----------------DFTK----KYPVLPYLVLVLKQFLLQRDLNEV 185
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
++GG+ SY L L+ FLQ
Sbjct: 186 FTGGIGSYSLFLMAVSFLQ 204
>gi|403331574|gb|EJY64740.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 316
Score = 69.3 bits (168), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 49/79 (62%), Gaps = 2/79 (2%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
V +DISF + G+ LVK+L + FP PL +VLK FL R L++ YSGG+ S+ L
Sbjct: 124 VNVDISFNRTN--GVYCVKLVKQLLQMFPELKPLMMVLKCFLKSRQLNEPYSGGVGSFLL 181
Query: 1458 MLLITRFLQHEHHLGRPIN 1476
+++T FLQ ++ LG N
Sbjct: 182 TMMVTSFLQRQYKLGNTNN 200
>gi|170592851|ref|XP_001901178.1| PAP/25A associated domain containing protein [Brugia malayi]
gi|158591245|gb|EDP29858.1| PAP/25A associated domain containing protein [Brugia malayi]
Length = 421
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/265 (24%), Positives = 105/265 (39%), Gaps = 80/265 (30%)
Query: 1202 RKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSN 1261
R+ +LSL LHEE+ + R + +RV LQ +WP ++ ++FGS
Sbjct: 58 RRYALSLRG--LHEELLDLYAWLKPSPLERALRLR-VFERVRGVLQRIWPTAKIDVFGSL 114
Query: 1262 ATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS 1321
T L LP+SD+D+VV V P+ + I +GI E S
Sbjct: 115 YTSLFLPTSDIDVVVESDLVSEEPPLWKTAIALKESGITE-------------------S 155
Query: 1322 LKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSAS 1381
+ ++ +PI+ +V D ++ D+
Sbjct: 156 INVLDKAFVPIVKMV---------------------------DKDTKIYLDI-------- 180
Query: 1382 PKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLAD 1441
S + +++A + D+ + P PL LVLKQFL
Sbjct: 181 ---SFNTVQGVRSAKFIE-DMKMRYP-------------------VLEPLVLVLKQFLMQ 217
Query: 1442 RSLDQSYSGGLSSYCLMLLITRFLQ 1466
R L+Q ++GGLSSY L+L++ FLQ
Sbjct: 218 RQLNQVFTGGLSSYGLILMLISFLQ 242
>gi|145491674|ref|XP_001431836.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398942|emb|CAK64438.1| unnamed protein product [Paramecium tetraurelia]
Length = 685
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/267 (22%), Positives = 109/267 (40%), Gaps = 73/267 (27%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P K S+ + L EI++F ++++ +KP I+ V+ V ++Q + +SR ++G
Sbjct: 324 PQHKISIDDIVKQLTFEINTFTQELSKYLDEQKPIIDKIVQLVDETVQNVQFKSRAFLYG 383
Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
S TGL+L SD+D+V+ E ++ I+ L A ++K
Sbjct: 384 SCQTGLNLLDSDIDIVI--------ETVESEEII---------LLFKLAEQFKTTSFIK- 425
Query: 1320 DSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS 1379
+K +EN P++ +
Sbjct: 426 -DVKVIENAKKPVLKM-------------------------------------------- 440
Query: 1380 ASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFL 1439
+CS D + +DI+ H+G +T + + E ++F LAL+LK +
Sbjct: 441 ---QCSKEFQDKL-------IDITISRNDHSGRKTANSMIEFQKEFKQFRSLALMLKFYF 490
Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+L SY GGL+SYC++ +I LQ
Sbjct: 491 KSINLLNSYQGGLNSYCILTMILALLQ 517
>gi|145539774|ref|XP_001455577.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423385|emb|CAK88180.1| unnamed protein product [Paramecium tetraurelia]
Length = 1013
Score = 68.9 bits (167), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 130/302 (43%), Gaps = 59/302 (19%)
Query: 1178 LARDQEHPDVAFPLQPLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINW 1237
L+ QE P+ + Q VQ +K L+ + + +I F + +E P+
Sbjct: 635 LSIKQESPEKSLKEQLENVQ----QKGKQKLIEQI-NSDILDFTDNIMSEYEEMLPFRFL 689
Query: 1238 AVKRVTRSLQVLW-----PRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGI 1292
A +RV +Q ++ + +FGS ATGL+L SD+D+ + V N
Sbjct: 690 AFERVKSVIQKVFLGIPDGMITSRLFGSCATGLALLDSDIDIGINGLEVYN--------- 740
Query: 1293 LEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQ 1352
RN +K + + + ++WV + + N+++PII L ++ P I
Sbjct: 741 ---RNMLK-SHFDNLYFEFSRKKWVVKAN--PIFNSSVPIIKLEID-PQINIFEYEGRNL 793
Query: 1353 SPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFK------S 1406
++ LK K + +++DISF
Sbjct: 794 DEQQIQLWKRLKQ---------------------------KLKSGIKVDISFNFNGNGIY 826
Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+H G TTDLVK+ E++P+ + L+LK + L +SY+GGLSSY L++++ +++
Sbjct: 827 STHLGSITTDLVKKWMEEYPSLQQIVLILKSMIKKLKLSESYTGGLSSYSLIIMVYSYMR 886
Query: 1467 HE 1468
+
Sbjct: 887 EQ 888
>gi|195053696|ref|XP_001993762.1| GH19417 [Drosophila grimshawi]
gi|193895632|gb|EDV94498.1| GH19417 [Drosophila grimshawi]
Length = 340
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 13/111 (11%)
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
+ +D++ LD ++ P T + T ++ DISF TG++ +L++ EQFP
Sbjct: 108 IATDVLVLDKASVPVVKFT-----EHITQIKFDISFNKT--TGVKAAELIQRYIEQFPEL 160
Query: 1429 TPLALVLKQFLADRSLDQSYS-GGLSSYCLMLLITRFLQH-----EHHLGR 1473
L +VLKQFL R L++ YS GG+SSY + L+ FLQ ++ LGR
Sbjct: 161 PKLVIVLKQFLMLRGLNEVYSTGGISSYAITLMCISFLQQHSSKNDNKLGR 211
Score = 47.4 bits (111), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH EI+ F + T + AV+R+ L LWP +FGS TGL+LP SD+
Sbjct: 23 LHMEIEHFYNYIVNTRTEYMMRLE-AVQRIENLLLGLWPEVSIEVFGSFRTGLNLPISDI 81
Query: 1273 DLVV 1276
D+ V
Sbjct: 82 DIAV 85
>gi|195045442|ref|XP_001991977.1| GH24509 [Drosophila grimshawi]
gi|193892818|gb|EDV91684.1| GH24509 [Drosophila grimshawi]
Length = 489
Score = 68.6 bits (166), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/92 (40%), Positives = 55/92 (59%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +FP L LV
Sbjct: 359 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKQFKREFPVLGKLVLV 411
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 412 LKQFLLLRDLNEVFTGGISSYSLILMCICFLQ 443
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 79/172 (45%), Gaps = 39/172 (22%)
Query: 1194 LEVQNCPTRKASLSLMHSL--LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQV 1248
++ + CP R + LHEEI+ F + V E+ R N VKR+ +Q
Sbjct: 247 VKYKGCPWRVRDFQYGDGIIGLHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVQS 302
Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCL 1304
+WP++ IFGS TGL LP+SD+DLVV P+R LE L R GI E C
Sbjct: 303 IWPQAVVEIFGSFRTGLFLPTSDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC- 355
Query: 1305 QHAARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+++ ++ ++PII L +V D+ + S VQS
Sbjct: 356 ----------------TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQS 391
>gi|145541945|ref|XP_001456660.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424473|emb|CAK89263.1| unnamed protein product [Paramecium tetraurelia]
Length = 805
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 53/256 (20%)
Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLV 1275
+I F Q+ EN + P R+ ++Q+L+ + +FGS T L+LP SD+D+
Sbjct: 480 DIKEFTDQIRRENDQQFPIRQLIFNRIQFTIQLLFKEAGVCLFGSCVTRLALPDSDIDI- 538
Query: 1276 VCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
GI + + +L+ WVK T +P+I L
Sbjct: 539 ---------------GITGLETHLLNQKMDAIIEFLSKMNWVKRIKPIYPTQTTLPLIKL 583
Query: 1336 VVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAA 1395
V+ I + ++ P H D+V +
Sbjct: 584 WVDPS---IPFRSGNMNLP---------------HIDLVC------------------QS 607
Query: 1396 TSVRLDISFKS-PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
+++DISF H GL +T+L +++ + L+ K L R L+ GG+SS
Sbjct: 608 QLIQVDISFFGHIQHQGLTSTELTCYWLQEYQELKTITLLFKSLLKKRGLNDQSKGGISS 667
Query: 1455 YCLMLLITRFLQHEHH 1470
+CL+L++ FL++ +
Sbjct: 668 FCLVLIVVAFLEYHYQ 683
>gi|432853107|ref|XP_004067543.1| PREDICTED: PAP-associated domain-containing protein 5-like [Oryzias
latipes]
Length = 679
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 53/96 (55%), Gaps = 7/96 (7%)
Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
S + LD + P T S T V++DISF S G++ L+KE E++P
Sbjct: 273 SAIKVLDKATVPIIKLTDS-----VTEVKVDISFNVES--GVKAARLIKEFKEKYPVLPY 325
Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 326 LVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 361
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 69/151 (45%), Gaps = 33/151 (21%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F ++ K + V R+ + LWP + +FGS +TGL LP+SD+
Sbjct: 186 LHEEIKDFYDYISPRPEEEKMRLE-VVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDI 244
Query: 1273 DLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L RN ++ ++K ++
Sbjct: 245 DLVVFGKWETLP----LWTLEEA--LRKRNVADKS------------------AIKVLDK 280
Query: 1328 TAIPIIML---VVEVPHDLIASAASSVQSPK 1355
+PII L V EV D+ + S V++ +
Sbjct: 281 ATVPIIKLTDSVTEVKVDISFNVESGVKAAR 311
>gi|308805789|ref|XP_003080206.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
gi|116058666|emb|CAL54373.1| S-M checkpoint control protein CID1 and related
nucleotidyltransferases (ISS) [Ostreococcus tauri]
Length = 555
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 99/257 (38%), Gaps = 83/257 (32%)
Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI F + AE ++R AV+RV ++ +WP +R + GS ATG+ LP
Sbjct: 127 LHNEIVDFSRFLEPTEAEASSR----TAAVERVRDVVKGIWPNARFEVHGSFATGMYLPG 182
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D V+ +N CL+ A LA ++ + ++ +
Sbjct: 183 SDIDAVILDSGAKN----------------PGVCLKALAIALARRDM--AIKIQLIAKAR 224
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PI+ V+S H ++ D + P +
Sbjct: 225 VPIVKF-------------EEVESG---------------HQFDISFDVANGPASAEIVR 256
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+N++ +FPA PL VLK FLA R L++ YS
Sbjct: 257 ENMR------------------------------RFPALRPLTTVLKAFLAQRGLNEVYS 286
Query: 1450 GGLSSYCLMLLITRFLQ 1466
GG+ SY L+ ++ LQ
Sbjct: 287 GGIGSYALLCMVMAHLQ 303
>gi|162317662|gb|AAI56330.1| PAP associated domain containing 5 [synthetic construct]
gi|162318878|gb|AAI57080.1| PAP associated domain containing 5 [synthetic construct]
Length = 442
Score = 68.2 bits (165), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T S T V++DISF G++ DL+K+ T+
Sbjct: 70 KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 123 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 165
>gi|357130698|ref|XP_003566984.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Brachypodium distachyon]
Length = 619
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 99/260 (38%), Gaps = 89/260 (34%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC + A E ++R AV+ V+ ++ +WP + +FGS TGL LP+
Sbjct: 176 LHKEILDFCDFISPSAEEQSSRTA----AVQAVSDVVKHIWPHCKVEVFGSFRTGLYLPT 231
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V I E R + L A+ L+ + K ++ +
Sbjct: 232 SDIDVV----------------IFESRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 273
Query: 1330 IPIIMLVVEV---PHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
+PI+ V V P D ++ D P+ +
Sbjct: 274 VPIVKFVERVSGIPFD-------------------------------ISFDIDGGPQAAD 302
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
D I+ + PA PL ++LK FL R L++
Sbjct: 303 FIKDAIR------------------------------KMPALRPLCMILKVFLHQRELNE 332
Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
Y+GG+ SY L+ ++ LQ
Sbjct: 333 VYTGGVGSYALLTMLITHLQ 352
>gi|297597347|ref|NP_001043830.2| Os01g0672700 [Oryza sativa Japonica Group]
gi|56201854|dbj|BAD73304.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|56201907|dbj|BAD73357.1| polymerase (DNA directed) sigma-like [Oryza sativa Japonica Group]
gi|255673541|dbj|BAF05744.2| Os01g0672700 [Oryza sativa Japonica Group]
Length = 578
Score = 67.8 bits (164), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 63/276 (22%), Positives = 106/276 (38%), Gaps = 96/276 (34%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC + A E ++R AVK V+ ++ +WP+ + +FGS TGL LP+
Sbjct: 142 LHKEILDFCDFISPSAEEQSSRTA----AVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPT 197
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V I + R + L A+ L+ + K ++ +
Sbjct: 198 SDIDVV----------------IFDSRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 239
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PI+ V + + + D ++ D P+ +
Sbjct: 240 VPIVKFV---------------------------ERKSEIAFD-ISFDMDGGPQAADFIK 271
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D +K +FPA L ++LK FL R L++ Y+
Sbjct: 272 DYVK------------------------------KFPALRHLCMILKVFLHQRELNEVYT 301
Query: 1450 GGLSSYCLMLLITRFLQ-------------HEHHLG 1472
GG+ SY L+ ++ LQ EH+LG
Sbjct: 302 GGIGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 337
>gi|116235017|dbj|BAF34948.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 578
Score = 67.8 bits (164), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 63/276 (22%), Positives = 106/276 (38%), Gaps = 96/276 (34%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC + A E ++R AVK V+ ++ +WP+ + +FGS TGL LP+
Sbjct: 142 LHKEILDFCDFISPSAEEQSSRTA----AVKAVSNVIKHIWPQCKVEVFGSFRTGLFLPT 197
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V I + R + L A+ L+ + K ++ +
Sbjct: 198 SDIDVV----------------IFDSRVKTPQVGLYALAKALSQKGVAK--KIQVIAKAR 239
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PI+ V + + + D ++ D P+ +
Sbjct: 240 VPIVKFV---------------------------ERKSEIAFD-ISFDMDGGPQAADFIK 271
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D +K +FPA L ++LK FL R L++ Y+
Sbjct: 272 DYVK------------------------------KFPALRHLCMILKVFLHQRELNEVYT 301
Query: 1450 GGLSSYCLMLLITRFLQ-------------HEHHLG 1472
GG+ SY L+ ++ LQ EH+LG
Sbjct: 302 GGIGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 337
>gi|198427134|ref|XP_002121817.1| PREDICTED: similar to PAP-associated domain-containing protein 5
(Topoisomerase-related function protein 4-2) (TRF4-2)
[Ciona intestinalis]
Length = 391
Score = 67.4 bits (163), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/104 (39%), Positives = 60/104 (57%), Gaps = 13/104 (12%)
Query: 1369 VHSDMVA------LDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
V D+VA LD +A P + T V++DISF S G+Q+ +L+K+
Sbjct: 158 VSKDIVAEGSVKVLDRAAVPLIKFQHKE-----TLVKVDISFNIQS--GVQSVELIKDFM 210
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+++PA L VLKQFL R L++ ++GGLSSY L+L+ FLQ
Sbjct: 211 KKYPALPKLIFVLKQFLLVRELNEVWTGGLSSYSLILMAISFLQ 254
>gi|118366560|ref|XP_001016496.1| Nucleotidyltransferase domain containing protein [Tetrahymena
thermophila]
gi|89298263|gb|EAR96251.1| Nucleotidyltransferase domain containing protein [Tetrahymena
thermophila SB210]
Length = 1873
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 123/284 (43%), Gaps = 46/284 (16%)
Query: 1203 KASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN----IF 1258
K + L ++ +I SF + + E+ +P R+ L+ + + + IF
Sbjct: 1459 KIAKRLFRRKINNDILSFTEMINLESYRHQPSRFLIFYRIQHLLKKQFEKVESQVDLRIF 1518
Query: 1259 GSNATGLSLPSSDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ 1314
GS ATGLSLP+SD+DL + C P + ++ L A L
Sbjct: 1519 GSCATGLSLPTSDIDLGITGFECHP---------RSDLI--------YILNQLANILEQF 1561
Query: 1315 EWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
+WVKS ++ + +IPII + V+ D + +D + +++
Sbjct: 1562 KWVKS--IERIFTASIPIIKMEVDPFVDF-------KECINQDELLLFFDPIEIQNLNLL 1612
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPS-----HTGLQTTDLVKELTEQFPAST 1429
+++ S K I + ++ +SP+ H G +TD V +
Sbjct: 1613 VTEENQSLK-------GINLKVDITVECQMQSPNTEILNHMGFLSTDFVVRWMSYYVRLQ 1665
Query: 1430 PLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
P+ +V+K FL R+L+ ++ GG+SSYCL+++I F + ++ +
Sbjct: 1666 PMTIVMKCFLHARNLNTNFKGGISSYCLLIMIIAFFRDNQNISQ 1709
>gi|355709229|gb|AES03522.1| PAP associated domain containing 5 [Mustela putorius furo]
Length = 458
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T S T V++DISF G++ DL+K+ T+
Sbjct: 40 KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 92
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 93 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 135
>gi|21739473|emb|CAD38778.1| hypothetical protein [Homo sapiens]
Length = 410
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T S T V++DISF G++ DL+K+ T+
Sbjct: 38 KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 90
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 91 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 133
>gi|62898381|dbj|BAD97130.1| PAP associated domain containing protein 5 variant [Homo sapiens]
Length = 466
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T S T V++DISF G++ DL+K+ T+
Sbjct: 47 KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 99
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 100 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 142
>gi|349603773|gb|AEP99516.1| PAP-associated domain-containing protein 5-like protein, partial
[Equus caballus]
Length = 459
Score = 67.4 bits (163), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T S T V++DISF G++ DL+K+ T+
Sbjct: 40 KHKVADEDSVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTK 92
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 93 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 135
>gi|5565689|gb|AAD45199.1|AF089897_1 topoisomerase-related function protein [Homo sapiens]
Length = 379
Score = 67.4 bits (163), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 54/95 (56%), Gaps = 7/95 (7%)
Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPL 1431
++ LD + P T S T V++DISF G++ DL+K+ T+++P L
Sbjct: 2 EVKVLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVRAADLIKDFTKKYPVLPYL 54
Query: 1432 ALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 55 VLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 89
>gi|242014489|ref|XP_002427922.1| PAP-associated domain-containing protein, putative [Pediculus humanus
corporis]
gi|212512406|gb|EEB15184.1| PAP-associated domain-containing protein, putative [Pediculus humanus
corporis]
Length = 531
Score = 67.0 bits (162), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V++DISF S G+Q+ L+K ++FP L LVLKQFL R L++ ++GG+SSY
Sbjct: 222 SDVKVDISFNMSS--GVQSAQLIKTFKKKFPVLAKLVLVLKQFLLQRDLNEVFTGGISSY 279
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 280 SLILMTISFLQ 290
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 70/150 (46%), Gaps = 31/150 (20%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F ++ + + V ++ ++ LWP+++ +FGS TGL LP+SD+
Sbjct: 115 LHDEIIEFYNYMSPRKCEHELRLK-VVHKIKSAIHELWPQAKVEVFGSFRTGLYLPTSDI 173
Query: 1273 DLVV--CLP--PVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
DLVV C P P+ LE + L E + ++K ++
Sbjct: 174 DLVVIGCWPTLPLWKLE-----------------------QKLLENEIAEQHTIKVLDKA 210
Query: 1329 AIPIIMLV---VEVPHDLIASAASSVQSPK 1355
++PII L +V D+ + +S VQS +
Sbjct: 211 SVPIIKLRDKESDVKVDISFNMSSGVQSAQ 240
>gi|242053947|ref|XP_002456119.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
gi|241928094|gb|EES01239.1| hypothetical protein SORBIDRAFT_03g030810 [Sorghum bicolor]
Length = 568
Score = 67.0 bits (162), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 62/254 (24%), Positives = 100/254 (39%), Gaps = 77/254 (30%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC ++ +T + AV+ V+ ++ +WP+ + +FGS TGL LP+SD+
Sbjct: 141 LHKEILDFCDFISP-STEEQSSRTAAVQDVSDVVKHIWPQCKVEVFGSFRTGLYLPTSDI 199
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV E R + L A+ L+ + K ++ + +PI
Sbjct: 200 DVVV----------------FESRVKTPQVGLYALAKALSQKGVAK--KIQVIAKARVPI 241
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
+ V +A D S ++D I
Sbjct: 242 VKFV--------------------------------ERKSGIAFDISFDMDGGPQAADFI 269
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K D VK+L PA PL ++LK FL R L++ Y+GG+
Sbjct: 270 K----------------------DAVKKL----PALRPLCMILKVFLHQRELNEVYTGGI 303
Query: 1453 SSYCLMLLITRFLQ 1466
SY L+ ++ LQ
Sbjct: 304 GSYALLTMLITHLQ 317
>gi|195145882|ref|XP_002013919.1| GL24402 [Drosophila persimilis]
gi|194102862|gb|EDW24905.1| GL24402 [Drosophila persimilis]
Length = 400
Score = 66.6 bits (161), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 108/267 (40%), Gaps = 92/267 (34%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
SLLH EI+ F + ++TA + Y+ AV+ + L +WP + +FGS TGL+LP
Sbjct: 26 SLLHLEIERFYNFI--QSTATEFYLRAEAVRCIEDMLVSIWPGASVEVFGSFRTGLNLPL 83
Query: 1270 SDVDLVV-------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
SD+DLVV PP+ LE + AR +A+ + ++
Sbjct: 84 SDIDLVVENRRSYWYNPPLYELES------------------ELVARGVADPQ-----TV 120
Query: 1323 KTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASP 1382
V+ A+P++ V + +K D
Sbjct: 121 NVVDTAAVPVVKFTERV---------------------SQIKFD---------------- 143
Query: 1383 KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADR 1442
S +KAA ++ DL+ E FP L +VLKQFLA
Sbjct: 144 -ISFNVGSGVKAAELIK----------------DLIHE----FPDLPKLVMVLKQFLAMH 182
Query: 1443 SLDQSY-SGGLSSYCLMLLITRFLQHE 1468
L+ Y SGG+SSY + L+ FLQH+
Sbjct: 183 GLNDVYRSGGISSYGITLMCIGFLQHQ 209
>gi|195131647|ref|XP_002010257.1| GI15833 [Drosophila mojavensis]
gi|193908707|gb|EDW07574.1| GI15833 [Drosophila mojavensis]
Length = 807
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 391 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKQFKRDYPVLGKLVLV 443
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 444 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 475
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 77/171 (45%), Gaps = 39/171 (22%)
Query: 1197 QNCPTRKASLSLMHSL--LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWP 1251
+ CP R + LHEEID F + V A E+ R N VKR+ + +WP
Sbjct: 282 KGCPWRVRDYQYGKGIIGLHEEIDHFYQYVLPTACEHAIR----NEVVKRIESVVHSIWP 337
Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHA 1307
++ IFGS TGL LP+SD+DLVV P+R LE A GI E C
Sbjct: 338 QAVVEIFGSFRTGLFLPTSDIDLVVLGLWEKLPLRTLEFELVA------RGIAEAC---- 387
Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
+++ ++ ++PII L +V D+ + S VQS +
Sbjct: 388 -------------TVRVLDKASVPIIKLTDRETQVKVDISFNMQSGVQSAE 425
>gi|195447518|ref|XP_002071250.1| GK25231 [Drosophila willistoni]
gi|194167335|gb|EDW82236.1| GK25231 [Drosophila willistoni]
Length = 1048
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 377 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 429
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 430 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 461
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 286 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEGVVHSIWPQAVVEIFGSFRTGLFLPT 341
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 342 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 378
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 379 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 409
>gi|281211597|gb|EFA85759.1| PAP/25A-associated domain-containing protein [Polysphondylium
pallidum PN500]
Length = 918
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/263 (22%), Positives = 106/263 (40%), Gaps = 87/263 (33%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
L +EI+ F + + EN R+ +N V ++ WP+++ +FGS +T L +PS
Sbjct: 460 LEKEIELFYQWLVPTQYENRMRQKIVNDIEAIVKQN----WPKAKVIVFGSFSTDLCIPS 515
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGR------NGIKETCLQHAARYLANQEWVKSDSLK 1323
SD+D+ + I E GR N + T +H R N ++
Sbjct: 516 SDIDIQIS--------GITEVASGNGRTYSNPINDLYNTLSKHHQREFTN--------IR 559
Query: 1324 TVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPK 1383
+ +PII K
Sbjct: 560 LIAAAKVPII-------------------------------------------------K 570
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
+H S T +DI F +P+ G++ T++VK+ ++ + L L+LK F+ +
Sbjct: 571 MAHKS-------TWYNVDICFDTPN--GIENTEIVKQFLRKYKSMKILLLILKYFMFQNN 621
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQ 1466
++++YSGG+ SY L L++ ++Q
Sbjct: 622 MNETYSGGIGSYALALMVVSYIQ 644
>gi|194909177|ref|XP_001981903.1| GG11338 [Drosophila erecta]
gi|190656541|gb|EDV53773.1| GG11338 [Drosophila erecta]
Length = 405
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 98/206 (47%), Gaps = 27/206 (13%)
Query: 1285 EPIKEAGILEGRNGIKETCLQHAA-----RYLAN---QEWVKSDSLKTVENTAIPIIMLV 1336
+P + +G+L G NGI CL H Y+ + + +++D+++ +E+ + I
Sbjct: 9 KPWQLSGLLYG-NGIPALCLLHQEIEQFYSYMQSTPTEFCLRADAVRRIEDVVLSIWPGA 67
Query: 1337 VEVPHDLIASAASSVQSPKEDAAHTT---------LKHD--NHVHSDMVALDDSAS--PK 1383
D+ S + + P D L HD N + S V DS S K
Sbjct: 68 CV---DVFGSFRTGLNLPNSDIDLVVYNGYYWSPRLLHDLENELVSRGVTDPDSVSVLDK 124
Query: 1384 CSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S + +R D++F + + +G+Q +L+K+ +FP L +VLKQFL+ +
Sbjct: 125 ASVPVVKFTDRVSRIRFDVTFNAAA-SGVQAAELIKDFIREFPELPKLVMVLKQFLSLQG 183
Query: 1444 LDQSY-SGGLSSYCLMLLITRFLQHE 1468
++ Y SGG+SSY L L++ FLQ +
Sbjct: 184 FNEVYSSGGVSSYALTLMVISFLQQQ 209
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 41/65 (63%), Gaps = 1/65 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH+EI+ F + + T + AV+R+ + +WP + ++FGS TGL+LP+SD
Sbjct: 27 LLHQEIEQFYSYMQSTPTEFCLRAD-AVRRIEDVVLSIWPGACVDVFGSFRTGLNLPNSD 85
Query: 1272 VDLVV 1276
+DLVV
Sbjct: 86 IDLVV 90
>gi|324522375|gb|ADY48050.1| PAP-associated domain-containing protein 5 [Ascaris suum]
Length = 244
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 52/92 (56%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD + P D T + LDISF + G++ ++++ ++P PL L+
Sbjct: 155 VLDKAFVPIVKMVDKD-----TKIFLDISFNTVQ--GVRAAKYIEQMKSRYPVLEPLVLI 207
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L+Q ++GGLSSY L+L++ FLQ
Sbjct: 208 LKQFLMQRQLNQVFTGGLSSYGLILMLISFLQ 239
Score = 46.6 bits (109), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 39/64 (60%), Gaps = 6/64 (9%)
Query: 1240 KRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPV------RNLEPIKEAGIL 1293
+RV ++ +WP ++ +FGS TGL LP+SD+D+VV + P+ + E +K +GI
Sbjct: 91 ERVRGVIERVWPSAKVAVFGSLFTGLFLPTSDIDVVVEVQPLDEPPLWKTAEALKASGIA 150
Query: 1294 EGRN 1297
E N
Sbjct: 151 ERIN 154
>gi|161077646|ref|NP_001096913.1| Trf4-1, isoform F [Drosophila melanogaster]
gi|161077648|ref|NP_001096914.1| Trf4-1, isoform G [Drosophila melanogaster]
gi|158031752|gb|ABW09362.1| Trf4-1, isoform F [Drosophila melanogaster]
gi|158031753|gb|ABW09363.1| Trf4-1, isoform G [Drosophila melanogaster]
Length = 802
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 163 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 215
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 216 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 247
Score = 57.4 bits (137), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 72 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 127
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 128 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 164
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 165 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 195
>gi|323445977|gb|EGB02334.1| hypothetical protein AURANDRAFT_68978 [Aureococcus anophagefferens]
Length = 587
Score = 66.6 bits (161), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT + +D+SF +GL+T LV+ L ++ P PL +VLK FLA R L+++++GG+ S
Sbjct: 496 ATGIPVDVSFDV--ESGLRTGRLVRGLMDRMPPLRPLVIVLKFFLAQRGLNETFTGGVGS 553
Query: 1455 YCLMLLITRFLQHEHH 1470
+ + +++ FLQ H
Sbjct: 554 FMMQMMVVSFLQMRHR 569
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/66 (46%), Positives = 45/66 (68%), Gaps = 5/66 (7%)
Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+EI +FC VA A T R+ + AV + ++ +WP++ ++FGS+ TGLSLPSS
Sbjct: 392 LHDEILAFCALVAPSAGETRRR---DAAVANIEAAVAGVWPKATVHVFGSSLTGLSLPSS 448
Query: 1271 DVDLVV 1276
DVD+VV
Sbjct: 449 DVDVVV 454
>gi|45554544|ref|NP_996381.1| Trf4-1, isoform E [Drosophila melanogaster]
gi|161077643|ref|NP_996382.2| Trf4-1, isoform C [Drosophila melanogaster]
gi|442615523|ref|NP_996383.2| Trf4-1, isoform H [Drosophila melanogaster]
gi|442615525|ref|NP_572490.2| Trf4-1, isoform I [Drosophila melanogaster]
gi|442615527|ref|NP_001259340.1| Trf4-1, isoform J [Drosophila melanogaster]
gi|33636605|gb|AAQ23600.1| RE04457p [Drosophila melanogaster]
gi|45446864|gb|AAS65290.1| Trf4-1, isoform E [Drosophila melanogaster]
gi|158031751|gb|AAS65289.2| Trf4-1, isoform C [Drosophila melanogaster]
gi|440216540|gb|AAS65288.2| Trf4-1, isoform H [Drosophila melanogaster]
gi|440216541|gb|AAF46390.3| Trf4-1, isoform I [Drosophila melanogaster]
gi|440216542|gb|AGB95183.1| Trf4-1, isoform J [Drosophila melanogaster]
Length = 1001
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 362 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 414
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 415 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 446
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 271 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 326
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 327 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 363
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 364 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 394
>gi|198469802|ref|XP_001355128.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
gi|198147042|gb|EAL32185.2| GA10875 [Drosophila pseudoobscura pseudoobscura]
Length = 1202
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 402 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 454
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 455 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 486
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 71/152 (46%), Gaps = 39/152 (25%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 311 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIENVVHTIWPQAVVEIFGSFRTGLFLPT 366
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV P+R LE E N GI E C +++
Sbjct: 367 SDIDLVVLGLWEKLPLRTLE-------YELVNRGIAEAC-----------------TVRV 402
Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
++ ++PII L +V D+ + S VQS
Sbjct: 403 LDKASVPIIKLTDRETQVKVDISFNMQSGVQS 434
>gi|194891045|ref|XP_001977427.1| GG18260 [Drosophila erecta]
gi|190649076|gb|EDV46354.1| GG18260 [Drosophila erecta]
Length = 1008
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 350 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 402
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 403 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 434
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 72/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V A E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 259 LHEEIEHFYQYVLPTACEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 314
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 315 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 351
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 352 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 382
>gi|195163618|ref|XP_002022646.1| GL14678 [Drosophila persimilis]
gi|194104669|gb|EDW26712.1| GL14678 [Drosophila persimilis]
Length = 1197
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 403 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 455
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 456 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 487
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 71/152 (46%), Gaps = 39/152 (25%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 312 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIENVVHTIWPQAVVEIFGSFRTGLFLPT 367
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRN-GIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV P+R LE E N GI E C +++
Sbjct: 368 SDIDLVVLGLWEKLPLRTLE-------YELVNRGIAEAC-----------------TVRV 403
Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
++ ++PII L +V D+ + S VQS
Sbjct: 404 LDKASVPIIKLTDRETQVKVDISFNMQSGVQS 435
>gi|391342828|ref|XP_003745717.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Metaseiulus occidentalis]
Length = 512
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 58/96 (60%), Gaps = 7/96 (7%)
Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
S + LD ++ P T + +T+V++DISF + G+++ L+K+ + FP
Sbjct: 181 STIKVLDRASVPIVKFT-----EQSTNVKVDISFNQKN--GVKSAKLIKDFCKTFPPLPK 233
Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L VLKQ+L R L++ ++GG+SSY L+LL+ FLQ
Sbjct: 234 LVFVLKQYLLQRDLNEVFTGGISSYSLILLVVSFLQ 269
Score = 46.2 bits (108), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 28/39 (71%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
V RV ++ LWP+++ +FGS TGL LP+SD+DLV+
Sbjct: 118 VVNRVKEVVRQLWPQAQCEVFGSFCTGLYLPTSDIDLVI 156
>gi|195480074|ref|XP_002101128.1| GE15792 [Drosophila yakuba]
gi|194188652|gb|EDX02236.1| GE15792 [Drosophila yakuba]
Length = 1014
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 351 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 403
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 404 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 435
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 260 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 315
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 316 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 352
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 353 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 383
>gi|194769888|ref|XP_001967033.1| GF21834 [Drosophila ananassae]
gi|190622828|gb|EDV38352.1| GF21834 [Drosophila ananassae]
Length = 1092
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 445 VLDKASVPIIKLTDRE-----THVKVDISFNMQS--GVQSAELIKKFKRDYPVLGKLVLV 497
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 498 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 529
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 70/151 (46%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 354 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 409
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 410 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 446
Query: 1326 ENTAIPIIMLVVEVPH---DLIASAASSVQS 1353
+ ++PII L H D+ + S VQS
Sbjct: 447 DKASVPIIKLTDRETHVKVDISFNMQSGVQS 477
>gi|195355122|ref|XP_002044042.1| GM21691 [Drosophila sechellia]
gi|194129295|gb|EDW51338.1| GM21691 [Drosophila sechellia]
Length = 1000
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 54/92 (58%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + T V++DISF S G+Q+ +L+K+ +P L LV
Sbjct: 363 VLDKASVPIIKLTDRE-----TQVKVDISFNMQS--GVQSAELIKKFKRDYPVLEKLVLV 415
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 416 LKQFLLLRDLNEVFTGGISSYSLILMCISFLQ 447
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 272 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 327
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L R GI E C +++ +
Sbjct: 328 SDIDLVVLGLWEKLPLRTLE-----FELVSR-GIAEAC-----------------TVRVL 364
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 365 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 395
>gi|212645230|ref|NP_492446.3| Protein GLD-4 [Caenorhabditis elegans]
gi|403399397|sp|G5EFL0.1|GLD4_CAEEL RecName: Full=Poly(A) RNA polymerase gld-4; AltName: Full=Defective
in germ line development protein 4; AltName:
Full=Germline development defective-4
gi|194686198|emb|CAB02138.3| Protein GLD-4 [Caenorhabditis elegans]
gi|226972859|gb|ACO95123.1| germline defective-4 [Caenorhabditis elegans]
Length = 845
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF + G++ + ++ E+FP PL L+LKQFL R+L+Q+++GGLSSY
Sbjct: 191 TRLSIDISFNTVQ--GVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 248
Query: 1456 CLMLLITRFLQ 1466
L+LL+ F Q
Sbjct: 249 GLVLLLVNFFQ 259
>gi|195504602|ref|XP_002099149.1| GE23533 [Drosophila yakuba]
gi|194185250|gb|EDW98861.1| GE23533 [Drosophila yakuba]
Length = 405
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 58/97 (59%), Gaps = 7/97 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
++ LD ++ P T + + +R D++F + + +G+Q +L+K+ QFP L
Sbjct: 119 VIVLDKASVPVVKFT-----ERISRIRFDVTFNAAA-SGVQAAELIKDFIRQFPELPKLV 172
Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQHE 1468
+VLKQFL+ + ++ Y SGG+SSY L L++ FLQ +
Sbjct: 173 MVLKQFLSLQGFNEVYSSGGISSYALTLMVISFLQQQ 209
Score = 49.3 bits (116), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 40/68 (58%), Gaps = 7/68 (10%)
Query: 1212 LLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
LLH+EI+ F + + E R AV+R+ + +WP + ++FGS TGL+LP
Sbjct: 27 LLHQEIEHFYSYIQSTPTEFCLRAE----AVRRIEDVVLTIWPGACVDVFGSFRTGLNLP 82
Query: 1269 SSDVDLVV 1276
SD+DLVV
Sbjct: 83 CSDIDLVV 90
>gi|195393956|ref|XP_002055618.1| GJ18687 [Drosophila virilis]
gi|194150128|gb|EDW65819.1| GJ18687 [Drosophila virilis]
Length = 1111
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF S G+Q+ +L+K+ +P L LVLKQFL R L++ ++GG+SSY
Sbjct: 376 TQVKVDISFNMQS--GVQSAELIKQFKRDYPVLGKLVLVLKQFLLLRDLNEVFTGGISSY 433
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 434 SLILMCISFLQ 444
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 71/151 (47%), Gaps = 37/151 (24%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEID F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 269 LHEEIDHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 324
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+DLVV P+R LE L RN I E C +++ +
Sbjct: 325 SDIDLVVLGLWEKLPLRTLE-----FELVSRN-IAEAC-----------------TVRVL 361
Query: 1326 ENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
+ ++PII L +V D+ + S VQS
Sbjct: 362 DKASVPIIKLTDRETQVKVDISFNMQSGVQS 392
>gi|449674689|ref|XP_002159462.2| PREDICTED: poly(A) RNA polymerase gld-4-like [Hydra magnipapillata]
Length = 710
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 46/71 (64%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V +DISF + T +Q+ ++VK ++F L L LK FL+ R L++ ++GG+SSY
Sbjct: 227 TRVHVDISFNT--DTAVQSAEMVKSYIKEFKVLPHLYLTLKYFLSQRELNEVFTGGISSY 284
Query: 1456 CLMLLITRFLQ 1466
CLMLL FLQ
Sbjct: 285 CLMLLTVSFLQ 295
Score = 46.6 bits (109), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 38/67 (56%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI F +A E R N ++R+T + +WP ++ +FGS T + LP+
Sbjct: 120 LHQEIIDFYNYMAPMPEEEMMR----NDVIQRITDLVVSIWPSAKVEVFGSFRTNIYLPT 175
Query: 1270 SDVDLVV 1276
SD+DLVV
Sbjct: 176 SDIDLVV 182
>gi|341895116|gb|EGT51051.1| hypothetical protein CAEBREN_16945 [Caenorhabditis brenneri]
Length = 901
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF + G++ + ++ E+FP PL L+LKQFL R+L+Q+++GGLSSY
Sbjct: 199 TRLSIDISFNTVQ--GVRAASYIDKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 256
Query: 1456 CLMLLITRFLQ 1466
L+LL+ F Q
Sbjct: 257 GLVLLLVNFFQ 267
>gi|403370926|gb|EJY85334.1| hypothetical protein OXYTRI_16805 [Oxytricha trifallax]
Length = 1384
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/211 (28%), Positives = 98/211 (46%), Gaps = 25/211 (11%)
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
++GS A+ LS+ +SDVDL V + +E + E R I++ L
Sbjct: 1072 LYGSMASNLSIDTSDVDLAVV---GLDFSGSREKQVNEMRKLIEQINLIMK--------- 1119
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
KS LK +E +P+I L V DL+ A +S K + + K N + +M L
Sbjct: 1120 -KSTKLKFIETATVPVIKLEV----DLVQIAKGHERSEKNEFGSKSAKK-NIIDDNMRYL 1173
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
T D IK L I S + G++ + +KEL + P + LVLK
Sbjct: 1174 G------IDITYEDTIKQHY-YGLHIENLSRINLGVKCINYIKELCKDLPGLQTIVLVLK 1226
Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
+ L L+Q+++GGL+SY ++L+ T +++
Sbjct: 1227 KLLQVNDLNQTFTGGLNSYSIVLMTTAYIKQ 1257
>gi|341883718|gb|EGT39653.1| hypothetical protein CAEBREN_22894 [Caenorhabditis brenneri]
Length = 901
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF + G++ + ++ E+FP PL L+LKQFL R+L+Q+++GGLSSY
Sbjct: 199 TRLSIDISFNTVQ--GVRAASYIDKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 256
Query: 1456 CLMLLITRFLQ 1466
L+LL+ F Q
Sbjct: 257 GLVLLLVNFFQ 267
>gi|268566431|ref|XP_002639720.1| Hypothetical protein CBG12446 [Caenorhabditis briggsae]
Length = 897
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF + G++ + ++ E+FP PL L+LKQFL R+L+Q+++GGLSSY
Sbjct: 193 TRLSIDISFNTVQ--GVRAASYIAKVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSY 250
Query: 1456 CLMLLITRFLQ 1466
L+LL+ F Q
Sbjct: 251 GLVLLLVNFFQ 261
>gi|256078812|ref|XP_002575688.1| hypothetical protein [Schistosoma mansoni]
gi|360044186|emb|CCD81733.1| hypothetical protein Smp_145600 [Schistosoma mansoni]
Length = 672
Score = 65.9 bits (159), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 57/99 (57%), Gaps = 7/99 (7%)
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
+ S++ LD + P T + T +R+DISF + ++ +L++ +++P
Sbjct: 139 ISSEIKVLDKATVPIVKMTDKE-----TELRVDISFNMIN--SVKAAELIRVFMKRYPCL 191
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
L VLKQFL RSL++ ++GGLSSY L+L+ FLQH
Sbjct: 192 PCLVFVLKQFLLQRSLNEVWTGGLSSYALILMCVSFLQH 230
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 56/108 (51%), Gaps = 10/108 (9%)
Query: 1175 ISQLARDQEHPDVAFPL-QPLEVQNCPTRKASLSLMHSL--LHEEIDSFCKQVA---AEN 1228
++ + + E+ + FPL Q ++++ P + + LH EI F ++ AE
Sbjct: 15 VTSVLKPDEYKNYGFPLGQSFKLEDIPWKSTEKEYDFGVIGLHAEIKDFLNYISPSPAEQ 74
Query: 1229 TARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
AR+ V +V + LWP + ++FGS TGL LP+SD+D+V+
Sbjct: 75 FAREV----VVAKVKDIVYSLWPNCQVDVFGSFKTGLYLPTSDIDMVI 118
>gi|448101749|ref|XP_004199636.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359381058|emb|CCE81517.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 646
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 91/231 (39%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N VK + R + LWP + T++FGS+AT L LP SD+D+VV + G E
Sbjct: 210 NRVVKDLKREINSLWPDTETHVFGSSATDLYLPGSDIDMVVT----------SKTGDYEN 259
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + YL N++ K ++ + +PII V P
Sbjct: 260 R-----SKLYQLSSYLRNRKLAK--DIEVIAKAKVPIIKFV----------------DPS 296
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+++H D+ S + I+AA +R
Sbjct: 297 -----------SNIHIDI-----------SFERRNGIEAAKRIR---------------- 318
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ ++ P L L++KQFL R L+ + GGL Y ++L FL+
Sbjct: 319 ----KWLDKTPGLRELVLIIKQFLRSRRLNNVHVGGLGGYSTIILCYHFLR 365
>gi|145348860|ref|XP_001418861.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579091|gb|ABO97154.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 347
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/257 (24%), Positives = 100/257 (38%), Gaps = 83/257 (32%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI F + + E T+R AV+RV + +WP +R + GS ATG+ LPS
Sbjct: 65 LHTEIVDFSRYLEPTEEEATSRAA----AVERVRAVVNGIWPDARFEVHGSFATGMYLPS 120
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D V+ +N AG+ CL+ A LA + + ++ +
Sbjct: 121 SDIDAVILDSGAKN------AGL----------CLKALAVALARRGM--AIKIQLIAKAR 162
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PI+ V+S H ++ D + P +
Sbjct: 163 VPIVKF-------------EEVESG---------------HQFDISFDVANGPASAEIVR 194
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+N++ +FPA PL VLK FL R L++ YS
Sbjct: 195 ENMR------------------------------RFPALRPLTTVLKAFLHQRGLNEVYS 224
Query: 1450 GGLSSYCLMLLITRFLQ 1466
GG+ SY L+ ++ LQ
Sbjct: 225 GGIGSYALLCMVMAHLQ 241
>gi|72392291|ref|XP_846946.1| DNA polymerase sigma [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62176235|gb|AAX70351.1| DNA polymerase sigma, putative [Trypanosoma brucei]
gi|70802976|gb|AAZ12880.1| DNA polymerase sigma, putative [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 746
Score = 65.1 bits (157), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)
Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+EI F K Q A + + V + + L WP+ +FGS ATGL LP S
Sbjct: 324 LHQEICDFVKFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVD+ + V P +EA L+ AR + SL + NTA
Sbjct: 381 DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
P ++L +VP L H S LD
Sbjct: 415 PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
+DIS + G + T +V +L + + + PL +V+K FL R +D+ Y G
Sbjct: 437 ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485
Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
GL S+ L L+ FL QH + G P
Sbjct: 486 GLGSFALTLMAISFLQQHPIYTGSP 510
>gi|224064673|ref|XP_002197521.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 1
[Taeniopygia guttata]
Length = 443
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 93/245 (37%), Gaps = 91/245 (37%)
Query: 1227 ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPV 1281
E T R +N R+ ++ LWP + IFGS TGL LP+SD+DLVV LP
Sbjct: 7 EETMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP-- 60
Query: 1282 RNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPH 1341
L ++EA L +S+K ++ +PII L
Sbjct: 61 --LWTLEEA--------------------LRKHNVADENSVKVLDKATVPIIKLT----- 93
Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
+ T +K D S + +KAA ++
Sbjct: 94 ----------------DSFTEVKVD-----------------ISFNVQNGVKAAQLIK-- 118
Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
D +K ++P L LVLKQFL R L++ ++GG+ SY L L+
Sbjct: 119 --------------DFIK----KYPVLPFLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMA 160
Query: 1462 TRFLQ 1466
FLQ
Sbjct: 161 VSFLQ 165
>gi|145509541|ref|XP_001440709.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407937|emb|CAK73312.1| unnamed protein product [Paramecium tetraurelia]
Length = 962
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/226 (25%), Positives = 97/226 (42%), Gaps = 55/226 (24%)
Query: 1257 IFGSNATGLSLPSSDVDLVVC---LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
+FGS ATGL+LP SD+D+ + L P L NG +Q +L N
Sbjct: 657 LFGSCATGLALPESDIDIGITGFELLPSNQL------------NG----PIQKIIEFLQN 700
Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM 1373
+WV +++ + +++P+I L V+ P ++ + P+ D+
Sbjct: 701 MKWVT--NIRAITTSSMPLIKLQVD-PSISFVESSHKILLPQ---------------IDL 742
Query: 1374 VALDDSASPKCSHTSSDNIKAATSVRLDISF------KSPSHTGLQTTDLVKELTEQFPA 1427
+ D SP + +DISF K H G +T+ + +
Sbjct: 743 ITNYDKESP------------SRIFSVDISFFQYQGSKQNWHLGQISTEQTLQWLSFYSE 790
Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGR 1473
P+ L+ K L R L+ Y GG+SS+C++ +I FL+ +H G+
Sbjct: 791 LRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMILAFLESCYHQGQ 836
>gi|195398455|ref|XP_002057837.1| GJ17879 [Drosophila virilis]
gi|194141491|gb|EDW57910.1| GJ17879 [Drosophila virilis]
Length = 416
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 52/94 (55%), Gaps = 7/94 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
M +D + P T S T ++LD+SF G+ + +L+K +P L
Sbjct: 115 MHVVDTALVPIIKFTDSK-----TQIKLDVSFNISD--GIHSAELIKTYLRDYPGLGKLV 167
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LVLKQF+ R L++++SGG+SSY L+L+ FLQ
Sbjct: 168 LVLKQFMLQRKLNEAFSGGISSYSLLLMCISFLQ 201
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 3/80 (3%)
Query: 1199 CPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTN 1256
CP ++A+ + + LHEEI+ F + + T I VKR+ + LWP++
Sbjct: 10 CPWQQANYNYGDGIIGLHEEIEHFYRYILPTPTEHAVRIE-LVKRIESIIHELWPQALVE 68
Query: 1257 IFGSNATGLSLPSSDVDLVV 1276
IFGS TGL LP+SD+DLVV
Sbjct: 69 IFGSFRTGLFLPNSDIDLVV 88
>gi|449472874|ref|XP_004176276.1| PREDICTED: PAP-associated domain-containing protein 5 isoform 2
[Taeniopygia guttata]
Length = 490
Score = 64.7 bits (156), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 93/245 (37%), Gaps = 91/245 (37%)
Query: 1227 ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPV 1281
E T R +N R+ ++ LWP + IFGS TGL LP+SD+DLVV LP
Sbjct: 7 EETMRMEVVN----RIENVIKELWPNADVQIFGSFKTGLYLPTSDIDLVVFGKWETLP-- 60
Query: 1282 RNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPH 1341
L ++EA L +S+K ++ +PII L
Sbjct: 61 --LWTLEEA--------------------LRKHNVADENSVKVLDKATVPIIKLT----- 93
Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
+ T +K D S + +KAA ++
Sbjct: 94 ----------------DSFTEVKVD-----------------ISFNVQNGVKAAQLIK-- 118
Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
D +K ++P L LVLKQFL R L++ ++GG+ SY L L+
Sbjct: 119 --------------DFIK----KYPVLPFLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMA 160
Query: 1462 TRFLQ 1466
FLQ
Sbjct: 161 VSFLQ 165
>gi|345325980|ref|XP_001507597.2| PREDICTED: PAP-associated domain-containing protein 5-like
[Ornithorhynchus anatinus]
Length = 578
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 35/92 (38%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD + P T S T V++DISF G++ L+K+ T+++P L LV
Sbjct: 170 VLDKATVPIIKLTDS-----FTEVKVDISFNV--QNGVKAAQLIKDFTKKYPVLPYLVLV 222
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 223 LKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 254
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 38/64 (59%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI+ F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 79 LHEEINDFYEYMSPRPEEEKMRME-VVNRIENVIKELWPTADVQIFGSFKTGLYLPTSDI 137
Query: 1273 DLVV 1276
DLVV
Sbjct: 138 DLVV 141
>gi|310799736|gb|EFQ34629.1| hypothetical protein GLRG_09773 [Glomerella graminicola M1.001]
Length = 756
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 109/287 (37%), Gaps = 85/287 (29%)
Query: 1201 TRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQV--LWPRSRTNIF 1258
T + S M LH+EI F + V + + V+R+ RSL+ + F
Sbjct: 416 TDHSKTSDMSVWLHKEIIDFYEVVRPRDFEHE-MRTQLVERLRRSLKTSHFYKDCDVRPF 474
Query: 1259 GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG--RNGIKETCLQHAARYLANQEW 1316
GS +GL LP++D+DLVVC L+G N L++ ++LA +
Sbjct: 475 GSYMSGLYLPTADMDLVVCA-----------RSWLDGAHSNFFGMKALRNFGKFLAQNKV 523
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
++++ + + +P++ +
Sbjct: 524 THYNTMEFIASAKVPLVKYI---------------------------------------- 543
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLK 1436
DNI T +R+DISF G Q E EQ+PA L ++K
Sbjct: 544 -------------DNI---TGLRVDISFDR--LDGPQAVKTFAEWKEQYPAMPILVTMIK 585
Query: 1437 QFLADRSLDQSYSGGLSSYCLMLLITRFLQ-----------HEHHLG 1472
FLA R L++ +GG+ S+ + ++ LQ EHHLG
Sbjct: 586 HFLAMRGLNEPVNGGIGSFTVTCMVVSMLQLMPQVQSRNLIPEHHLG 632
>gi|307168873|gb|EFN61797.1| PAP-associated domain-containing protein 5 [Camponotus floridanus]
Length = 643
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
S + LD ++ P T + + +++DISF + G+++ DL+ +++P
Sbjct: 274 SSIKVLDKASVPIVKLTDKE-----SEIKVDISFNM--NNGVKSADLINSFKKRYPVLEK 326
Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L +VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 327 LVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 362
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 65/147 (44%), Gaps = 30/147 (20%)
Query: 1195 EVQNCPTRKASLSLMHSL--LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + + LHEEI+ F + + I VKR+ + + LWP
Sbjct: 167 EYGGCPWRVPNKHYSRGVIGLHEEIEDFFAYMCPSHEEHVLRIR-VVKRIEQVIYDLWPN 225
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S+ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 226 SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 262
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIML 1335
R L +Q + S+K ++ ++PI+ L
Sbjct: 263 RALLDQNIAEPSSIKVLDKASVPIVKL 289
>gi|219130012|ref|XP_002185169.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403348|gb|EEC43301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 425
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/260 (25%), Positives = 105/260 (40%), Gaps = 70/260 (26%)
Query: 1213 LHEEIDSFCKQV--AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH EI F + + E ++ + VKRV + + ++ ++FGS ATGL LPSS
Sbjct: 74 LHNEIVQFVRLMEPMPEEIRQREDL---VKRVEELVHRTFDNAQVHVFGSQATGLFLPSS 130
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARY--LANQEWVKSDS-LKTVEN 1327
DVDL+V + N + E + + ++ R+ + ++W+ S L+ + N
Sbjct: 131 DVDLLV----ITNEKANDETSQPDRQEDWQKPSGSPLDRFESVLREDWLMELSYLEVIGN 186
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
T +P++ K A T + D V D + P +
Sbjct: 187 TKVPLV---------------------KFTHAPTNISVD-------VCFDQESGPGAAQL 218
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
++A +R PL VLK FL+ R L++
Sbjct: 219 MKTYLEALPPLR------------------------------PLTFVLKYFLSARGLNEP 248
Query: 1448 YSGGLSSYCLMLLITRFLQH 1467
YSGG+ SY L L+I FLQH
Sbjct: 249 YSGGVGSYLLQLMIVSFLQH 268
>gi|224483450|gb|ACN51269.1| nuclear non-canonical poly(A) polymerase 1 [Trypanosoma brucei]
Length = 746
Score = 64.3 bits (155), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)
Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+EI F K Q A + + V + + L WP+ +FGS ATGL LP S
Sbjct: 324 LHQEIWDFVKFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVD+ + V P +EA L+ AR + SL + NTA
Sbjct: 381 DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
P ++L +VP L H S LD
Sbjct: 415 PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
+DIS + G + T +V +L + + + PL +V+K FL R +D+ Y G
Sbjct: 437 ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485
Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
GL S+ L L+ FL QH + G P
Sbjct: 486 GLGSFALTLMAISFLQQHPIYTGSP 510
>gi|50557292|ref|XP_506054.1| YALI0F30525p [Yarrowia lipolytica]
gi|49651924|emb|CAG78867.1| YALI0F30525p [Yarrowia lipolytica CLIB122]
Length = 716
Score = 63.9 bits (154), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 58/256 (22%), Positives = 102/256 (39%), Gaps = 77/256 (30%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
L EE+ F K ++ + K + V+RV ++ LW ++ ++ GS T + LP SD+
Sbjct: 249 LQEEVMDFVKYISPTQSEIKARQDL-VERVRGAVNGLWGDAKVHVLGSFTTNMHLPQSDI 307
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
DLVVC P G G + C+ + + ++ +K L+T+ +PI
Sbjct: 308 DLVVCSP--------------HGHYG-ERACIYQLSSVIRSR--MKVAELQTITKARVPI 350
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I + VH D+ S I
Sbjct: 351 IKFI---------------------------DSRTGVHVDI-----------SFEKDGGI 372
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K A++ + + ++ PA PL L++KQFL+ R L++ ++GGL
Sbjct: 373 KTAST--------------------ITKWSKSMPALRPLVLIVKQFLSSRRLNEVFTGGL 412
Query: 1453 SSYCLMLLITRFL-QH 1467
+ ++ ++ FL QH
Sbjct: 413 GGFSVICMVYSFLIQH 428
>gi|391346299|ref|XP_003747415.1| PREDICTED: PAP-associated domain-containing protein 5-like
[Metaseiulus occidentalis]
Length = 491
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/272 (23%), Positives = 104/272 (38%), Gaps = 89/272 (32%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P R SL L L EEI F A N A + +++V +++ WP +FG
Sbjct: 75 PNRSYSLGL--EGLTEEIHDFF-MYAQPNAADQSRREQVIEKVRAAIREKWPDCVVEVFG 131
Query: 1260 SNATGLSLPSSDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ 1314
S TGL LP+ D+D+V+ +PP+ +LE R L +
Sbjct: 132 SYKTGLYLPTGDIDMVIQGNWEIIPPLFDLE-----------------------RQLIEK 168
Query: 1315 EWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
+ + ++ K ++ ++P+I +DA D + D+
Sbjct: 169 KVGEKNTFKVLDKASVPLIKF--------------------KDA-------DTEIRVDL- 200
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
S ++ +AA V K+ FP L V
Sbjct: 201 ----------SFNQANCTEAAAFV--------------------KQCCRTFPPLAKLIFV 230
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQ+L+ L++ + GG+SSY L L+I FLQ
Sbjct: 231 LKQYLSLHGLNEVFHGGISSYSLTLMILSFLQ 262
>gi|159112073|ref|XP_001706266.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
gi|157434361|gb|EDO78592.1| Topoisomerase I-related protein [Giardia lamblia ATCC 50803]
Length = 520
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 103/253 (40%), Gaps = 78/253 (30%)
Query: 1219 SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCL 1278
++C AE AR+ + + ++ +Q++ P + ++FGS +TG+S SSD+D+ V +
Sbjct: 106 AYCTLTEAEKAARE----YVLGQLRDIIQLVLPDATVDVFGSYSTGMSSYSSDLDICVNV 161
Query: 1279 PPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVE 1338
P+ A +++ C H L + + ++ + + +PII
Sbjct: 162 -------PVNSAAMMQ--------CHMHDIATLLRRS-ISTNYVDVRSHARVPII----- 200
Query: 1339 VPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
VHS++ S+
Sbjct: 201 ----------------------------KGVHSEL-----------------------SL 209
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
DISF SP G + + E+ P + L +V+K FL R L+Q Y+GG+SSY L+
Sbjct: 210 EYDISFNSPH--GAAHRETILGYIEKHPLARILIMVVKSFLKKRGLNQPYTGGMSSYILL 267
Query: 1459 LLITRFLQHEHHL 1471
LI ++ H++
Sbjct: 268 QLIVVYISERHNV 280
>gi|195354696|ref|XP_002043832.1| GM17777 [Drosophila sechellia]
gi|194129070|gb|EDW51113.1| GM17777 [Drosophila sechellia]
Length = 236
Score = 63.9 bits (154), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+ LD ++ P T D I + +R D++F S + +G+Q +L+K+ QFP L
Sbjct: 118 VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171
Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
+VLKQFL+ ++ Y SGG+SSY L L++ FLQ
Sbjct: 172 MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH+EI+ F + + T AV+R+ + +WPR+ ++FGS TGL+LP SD
Sbjct: 27 LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPRAFVDLFGSFRTGLNLPDSD 85
Query: 1272 VDLVV 1276
+DLVV
Sbjct: 86 IDLVV 90
>gi|195331816|ref|XP_002032595.1| GM26647 [Drosophila sechellia]
gi|195354690|ref|XP_002043829.1| GM17772 [Drosophila sechellia]
gi|194121538|gb|EDW43581.1| GM26647 [Drosophila sechellia]
gi|194129067|gb|EDW51110.1| GM17772 [Drosophila sechellia]
Length = 403
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+ LD ++ P T D I + +R D++F S + +G+Q +L+K+ QFP L
Sbjct: 118 VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171
Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
+VLKQFL+ ++ Y SGG+SSY L L++ FLQ
Sbjct: 172 MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH+EI+ F + + T AV+R+ + +WPR+ ++FGS TGL+LP SD
Sbjct: 27 LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPRAFVDLFGSFRTGLNLPDSD 85
Query: 1272 VDLVV 1276
+DLVV
Sbjct: 86 IDLVV 90
>gi|195391176|ref|XP_002054239.1| GJ24338 [Drosophila virilis]
gi|194152325|gb|EDW67759.1| GJ24338 [Drosophila virilis]
Length = 397
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 104/257 (40%), Gaps = 80/257 (31%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI+ F + + T + AV R+ R + +WP + +FGS TGL+LP SD+
Sbjct: 25 LHQEIEHFYNYIVSTPTEYMMRME-AVHRIERVVLSIWPDACIEVFGSFRTGLNLPISDI 83
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+ V N A +LE +N + AR +A D++ ++ ++P+
Sbjct: 84 DIAV------NNFYWHGAPLLELKNAL-------MARGVA-------DNVNVLDKASVPV 123
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
+ + + +K D S ++ +
Sbjct: 124 VKFTERI---------------------SEIKFD-----------------ISFNTTTGV 145
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY-SGG 1451
KAA V + EQFP L ++LKQ+L + L++ Y SGG
Sbjct: 146 KAAELV--------------------QRFIEQFPELPKLVIILKQYLMLQGLNEVYSSGG 185
Query: 1452 LSSYCLMLLITRFLQHE 1468
+SSY + L+ FLQ +
Sbjct: 186 ISSYAVTLMCISFLQQQ 202
>gi|195573661|ref|XP_002104810.1| GD21148 [Drosophila simulans]
gi|194200737|gb|EDX14313.1| GD21148 [Drosophila simulans]
Length = 403
Score = 63.5 bits (153), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 56/96 (58%), Gaps = 7/96 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+ LD ++ P T D I + +R D++F S + +G+Q +L+K+ QFP L
Sbjct: 118 VTVLDKASVPVVKFT--DRI---SRIRFDVTFNSVA-SGVQAAELIKDFIRQFPELPKLV 171
Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQH 1467
+VLKQFL+ ++ Y SGG+SSY L L++ FLQ
Sbjct: 172 MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQ 207
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH+EI+ F + + T AV+R+ + +WP + ++FGS TGL+LP SD
Sbjct: 27 LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPSAFVDLFGSFRTGLNLPDSD 85
Query: 1272 VDLVV 1276
+DLVV
Sbjct: 86 IDLVV 90
>gi|195053247|ref|XP_001993538.1| GH13010 [Drosophila grimshawi]
gi|193900597|gb|EDV99463.1| GH13010 [Drosophila grimshawi]
Length = 359
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 53/94 (56%), Gaps = 7/94 (7%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
M L +A P T + T +++D+SF + G+Q+ +L+K +PA L
Sbjct: 121 MRVLATAAVPIIKFTDRE-----TKIKVDLSFNMAN--GVQSAELIKIFKRDYPALGYLV 173
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+VLKQFL R L++ ++ GLSSY L+LL FLQ
Sbjct: 174 MVLKQFLLQRDLNEVFTAGLSSYSLILLCISFLQ 207
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 43/67 (64%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + + +E+ AR N ++R+ ++ LWP++ IFGS TGL LP+
Sbjct: 32 LHEEIEHFYRYILPTPSEHAAR----NKVIQRIENIIRALWPQALVEIFGSFRTGLFLPN 87
Query: 1270 SDVDLVV 1276
SD+DLVV
Sbjct: 88 SDIDLVV 94
>gi|59807821|gb|AAH89405.1| PAPD5 protein, partial [Homo sapiens]
Length = 390
Score = 63.5 bits (153), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 44/68 (64%), Gaps = 2/68 (2%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
++DISF G++ DL+K+ T+++P L LVLKQFL R L++ ++GG+ SY L
Sbjct: 1 KVDISFNV--QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLF 58
Query: 1459 LLITRFLQ 1466
L+ FLQ
Sbjct: 59 LMAVSFLQ 66
>gi|67989518|ref|NP_001018181.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe 972h-]
gi|81175166|sp|Q9UTN3.2|CID14_SCHPO RecName: Full=Poly(A) RNA polymerase cid14; Short=PAP; AltName:
Full=Caffeine-induced death protein 14; AltName:
Full=Polynucleotide adenylyltransferase cid14
gi|62554069|emb|CAI79317.1| poly(A) polymerase Cid14 [Schizosaccharomyces pombe]
Length = 684
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V +DISF P GL+T +V +++PA PL +++K FL R+L++ + GGLSSY
Sbjct: 351 TKVHVDISFNQPG--GLKTCLVVNGFMKKYPALRPLVIIIKHFLNMRALNEVFLGGLSSY 408
Query: 1456 CLMLLITRFLQ 1466
++ L+ FLQ
Sbjct: 409 AIVCLVVSFLQ 419
>gi|24649854|ref|NP_651313.1| Trf4-2, isoform A [Drosophila melanogaster]
gi|442621019|ref|NP_001262941.1| Trf4-2, isoform B [Drosophila melanogaster]
gi|7301238|gb|AAF56369.1| Trf4-2, isoform A [Drosophila melanogaster]
gi|21430468|gb|AAM50912.1| LP06848p [Drosophila melanogaster]
gi|220950110|gb|ACL87598.1| CG17462-PA [synthetic construct]
gi|440217872|gb|AGB96321.1| Trf4-2, isoform B [Drosophila melanogaster]
Length = 407
Score = 63.5 bits (153), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 57/106 (53%), Gaps = 7/106 (6%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
+ LD ++ P T + +R D++F S + +G+Q DL+K+ FP L
Sbjct: 118 VTVLDKASVPVVKFTD-----LISRIRFDVTFNSVA-SGVQAADLIKDFIRHFPELPKLV 171
Query: 1433 LVLKQFLADRSLDQSY-SGGLSSYCLMLLITRFLQHEHHLGRPINQ 1477
+VLKQFL+ ++ Y SGG+SSY L L++ FLQ R +++
Sbjct: 172 MVLKQFLSLHGFNEVYNSGGVSSYALTLMVISFLQQHARSNRRLSE 217
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH+EI+ F + + T AV+R+ + +WP + ++FGS TGL+LP SD
Sbjct: 27 LLHQEIEQFYNYIRSTPTEFCLRAG-AVRRIEDVVLSIWPSASVDLFGSFRTGLNLPDSD 85
Query: 1272 VDLVV 1276
+DLVV
Sbjct: 86 IDLVV 90
>gi|332030078|gb|EGI69903.1| PAP-associated domain-containing protein 5 [Acromyrmex echinatior]
Length = 662
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 55/96 (57%), Gaps = 7/96 (7%)
Query: 1371 SDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTP 1430
S + LD ++ P T + + +++DISF + G+++ +L+ Q+P
Sbjct: 294 SSIKVLDKASVPIVKLTDKE-----SEIKVDISFNM--NNGVKSAELINSYKRQYPVLEK 346
Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L +VLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 347 LVMVLKQFLLQRDLNEVFTGGISSYSLILMTISFLQ 382
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 65/147 (44%), Gaps = 30/147 (20%)
Query: 1195 EVQNCPTRKAS--LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
E CP R + S+ LHEEI+ F + + + VKR+ + LWP
Sbjct: 187 EYGGCPWRVTNKHYSMGTIGLHEEIEDFFSYMCPSHEEHVLRLR-VVKRIENVIYDLWPD 245
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLP----PVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S+ +FGS TGL LP+SD+DLVV P+R LE
Sbjct: 246 SKVEVFGSFRTGLYLPTSDIDLVVIGMWTNLPLRTLE----------------------- 282
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIML 1335
R L ++ + S+K ++ ++PI+ L
Sbjct: 283 RALLDRNIAEPSSIKVLDKASVPIVKL 309
>gi|261330131|emb|CBH13115.1| DNA polymerase sigma, putative [Trypanosoma brucei gambiense DAL972]
Length = 746
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)
Query: 1213 LHEEIDSFCK--QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+EI F + Q A + + V + + L WP+ +FGS ATGL LP S
Sbjct: 324 LHQEICDFVEFHQPTEAEIAIRCLVEAEVVTIAKQL---WPKCEPVVFGSMATGLLLPLS 380
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAI 1330
DVD+ + V P +EA L+ AR + SL + NTA
Sbjct: 381 DVDITILNVDV----PTEEA-------------LRLLAREI---------SLSGICNTAY 414
Query: 1331 PIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSD 1390
P ++L +VP L H S LD
Sbjct: 415 PQLVLKAKVP----------------------LLKFQHCGS---LLD------------- 436
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
+DIS + G + T +V +L + + + PL +V+K FL R +D+ Y G
Sbjct: 437 ---------VDISIN--ARDGPRNTLIVIDLLKHYREAEPLIVVVKYFLHQRGMDEPYHG 485
Query: 1451 GLSSYCLMLLITRFL-QHEHHLGRP 1474
GL S+ L L+ FL QH + G P
Sbjct: 486 GLGSFALTLMAISFLQQHPIYTGSP 510
>gi|403373923|gb|EJY86891.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 403
Score = 63.2 bits (152), Expect = 1e-06, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 2/78 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
TS +DISF + G+ LVK L ++P PL +VLK FL R L+++YSGG+SS+
Sbjct: 214 TSCNIDISFNRTN--GIYCVKLVKTLMIKYPELRPLMIVLKAFLKCRGLNETYSGGISSF 271
Query: 1456 CLMLLITRFLQHEHHLGR 1473
L +L T +LQ + G+
Sbjct: 272 LLTMLATSYLQMAYKSGK 289
Score = 47.4 bits (111), Expect = 0.069, Method: Composition-based stats.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 7/74 (9%)
Query: 1212 LLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
LH EI F K + + AR+ V R+ + ++ + +++ IFGS ATGL LP
Sbjct: 106 FLHNEILDFVKYIGTTTEDQQARRK----VVSRIHKIVKECFSQAKVMIFGSCATGLDLP 161
Query: 1269 SSDVDLVVCLPPVR 1282
+SDVDL+V P R
Sbjct: 162 NSDVDLLVYYPDQR 175
>gi|387219307|gb|AFJ69362.1| poly rna polymerase, partial [Nannochloropsis gaditana CCMP526]
gi|422292753|gb|EKU20055.1| poly rna polymerase, partial [Nannochloropsis gaditana CCMP526]
Length = 1647
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 47/77 (61%), Gaps = 3/77 (3%)
Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELT---EQFPASTPLALVLKQFLADRSLDQSYSG 1450
A T V +DIS + P H G+ ++ L+++L + + + PL LVLK L R L+Q + G
Sbjct: 1265 ATTLVPVDISLEGPQHGGIASSLLIRDLVGPGKPYAHAVPLTLVLKALLMQRGLNQPWCG 1324
Query: 1451 GLSSYCLMLLITRFLQH 1467
G+SSY LML++ LQ
Sbjct: 1325 GISSYALMLMVITVLQQ 1341
Score = 48.1 bits (113), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 39/82 (47%), Gaps = 1/82 (1%)
Query: 1206 LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
+ M S L E I F K + + ++ V +Q LW SR ++GS TGL
Sbjct: 986 IDAMRSQLSENIADFMKNRETKMMEHRAARERILEEVRSIIQSLWAGSRVELYGSCFTGL 1045
Query: 1266 SLPSSDVDLVVC-LPPVRNLEP 1286
L SSD+DLVVC L P P
Sbjct: 1046 DLISSDLDLVVCGLKPAGTASP 1067
>gi|195115910|ref|XP_002002499.1| GI12386 [Drosophila mojavensis]
gi|193913074|gb|EDW11941.1| GI12386 [Drosophila mojavensis]
Length = 348
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 55/92 (59%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
+D +A P T + T +++DISF P+ G+ +++L+K+ + P L LV
Sbjct: 117 VVDTAAVPIIRFTDCE-----THLKVDISFNMPN--GIDSSELIKKFLHEHPVLGKLVLV 169
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R+L+ + +GG+SSY L+++ FLQ
Sbjct: 170 LKQFLEQRNLNSTLNGGISSYNLIIMCINFLQ 201
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 61/126 (48%), Gaps = 30/126 (23%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI+ F + + T I + RV R +Q LWP + IFGS G++LP+SD+
Sbjct: 26 LHEEIEHFYRYMLPTPTEHAARIE-LLSRVERVIQGLWPEALVEIFGSFRLGINLPNSDI 84
Query: 1273 DLVV--CLP--PVRNLE-PIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV C P+R+LE ++ +GI V +L+ V+
Sbjct: 85 DLVVLGCWEHLPLRSLESELRSSGI------------------------VLPGTLQVVDT 120
Query: 1328 TAIPII 1333
A+PII
Sbjct: 121 AAVPII 126
>gi|301100842|ref|XP_002899510.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103818|gb|EEY61870.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 270
Score = 63.2 bits (152), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/70 (44%), Positives = 47/70 (67%), Gaps = 1/70 (1%)
Query: 1398 VRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
V LD++ S HTGL DL+ L + PA PL L+LK L +++L+ +++GG+SSY
Sbjct: 78 VLLDLTCGHSVGHTGLGARDLIYSLQTEMPALRPLVLILKSHLVNKNLNCAFTGGISSYV 137
Query: 1457 LMLLITRFLQ 1466
L++L+ RFLQ
Sbjct: 138 LVILVIRFLQ 147
>gi|119188673|ref|XP_001244943.1| hypothetical protein CIMG_04384 [Coccidioides immitis RS]
Length = 785
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 124/318 (38%), Gaps = 89/318 (27%)
Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
D ++ + ++ E PD F L +Q T R+A+ L +L +E
Sbjct: 297 DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 356
Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+D + K E+ R I R R +Q +P S+ + FGS A+GL LP +DVDL
Sbjct: 357 VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 412
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
V+ + + I +GR + K + Y+ + E S++T+ + +PI
Sbjct: 413 VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 462
Query: 1333 IMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNI 1392
I V + T LK D ++ D+++ + T
Sbjct: 463 IKFVDRL---------------------TGLKVD-------LSFDNNSGLAANRT----- 489
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
++ E FPA + V+KQFL R L++ +GGL
Sbjct: 490 -------------------------FQQWKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGL 524
Query: 1453 SSYCLMLLITRFLQHEHH 1470
+ ++ L+T LQH H
Sbjct: 525 GGFSIICLVTSLLQHLPH 542
>gi|47229501|emb|CAF99489.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1518
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF TG++ +K+ + +P L VLKQFL R L++ ++GG+SSY
Sbjct: 11 TEVKVDISFNV--ETGVKAASFIKDYVKMYPVLPYLIFVLKQFLLQRDLNEVFTGGISSY 68
Query: 1456 CLMLLITRFLQ 1466
L+L++ FLQ
Sbjct: 69 SLILMVISFLQ 79
>gi|326430489|gb|EGD76059.1| hypothetical protein PTSG_00768 [Salpingoeca sp. ATCC 50818]
Length = 425
Score = 62.8 bits (151), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 45/72 (62%), Gaps = 2/72 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
T++ +DISF PS G Q + VK+ Q+ PL L++KQFL R L++ +SGG+ S
Sbjct: 201 VTNLWMDISFNQPS--GPQDSINVKKWKTQYRGLVPLVLIIKQFLLQRGLNEPFSGGIGS 258
Query: 1455 YCLMLLITRFLQ 1466
Y + LL+ FLQ
Sbjct: 259 YAVFLLVMSFLQ 270
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 11/79 (13%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI F K + E T R+ +++ RV + LWP++ +FGS TGL LP+
Sbjct: 97 LHQEIVDFHKYMEPMKEEVTLRRAFVD----RVKEVILGLWPKAEVTVFGSFNTGLYLPT 152
Query: 1270 SDVDLVV----CLPPVRNL 1284
SD+D+VV +PP++ L
Sbjct: 153 SDIDVVVFGDWAVPPLQTL 171
>gi|145497122|ref|XP_001434550.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401677|emb|CAK67153.1| unnamed protein product [Paramecium tetraurelia]
Length = 801
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/256 (23%), Positives = 101/256 (39%), Gaps = 53/256 (20%)
Query: 1216 EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLV 1275
+I F Q+ +N + P R+ +Q L+ + +FGS AT L+LP SD+D+
Sbjct: 476 DIKEFIDQIRRDNDLQFPIRQLVFNRIQFIIQFLFKDAGVCLFGSCATRLALPDSDIDIG 535
Query: 1276 VCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
+ L K I+E +L W+K T +P+I L
Sbjct: 536 ITGLETHQLNQ-KMDVIIE---------------FLYKMNWIKRIKPIYPTQTTLPLIKL 579
Query: 1336 VVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAA 1395
V+ I ++ P H D+V +
Sbjct: 580 WVDPS---IPFRNGNMNLP---------------HIDLVC------------------QS 603
Query: 1396 TSVRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
+++DISF H GL +T+L +++ + L+ K L R L+ GG+SS
Sbjct: 604 QLIQVDISFFGHIQHQGLTSTELTCFWLQEYQELKTITLLFKSLLKKRGLNDQSKGGISS 663
Query: 1455 YCLMLLITRFLQHEHH 1470
+CL+L++ FL++ +
Sbjct: 664 FCLVLIVVAFLEYYYQ 679
>gi|47208265|emb|CAF92498.1| unnamed protein product [Tetraodon nigroviridis]
Length = 297
Score = 62.8 bits (151), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
+ +DISF S G++ L+KE E++P L LVLKQFL R L++ ++GG+ SY L
Sbjct: 93 IFVDISFNMKS--GVKAAQLIKEFKEKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSL 150
Query: 1458 MLLITRFLQHEHH 1470
L+ FLQ +
Sbjct: 151 FLMAVSFLQLHYR 163
Score = 45.4 bits (106), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 26/39 (66%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
V R+ + LWP + +FGS +TGL LP+SD+DLVV
Sbjct: 14 VVDRIKGVIHDLWPSAEVQVFGSFSTGLYLPTSDIDLVV 52
>gi|213403316|ref|XP_002172430.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
gi|212000477|gb|EEB06137.1| Poly(A) RNA polymerase cid14 [Schizosaccharomyces japonicus yFS275]
Length = 667
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 46/71 (64%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +D+SF P GL+T +V ++FPA PL +++K FL R+L++ + GGLSSY
Sbjct: 342 TRIHVDVSFNQPG--GLKTCLVVNGFLKKFPAVRPLTMLVKHFLNMRALNEVFLGGLSSY 399
Query: 1456 CLMLLITRFLQ 1466
++ L+ FLQ
Sbjct: 400 AIVCLVVSFLQ 410
Score = 43.9 bits (102), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 7/68 (10%)
Query: 1212 LLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
LLH++I +F + E+ RK I ++ R+++ WP +FGS T L LP
Sbjct: 231 LLHQDILNFINYLEPTPQEHAVRKSLIT----KLDRAIRAKWPEVTVYVFGSFETRLYLP 286
Query: 1269 SSDVDLVV 1276
+SD+D+VV
Sbjct: 287 TSDIDMVV 294
>gi|348512463|ref|XP_003443762.1| PREDICTED: DNA polymerase sigma-like [Oreochromis niloticus]
Length = 789
Score = 62.4 bits (150), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF T ++ +K +++P PL VLKQFL R L++ ++GG+SSY
Sbjct: 338 TRVKVDISFNI--ETAVKAAQFIKSYLKKYPVLPPLIFVLKQFLLQRELNEVFTGGISSY 395
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 396 SLILMAISFLQ 406
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 51/95 (53%), Gaps = 11/95 (11%)
Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI SF E + R+ +N R+ ++ LWP + IFGS +TGL LP+
Sbjct: 231 LHEEILDFFSFMSPKPEEESMRRDVVN----RIEGIIKDLWPTVQVEIFGSFSTGLYLPT 286
Query: 1270 SDVDLVVC----LPPVRNLEPIKEAGILEGRNGIK 1300
SD+DLVV PP++ LE + + G + IK
Sbjct: 287 SDIDLVVFGNWDHPPLQELEQALKKHNVSGSHPIK 321
>gi|326917206|ref|XP_003204892.1| PREDICTED: DNA polymerase sigma-like [Meleagris gallopavo]
Length = 429
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +KE +
Sbjct: 70 KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|61098442|ref|NP_001012968.1| DNA polymerase sigma [Gallus gallus]
gi|53130894|emb|CAG31774.1| hypothetical protein RCJMB04_10o8 [Gallus gallus]
Length = 429
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +KE +
Sbjct: 70 KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|327270606|ref|XP_003220080.1| PREDICTED: DNA polymerase sigma-like [Anolis carolinensis]
Length = 686
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ L+K+ +
Sbjct: 271 KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARLIKDYMK 323
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 324 KYTLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 366
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F ++ E RK VKR+ ++ LWP + IFGS +TGL LP+
Sbjct: 191 LHEEIMDFYDFMSPRPEEAVMRKE----VVKRIETVIKDLWPTADVQIFGSFSTGLYLPT 246
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV PP++ LE
Sbjct: 247 SDIDLVVFGKWESPPLQLLE 266
>gi|405972624|gb|EKC37384.1| PAP-associated domain-containing protein 5 [Crassostrea gigas]
Length = 672
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF + +++ +L+ + E+FP L LVLKQFL R L++ ++GG+SSY
Sbjct: 224 TEVKVDISFNT--QNSVESANLITKYMEEFPNLKYLVLVLKQFLLQRDLNEVFTGGISSY 281
Query: 1456 CLMLLITRFLQ 1466
L+ + F Q
Sbjct: 282 SLIYMTVSFFQ 292
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 70/152 (46%), Gaps = 39/152 (25%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + + A E R N V+R+ ++ LWP ++ IFGS TGL LP+
Sbjct: 117 LHEEIKDFYEYMSPKAEEANMR----NEVVRRIKDVVEDLWPEAKVEIFGSFKTGLYLPT 172
Query: 1270 SDVDLVV-----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SD+DLVV LP + ++E G + S+K
Sbjct: 173 SDIDLVVYGKWDSLPLFTLEKALRERG------------------------YADPASVKV 208
Query: 1325 VENTAIPIIMLV---VEVPHDLIASAASSVQS 1353
++ ++PII +V EV D+ + +SV+S
Sbjct: 209 LDKASVPIIKMVDLQTEVKVDISFNTQNSVES 240
>gi|449493801|ref|XP_002190556.2| PREDICTED: DNA polymerase sigma [Taeniopygia guttata]
Length = 429
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +KE +
Sbjct: 70 KHNVAEPYSIKVLDKATVPIIKLTDQE-----TEVKVDISFNV--ETGVKAARFIKEYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVIKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|387196341|gb|AFJ68755.1| DNA polymerase sigma subunit, partial [Nannochloropsis gaditana
CCMP526]
Length = 419
Score = 62.0 bits (149), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 45/71 (63%), Gaps = 2/71 (2%)
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
++++DISF S S GL++ + E E P PL LVLK FLA R L+Q+Y GG S+
Sbjct: 177 NLQVDISFSSDS--GLKSARYMLEKMEAMPPLRPLILVLKYFLAQRELNQTYMGGCGSFL 234
Query: 1457 LMLLITRFLQH 1467
L L++ +LQH
Sbjct: 235 LQLMVIAYLQH 245
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 42/74 (56%), Gaps = 16/74 (21%)
Query: 1214 HEEIDSFCKQVA---AENTARKPYINWAVKRVTR----SLQVLWPRSRTNIFGSNATGLS 1266
H E+ FC +A AE AR+ +VTR +++ LWP ++FGS AT +
Sbjct: 68 HNELLDFCDLLAPTRAELEARQ--------KVTRISADTVKKLWPSFDVHVFGSEATKVF 119
Query: 1267 LPSSDVDLVVCLPP 1280
LP SD+D+VV LPP
Sbjct: 120 LPDSDIDMVV-LPP 132
>gi|261192276|ref|XP_002622545.1| topoisomerase TRF4 [Ajellomyces dermatitidis SLH14081]
gi|239589420|gb|EEQ72063.1| topoisomerase TRF4 [Ajellomyces dermatitidis SLH14081]
Length = 681
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)
Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
+P E QN SL S++H S LH EI SF K EN+ R N + R+ R
Sbjct: 227 KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282
Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
+ S+ FGS A+GL LP++D+DLV+ + I + R + +
Sbjct: 283 HFERRHYSSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332
Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
+ A YL + + S++T+ + +PII V + T
Sbjct: 333 EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371
Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
LK D ++ D+S TG+ + +
Sbjct: 372 LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QFPA + V+KQFL R L++ +GGL + ++ L+T LQH + G
Sbjct: 395 SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444
>gi|440796505|gb|ELR17614.1| nucleotidyltransferase domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 911
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ AAT +DISF P TG +L+K ++ P+ PLAL++K +L +++ Y+GG
Sbjct: 363 VDAATGCLVDISFDVP--TGPAHINLIKRYLDEEPSVKPLALLIKYYLKQFGMNEPYTGG 420
Query: 1452 LSSYCLMLLITRFLQ 1466
L SY L+++I +LQ
Sbjct: 421 LGSYALIIMIISYLQ 435
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 45/80 (56%), Gaps = 7/80 (8%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI SFCK V A E R+ I R+++ ++ LWP + +FGS AT + LP+
Sbjct: 261 LHDEIISFCKWVSPSAEEKQMREDVI----ARISKVVETLWPSVQLRVFGSCATDIYLPT 316
Query: 1270 SDVDLVVCLPPVRNLEPIKE 1289
SD+DL + + PI E
Sbjct: 317 SDIDLCIMGANACSPSPIDE 336
>gi|320164013|gb|EFW40912.1| PAP associated domain containing 5 [Capsaspora owczarzaki ATCC 30864]
Length = 558
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 46/74 (62%), Gaps = 2/74 (2%)
Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
A + V +DISF S GL+T + ++ +++ L V+KQFLA R L++ YSGGL
Sbjct: 315 ALSGVFVDISFNLES--GLRTVEFIRACVDEYRMLYHLTFVIKQFLAQRQLNEPYSGGLG 372
Query: 1454 SYCLMLLITRFLQH 1467
SY ++LL+ FLQ
Sbjct: 373 SYSVVLLVVSFLQR 386
Score = 45.1 bits (105), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 37/70 (52%), Gaps = 6/70 (8%)
Query: 1209 MHSLLHEEID--SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
++SL E D F K E+ R+ V+R+ + W +R +FGS ATGL
Sbjct: 209 LYSLEQEMYDFVEFIKPTPLEHQMREE----IVQRIREVITGAWKHARVEVFGSFATGLY 264
Query: 1267 LPSSDVDLVV 1276
LP SD+D+VV
Sbjct: 265 LPMSDIDIVV 274
>gi|296423712|ref|XP_002841397.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637635|emb|CAZ85588.1| unnamed protein product [Tuber melanosporum]
Length = 802
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 98/259 (37%), Gaps = 78/259 (30%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI F + A E+ R+ ++ RV + LWP + +FGS A L LP+
Sbjct: 391 LHNEILDFIAYIQPRAYEHAVRRELVH----RVRVVVTKLWPDTDLRVFGSFAAELYLPT 446
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGI--LEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
SD+DLVV + GI RN + + L+ A + + K SL + N
Sbjct: 447 SDIDLVVI------STSFAKIGIPRYSERNALYK--LRQA---ILSSNTAKPGSLAVIAN 495
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII V + ++H D
Sbjct: 496 AKVPIIKFV---------------------------DRETNIHVD--------------- 513
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
ISF++ S GL + Q+PA L V+KQFLA R L++
Sbjct: 514 --------------ISFENSS--GLVANETFIRWKAQYPAMPILLTVIKQFLAMRGLNEV 557
Query: 1448 YSGGLSSYCLMLLITRFLQ 1466
Y GGL S+ + LI Q
Sbjct: 558 YLGGLGSFSVTCLIVSMFQ 576
>gi|327349744|gb|EGE78601.1| topoisomerase TRF4 [Ajellomyces dermatitidis ATCC 18188]
Length = 681
Score = 61.6 bits (148), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)
Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
+P E QN SL S++H S LH EI SF K EN+ R N + R+ R
Sbjct: 227 KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282
Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
+ S+ FGS A+GL LP++D+DLV+ + I + R + +
Sbjct: 283 HFERRHYGSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332
Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
+ A YL + + S++T+ + +PII V + T
Sbjct: 333 EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371
Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
LK D ++ D+S TG+ + +
Sbjct: 372 LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QFPA + V+KQFL R L++ +GGL + ++ L+T LQH + G
Sbjct: 395 SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444
>gi|74003095|ref|XP_545180.2| PREDICTED: DNA polymerase sigma [Canis lupus familiaris]
Length = 672
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +L+K +
Sbjct: 200 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 252
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 253 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 295
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 120 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 172
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 173 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 209
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 210 KVLDKATVPIIKL 222
>gi|322697398|gb|EFY89178.1| topoisomerase family protein TRF4 [Metarhizium acridum CQMa 102]
Length = 702
Score = 61.6 bits (148), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 113/291 (38%), Gaps = 84/291 (28%)
Query: 1199 CPTRKASLSLMHSL---LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPR 1252
CP + S H++ LH+EI F V A E R + K + R + +
Sbjct: 352 CPWQTKDHSATHNMAFRLHKEILDFYDYVRPRAFEQRMRDNLVENLCKAMRRDRRN-FAS 410
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLA 1312
++ + FGS +GL LP++D+DLVVC P G ++ L ++L
Sbjct: 411 AQVHPFGSFMSGLYLPTADMDLVVCSASYMRGGPPTYLGA--------KSWLYKFQKFLV 462
Query: 1313 NQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSD 1372
Q+ SDS++ + + IP++ V ++ T LK D
Sbjct: 463 MQKVADSDSIEVIAHARIPLVKFVDKL---------------------TGLKVD------ 495
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
S +N+ T+V +S+K +Q+PA L
Sbjct: 496 --------------VSFENLGGVTAVDTFLSWK-----------------KQYPAMPILV 524
Query: 1433 LVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH-----------EHHLG 1472
V+K FL R L++ +GG+ + ++ L+ LQ EHHLG
Sbjct: 525 TVIKHFLLMRGLNEPVNGGIGGFSVICLVVSMLQMMPHVQSRSLIPEHHLG 575
>gi|410949807|ref|XP_003981609.1| PREDICTED: DNA polymerase sigma [Felis catus]
Length = 542
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +L+K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|403374868|gb|EJY87397.1| Nucleotidyltransferase domain containing protein [Oxytricha
trifallax]
Length = 2589
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 99/228 (43%), Gaps = 54/228 (23%)
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRN-LEPIKEAGILEGRNGIKETCLQHAARYLANQE 1315
+FGS TGL+L SSD+DL V + + I E L G A Q+
Sbjct: 2075 LFGSLVTGLALESSDMDLAVTGLYIEDRYAVICELNKLSG----------------ALQD 2118
Query: 1316 WVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDM-- 1373
W +S K++E +IP+I + V DL ++ NH SD
Sbjct: 2119 WDILESFKSIETASIPVIKMKV----DLYQLREKEMK--------------NHGFSDFDL 2160
Query: 1374 -------------VALDDSASPKCSHTSSDNIKAATSV-RLDISFKSPSHTGLQTTDLVK 1419
+ DDS+ + + + NI + +L+I+ +H GLQ+ LV+
Sbjct: 2161 RPIPEELRYFSVDITFDDSSCSQMMNQVTSNIFQQDLISQLNIT---KTHLGLQSCSLVQ 2217
Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
+ ++++LK+FLA + + Y GG+SSY ++L+ ++ +
Sbjct: 2218 FFIANYQNLKEVSILLKRFLALNNFNSPYYGGISSYSAVILLVAYMNY 2265
>gi|448097882|ref|XP_004198786.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
gi|359380208|emb|CCE82449.1| Piso0_002176 [Millerozyma farinosa CBS 7064]
Length = 650
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 88/231 (38%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N VK + R + LWP + ++FGS+AT L LP SD+D+VV G E
Sbjct: 211 NRVVKDLKREINNLWPDTEAHVFGSSATDLYLPGSDIDMVVT----------SNTGDYEN 260
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + YL N++ K ++ + +PI+ V P
Sbjct: 261 R-----SKLYQLSSYLRNRKLAK--DIEVIAKAKVPIVKFV----------------DPS 297
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+++H D+ S + I+AA +R
Sbjct: 298 -----------SNIHIDI-----------SFERRNGIEAAKRIR---------------- 319
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ P L L++KQFL R L+ + GGL Y ++L FL+
Sbjct: 320 ----RWLDRTPGLRELVLIVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLR 366
>gi|281354437|gb|EFB30021.1| hypothetical protein PANDA_012597 [Ailuropoda melanoleuca]
Length = 542
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +L+K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|443713391|gb|ELU06261.1| hypothetical protein CAPTEDRAFT_115913 [Capitella teleta]
Length = 431
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T T V++DISF + G+++ +L+ + P L LV
Sbjct: 175 VLDKASVPIVKLTDK-----KTEVKVDISFNM--NNGVRSANLIMDYMRDMPTLPYLVLV 227
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+LL FLQ
Sbjct: 228 LKQFLLQRDLNEVFTGGISSYSLILLTVSFLQ 259
Score = 48.5 bits (114), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI F + ++ E+ R+ V+R++ +Q LWP ++ IFGS T L LP+
Sbjct: 84 LHQEIMDFYEYMSPQPEEHYMRED----VVQRISAVIQGLWPCAKVEIFGSFRTHLYLPT 139
Query: 1270 SDVDLVV 1276
SD+DLVV
Sbjct: 140 SDIDLVV 146
>gi|194745606|ref|XP_001955278.1| GF16314 [Drosophila ananassae]
gi|190628315|gb|EDV43839.1| GF16314 [Drosophila ananassae]
Length = 394
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 57/103 (55%), Gaps = 7/103 (6%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD ++ P T + + ++ D++F + + +G+Q +L++E Q+P L +V
Sbjct: 126 VLDKASMPVVKFTET-----ISQIKFDVNFNAAA-SGVQAAELIREFIRQYPDLPKLVMV 179
Query: 1435 LKQFLADRSLDQSYS-GGLSSYCLMLLITRFLQHEHHLGRPIN 1476
LKQFL + L++ YS GG+SSY L L+ FL+ + + N
Sbjct: 180 LKQFLTIQGLNEVYSTGGVSSYALTLMCISFLKQQARCKKKYN 222
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 28/149 (18%)
Query: 1212 LLHEEID---SFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
LLH+EI+ +F + E R AV+R+ + +WP S ++FGS TGL+LP
Sbjct: 32 LLHQEIEKFYNFMQSTPIEFLLRAE----AVRRIEDVVLAIWPSSCVDVFGSFRTGLNLP 87
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
SD+D+VV N + L R L + + DS+ ++
Sbjct: 88 VSDIDMVV-----------------YHANHWHKAPLHELQRELIARGVAEPDSISVLDKA 130
Query: 1329 AIPIIML---VVEVPHDL-IASAASSVQS 1353
++P++ + ++ D+ +AAS VQ+
Sbjct: 131 SMPVVKFTETISQIKFDVNFNAAASGVQA 159
>gi|145488179|ref|XP_001430094.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397189|emb|CAK62696.1| unnamed protein product [Paramecium tetraurelia]
Length = 986
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 94/215 (43%), Gaps = 49/215 (22%)
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
IFGS+ TGL+L SDVD+VV PV +A + + + E + +W
Sbjct: 681 IFGSSRTGLALHDSDVDMVVFGLPV-----FTKAQLFDPMRKLLEV--------FSQMKW 727
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
+ S K + IP+I + ++ + + + + + H N
Sbjct: 728 --AISYKHIFQATIPLIKITIDPSTGFLEFVGNPL--------YYVMNHRN--------- 768
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPS----------HTGLQTTDLVKELTEQFP 1426
D + K S + +++DI+F+ H GL +TD +++
Sbjct: 769 IDQFNLKYGELSQN-------IQIDITFELQQPYPMYGYQGYHVGLLSTDYQRQVGNSVK 821
Query: 1427 ASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
T +A+VLK+ L R L+ SY+GG+SS+CL +++
Sbjct: 822 GFTEVAIVLKKLLKHRGLNDSYTGGVSSFCLTVML 856
>gi|388580693|gb|EIM21006.1| Nucleotidyltransferase, partial [Wallemia sebi CBS 633.66]
Length = 360
Score = 61.2 bits (147), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 67/128 (52%), Gaps = 11/128 (8%)
Query: 1342 DLIASAASSVQSPKEDAAHTT---LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSV 1398
DL+ S Q K+ HT L+ N S V + + P TSS
Sbjct: 134 DLVVMRKSVNQYNKQSMLHTMASMLRQANLAQSIQV-ISKARVPIIKFTSS-----FGGY 187
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
+DIS + G+ +V E+ +++PA+ PL+++LK FL+ RS+++ Y+GG+SSY ++
Sbjct: 188 PIDISLNQTN--GVDAGRMVNEILDRYPAARPLSMLLKCFLSQRSMNEVYTGGVSSYSVI 245
Query: 1459 LLITRFLQ 1466
L+ FLQ
Sbjct: 246 CLVVSFLQ 253
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LLH E+++F K ++ T K + ++ + R + W + FGS T L LP D
Sbjct: 74 LLHREVNAFTKFISPSLTEHKTR-EYTIECIRRCITSRWADAEVFAFGSFETRLYLPDGD 132
Query: 1272 VDLVVCLPPV 1281
+DLVV V
Sbjct: 133 IDLVVMRKSV 142
>gi|239615134|gb|EEQ92121.1| topoisomerase TRF4 [Ajellomyces dermatitidis ER-3]
Length = 681
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 81/290 (27%)
Query: 1192 QPLEVQNCPTRKASL--SLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVTR 1244
+P E QN SL S++H S LH EI SF K EN+ R N + R+ R
Sbjct: 227 KPFEDQNTTPWIDSLGPSIVHLGSRLHNEILSFYHWVKPKPFENSIR----NDLIARLQR 282
Query: 1245 SLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KET 1302
+ S+ FGS A+GL LP++D+DLV+ + I + R + +
Sbjct: 283 HFERRHYGSQLRAFGSFASGLYLPTADMDLVL----------LSRQFIRQDRKVLCQRPR 332
Query: 1303 CLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTT 1362
+ A YL + + S++T+ + +PII V + T
Sbjct: 333 EIYSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------TG 371
Query: 1363 LKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELT 1422
LK D ++ D+S TG+ + +
Sbjct: 372 LKVD-------LSFDNS------------------------------TGIAANETFQVWK 394
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QFPA + V+KQFL R L++ +GGL + ++ L+T LQH + G
Sbjct: 395 SQFPAMPIIVSVIKQFLLLRGLNEVPTGGLGGFSIICLVTSLLQHLPYSG 444
>gi|395510809|ref|XP_003759662.1| PREDICTED: DNA polymerase sigma [Sarcophilus harrisii]
Length = 594
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ L+K+ +
Sbjct: 122 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVKAARLIKDYMK 174
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 175 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 217
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 69/156 (44%), Gaps = 43/156 (27%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 42 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 94
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 95 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 131
Query: 1323 KTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
K ++ +PII L EV D+ + + V++ +
Sbjct: 132 KVLDKATVPIIKLTDQETEVKVDISFNMETGVKAAR 167
>gi|301776096|ref|XP_002923468.1| PREDICTED: DNA polymerase sigma-like [Ailuropoda melanoleuca]
Length = 656
Score = 61.2 bits (147), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ +L+K +
Sbjct: 184 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAELIKNYMK 236
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 237 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 279
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 104 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 156
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 157 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 193
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 194 KVLDKATVPIIKL 206
>gi|334325414|ref|XP_001371537.2| PREDICTED: DNA polymerase sigma-like [Monodelphis domestica]
Length = 860
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 56/103 (54%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ L+K+ +
Sbjct: 388 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVKAARLIKDYMK 440
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 441 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 483
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 69/156 (44%), Gaps = 43/156 (27%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 308 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 360
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 361 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 397
Query: 1323 KTVENTAIPIIMLV---VEVPHDLIASAASSVQSPK 1355
K ++ +PII L EV D+ + + V++ +
Sbjct: 398 KVLDKATVPIIKLTDQETEVKVDISFNMETGVKAAR 433
>gi|145494404|ref|XP_001433196.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400313|emb|CAK65799.1| unnamed protein product [Paramecium tetraurelia]
Length = 940
Score = 60.8 bits (146), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 59/241 (24%), Positives = 103/241 (42%), Gaps = 60/241 (24%)
Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC---LPPVRNLEPIKEAG 1291
I++ + ++ R S +FGS ATGL+LP SD+D+ + L P L
Sbjct: 618 ISFVINQLFREF-----NSNVRLFGSCATGLALPESDIDIGITGFELFPSTQL------- 665
Query: 1292 ILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSV 1351
NG +Q +L +WVK+ ++ + + +P+I L V+ +
Sbjct: 666 -----NG----PIQKIIDFLQKMKWVKN--IRAITTSNMPLIKLQVDPTISFV------- 707
Query: 1352 QSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISF------K 1405
D++H + + D++ D P SH S +D+SF K
Sbjct: 708 -----DSSHVLVLP----YIDLIPNSDEEIP--SHLFS----------VDVSFFQYQGAK 746
Query: 1406 SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
H G +T+L + + P+ L+ K L R L+ Y GG+SS+C++ ++ FL
Sbjct: 747 QNWHLGQISTELTLQWLSFYNELRPIVLLFKSLLKKRGLNDQYKGGISSFCIIQMVLAFL 806
Query: 1466 Q 1466
+
Sbjct: 807 E 807
>gi|366992111|ref|XP_003675821.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
gi|342301686|emb|CCC69457.1| hypothetical protein NCAS_0C04670 [Naumovozyma castellii CBS 4309]
Length = 586
Score = 60.8 bits (146), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 93/231 (40%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + +V +++ LWP + ++FGS AT L LP SD+D V+ +AG E
Sbjct: 199 NQTISKVRNAVKQLWPDADLHVFGSYATDLYLPGSDIDCVIN----------SKAGDKEN 248
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN L A +L Q + ++ + T +PII V
Sbjct: 249 RN-----SLYSLASFLKQQGL--ATQIEVIAKTRVPIIKFV------------------- 282
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ ++++H D+ S ++ ++AA
Sbjct: 283 --------EPESNIHIDV-----------SFERTNGLEAAK------------------- 304
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L++E + P L L++KQFL R L+ ++GGL + ++ ++ FLQ
Sbjct: 305 -LIREWLQDTPGLRELVLIIKQFLHSRRLNNVHTGGLGGFSIICIVFSFLQ 354
>gi|440905114|gb|ELR55540.1| DNA polymerase sigma [Bos grunniens mutus]
Length = 542
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 47.4 bits (111), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V+R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|237512970|ref|NP_001153629.1| DNA polymerase sigma [Bos taurus]
Length = 541
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 47.4 bits (111), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V+R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|157819111|ref|NP_001100803.1| DNA polymerase sigma [Rattus norvegicus]
gi|149032753|gb|EDL87608.1| polymerase (DNA directed) sigma (predicted) [Rattus norvegicus]
Length = 541
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPSADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|145513226|ref|XP_001442524.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124409877|emb|CAK75127.1| unnamed protein product [Paramecium tetraurelia]
Length = 220
Score = 60.5 bits (145), Expect = 7e-06, Method: Composition-based stats.
Identities = 30/81 (37%), Positives = 50/81 (61%), Gaps = 8/81 (9%)
Query: 1393 KAATSVRLDISFK-------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+ + +++DISF SP H+G TT+LVK+ +++P L L+LK + L
Sbjct: 14 RIKSCIKVDISFNFSGSTYDSP-HSGFITTNLVKQWMKEYPVIQQLVLILKSMIKKLGLS 72
Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
+SY+GGLSSY L++++ FL+
Sbjct: 73 ESYTGGLSSYSLIIMVYSFLR 93
>gi|296475642|tpg|DAA17757.1| TPA: DNA polymerase sigma [Bos taurus]
Length = 541
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 47.4 bits (111), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V+R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|387273371|gb|AFJ70180.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
Length = 542
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|431900711|gb|ELK08155.1| DNA polymerase sigma [Pteropus alecto]
Length = 539
Score = 60.5 bits (145), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 33/52 (63%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVGRIETVVKDLWPAADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|395735628|ref|XP_002815457.2| PREDICTED: DNA polymerase sigma [Pongo abelii]
Length = 542
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|284507293|ref|NP_001165276.1| DNA polymerase sigma isoform 2 [Homo sapiens]
gi|219518042|gb|AAI43883.1| POLS protein [Homo sapiens]
Length = 541
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|426246851|ref|XP_004017201.1| PREDICTED: LOW QUALITY PROTEIN: DNA polymerase sigma [Ovis aries]
Length = 602
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 47.0 bits (110), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 34/51 (66%), Gaps = 4/51 (7%)
Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
V+R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 15 VRRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|417402533|gb|JAA48112.1| Putative dna polymerase sigma [Desmodus rotundus]
Length = 542
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|5902142|ref|NP_008930.1| DNA polymerase sigma isoform 1 [Homo sapiens]
gi|397472993|ref|XP_003808011.1| PREDICTED: DNA polymerase sigma [Pan paniscus]
gi|60392922|sp|Q5XG87.2|PAPD7_HUMAN RecName: Full=DNA polymerase sigma; AltName: Full=DNA polymerase
kappa; AltName: Full=LAK-1; AltName: Full=PAP-associated
domain-containing protein 7; AltName: Full=Terminal
uridylyltransferase 5; Short=TUTase 5; AltName:
Full=Topoisomerase-related function protein 4-1;
Short=TRF4-1
gi|5139669|dbj|BAA24434.2| LAK-1 [Homo sapiens]
gi|47777665|gb|AAT38110.1| polymerase (DNA directed) sigma [Homo sapiens]
gi|109658946|gb|AAI17138.1| Polymerase (DNA directed) sigma [Homo sapiens]
gi|116496943|gb|AAI26107.1| Polymerase (DNA directed) sigma [Homo sapiens]
gi|119628506|gb|EAX08101.1| polymerase (DNA directed) sigma [Homo sapiens]
gi|127799472|gb|AAH84567.2| Polymerase (DNA directed) sigma [Homo sapiens]
gi|158260737|dbj|BAF82546.1| unnamed protein product [Homo sapiens]
gi|313883146|gb|ADR83059.1| PAP associated domain containing 7 [synthetic construct]
gi|410210372|gb|JAA02405.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410259248|gb|JAA17590.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410303968|gb|JAA30584.1| PAP associated domain containing 7 [Pan troglodytes]
gi|410337733|gb|JAA37813.1| PAP associated domain containing 7 [Pan troglodytes]
Length = 542
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|386782099|ref|NP_001248228.1| DNA polymerase sigma [Macaca mulatta]
gi|355691204|gb|EHH26389.1| DNA polymerase sigma [Macaca mulatta]
gi|355749808|gb|EHH54146.1| DNA polymerase sigma [Macaca fascicularis]
gi|380809116|gb|AFE76433.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
gi|383415413|gb|AFH30920.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
gi|384945014|gb|AFI36112.1| DNA polymerase sigma isoform 1 [Macaca mulatta]
Length = 542
Score = 60.5 bits (145), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|281306714|ref|NP_001162602.1| DNA polymerase sigma isoform 2 [Mus musculus]
gi|74220251|dbj|BAE31304.1| unnamed protein product [Mus musculus]
Length = 541
Score = 60.1 bits (144), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|403282221|ref|XP_003932554.1| PREDICTED: DNA polymerase sigma [Saimiri boliviensis boliviensis]
Length = 542
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|444729764|gb|ELW70170.1| DNA polymerase sigma [Tupaia chinensis]
Length = 729
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 137 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 189
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 190 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 232
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 57 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPAADVQIFGSFSTGLY 109
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 110 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 146
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 147 KVLDKATVPIIKL 159
>gi|281306711|ref|NP_941002.2| DNA polymerase sigma isoform 1 [Mus musculus]
gi|148705067|gb|EDL37014.1| polymerase (DNA directed) sigma [Mus musculus]
Length = 542
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|325186972|emb|CCA21516.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 766
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 44/75 (58%), Gaps = 3/75 (4%)
Query: 1395 ATSVRLDISFKSPS---HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
A SV DI+F P+ H G+ T LV L ++F L +VLK FLA R L+ Y GG
Sbjct: 476 AQSVHFDITFDDPNVSPHDGVATAVLVNCLIDRFYGLGQLNIVLKYFLAKRDLNDPYMGG 535
Query: 1452 LSSYCLMLLITRFLQ 1466
LSSY L L+IT ++
Sbjct: 536 LSSYGLFLMITHIVR 550
>gi|354479063|ref|XP_003501733.1| PREDICTED: DNA polymerase sigma-like [Cricetulus griseus]
Length = 624
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 151 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 203
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 204 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 246
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 58/133 (43%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA V R+ ++ LWP + IFGS +TGL
Sbjct: 71 LHEEIIDFYNFMSPCPEEAAMRRE-------VVTRIETVVKDLWPTADVQIFGSFSTGLY 123
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 124 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 160
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 161 KVLDKATVPIIKL 173
>gi|432104646|gb|ELK31258.1| DNA polymerase sigma [Myotis davidii]
Length = 513
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPGSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 34/52 (65%), Gaps = 4/52 (7%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLE 1285
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLE 65
>gi|60392923|sp|Q6PB75.1|PAPD7_MOUSE RecName: Full=DNA polymerase sigma; AltName: Full=PAP-associated
domain-containing protein 7
gi|37590586|gb|AAH59846.1| Polymerase (DNA directed) sigma [Mus musculus]
Length = 542
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|449326076|gb|AGE92663.1| PAPD7 [Homo sapiens]
Length = 772
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 300 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 352
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 353 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 395
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 220 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 272
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 273 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 309
Query: 1323 KTVENTAIPIIMLV 1336
K ++ +PII L
Sbjct: 310 KVLDKATVPIIKLT 323
>gi|344255499|gb|EGW11603.1| DNA polymerase sigma [Cricetulus griseus]
Length = 534
Score = 60.1 bits (144), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 61 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 113
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 114 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 156
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 48/102 (47%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
V R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 5 VVTRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 60
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 61 --KHNVAEPC-----------------SIKVLDKATVPIIKL 83
>gi|344272726|ref|XP_003408182.1| PREDICTED: DNA polymerase sigma-like [Loxodonta africana]
Length = 776
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 46/71 (64%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF TG++ +L+K +++ L LVLKQFL R L++ ++GG+SSY
Sbjct: 331 TEVKVDISFNV--ETGVRAAELIKSYMKKYSLLPYLILVLKQFLLQRDLNEVFTGGISSY 388
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 389 SLILMAISFLQ 399
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 45/83 (54%), Gaps = 17/83 (20%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA T VKR+ ++ LWP + IFGS +TGL
Sbjct: 124 LHEEIIDFYNFMSPCPEEAAMRTE-------VVKRIETVIKDLWPTADVQIFGSFSTGLY 176
Query: 1267 LPSSDVDLVVC----LPPVRNLE 1285
LP+SD+DLVV PP++ LE
Sbjct: 177 LPTSDIDLVVFGKWERPPLQLLE 199
>gi|296194961|ref|XP_002745186.1| PREDICTED: DNA polymerase sigma [Callithrix jacchus]
Length = 866
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 395 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 447
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 448 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 490
Score = 48.5 bits (114), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 49/101 (48%), Gaps = 27/101 (26%)
Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGILE 1294
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 340 VKRIETVVKDLWPFFQVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR----- 394
Query: 1295 GRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 395 -KHNVAEPC-----------------SIKVLDKATVPIIKL 417
>gi|338718783|ref|XP_001501238.3| PREDICTED: DNA polymerase sigma [Equus caballus]
Length = 574
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 102 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 154
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 155 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 197
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 22 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 74
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 75 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 111
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 112 KVLDKATVPIIKL 124
>gi|402871130|ref|XP_003899535.1| PREDICTED: DNA polymerase sigma [Papio anubis]
Length = 775
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 303 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 355
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 356 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 398
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 223 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTADVQIFGSFSTGLY 275
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 276 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 312
Query: 1323 KTVENTAIPIIMLV 1336
K ++ +PII L
Sbjct: 313 KVLDKATVPIIKLT 326
>gi|145533933|ref|XP_001452711.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420410|emb|CAK85314.1| unnamed protein product [Paramecium tetraurelia]
Length = 262
Score = 60.1 bits (144), Expect = 1e-05, Method: Composition-based stats.
Identities = 29/78 (37%), Positives = 49/78 (62%), Gaps = 8/78 (10%)
Query: 1396 TSVRLDISFK-------SPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSY 1448
+ +++DISF SP H+G TT+LVK+ +++P L L+LK + L +SY
Sbjct: 59 SCIKVDISFNFSGSTYDSP-HSGFMTTNLVKQWMKEYPVIQQLVLILKSMIKKLGLSESY 117
Query: 1449 SGGLSSYCLMLLITRFLQ 1466
+GGLSSY L++++ +L+
Sbjct: 118 TGGLSSYSLIIMVYSYLR 135
>gi|291411166|ref|XP_002721862.1| PREDICTED: DNA polymerase sigma [Oryctolagus cuniculus]
Length = 542
Score = 60.1 bits (144), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 70 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKSYMK 122
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 123 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
VKR+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVKRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGKWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHNVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|240276761|gb|EER40272.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus H143]
Length = 680
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 117/291 (40%), Gaps = 83/291 (28%)
Query: 1192 QPLEVQNCPT---RKASLSLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVT 1243
+P E QN PT + S +H S LH EI SF K + E++ R N V+R+
Sbjct: 227 KPFEDQN-PTPWMDSSGPSTLHLGSRLHNEILSFYHWVKPKSFEHSIR----NDLVERLQ 281
Query: 1244 RSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KE 1301
R + S+ FGS A+GL LP++D+DLV+C + + R + +
Sbjct: 282 RHFERRHYGSQLCAFGSFASGLYLPTADMDLVLC----------SRQFVRQDRKVLCQRP 331
Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
+ A YL + + S++T+ + +PII V + T
Sbjct: 332 REIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------T 370
Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
LK D ++ D+S TG+ +
Sbjct: 371 GLKVD-------LSFDNS------------------------------TGIAANKTFQVW 393
Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QFPA + V+KQFL R L++ +GGL + ++ L+T LQH + G
Sbjct: 394 KSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGFSIICLVTSLLQHLPYSG 444
>gi|225556159|gb|EEH04448.1| Poly(A) RNA polymerase cid14 [Ajellomyces capsulatus G186AR]
Length = 680
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 117/291 (40%), Gaps = 83/291 (28%)
Query: 1192 QPLEVQNCPT---RKASLSLMH--SLLHEEIDSF---CKQVAAENTARKPYINWAVKRVT 1243
+P E QN PT + S +H S LH EI SF K + E++ R N V+R+
Sbjct: 227 KPFEDQN-PTPWMDSSGPSTLHLGSRLHNEILSFYHWVKPKSFEHSIR----NDLVERLQ 281
Query: 1244 RSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KE 1301
R + S+ FGS A+GL LP++D+DLV+C + + R + +
Sbjct: 282 RHFERRHYGSQLCAFGSFASGLYLPTADMDLVLC----------SRQFVRQDRKVLCQRP 331
Query: 1302 TCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHT 1361
+ A YL + + S++T+ + +PII V + T
Sbjct: 332 REIHSFAAYLKDLDIAVPGSVETIAHAKVPIIKFVDRL---------------------T 370
Query: 1362 TLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKEL 1421
LK D ++ D+S TG+ +
Sbjct: 371 GLKVD-------LSFDNS------------------------------TGIAANKTFQVW 393
Query: 1422 TEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QFPA + V+KQFL R L++ +GGL + ++ L+T LQH + G
Sbjct: 394 KSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGFSIICLVTSLLQHLPYSG 444
>gi|219518895|gb|AAI43707.1| PAPD5 protein [Homo sapiens]
Length = 277
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 38/58 (65%)
Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
G++ DL+K+ T+++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 1 QNGVRAADLIKDFTKKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 58
>gi|335304248|ref|XP_003134203.2| PREDICTED: DNA polymerase sigma [Sus scrofa]
Length = 625
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T + T V++DISF TG++ + +K +
Sbjct: 153 KHKVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 205
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 206 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 248
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 59/133 (44%), Gaps = 40/133 (30%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 73 LHEEIIDFYNFMSPCPEEAAMRRE-------VVKRIETVVKDLWPTAEVQIFGSFSTGLY 125
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 126 LPTSDIDLVVFGKWERPPLQLLEQALR------KHKVAEPC-----------------SI 162
Query: 1323 KTVENTAIPIIML 1335
K ++ +PII L
Sbjct: 163 KVLDKATVPIIKL 175
>gi|395857085|ref|XP_003800943.1| PREDICTED: DNA polymerase sigma [Otolemur garnettii]
Length = 837
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 55/103 (53%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH+ + LD + P T + T V++DISF TG++ + +K +
Sbjct: 365 KHNVAEPCSIKVLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMK 417
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 418 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 460
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 40/134 (29%)
Query: 1213 LHEEIDSF------CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
LHEEI F C + AA VKR+ ++ LWP + IFGS +TGL
Sbjct: 285 LHEEIIDFYNFMSPCPEEAAMR-------REVVKRIETVVKDLWPTADVQIFGSFSTGLY 337
Query: 1267 LPSSDVDLVVC----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL 1322
LP+SD+DLVV PP++ LE ++ + E C S+
Sbjct: 338 LPTSDIDLVVFGKWERPPLQLLEQALR------KHNVAEPC-----------------SI 374
Query: 1323 KTVENTAIPIIMLV 1336
K ++ +PII L
Sbjct: 375 KVLDKATVPIIKLT 388
>gi|351706462|gb|EHB09381.1| DNA polymerase sigma [Heterocephalus glaber]
Length = 542
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 51/92 (55%), Gaps = 7/92 (7%)
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
LD + P T + T V++DISF TG++ + +K +++ L LV
Sbjct: 81 VLDKATVPIIKLTDQE-----TEVKVDISFNM--ETGVRAAEFIKNYMKEYSLLPYLILV 133
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 134 LKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 165
Score = 47.8 bits (112), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 49/102 (48%), Gaps = 27/102 (26%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVC----LPPVRNLEPIKEAGIL 1293
V+R+ ++ LWP + IFGS +TGL LP+SD+DLVV PP++ LE
Sbjct: 14 VVQRIETVVKDLWPTADVQIFGSFSTGLYLPTSDIDLVVFGEWERPPLQLLEQALR---- 69
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIML 1335
++ + E C S+K ++ +PII L
Sbjct: 70 --KHSVAEPC-----------------SIKVLDKATVPIIKL 92
>gi|258575851|ref|XP_002542107.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902373|gb|EEP76774.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 675
Score = 59.7 bits (143), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 104/263 (39%), Gaps = 77/263 (29%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI F K E+ R I R+ +Q +P S+ + FGS A+GL LP
Sbjct: 245 LHDEIIGFYHWVKPKPFEDVIRTDLIT----RLEMHMQRRFPGSQLHAFGSYASGLYLPV 300
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVEN 1327
+DVDLV+ + + + +G+ + K + + Y+ + E S++T+ +
Sbjct: 301 ADVDLVL----------LSRSFLRQGKKFLCQKPKDIWSLSAYIKDTEIAVPGSIETIAH 350
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII V + T LK D ++ D+S
Sbjct: 351 ARVPIIKFVDRL---------------------TGLKVD-------LSFDNS-------- 374
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
+GL + QFPA + V+KQFL R L++
Sbjct: 375 ----------------------SGLTANRTFQIWKTQFPAMPLIVSVIKQFLLLRGLNEV 412
Query: 1448 YSGGLSSYCLMLLITRFLQHEHH 1470
+GGL + ++ L+T LQH H
Sbjct: 413 PTGGLGGFSIICLVTSLLQHLPH 435
>gi|219110415|ref|XP_002176959.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411494|gb|EEC51422.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1606
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 114/267 (42%), Gaps = 76/267 (28%)
Query: 1245 SLQVLWPRS-RTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEA----GILEGR--- 1296
S+ LWP S + ++GS AT L LPSSD+D+VV R ++ G L+G+
Sbjct: 904 SVAKLWPASCQVELYGSCATHLDLPSSDLDVVVVGLDRRRGTLMQVGTQNNGCLDGKMES 963
Query: 1297 ------NGIKETCL-------------------QHAARYLANQEWVKSDSLKTVENTAIP 1331
+ +++ L + A LA Q W +K + ++P
Sbjct: 964 KSDFDVDDVRQKILNASLPPYIPTSTSLNAERIKRLAAELATQPWAVQ--VKAIPTASVP 1021
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
++ ++ + P+ L ++ + K D H +A + S K S SSD+
Sbjct: 1022 VVKVLAD-PYRL------------QNVSGNDWKLDKH---PKIAAETS---KVSQVSSDS 1062
Query: 1392 IKAATS------------VRLDISFKSPSHTGLQTTDL-VKELTEQF------PASTPLA 1432
+ + + DI+F+ P H G+ +T+ ++ + E P TP
Sbjct: 1063 ARLQSFQPWRGADAMNGLLSFDITFEGPEHGGIGSTEFSIRTVNEACRETGLPPEGTPFV 1122
Query: 1433 ---LVLKQFLADRSLDQSYSGGLSSYC 1456
+VLK+ LA R L++ YSGGLSSY
Sbjct: 1123 QVIMVLKELLAQRKLNEPYSGGLSSYA 1149
>gi|345313578|ref|XP_001519035.2| PREDICTED: DNA polymerase sigma-like, partial [Ornithorhynchus
anatinus]
Length = 641
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 54/103 (52%), Gaps = 7/103 (6%)
Query: 1364 KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTE 1423
KH + LD + P T + T V++DISF TG++ +K+ +
Sbjct: 170 KHSVAEPKTIKVLDKATVPIIKLTDQE-----TDVKVDISFNM--ETGVKAARFIKDYMK 222
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 223 KYSLLPYLILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 265
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 48/87 (55%), Gaps = 11/87 (12%)
Query: 1206 LSLMHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNA 1262
LSL LHEEI F ++ E R+ VKR+ ++ LWP + IFGS +
Sbjct: 83 LSLCSLKLHEEIIDFYNFMSPCPEEGIMRRE----VVKRIETVIKDLWPTADVQIFGSFS 138
Query: 1263 TGLSLPSSDVDLVVC----LPPVRNLE 1285
TGL LP+SD+DLVV PP++ LE
Sbjct: 139 TGLYLPTSDIDLVVFGKWERPPLQLLE 165
>gi|145491624|ref|XP_001431811.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398917|emb|CAK64413.1| unnamed protein product [Paramecium tetraurelia]
Length = 985
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 94/215 (43%), Gaps = 49/215 (22%)
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
IFGS+ TGL+L SDVD+VV PV + + E + E Q +W
Sbjct: 680 IFGSSRTGLALHDSDVDMVVFGLPV-----FTKPQLFEPMRKLLEVFCQ--------MKW 726
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL 1376
+ S K + +IP+I + ++ + + + + + H N ++
Sbjct: 727 --AISYKHIFQASIPLIKITIDPSIGFLEFVGNPL--------YYVMNHRNIEQMNLKIG 776
Query: 1377 DDSASPKCSHTSSDNIKAATSVRLDISFKSPS----------HTGLQTTDLVKELTEQFP 1426
+ S + +++DI+F+ H GL +TD +++
Sbjct: 777 EQSQN----------------IQIDITFELQQPYPMYGYQGYHVGLLSTDYQRQVGISVK 820
Query: 1427 ASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
T +A+VLK+ L R L+ SY+GG+SS+CL +++
Sbjct: 821 GFTEVAIVLKKILKQRGLNDSYTGGVSSFCLTIML 855
>gi|170084663|ref|XP_001873555.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651107|gb|EDR15347.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 300
Score = 59.3 bits (142), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
SV +DI + GLQT DLVK EQ PA PL LV K+FL R L+ + GL SY
Sbjct: 11 SVAIDIGINNVD--GLQTRDLVKSYLEQMPALRPLILVFKRFLEQRELNNASKSGLGSYA 68
Query: 1457 LMLLITRFLQ 1466
L+ FLQ
Sbjct: 69 ATLMCINFLQ 78
>gi|397613693|gb|EJK62373.1| hypothetical protein THAOC_17017 [Thalassiosira oceanica]
Length = 458
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
++ +D+ F P+ G ++ DL+ E P PL LK FL+ R +++ ++GG+ SY
Sbjct: 134 NIDVDVCFDQPN--GPESADLMHRFMESMPPLRPLTFALKYFLSSRDINKPFTGGIGSYL 191
Query: 1457 LMLLITRFLQH 1467
L L+I FLQH
Sbjct: 192 LQLMIVSFLQH 202
>gi|294883728|ref|XP_002771045.1| hypothetical protein Pmar_PMAR026020 [Perkinsus marinus ATCC 50983]
gi|239874251|gb|EER02861.1| hypothetical protein Pmar_PMAR026020 [Perkinsus marinus ATCC 50983]
Length = 651
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 42/72 (58%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+SV++ IS P+H+G++T+ V L + + PL LV+K L L Y GG+SSY
Sbjct: 383 SSVKVQISVTGPAHSGIETSAYVAMLCQHYDCLRPLVLVIKHVLVCDGLVTPYDGGMSSY 442
Query: 1456 CLMLLITRFLQH 1467
L+L++ FL
Sbjct: 443 TLVLMVVSFLNQ 454
>gi|294945352|ref|XP_002784637.1| hypothetical protein Pmar_PMAR021029 [Perkinsus marinus ATCC 50983]
gi|239897822|gb|EER16433.1| hypothetical protein Pmar_PMAR021029 [Perkinsus marinus ATCC 50983]
Length = 681
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 42/70 (60%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+SV++ IS P+H+G++T+ V L + + PL LV+K L L Y GG+SSY
Sbjct: 413 SSVKVQISVTGPAHSGIETSAYVAMLCQHYDCLRPLVLVIKHVLVCDGLVTPYDGGMSSY 472
Query: 1456 CLMLLITRFL 1465
L+L++ FL
Sbjct: 473 TLVLMVVSFL 482
>gi|410905163|ref|XP_003966061.1| PREDICTED: DNA polymerase sigma-like [Takifugu rubripes]
Length = 778
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 47/80 (58%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F ++ E R+ +N R+ R ++ LWP +R IFGS +TGL LP+
Sbjct: 237 LHEEIMDFFNFISPRPEEEAMRRDVVN----RIERVIKDLWPTARVEIFGSFSTGLYLPT 292
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV PP++ LE
Sbjct: 293 SDIDLVVFGKWDHPPLQELE 312
Score = 57.8 bits (138), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++DISF T ++ +K +++ PL VLKQFL R L++ ++GG+ SY
Sbjct: 344 TEVKVDISFNV--ETAVKAAQFIKSYLKKYTVLPPLIFVLKQFLLQRDLNEVFTGGIGSY 401
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 402 SLILMAISFLQ 412
>gi|344305107|gb|EGW35339.1| hypothetical protein SPAPADRAFT_48344 [Spathaspora passalidarum NRRL
Y-27907]
Length = 615
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 90/231 (38%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N V R+ + WP + ++FGS AT L LP SD+D+VV I E G E
Sbjct: 206 NTVVNRLKTQIAKFWPGTEAHVFGSCATDLYLPGSDIDMVV----------ISETGDYEN 255
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + +L +++ K +++ + N +PII V P
Sbjct: 256 R-----SRLYQLSSFLRSKKLAK--NVEVIANAKVPIIKFV----------------DP- 291
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
++ +H D+ S ++ I AA +R L TT
Sbjct: 292 ----------ESEIHIDV-----------SFERTNGIDAAKRIR----------KWLITT 320
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
P L L++KQFL R L+ + GGL Y +++ FL+
Sbjct: 321 ----------PGLRELVLIVKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 361
>gi|401837753|gb|EJT41641.1| PAP2-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 592
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 87/234 (37%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +L+ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 203 NQTISTIREALKQLWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SELGGKES 252
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + LA + ++ V +PII V PH
Sbjct: 253 RNNLYSLASHLKKNNLATE-------IEVVAKARVPIIKFV--EPH-------------- 289
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D +SF+ + GL+
Sbjct: 290 -----------SRIHID-----------------------------VSFERTN--GLEAA 307
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 308 KLIREWLNDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 360
>gi|410077415|ref|XP_003956289.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
gi|372462873|emb|CCF57154.1| hypothetical protein KAFR_0C01610 [Kazachstania africana CBS 2517]
Length = 537
Score = 58.5 bits (140), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 91/234 (38%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + R+ +++ LWP + ++FGS +T L LP SD+D VV E G +
Sbjct: 167 NITISRIRDAVKELWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SERGNKDS 216
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
+N CL A++L ++ + ++ V +PII V PH
Sbjct: 217 KN-----CLYQLAKFLTTKKL--ATDVEVVSKARVPIIKFV--EPH-------------- 253
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+H D +SF+ + GL+
Sbjct: 254 -----------TGIHID-----------------------------VSFERTN--GLEAA 271
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++ + L LV+KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 272 KLIRSWLDSTAGLRELVLVIKQFLHARRLNNVHTGGLGGFSIICLVFTFL-HMH 324
>gi|344230457|gb|EGV62342.1| hypothetical protein CANTEDRAFT_126141 [Candida tenuis ATCC 10573]
Length = 615
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 94/231 (40%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N VK + + ++V WP + ++FGS AT L LP SD+D+VV + + G E
Sbjct: 202 NSVVKTLKQQIKVCWPDAEAHVFGSFATDLYLPGSDIDMVV----------VSKNGDCEN 251
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ L + +L +++ K ++ + +PII V PK
Sbjct: 252 RHK-----LYQLSSFLRSKKLAK--DIEVIAGAKVPIIKFV----------------DPK 288
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
++H D+ S ++ + AA +R L+TT
Sbjct: 289 -----------TNIHLDI-----------SFERTNGLDAARRIR----------KWLETT 316
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++E L LV+KQFL R L+ + GGL Y ++L F++
Sbjct: 317 AGLRE----------LVLVVKQFLRSRKLNNVHVGGLGGYATIILCYHFIK 357
>gi|145513903|ref|XP_001442862.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124410223|emb|CAK75465.1| unnamed protein product [Paramecium tetraurelia]
Length = 627
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 70/314 (22%), Positives = 120/314 (38%), Gaps = 87/314 (27%)
Query: 1194 LEVQNCPTRKA-------------SLSLMHSLLHEEIDSFCKQVAAENTA-----RKPY- 1234
LEVQN P ++ + +LM L ++ F + +N R Y
Sbjct: 235 LEVQNSPFKQTVKEDQIDQHIMNQARALMIKKLDMDMREFNDIIQGQNQQILKLRRLIYD 294
Query: 1235 -----INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKE 1289
IN+ + + L S+ +FGS ATGL+LP SD+D+
Sbjct: 295 RLQFVINYLFRGFIHHISQLDFNSQVCLFGSCATGLALPESDIDI--------------- 339
Query: 1290 AGILEGRNGIK--ETC-----LQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHD 1342
G G + TC +Q +L WV +S+ + ++ +P+I L V+ P
Sbjct: 340 -----GITGFEMCSTCQLYVPIQKLTEFLQRMRWV--NSIVAITSSTMPLIKLQVD-PTI 391
Query: 1343 LIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDI 1402
+ + P D L D ++ + ++D I
Sbjct: 392 SFVQSTLPIGLPYIDLI---LNPDENITRQIFSVD------------------------I 424
Query: 1403 SF------KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
SF K H GL +T+L + + + L+ K L R L+ + GG+SS+C
Sbjct: 425 SFFQYSGPKQNQHLGLISTELTLQWLSFYSELRSIVLLFKSLLKKRGLNDQFKGGMSSFC 484
Query: 1457 LMLLITRFLQHEHH 1470
++ ++ FL+ +H
Sbjct: 485 IIQMVLAFLECFYH 498
>gi|365758533|gb|EHN00370.1| Pap2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 514
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 87/234 (37%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +L+ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 203 NQTISTIREALKQLWPDADLHVFGSYSTDLYLPGSDIDCVVN----------SELGGKES 252
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + LA + ++ V +PII V PH
Sbjct: 253 RNNLYSLASHLKKNNLATE-------IEVVAKARVPIIKFV--EPH-------------- 289
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D +SF+ + GL+
Sbjct: 290 -----------SRIHID-----------------------------VSFERTN--GLEAA 307
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 308 KLIREWLNDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 360
>gi|308499447|ref|XP_003111909.1| CRE-GLD-4 protein [Caenorhabditis remanei]
gi|308268390|gb|EFP12343.1| CRE-GLD-4 protein [Caenorhabditis remanei]
Length = 907
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 36/47 (76%)
Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ E+FP PL L+LKQFL R+L+Q+++GGLSSY L+LL+ F Q
Sbjct: 230 QVKEEFPLIEPLVLLLKQFLHYRNLNQTFTGGLSSYGLVLLLVNFFQ 276
>gi|432908932|ref|XP_004078069.1| PREDICTED: uncharacterized protein LOC101163355 [Oryzias latipes]
Length = 779
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V++DISF T ++ +K +++ PL VLKQFL R L++ ++GG+SSY
Sbjct: 324 SKVKVDISFNV--ETAVKAAQFIKSYLKKYAVLPPLIFVLKQFLLQRDLNEVFTGGISSY 381
Query: 1456 CLMLLITRFLQ 1466
L+L+ FLQ
Sbjct: 382 SLILMAISFLQ 392
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 46/80 (57%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F ++ E R+ +N ++ ++ LWP +R IFGS +TGL LP+
Sbjct: 217 LHEEILDFFHFMSPKPEEEAMRRDVVN----KIEGVIKDLWPTARVEIFGSFSTGLYLPT 272
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV PP++ LE
Sbjct: 273 SDIDLVVFGKWEHPPLQELE 292
>gi|260948920|ref|XP_002618757.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
gi|238848629|gb|EEQ38093.1| hypothetical protein CLUG_02216 [Clavispora lusitaniae ATCC 42720]
Length = 567
Score = 58.2 bits (139), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 89/231 (38%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N ++R+ R + WP+++ ++FGS AT L LP SD+D+VV I G E
Sbjct: 180 NTVIRRLKRRIAEFWPQTQAHVFGSCATDLYLPGSDIDMVV----------ISTTGDYEQ 229
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R L + +L + K +++ + +PII V P+
Sbjct: 230 RGK-----LYQLSSFLRTNKLAK--NIEVIATAKVPIIKFV----------------DPQ 266
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ + AA +R
Sbjct: 267 YN-----------IHVDI-----------SFERTNGLDAARRIR---------------- 288
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ + P L L++KQFL R L+ + GGL Y ++L+ FL+
Sbjct: 289 ----KWLDSMPGLRELVLIVKQFLRSRRLNNVHVGGLGGYATIILMYHFLR 335
>gi|342182382|emb|CCC91860.1| putative DNA polymerase sigma [Trypanosoma congolense IL3000]
Length = 403
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 42/68 (61%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + G + T +V +L +++P + PL +V+K FL R +D+ Y GGL S+ + L
Sbjct: 92 LDVDISINAEDGQRNTSIVIDLLKRYPEARPLTVVVKYFLLQRGMDEPYHGGLGSFAVTL 151
Query: 1460 LITRFLQH 1467
L+ FLQ+
Sbjct: 152 LVISFLQN 159
>gi|146419896|ref|XP_001485907.1| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC 6260]
Length = 588
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 91/235 (38%), Gaps = 83/235 (35%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + R+ ++ WP + ++FGS+AT L LP SD+D+VV I G E
Sbjct: 190 NNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVV----------ISRDGDREQ 239
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ + + ++ LA +++ + +PI+ V P
Sbjct: 240 RSRLYQLSTHLRSKKLAK-------NIEVIAKAKVPIVKFV----------------DP- 275
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
D+++H D+ S S+ I AA +R
Sbjct: 276 ----------DSNIHIDV-----------SFERSNGIDAAIKIR---------------- 298
Query: 1416 DLVKELTEQFPASTP----LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ ASTP L LV+KQFL R L+ + GGL Y ++L FL+
Sbjct: 299 --------EWLASTPGLRELVLVVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLK 345
>gi|190345571|gb|EDK37480.2| hypothetical protein PGUG_01578 [Meyerozyma guilliermondii ATCC 6260]
Length = 588
Score = 57.8 bits (138), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 91/235 (38%), Gaps = 83/235 (35%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + R+ ++ WP + ++FGS+AT L LP SD+D+VV I G E
Sbjct: 190 NNVIGRLKSTITKFWPDTEVHVFGSSATDLYLPGSDIDMVV----------ISRDGDREQ 239
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ + + ++ LA +++ + +PI+ V P
Sbjct: 240 RSRLYQLSTHLRSKKLAK-------NIEVIAKAKVPIVKFV----------------DP- 275
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
D+++H D+ S S+ I AA +R
Sbjct: 276 ----------DSNIHIDV-----------SFERSNGIDAAIKIR---------------- 298
Query: 1416 DLVKELTEQFPASTP----LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ ASTP L LV+KQFL R L+ + GGL Y ++L FL+
Sbjct: 299 --------EWLASTPGLRELVLVVKQFLRSRRLNNVHVGGLGGYSTIILCYHFLK 345
>gi|359493669|ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis
vinifera]
gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 57.4 bits (137), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 10/89 (11%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ +SV DISF G + + +++ ++P PL L+LK FL R L++ YSGG
Sbjct: 209 IEKRSSVAFDISFDV--ENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVYSGG 266
Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
+ SY L+ ++ LQ+ EH+LG
Sbjct: 267 IGSYALLAMLIAMLQNLQEWNASVEHNLG 295
>gi|268637610|ref|XP_002649102.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
gi|256012839|gb|EEU04050.1| PAP/25A-associated domain-containing protein [Dictyostelium
discoideum AX4]
Length = 938
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 45/67 (67%), Gaps = 2/67 (2%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DI F +P+ G++ T +VK L +Q+ + L L++K FL +L+++Y+GG+ SY L L
Sbjct: 565 IDICFDTPN--GIENTAVVKGLLKQYKSMRTLLLIIKFFLHQNNLNETYTGGIGSYALAL 622
Query: 1460 LITRFLQ 1466
++ F+Q
Sbjct: 623 MVVSFIQ 629
>gi|150951520|ref|XP_001387852.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
gi|149388662|gb|EAZ63829.2| topoisomerase I-related protein [Scheffersomyces stipitis CBS 6054]
Length = 605
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 55/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ S+ WP + T++FGS+AT L LP SD+D+V+ + G E
Sbjct: 200 NRLINKLKSSISSYWPETETHVFGSSATDLYLPGSDIDIVI----------VSRTGDYEN 249
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + YL ++ K +++ + +PII V P+
Sbjct: 250 R-----SRLYQLSSYLRHKGLAK--NMEVIAKAKVPIIKFV----------------DPE 286
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ V +D S + I+AA +R + TT
Sbjct: 287 SN----------------VNID------VSFERRNGIEAAKKIR----------RWMTTT 314
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
++EL L++KQFL+ R L+ +SGGL Y ++L FL
Sbjct: 315 PGLREL----------VLIIKQFLSSRRLNNVHSGGLGGYATIILCYHFL 354
>gi|307102621|gb|EFN50891.1| hypothetical protein CHLNCDRAFT_37639 [Chlorella variabilis]
Length = 509
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
A + DISF G + + V+ L + P PL +VLK FL R L++ YSGGL
Sbjct: 264 AESGYNFDISFDV--ANGPEAAENVRALMDALPPMRPLVMVLKVFLQQRELNEVYSGGLG 321
Query: 1454 SYCLMLLITRFLQ 1466
SY L++++ FLQ
Sbjct: 322 SYALLVMVATFLQ 334
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 42/66 (63%), Gaps = 5/66 (7%)
Query: 1213 LHEEIDSFCKQVA--AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH+EI FC+ +A AE A++ A+ R+ + +WP++R FGS ATGL LP+S
Sbjct: 162 LHQEIVEFCRYLAPSAEEQAQR---QAAMDRIEAVVTSIWPKARLQAFGSFATGLYLPTS 218
Query: 1271 DVDLVV 1276
DVD VV
Sbjct: 219 DVDAVV 224
>gi|255072677|ref|XP_002500013.1| predicted protein [Micromonas sp. RCC299]
gi|226515275|gb|ACO61271.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 292
Score = 57.0 bits (136), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/261 (22%), Positives = 102/261 (39%), Gaps = 91/261 (34%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI FC+ + A E++AR AV+RV ++ +WP +R + GS ATG+ LP+
Sbjct: 4 LHHEIVDFCRYLEPTAEESSARTA----AVERVRGAVLSIWPGARFEVHGSFATGMYLPN 59
Query: 1270 SDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
SD+D V+ C P TCL+ A L+ + + ++ +
Sbjct: 60 SDIDAVILGSGCKSPA--------------------TCLKALALSLSRKGMAR--KIQLI 97
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+PI+ K + + + D V+ D + P +
Sbjct: 98 AKARVPIV---------------------KFEERPSGFQFD-------VSFDVANGPASA 129
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
N++ ++R P +T L K FL R+L+
Sbjct: 130 EIVRANMRRFPALR--------------------------PLTTVL----KAFLQQRALN 159
Query: 1446 QSYSGGLSSYCLMLLITRFLQ 1466
+ Y+GG+ SY L+ ++ LQ
Sbjct: 160 EVYTGGVGSYALLCMVMAHLQ 180
>gi|406604992|emb|CCH43591.1| Poly(A) RNA polymerase protein 1 [Wickerhamomyces ciferrii]
Length = 624
Score = 57.0 bits (136), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 90/230 (39%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N V+++ ++ LWP ++FGS AT L LP SD+D+V+ + E G E
Sbjct: 234 NNTVRKLREAIMELWPDCEVHVFGSYATDLYLPGSDIDMVI----------VSEHGGYES 283
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN L + +L + K +++ + +PII
Sbjct: 284 RN-----SLYSLSSFLKRKNLAK--NVEVIAKAKVPIIKFT------------------- 317
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +++H D+ S ++ I AA ++R I+
Sbjct: 318 --------ESTSNIHIDV-----------SFERTNGIDAAKTIRSWIT------------ 346
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
+ P + L++KQFL+ R L+ + GGL Y ++ L+ FL
Sbjct: 347 --------ETPGLREIVLIVKQFLSSRKLNNVHVGGLGGYSIICLVYSFL 388
>gi|348677916|gb|EGZ17733.1| hypothetical protein PHYSODRAFT_503192 [Phytophthora sojae]
Length = 269
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 1398 VRLDISF-KSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
V LD++ S HTGL DL+ + PA PL L+LK L L ++ GG+SSY
Sbjct: 77 VLLDLTCGHSVGHTGLGARDLIYSFQAEMPALRPLVLILKSHLLRNDLKCAFIGGISSYV 136
Query: 1457 LMLLITRFLQ 1466
L++L+ RFLQ
Sbjct: 137 LVILVIRFLQ 146
>gi|164656242|ref|XP_001729249.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
gi|159103139|gb|EDP42035.1| hypothetical protein MGL_3716 [Malassezia globosa CBS 7966]
Length = 527
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS + GLQ + V ++ PA PL +V+KQFL R+L + Y+GGL SY + L
Sbjct: 92 VDISINQAN--GLQASKFVNGWLKKQPAIRPLVMVIKQFLQQRALSEVYTGGLGSYSVTL 149
Query: 1460 LITRFLQHEHHLGR 1473
++ FLQ L R
Sbjct: 150 MVLSFLQLHPKLQR 163
>gi|147800856|emb|CAN64475.1| hypothetical protein VITISV_017481 [Vitis vinifera]
Length = 493
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 10/89 (11%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ +SV DISF G + + +++ ++P PL L+LK FL R L++ YSGG
Sbjct: 199 VEKRSSVAFDISFDV--ENGPKAAEYIQDAISKWPPLRPLCLILKVFLQQRELNEVYSGG 256
Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
+ SY L+ ++ LQ+ EH+LG
Sbjct: 257 IGSYALLAMLIAMLQNLQEWNASVEHNLG 285
>gi|440637467|gb|ELR07386.1| hypothetical protein GMDG_08401 [Geomyces destructans 20631-21]
Length = 753
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I T +R+D+SF++ TGL + + FPA L ++KQFLA R L++ +GG
Sbjct: 534 IDYYTGLRVDVSFEN--DTGLIANKTFQNWKDTFPAMPILVTIVKQFLAMRGLNEPVNGG 591
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + + L+ LQH EHHLG
Sbjct: 592 IGGFTVTCLVVSLLQHMPQIQSGNMIPEHHLG 623
>gi|349581056|dbj|GAA26214.1| K7_Pap2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 584
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +++ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 201 NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 250
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA + ++ V +PII V PH
Sbjct: 251 RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPH-------------- 287
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ I+AA
Sbjct: 288 -----------SGIHIDV-----------SFERTNGIEAAK------------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 307 -LIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 358
>gi|6324457|ref|NP_014526.1| non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
S288c]
gi|1717744|sp|P53632.1|PAP2_YEAST RecName: Full=Poly(A) RNA polymerase protein 2; AltName: Full=DNA
polymerase kappa; AltName: Full=DNA polymerase sigma;
AltName: Full=Topoisomerase 1-related protein TRF4
gi|663237|emb|CAA88145.1| ORF [Saccharomyces cerevisiae]
gi|950226|gb|AAC49091.1| Trf4p [Saccharomyces cerevisiae]
gi|1419987|emb|CAA99134.1| TRF4 [Saccharomyces cerevisiae]
gi|51830518|gb|AAU09782.1| YOL115W [Saccharomyces cerevisiae]
gi|285814775|tpg|DAA10668.1| TPA: non-canonical poly(A) polymerase PAP2 [Saccharomyces cerevisiae
S288c]
gi|392296670|gb|EIW07772.1| Pap2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 584
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 89/234 (38%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +++ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 201 NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 250
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA + ++ V +PII V PH
Sbjct: 251 RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPH-------------- 287
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ I+AA
Sbjct: 288 -----------SGIHIDV-----------SFERTNGIEAAK------------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 307 -LIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 358
>gi|453083258|gb|EMF11304.1| hypothetical protein SEPMUDRAFT_47828 [Mycosphaerella populorum
SO2202]
Length = 672
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 47/76 (61%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF++ S GL+ +E +++P L +++KQFL R L++ + GG
Sbjct: 345 VDATTGIKVDVSFENDS--GLRAIPTFQEWKQRYPQMPVLVVIIKQFLTMRGLNEVFLGG 402
Query: 1452 LSSYCLMLLITRFLQH 1467
+ + ++ L+T LQH
Sbjct: 403 IGGFTIICLVTSMLQH 418
>gi|196004468|ref|XP_002112101.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
gi|190586000|gb|EDV26068.1| hypothetical protein TRIADDRAFT_23436 [Trichoplax adhaerens]
Length = 289
Score = 56.6 bits (135), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 52/93 (55%), Gaps = 6/93 (6%)
Query: 1378 DSASPKCSHTSSDNIKAAT----SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLAL 1433
D + KC H + I T + ++DI F + +++ D+++ ++ L L
Sbjct: 95 DRDNIKCIHNAMVPIIKLTEKTCNYKMDIEFNIEN--SVKSADIIQTYIRKYEPLKYLVL 152
Query: 1434 VLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
VLKQFL R L++ +SGG+SSY L++++ FLQ
Sbjct: 153 VLKQFLFQRELNEVFSGGVSSYTLVMMVVNFLQ 185
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 60/126 (47%), Gaps = 26/126 (20%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F + ++ E R+ V+ V + LWP + +FGS TGL LP+
Sbjct: 10 LHEEIIDFYQYISPRPEEKNMRET----VVEGVKEVILTLWPHVQVEVFGSFRTGLYLPT 65
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+DLV+ GI +G+ ++ L+ A L E D++K + N
Sbjct: 66 SDIDLVI-------------FGI-DGKGAFED--LEKA---LMQHEVCDRDNIKCIHNAM 106
Query: 1330 IPIIML 1335
+PII L
Sbjct: 107 VPIIKL 112
>gi|294654384|ref|XP_456434.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
gi|199428840|emb|CAG84386.2| DEHA2A02200p [Debaryomyces hansenii CBS767]
Length = 600
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 90/231 (38%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N V ++ + + WP + ++FGS AT L LP SD+D+VV + E G E
Sbjct: 194 NRVVNQLKQQIGQFWPATELHVFGSCATDLYLPGSDIDMVV----------VSETGDYEH 243
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + +L N++ K +++ + +PII V P
Sbjct: 244 R-----SRLYQLSSFLRNKKLAK--NIEVIAKAKVPIIKFV----------------DPT 280
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+++H D+ S ++ I AA +R L +T
Sbjct: 281 -----------SNIHIDI-----------SFERTNGIDAAKKIR----------RWLSST 308
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
P L L++KQFL R L+ + GGL Y ++L FL+
Sbjct: 309 ----------PGLRELVLIVKQFLRSRKLNNVHVGGLGGYSTIILCYHFLK 349
>gi|440795046|gb|ELR16187.1| PAP/25A associated domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 471
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 48/75 (64%), Gaps = 1/75 (1%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T+ ++DISF + ++ G T +VK+ + PLALVLK +L+ L++ Y+GG
Sbjct: 208 VDSETNCQVDISFATTAN-GAVNTAIVKKFCVEAEEIRPLALVLKYYLSQMGLNEPYTGG 266
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L+L+I +LQ
Sbjct: 267 IGSYTLLLMIISYLQ 281
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%), Gaps = 1/56 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
LHEEI +F Q + K ++R++ + +WP + N+FGS+A+ L LP
Sbjct: 134 LHEEIKAFA-QFMSPTPGEKKMREEVIRRISSVITTVWPYAEVNVFGSSASELYLP 188
>gi|313226931|emb|CBY22076.1| unnamed protein product [Oikopleura dioica]
Length = 424
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I++ T V +DISF + + G + LVK+ ++P L ++LK L R L++ + GG
Sbjct: 193 IESNTLVHVDISFNTSN--GREAAALVKKYMAEYPNLKQLVVLLKYILNHRGLNEVWKGG 250
Query: 1452 LSSYCLMLLITRFLQ 1466
L SY L LL+ FLQ
Sbjct: 251 LGSYALTLLVVNFLQ 265
>gi|345563688|gb|EGX46674.1| hypothetical protein AOL_s00097g578 [Arthrobotrys oligospora ATCC
24927]
Length = 1706
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + +DISF++PS GL + + FP LAL+LKQFLA R +++ ++GGL S+
Sbjct: 1390 TGLSVDISFENPS--GLIAIRTFNQWRQVFPEMPKLALLLKQFLAIRGINEPFNGGLGSF 1447
Query: 1456 CLMLLITRFLQ 1466
L+ +I LQ
Sbjct: 1448 SLICMIVSMLQ 1458
>gi|222619028|gb|EEE55160.1| hypothetical protein OsJ_02969 [Oryza sativa Japonica Group]
Length = 536
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 15/94 (15%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + DISF G Q D +K+ ++FPA L ++LK FL R L++ Y+GG
Sbjct: 204 VERKSEIAFDISFDM--DGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGG 261
Query: 1452 LSSYCLMLLITRFLQ-------------HEHHLG 1472
+ SY L+ ++ LQ EH+LG
Sbjct: 262 IGSYALLTMLITHLQLIWGGKDILGYRKKEHNLG 295
>gi|367014043|ref|XP_003681521.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
gi|359749182|emb|CCE92310.1| hypothetical protein TDEL_0E00670 [Torulaspora delbrueckii]
Length = 663
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 53/234 (22%), Positives = 89/234 (38%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ +++ LWP + +FGS AT L LP SD+D VV +G
Sbjct: 229 NKTISKIRAAVRELWPDADLQVFGSYATDLYLPGSDIDCVVN---------------SKG 273
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ L A +L ++E ++ + +PII V P+
Sbjct: 274 RDKENRNSLYSLASFLKSKELAT--RVEVIAKARVPIIKFV----------------EPQ 315
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S + I + RL
Sbjct: 316 -----------SQIHIDV--------------SFERINGLEAARL--------------- 335
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
++E E+ P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 336 --IREWLEETPGLRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLVYSFL-HLH 386
>gi|159470731|ref|XP_001693510.1| poly(a) polymerase [Chlamydomonas reinhardtii]
gi|158283013|gb|EDP08764.1| poly(a) polymerase [Chlamydomonas reinhardtii]
Length = 517
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/273 (23%), Positives = 106/273 (38%), Gaps = 68/273 (24%)
Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT---NIFGSNATGLS 1266
+S LH I+ FC++V R+ ++ V ++ +WP +R +FGS A GLS
Sbjct: 100 YSPLHYNIEEFCQRVVPTEGERRQR-QEVIEAVRGGVRRVWPGARGVELQVFGSFANGLS 158
Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETC-LQHAARYLANQEWVKSDSLKTV 1325
+SD+DLVV EP + G E + + T L+ A L + + + + +
Sbjct: 159 TWNSDLDLVV----TGIYEPDRMTGGYEINDRGRITAKLRKIAEALNRSKAIDIERQQLI 214
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
IPI+ L + A T+ V++ D + P+ +
Sbjct: 215 PRARIPILKLWTK--------------------ARVTVD---------VSMSDDSGPRAA 245
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
+ +A +P PL LVLK +L L+
Sbjct: 246 RYMAQQCRA------------------------------YPPLKPLVLVLKAYLKACRLN 275
Query: 1446 QSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
+ SGGLSSY L ++ LQ E G ++ +
Sbjct: 276 EVNSGGLSSYSLTNMVIAHLQEELKSGHDVSDL 308
>gi|410075647|ref|XP_003955406.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
gi|372461988|emb|CCF56271.1| hypothetical protein KAFR_0A08370 [Kazachstania africana CBS 2517]
Length = 630
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/234 (23%), Positives = 93/234 (39%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N AV ++ R+++ LW S +FGS AT L LP SD+D VV +G E
Sbjct: 191 NKAVSKLGRAIKELWSDSELLVFGSYATDLYLPGSDIDCVVN----------SASGNKEH 240
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L AR+L + + S++ + +PII +
Sbjct: 241 R-----SYLYELARFLKKKNL--ATSIEVIARARVPIIKFI------------------- 274
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ ++ VH D+ S ++ ++AA
Sbjct: 275 --------EPESGVHIDI-----------SFERTNGVEAAK------------------- 296
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
L++E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 297 -LIREWLDMTPGLRELVLIVKQFLTARRLNDVHTGGLGGFSIICLVYSFL-HLH 348
>gi|412993216|emb|CCO16749.1| predicted protein [Bathycoccus prasinos]
Length = 767
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 71/262 (27%)
Query: 1213 LHEEIDSFCK---QVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH++I+ F Q AE +AR I V+ + + +++ + FGS ATGL+L
Sbjct: 267 LHDDIERFASENVQNEAETSARDE-IEQTVRETIQKYS--FSKAQVHRFGSGATGLALKD 323
Query: 1270 SDVDLVVC-LPPVRNLEPIKEAGILEGRNGIKETCLQH---AARYLANQEWVKSDSLKTV 1325
+DVDLV+ + P + +K G R+ K+ +QH A+ L N+ K + +
Sbjct: 324 ADVDLVILGVGP----QSVKGGGGGFTRSE-KQLLVQHLRTIAKTLRNKGVCKRAEI--I 376
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+ +PI L DA H K
Sbjct: 377 SSAKVPIAKL---------------------DAYHAETK--------------------- 394
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLD 1445
T ++LD++ + GL ++E +FPA PL LVLK+ L L+
Sbjct: 395 ----------THIKLDLAIGVSN--GLAAAQWIREQVGEFPALKPLVLVLKRLLQIHKLN 442
Query: 1446 QSYSGGLSSYCLMLLITRFLQH 1467
+ +GG Y L+ L+ L+
Sbjct: 443 NAATGGCGGYLLISLVVSHLKQ 464
>gi|218188825|gb|EEC71252.1| hypothetical protein OsI_03224 [Oryza sativa Indica Group]
Length = 571
Score = 55.8 bits (133), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + DISF G Q D +K+ ++FPA L ++LK FL R L++ Y+GG
Sbjct: 236 VERKSEIAFDISFDM--DGGPQAADFIKDYVKKFPALRHLCMILKVFLHQRELNEVYTGG 293
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L+ ++ LQ
Sbjct: 294 IGSYALLTMLITHLQ 308
>gi|429327416|gb|AFZ79176.1| nucleotidyltransferase domain containing protein [Babesia equi]
Length = 520
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 4/76 (5%)
Query: 1392 IKAATSVRLDIS-FKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
+ T VRLDIS F+ S+T TT +KE F PL L++K FL R+L +Y G
Sbjct: 234 VDKRTGVRLDISVFQESSNT---TTKFIKEKCSAFKYMQPLILLIKLFLQARNLGDTYFG 290
Query: 1451 GLSSYCLMLLITRFLQ 1466
G+ SY L ++ FLQ
Sbjct: 291 GVGSYLLYCMVLSFLQ 306
>gi|389742365|gb|EIM83552.1| Nucleotidyltransferase [Stereum hirsutum FP-91666 SS1]
Length = 540
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 97/257 (37%), Gaps = 75/257 (29%)
Query: 1213 LHEEI---DSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI S+ AE AR+ + + R W ++ +++GS +T L LP
Sbjct: 52 LHDEIVAHASYILPTPAERLARQALYDQIASVLVRGK---WRNAKLHLYGSISTDLCLPR 108
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+DLV + + +EA ++ L A RYL Q+++ +
Sbjct: 109 SDMDLVF---EIHQIADTQEAC---------KSVLFEAKRYLFEQQYLAEE--------- 147
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+ VV H+ + L+ + P+
Sbjct: 148 ----LFVV-------------------------------AHARVPVLNMVSHPQTG---- 168
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
R ++ G+ T VKE + PA PL L++K L L + S
Sbjct: 169 ---------RFNVDITVNGADGVPTIAPVKEYLKTMPALRPLVLLIKSLLEQHGLHSAQS 219
Query: 1450 GGLSSYCLMLLITRFLQ 1466
GLSSYC++ ++ FLQ
Sbjct: 220 SGLSSYCVICMVISFLQ 236
>gi|50302781|ref|XP_451327.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640458|emb|CAH02915.1| KLLA0A07359p [Kluyveromyces lactis]
Length = 684
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/231 (24%), Positives = 88/231 (38%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N A+ ++ ++ LWP S N FGS AT L LP SD+D VV +G E
Sbjct: 213 NQAIAKLKEAVVELWPDSSLNCFGSYATDLYLPGSDIDCVVR----------SASGDKEN 262
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA Q ++ + +PII V
Sbjct: 263 RNALYSLASFLKRKQLATQ-------VEVIAKARVPIIKFV------------------- 296
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ ++ +H D+ S ++ ++AA +R
Sbjct: 297 --------EPESKIHIDV-----------SFERTNGLEAARVIR---------------- 321
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L EQ P L L++KQFL R L+ ++GGL Y ++ L+ FL+
Sbjct: 322 ---GWLEEQ-PGLRELVLIVKQFLHARRLNNVHTGGLGGYSIICLVYTFLK 368
>gi|47209824|emb|CAF91228.1| unnamed protein product [Tetraodon nigroviridis]
Length = 964
Score = 55.5 bits (132), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 47/80 (58%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI F ++ E R+ +N R+ + ++ LWP ++ IFGS +TGL LP+
Sbjct: 450 LHEEIMDFFNFISPRPEEEAMRRDVVN----RIEKVIKDLWPTAQVEIFGSFSTGLYLPT 505
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV PP++ LE
Sbjct: 506 SDIDLVVFGKWDHPPLQELE 525
>gi|84995372|ref|XP_952408.1| topoisomerase-related nucleotidyltransferase [Theileria annulata
strain Ankara]
gi|65302569|emb|CAI74676.1| topoisomerase-related nucleotidyltransferase, putative [Theileria
annulata]
Length = 532
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T VRLD+S + S +TT ++E+ ++ PL L++K FL RSL +Y GG
Sbjct: 233 VDKRTGVRLDLSVYNDS--AFKTTKFIQEMCLKYKYMQPLILLIKLFLQSRSLGDTYFGG 290
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L ++ FLQ
Sbjct: 291 VGSYLLYCMVLSFLQ 305
>gi|340055136|emb|CCC49447.1| putative DNA polymerase sigma [Trypanosoma vivax Y486]
Length = 744
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + G + T +V +L Q+P + L +V+K FL R +D+ Y GGL S+ L
Sbjct: 429 LDVDVSINAVDGSRNTSIVVDLLRQYPEARSLIVVVKYFLQQRGMDEPYHGGLGSFATTL 488
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + RP
Sbjct: 489 LVISFLQQHPIYTNRP 504
>gi|149244754|ref|XP_001526920.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449314|gb|EDK43570.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 664
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 87/231 (37%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N V + + + WP + ++FGS+AT L LP SD+D+VV + + G E
Sbjct: 245 NKVVNTLKTQIALFWPGTEAHVFGSSATDLYLPGSDIDMVV----------LSDTGDYEN 294
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ + + A+ LA +++ + + +PII V P
Sbjct: 295 RSRLYQLSSFLKAKKLAT-------NVEVIASAKVPIIKFV----------------DP- 330
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
D+++H D ISF+ + GL
Sbjct: 331 ----------DSNIHVD-----------------------------ISFERKN--GLDAA 349
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++ P L LV+KQFL R L+ + GGL Y +++ FL+
Sbjct: 350 RRIRRWLASTPGLRELVLVVKQFLRSRKLNNVHVGGLGGYATIIICYHFLR 400
>gi|302834665|ref|XP_002948895.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
nagariensis]
gi|300266086|gb|EFJ50275.1| hypothetical protein VOLCADRAFT_104084 [Volvox carteri f.
nagariensis]
Length = 1052
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 104/272 (38%), Gaps = 66/272 (24%)
Query: 1210 HSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT---NIFGSNATGLS 1266
+S LH I+ FC +V ++ + + + ++ +WP SR +FGS A GLS
Sbjct: 651 YSPLHYNIEDFCTKVVPTEGEKRQRME-VIDAIRAGVRKVWPNSRQVELQVFGSFANGLS 709
Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
SSD+DLVV +EP + +G E L A+ A + +D+L +
Sbjct: 710 TWSSDLDLVV----TGVMEPDRVSGGYE---------LADRAKITARLRKI-ADALNRAK 755
Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
N I L+ P+ L + V D+ DDS P+ +
Sbjct: 756 NIDILRQQLI-----------------PRARIPILKLWTKSRVCVDVSVSDDSG-PRAAR 797
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
+A FP PL LV+K +L L++
Sbjct: 798 YMVQQCRA------------------------------FPPVKPLVLVVKTYLKACRLNE 827
Query: 1447 SYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
+GGLSSY L ++ LQ E G I+ +
Sbjct: 828 VNTGGLSSYSLTNMVIAHLQEELKSGHDISDL 859
>gi|253741392|gb|EES98263.1| Topoisomerase I-related protein [Giardia intestinalis ATCC 50581]
Length = 524
Score = 55.1 bits (131), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + S+ DISF SP G + + E+ P + L +V+K FL R L+Q Y+GG
Sbjct: 203 VHSELSLEYDISFNSPH--GAAHRESILNYIEKHPLARVLIMVVKSFLKKRGLNQPYTGG 260
Query: 1452 LSSYCLMLLITRFLQHEHHL 1471
+SSY L+ LI ++ H++
Sbjct: 261 MSSYILLQLIVVYISERHNV 280
>gi|303277243|ref|XP_003057915.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460572|gb|EEH57866.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 658
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 44/74 (59%), Gaps = 2/74 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
+A + + DISF + G + ++V+ +FPA PL VLK FL R+L++ Y+GG+
Sbjct: 244 EAPSGFQFDISFDVAN--GPASAEIVRMNMRRFPALRPLTTVLKAFLQQRALNEVYTGGI 301
Query: 1453 SSYCLMLLITRFLQ 1466
SY L+ ++ LQ
Sbjct: 302 GSYALLCMVMAHLQ 315
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 40/67 (59%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH EI FC+ + A E RK AV RV ++ +WP+SR + GS ATG+ LP+
Sbjct: 139 LHAEIVDFCRFLEPTAEEAEGRKA----AVDRVRAAVLSIWPQSRFEVHGSFATGMYLPN 194
Query: 1270 SDVDLVV 1276
SD+D V+
Sbjct: 195 SDIDAVI 201
>gi|308162052|gb|EFO64479.1| Topoisomerase I-related protein [Giardia lamblia P15]
Length = 520
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 2/80 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + S+ DISF SP G + + E+ P + L +V+K FL R L+Q Y+GG
Sbjct: 203 VHSELSLEYDISFNSPH--GAAHRETILGYIEKHPLARILIMVVKSFLKKRGLNQPYTGG 260
Query: 1452 LSSYCLMLLITRFLQHEHHL 1471
+SSY L+ LI ++ H++
Sbjct: 261 MSSYILLQLIVVYISERHNV 280
>gi|71030472|ref|XP_764878.1| hypothetical protein [Theileria parva strain Muguga]
gi|68351834|gb|EAN32595.1| hypothetical protein TP02_0312 [Theileria parva]
Length = 408
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T VRLD+S + S +TT ++E+ ++ PL L++K FL RSL +Y GG
Sbjct: 233 VDKRTGVRLDLSVYNDS--AFKTTKFIQEMCLKYKYMQPLILLIKLFLQSRSLGDTYFGG 290
Query: 1452 LSSYCLMLLITRFLQ 1466
+ SY L ++ FLQ
Sbjct: 291 VGSYLLYCMVLSFLQ 305
>gi|67539048|ref|XP_663298.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
gi|40743597|gb|EAA62787.1| hypothetical protein AN5694.2 [Aspergillus nidulans FGSC A4]
gi|259484832|tpe|CBF81391.1| TPA: topoisomerase family protein TRF4, putative (AFU_orthologue;
AFUA_7G04130) [Aspergillus nidulans FGSC A4]
Length = 694
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 63/261 (24%), Positives = 103/261 (39%), Gaps = 73/261 (27%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI SF K V E+ R+ V R+ + Q + + FGS A+GL LP+
Sbjct: 255 LHDEILSFYHWVKPVRYEHIVRQDL----VARLQAAFQSRYYGVEIHAFGSFASGLYLPN 310
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
+D+DLV+ R +K G +G+ + + +L NQ S++T+ +
Sbjct: 311 ADIDLVLLSTSFRRTG-VKTFGERKGQ-------IYAFSAFLKNQNIAVPGSVETIAHAR 362
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PI+ V ++ T LK D
Sbjct: 363 VPILKFVDKL---------------------TGLKVD----------------------- 378
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+SF + S GL ++ ++PA + V+KQFL R L++ +
Sbjct: 379 ------------LSFDNDS--GLIANRTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPT 424
Query: 1450 GGLSSYCLMLLITRFLQHEHH 1470
GL + + L+T LQH H
Sbjct: 425 LGLGGFSITCLVTSLLQHMPH 445
>gi|395333834|gb|EJF66211.1| hypothetical protein DICSQDRAFT_152192 [Dichomitus squalens LYAD-421
SS1]
Length = 647
Score = 54.7 bits (130), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 38/68 (55%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R + G+ T +VK+ + PA L L++K FL+ RS+++ ++GGL SY ++
Sbjct: 269 RFSVDISVNQGNGVDTGKMVKQFLRELPALRSLVLIIKNFLSQRSMNEVFTGGLGSYSIV 328
Query: 1459 LLITRFLQ 1466
L FLQ
Sbjct: 329 CLAISFLQ 336
>gi|393216777|gb|EJD02267.1| hypothetical protein FOMMEDRAFT_141374 [Fomitiporia mediterranea
MF3/22]
Length = 732
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 40/67 (59%), Gaps = 2/67 (2%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS + G++T D++ +FPA + L++K FL R+L++ YSGGL SY ++
Sbjct: 263 VDISMNQSN--GVKTGDMINRFLNEFPALRAIVLIVKSFLKQRNLNEVYSGGLGSYAIVC 320
Query: 1460 LITRFLQ 1466
L LQ
Sbjct: 321 LAVSHLQ 327
>gi|255710469|ref|XP_002551518.1| KLTH0A01276p [Lachancea thermotolerans]
gi|238932895|emb|CAR21076.1| KLTH0A01276p [Lachancea thermotolerans CBS 6340]
Length = 609
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 92/231 (39%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ +++ LWP + ++FGS AT L LP SD+D V+ +G E
Sbjct: 191 NSTISKIRNAVKDLWPDADLHVFGSFATDLYLPGSDIDCVIN----------SSSGDKEN 240
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN CL A +L ++ ++ + +PII V +
Sbjct: 241 RN-----CLYSLASFLKRRKLAT--QVEVISKARVPIIKFVEPI---------------- 277
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S + ++AA +R L+ T
Sbjct: 278 -----------SQIHIDI-----------SFERVNGLEAARVIR----------GWLKET 305
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL L++KQFLA R L+ ++GGL + ++ L+ FLQ
Sbjct: 306 PGLRELV----------LIVKQFLAARRLNMVHTGGLGGFSIICLVYAFLQ 346
>gi|242775359|ref|XP_002478628.1| topoisomerase family protein TRF4, putative [Talaromyces stipitatus
ATCC 10500]
gi|218722247|gb|EED21665.1| topoisomerase family protein TRF4, putative [Talaromyces stipitatus
ATCC 10500]
Length = 692
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 47/81 (58%), Gaps = 3/81 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I T +R+D++F + S G++ ++ E +PA + V+KQFL RSL++ +GG
Sbjct: 403 IDRLTGLRVDMAFDNDS--GIRAIQTFEKWKEAYPAMPAIVAVIKQFLLLRSLNEVPTGG 460
Query: 1452 LSSYCLMLLITRFLQH-EHHL 1471
L + ++ L+ LQH HHL
Sbjct: 461 LGGFAIICLVVSLLQHMPHHL 481
>gi|145497035|ref|XP_001434507.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124401633|emb|CAK67110.1| unnamed protein product [Paramecium tetraurelia]
Length = 648
Score = 54.3 bits (129), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 54/267 (20%), Positives = 102/267 (38%), Gaps = 74/267 (27%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P + SL + L EI++F +++ ++P I+ V+ V ++Q + +SR ++G
Sbjct: 304 PQHRISLDDIVKQLSFEINTFSQELNKLLEEQRPIIDKIVQMVDETVQSVCSKSRAFLYG 363
Query: 1260 SNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKS 1319
S TG L D D+ + + N + L A Q ++K
Sbjct: 364 SCQTG--LNLLDSDIDIVIET----------------NEQERISLYKIAEQFKTQGFIKE 405
Query: 1320 DSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDS 1379
+K ++N P++ +
Sbjct: 406 --VKVIDNARKPVLKM-------------------------------------------- 419
Query: 1380 ASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFL 1439
+CS + + +DI+ H+G +T + + E ++F LAL+LK +
Sbjct: 420 ---QCSQEFQNKL-------IDITISKNDHSGRKTANSMIEFQKEFKQFKSLALILKFYF 469
Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+L +Y GGL+SYC++ +I LQ
Sbjct: 470 KSINLLNAYQGGLNSYCILTMILALLQ 496
>gi|392567029|gb|EIW60204.1| hypothetical protein TRAVEDRAFT_164816 [Trametes versicolor FP-101664
SS1]
Length = 660
Score = 54.3 bits (129), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 39/71 (54%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T R + G+ +VK+ E+ PA L L++K FL+ RS+++ ++GGL SY
Sbjct: 276 THGRFSVDISINQGNGVTAGKMVKQFLEELPALRSLVLIIKSFLSQRSMNEVFTGGLGSY 335
Query: 1456 CLMLLITRFLQ 1466
++ L FLQ
Sbjct: 336 SIVCLAISFLQ 346
>gi|302508901|ref|XP_003016411.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
gi|291179980|gb|EFE35766.1| hypothetical protein ARB_05810 [Arthroderma benhamiae CBS 112371]
Length = 577
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF + S GL D K +++PA +A ++K FL R L+ +GG
Sbjct: 337 VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 394
Query: 1452 LSSYCLMLLITRFLQHE-HHLGRP 1474
L + ++ L+T LQH +G P
Sbjct: 395 LGGFSIICLVTSILQHSPRGIGEP 418
>gi|389585006|dbj|GAB67737.1| hypothetical protein PCYB_123030 [Plasmodium cynomolgi strain B]
Length = 853
Score = 53.9 bits (128), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++D+ S QTTD +K E++ PL ++LK FL R+L+++Y GG+ S+
Sbjct: 527 TGVQVDVCTNQVSSK--QTTDFIKSKVEKYLYLRPLVILLKFFLNTRNLNETYIGGIGSF 584
Query: 1456 CLMLLITRFLQ 1466
L ++T FLQ
Sbjct: 585 LLCCMVTHFLQ 595
>gi|328772133|gb|EGF82172.1| hypothetical protein BATDEDRAFT_23561 [Batrachochytrium dendrobatidis
JAM81]
Length = 752
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 68/137 (49%), Gaps = 32/137 (23%)
Query: 1211 SLLHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSL 1267
++L++EID + V AE++ RK + RV + ++ +W + ++FGS T L L
Sbjct: 176 TMLNQEIDDYVDYVRPTEAEHSLRK----LTIARVRKIVKQIWADAEVHVFGSFQTKLYL 231
Query: 1268 PSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSL---KT 1324
PSSDVD+VV G + + CL+ A+ + K+D+L +
Sbjct: 232 PSSDVDIVVV-----------------GDSCVLPKCLRQLAK-----AFEKADTLSRMEV 269
Query: 1325 VENTAIPIIMLVVEVPH 1341
+E T +PII V ++ H
Sbjct: 270 IEKTKVPIIKGVDKLTH 286
>gi|83763953|emb|CAJ12156.1| cg11265 protein [Drosophila simulans]
Length = 279
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQV---AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEEI+ F + V E+ R N VKR+ + +WP++ IFGS TGL LP+
Sbjct: 178 LHEEIEHFYQYVLPTPCEHAIR----NEVVKRIEAVVHSIWPQAVVEIFGSFRTGLFLPT 233
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV P+R LE
Sbjct: 234 SDIDLVVLGLWEKLPLRTLE 253
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 37/64 (57%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
V++DIS H+GL L + L + P P AL ++ LA L+ +Y+GGL SY +
Sbjct: 675 VKVDISLDCAGHSGLAAAALCERLVSRLPELAPAALAVRCLLAAAGLNDAYTGGLPSYAV 734
Query: 1458 MLLI 1461
+L++
Sbjct: 735 LLMV 738
>gi|296821440|ref|XP_002850129.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
gi|238837683|gb|EEQ27345.1| Poly(A) RNA polymerase cid14 [Arthroderma otae CBS 113480]
Length = 620
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF + ++GL D K E++PA +A ++K FL R L+ +GG
Sbjct: 380 VDGLTGLKVDLSFDN--NSGLAAIDTFKAWKEEYPAMPVIASIIKHFLLIRGLNDVATGG 437
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 438 LGGFSIICLVTSILQH 453
>gi|302657626|ref|XP_003020531.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
gi|291184373|gb|EFE39913.1| hypothetical protein TRV_05360 [Trichophyton verrucosum HKI 0517]
Length = 577
Score = 53.9 bits (128), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF + S GL D K +++PA +A ++K FL R L+ +GG
Sbjct: 337 VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 394
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 395 LGGFSIICLVTSILQH 410
>gi|326479586|gb|EGE03596.1| topoisomerase TRF4 [Trichophyton equinum CBS 127.97]
Length = 631
Score = 53.9 bits (128), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF + S GL D K +++PA +A ++K FL R L+ +GG
Sbjct: 391 VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 449 LGGFSIICLVTSILQH 464
>gi|326470670|gb|EGD94679.1| topoisomerase TRF4 [Trichophyton tonsurans CBS 112818]
Length = 631
Score = 53.9 bits (128), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF + S GL D K +++PA +A ++K FL R L+ +GG
Sbjct: 391 VDALTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 449 LGGFSIICLVTSILQH 464
>gi|398406264|ref|XP_003854598.1| topoisomerase-related protein, partial [Zymoseptoria tritici IPO323]
gi|339474481|gb|EGP89574.1| topoisomerase-related protein [Zymoseptoria tritici IPO323]
Length = 565
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I A T +++DISF++ S GL+ + + EQFP L ++KQFL R++++ ++GG
Sbjct: 362 IDAKTGLKVDISFENDS--GLRAIETFEAWKEQFPEMPVLVTIIKQFLLMRNMNEVHTGG 419
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + ++ L+ LQ
Sbjct: 420 IGGFTIICLVVSMLQ 434
>gi|327307976|ref|XP_003238679.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
gi|326458935|gb|EGD84388.1| topoisomerase TRF4 [Trichophyton rubrum CBS 118892]
Length = 631
Score = 53.5 bits (127), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF + S GL D K +++PA +A ++K FL R L+ +GG
Sbjct: 391 VDALTGLKVDLSFDNSS--GLTAIDTFKAWKQEYPAMPVIASIIKHFLLIRGLNDVATGG 448
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 449 LGGFSIICLVTSILQH 464
>gi|432884542|ref|XP_004074488.1| PREDICTED: uncharacterized protein LOC101158959 [Oryzias latipes]
Length = 421
Score = 53.5 bits (127), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 45/80 (56%), Gaps = 11/80 (13%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LHEE+ F K ++ E RK VKR+ ++ WP + IFGS +TGL LP+
Sbjct: 242 LHEEVMDFYKYMSPRPEEAAMRKE----VVKRIETIIKEQWPSADVQIFGSFSTGLYLPT 297
Query: 1270 SDVDLVVC----LPPVRNLE 1285
SD+DLVV PP++ LE
Sbjct: 298 SDIDLVVFGKWERPPLQELE 317
>gi|353241543|emb|CCA73351.1| related to TRF4-topoisomerase I-related protein [Piriformospora
indica DSM 11827]
Length = 628
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 2/67 (2%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS TG+ ++ E FPA PL +V+K FL R +++ Y GGL SY ++
Sbjct: 209 VDISLNQT--TGVSAIPVINRYLEHFPALRPLIMVVKAFLNQRGMNEVYKGGLGSYSIIC 266
Query: 1460 LITRFLQ 1466
L FLQ
Sbjct: 267 LAISFLQ 273
>gi|84468450|dbj|BAE71308.1| hypothetical protein [Trifolium pratense]
Length = 518
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 10/89 (11%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + DISF G + + ++E ++P PL L+LK FL R L++ YSGG
Sbjct: 215 VEKKSGLSFDISFDI--DNGPKAAEYIQEAVAKWPQLRPLCLILKVFLQQRELNEVYSGG 272
Query: 1452 LSSYCLMLLITRFLQH--------EHHLG 1472
+ SY L+ ++ L++ EH+LG
Sbjct: 273 IGSYALLTMLMAMLRNVRQSQPTAEHNLG 301
>gi|323352825|gb|EGA85127.1| Trf5p [Saccharomyces cerevisiae VL3]
Length = 510
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 164 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 214 RNYIYEL-----ARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 250
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 251 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 272
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 273 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 318
>gi|156837261|ref|XP_001642660.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
gi|156113216|gb|EDO14802.1| hypothetical protein Kpol_1076p8 [Vanderwaltozyma polyspora DSM
70294]
Length = 524
Score = 53.5 bits (127), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 99/244 (40%), Gaps = 76/244 (31%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ ++Q WP + ++FGS AT L LP SD+D VV +AG +
Sbjct: 137 NQTIGKLRDAVQHHWPDANLHVFGSYATDLYLPGSDIDCVVN----------SKAGDKQS 186
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN CL A +L + ++ ++ + +PII V +
Sbjct: 187 RN-----CLYSLASHLKKEGL--AEDIEIIAKARVPIIKFVEPL---------------- 223
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R L +T
Sbjct: 224 -----------SKIHVDV-----------SFERTNGLEAAKLIR----------GWLDST 251
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ-HEHHLGRP 1474
+ ++EL L++KQFL R L++ ++GGL + ++ L+ FL H L
Sbjct: 252 NGLRELV----------LIVKQFLQARRLNKVHTGGLGGFSIICLVYSFLHLHPRILANE 301
Query: 1475 INQV 1478
IN +
Sbjct: 302 INPI 305
>gi|323335976|gb|EGA77253.1| Trf5p [Saccharomyces cerevisiae Vin13]
Length = 576
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|1493831|gb|AAC49397.1| Trf5p [Saccharomyces cerevisiae]
Length = 625
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|392296994|gb|EIW08095.1| Trf5p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 608
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 91/230 (39%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 164 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ ++ + T +PII + P+
Sbjct: 214 RNYIYEL-----ARHLKNKGLAI--RMEVIVKTRVPIIKFI----------------EPQ 250
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 251 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 272
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 273 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 318
>gi|207341968|gb|EDZ69877.1| YNL299Wp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|403419742|emb|CCM06442.1| predicted protein [Fibroporia radiculosa]
Length = 1487
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
+ +DIS G+ ++K + PA L L++K FL+ RS+++ Y+GGL SY +
Sbjct: 268 IPVDISINQ--GNGVTAGTMIKHFLAELPALRSLVLIVKSFLSQRSMNEVYTGGLGSYSI 325
Query: 1458 MLLITRFLQHEHHLGR 1473
+ L+ FLQ + R
Sbjct: 326 VCLVISFLQMHPKIRR 341
>gi|1050861|gb|AAC49099.1| Ynl0440p [Saccharomyces cerevisiae]
gi|1302392|emb|CAA96217.1| TRF5 [Saccharomyces cerevisiae]
Length = 625
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|323346953|gb|EGA81231.1| Trf5p [Saccharomyces cerevisiae Lalvin QA23]
Length = 616
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|301615931|ref|XP_002937419.1| PREDICTED: PAP-associated domain-containing protein 5-like [Xenopus
(Silurana) tropicalis]
Length = 572
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F K ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 156 LHEEIMDFYKYMSPRPEEEKMRME-VVNRIENVIKELWPNADVQIFGSFKTGLYLPTSDI 214
Query: 1273 DLVV 1276
DLVV
Sbjct: 215 DLVV 218
Score = 46.6 bits (109), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 28/43 (65%)
Query: 1424 QFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 253 KYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 295
>gi|256271295|gb|EEU06367.1| Trf5p [Saccharomyces cerevisiae JAY291]
gi|349580652|dbj|GAA25811.1| K7_Trf5p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|190409265|gb|EDV12530.1| hypothetical protein SCRG_03424 [Saccharomyces cerevisiae RM11-1a]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|81360384|ref|NP_014100.2| non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
S288c]
gi|148887014|sp|P48561.2|TRF5_YEAST RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
Full=Topoisomerase 1-related protein TRF5
gi|151944249|gb|EDN62528.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
gi|285814367|tpg|DAA10261.1| TPA: non-canonical poly(A) polymerase TRF5 [Saccharomyces cerevisiae
S288c]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|365763604|gb|EHN05131.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|302148910|pdb|3NYB|A Chain A, Structure And Function Of The Polymerase Core Of Tramp, A
Rna Surveillance Complex
Length = 323
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 85/234 (36%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +++ LWP + ++FGS +T L LP SD+D VV E G E
Sbjct: 43 NQTISTIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVVT----------SELGGKES 92
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA + ++ V +PII V PH I A
Sbjct: 93 RNNLYSLASHLKKKNLATE-------VEVVAKARVPIIKFV--EPHSGIHIA-------- 135
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
S ++ I+AA +R
Sbjct: 136 ----------------------------VSFERTNGIEAAKLIR---------------- 151
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
E + P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 152 ----EWLDDTPGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVFSFL-HMH 200
>gi|323331833|gb|EGA73245.1| Trf5p [Saccharomyces cerevisiae AWRI796]
Length = 453
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|323307579|gb|EGA60848.1| Trf5p [Saccharomyces cerevisiae FostersO]
Length = 575
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 131 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 180
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 181 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 217
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 218 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 239
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 240 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 285
>gi|259149072|emb|CAY82314.1| Trf5p [Saccharomyces cerevisiae EC1118]
Length = 642
Score = 53.1 bits (126), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 92/230 (40%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 198 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 247
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN I E AR+L N+ + ++ + T +PII + P+
Sbjct: 248 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI----------------EPQ 284
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R
Sbjct: 285 -----------SQLHIDV-----------SFERTNGLEAAKLIR---------------- 306
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 307 ----EWLRDSPGLRELVLIIKQFLHSRRLNNVHTGGLGGFTVICLVYSFL 352
>gi|302847239|ref|XP_002955154.1| hypothetical protein VOLCADRAFT_96037 [Volvox carteri f. nagariensis]
gi|300259446|gb|EFJ43673.1| hypothetical protein VOLCADRAFT_96037 [Volvox carteri f. nagariensis]
Length = 1983
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 63/131 (48%), Gaps = 14/131 (10%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
SLLHEE+ +F ++ A ++ + V R +WP +R +FGS A GL+LP S
Sbjct: 1276 SLLHEEVAAFARESAPREDELL-ALSRGLAAVQRVATGIWPTARVVLFGSQANGLALPGS 1334
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVE 1326
D+DLVV L N+ A EG +G+ AAR+L D+ TV
Sbjct: 1335 DLDLVV-LGVTPNM-----ARAAEGLSGLARAT---AARHLETLAHRLRDAGLTRRHTVV 1385
Query: 1327 NTAIPIIMLVV 1337
+PI+ V+
Sbjct: 1386 KAKVPIVKTVL 1396
>gi|256818788|ref|NP_001157971.1| PAP-associated domain-containing protein 5 isoform b [Mus musculus]
Length = 637
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 44/165 (26%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 181 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 239
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 240 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 275
Query: 1328 TAIPIIMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
+PII L EV D+ + + V++ P+EDA
Sbjct: 276 ATVPIIKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 320
>gi|452988727|gb|EME88482.1| hypothetical protein MYCFIDRAFT_159726 [Pseudocercospora fijiensis
CIRAD86]
Length = 658
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T++++DISF++ S GL T + +QFP L ++KQFLA R L++ ++GG
Sbjct: 327 VDSRTNIKVDISFENDS--GLNTIPTLLAWKQQFPEMPVLVAIIKQFLAMRGLNEVHTGG 384
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + ++ L+ LQ
Sbjct: 385 IGGFTIICLVVSMLQ 399
>gi|145548044|ref|XP_001459703.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124427529|emb|CAK92306.1| unnamed protein product [Paramecium tetraurelia]
Length = 650
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 51/99 (51%), Gaps = 11/99 (11%)
Query: 1372 DMVALDDSASP----KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPA 1427
D+ +D++ P +CS + + +DI+ H+G +T + + E ++F
Sbjct: 406 DVKVIDNARKPVLKMQCSQEFQNKL-------IDITISKNDHSGRKTANSMIEFQKEFKQ 458
Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LAL+LK + +L +Y GGL+SYC++ +I LQ
Sbjct: 459 FKSLALILKFYFKSINLLNAYQGGLNSYCILTMILALLQ 497
Score = 44.7 bits (104), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 44/77 (57%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P + +L + L EI++F +++ ++P I+ V+ V +Q + +SR ++G
Sbjct: 305 PQHRTTLDDIVKQLSFEINTFSQELNKLLEEQRPIIDKIVQMVDEIVQTVSSKSRAFLYG 364
Query: 1260 SNATGLSLPSSDVDLVV 1276
S TGL+L SD+D+V+
Sbjct: 365 SCYTGLNLLDSDIDIVI 381
>gi|449498731|ref|XP_004160618.1| PREDICTED: uncharacterized protein LOC101229001 [Cucumis sativus]
Length = 384
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 64/127 (50%), Gaps = 25/127 (19%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC+ ++ E AR + AV+RV ++ +WP + +FGS TGL LP+
Sbjct: 197 LHKEIVDFCEFLSPTEEERVAR----DSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPT 252
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D+V+ +GI + + G LQ +R L+ + K ++ +
Sbjct: 253 SDIDVVIL-----------GSGIPKPQLG-----LQALSRALSQKGIAK--KIQVIGKAR 294
Query: 1330 IPIIMLV 1336
+PII +
Sbjct: 295 VPIIKFI 301
>gi|219518398|gb|AAI44798.1| Papd5 protein [Mus musculus]
Length = 590
Score = 52.8 bits (125), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 71/165 (43%), Gaps = 44/165 (26%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 134 LHEEISDFYEYMSPRPEEEKMRME-VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 192
Query: 1273 DLVVC-----LPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVEN 1327
DLVV LP L ++EA L + DS+K ++
Sbjct: 193 DLVVFGKWENLP----LWTLEEA--------------------LRKHKVADEDSVKVLDK 228
Query: 1328 TAIPIIMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
+PII L EV D+ + + V++ P+EDA
Sbjct: 229 ATVPIIKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 273
>gi|358394414|gb|EHK43807.1| hypothetical protein TRIATDRAFT_163212, partial [Trichoderma
atroviride IMI 206040]
Length = 690
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I AT +R+D+SF++ G++ D ++FPA L V+K FL R L++ +GG
Sbjct: 481 IDRATGLRVDVSFEN--LGGVKAIDTFLRWKKEFPAMPILVTVIKHFLLMRGLNEPVNGG 538
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
L + ++ L+ LQ EHHLG
Sbjct: 539 LGGFSVICLVVSMLQLMPQVQSRSLIPEHHLG 570
>gi|156096154|ref|XP_001614111.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802985|gb|EDL44384.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 808
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I T V++D+ S QTTD +K ++F PL ++LK FL R+L+++Y GG
Sbjct: 486 IDRETGVQVDVCTNQLSSR--QTTDFIKSKLQKFIYMRPLVILLKFFLNTRNLNETYIGG 543
Query: 1452 LSSYCLMLLITRFLQ 1466
+ S+ L ++ FLQ
Sbjct: 544 IGSFLLCCMVLHFLQ 558
>gi|403159818|ref|XP_003320384.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375168256|gb|EFP75965.2| hypothetical protein PGTG_01296 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 876
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 56/267 (20%), Positives = 105/267 (39%), Gaps = 35/267 (13%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
+ +L EI SF + + + ++ + +++ WP + FGS T L LP
Sbjct: 68 VQQMLTAEIGSFVAYIQPTHEEHQ-LRQMIIQMIRKTVHSRWPDADVEPFGSFGTKLYLP 126
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
+ D+DLV+ + N + K L A L + + D + +
Sbjct: 127 AGDIDLVIISTQMMNEQ--------------KSRILYKLA-PLIRENNIGQDVV-VIAKA 170
Query: 1329 AIPII---MLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
+PII + + D+ + + + +K N + D+ L S
Sbjct: 171 KVPIIKFKTIFGNINVDISINQTNGI---------VAMKKVNELLDDIKCLSLDISGSTR 221
Query: 1386 HTSSDNIKAATSVRLDISFKSPSHTGLQTTDL------VKELTEQFPASTPLALVLKQFL 1439
H+ + ++ + S H + DL + ++ E A+ L LV+K L
Sbjct: 222 HSRDLDQPLSSRKEQNGPAHSRRHRRTSSPDLSDDEKAISQIIEDLGAAKCLILVIKSVL 281
Query: 1440 ADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
R +++ Y+GGL SY ++ L+ FLQ
Sbjct: 282 KQRGMNEVYTGGLGSYSIICLVVSFLQ 308
>gi|156088769|ref|XP_001611791.1| hypothetical protein [Babesia bovis T2Bo]
gi|154799045|gb|EDO08223.1| hypothetical protein BBOV_III006620 [Babesia bovis]
Length = 468
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V+LDIS + TT+ VK +Q+ PL L++K FL R+L+ +Y GG+ SY
Sbjct: 179 SGVKLDISIWQAE--AINTTNFVKAKCKQYKYLQPLVLLIKLFLQLRNLNDTYIGGIGSY 236
Query: 1456 CLMLLITRFLQ 1466
L ++ FLQ
Sbjct: 237 LLYCMVLSFLQ 247
>gi|402220735|gb|EJU00806.1| Nucleotidyltransferase, partial [Dacryopinax sp. DJM-731 SS1]
Length = 266
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 34/63 (53%)
Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
GL+ +V PA PL +V+K FL R L++ YSGG SY ++ ++ FLQ
Sbjct: 127 GLRAVTVVNRFLWYLPAVRPLVMVIKAFLLQRGLNEPYSGGFGSYTVICMVVSFLQMHPK 186
Query: 1471 LGR 1473
+ R
Sbjct: 187 IRR 189
>gi|313243660|emb|CBY42319.1| unnamed protein product [Oikopleura dioica]
Length = 513
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 48/72 (66%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V++DISF TG+++ L+K ++ L LVLKQFL ++L++ ++GG+SSY
Sbjct: 52 SDVKVDISFNV--KTGIKSVSLIKAFIREYQVLPVLVLVLKQFLLQQNLNEVWTGGISSY 109
Query: 1456 CLMLLITRFLQH 1467
L+L++ FLQ+
Sbjct: 110 GLILMVLCFLQN 121
>gi|393245685|gb|EJD53195.1| Nucleotidyltransferase [Auricularia delicata TFB-10046 SS5]
Length = 584
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 39/71 (54%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T R + G+ + +V ++ PA PLA+ +K FL +R++++ Y+GGL SY
Sbjct: 222 THGRFAVDISVNQDNGIASGRIVNGFLQELPALRPLAMTVKAFLRERNMNEVYNGGLGSY 281
Query: 1456 CLMLLITRFLQ 1466
+ L+ FLQ
Sbjct: 282 STVCLLVSFLQ 292
>gi|407927280|gb|EKG20178.1| PAP/25A-associated [Macrophomina phaseolina MS6]
Length = 691
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 46/76 (60%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++DISF++ +TG+ +E E +PA +A ++KQ+LA R L++ ++GG
Sbjct: 477 VDRKTGLKVDISFEN--NTGIIANRTFREWKEMYPAMPVIATLIKQYLAMRGLNEVFTGG 534
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+ L +
Sbjct: 535 LGGFSVICLVVSLLHN 550
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 64/141 (45%), Gaps = 17/141 (12%)
Query: 1199 CPTRKASLSLMHSLLHEEIDSFCKQVAA---ENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
C ++ M LH+EI F + V EN R ++ +++VTR +P S
Sbjct: 351 CAIDHSATQSMGFWLHKEIADFYEFVRPHPFENEIRLDLVH-RIEQVTRRE---FPGSEL 406
Query: 1256 NIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE 1315
+ FGS A GL LP++D+DLV+ + + I GR + + L + L +
Sbjct: 407 HYFGSFAAGLYLPTADMDLVL----------LSKHFIRTGRGNVTKKNLYKMSTLLEREG 456
Query: 1316 WVKSDSLKTVENTAIPIIMLV 1336
+ + + + +PII +V
Sbjct: 457 ISQPGATEVISGARVPIIKIV 477
>gi|403221893|dbj|BAM40025.1| topoisomerase-related nucleotidyltransferase [Theileria orientalis
strain Shintoku]
Length = 520
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T VR+D+S + S +TT ++E+ ++ PL L++K FL R+L +Y GG
Sbjct: 232 VDKRTGVRIDLSCYNES--AFKTTKFIQEMCVKYKYMQPLILLVKLFLQSRNLGDTYFGG 289
Query: 1452 LSSYCLMLLITRFLQ 1466
+ S+ L +I FLQ
Sbjct: 290 VGSFLLYCMILSFLQ 304
>gi|402083045|gb|EJT78063.1| DNA polymerase sigma [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 732
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++DISF++ TG+ + E+FPA L V+K FLA R L++ +GG
Sbjct: 515 VDARTGLKVDISFEN--STGITAVNTFLAWKEEFPAMPILVTVIKHFLAMRGLNEPVNGG 572
Query: 1452 LSSYCLMLLITRFLQ 1466
L + ++ L+ LQ
Sbjct: 573 LGGFSVICLVVSMLQ 587
>gi|444720754|gb|ELW61529.1| HEAT repeat-containing protein 3 [Tupaia chinensis]
Length = 1047
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 71/160 (44%), Gaps = 34/160 (21%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 646 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 704
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
DLVV NL P+ L+ A R + DS+K ++ +PI
Sbjct: 705 DLVV-FGKWENL-PL--------------WTLEEALR---KHKVADEDSVKVLDKATVPI 745
Query: 1333 IMLV---VEVPHDLIASAASSVQS-----------PKEDA 1358
I L EV D+ + + V++ P+EDA
Sbjct: 746 IKLTDSFTEVKVDISFNVQNGVRAADLIKDFTKLHPREDA 785
>gi|366997671|ref|XP_003683572.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
gi|357521867|emb|CCE61138.1| hypothetical protein TPHA_0A00530 [Tetrapisispora phaffii CBS 4417]
Length = 678
Score = 52.0 bits (123), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 51/88 (57%), Gaps = 3/88 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + G++ L++E + P L LV+KQFLA R L+ ++GG
Sbjct: 281 VEPKSQIHIDVSFERTN--GVEAAKLIREWLDDTPGLRELVLVIKQFLATRRLNNVHTGG 338
Query: 1452 LSSYCLMLLITRFLQ-HEHHLGRPINQV 1478
L + ++ L+ FL+ H + IN++
Sbjct: 339 LGGFSIICLVFCFLKMHPKIITNAINEL 366
>gi|221058739|ref|XP_002260015.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193810088|emb|CAQ41282.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 821
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T V++D+ S QTTD +K E++ PL ++LK FL R+L+++Y GG+ S+
Sbjct: 499 TGVQVDVCTNQLS--SRQTTDFIKSKMEKYIYLRPLVILLKFFLNTRNLNETYIGGIGSF 556
Query: 1456 CLMLLITRFLQ 1466
L ++ FLQ
Sbjct: 557 MLCCMVLHFLQ 567
>gi|443927237|gb|ELU45748.1| Trf5 [Rhizoctonia solani AG-1 IA]
Length = 622
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
++DIS + G+ ++ ++ PA PL + K FL+ R +++ Y+GGL SY ++
Sbjct: 208 KVDISLNQAN--GVSAGKIINNFLKRLPALRPLVFIFKAFLSQREMNEVYNGGLGSYSIV 265
Query: 1459 LLITRFLQ 1466
++ FLQ
Sbjct: 266 CMVLSFLQ 273
>gi|355756767|gb|EHH60375.1| hypothetical protein EGM_11721, partial [Macaca fascicularis]
Length = 483
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LHEEI F + ++ K + V R+ ++ LWP + IFGS TGL LP+SD+
Sbjct: 19 LHEEISDFYEYMSPRPEEEKMRME-VVNRIESVIKELWPSADVQIFGSFKTGLYLPTSDI 77
Query: 1273 DLVV 1276
DLVV
Sbjct: 78 DLVV 81
Score = 47.8 bits (112), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 33/53 (62%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ ++ + T ++P L LVLKQFL R L++ ++GG+ SY L L+ FLQ
Sbjct: 107 SVKVLDKATVKYPVLPYLVLVLKQFLLQRDLNEVFTGGIGSYSLFLMAVSFLQ 159
>gi|380483151|emb|CCF40795.1| Poly(A) RNA polymerase cid14 [Colletotrichum higginsianum]
Length = 269
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 45/92 (48%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I + T +R+DISF G Q E EQFPA L ++K FLA R L++ +GG
Sbjct: 56 IDSITGLRVDISFDRLD--GPQAVKTFAEWKEQFPAMPILVTMIKHFLAMRGLNEPVNGG 113
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ S+ + ++ LQ EHHLG
Sbjct: 114 IGSFTVTCMVVSMLQLMPQVQSRNLIPEHHLG 145
>gi|313231448|emb|CBY08562.1| unnamed protein product [Oikopleura dioica]
Length = 587
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 48/72 (66%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V++DISF TG+++ L+K ++ L LVLKQFL ++L++ ++GG+SSY
Sbjct: 116 SDVKVDISFNV--KTGIKSVSLIKAFIREYQVLPVLVLVLKQFLLQQNLNEVWTGGISSY 173
Query: 1456 CLMLLITRFLQH 1467
L+L++ FLQ+
Sbjct: 174 GLILMVLCFLQN 185
>gi|449547164|gb|EMD38132.1| hypothetical protein CERSUDRAFT_49354 [Ceriporiopsis subvermispora B]
Length = 547
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 37/75 (49%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R + G+ ++ + PA L V+K FL+ RS+++ ++GGL SY ++
Sbjct: 283 RFSVDISINQGNGVAAGKMINHFLSELPALRALVFVVKSFLSQRSMNEVFTGGLGSYSIV 342
Query: 1459 LLITRFLQHEHHLGR 1473
L FLQH + R
Sbjct: 343 CLAISFLQHHPKIRR 357
>gi|320035002|gb|EFW16944.1| Poly(A) polymerase [Coccidioides posadasii str. Silveira]
Length = 680
Score = 51.6 bits (122), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + ++GL ++ E FPA + V+KQFL R L++ +GGL +
Sbjct: 365 TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGF 422
Query: 1456 CLMLLITRFLQHEHH 1470
++ L+T LQH H
Sbjct: 423 SIICLVTSLLQHLPH 437
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
D ++ + ++ E PD F L +Q T R+A+ L +L +E
Sbjct: 192 DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251
Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+D + K E+ R I R R +Q +P S+ + FGS A+GL LP +DVDL
Sbjct: 252 VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
V+ + + I +GR + K + Y+ + E S++T+ + +PI
Sbjct: 308 VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357
Query: 1333 IMLV 1336
I V
Sbjct: 358 IKFV 361
>gi|299738269|ref|XP_001838228.2| PAP associated domain containing 5 [Coprinopsis cinerea okayama7#130]
gi|298403230|gb|EAU83596.2| PAP associated domain containing 5 [Coprinopsis cinerea okayama7#130]
Length = 431
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 127/339 (37%), Gaps = 89/339 (26%)
Query: 1159 MIQPLSGALLQDHLIAISQLARDQEHPD-VAFPLQPL-------EVQNCPTRKASLSLMH 1210
M L G L+D++IA R + + D P+ P N P +L+ H
Sbjct: 1 MSSSLLGQTLRDNIIA-GVPTRTKAYSDYTGLPVPPCGDFAPRSTSINAPPWMPTLTSPH 59
Query: 1211 SL-LHEEIDSFC---KQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
L LH EI+ F K E R IN R T+ ++ L P+ GS +GL
Sbjct: 60 RLSLHNEIEHFVAYMKPTDKELLLRTDLIN----RFTKLIETLDPQVSVQAVGSTVSGLH 115
Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
P+SD+D+VV L R L R + T L++ R ++S +
Sbjct: 116 FPTSDIDMVVTLAGAR--AGYYSTSTLSARTTL--TLLEYKIRSSGFARKIES-----IL 166
Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
N ++P++ I A + + A +K N V S M
Sbjct: 167 NASVPLLR---------ITDAVTGIDIDLTAADEHGVKATNMVQSYM------------- 204
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
KS + T +++ L +VLK FLA R L
Sbjct: 205 ------------------KSENATVIKS----------------LVMVLKLFLATRKLGT 230
Query: 1447 SYSGGLSSYCLMLLITRFLQHEH-------HLGRPINQV 1478
+++GG++SY L+ ++ +++ E H RP N V
Sbjct: 231 TFTGGINSYLLVWMMVAWVKLEMPKLVGGAHSERPSNNV 269
>gi|212532361|ref|XP_002146337.1| topoisomerase family protein TRF4, putative [Talaromyces marneffei
ATCC 18224]
gi|210071701|gb|EEA25790.1| topoisomerase family protein TRF4, putative [Talaromyces marneffei
ATCC 18224]
Length = 683
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 46/77 (59%), Gaps = 3/77 (3%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D++F + S G++ ++ E +PA + V+KQFL RSL++ +GGL +
Sbjct: 400 TGLKVDMAFDNDS--GIRAIQTFEKWKEAYPAMPAIVAVVKQFLLLRSLNEVPTGGLGGF 457
Query: 1456 CLMLLITRFLQH-EHHL 1471
++ L+ LQH HHL
Sbjct: 458 SIICLVVSLLQHMPHHL 474
>gi|403213331|emb|CCK67833.1| hypothetical protein KNAG_0A01440 [Kazachstania naganishii CBS 8797]
Length = 526
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + GL+ L++E E P L L++KQFL R L+ ++GG
Sbjct: 245 VEPESRIHIDVSFERIN--GLEAAKLIREWLESTPGLRELVLIIKQFLHSRRLNNVHTGG 302
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 303 LGGFSIICLVYSFL 316
>gi|254579541|ref|XP_002495756.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
gi|238938647|emb|CAR26823.1| ZYRO0C02332p [Zygosaccharomyces rouxii]
Length = 531
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 53/230 (23%), Positives = 88/230 (38%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N ++ + +++ LWP + +FGS AT L LP SD+D V+ + G E
Sbjct: 123 NKTIRTLRHAVRKLWPGADLQVFGSYATDLYLPGSDIDCVIN----------SKTGDKEN 172
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R+ + E R LA Q ++ + +PII V P
Sbjct: 173 RSSLYELAHFLKNRKLATQ-------VEVIAKARVPIIKFV----------------EPT 209
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ ++AA +R + LQ T
Sbjct: 210 -----------SQIHVDV-----------SFERTNGLEAAKLIR----------SWLQQT 237
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
P L L++KQFL R L+ ++GGL + ++ L+ FL
Sbjct: 238 ----------PGLRELVLIVKQFLHARRLNNVHTGGLGGFSIICLVYAFL 277
>gi|303323645|ref|XP_003071814.1| PAP/25A associated domain containing protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240111516|gb|EER29669.1| PAP/25A associated domain containing protein [Coccidioides posadasii
C735 delta SOWgp]
Length = 680
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + ++GL ++ E FPA + V+KQFL R L++ +GGL +
Sbjct: 365 TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVIKQFLLLRGLNEVPTGGLGGF 422
Query: 1456 CLMLLITRFLQHEHH 1470
++ L+T LQH H
Sbjct: 423 SIICLVTSLLQHLPH 437
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
D ++ + ++ E PD F L +Q T R+A+ L +L +E
Sbjct: 192 DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251
Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+D + K E+ R I R R +Q +P S+ + FGS A+GL LP +DVDL
Sbjct: 252 VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
V+ + + I +GR + K + Y+ + E S++T+ + +PI
Sbjct: 308 VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357
Query: 1333 IMLV 1336
I V
Sbjct: 358 IKFV 361
>gi|406701338|gb|EKD04487.1| hypothetical protein A1Q2_01263 [Trichosporon asahii var. asahii CBS
8904]
Length = 624
Score = 51.6 bits (122), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A + +DIS + G+ +V + P + L LV+K FL+ RS+++ Y+GG
Sbjct: 211 VTAEGGINVDISLNQTN--GVSAVKIVNHYLKALPGARELILVIKAFLSQRSMNEVYTGG 268
Query: 1452 LSSYCLMLLITRFLQ 1466
L SY ++ L FLQ
Sbjct: 269 LGSYSVICLALSFLQ 283
>gi|401882466|gb|EJT46724.1| hypothetical protein A1Q1_04689 [Trichosporon asahii var. asahii CBS
2479]
Length = 631
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A + +DIS + G+ +V + P + L LV+K FL+ RS+++ Y+GG
Sbjct: 211 VTAEGGINVDISLNQTN--GVSAVKIVNHYLKALPGARELILVIKAFLSQRSMNEVYTGG 268
Query: 1452 LSSYCLMLLITRFLQ 1466
L SY ++ L FLQ
Sbjct: 269 LGSYSVICLALSFLQ 283
>gi|242212981|ref|XP_002472321.1| predicted protein [Postia placenta Mad-698-R]
gi|220728598|gb|EED82489.1| predicted protein [Postia placenta Mad-698-R]
Length = 1512
Score = 51.2 bits (121), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 36/63 (57%)
Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
G+ ++K + PA L LV+K FL+ RS+++ Y+GGL SY ++ L FLQ
Sbjct: 269 GVTAGKMIKHYLAELPALRSLILVIKSFLSQRSMNEVYTGGLGSYSIVCLAISFLQMHPK 328
Query: 1471 LGR 1473
+ R
Sbjct: 329 IRR 331
>gi|392867853|gb|EAS33553.2| topoisomerase TRF4 [Coccidioides immitis RS]
Length = 680
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + ++GL ++ E FPA + V+KQFL R L++ +GGL +
Sbjct: 365 TGLKVDLSFDN--NSGLAANRTFQQWKEHFPAMPLIVSVVKQFLLLRGLNEVPTGGLGGF 422
Query: 1456 CLMLLITRFLQHEHH 1470
++ L+T LQH H
Sbjct: 423 SIICLVTSLLQHLPH 437
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 48/184 (26%), Positives = 80/184 (43%), Gaps = 31/184 (16%)
Query: 1170 DHLIAISQLARDQEHPDVAFPLQPLEVQNCPT-------------RKASLSLMHSLLHEE 1216
D ++ + ++ E PD F L +Q T R+A+ L +L +E
Sbjct: 192 DEIVGPAPRSQRYEKPDTRFNLDGSIIQAYRTIAGQDSTPWFDRSRQATFHLGAALQNEI 251
Query: 1217 IDSF--CKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+D + K E+ R I R R +Q +P S+ + FGS A+GL LP +DVDL
Sbjct: 252 VDFYHWVKPRPFEDVIRTDLIT----RFERLMQNRFPGSQLHAFGSYASGLYLPVADVDL 307
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGI--KETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
V+ + + I +GR + K + Y+ + E S++T+ + +PI
Sbjct: 308 VL----------LSRSFIRQGRKFLCQKIKDIYSLTAYIRDTEIAVPGSIETIAHARVPI 357
Query: 1333 IMLV 1336
I V
Sbjct: 358 IKFV 361
>gi|401623740|gb|EJS41828.1| trf4p [Saccharomyces arboricola H-6]
Length = 573
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 51/234 (21%), Positives = 87/234 (37%), Gaps = 76/234 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + +++ LWP + ++FGS +T L LP SD+D V+ E G E
Sbjct: 193 NQTISMIREAVKQLWPDADLHVFGSYSTDLYLPGSDIDCVIT----------SELGGKES 242
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
RN + + LA + ++ V +PII V
Sbjct: 243 RNNLFSLASHLKKKNLATE-------IEVVAKARVPIIKFV------------------- 276
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ ++ +H D+ S ++ ++AA +R
Sbjct: 277 --------EPNSGIHIDV-----------SFERTNGLEAAKLIR---------------- 301
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEH 1469
E P L L++KQFL R L+ ++GGL + ++ L+ FL H H
Sbjct: 302 ----EWLNDTPGLRELVLIVKQFLHSRRLNNVHTGGLGGFSIICLVFSFL-HMH 350
>gi|449296989|gb|EMC93008.1| hypothetical protein BAUCODRAFT_49110, partial [Baudoinia
compniacensis UAMH 10762]
Length = 611
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I T +++DISF++ TG+ +E +PA + +++KQ LA R L++ +SGG
Sbjct: 407 IDKKTGIKVDISFEN--RTGIVANKTFQEWKVLYPAMPAIVVLIKQLLAMRGLNEVFSGG 464
Query: 1452 LSSYCLMLLITRFLQH 1467
+ + + L+ +QH
Sbjct: 465 IGGFTTICLVVSMMQH 480
>gi|429964503|gb|ELA46501.1| hypothetical protein VCUG_01993 [Vavraia culicis 'floridensis']
Length = 401
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 2/67 (2%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R D+S S G+ T LVKE+ + PA +AL LK FL R L+++ GGL+SY
Sbjct: 207 RYDLSLNQES--GVIHTKLVKEILHEQPAIKDMALFLKHFLKCRGLNENRRGGLNSYAQF 264
Query: 1459 LLITRFL 1465
L+I FL
Sbjct: 265 LMIASFL 271
>gi|426385175|ref|XP_004059104.1| PREDICTED: PAP-associated domain-containing protein 5-like, partial
[Gorilla gorilla gorilla]
Length = 240
Score = 51.2 bits (121), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 9/79 (11%)
Query: 1213 LHEEIDSFCKQVAA--ENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LHEEI F ++ E A + + VKR+ ++ LWP + IFGS +TGL LP+S
Sbjct: 136 LHEEIIDFYNFMSPCPEEAAMRREV---VKRIETVVKDLWPTADVQIFGSFSTGLYLPTS 192
Query: 1271 DVDLVVC----LPPVRNLE 1285
D+DLVV PP++ LE
Sbjct: 193 DIDLVVFGKWERPPLQLLE 211
>gi|443895250|dbj|GAC72596.1| DNA polymerase sigma [Pseudozyma antarctica T-34]
Length = 689
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 4/70 (5%)
Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
+++DIS +HT GL T V ++P PL +V+K L R + + +SGGL SY
Sbjct: 210 LKVDISL---NHTNGLTTASFVNSWLRKWPHIRPLIIVVKHLLMQRGMSEVFSGGLGSYS 266
Query: 1457 LMLLITRFLQ 1466
+++++ FLQ
Sbjct: 267 IIIMVISFLQ 276
>gi|409045762|gb|EKM55242.1| hypothetical protein PHACADRAFT_93478 [Phanerochaete carnosa
HHB-10118-sp]
Length = 478
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 36/68 (52%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R + G++ ++K + PA L L+ K FL+ RS+++ ++GGL SY ++
Sbjct: 264 RFSVDISLNQMNGVKAGTMIKRFLDHIPALQALVLITKSFLSQRSMNEVFTGGLGSYSIV 323
Query: 1459 LLITRFLQ 1466
L FLQ
Sbjct: 324 CLAISFLQ 331
>gi|156057775|ref|XP_001594811.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980]
gi|154702404|gb|EDO02143.1| hypothetical protein SS1G_04619 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 794
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ TGL +E QFPA L ++K LA R L++ +GG
Sbjct: 573 VDGFTGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 630
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + + L+ LQ+ EHHLG
Sbjct: 631 IGGFSVTCLVVSLLQNMPQVKSKTMIPEHHLG 662
>gi|444314869|ref|XP_004178092.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
gi|387511131|emb|CCH58573.1| hypothetical protein TBLA_0A07840 [Tetrapisispora blattae CBS 6284]
Length = 711
Score = 50.8 bits (120), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 97/258 (37%), Gaps = 84/258 (32%)
Query: 1212 LLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSD 1271
LL EEI F ++ + N + ++ ++++ LW S +FGS AT L LP SD
Sbjct: 204 LLTEEIKDFVAYISP-SREEIELRNQTIGKIRKAVKKLWKNSDLYVFGSYATDLYLPGSD 262
Query: 1272 VDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT----VEN 1327
+D V+ I AG E R+ + + W+K + L T +
Sbjct: 263 IDCVI----------ISSAGDKENRHSLYQLS-----------SWLKQNKLATKVEVIAK 301
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHT 1387
+PII V + +++H D+ S
Sbjct: 302 ARVPIIKFV---------------------------EPSSNIHIDV-----------SFE 323
Query: 1388 SSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
S+ ++AA ++R L +T P L L++KQFL R L+
Sbjct: 324 RSNGLEAAKTIR----------DWLNST----------PGLRELVLIIKQFLNSRRLNDV 363
Query: 1448 YSGGLSSYCLMLLITRFL 1465
+ GGL ++ ++ FL
Sbjct: 364 HLGGLGGLSIICMVYSFL 381
>gi|71657648|ref|XP_817336.1| DNA polymerase sigma [Trypanosoma cruzi strain CL Brener]
gi|70882521|gb|EAN95485.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
Length = 680
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + G + + +V ++ FP + PL +++K FL R + + Y GGL S+ L
Sbjct: 374 LDVDVSINAEDGQRNSAIVADMLRSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 433
Query: 1460 LITRFLQ-HEHHLGRP 1474
L+ FLQ H + RP
Sbjct: 434 LVISFLQNHPIYTERP 449
>gi|255082912|ref|XP_002504442.1| predicted protein [Micromonas sp. RCC299]
gi|226519710|gb|ACO65700.1| predicted protein [Micromonas sp. RCC299]
Length = 998
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/241 (21%), Positives = 93/241 (38%), Gaps = 20/241 (8%)
Query: 1235 INWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP-SSDVDLVVCLPPVRNLEPIKEAGIL 1293
++ ++ + +Q L+P + +FGS T +P +S++D+ + LP +P K+ L
Sbjct: 620 VDMLIRTMNAIVQSLYPTASVEVFGSFPTASWVPGASNLDIALSLPEAVTSDPQKK---L 676
Query: 1294 EGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQS 1353
+ N + A AN WV + V + P+I+L + Q
Sbjct: 677 DALNSL-------AVALRANPHWV--SEVNVVPSPFRPLILLTTHSAFFQPPPPPQAAQG 727
Query: 1354 PKEDAAHTTLKHDNH--VHSDMVALDDSASPKCSHTSSDNIKAATS-----VRLDISFKS 1406
P+ + +SD +S + + IS K
Sbjct: 728 PRPPPGAPPTANGGSPGANSDSPGAPPLPPGPPPPAASVGFGLGNGGVGLPLEVAISIKG 787
Query: 1407 PSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
H G +++ ++PA P+ V K +LA+R L Y GG+ SY L L+ +Q
Sbjct: 788 KDHKGAAAMKFLRQAETEYPALAPVLAVQKAYLANRGLRGVYKGGIGSYSLALMALHSMQ 847
Query: 1467 H 1467
Sbjct: 848 Q 848
>gi|367040851|ref|XP_003650806.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
gi|346998067|gb|AEO64470.1| hypothetical protein THITE_2110633 [Thielavia terrestris NRRL 8126]
Length = 759
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ T +++DISF++ TGL + K EQ+P L ++K FL R L++ +GG
Sbjct: 506 IEHRTGLKVDISFEN--RTGLTAIETFKAWREQYPGMPALVTLIKHFLLMRGLNEPVNGG 563
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
+ + ++ L+ LQ HHLG+
Sbjct: 564 IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 596
>gi|66363120|ref|XP_628526.1| TRF4/5 nucletotidyl transferase [Cryptosporidium parvum Iowa II]
gi|46229540|gb|EAK90358.1| TRF4/5 nucletotidyl transferase [Cryptosporidium parvum Iowa II]
gi|323509243|dbj|BAJ77514.1| cgd7_3630 [Cryptosporidium parvum]
gi|323510373|dbj|BAJ78080.1| cgd7_3630 [Cryptosporidium parvum]
Length = 480
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + + +DIS S + TT + E ++F PL +V+K FL R LD++Y GG
Sbjct: 233 VDSEACISVDISLNQES--SIDTTKHISECLKKFSTLRPLLIVIKLFLRQRGLDETYLGG 290
Query: 1452 LSSYCLMLLITRFLQH 1467
L SY L+ FLQ
Sbjct: 291 LGSYSQFCLVLSFLQQ 306
>gi|67623901|ref|XP_668233.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54659437|gb|EAL38014.1| hypothetical protein Chro.70404 [Cryptosporidium hominis]
Length = 480
Score = 50.8 bits (120), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + + +DIS S + TT + E ++F PL +V+K FL R LD++Y GG
Sbjct: 233 VDSEACISVDISLNQES--SIDTTKHISECLKKFSTLRPLLIVIKLFLRQRGLDETYLGG 290
Query: 1452 LSSYCLMLLITRFLQH 1467
L SY L+ FLQ
Sbjct: 291 LGSYSQFCLVLSFLQQ 306
>gi|388851758|emb|CCF54564.1| related to TRF4-topoisomerase I-related protein [Ustilago hordei]
Length = 701
Score = 50.8 bits (120), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
+++DIS +HT GL T V ++P PL LV+K L R + + +SGGL SY
Sbjct: 211 LKVDISL---NHTNGLTTASYVNGWLRKWPHIRPLILVIKHLLMQRGMSEVFSGGLGSYS 267
Query: 1457 LMLLITRFLQHEHHLGR 1473
+++++ FLQ L R
Sbjct: 268 VIIMVISFLQLHPKLQR 284
>gi|238879008|gb|EEQ42646.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 603
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + + + WP + T++FGS AT L LP SD+D+VV + E G E
Sbjct: 191 NNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 240
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + +L ++ K +++ + + +PII V V
Sbjct: 241 R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPV---------------- 277
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ + AA +R L +T
Sbjct: 278 -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 305
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL LV+KQFL R L+ + GGL Y +++ FL+
Sbjct: 306 PGLREL----------VLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 346
>gi|392580130|gb|EIW73257.1| hypothetical protein TREMEDRAFT_22292, partial [Tremella mesenterica
DSM 1558]
Length = 303
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 12/109 (11%)
Query: 1370 HSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSH------------TGLQTTDL 1417
H+ L+D A +DN+ + R+ I H G+ +
Sbjct: 91 HNKKNLLNDLARTMRYAMITDNVVVISKARVPIIKFVTKHGKLNVDISLNQVNGISAGKI 150
Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ + + P + L LV+K FL+ RS+++ Y+GGL SY ++ L+ FLQ
Sbjct: 151 INQYLDVIPGARQLILVVKAFLSQRSMNEVYTGGLGSYSVICLVISFLQ 199
>gi|68480208|ref|XP_715914.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
gi|68480321|ref|XP_715864.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437507|gb|EAK96852.1| hypothetical protein CaO19.429 [Candida albicans SC5314]
gi|46437559|gb|EAK96903.1| hypothetical protein CaO19.8059 [Candida albicans SC5314]
Length = 603
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 53/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + + + WP + T++FGS AT L LP SD+D+VV + E G E
Sbjct: 191 NNVISTLKKEIGKFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 240
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + +L ++ K +++ + + +PII V V
Sbjct: 241 R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPV---------------- 277
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ + AA +R L +T
Sbjct: 278 -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 305
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL LV+KQFL R L+ + GGL Y +++ FL+
Sbjct: 306 PGLREL----------VLVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 346
>gi|315054687|ref|XP_003176718.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
gi|311338564|gb|EFQ97766.1| Poly(A) RNA polymerase cid14 [Arthroderma gypseum CBS 118893]
Length = 627
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T +++D+SF + S GL D K +++P +A ++K FL R L+ +GG
Sbjct: 387 VDSLTGLKVDLSFDNSS--GLAAIDTFKAWKQEYPTMPVIASIIKHFLLIRGLNDVATGG 444
Query: 1452 LSSYCLMLLITRFLQH 1467
L + ++ L+T LQH
Sbjct: 445 LGGFSIICLVTSILQH 460
>gi|403356668|gb|EJY77933.1| Poly(A) RNA polymerase putative [Oxytricha trifallax]
Length = 397
Score = 50.4 bits (119), Expect = 0.007, Method: Composition-based stats.
Identities = 22/56 (39%), Positives = 37/56 (66%)
Query: 1411 GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
GL+ LV L ++FP P+ V+K FL +++L + Y+GG+ S+ L+ +IT +LQ
Sbjct: 216 GLKGLSLVTTLIKEFPELRPMYFVIKAFLKNKNLHKPYTGGIGSFVLINMITFYLQ 271
>gi|348684587|gb|EGZ24402.1| hypothetical protein PHYSODRAFT_349922 [Phytophthora sojae]
Length = 592
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 8/125 (6%)
Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
EE+ +C A + +I+ A+ + ++ +WP + FGS +TGL LPSSDVD+
Sbjct: 288 EEMARYCAYTQRVVDAARQHIDAAIAHIADGVRKIWPHASVMCFGSYSTGLWLPSSDVDV 347
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE-WVKSDSLKTVENTAIPII 1333
VV + N E ++ +G+KE L+ AR L Q WV+ + V +P+
Sbjct: 348 VVM--GLSNSED-RDVPTRFVPDGVKE--LEQLARQLRPQRSWVQ--RVDVVRGAKVPVA 400
Query: 1334 MLVVE 1338
L++
Sbjct: 401 KLILR 405
>gi|403365629|gb|EJY82602.1| PAP-associated domain-containing protein, putative [Oxytricha
trifallax]
Length = 1647
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 50/211 (23%), Positives = 97/211 (45%), Gaps = 31/211 (14%)
Query: 1257 IFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEW 1316
++GS A+ L++ SDVDL + + + IKE I E R + + L Q
Sbjct: 1331 MYGSMASKLAIEESDVDLAII---GLDFKGIKELQIKEMR-----VLYEQLMQVLKAQST 1382
Query: 1317 VKSDSLKTVENTAIPIIMLVVEVP--HDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMV 1374
V+ +E IP+I L V++ + I S + ++ + ++++ +
Sbjct: 1383 VQ-----FIETATIPVIKLRVDLEKIRESITCNQESQEHQQQSSGCSSIEDQMRYLGVDI 1437
Query: 1375 ALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALV 1434
+DS S +++ ++ R+++ G+Q +++L P+ L
Sbjct: 1438 TFEDS--------SMNSLYSSDGQRINL--------GIQCISYIQQLCAAQAELKPIVLF 1481
Query: 1435 LKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
LK+ L + SL++ Y GGL+SY L+L+ + FL
Sbjct: 1482 LKKLLQNNSLNEPYHGGLNSYSLILMTSAFL 1512
>gi|145488123|ref|XP_001430066.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124397161|emb|CAK62668.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 50.4 bits (119), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 50/99 (50%), Gaps = 11/99 (11%)
Query: 1372 DMVALDDSASP----KCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPA 1427
D+ +D++ P +CS + + +DI+ H G +T + + E ++F
Sbjct: 393 DVKVIDNAKKPVLKMQCSKEFQEKL-------IDITISRNDHFGRKTANSMMEFQKEFKQ 445
Query: 1428 STPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
LAL+LK + +L +Y GGL+SYC++ +I LQ
Sbjct: 446 FKSLALMLKFYFKSINLLNAYQGGLNSYCVLTMILALLQ 484
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 45/75 (60%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P K SL + L EI++F ++++ +KP I+ V+ V ++Q + P+SR ++G
Sbjct: 299 PQHKISLDDIVKQLTFEINTFTQELSKYLDEQKPIIDKVVQLVDETVQNVQPKSRAFLYG 358
Query: 1260 SNATGLSLPSSDVDL 1274
S TGL+L SD+D+
Sbjct: 359 SCQTGLNLLDSDIDI 373
>gi|428186552|gb|EKX55402.1| hypothetical protein GUITHDRAFT_83772 [Guillardia theta CCMP2712]
Length = 440
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 60/127 (47%), Gaps = 19/127 (14%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH EI QV N+ R V V +++ W + ++GS +T L LP SDV
Sbjct: 97 LHSEIVELLNQVYPTNSTRTRR-QRIVDLVDEAVKSCWEGAHVEVYGSFSTDLYLPHSDV 155
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D++V KE I E+ ++H A L + W + ++++ENT++P+
Sbjct: 156 DVLVI--------GAKEWSI--------ESKMKHLASRLKSMPWCR--YMRSIENTSVPV 197
Query: 1333 IMLVVEV 1339
I L +V
Sbjct: 198 IKLSADV 204
>gi|328770392|gb|EGF80434.1| hypothetical protein BATDEDRAFT_25005 [Batrachochytrium dendrobatidis
JAM81]
Length = 547
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
+R+DI + S G ++T+L+K ++P P+ALVLK + LD+ ++GGL Y L
Sbjct: 161 IRIDIVANTDS--GNESTELMKTWVGKYPTLIPMALVLKLLIRQHRLDKVFTGGLGGYAL 218
Query: 1458 MLLITRFLQHEHHLGRPIN 1476
+ +I L + H I+
Sbjct: 219 LNMIVAVLIRDQHCSDEID 237
>gi|154323524|ref|XP_001561076.1| hypothetical protein BC1G_00161 [Botryotinia fuckeliana B05.10]
Length = 692
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF++ TGL +E QFPA L ++K LA R L++ +GG
Sbjct: 524 VDAITGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 581
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + + L+ LQ+ EH+LG
Sbjct: 582 IGGFSVTCLVVSLLQNLPQVKSGTMVPEHNLG 613
>gi|429966451|gb|ELA48448.1| hypothetical protein VCUG_00057 [Vavraia culicis 'floridensis']
Length = 400
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 38/67 (56%), Gaps = 2/67 (2%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R D+S S G+ T VK++ +Q P +AL LK FL R L++S GGL SY +
Sbjct: 206 RYDLSLNQDS--GIVHTKFVKKVLQQQPYIKDIALFLKYFLKSRGLNESKRGGLCSYAQL 263
Query: 1459 LLITRFL 1465
L+I FL
Sbjct: 264 LMIISFL 270
>gi|393218042|gb|EJD03530.1| Nucleotidyltransferase [Fomitiporia mediterranea MF3/22]
Length = 534
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+E+ F +A R+ + V R+ L+ WP ++ +FGSNATGL L D+
Sbjct: 56 LHDELVQFIGYIAPSKQEREAR-SMLVARIEDLLKSRWPDAQVTVFGSNATGLELTGGDI 114
Query: 1273 DLVVCL 1278
DLVV
Sbjct: 115 DLVVAF 120
>gi|71005312|ref|XP_757322.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
gi|46096726|gb|EAK81959.1| hypothetical protein UM01175.1 [Ustilago maydis 521]
Length = 730
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 41/70 (58%), Gaps = 4/70 (5%)
Query: 1398 VRLDISFKSPSHT-GLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
+++DIS +HT GL T V ++P PL LV+K L R + + +SGGL SY
Sbjct: 211 LKVDISL---NHTNGLTTASYVNSWLRKWPHIRPLILVVKYLLMQRGMSEVFSGGLGSYS 267
Query: 1457 LMLLITRFLQ 1466
+++++ FLQ
Sbjct: 268 VIIMVISFLQ 277
>gi|134080951|emb|CAK41466.1| unnamed protein product [Aspergillus niger]
Length = 614
Score = 50.4 bits (119), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ +GGL +
Sbjct: 311 TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 368
Query: 1456 CLMLLITRFLQHEHH 1470
+ L+T LQH H
Sbjct: 369 SITCLVTSLLQHLPH 383
>gi|358373036|dbj|GAA89636.1| topoisomerase family protein Trf4 [Aspergillus kawachii IFO 4308]
Length = 682
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 44/79 (55%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ +GG
Sbjct: 375 VDKLTGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGG 432
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 433 LGGFSITCLVTSLLQHLPH 451
>gi|168036465|ref|XP_001770727.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677945|gb|EDQ64409.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1560
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 117/263 (44%), Gaps = 35/263 (13%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH EI F + AR A+ V +++ LWP + +FGS ATGL L SDV
Sbjct: 1039 LHSEILEFAHTASPSAEARS-CAEAAIDCVRGAVKKLWPNADVEVFGSYATGLCLSHSDV 1097
Query: 1273 DLVVCLPPVRNLEPIK---EAGILEGRNGIKETCLQHAARYLA----NQEWVKSDSLKTV 1325
D+VV P P+K G+ R + + L + R L + +W K S++T+
Sbjct: 1098 DVVVVDAP----PPLKLPDTTGLTGTR--LVASLLAPSIRLLGAALQDYDWCK--SIRTI 1149
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCS 1385
++ +P+I L L+ S D A +K D + + A+ S
Sbjct: 1150 DSATMPVIKLQCR---PLVTSL---------DPAAPVVKIDITIGGKKTNEEAVATASQS 1197
Query: 1386 HTSSDNIKAATSVRLDISFKSPS--HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
S+ TS +D F+ H G + V + +Q PA PL L+LK +L +
Sbjct: 1198 SEGSE-----TSRHVDSQFELARKFHNGAAAREYVIKRLQQQPALAPLVLLLKSYLQSKG 1252
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQ 1466
L Y+GGL S+ L +++ +L+
Sbjct: 1253 LKDVYTGGLGSFSLTIMLVFYLE 1275
>gi|347830146|emb|CCD45843.1| hypothetical protein [Botryotinia fuckeliana]
Length = 788
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ A T +++D+SF++ TGL +E QFPA L ++K LA R L++ +GG
Sbjct: 564 VDAITGLKVDMSFEN--DTGLIANKTFQEWKAQFPAMPILVTIIKHLLAMRGLNEPVNGG 621
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + + L+ LQ+ EH+LG
Sbjct: 622 IGGFSVTCLVVSLLQNLPQVKSGTMVPEHNLG 653
>gi|350638915|gb|EHA27270.1| hypothetical protein ASPNIDRAFT_44704 [Aspergillus niger ATCC 1015]
Length = 682
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ +GGL +
Sbjct: 379 TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 436
Query: 1456 CLMLLITRFLQHEHH 1470
+ L+T LQH H
Sbjct: 437 SITCLVTSLLQHLPH 451
>gi|317034217|ref|XP_001396205.2| topoisomerase family protein TRF4 [Aspergillus niger CBS 513.88]
Length = 682
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ +GGL +
Sbjct: 379 TGLRVDLSFDNDS--GLVANETFQQWKAEYPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 436
Query: 1456 CLMLLITRFLQHEHH 1470
+ L+T LQH H
Sbjct: 437 SITCLVTSLLQHLPH 451
>gi|241948905|ref|XP_002417175.1| topoisomerase 1-related protein TRF4, putative [Candida dubliniensis
CD36]
gi|223640513|emb|CAX44767.1| topoisomerase 1-related protein TRF4, putative [Candida dubliniensis
CD36]
Length = 606
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 52/231 (22%), Positives = 91/231 (39%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + + + WP + T++FGS AT L LP SD+D+VV + E G E
Sbjct: 194 NNVISTLKTEIGMFWPGTETHVFGSCATDLYLPGSDIDMVV----------VSETGDYEN 243
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R + L + +L ++ K +++ + + +PII V +
Sbjct: 244 R-----SRLYQLSTFLRTKKLAK--NVEVIASAKVPIIKFVDPI---------------- 280
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ +H D+ S ++ + AA +R L +T
Sbjct: 281 -----------SELHIDV-----------SFERTNGLDAAKRIR----------RWLIST 308
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL LV+KQFL R L+ + GGL Y +++ FL+
Sbjct: 309 PGLRELV----------LVIKQFLRSRRLNNVHVGGLGGYATIIMCYHFLR 349
>gi|115395044|ref|XP_001213471.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114193040|gb|EAU34740.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 680
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + ++ +++PA + V+KQFL R L++ +GG
Sbjct: 371 VDKLTGLRVDLSFDNDS--GLIANNTFQQWKQEYPAMPVIVSVVKQFLLLRGLNEVPTGG 428
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 429 LGGFSITCLVTSLLQHLPH 447
>gi|403218109|emb|CCK72601.1| hypothetical protein KNAG_0K02380 [Kazachstania naganishii CBS 8797]
Length = 632
Score = 50.1 bits (118), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 59/256 (23%), Positives = 106/256 (41%), Gaps = 82/256 (32%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
L +EI F ++ +E +R N AV+++ ++++ LW S ++FGS AT L +P
Sbjct: 165 LTQEIKDFVAYISPTGSEIISR----NRAVQKIRKAVRSLWRDSDLHVFGSYATDLYMPG 220
Query: 1270 SDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTA 1329
SD+D VV + ++ N L AR+L ++ ++ + T
Sbjct: 221 SDIDCVV------------NSTSMDKENT---QYLYELARHLRDENLAV--QIEVISRTR 263
Query: 1330 IPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
+PII + PH +++H D+ S
Sbjct: 264 VPIIKFI--EPH-------------------------SNLHIDV-----------SFERL 285
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
+ I+AA +R L+ T ++EL L++KQFLA R L+ ++
Sbjct: 286 NGIEAARLIR----------GWLRETPGLRELV----------LIIKQFLAARRLNDVHT 325
Query: 1450 GGLSSYCLMLLITRFL 1465
GGL + ++ L+ F+
Sbjct: 326 GGLGGFSIICLVYSFM 341
>gi|407835610|gb|EKF99338.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
Length = 398
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + G + + +V ++ FP + PL +++K FL R + + Y GGL S+ L
Sbjct: 92 LDVDVSINAEDGQRNSAIVADMLRSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 151
Query: 1460 LITRFLQ-HEHHLGRP 1474
L+ FLQ H + RP
Sbjct: 152 LVISFLQNHPIYTERP 167
>gi|302762448|ref|XP_002964646.1| hypothetical protein SELMODRAFT_22150 [Selaginella moellendorffii]
gi|300168375|gb|EFJ34979.1| hypothetical protein SELMODRAFT_22150 [Selaginella moellendorffii]
Length = 313
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 61/121 (50%), Gaps = 16/121 (13%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC ++ + A+ R++ +++ LW + IFGS ATGL LP+SD+
Sbjct: 8 LHQEILDFCTFIS-PTVEEEAERAAAINRISNAIKSLWSHCKIEIFGSYATGLYLPTSDI 66
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVENT 1328
D V+ ++G +G++ ++ R LA + + S + +K VE+T
Sbjct: 67 DAVIL-----------DSGCDRIVSGLRAISIELRKRKLARKITIISKARVPIIKFVEST 115
Query: 1329 A 1329
+
Sbjct: 116 S 116
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+++ + + DISF G + ++ P PL +VLK FL R L++ Y GG
Sbjct: 112 VESTSGISFDISFDL--RNGPEAAKFIQRAIGTLPPLRPLCMVLKIFLQQRELNEVYCGG 169
Query: 1452 LSSYCLMLLITRFLQH---EHHLG 1472
+ SY L++++ LQ HH G
Sbjct: 170 IGSYGLLIMLITHLQMHPSRHHGG 193
>gi|302815631|ref|XP_002989496.1| hypothetical protein SELMODRAFT_11961 [Selaginella moellendorffii]
gi|300142674|gb|EFJ09372.1| hypothetical protein SELMODRAFT_11961 [Selaginella moellendorffii]
Length = 307
Score = 50.1 bits (118), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 61/121 (50%), Gaps = 16/121 (13%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC ++ + A+ R++ +++ LW + IFGS ATGL LP+SD+
Sbjct: 8 LHQEILDFCTFISP-TVEEEAERAAAINRISNAIKSLWSHCKIEIFGSYATGLYLPTSDI 66
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDS----LKTVENT 1328
D V+ ++G +G++ ++ R LA + + S + +K VE+T
Sbjct: 67 DAVIL-----------DSGCDRIVSGLRAISIELRKRKLARKITIISKARVPIIKFVEST 115
Query: 1329 A 1329
+
Sbjct: 116 S 116
Score = 48.1 bits (113), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 5/84 (5%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+++ + + DISF G + ++ P PL +VLK FL R L++ Y GG
Sbjct: 112 VESTSGISFDISFDL--RNGPEAAKFIQRAIGTLPPLRPLCMVLKIFLQQRELNEVYCGG 169
Query: 1452 LSSYCLMLLITRFLQH---EHHLG 1472
+ SY L++++ LQ HH G
Sbjct: 170 IGSYGLLIMLITHLQMHPSRHHGG 193
>gi|406862982|gb|EKD16031.1| topoisomerase family protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 868
Score = 50.1 bits (118), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 63/272 (23%), Positives = 97/272 (35%), Gaps = 79/272 (29%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI F V + ++ + ++ Q + FGS GL LP+SD+
Sbjct: 539 LHKEIMDFYDYVKPRDFEQEIRLRLVEDLRSKFKQYFYSDDDIQPFGSFPAGLYLPTSDM 598
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQ-EWVKSDSLKTVENTAIP 1331
DLV I I +GR R LAN E++K
Sbjct: 599 DLVC----------ISREYIADGRK-----IFGQGTRGLANFCEFLKK------------ 631
Query: 1332 IIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDN 1391
E P D S + P +V DS
Sbjct: 632 -----YEFPLDKKIEVISKAKVP------------------LVKYMDSL----------- 657
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
T +++DISF++ TG+ + ++FPA L ++K LA R L++ +GG
Sbjct: 658 ----TGLKVDISFEN--TTGIVANTTFQNWKKEFPAVPILVTLIKHLLAMRGLNEPVNGG 711
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + + L+ LQ+ EHHLG
Sbjct: 712 IGGFSVTCLVVSLLQNMPQVQSRNLVPEHHLG 743
>gi|340518876|gb|EGR49116.1| hypothetical protein TRIREDRAFT_61322 [Trichoderma reesei QM6a]
Length = 679
Score = 50.1 bits (118), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ AT +R+D+SF++ G++ + + +++PA L V+K +L R L++ +GG
Sbjct: 463 VDKATGLRVDVSFENLG--GVRAVETFQRWKKEYPAMPALVTVIKHYLLMRGLNEPVNGG 520
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
L + ++ L+ LQ EHHLG
Sbjct: 521 LGGFTVICLVVSMLQLMPEVQSGSLVPEHHLG 552
>gi|145346336|ref|XP_001417645.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577873|gb|ABO95938.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 781
Score = 50.1 bits (118), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
+ + IS K +H GL + V+ EQ A PL V K LA + L Y GGL SY L
Sbjct: 561 LEVHISLKDANHKGLSSMQFVQAAEEQHGALAPLVCVQKAVLASKGLRGVYRGGLGSYAL 620
Query: 1458 MLLITRFLQ 1466
L+ +Q
Sbjct: 621 TLMALTAIQ 629
>gi|328860813|gb|EGG09918.1| hypothetical protein MELLADRAFT_115680 [Melampsora larici-populina
98AG31]
Length = 987
Score = 50.1 bits (118), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 102/262 (38%), Gaps = 37/262 (14%)
Query: 1209 MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
+ +L EI SF + E+ R I K VT + WP + FGS T L
Sbjct: 230 VQQMLTAEIGSFVAYIRPTREEDELRLMIIEMIRKAVT----MQWPDADVVPFGSFGTKL 285
Query: 1266 SLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTV 1325
LP D+DLV+ + +K+A K L A L Q + +
Sbjct: 286 YLPGGDIDLVIL-----STRMMKDA---------KSKILYRLAPLLREQNI--GQDVVVI 329
Query: 1326 ENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVAL-DDSASPKC 1384
+PII + + + S + L+ N + D+ L D + +
Sbjct: 330 AKAKVPIIKFKT-----IFGNFQVDI-SINQSNGLVALEKVNELLDDVKYLSKDLRAEQR 383
Query: 1385 SHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSL 1444
H S N + ++ + + + ++ E+ A+ + LV+K L R +
Sbjct: 384 HHQHSRNQSKRRNEETNLEMD-------EEEEAISKVVEELGAAKCMILVIKSVLKQRGM 436
Query: 1445 DQSYSGGLSSYCLMLLITRFLQ 1466
++ YSGGL SY ++ L+ FLQ
Sbjct: 437 NEVYSGGLGSYSIICLVISFLQ 458
>gi|407044805|gb|EKE42834.1| nucleotidyltransferase domain containing protein [Entamoeba nuttalli
P19]
Length = 338
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
L I G+ TT +K L +FP + L+L +K L SL++ Y GGL SY ++L
Sbjct: 131 LSIDISVDCDNGIVTTSYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190
Query: 1460 LITRFLQH 1467
L+ +L++
Sbjct: 191 LVCTYLKN 198
>gi|365758850|gb|EHN00675.1| Trf5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 642
Score = 49.7 bits (117), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ + + +D+SF+ + GL+ L++E P L LV+KQFL R L+ ++GG
Sbjct: 281 IEPLSQLHIDVSFERTN--GLEAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHTGG 338
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 339 LGGFTVICLVYSFL 352
>gi|401837953|gb|EJT41787.1| TRF5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 642
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ + + +D+SF+ + GL+ L++E P L LV+KQFL R L+ ++GG
Sbjct: 281 IEPLSQLHIDVSFERTN--GLEAARLIREWLRDSPGLRELVLVIKQFLHSRRLNNVHTGG 338
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 339 LGGFTVICLVYSFL 352
>gi|67465459|ref|XP_648914.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56465219|gb|EAL43527.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449703139|gb|EMD43640.1| PAPassociated domain containing protein [Entamoeba histolytica KU27]
Length = 338
Score = 49.7 bits (117), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
L I G+ TT +K L +FP + L+L +K L SL++ Y GGL SY ++L
Sbjct: 131 LSIDISVDCDNGIVTTSYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190
Query: 1460 LITRFLQH 1467
L+ +L++
Sbjct: 191 LVCTYLKN 198
>gi|295671679|ref|XP_002796386.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226283366|gb|EEH38932.1| Poly(A) RNA polymerase cid14 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 725
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + TGL + QFPA + ++KQFL R L++ +GGL +
Sbjct: 413 TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSIIKQFLLLRGLNEVPTGGLGGF 470
Query: 1456 CLMLLITRFLQHEHHLG 1472
++ L+T LQH + G
Sbjct: 471 SIICLVTSLLQHLPYSG 487
>gi|255953511|ref|XP_002567508.1| Pc21g04620 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211589219|emb|CAP95359.1| Pc21g04620 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 708
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL D ++ Q+P + V+KQFL R L++ +GG
Sbjct: 366 VDKLTGLRVDLSFDNDS--GLIANDTFQKWKTQYPMMPVILAVIKQFLLIRGLNEVPTGG 423
Query: 1452 LSSYCLMLLITRFLQH 1467
L + + L+T LQH
Sbjct: 424 LGGFSITCLVTSLLQH 439
>gi|452823485|gb|EME30495.1| DNA polymerase sigma subunit [Galdieria sulphuraria]
Length = 417
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 42/75 (56%), Gaps = 6/75 (8%)
Query: 1206 LSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGL 1265
++L + +L E++ F E RK I +RVT ++ +WP S ++FGS AT L
Sbjct: 95 IALHYEIL--ELERFVSGTREETKQRKQLI----ERVTEIIRQIWPNSSVHVFGSFATNL 148
Query: 1266 SLPSSDVDLVVCLPP 1280
LP+SD+DL + P
Sbjct: 149 YLPTSDIDLCILSSP 163
Score = 44.7 bits (104), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 7/83 (8%)
Query: 1373 MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLA 1432
++A+D + P T + T ++ DISF + G++ +++ +++P+ PL
Sbjct: 188 VMAIDKARVPIIKVTDRE-----TGIQCDISFGRTN--GIENVRHIQKYLKRYPSLRPLM 240
Query: 1433 LVLKQFLADRSLDQSYSGGLSSY 1455
+V+K FL R+L++ + GG+ SY
Sbjct: 241 MVIKCFLHQRALNEVHEGGIGSY 263
>gi|226288571|gb|EEH44083.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 684
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + TGL + QFPA + V+KQFL R L++ +GGL +
Sbjct: 375 TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSVIKQFLLLRGLNEVPTGGLGGF 432
Query: 1456 CLMLLITRFLQHEHHLG 1472
++ L+T LQH + G
Sbjct: 433 SIICLVTSLLQHLPYSG 449
>gi|119482616|ref|XP_001261336.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
NRRL 181]
gi|119409491|gb|EAW19439.1| topoisomerase family protein TRF4, putative [Neosartorya fischeri
NRRL 181]
Length = 701
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + + ++PA + V+KQFL R L++ +GG
Sbjct: 374 VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 431
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 432 LGGFSITCLVTSLLQHLPH 450
>gi|159123133|gb|EDP48253.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
A1163]
Length = 703
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + + ++PA + V+KQFL R L++ +GG
Sbjct: 376 VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 433
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 434 LGGFSITCLVTSLLQHLPH 452
>gi|167391312|ref|XP_001739718.1| PAP-associated domain-containing protein [Entamoeba dispar SAW760]
gi|165896477|gb|EDR23882.1| PAP-associated domain-containing protein, putative [Entamoeba dispar
SAW760]
Length = 337
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
L I G+ TT +K L +FP + L+L +K L SL++ Y GGL SY ++L
Sbjct: 131 LSIDISVDCDNGIVTTTYIKTLLTEFPLARTLSLFIKFLLFQNSLNEPYHGGLGSYAIIL 190
Query: 1460 LITRFLQH 1467
L+ +L++
Sbjct: 191 LVCTYLKN 198
>gi|70987233|ref|XP_749095.1| topoisomerase family protein TRF4 [Aspergillus fumigatus Af293]
gi|66846725|gb|EAL87057.1| topoisomerase family protein TRF4, putative [Aspergillus fumigatus
Af293]
Length = 702
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + + ++PA + V+KQFL R L++ +GG
Sbjct: 375 VDKMTGLRVDLSFDNDS--GLIANNTFQNWKSEYPAMPVIVAVVKQFLLLRGLNEVPTGG 432
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 433 LGGFSITCLVTSLLQHLPH 451
>gi|320587855|gb|EFX00330.1| topoisomerase family protein [Grosmannia clavigera kw1407]
Length = 816
Score = 49.7 bits (117), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ TG+ + EQ+PA L ++KQFL R L++ +GG
Sbjct: 596 VDVKTWLKVDVSFEN--MTGVAAVRTFTKWREQYPAMPVLVTLVKQFLTMRGLNEPVNGG 653
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
+ + ++ L+ LQ EHHLG
Sbjct: 654 IGGFSMICLVVSMLQLMPPIQSKDMKGEHHLG 685
>gi|365982357|ref|XP_003668012.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
gi|343766778|emb|CCD22769.1| hypothetical protein NDAI_0A06140 [Naumovozyma dairenensis CBS 421]
Length = 684
Score = 49.7 bits (117), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 88/230 (38%), Gaps = 75/230 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ ++++ LW S+ +IFGS AT L LP SD+D VV + G E
Sbjct: 227 NKTISKLRKAVKELWSDSQLHIFGSYATDLYLPGSDIDCVVN----------SKMGDKEQ 276
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
R L AR+L + + ++ + +PII V
Sbjct: 277 RQ-----YLYDLARHLKQKGL--TSQVEVIAKARVPIIKFV------------------- 310
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+ + +H D+ S ++ ++AA +R
Sbjct: 311 --------EKSSQIHIDV-----------SFERTNGVEAAKLIR---------------- 335
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
E P L L++KQFL+ R L+ ++GGL + ++ L+ FL
Sbjct: 336 ----EWLSATPGLRELILIVKQFLSARRLNDVHTGGLGGFTIICLVYSFL 381
>gi|366988339|ref|XP_003673936.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
gi|342299799|emb|CCC67555.1| hypothetical protein NCAS_0A09970 [Naumovozyma castellii CBS 4309]
Length = 683
Score = 49.7 bits (117), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + G + L++E + P L LVLKQFLA + L++ +GG
Sbjct: 303 VEPRSKIHIDVSFEKTN--GAEAAKLIREWIKDTPGLRELVLVLKQFLAVKKLNEVVNGG 360
Query: 1452 LSSYCLMLLITRFLQ 1466
L + ++ L+ FL+
Sbjct: 361 LGGFSIICLVYAFLR 375
>gi|444317060|ref|XP_004179187.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
gi|387512227|emb|CCH59668.1| hypothetical protein TBLA_0B08530 [Tetrapisispora blattae CBS 6284]
Length = 461
Score = 49.7 bits (117), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ +++ +DISF+ + GL+ L++ E P L L++KQFL R L+ +GG
Sbjct: 176 IEPNSNIHIDISFERTN--GLEAAKLIRSWLETTPGLRELVLIIKQFLHARRLNNVRTGG 233
Query: 1452 LSSYCLMLLITRFLQHEH 1469
L + ++ L+ FL H H
Sbjct: 234 LGGFSIICLVYSFL-HLH 250
>gi|145519187|ref|XP_001445460.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412915|emb|CAK78063.1| unnamed protein product [Paramecium tetraurelia]
Length = 1073
Score = 49.7 bits (117), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 59/272 (21%), Positives = 121/272 (44%), Gaps = 47/272 (17%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
+ S+ +EID F + A + + ++RV ++ + T IFGS+ TGL+L
Sbjct: 723 VKSIYQKEIDQFNRYKVARDIS--------IQRVQHVIKSYFQNCETEIFGSSTTGLALK 774
Query: 1269 SSDVDLVV----CLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKT 1324
SDVD+VV + EP+++ ++E + +K T K
Sbjct: 775 DSDVDMVVYGLHVYTKQQLFEPMRK--LIEIFSELKWTV-----------------QCKH 815
Query: 1325 VENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKC 1384
+ ++P+I ++V+ D + S K + + ++ N + L+
Sbjct: 816 IFQASVPLIKVLVDPSIDFL--------SFKGEPKYILMQCRN------LDLNLKYGDPS 861
Query: 1385 SHTSSD-NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
H D + R+ + + G Q+T ++ ++ + +A+ LK+ L +
Sbjct: 862 QHIFIDITFELTPPYRIYNPYIQAYNIGFQSTQYTIDICDKIQGFSEVAIYLKKLLKIKD 921
Query: 1444 LDQSYSGGLSSYCLMLLITRFLQHEHHLGRPI 1475
L+ SY+GG+SS+CL +++ Q +H++G+ +
Sbjct: 922 LNDSYTGGISSFCLTIMLAAIGQ-DHNIGQKL 952
>gi|71659078|ref|XP_821264.1| DNA polymerase sigma [Trypanosoma cruzi strain CL Brener]
gi|70886638|gb|EAN99413.1| DNA polymerase sigma, putative [Trypanosoma cruzi]
Length = 685
Score = 49.7 bits (117), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + G + + +V ++ FP + PL +++K FL R + + Y GGL S+ L
Sbjct: 379 LDVDVSINAEDGQRNSAIVADMLCSFPEARPLIVLVKYFLQQRGMHEPYHGGLGSFATTL 438
Query: 1460 LITRFLQ-HEHHLGRP 1474
L+ FLQ H + RP
Sbjct: 439 LVISFLQNHPIYTERP 454
>gi|190407236|gb|EDV10503.1| DNA polymerase sigma [Saccharomyces cerevisiae RM11-1a]
gi|259149371|emb|CAY86175.1| Pap2p [Saccharomyces cerevisiae EC1118]
Length = 584
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + G++ L++E + P L L++KQFL R L+ ++GG
Sbjct: 284 VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341
Query: 1452 LSSYCLMLLITRFLQHEH 1469
L + ++ L+ FL H H
Sbjct: 342 LGGFSIICLVFSFL-HMH 358
>gi|256271045|gb|EEU06149.1| Pap2p [Saccharomyces cerevisiae JAY291]
Length = 584
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + G++ L++E + P L L++KQFL R L+ ++GG
Sbjct: 284 VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341
Query: 1452 LSSYCLMLLITRFLQHEH 1469
L + ++ L+ FL H H
Sbjct: 342 LGGFSIICLVFSFL-HMH 358
>gi|50286703|ref|XP_445781.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525087|emb|CAG58700.1| unnamed protein product [Candida glabrata]
Length = 485
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + +DISF+ + GL+ L+++ P L LV+KQFL R L+ +SGG
Sbjct: 208 VEVESRTHMDISFERLN--GLEAAKLIRDWLASTPGLRELVLVVKQFLHSRRLNNVHSGG 265
Query: 1452 LSSYCLMLLITRFLQ 1466
L + ++ L+ FL+
Sbjct: 266 LGGFSIICLVYSFLR 280
>gi|410730487|ref|XP_003671423.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
gi|401780241|emb|CCD26180.2| hypothetical protein NDAI_0G04030 [Naumovozyma dairenensis CBS 421]
Length = 582
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I + + +D+SF+ + GL+ L++E P + L++KQFL R L+ ++GG
Sbjct: 274 IDPRSKIHIDVSFERIN--GLEAAKLIREWLNDTPGLREITLIVKQFLHARRLNNVHTGG 331
Query: 1452 LSSYCLMLLITRFLQ 1466
L + ++ L+ FLQ
Sbjct: 332 LGGFSIICLVFSFLQ 346
>gi|151945519|gb|EDN63760.1| DNA polymerase sigma [Saccharomyces cerevisiae YJM789]
Length = 584
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + G++ L++E + P L L++KQFL R L+ ++GG
Sbjct: 284 VEPHSGIHIDVSFERTN--GIEAAKLIREWLDDTPGLRELVLIVKQFLHARRLNNVHTGG 341
Query: 1452 LSSYCLMLLITRFLQHEH 1469
L + ++ L+ FL H H
Sbjct: 342 LGGFSIICLVFSFL-HMH 358
>gi|409052310|gb|EKM61786.1| hypothetical protein PHACADRAFT_190976 [Phanerochaete carnosa
HHB-10118-sp]
Length = 434
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 49/109 (44%), Gaps = 8/109 (7%)
Query: 1366 DNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRL--------DISFKSPSHTGLQTTDL 1417
+ +H ++VA + A+P + + A + D+ L
Sbjct: 68 EQRLHDEIVAFMNYATPNATERTVRERLIARLTEIIQRRYWCSDVKVFGSMAQDLSLLSG 127
Query: 1418 VKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
V+E + PA PL +V+K FLA + L + +GGL SY L L++ FLQ
Sbjct: 128 VQEYLDTMPALRPLIMVIKMFLAAKGLHSAGTGGLGSYALTLMVISFLQ 176
>gi|154272285|ref|XP_001536995.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150408982|gb|EDN04438.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 673
Score = 49.3 bits (116), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + TG+ + QFPA + V+KQFL R L++ +GGL +
Sbjct: 363 TGLKVDLSFDN--STGITANKTFQVWKSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGF 420
Query: 1456 CLMLLITRFLQHEHHLG 1472
++ L+T LQH + G
Sbjct: 421 SIICLVTSLLQHLPYSG 437
>gi|50294195|ref|XP_449509.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528823|emb|CAG62485.1| unnamed protein product [Candida glabrata]
Length = 626
Score = 49.3 bits (116), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + GL+ L++E P L LV+KQFL R L+ ++GG
Sbjct: 292 VEPESDIHIDVSFERSN--GLEAAKLIREWIGDTPGLRELTLVVKQFLHARRLNDVHTGG 349
Query: 1452 LSSYCLMLLITRFLQ 1466
L + ++ L+ FL+
Sbjct: 350 LGGFSIICLVFSFLR 364
>gi|225681443|gb|EEH19727.1| polymerase sigma [Paracoccidioides brasiliensis Pb03]
Length = 684
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + TGL + QFPA + ++KQFL R L++ +GGL +
Sbjct: 375 TGLKVDLSFDN--STGLAANKTFQVWKSQFPAMPVIVSIIKQFLLLRGLNEVPTGGLGGF 432
Query: 1456 CLMLLITRFLQHEHHLG 1472
++ L+T LQH + G
Sbjct: 433 SIICLVTSLLQHLPYSG 449
>gi|189204442|ref|XP_001938556.1| PAP-associated domain containing protein 5 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985655|gb|EDU51143.1| PAP-associated domain containing protein 5 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 599
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 2/73 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++DISF++ S G+Q E ++P + ++KQFL R L++ ++GG+ Y
Sbjct: 379 TKLQVDISFENLS--GVQAQATFVEWKAKYPDMIYMVALMKQFLVMRGLNEVHTGGIGGY 436
Query: 1456 CLMLLITRFLQHE 1468
++ LI +LQ E
Sbjct: 437 SIICLIVSYLQLE 449
>gi|323303298|gb|EGA57094.1| Trf5p [Saccharomyces cerevisiae FostersB]
Length = 419
Score = 49.3 bits (116), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ + + +D+SF+ GL+ L++E P L L++KQFL R L+ ++GG
Sbjct: 247 IEPQSQLHIDVSFERT--XGLEAAKLIREWLRDSPGLRELVLIIKQFLHSRRLNNVHTGG 304
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 305 LGGFTVICLVYSFL 318
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 51/101 (50%), Gaps = 17/101 (16%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ R+++ LW + ++FGS AT L LP SD+D C+ RN + E
Sbjct: 164 NRTIDKLRRAVKELWSDADLHVFGSFATDLYLPGSDID---CVVNSRNRDK-------ED 213
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
RN I E AR+L N+ + ++ + T +PII +
Sbjct: 214 RNYIYE-----LARHLKNKGL--AIRMEVIVKTRVPIIKFI 247
>gi|367030285|ref|XP_003664426.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
42464]
gi|347011696|gb|AEO59181.1| hypothetical protein MYCTH_2307240 [Myceliophthora thermophila ATCC
42464]
Length = 772
Score = 49.3 bits (116), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ T +++D+SF++ +TG+ K EQ+P L ++K FL R L++ +GG
Sbjct: 517 IEHKTGLKVDVSFEN--NTGVNAIKTFKAWREQYPGMPALVTLIKHFLLMRGLNEPVNGG 574
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
+ + ++ L+ LQ HHLG+
Sbjct: 575 IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 607
>gi|308803585|ref|XP_003079105.1| nucleotidyltransferase family protein (ISS) [Ostreococcus tauri]
gi|116057560|emb|CAL53763.1| nucleotidyltransferase family protein (ISS) [Ostreococcus tauri]
Length = 1238
Score = 49.3 bits (116), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 33/69 (47%)
Query: 1398 VRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCL 1457
+ + IS K SH GL V+ EQ A PL V K LA + L Y GGL SY L
Sbjct: 1019 LEVHISLKDGSHKGLSALRFVQAAEEQHGALAPLVCVQKAVLASKGLRGVYKGGLGSYAL 1078
Query: 1458 MLLITRFLQ 1466
++ +Q
Sbjct: 1079 TMMALTSIQ 1087
>gi|124513398|ref|XP_001350055.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
gi|23615472|emb|CAD52463.1| nucleotidyltransferase, putative [Plasmodium falciparum 3D7]
Length = 858
Score = 49.3 bits (116), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I V++DI S +TTD + +++ PL L+LK FL R+L+++Y GG
Sbjct: 530 IDKELGVQIDICINQKS--SKETTDFIITQMKKYIYLRPLVLLLKFFLHSRNLNETYIGG 587
Query: 1452 LSSYCLMLLITRFLQ 1466
+ S+ L ++ FLQ
Sbjct: 588 IGSFLLSCMVLHFLQ 602
>gi|82596357|ref|XP_726229.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23481547|gb|EAA17794.1| topoisomerase-related function protein [Plasmodium yoelii yoelii]
Length = 600
Score = 48.9 bits (115), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 37/54 (68%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++TD VK+ +++ PL +++K FL R+L+++Y+GG+ S+ L ++ FLQ
Sbjct: 400 ESTDFVKKKIKEYIYLRPLVILIKFFLNSRNLNETYTGGIGSFMLCCMVLHFLQ 453
>gi|146077494|ref|XP_001463285.1| topoisomerase-related function protein-like protein [Leishmania
infantum JPCM5]
gi|134067369|emb|CAM65641.1| topoisomerase-related function protein-like protein [Leishmania
infantum JPCM5]
Length = 917
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 518 IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 577
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 578 LVVAFLRQHPIYTTQP 593
>gi|157864534|ref|XP_001680977.1| topoisomerase-related function protein-like protein [Leishmania major
strain Friedlin]
gi|68124270|emb|CAJ07032.1| topoisomerase-related function protein-like protein [Leishmania major
strain Friedlin]
Length = 916
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 515 IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 574
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 575 LVVAFLRQHPIYTTQP 590
>gi|336466097|gb|EGO54262.1| hypothetical protein NEUTE1DRAFT_124553 [Neurospora tetrasperma FGSC
2508]
gi|350287057|gb|EGZ68304.1| hypothetical protein NEUTE2DRAFT_118045 [Neurospora tetrasperma FGSC
2509]
Length = 708
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T +++D+SF++ TGLQ +QFP L ++K FL R L++ +GG
Sbjct: 489 VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 546
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + + L+T LQ
Sbjct: 547 IGGFTVTCLVTSMLQ 561
>gi|398010636|ref|XP_003858515.1| topoisomerase-related function protein-like protein [Leishmania
donovani]
gi|322496723|emb|CBZ31792.1| topoisomerase-related function protein-like protein [Leishmania
donovani]
Length = 916
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 517 IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVMQRGMHEPYYGGLGSYAATL 576
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 577 LVVAFLRQHPIYTTQP 592
>gi|401415443|ref|XP_003872217.1| topoisomerase-related function protein-like protein [Leishmania
mexicana MHOM/GT/2001/U1103]
gi|322488440|emb|CBZ23686.1| topoisomerase-related function protein-like protein [Leishmania
mexicana MHOM/GT/2001/U1103]
Length = 912
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 512 IDVDISVGAVDGKRNSECIVQYMNMYPEALPLTLVVKYFVTQRGMHEPYYGGLGSYAATL 571
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 572 LVVAFLRQHPIYTTQP 587
>gi|238482039|ref|XP_002372258.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
NRRL3357]
gi|220700308|gb|EED56646.1| topoisomerase family protein TRF4, putative [Aspergillus flavus
NRRL3357]
Length = 608
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ GG
Sbjct: 299 VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 356
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 357 LGGFSITCLVTSLLQHLPH 375
>gi|46127561|ref|XP_388334.1| hypothetical protein FG08158.1 [Gibberella zeae PH-1]
Length = 708
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ G+ D + EQ+PA L V+K FL R L++ +GG
Sbjct: 494 VDKQTGLKVDVSFEN--LGGVNAIDTFLQWKEQYPAMPILVTVIKHFLLMRGLNEPVNGG 551
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
+ + ++ L+ LQ EHHLG
Sbjct: 552 IGGFSVICLVVSMLQLMPQVQSRSLVPEHHLG 583
>gi|255081724|ref|XP_002508084.1| predicted protein [Micromonas sp. RCC299]
gi|226523360|gb|ACO69342.1| predicted protein [Micromonas sp. RCC299]
Length = 1070
Score = 48.9 bits (115), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 41/68 (60%), Gaps = 1/68 (1%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
++ LH ++ SF +++ + + K+V+ ++ L+P +R +FGS ATGL+L
Sbjct: 627 LYGTLHSQVSSFASTAGPDDSDLQDRL-VVQKKVSEAITALYPEARLEVFGSGATGLALK 685
Query: 1269 SSDVDLVV 1276
+D+DLVV
Sbjct: 686 CADIDLVV 693
>gi|317139015|ref|XP_001817214.2| topoisomerase family protein TRF4 [Aspergillus oryzae RIB40]
Length = 659
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ GG
Sbjct: 350 VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 407
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 408 LGGFSITCLVTSLLQHLPH 426
>gi|408389494|gb|EKJ68941.1| hypothetical protein FPSE_10866 [Fusarium pseudograminearum CS3096]
Length = 708
Score = 48.9 bits (115), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ G+ D + EQ+PA L V+K FL R L++ +GG
Sbjct: 494 VDKQTGLKVDVSFENLG--GVNAIDTFLQWKEQYPAMPILVTVIKHFLLMRGLNEPVNGG 551
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
+ + ++ L+ LQ EHHLG
Sbjct: 552 IGGFSVICLVVSMLQLMPQVQSRSLVPEHHLG 583
>gi|85097984|ref|XP_960550.1| hypothetical protein NCU05588 [Neurospora crassa OR74A]
gi|28922043|gb|EAA31314.1| predicted protein [Neurospora crassa OR74A]
Length = 715
Score = 48.9 bits (115), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T +++D+SF++ TGLQ +QFP L ++K FL R L++ +GG
Sbjct: 496 VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 553
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + + L+T LQ
Sbjct: 554 IGGFTVTCLVTSMLQ 568
>gi|145511526|ref|XP_001441685.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408946|emb|CAK74288.1| unnamed protein product [Paramecium tetraurelia]
Length = 622
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 64/293 (21%), Positives = 120/293 (40%), Gaps = 55/293 (18%)
Query: 1193 PLEVQNCPTRKASLSLMHSLLHE--EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW 1250
P Q P ++ L+L S H+ E+ S Q + +N RK + R +++
Sbjct: 282 PHSTQTTPIKRKKLNLTKSKTHDIDELSSAILQKSYQNRIRKQIC------LNRLNYLIY 335
Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG-RNGIKETCLQHA 1307
N++ GS TGL L SDVD+ V +NL + L+ N +K+T
Sbjct: 336 TNYHLNMYPYGSFETGLDLEISDVDVGVW--GSQNLSYAQIVSFLQLLNNTLKQTPFLIK 393
Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDN 1367
++ + Q +PI+ L + +PK D
Sbjct: 394 SKLIQGQ---------------MPILKLEL---------------NPKSGF----YDEDQ 419
Query: 1368 HVHSD--MVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQF 1425
H+ + LD + + I +++V + +P H G +T +VK+ +F
Sbjct: 420 HIQQNWSYFHLDHNDQGRIIQVDITWIYQSSNV-----YNNP-HLGFASTTIVKDWIHRF 473
Query: 1426 PASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
+ L+LKQ + ++L+ +++GG+SS+CL +++ +H+ Q+
Sbjct: 474 VWYRDIMLILKQLVKSKNLNDAHTGGISSFCLSIMLAAIYMCKHYTQNDKKQI 526
>gi|358385776|gb|EHK23372.1| hypothetical protein TRIVIDRAFT_114164, partial [Trichoderma virens
Gv29-8]
Length = 674
Score = 48.9 bits (115), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ AT +R+D+SF++ G+ + +++PA L V+K FL R L++ +GG
Sbjct: 465 VDKATGLRVDVSFEN--LGGVTAIETFARWKKEYPAMPILVTVIKHFLLMRGLNEPVNGG 522
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
L + ++ L+ LQ EHHLG
Sbjct: 523 LGGFTIICLVVSMLQLMPQVQSRNLIPEHHLG 554
>gi|145528001|ref|XP_001449800.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417389|emb|CAK82403.1| unnamed protein product [Paramecium tetraurelia]
Length = 1063
Score = 48.9 bits (115), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 117/268 (43%), Gaps = 39/268 (14%)
Query: 1209 MHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLP 1268
+ + +EID F + AA + + ++RV ++ + T IFGS+ TGL+L
Sbjct: 713 VKQIYQKEIDQFNRYKAARDIS--------IQRVQHVIKSYFKNCDTQIFGSSTTGLALK 764
Query: 1269 SSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENT 1328
SDVD+VV L+ + + E + E +W + K +
Sbjct: 765 DSDVDMVVY-----GLQVFTKQQLFEPMRRLIEI--------FTELKW--AVQCKHIFQA 809
Query: 1329 AIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS 1388
++P+I ++V+ D + S K + + ++ N + L+ H
Sbjct: 810 SVPLIKVLVDPSIDFL--------SFKGEPKYIIMQCRN------LDLNLKYGDPSQHIF 855
Query: 1389 SD-NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQS 1447
D + A S + + + G Q+T ++ + + +A+ LK+ L + L+ S
Sbjct: 856 IDITFELAPSYAIYNPYIQAFNIGFQSTQFTIDICGKIQGFSEVAIYLKKLLKIKDLNDS 915
Query: 1448 YSGGLSSYCLMLLITRFLQHEHHLGRPI 1475
Y+GG+SS+CL +++ Q +H +G+ +
Sbjct: 916 YTGGISSFCLTIMLAAIGQ-DHSIGQKL 942
>gi|336276454|ref|XP_003352980.1| hypothetical protein SMAC_03298 [Sordaria macrospora k-hell]
gi|380092465|emb|CCC09742.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 781
Score = 48.5 bits (114), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + T +++D+SF++ TGLQ +QFP L ++K FL R L++ +GG
Sbjct: 562 VDSRTGLKIDVSFEN--DTGLQAIKTFHAWRDQFPVMPVLVTLIKHFLCMRGLNEPVNGG 619
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + + L+T LQ
Sbjct: 620 IGGFTVTCLVTSMLQ 634
>gi|414881049|tpg|DAA58180.1| TPA: hypothetical protein ZEAMMB73_639297, partial [Zea mays]
Length = 260
Score = 48.5 bits (114), Expect = 0.026, Method: Composition-based stats.
Identities = 24/64 (37%), Positives = 41/64 (64%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI FC ++ +T + AV+ V+ ++ +WP+ + +FGS TGL LP+SD+
Sbjct: 131 LHKEILDFCDFISP-STEEQSSRAAAVQDVSDVVKHIWPQCKVEVFGSFRTGLYLPTSDI 189
Query: 1273 DLVV 1276
D+V+
Sbjct: 190 DVVI 193
>gi|325095205|gb|EGC48515.1| topoisomerase TRF4 [Ajellomyces capsulatus H88]
Length = 443
Score = 48.5 bits (114), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++D+SF + TG+ + QFPA + V+KQFL R L++ +GGL +
Sbjct: 133 TGLKVDLSFDNS--TGIAANKTFQVWKSQFPAMPVIVSVVKQFLLLRGLNEVATGGLGGF 190
Query: 1456 CLMLLITRFLQH 1467
++ L+T LQH
Sbjct: 191 SIICLVTSLLQH 202
>gi|229595481|ref|XP_001017236.2| Nucleotidyltransferase domain containing protein [Tetrahymena
thermophila]
gi|225565997|gb|EAR96991.2| Nucleotidyltransferase domain containing protein [Tetrahymena
thermophila SB210]
Length = 1151
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
L+EEI F ++V N +P + +K+V ++ +P + N +GS AT L LP SD+
Sbjct: 832 LNEEIREFEREVNEYNKRMEPTFDELIKKVEDIIKQQYPNIKLNKYGSYATKLCLPWSDI 891
Query: 1273 DLVV 1276
D+VV
Sbjct: 892 DIVV 895
>gi|209879195|ref|XP_002141038.1| PAP/25A associated domain-containing protein [Cryptosporidium muris
RN66]
gi|209556644|gb|EEA06689.1| PAP/25A associated domain-containing protein [Cryptosporidium muris
RN66]
Length = 485
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ + + + +DIS S + TT + E ++F PL + +K FL R LD++Y GG
Sbjct: 236 VDSESCISVDISLNQES--CIDTTQHICECLKRFDVLRPLIITIKLFLHQRGLDETYLGG 293
Query: 1452 LSSYCLMLLITRFLQH 1467
L SY LI F+Q
Sbjct: 294 LGSYAQFCLILSFIQQ 309
>gi|156843407|ref|XP_001644771.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
70294]
gi|156115421|gb|EDO16913.1| hypothetical protein Kpol_1020p21 [Vanderwaltozyma polyspora DSM
70294]
Length = 640
Score = 48.5 bits (114), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +DISF+ + GL+ +++E + P L L++KQFLA R L+ ++GG
Sbjct: 255 VEPNSKLHIDISFERLN--GLEVAKIIREWLDDTPGLRELVLIVKQFLAARRLNNVHTGG 312
Query: 1452 LSSYCLMLLITRFLQ 1466
L ++ L+ FL+
Sbjct: 313 LGGLSIICLVYSFLK 327
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 44/101 (43%), Gaps = 17/101 (16%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + ++ SL+ LW S ++FGS AT L LPSSD+D VV E G E
Sbjct: 172 NVTISKIRNSLKELWSDSDLHVFGSYATDLYLPSSDIDCVVN----------SENGDKEN 221
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
RN + LA Q ++ + +PII V
Sbjct: 222 RNDLYSLATHFKRNGLAIQ-------VEVIAKARVPIIKFV 255
>gi|323452526|gb|EGB08400.1| hypothetical protein AURANDRAFT_64151 [Aureococcus anophagefferens]
Length = 1245
Score = 48.5 bits (114), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 25/56 (44%), Positives = 37/56 (66%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+DISF++P H GL ++ E ++P PL LVLK+ L R L++ ++GGLSSY
Sbjct: 972 VDISFEAPDHGGLASSRYALEAMSRWPELAPLMLVLKELLVQRGLNEPFTGGLSSY 1027
>gi|440492821|gb|ELQ75356.1| DNA polymerase sigma [Trachipleistophora hominis]
Length = 402
Score = 48.5 bits (114), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 24/54 (44%), Positives = 34/54 (62%)
Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
L T VK++ + PA +AL LKQFL R L+++ GGL+SY +L+I FL
Sbjct: 219 LAHTKFVKDILHKQPAIKDMALFLKQFLKCRGLNENRRGGLNSYAQLLMIISFL 272
>gi|70945383|ref|XP_742517.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521545|emb|CAH76038.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 411
Score = 48.1 bits (113), Expect = 0.034, Method: Composition-based stats.
Identities = 24/71 (33%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V +DI S +TT+ VK+ +++ PL +++K FL R L+ +YSGG+ S+
Sbjct: 114 SGVNVDICINQESSR--ETTEFVKKKIKEYIYLRPLVIIIKFFLNSRGLNDTYSGGIGSF 171
Query: 1456 CLMLLITRFLQ 1466
L ++ FLQ
Sbjct: 172 MLCCMVLHFLQ 182
>gi|302405651|ref|XP_003000662.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
gi|261360619|gb|EEY23047.1| Poly(A) RNA polymerase cid14 [Verticillium albo-atrum VaMs.102]
Length = 723
Score = 48.1 bits (113), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I A T +R+DISF G + EQ+PA L ++K FLA R L++ +GG
Sbjct: 522 IDAVTGLRVDISFDR--MDGPAAIKTFLDWKEQYPALPILVTIIKHFLAMRGLNEPVNGG 579
Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
+ S+ L+ EHHLG
Sbjct: 580 IGSFSSKNLVP-----EHHLG 595
>gi|320583303|gb|EFW97518.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Ogataea
parapolymorpha DL-1]
Length = 538
Score = 48.1 bits (113), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 61/125 (48%), Gaps = 2/125 (1%)
Query: 1342 DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLD 1401
D+ S ++ K D + + +++ + + ++ A K + AT + +D
Sbjct: 169 DIDMVVVSKARNNKYDNRSSLYQLSSYIRNHRLGVNVEAIAKAKVPIIKFVDPATKIHID 228
Query: 1402 ISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLI 1461
ISF+ + G++ +L+ + P L L++KQFL+ R L+ ++GGL + + L+
Sbjct: 229 ISFERTN--GIKAAELIISWLKDTPGLRELVLIVKQFLSVRKLNIVHTGGLGGFATICLV 286
Query: 1462 TRFLQ 1466
FL+
Sbjct: 287 RSFLK 291
>gi|154332398|ref|XP_001562573.1| topoisomerase-related function protein-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
gi|134059463|emb|CAM41689.1| topoisomerase-related function protein-like protein [Leishmania
braziliensis MHOM/BR/75/M2904]
gi|449310632|gb|AGE92543.1| topoisomerase-related protein [Leishmania braziliensis]
Length = 892
Score = 48.1 bits (113), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 500 IDVDISVGAVDGKRNSECIVQYMNAYPEALPLILVVKYFVMQRGMHEPYHGGLGSYATTL 559
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 560 LVVAFLRQHPIYTTQP 575
>gi|301105803|ref|XP_002901985.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099323|gb|EEY57375.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 444
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 64/125 (51%), Gaps = 8/125 (6%)
Query: 1215 EEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDL 1274
+E+ +C + +I+ A+ +T ++ +WP + FGS +TGL LPSSDVD+
Sbjct: 290 QEMTRYCTYTQRVVDEARQHIDAAIAHITEGVRKIWPHASVTCFGSYSTGLWLPSSDVDV 349
Query: 1275 VVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQE-WVKSDSLKTVENTAIPII 1333
VV + N E ++ +G+KE L+ AR L Q+ WV+ + V +P+
Sbjct: 350 VVM--GLSNSED-RDVPTRFLPDGVKE--LEQLARQLRPQKSWVQ--RVDVVRGAKVPVA 402
Query: 1334 MLVVE 1338
+++
Sbjct: 403 KVILR 407
>gi|449310610|gb|AGE92532.1| topoisomerase-related protein [Leishmania braziliensis]
Length = 893
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+D+ + G + ++ + + +P + PL LV+K F+ R + + Y GGL SY L
Sbjct: 501 IDVDISVGAVDGKRNSECIVQYMNAYPEALPLILVVKYFVMQRGMHEPYHGGLGSYATTL 560
Query: 1460 LITRFL-QHEHHLGRP 1474
L+ FL QH + +P
Sbjct: 561 LVVAFLRQHPIYTTQP 576
>gi|83765069|dbj|BAE55212.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391870459|gb|EIT79642.1| DNA polymerase sigma [Aspergillus oryzae 3.042]
Length = 390
Score = 48.1 bits (113), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + ++ ++PA + V+KQFL R L++ GG
Sbjct: 177 VDKMTGLRVDLSFDNDS--GLVANNTFQQWKSEYPAMPVIVSVIKQFLLLRGLNEVPCGG 234
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 235 LGGFSITCLVTSLLQHLPH 253
>gi|121711275|ref|XP_001273253.1| topoisomerase family protein TRF4, putative [Aspergillus clavatus
NRRL 1]
gi|119401404|gb|EAW11827.1| topoisomerase family protein TRF4, putative [Aspergillus clavatus
NRRL 1]
Length = 704
Score = 48.1 bits (113), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 2/79 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +R+D+SF + S GL + + ++PA + V+KQFL R L++ +GG
Sbjct: 377 VDKLTGLRVDLSFDNDS--GLIANNTFQSWKSEYPAMPVILSVVKQFLLLRGLNEVPTGG 434
Query: 1452 LSSYCLMLLITRFLQHEHH 1470
L + + L+T LQH H
Sbjct: 435 LGGFSITCLVTSLLQHLPH 453
>gi|451844986|gb|EMD58301.1| hypothetical protein COCSADRAFT_165704 [Cochliobolus sativus ND90Pr]
Length = 642
Score = 48.1 bits (113), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 43/72 (59%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++DISF++ S G+Q + +++P + +LKQFL L++ ++GG+ +
Sbjct: 447 TQLQVDISFENLS--GVQAQATFAQWKQEYPDMIYMVALLKQFLVMHGLNEVHTGGIGGF 504
Query: 1456 CLMLLITRFLQH 1467
++ LI ++QH
Sbjct: 505 SIICLIVSYIQH 516
>gi|432948655|ref|XP_004084114.1| PREDICTED: DNA polymerase sigma-like, partial [Oryzias latipes]
Length = 383
Score = 47.8 bits (112), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 31/44 (70%)
Query: 1423 EQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+++P L VLKQFL R L++ ++GG+SSY L+L++ FLQ
Sbjct: 24 QKYPVLPYLIFVLKQFLLQRDLNEVFTGGISSYSLILMVISFLQ 67
>gi|302691928|ref|XP_003035643.1| hypothetical protein SCHCODRAFT_104957 [Schizophyllum commune H4-8]
gi|300109339|gb|EFJ00741.1| hypothetical protein SCHCODRAFT_104957, partial [Schizophyllum
commune H4-8]
Length = 671
Score = 47.8 bits (112), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTE--QFP-ASTPLALVLKQFLADRSLDQSY 1448
I AA R I +GL ++D++ Q P A L L+LK +L+ R +++ Y
Sbjct: 265 ITAAEYGRFQIDISVNQFSGLVSSDIINGFQRGMQCPIAIRSLVLILKLYLSQRGMNEVY 324
Query: 1449 SGGLSSYCLMLLITRFLQ 1466
+GGL SY ++ L+ FLQ
Sbjct: 325 TGGLGSYSIVCLVLSFLQ 342
>gi|145535359|ref|XP_001453418.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124421129|emb|CAK86021.1| unnamed protein product [Paramecium tetraurelia]
Length = 634
Score = 47.8 bits (112), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 61/299 (20%), Positives = 126/299 (42%), Gaps = 58/299 (19%)
Query: 1193 PLEVQNCPTRKASLSLMHSLLHE--EIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLW 1250
P Q P ++ L+L+ S ++ E+ ++ +Q + +N RK + R +++
Sbjct: 285 PHSTQTTPIKRKQLNLVKSKTYDIDELSNYIQQKSYQNRIRKQIC------LNRLNYLIY 338
Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG-RNGIKETCLQHA 1307
N++ GS TGL L SDVD V + +NL + L+ N +K+T
Sbjct: 339 TNYHLNMYPYGSFETGLDLEISDVD--VGIWGSQNLSYSQIVSFLQLLNNTLKQTPFLIK 396
Query: 1308 ARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDN 1367
++ + +Q +PI+ L + +PK + D
Sbjct: 397 SKLIQSQ---------------MPILKLEL---------------NPKS----SFYDEDQ 422
Query: 1368 HVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDIS--------FKSPSHTGLQTTDLVK 1419
H+ + + + SS + +++DI+ + +P H G +T +VK
Sbjct: 423 HIQQNWSYFHLDQNDQGIIQSSQFL--GKIIQVDITWIYQWSNVYNNP-HLGFASTTIVK 479
Query: 1420 ELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLGRPINQV 1478
+ +F + L+LK + ++L+ +++GG+SS+CL +++ +H+ Q+
Sbjct: 480 DWVSRFVWYRDIMLILKHLVKSKNLNDAHTGGISSFCLSIMLAAIYMCKHYTQNDKKQI 538
>gi|452839453|gb|EME41392.1| hypothetical protein DOTSEDRAFT_46399 [Dothistroma septosporum NZE10]
Length = 754
Score = 47.8 bits (112), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ S GL ++ + +P L +++KQ LA R L++ ++GG
Sbjct: 442 VDGHTGIKVDLSFENDS--GLTANQTFQQWKKDYPEMPVLVMIIKQMLAMRGLNEVHTGG 499
Query: 1452 LSSYCLMLLITRFLQ 1466
+ + ++ L+ LQ
Sbjct: 500 IGGFTIICLVVSMLQ 514
>gi|196008079|ref|XP_002113905.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
gi|190582924|gb|EDV22995.1| hypothetical protein TRIADDRAFT_57804 [Trichoplax adhaerens]
Length = 1063
Score = 47.4 bits (111), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 106/279 (37%), Gaps = 69/279 (24%)
Query: 1198 NCPTRKASLS---LMHSLLHEEIDSFC---KQVAAENTARKPYINWAVKRVTRSLQVLWP 1251
NCP ++L+ M+ ID FC K A + K + + RVT+++Q ++P
Sbjct: 426 NCPESISTLNPLPEMNIDYLRAIDEFCYYVKDYYAMSAEEKQFREEILHRVTKAMQAIFP 485
Query: 1252 RSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYL 1311
+ ++FGS+ G SDVD+ + +P +L + E L G Q A R +
Sbjct: 486 EATLHLFGSSKNGFGTKQSDVDMCMMIPD-DSLNCLDEK--LRG---------QEAIRRI 533
Query: 1312 ANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHS 1371
A Q KS V++ + + +V
Sbjct: 534 AKQLRKKSRDFAKVQDISRATVPIV----------------------------------- 558
Query: 1372 DMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVK---ELTEQFPAS 1428
+ +P CS + DIS+++ + T+L+ L ++ P
Sbjct: 559 KFYDVRRYVNPTCS--------LNRKLSCDISYQNA--LAVHNTNLLASYGSLDDRIPI- 607
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
L LVLK + + G LSSY L++ FLQH
Sbjct: 608 --LVLVLKLIAKACDIGDASRGSLSSYAHTLMMIYFLQH 644
>gi|346972692|gb|EGY16144.1| Poly(A) RNA polymerase cid14 [Verticillium dahliae VdLs.17]
Length = 726
Score = 47.4 bits (111), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 40/81 (49%), Gaps = 7/81 (8%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I A T +R+DISF G EQ+PA L ++K FLA R L++ +GG
Sbjct: 525 IDAVTGLRVDISFDR--MDGPAAIKTFLNWKEQYPALPILVTIIKHFLAMRGLNEPVNGG 582
Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
+ S+ L+ EHHLG
Sbjct: 583 IGSFSSKNLVP-----EHHLG 598
>gi|307107081|gb|EFN55325.1| hypothetical protein CHLNCDRAFT_134313 [Chlorella variabilis]
Length = 1656
Score = 47.4 bits (111), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 13/78 (16%)
Query: 1211 SLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS---------- 1260
S LH E+ +F + A + AV V+ + + LWP+SRT +FGS
Sbjct: 1118 SPLHHEVAAFAAR-ATPTPTEVAAVQQAVLAVSEAARSLWPQSRTALFGSQASSGGPRRL 1176
Query: 1261 --NATGLSLPSSDVDLVV 1276
+ATGL+LP SD+D+VV
Sbjct: 1177 LASATGLALPGSDLDIVV 1194
Score = 43.9 bits (102), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 1401 DISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLL 1460
D+S + G Q D V+ P PL L +K FL R ++ ++GGLSSY + +
Sbjct: 1260 DVSLGA--ENGAQAVDFVRRQVLVVPPLRPLCLAVKAFLRQRGSNEVFTGGLSSYSVFNM 1317
Query: 1461 ITRFLQHEHH 1470
+ LQ E +
Sbjct: 1318 VMAHLQLEGY 1327
>gi|344255571|gb|EGW11675.1| PAP-associated domain-containing protein 5 [Cricetulus griseus]
Length = 382
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 60/135 (44%), Gaps = 33/135 (24%)
Query: 1238 AVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRN 1297
V R+ ++ LWP + IFGS TGL LP+SD+DLVV NL P+
Sbjct: 5 VVSRIESVIKELWPSADVQIFGSFKTGLYLPTSDIDLVV-FGKWENL-PL---------- 52
Query: 1298 GIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV---VEVPHDLIASAASSVQS- 1353
L+ A R + DS+K ++ +PII L EV D+ + + V++
Sbjct: 53 ----WTLEEALR---KHKVADEDSVKVLDKATVPIIKLTDSFTEVKVDISFNVQNGVRAA 105
Query: 1354 ----------PKEDA 1358
P+EDA
Sbjct: 106 DLIKDFTKLHPREDA 120
>gi|224128147|ref|XP_002329093.1| predicted protein [Populus trichocarpa]
gi|222869762|gb|EEF06893.1| predicted protein [Populus trichocarpa]
Length = 543
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 39/67 (58%), Gaps = 7/67 (10%)
Query: 1213 LHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPS 1269
LH+EI FC ++ E +R AV+ V ++ +WP + +FGS TGL LP+
Sbjct: 125 LHKEIVDFCDFLSPTQEEQASRAE----AVRCVFDVIKYIWPNCKVEVFGSFRTGLYLPT 180
Query: 1270 SDVDLVV 1276
SD+D+V+
Sbjct: 181 SDIDVVI 187
Score = 47.0 bits (110), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 2/64 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + V DISF + G + +K ++P PL L+LK FL R L++ YSGG
Sbjct: 229 VEKRSGVSFDISFDV--NGGPIAAEFIKNAISKWPELRPLCLILKVFLQQRELNEVYSGG 286
Query: 1452 LSSY 1455
+SSY
Sbjct: 287 ISSY 290
>gi|302673443|ref|XP_003026408.1| hypothetical protein SCHCODRAFT_114488 [Schizophyllum commune H4-8]
gi|300100090|gb|EFI91505.1| hypothetical protein SCHCODRAFT_114488, partial [Schizophyllum
commune H4-8]
Length = 466
Score = 47.4 bits (111), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
D+ + GL ++++ PA PL VLK+FL R L + GL Y LM
Sbjct: 212 FDVDIGVNNVDGLNGVPIMQKYLRTMPALRPLLFVLKRFLGQRDLGNAAKSGLGGYGLMC 271
Query: 1460 LITRFLQHEHHLGRP 1474
LI FL++ GRP
Sbjct: 272 LIVAFLKNNPQ-GRP 285
>gi|453083459|gb|EMF11505.1| PAP/OAS1 substrate-binding domain-containing protein [Mycosphaerella
populorum SO2202]
Length = 632
Score = 47.4 bits (111), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K ++ DI+F +P GL T +++ ++ P P+ L +K + R ++ SYSG L
Sbjct: 238 KDGIGIQCDINFSNP--LGLHNTQMLRCYSKCDPRVRPIVLFMKSWAKQRKINSSYSGTL 295
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 296 SSYGYVLMVLHYL 308
>gi|342873493|gb|EGU75659.1| hypothetical protein FOXB_13808 [Fusarium oxysporum Fo5176]
Length = 340
Score = 47.4 bits (111), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++D+SF++ G+ D E Q+PA L V+K FL R L++ +GG
Sbjct: 124 VDKQTGLKVDVSFENLG--GVNAVDTFLEWKAQYPAMPILVTVIKHFLLMRGLNEPVNGG 181
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ + ++ L+ LQ EHHLG
Sbjct: 182 IGGFTVICLVVSMLQLMPQVQSRNLIPEHHLG 213
>gi|237839809|ref|XP_002369202.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211966866|gb|EEB02062.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
Length = 1196
Score = 47.4 bits (111), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 127/288 (44%), Gaps = 33/288 (11%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P+R+ L H L HEE F + + A ++ + RVT ++LWP FG
Sbjct: 306 PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362
Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
S+ T L+LP +D+D+ + + PP+ LE +E ++ I E+ A R+ A
Sbjct: 363 SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420
Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
E V +++ ++ E + +A +V P +DA ++
Sbjct: 421 LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480
Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
+ D+ + +D+ + ++ P C + T++ +DIS PS + T+ +
Sbjct: 481 RESPRRRDSPLVTDLQLILEARVPICRF-----VDTETNLPVDISLDQPS--AVLTSLYI 533
Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ +FP PL L+ K L L++ + GG+ SY L ++ FLQ
Sbjct: 534 RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581
>gi|451992975|gb|EMD85451.1| hypothetical protein COCHEDRAFT_1148848 [Cochliobolus heterostrophus
C5]
Length = 624
Score = 47.4 bits (111), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 42/72 (58%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++DISF++ S G+Q + + +P + +LKQFL L++ ++GG+ +
Sbjct: 429 TQLQVDISFENLS--GVQAQATFAQWKQDYPDMIYMVALLKQFLVMHGLNEVHTGGIGGF 486
Query: 1456 CLMLLITRFLQH 1467
++ LI ++QH
Sbjct: 487 SIICLIVSYIQH 498
>gi|145525609|ref|XP_001448621.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416176|emb|CAK81224.1| unnamed protein product [Paramecium tetraurelia]
Length = 364
Score = 47.4 bits (111), Expect = 0.068, Method: Composition-based stats.
Identities = 62/266 (23%), Positives = 103/266 (38%), Gaps = 79/266 (29%)
Query: 1203 KASLSLMHSLL--HEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGS 1260
K +L + H +L H EI F + ++ + K + A RV + LQ + P ++ FGS
Sbjct: 52 KKTLMIRHPVLRLHNEIAEFYEYISPSDQEHKRRVT-AYLRVEKYLQDIAPEAQIESFGS 110
Query: 1261 NATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSD 1320
T + LP++D+D+V ++E K+ + AAR + K +
Sbjct: 111 FKTRMYLPNADIDIV----------------MIETSCTQKQLFKKVAARMMKQTN--KFE 152
Query: 1321 SLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSA 1380
++ + N +PII VEV ++ H D+
Sbjct: 153 NVNLIANAKVPIIKF-VEV--------------------------ESQYHFDL------- 178
Query: 1381 SPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLA 1440
S D +K Q +L K E +P L + LK L
Sbjct: 179 ----SFNQLDGLK-------------------QIEELEKAF-ELYPELKFLLMTLKCVLR 214
Query: 1441 DRSLDQSYSGGLSSYCLMLLITRFLQ 1466
R L+++YSGG+ S+ L +I FL+
Sbjct: 215 QRDLNETYSGGVGSFLLFQMILAFLR 240
>gi|390597612|gb|EIN07011.1| Nucleotidyltransferase [Punctularia strigosozonata HHB-11173 SS5]
Length = 464
Score = 47.0 bits (110), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
+DIS + G+ ++ + PA L ++ K FL+ RS+++ Y+GGL SY ++
Sbjct: 255 VDISLNQTN--GVAAGKMINRYLRELPALRGLVMITKAFLSQRSMNEVYTGGLGSYSIVC 312
Query: 1460 LITRFLQHEHHLGR 1473
L FLQ + R
Sbjct: 313 LAISFLQMHPKIRR 326
>gi|342320294|gb|EGU12235.1| Trf5 [Rhodotorula glutinis ATCC 204091]
Length = 522
Score = 47.0 bits (110), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 62/144 (43%), Gaps = 27/144 (18%)
Query: 1196 VQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
++ RK + +LL EE+D+F N + ++ A + ++ LWP +
Sbjct: 52 MEGLKRRKDDMRHPLALLDEELDAFVTWTRPNNAEHQLRLH-AFRCFQSLVKALWPFATC 110
Query: 1256 NIFGSNATGLSLPSSDVDLVV------CLPPVRNLEPIKEAGILEGRNGIKETCLQHAAR 1309
+FGS ATGL LP D D+V+ P V L +++A + G
Sbjct: 111 ELFGSMATGLYLPDGDFDVVIFDSRLMTTPSVSLLRTLRDAIVRSG-------------- 156
Query: 1310 YLANQEWVKSDSLKTVENTAIPII 1333
+ ++ ++ VEN +P++
Sbjct: 157 ------FAQASDIRLVENAKVPLV 174
>gi|221504776|gb|EEE30441.1| trf5, putative [Toxoplasma gondii VEG]
Length = 1196
Score = 47.0 bits (110), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 128/288 (44%), Gaps = 33/288 (11%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P+R+ L H L HEE F + + A ++ + RVT ++LWP FG
Sbjct: 306 PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362
Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
S+ T L+LP +D+D+ + + PP+ LE +E ++ I E+ A R+ A
Sbjct: 363 SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420
Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
E V +++ ++ E + +A +V P +DA ++
Sbjct: 421 LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480
Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
+ D+ + +D+ + ++ P C ++ T++ +DIS PS + T+ +
Sbjct: 481 RESPRRRDSPLVTDLQLILEARVPICRFVDTE-----TNLPVDISLDQPS--AVLTSLYI 533
Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ +FP PL L+ K L L++ + GG+ SY L ++ FLQ
Sbjct: 534 RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581
>gi|221484582|gb|EEE22876.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 1196
Score = 47.0 bits (110), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 128/288 (44%), Gaps = 33/288 (11%)
Query: 1200 PTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFG 1259
P+R+ L H L HEE F + + A ++ + RVT ++LWP FG
Sbjct: 306 PSRE-KLDFFHRL-HEECVDFLQWIDATEEEQRNRAK-VLARVTAVSRLLWPNCVVCPFG 362
Query: 1260 SNATGLSLPSSDVDLVVCL---PPVRNLEPIKEAGILEGRNGIKETCLQHAA----RYLA 1312
S+ T L+LP +D+D+ + + PP+ LE +E ++ I E+ A R+ A
Sbjct: 363 SSYTNLALPHADLDICIFINSDPPL--LEEFRELAAVDSAKEIAESAASAQAASYRRFGA 420
Query: 1313 NQEWVKSDSLKTVENTAIPIIML------VVEVPHDLIASAASSVQSPKEDAAHTTL--- 1363
E V +++ ++ E + +A +V P +DA ++
Sbjct: 421 LAEKVMAEASRSTETFGYGSSFRRKKEEDACHLRRLAAVLSACTVSLPSQDAPSSSAPNR 480
Query: 1364 -----KHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLV 1418
+ D+ + +D+ + ++ P C ++ T++ +DIS PS + T+ +
Sbjct: 481 RESPRRRDSPLVTDLQLILEARVPICRFVDTE-----TNLPVDISLDQPS--AVLTSLYI 533
Query: 1419 KELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ +FP PL L+ K L L++ + GG+ SY L ++ FLQ
Sbjct: 534 RLQLLRFPLLRPLMLLNKTALKLWRLNEPFKGGVGSYLLFVMTLSFLQ 581
>gi|145516679|ref|XP_001444228.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411639|emb|CAK76831.1| unnamed protein product [Paramecium tetraurelia]
Length = 637
Score = 47.0 bits (110), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 113/282 (40%), Gaps = 49/282 (17%)
Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRV--TRSLQVLW 1250
P Q P ++ L+L S ++ID F N K + N KR+ R ++
Sbjct: 297 PYSSQTTPIKRKQLNLNKSKT-QDIDEF-----VNNIQVKSFQNKIRKRICFNRLHYIIS 350
Query: 1251 PRSRTNIF--GSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAA 1308
S NI+ GS TGL L SDVD+ + + I L+ N L
Sbjct: 351 MNSDLNIYAYGSFETGLDLEVSDVDIGIWGTQTLSYNQI--VNFLQSINQ-----LLKQT 403
Query: 1309 RYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNH 1368
+L N + ++S +PI+ L + P + + +Q +
Sbjct: 404 PFLVNSKLIQS---------HMPILKLELN-PKTAFYNDDAYIQQNWQS----------- 442
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
LD S K I +++ + +P H G +T ++K+ +F
Sbjct: 443 -----FHLDQQDSGKIIQVDLTWIYQWSNI-----YNNP-HLGFASTTIIKDWVNRFYWY 491
Query: 1429 TPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
+ L+LK L ++L+ +++GG+SS+CL +++ +H+
Sbjct: 492 RDIMLILKYLLKSKNLNDAHTGGISSFCLSIMLAAIYMCKHY 533
>gi|254573058|ref|XP_002493638.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Komagataella
pastoris GS115]
gi|238033437|emb|CAY71459.1| Catalytic subunit of TRAMP (Trf4/Pap2p-Mtr4p-Air1p/2p) [Komagataella
pastoris GS115]
gi|328354535|emb|CCA40932.1| DNA polymerase sigma subunit [Komagataella pastoris CBS 7435]
Length = 601
Score = 46.6 bits (109), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 36/96 (37%), Positives = 47/96 (48%), Gaps = 12/96 (12%)
Query: 1190 PLQPLEVQNCP---TRKASLSL-MHSLLHEEIDSFCKQVA---AENTARKPYINWAVKRV 1242
PL +Q P R S L + L EI F ++ AE AR N AVKR+
Sbjct: 108 PLDDKPIQQYPWIKNRDHSKQLELSDWLTLEIKDFINYISPSIAEIEAR----NNAVKRL 163
Query: 1243 TRSLQV-LWPRSRTNIFGSNATGLSLPSSDVDLVVC 1277
+ + LWP N+FGS AT L LP SD+D+V+
Sbjct: 164 RKEITTNLWPDCYVNVFGSFATDLYLPGSDIDMVIT 199
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ + + +D+SF+ + GL+ + ++ + P L L++KQFLA R ++ + GG
Sbjct: 241 IEPRSKIHIDVSFEKTN--GLRAAERIQGWLRETPGLRELVLIVKQFLAVRRMNNVHHGG 298
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 299 LGGFSIICLVHSFL 312
>gi|330917097|ref|XP_003297677.1| hypothetical protein PTT_08167 [Pyrenophora teres f. teres 0-1]
gi|311329513|gb|EFQ94235.1| hypothetical protein PTT_08167 [Pyrenophora teres f. teres 0-1]
Length = 673
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 41/71 (57%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +++DISF++ S G+Q E ++P + ++KQFL L++ ++GG+ Y
Sbjct: 478 TKLQVDISFENLS--GVQAQATFAEWKAKYPDMIYMVALMKQFLVMHGLNEVHTGGIGGY 535
Query: 1456 CLMLLITRFLQ 1466
++ LI +LQ
Sbjct: 536 SIICLIVSYLQ 546
>gi|425766989|gb|EKV05577.1| Topoisomerase family protein TRF4, putative [Penicillium digitatum
Pd1]
Length = 714
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +R+D+SF + S GL + ++ ++P + V+KQFL R L++ +GGL +
Sbjct: 371 TGLRVDLSFDNDS--GLIANETFQKWKTEYPTMPVILAVIKQFLLIRGLNEVPTGGLGGF 428
Query: 1456 CLMLLITRFLQH 1467
+ L+T LQH
Sbjct: 429 SITCLVTSILQH 440
>gi|68074819|ref|XP_679326.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56500049|emb|CAH96511.1| conserved hypothetical protein [Plasmodium berghei]
Length = 390
Score = 46.6 bits (109), Expect = 0.11, Method: Composition-based stats.
Identities = 18/54 (33%), Positives = 37/54 (68%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++TD +K+ +++ PL +++K FL R+L+++Y+GG+ S+ L ++ FLQ
Sbjct: 248 ESTDFIKKKIKEYIYLRPLVILIKFFLNSRNLNETYTGGIGSFMLCCMVLHFLQ 301
>gi|425780140|gb|EKV18158.1| Topoisomerase family protein TRF4, putative [Penicillium digitatum
PHI26]
Length = 714
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +R+D+SF + S GL + ++ ++P + V+KQFL R L++ +GGL +
Sbjct: 371 TGLRVDLSFDNDS--GLIANETFQKWKTEYPTMPVILAVIKQFLLIRGLNEVPTGGLGGF 428
Query: 1456 CLMLLITRFLQH 1467
+ L+T LQH
Sbjct: 429 SITCLVTSILQH 440
>gi|367007982|ref|XP_003688720.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
gi|357527030|emb|CCE66286.1| hypothetical protein TPHA_0P01280 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 20/74 (27%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ + + +D+SF+ + GL ++++ P+ L L++KQFL R L+ ++GG
Sbjct: 231 VEPVSGIHIDVSFERDN--GLDAANIIRSWLTSTPSLRELVLIVKQFLNSRRLNDVHTGG 288
Query: 1452 LSSYCLMLLITRFL 1465
L + ++ L+ FL
Sbjct: 289 LGGFSIICLVYSFL 302
>gi|302831043|ref|XP_002947087.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f. nagariensis]
gi|300267494|gb|EFJ51677.1| hypothetical protein VOLCADRAFT_87356 [Volvox carteri f. nagariensis]
Length = 609
Score = 46.6 bits (109), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 1391 NIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSG 1450
I + V +DIS S G Q + +++ FPA PL LVLK ++ + L + G
Sbjct: 353 KITTSAGVSVDISIAGLS--GPQAAEYIRQQVSSFPALRPLVLVLKSYMRELDLAEVAKG 410
Query: 1451 GLSSYCLMLLITRFL 1465
G+SSY L ++ L
Sbjct: 411 GISSYGLTYMVMAHL 425
>gi|296088678|emb|CBI38128.3| unnamed protein product [Vitis vinifera]
Length = 178
Score = 46.6 bits (109), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 38/61 (62%), Gaps = 1/61 (1%)
Query: 1004 TTDVATEEEISGSFASLSGDVDSKALDTLPCPILRPIIIPNLSRERSRSDFKRSHEHKSP 1063
+ D +E+++SGS A+L +V+ K+ D LP +L PIIIPN+S+E S D K + P
Sbjct: 2 SADALSEKKVSGSLANLPVNVEGKSGDPLPYSLL-PIIIPNMSKESSLKDVKGHGNDEDP 60
Query: 1064 C 1064
Sbjct: 61 I 61
>gi|396490001|ref|XP_003843230.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
gi|312219809|emb|CBX99751.1| hypothetical protein LEMA_P073400.1 [Leptosphaeria maculans JN3]
Length = 717
Score = 46.6 bits (109), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 47/83 (56%), Gaps = 5/83 (6%)
Query: 1394 AATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLS 1453
+ T +++DISF++ S G+Q + +++P + +LKQFL R L++ ++GG+
Sbjct: 476 SVTRLQVDISFENLS--GMQAQATFAQWKDKYPDMIYMVALLKQFLVMRGLNEVHTGGIG 533
Query: 1454 SYCLMLLITRFLQHEHHLGRPIN 1476
+ ++ LI ++ H G+ N
Sbjct: 534 GFAIICLIVHYI---HQAGKAEN 553
>gi|440493090|gb|ELQ75598.1| DNA polymerase sigma [Trachipleistophora hominis]
Length = 288
Score = 46.2 bits (108), Expect = 0.15, Method: Composition-based stats.
Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 2/67 (2%)
Query: 1399 RLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLM 1458
R D+S S G+ T +K+ +Q P L+L LK FL R L++S GGL SY +
Sbjct: 198 RYDLSLNQES--GIIHTRFIKKALKQQPYIKDLSLFLKYFLKSRGLNESKRGGLCSYAQL 255
Query: 1459 LLITRFL 1465
L++ FL
Sbjct: 256 LMLINFL 262
>gi|395335008|gb|EJF67384.1| hypothetical protein DICSQDRAFT_77074 [Dichomitus squalens LYAD-421
SS1]
Length = 592
Score = 46.2 bits (108), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
S+++D+S + GL+ +++ ++ PA L L LK L+ L+ + GGLSSY
Sbjct: 220 SLKIDVSLNA--TDGLKAVPILRSYFDRMPALRHLVLCLKALLSRHGLNSASFGGLSSYA 277
Query: 1457 LMLLITRFLQHEHHLGRP 1474
L+ L FLQ + +GRP
Sbjct: 278 LICLAISFLQL-NPMGRP 294
>gi|374107870|gb|AEY96777.1| FAEL207Wp [Ashbya gossypii FDAG1]
Length = 626
Score = 46.2 bits (108), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 87/231 (37%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N A+KR+ ++Q WP + + FGS AT L LP SD+D VV ++G +
Sbjct: 217 NDALKRIRDAVQDFWPDANLHCFGSYATDLYLPGSDIDCVVN----------SKSGDKDN 266
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
+N + LA Q + + +PII V
Sbjct: 267 KNALYSLASYLKRNGLATQ-------VSVIAKARVPIIKFV------------------- 300
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
E A+ +H D+ S ++ ++AA +R L T
Sbjct: 301 EPAS--------QIHIDL-----------SFERTNGVEAAKIIR----------GWLHDT 331
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL L++KQFL R L+ + GGL + ++ L FL+
Sbjct: 332 PGLRELV----------LIVKQFLHARRLNDVHIGGLGGFSIICLAYSFLK 372
>gi|146184040|ref|XP_001027646.2| Chitinase class I family protein [Tetrahymena thermophila]
gi|146143378|gb|EAS07404.2| Chitinase class I family protein [Tetrahymena thermophila SB210]
Length = 463
Score = 46.2 bits (108), Expect = 0.16, Method: Composition-based stats.
Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ T + DISF G++ VK+ E +P L +V+K L R L ++YSGG
Sbjct: 264 VEKETQINFDISFNK--EDGVKQLSEVKKGLELYPEMKYLIMVMKCILRQRDLHETYSGG 321
Query: 1452 LSSYCLMLLITRFLQ 1466
+ S+ L +I FL+
Sbjct: 322 IGSFLLFCMILAFLR 336
>gi|195112905|ref|XP_002001012.1| GI22217 [Drosophila mojavensis]
gi|193917606|gb|EDW16473.1| GI22217 [Drosophila mojavensis]
Length = 397
Score = 45.8 bits (107), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
LH+EI+ F + + T + AV R+ R + +WP +FGS TGL+LP SD+
Sbjct: 25 LHQEIEHFYNYMVSTPTEYMMRME-AVHRIERVVLSIWPDCCIEVFGSFRTGLNLPISDI 83
Query: 1273 DLVV 1276
D+ V
Sbjct: 84 DIAV 87
Score = 44.7 bits (104), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 7/78 (8%)
Query: 1369 VHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPAS 1428
V D+ LD ++ P T + + ++ DISF + S G++ +LV++ EQFP
Sbjct: 109 VTDDVNVLDKASVPVVKFT-----EQISRIKFDISFNTTS--GVKAAELVQQFIEQFPEL 161
Query: 1429 TPLALVLKQFLADRSLDQ 1446
L +VLKQ+L + L++
Sbjct: 162 PKLVIVLKQYLLLQGLNE 179
>gi|302917206|ref|XP_003052397.1| hypothetical protein NECHADRAFT_1558 [Nectria haematococca mpVI
77-13-4]
gi|256733337|gb|EEU46684.1| hypothetical protein NECHADRAFT_1558 [Nectria haematococca mpVI
77-13-4]
Length = 630
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+ T +++DISF++ G+Q D E +PA L V+K FL R L++ +GG
Sbjct: 427 VDKQTGLKVDISFEN--LGGVQAVDTFLRWKELYPAMPILVTVIKHFLLMRGLNEPVNGG 484
Query: 1452 LSSYCLMLLITRFLQ-----------HEHHLG 1472
+ + ++ L+ LQ EH+LG
Sbjct: 485 IGGFSVICLVVSMLQLNPQVQSRSLVPEHNLG 516
>gi|159474006|ref|XP_001695120.1| hypothetical protein CHLREDRAFT_206054 [Chlamydomonas reinhardtii]
gi|158276054|gb|EDP01828.1| hypothetical protein CHLREDRAFT_206054 [Chlamydomonas reinhardtii]
Length = 397
Score = 45.8 bits (107), Expect = 0.17, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ V +D+S S S G + +++ +PA PL LVLK ++ L + SGGLSSY
Sbjct: 193 SGVSVDVSIASDS--GPRAAAFIRQQAVAYPALRPLVLVLKSYMRAEGLAEVASGGLSSY 250
Query: 1456 CLMLLITRFLQHEHHLG 1472
L ++ LQ E G
Sbjct: 251 GLTYMLLAHLQEEAKRG 267
>gi|452988962|gb|EME88717.1| hypothetical protein MYCFIDRAFT_87561 [Pseudocercospora fijiensis
CIRAD86]
Length = 848
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F +P GL T +++ ++ P P+ L +K + R ++ SYS
Sbjct: 446 DFPKDGVGIQCDINFFNP--LGLHNTQMLRCYSKCDPRVRPMVLFVKSWAKGRKINSSYS 503
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY +L++ +L
Sbjct: 504 GTLSSYGYVLMVLHYL 519
>gi|45190400|ref|NP_984654.1| AEL207Wp [Ashbya gossypii ATCC 10895]
gi|50401682|sp|Q9HFW3.1|TRF5_ASHGO RecName: Full=Poly(A) RNA polymerase protein 1; AltName:
Full=Topoisomerase 1-related protein TRF5
gi|10444115|gb|AAG17722.1|AF286114_3 Trf5 [Eremothecium gossypii]
gi|44983296|gb|AAS52478.1| AEL207Wp [Ashbya gossypii ATCC 10895]
Length = 626
Score = 45.8 bits (107), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 87/231 (37%), Gaps = 75/231 (32%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N A+KR+ ++Q WP + + FGS AT L LP SD+D VV ++G +
Sbjct: 217 NDALKRIRDAVQDFWPDANLHCFGSYATDLYLPGSDIDCVVN----------SKSGDKDN 266
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
+N + LA Q + + +PII V
Sbjct: 267 KNALYSLASYLKRNGLATQ-------VSVIAKARVPIIKFV------------------- 300
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
E A+ +H D+ S ++ ++AA +R L T
Sbjct: 301 EPAS--------QIHIDL-----------SFERTNGVEAAKIIR----------GWLHDT 331
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
++EL L++KQFL R L+ + GGL + ++ L FL+
Sbjct: 332 PGLRELV----------LIVKQFLHARRLNDVHIGGLGGFSIICLAYSFLK 372
>gi|380471218|emb|CCF47385.1| Poly(A) RNA polymerase cid14, partial [Colletotrichum higginsianum]
Length = 457
Score = 45.8 bits (107), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 2/65 (3%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I + T +R+DISF G Q E EQFPA L ++K FLA R L++ +GG
Sbjct: 395 IDSITGLRVDISFDR--LDGPQAVKTFAEWKEQFPAMPILVTMIKHFLAMRGLNEPVNGG 452
Query: 1452 LSSYC 1456
+ S+
Sbjct: 453 IGSFT 457
>gi|355709232|gb|AES03523.1| polymerase sigma [Mustela putorius furo]
Length = 417
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 20/36 (55%), Positives = 27/36 (75%)
Query: 1431 LALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L LVLKQFL R L++ ++GG+SSY L+L+ FLQ
Sbjct: 6 LILVLKQFLLQRDLNEVFTGGISSYSLILMAISFLQ 41
>gi|189211747|ref|XP_001942202.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187979401|gb|EDU46027.1| Poly(A) RNA polymerase cid13 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 636
Score = 45.8 bits (107), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 60/262 (22%), Positives = 104/262 (39%), Gaps = 58/262 (22%)
Query: 1258 FGSNATGLSLPSSDVDLVVCLPPVRN------------LEPIKEAGILEGRNG------- 1298
FGS A+G +P SD+DL V +P R+ + + E +L+ + G
Sbjct: 199 FGSLASGFGMPGSDMDLAV-VPTYRDPARANDFSIDRGIPRLLERAVLDAKLGGRLLTRT 257
Query: 1299 ---IKETCLQ-----HAARYLANQEW--------VKSDSL------KTVENTAIP----- 1331
I + C + A + Q+W SD+ K N +P
Sbjct: 258 RVPILKICQSPTETLYTALFEERQKWDELPDEEKYPSDATRSFVQPKPAANHQVPPTPQV 317
Query: 1332 -IIMLVVEVPH-DLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSS 1389
V P A +S+Q+PK + T + + V +D D + +
Sbjct: 318 DAAKFVTNFPTLGTTAPTKNSLQAPKTEVKSPTTETSS-VTNDKAKKDQNQRSGKQWSRE 376
Query: 1390 ------DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRS 1443
D K ++ DI+F++P G+ T ++K + P P+ L +K + R
Sbjct: 377 KVVGPLDFPKTGCGIQCDINFENP--LGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRK 434
Query: 1444 LDQSYSGGLSSYCLMLLITRFL 1465
++ +YSG LSSY +L++ +L
Sbjct: 435 VNSAYSGTLSSYGWVLMVLHYL 456
>gi|407398163|gb|EKF28017.1| DNA polymerase sigma, putative [Trypanosoma cruzi marinkellei]
Length = 286
Score = 45.8 bits (107), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 31/55 (56%), Gaps = 1/55 (1%)
Query: 1421 LTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ-HEHHLGRP 1474
+ FP + PL +V+K FL R + + Y GGL S+ LL+ FLQ H + RP
Sbjct: 1 MLRSFPEAQPLIIVVKYFLQQRGMHEPYHGGLGSFATTLLVISFLQNHPIYTERP 55
>gi|325182222|emb|CCA16676.1| phospholipase putative [Albugo laibachii Nc14]
Length = 1653
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 17/145 (11%)
Query: 1196 VQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
V+ PT +A L+ E++ Q A +N AR A + + R +Q LW +
Sbjct: 1279 VREEPTVEAPLT--------ELNQRILQFACQNEARN---EQARQEIERGIQSLWSDASV 1327
Query: 1256 NIFGSNATGLS--LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLAN 1313
+FGS AT LS + ++DVDLVV LP + A ++ E+ + + + A+
Sbjct: 1328 YLFGSAATSLSIDIETADVDLVVILPSQSTVFRKASAHLVIQLKDHLESLVSEKSLFCAS 1387
Query: 1314 QEWVKSDSLKTVENTAIPIIMLVVE 1338
KS + + N IPI L +E
Sbjct: 1388 ----KSVNFSLITNARIPITRLWIE 1408
>gi|452839338|gb|EME41277.1| hypothetical protein DOTSEDRAFT_73629 [Dothistroma septosporum NZE10]
Length = 835
Score = 45.4 bits (106), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F +P GL T L++ + P P+ L +K + R ++ SYS
Sbjct: 448 DFPKDGVGIQCDINFFNP--LGLHNTQLLRCYSSCDPRVRPMVLFVKSWAKKRRINSSYS 505
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY ++++ +L
Sbjct: 506 GTLSSYGYVMMVLHYL 521
>gi|428181724|gb|EKX50587.1| hypothetical protein GUITHDRAFT_135219 [Guillardia theta CCMP2712]
Length = 186
Score = 45.4 bits (106), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 30/48 (62%), Gaps = 5/48 (10%)
Query: 1249 LWPRSRTNIFGSNATGLSLPSSDVDLVV-----CLPPVRNLEPIKEAG 1291
+WP + +FGS ATG+ LPSSDVD+VV C P V+ + +K G
Sbjct: 65 IWPEAELKVFGSYATGMFLPSSDVDMVVVNCSDCDPLVKLAKVLKSDG 112
>gi|330844415|ref|XP_003294122.1| hypothetical protein DICPUDRAFT_159072 [Dictyostelium purpureum]
gi|325075475|gb|EGC29358.1| hypothetical protein DICPUDRAFT_159072 [Dictyostelium purpureum]
Length = 825
Score = 45.1 bits (105), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 45/74 (60%), Gaps = 2/74 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T +DI F +PS G++ T +VK +Q+ + L LVLK FL ++++Y+GG+ SY
Sbjct: 417 TQYNIDICFDTPS--GIENTQVVKSFLKQYKSMRVLLLVLKFFLQQNHVNETYTGGIGSY 474
Query: 1456 CLMLLITRFLQHEH 1469
L L++ ++Q H
Sbjct: 475 ALALMVVSYIQLRH 488
>gi|213406299|ref|XP_002173921.1| Poly(A) RNA polymerase cid12 [Schizosaccharomyces japonicus yFS275]
gi|212001968|gb|EEB07628.1| Poly(A) RNA polymerase cid12 [Schizosaccharomyces japonicus yFS275]
Length = 357
Score = 44.7 bits (104), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ + +DI+F + S + + +L E + P L ++LK +L +R LD Y GGL S
Sbjct: 124 SGLSVDITFDNAS--AISSANLQNEYADNHPVFRKLFMLLKHWLFERRLDHVYIGGLGSS 181
Query: 1456 CLMLLITRFLQHEHH 1470
L +I +L+ ++H
Sbjct: 182 ALSYMIIGWLEMQYH 196
>gi|380487956|emb|CCF37707.1| hypothetical protein CH063_01756 [Colletotrichum higginsianum]
Length = 1037
Score = 44.7 bits (104), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 8/123 (6%)
Query: 1349 SSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTS---SDNI---KAATSVRLDI 1402
S V P+ D KH+ M L D A + D + K+ V+ DI
Sbjct: 648 SIVHDPQYDYVVPVTKHNIETIKRMRGLRDPAKMAVNQPRDPYKDKLEFPKSGVGVQCDI 707
Query: 1403 SFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLIT 1462
+F + H L T L++ + P PL L +K + R ++ Y G LSSY +L++
Sbjct: 708 NFAA--HLALHNTLLLRCYSHSDPRVRPLVLFIKHWAKVRGINTPYRGTLSSYGYVLMML 765
Query: 1463 RFL 1465
+L
Sbjct: 766 HYL 768
>gi|340960188|gb|EGS21369.1| hypothetical protein CTHT_0032240 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1085
Score = 44.3 bits (103), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 41/144 (28%), Positives = 69/144 (47%), Gaps = 13/144 (9%)
Query: 1328 TAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSA--SPKCS 1385
T I I + +++ P ++ S +S Q+ DA K + + S++ L D A +P
Sbjct: 697 TKIRIYIELLKGP--MVPSPSSKTQT---DAVIPLPKDADTLISELRQLGDPAGMAPNQP 751
Query: 1386 HTSSDNI----KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLAD 1441
+N K+ V+ DI+F + H LQ T L++ + P PL L +K +
Sbjct: 752 RDRYNNPLEFPKSGVGVQCDINFSA--HLALQNTLLLRCYSHCDPRVRPLILFVKHWAKV 809
Query: 1442 RSLDQSYSGGLSSYCLMLLITRFL 1465
R ++ Y G LSSY +L++ FL
Sbjct: 810 RRINTPYRGTLSSYGYVLMMLHFL 833
>gi|389600163|ref|XP_001561739.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504209|emb|CAM41532.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 3055
Score = 44.3 bits (103), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT ++DI+F H ++ L + L E FP + PL L LK + + +S GG++S
Sbjct: 1681 ATGTKVDITFAGGEH--WRSMQLTRSLLEVFPHARPLILFLKYCVRSLGVGESEPGGVTS 1738
Query: 1455 YCLMLLITRFLQHE 1468
+ + L++ F HE
Sbjct: 1739 FAIYLMVLHFY-HE 1751
>gi|340939540|gb|EGS20162.1| hypothetical protein CTHT_0046690 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 836
Score = 44.3 bits (103), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 44/92 (47%), Gaps = 13/92 (14%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ T +R+DISF++ +G+ + + +P L V+K FL R L++ +GG
Sbjct: 582 VERKTGLRIDISFEN--RSGIVAINTFHKWRNLYPGMPALVSVIKHFLLMRGLNEPVNGG 639
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLG 1472
+ S+ + L+ LQ HHLG
Sbjct: 640 IGSFSVTCLVVSMLQMMPEVQSGNMDTRHHLG 671
>gi|116201727|ref|XP_001226675.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
gi|88177266|gb|EAQ84734.1| hypothetical protein CHGG_08748 [Chaetomium globosum CBS 148.51]
Length = 780
Score = 44.3 bits (103), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 23/93 (24%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
I+ T +++D+SF++ ++G+ K +++P L ++K FL R L++ +GG
Sbjct: 525 IEHKTGLKVDVSFEN--NSGVTAIATFKAWRDRYPGMPALVTLVKHFLLMRGLNEPVNGG 582
Query: 1452 LSSYCLMLLITRFLQH-----------EHHLGR 1473
+ + ++ L+ LQ HHLG+
Sbjct: 583 IGGFSVICLVVSMLQMMPEVQSGNLDTRHHLGQ 615
>gi|290988093|ref|XP_002676756.1| predicted protein [Naegleria gruberi]
gi|284090360|gb|EFC44012.1| predicted protein [Naegleria gruberi]
Length = 604
Score = 44.3 bits (103), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
+ + ++ +E++PA L LK FL R+L+ GGLSSY L L+I LQ
Sbjct: 375 SVEFIRRSSEKYPAFKYLCYFLKYFLKQRNLNMVLYGGLSSYGLSLMIVSHLQ 427
>gi|354544020|emb|CCE40742.1| hypothetical protein CPAR2_107770 [Candida parapsilosis]
Length = 608
Score = 44.3 bits (103), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 24/124 (19%)
Query: 1216 EIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDV 1272
E+ F + ++ AE R IN K ++ WP + ++FGS AT L LP SD+
Sbjct: 164 EMKDFVRYISPSKAEIITRNNVINTLKKEISS----FWPGTEAHVFGSCATDLYLPGSDI 219
Query: 1273 DLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPI 1332
D+VV I G E R + L + +L + K +++ + N +PI
Sbjct: 220 DMVV----------ISSTGDYENR-----SRLYQLSSFLRVKNLAK--NVEVIANAKVPI 262
Query: 1333 IMLV 1336
I V
Sbjct: 263 IKFV 266
>gi|396491063|ref|XP_003843481.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
gi|312220060|emb|CBY00002.1| hypothetical protein LEMA_P075910.1 [Leptosphaeria maculans JN3]
Length = 729
Score = 44.3 bits (103), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F++P G+ + +++ + P P+ L +K + R ++ SYS
Sbjct: 447 DFPKTGCGIQCDINFENP--LGIHNSQMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSSYS 504
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY +L++ +L
Sbjct: 505 GTLSSYGWVLMVLHYL 520
>gi|330915205|ref|XP_003296937.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
gi|311330654|gb|EFQ94964.1| hypothetical protein PTT_07185 [Pyrenophora teres f. teres 0-1]
Length = 513
Score = 43.9 bits (102), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F++P G+ T ++K + P P+ L +K + R ++ +YS
Sbjct: 246 DFPKTGCGIQCDINFENP--LGIHNTHMLKCYSLTDPRVRPMVLFVKSWAKRRKVNSAYS 303
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY +L++ +L
Sbjct: 304 GTLSSYGWVLMVLHYL 319
>gi|378726515|gb|EHY52974.1| DNA polymerase sigma subunit [Exophiala dermatitidis NIH/UT8656]
Length = 684
Score = 43.9 bits (102), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 41/72 (56%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ +++D+ F + S G+ + ++ +++PA + ++KQFL R L+ +GGL +
Sbjct: 356 SGLKVDLCFDNDS--GVAAVETFQKWKKEYPAMPIIVAIVKQFLMIRGLNDVATGGLGGF 413
Query: 1456 CLMLLITRFLQH 1467
+ L+T LQH
Sbjct: 414 STICLVTSLLQH 425
>gi|145533334|ref|XP_001452417.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124420105|emb|CAK85020.1| unnamed protein product [Paramecium tetraurelia]
Length = 361
Score = 43.9 bits (102), Expect = 0.73, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+ + DISF GL+ D +++ +P L ++LK L R L+++YSGG+ S+
Sbjct: 171 SQYQFDISFNQ--MDGLKQIDEIRKAFTIYPEFKYLIMILKCMLKQRELNETYSGGIGSF 228
Query: 1456 CLMLLITRFLQ 1466
L +I FL+
Sbjct: 229 LLFQMILAFLR 239
>gi|145475559|ref|XP_001423802.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124390863|emb|CAK56404.1| unnamed protein product [Paramecium tetraurelia]
Length = 354
Score = 43.9 bits (102), Expect = 0.73, Method: Composition-based stats.
Identities = 23/71 (32%), Positives = 40/71 (56%), Gaps = 2/71 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
+S + DISF G++ D +++ +P L ++LK L R L+++YSGG+ S+
Sbjct: 164 SSYQFDISFNQ--MDGIRQIDEIQKAFTIYPEFKYLIMILKCILKQRDLNETYSGGIGSF 221
Query: 1456 CLMLLITRFLQ 1466
L +I FL+
Sbjct: 222 LLFQMILAFLR 232
>gi|299473006|emb|CBN77407.1| nucleotidyltransferase family protein [Ectocarpus siliculosus]
Length = 434
Score = 43.9 bits (102), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 46/100 (46%), Gaps = 5/100 (5%)
Query: 1213 LHEEIDSFCKQV--AAENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSS 1270
LH EI C + E A+ +KRV + L +R IFGS TGL LPSS
Sbjct: 265 LHNEILDLCDLLMPTPEEKAKTAIAITYIKRVVE--ETLGSEARVEIFGSQLTGLVLPSS 322
Query: 1271 DVDLVVCLPPVRNLEPIKEAGILEGRNG-IKETCLQHAAR 1309
D+D VV P +L + A G ++E + +AR
Sbjct: 323 DIDSVVLGGPRGSLGSLGAAMYRRQNKGEVREVTVIKSAR 362
>gi|145526448|ref|XP_001449035.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416601|emb|CAK81638.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 43.9 bits (102), Expect = 0.77, Method: Compositional matrix adjust.
Identities = 18/62 (29%), Positives = 37/62 (59%)
Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
H G +T ++K+ +F L+LK L ++L+ +++GG+SS+CL +++T +
Sbjct: 462 HLGFASTTIIKDWVNRFYWYRETILILKFLLKSKNLNDAHTGGISSFCLSIMLTAIYMCK 521
Query: 1469 HH 1470
H+
Sbjct: 522 HY 523
>gi|449688102|ref|XP_004211642.1| PREDICTED: terminal uridylyltransferase 4-like, partial [Hydra
magnipapillata]
Length = 196
Score = 43.9 bits (102), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 37/66 (56%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QT+DL+ + T LA++++ + LD + +GGLS YC +L++ +LQH
Sbjct: 29 QTSDLIIKYTNVDERCAKLAMLIRMWSKACGLDNADNGGLSPYCFVLMVINYLQHTEPPV 88
Query: 1473 RPINQV 1478
PI +V
Sbjct: 89 LPIIKV 94
>gi|342885079|gb|EGU85188.1| hypothetical protein FOXB_04303 [Fusarium oxysporum Fo5176]
Length = 1055
Score = 43.5 bits (101), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K V+ DI+F + H L T L++ + P P+ L +K + R ++ Y G L
Sbjct: 730 KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKHWAKTRGINSGYRGTL 787
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 788 SSYGYVLMVLHYL 800
>gi|389642869|ref|XP_003719067.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|351641620|gb|EHA49483.1| DNA polymerase sigma [Magnaporthe oryzae 70-15]
gi|440474598|gb|ELQ43333.1| DNA polymerase sigma [Magnaporthe oryzae Y34]
gi|440486580|gb|ELQ66430.1| DNA polymerase sigma [Magnaporthe oryzae P131]
Length = 703
Score = 43.5 bits (101), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 58/260 (22%), Positives = 94/260 (36%), Gaps = 79/260 (30%)
Query: 1213 LHEEIDSF---CKQVAAENTARKPYINWAVKRVTRSL---QVLWPRSRTNIFGSNATGLS 1266
LH+EI F K E R+ I+ K + S ++P FGS + L
Sbjct: 374 LHKEIVDFYNYAKPRDFEEKLRQGLIDELAKLIRNSQFRDATVYP------FGSFKSNLY 427
Query: 1267 LPSSDVDLVVCLPPVRNLEPIKEAGILEGRNGIKETCLQHAARYLANQEWVKSDSLKTVE 1326
LP+ D+DLV C + GR AARY + K + +
Sbjct: 428 LPTGDMDLVFC-----------SDSYMSGR----------AARYSSKNHVFKFGAFIERK 466
Query: 1327 NTAIPIIMLVVEVPHDLIASAASSVQSPKEDAAHTTLKHDNHVHSDMVALDDSASPKCSH 1386
A+ DNHV + + P +
Sbjct: 467 QLAV-----------------------------------DNHVEK----ISKARVPLVKY 487
Query: 1387 TSSDNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQ 1446
+ + T +++D+SF++ TG++ + EQFP L +K FLA R L++
Sbjct: 488 -----VDSRTGLKVDVSFEN--ITGIRAIETFLAWREQFPDMPVLVTCIKHFLAMRGLNE 540
Query: 1447 SYSGGLSSYCLMLLITRFLQ 1466
+GG+ ++ L+ LQ
Sbjct: 541 PANGGIGGTTVICLVVSMLQ 560
>gi|398408115|ref|XP_003855523.1| hypothetical protein MYCGRDRAFT_21922, partial [Zymoseptoria tritici
IPO323]
gi|339475407|gb|EGP90499.1| hypothetical protein MYCGRDRAFT_21922 [Zymoseptoria tritici IPO323]
Length = 486
Score = 43.5 bits (101), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
++ DI+F +P GL T +++ + P P+ L +K + R ++ SYSG LSS
Sbjct: 237 GVGIQCDINFFNP--LGLHNTQMLRCYSMCDPRVRPMVLFVKSWAKQRKINSSYSGTLSS 294
Query: 1455 YCLMLLITRFLQHEHHLGRP 1474
Y +L++ +L ++ RP
Sbjct: 295 YGYVLMVLHYLM---NVARP 311
>gi|358378134|gb|EHK15816.1| hypothetical protein TRIVIDRAFT_195758 [Trichoderma virens Gv29-8]
Length = 1020
Score = 43.5 bits (101), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K+ V+ DI+F + H LQ T L++ + P P+ L +K + R ++ Y G L
Sbjct: 714 KSDVGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPMILFVKHWAKMRGINSGYRGTL 771
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 772 SSYGYVLMVLHYL 784
>gi|344278690|ref|XP_003411126.1| PREDICTED: terminal uridylyltransferase 4 [Loxodonta africana]
Length = 1643
Score = 43.5 bits (101), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L ++ P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 482 TTDLLAALGKREPVFTPLVLAFRYWAKLCHIDSQTDGGIPSYCFALMVMFFLQQ 535
>gi|345569402|gb|EGX52268.1| hypothetical protein AOL_s00043g57 [Arthrobotrys oligospora ATCC
24927]
Length = 747
Score = 43.5 bits (101), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 50/230 (21%), Positives = 89/230 (38%), Gaps = 53/230 (23%)
Query: 1239 VKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEGRNG 1298
++R+T L P SR FGS +G + +D+D++ +
Sbjct: 281 LRRITEICNNLVPGSRIIPFGSLVSGFATKGADMDVIFA------------------HDS 322
Query: 1299 IKETCLQHAARY---LANQEWVKSDSLKTVENTAIPIIMLVVEVPHDLIASAASSVQSPK 1355
+ H + LAN+ + + + T +PI L ++ P + S P
Sbjct: 323 LHPQPFSHESNVPVRLANEFLKRGFEVDLLIRTRVPI--LKIKTPSNDPGS------RPG 374
Query: 1356 EDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQTT 1415
+A LK D LD+ P+ ++ DI FK+ H G+ +
Sbjct: 375 SPSAQDALKED---------LDEEPWPE-------------NISCDIGFKA--HLGITNS 410
Query: 1416 DLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFL 1465
+ ++ + L +KQ+ +R L+ Y G LSSY +L++ FL
Sbjct: 411 YFFRTYSQCDSRFREMVLFVKQWSKNRDLNSPYFGTLSSYGYVLMVAHFL 460
>gi|123449289|ref|XP_001313365.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
gi|121895246|gb|EAY00436.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
Length = 346
Score = 43.5 bits (101), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 35/71 (49%), Gaps = 2/71 (2%)
Query: 1397 SVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYC 1456
+ +DISF G ++E+ P P + LK L LDQ + GG+SSY
Sbjct: 153 GISIDISFDE--LHGPLCVQTIREIFRTIPCILPAQIFLKAMLRRNKLDQPFLGGISSYT 210
Query: 1457 LMLLITRFLQH 1467
L L+I ++Q+
Sbjct: 211 LQLMILAYVQY 221
>gi|350586191|ref|XP_003128036.3| PREDICTED: terminal uridylyltransferase 4 [Sus scrofa]
Length = 1376
Score = 43.5 bits (101), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 28/57 (49%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
TTDL+ L + P PL L + + +D GG+ SYC L++ FLQ H
Sbjct: 210 TTDLLAALGKMEPVFIPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQRKH 266
>gi|384488317|gb|EIE80497.1| hypothetical protein RO3G_05202 [Rhizopus delemar RA 99-880]
Length = 403
Score = 43.5 bits (101), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 40/72 (55%), Gaps = 7/72 (9%)
Query: 1210 HSLLHEEIDSFCKQVA---AENTARKPYINWAVKRVTRSLQVLWPRSRTNIFGSNATGLS 1266
+ + +E+D F + + E R+ V+R+++++ WP + ++FGS AT L
Sbjct: 120 YDIFRQEVDCFLRYIEPTEIEMELRES----IVQRISKAISKQWPDAEVSVFGSFATCLF 175
Query: 1267 LPSSDVDLVVCL 1278
LP+SD+DL +
Sbjct: 176 LPNSDIDLDITF 187
>gi|223999357|ref|XP_002289351.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974559|gb|EED92888.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1716
Score = 43.5 bits (101), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 1213 LHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPRS-RTNIFGSNATGLSLPSSD 1271
L EEI++F + + R+ + + + +L +WP S ++GS AT L LPSSD
Sbjct: 926 LSEEINTFMYRRSLALDIRRKERSAVLSALQDTLGRIWPGSCHVEMYGSCATELDLPSSD 985
Query: 1272 VDLVVC 1277
+DLVVC
Sbjct: 986 LDLVVC 991
Score = 41.6 bits (96), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 29/66 (43%), Positives = 40/66 (60%), Gaps = 12/66 (18%)
Query: 1400 LDISFKSPSHTGLQTT--------DLVKELTEQFPASTPL---ALVLKQFLADRSLDQSY 1448
+DI+F+ P H G+ +T D KE T P STP+ A+VLK+ LA R L++ +
Sbjct: 1254 IDITFEGPEHGGIGSTTYSACVVQDACKE-TNLPPESTPVVQVAMVLKELLAQRRLNEPF 1312
Query: 1449 SGGLSS 1454
SGGLSS
Sbjct: 1313 SGGLSS 1318
>gi|123440101|ref|XP_001310815.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
gi|121892600|gb|EAX97885.1| PAP/25A associated domain containing protein [Trichomonas vaginalis
G3]
Length = 439
Score = 43.5 bits (101), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 7/102 (6%)
Query: 1191 LQPLEVQ-NCP---TRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINW-AVKRVTRS 1245
L P +VQ +CP + S S + +H EI F + + T + ++ + +KR +
Sbjct: 15 LHPCQVQADCPWCNGMEYSHSSTYLCMHREILDFARWILP--TEEEKHLRYLVIKRFRVA 72
Query: 1246 LQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPI 1287
+ LWP ++ GS ATG LP+ D+D V P N E +
Sbjct: 73 INQLWPNAKVICHGSTATGTFLPNGDLDFCVLGAPSGNDEDL 114
>gi|219125772|ref|XP_002183147.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405422|gb|EEC45365.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 707
Score = 43.5 bits (101), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 34/64 (53%)
Query: 1409 HTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHE 1468
H G T+ TE++ + P+ L LK L +LD+ ++GGL SY L +L+ ++
Sbjct: 523 HNGSDTSPYAGSQTEKYRSFAPVVLALKVVLQQTNLDEPFAGGLGSYKLYVLVAYHIEQH 582
Query: 1469 HHLG 1472
LG
Sbjct: 583 LLLG 586
>gi|407405189|gb|EKF30316.1| hypothetical protein MOQ_005876 [Trypanosoma cruzi marinkellei]
Length = 1890
Score = 43.5 bits (101), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT +R DI+F H ++ L L +++PAS L L LK + + S GG++S
Sbjct: 761 ATGMRGDITFAGGEH--WRSMQLTNHLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818
Query: 1455 YCLMLLITRFL 1465
+ + L++ F
Sbjct: 819 FAIYLMVLHFF 829
>gi|448519050|ref|XP_003868035.1| non-canonical poly(A) polymerase [Candida orthopsilosis Co 90-125]
gi|380352374|emb|CCG22600.1| non-canonical poly(A) polymerase [Candida orthopsilosis]
Length = 604
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 17/101 (16%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEAGILEG 1295
N + + R + WP + ++FGS AT L LP SD+D+VV I G E
Sbjct: 187 NNVINTLKREVSSFWPGTEAHVFGSCATDLYLPGSDIDMVV----------ISSTGDYEN 236
Query: 1296 RNGIKETCLQHAARYLANQEWVKSDSLKTVENTAIPIIMLV 1336
R+ + + A+ LA +++ + + +PII V
Sbjct: 237 RSRLYQLSSFLRAKNLAK-------NVEVIASAKVPIIKFV 270
>gi|303285524|ref|XP_003062052.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456463|gb|EEH53764.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 842
Score = 43.1 bits (100), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 35/78 (44%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T R+D + GL +K E FPA PL L LK+ L +LD + +GG Y
Sbjct: 664 TFARMDPRPSVGAVNGLDAVAWIKRQCETFPALRPLVLTLKRLLKTHALDDASTGGCGGY 723
Query: 1456 CLMLLITRFLQHEHHLGR 1473
L+ L L+ G+
Sbjct: 724 LLVSLAVSHLRLSGDAGK 741
>gi|440467546|gb|ELQ36762.1| caffeine-induced death protein 1 [Magnaporthe oryzae Y34]
gi|440488651|gb|ELQ68366.1| caffeine-induced death protein 1 [Magnaporthe oryzae P131]
Length = 1067
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K+ V+ DI+F + H LQ T L++ + P P+ L +K + R ++ Y G L
Sbjct: 750 KSGVGVQCDINFSA--HLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRGTL 807
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 808 SSYGYVLMVLHYL 820
>gi|449664468|ref|XP_002160563.2| PREDICTED: terminal uridylyltransferase 7-like [Hydra magnipapillata]
Length = 338
Score = 43.1 bits (100), Expect = 1.2, Method: Composition-based stats.
Identities = 22/66 (33%), Positives = 37/66 (56%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QT+DL+ + T LA++++ + LD + +GGLS YC +L++ +LQH
Sbjct: 212 QTSDLIIKYTNADKRCAKLAMLIRMWSKVCGLDNADNGGLSPYCFVLMVINYLQHTEPPV 271
Query: 1473 RPINQV 1478
PI +V
Sbjct: 272 LPIIKV 277
>gi|408391691|gb|EKJ71060.1| hypothetical protein FPSE_08724 [Fusarium pseudograminearum CS3096]
Length = 1059
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K V+ DI+F + H L T L++ + P P+ L +K + R ++ Y G L
Sbjct: 737 KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKNWAKIRGINSGYRGTL 794
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 795 SSYGYVLMVLHYL 807
>gi|389634385|ref|XP_003714845.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
gi|351647178|gb|EHA55038.1| caffeine-induced death protein 1 [Magnaporthe oryzae 70-15]
Length = 1067
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K+ V+ DI+F + H LQ T L++ + P P+ L +K + R ++ Y G L
Sbjct: 750 KSGVGVQCDINFSA--HLALQNTLLLRCYSYVDPRVRPMVLFVKHWAKARGINTPYRGTL 807
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 808 SSYGYVLMVLHYL 820
>gi|407844181|gb|EKG01837.1| hypothetical protein TCSYLVIO_007151 [Trypanosoma cruzi]
Length = 1878
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT +R DI+F H ++ L L +++PAS L L LK + + S GG++S
Sbjct: 761 ATEMRGDITFAGGEH--WRSMQLTNRLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818
Query: 1455 YCLMLLITRFL 1465
+ + L++ F
Sbjct: 819 FAIYLMVLHFF 829
>gi|169599845|ref|XP_001793345.1| hypothetical protein SNOG_02748 [Phaeosphaeria nodorum SN15]
gi|160705333|gb|EAT89479.2| hypothetical protein SNOG_02748 [Phaeosphaeria nodorum SN15]
Length = 607
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
Query: 1400 LDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLML 1459
LD+ + T L + DL+K EQ+P L LV+K R L + GG SY L +
Sbjct: 303 LDVQVVLSNDTSL-SRDLMKGYMEQYPYLRQLHLVVKTMFDVRGLTDVFRGGFGSYTLFM 361
Query: 1460 LITRFLQHEHH 1470
+I ++H+ H
Sbjct: 362 MIVASIRHKPH 372
>gi|116196370|ref|XP_001223997.1| hypothetical protein CHGG_04783 [Chaetomium globosum CBS 148.51]
gi|88180696|gb|EAQ88164.1| hypothetical protein CHGG_04783 [Chaetomium globosum CBS 148.51]
Length = 1123
Score = 43.1 bits (100), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 40/73 (54%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K+ V+ DI+F + H G+Q T L++ + P PL L +K + R ++ +Y G L
Sbjct: 747 KSGVGVQCDINFSA--HLGVQNTLLLRCYSHCDPRVRPLILFVKHWAKVRRINSAYRGTL 804
Query: 1453 SSYCLMLLITRFL 1465
SY +L++ +L
Sbjct: 805 GSYGYVLMMLHYL 817
>gi|71415610|ref|XP_809866.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70874312|gb|EAN88015.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1887
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT +R DI+F H ++ L L +++PAS L L LK + + S GG++S
Sbjct: 761 ATEMRGDITFAGGEH--WRSMQLTNRLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 818
Query: 1455 YCLMLLITRFL 1465
+ + L++ F
Sbjct: 819 FAIYLMVLHFF 829
>gi|340521600|gb|EGR51834.1| predicted protein [Trichoderma reesei QM6a]
Length = 841
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K V+ DI+F + H LQ T L++ + P P+ L +K + R ++ Y G L
Sbjct: 534 KTDVGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPMILFVKHWAKMRGINSGYRGTL 591
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 592 SSYGYVLMVLHYL 604
>gi|395855062|ref|XP_003799990.1| PREDICTED: terminal uridylyltransferase 4 [Otolemur garnettii]
Length = 1620
Score = 43.1 bits (100), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 45/113 (39%), Gaps = 17/113 (15%)
Query: 1355 KEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQT 1414
K+ A++ ++ D H +V D S S+ N A T
Sbjct: 438 KKTASYVDVESDFHAKVPVVVCKDRKSGLLCRVSAGNDMACL-----------------T 480
Query: 1415 TDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 481 TDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 533
>gi|451847669|gb|EMD60976.1| hypothetical protein COCSADRAFT_149352 [Cochliobolus sativus ND90Pr]
Length = 723
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F +P G+ T +++ + P P+ L +K + R ++ +YS
Sbjct: 446 DFPKDGCGIQCDINFANP--LGIHNTHMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSAYS 503
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY +L++ +L
Sbjct: 504 GTLSSYGWVLMVLHYL 519
>gi|46116842|ref|XP_384439.1| hypothetical protein FG04263.1 [Gibberella zeae PH-1]
Length = 1052
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 2/73 (2%)
Query: 1393 KAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGL 1452
K V+ DI+F + H L T L++ + P P+ L +K + R ++ Y G L
Sbjct: 730 KTGAGVQCDINFSA--HLALHNTALLRCYSHTDPRVRPMVLFVKNWAKIRGINSGYRGTL 787
Query: 1453 SSYCLMLLITRFL 1465
SSY +L++ +L
Sbjct: 788 SSYGYVLMVLHYL 800
>gi|154308271|ref|XP_001553472.1| hypothetical protein BC1G_07881 [Botryotinia fuckeliana B05.10]
Length = 1246
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L+ T ++K E P PLA+++K + R+++ + G LSSY + +I FLQ
Sbjct: 395 LENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMIINFLQ 449
>gi|71406967|ref|XP_805982.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70869592|gb|EAN84131.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 1625
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT +R DI+F H ++ L L +++PAS L L LK + + S GG++S
Sbjct: 488 ATEMRGDITFAGGEH--WRSMQLTNHLLKRYPASRGLILFLKHCVRQMGIGDSQPGGVTS 545
Query: 1455 YCLMLLITRFL 1465
+ + L++ F
Sbjct: 546 FAIYLMVLHFF 556
>gi|332635009|ref|NP_001193859.1| terminal uridylyltransferase 4 [Bos taurus]
gi|296489128|tpg|DAA31241.1| TPA: Caffeine Induced Death homolog family member (cid-1)-like [Bos
taurus]
Length = 1639
Score = 42.7 bits (99), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 479 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532
>gi|348554637|ref|XP_003463132.1| PREDICTED: LOW QUALITY PROTEIN: terminal uridylyltransferase 4-like
[Cavia porcellus]
Length = 1620
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%)
Query: 1410 TGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
T TTDL+ L + P TPL L + + +D GG+ SYC +L+ FLQ
Sbjct: 462 TACLTTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFVLMTMFFLQQ 519
>gi|451996743|gb|EMD89209.1| hypothetical protein COCHEDRAFT_1196132 [Cochliobolus heterostrophus
C5]
Length = 718
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)
Query: 1390 DNIKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYS 1449
D K ++ DI+F +P G+ T +++ + P P+ L +K + R ++ +YS
Sbjct: 441 DFPKDGCGIQCDINFANP--LGIHNTHMLRCYSLTDPRVRPIVLFVKSWAKRRKVNSAYS 498
Query: 1450 GGLSSYCLMLLITRFL 1465
G LSSY +L++ +L
Sbjct: 499 GTLSSYGWVLMVLHYL 514
>gi|347441079|emb|CCD34000.1| similar to zinc finger protein [Botryotinia fuckeliana]
Length = 1243
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 32/55 (58%)
Query: 1412 LQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQ 1466
L+ T ++K E P PLA+++K + R+++ + G LSSY + +I FLQ
Sbjct: 392 LENTRMIKTYIEIDPRVRPLAMIIKHWTKSRAINDAVGGTLSSYTWICMIINFLQ 446
>gi|338721674|ref|XP_003364417.1| PREDICTED: terminal uridylyltransferase 4 [Equus caballus]
Length = 1642
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 482 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 535
>gi|440906885|gb|ELR57101.1| Terminal uridylyltransferase 4 [Bos grunniens mutus]
Length = 1665
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 500 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 553
>gi|426215544|ref|XP_004002031.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Ovis aries]
Length = 1643
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 479 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532
>gi|426215542|ref|XP_004002030.1| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Ovis aries]
Length = 1643
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 479 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 532
>gi|345480897|ref|XP_003424241.1| PREDICTED: hypothetical protein LOC100679579 [Nasonia vitripennis]
Length = 537
Score = 42.7 bits (99), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 55/242 (22%), Positives = 93/242 (38%), Gaps = 39/242 (16%)
Query: 972 SPFDPLGPGHQAFSYVVPGN-EVPGKVLHSSSTTTDVATEEEISGSFASLSGDV-DSKAL 1029
+PF PG + + N + K++H+ ++T TEE++ FA + + D K L
Sbjct: 135 TPFKNNRPGQHWYRAFMKRNPNLSNKIVHNHASTRAPVTEEDLRDWFAKIRLHLEDKKLL 194
Query: 1030 DTLPCPILRP-----IIIPNLSRERSRSDFKRSHEHKSPCVPPSRREQPRIKRPPSPVVL 1084
+ P I ++ P ++ + K HE S ++ S V+
Sbjct: 195 NIEPHRIFNAGESTFMLTPKYNKVLTEEGVKAPHEIVSSN-----------EKDTSTVLF 243
Query: 1085 CVPRAPRPPPPSPVSDSRKTRGFPTVRSGSSSPRHWGVRGWYHEGTTSEEGCVRMDGSEV 1144
+ + PPP + D +KT P P+ WGV TTSE +
Sbjct: 244 SISASGVMPPPMILFDFKKT---PKKSVIDKIPKGWGVVNTERGLTTSESFYSYITN--- 297
Query: 1145 VWPSWRNKNLSAHPMIQPLSG----------ALLQDHLIAISQLARDQEHPDVAFPLQPL 1194
V+ W +N P+I + G +DHLI + L +P+ + PL
Sbjct: 298 VFYKWLKENNYVFPVILYVDGRSSQITLPLLKFCKDHLIELIIL-----YPNATHIIHPL 352
Query: 1195 EV 1196
+V
Sbjct: 353 DV 354
>gi|149693612|ref|XP_001490500.1| PREDICTED: terminal uridylyltransferase 4 isoform 2 [Equus caballus]
Length = 1647
Score = 42.7 bits (99), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 482 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 535
>gi|402079697|gb|EJT74962.1| caffeine-induced death protein 1 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 1114
Score = 42.7 bits (99), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
V+ DI+F + H LQ T L++ + P PL L +K + R ++ +Y G LSS
Sbjct: 783 GAGVQCDINFSA--HLALQNTLLLRCYSHTDPRVRPLVLFVKHWAKVRGINTAYRGTLSS 840
Query: 1455 YCLMLLITRFL 1465
Y +L++ +L
Sbjct: 841 YGYVLMMLHYL 851
>gi|345780735|ref|XP_532573.3| PREDICTED: terminal uridylyltransferase 4 isoform 1 [Canis lupus
familiaris]
Length = 1625
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 462 TTDLLAALGKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVLFFLQQ 515
>gi|357623592|gb|EHJ74680.1| hypothetical protein KGM_08898 [Danaus plexippus]
Length = 443
Score = 42.4 bits (98), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 2/98 (2%)
Query: 1193 PLEVQNCPTRKASLSLMHSLLHEEIDSFCKQVAAENTARKPYINWAVKRVTRSLQVLWPR 1252
P +V+ P + S +L E D F K E T RK W + + S++ ++PR
Sbjct: 120 PFQVKFTPDNLLNGSEWDTLSQEIWDKFVKSQQTEETFRKKMNLW--RYLYISIKSMFPR 177
Query: 1253 SRTNIFGSNATGLSLPSSDVDLVVCLPPVRNLEPIKEA 1290
+ GS +G L SSD+DL + + + LEP A
Sbjct: 178 YGLYVVGSTMSGFGLDSSDMDLCLHVRALAELEPRAHA 215
>gi|330796515|ref|XP_003286312.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
gi|325083739|gb|EGC37184.1| hypothetical protein DICPUDRAFT_150267 [Dictyostelium purpureum]
Length = 2271
Score = 42.0 bits (97), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 40/72 (55%), Gaps = 2/72 (2%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T + D+ F G+ ++L+KE ++ P L L++K + + ++ + G SS+
Sbjct: 2048 TGTQFDMCFNH--EIGIYKSNLIKEYSDLDPRCKELILLVKYWAQQKDINDASKGTFSSF 2105
Query: 1456 CLMLLITRFLQH 1467
CL+L++ FLQ+
Sbjct: 2106 CLVLMVIHFLQY 2117
>gi|422296106|gb|EKU23405.1| nucleotidyltransferase-like protein [Nannochloropsis gaditana
CCMP526]
Length = 432
Score = 42.0 bits (97), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
T + DIS ++P L+ T L+ E ++ P LA ++K + R ++ + G LSS
Sbjct: 113 GTGLDCDISVENP--LALRNTQLLHEYSQVDPRVRALAYIVKHWARARKINNASGGTLSS 170
Query: 1455 YCLMLLITRFLQ 1466
Y +L++ FLQ
Sbjct: 171 YAYILMVLHFLQ 182
>gi|363751202|ref|XP_003645818.1| hypothetical protein Ecym_3523 [Eremothecium cymbalariae DBVPG#7215]
gi|356889452|gb|AET39001.1| Hypothetical protein Ecym_3523 [Eremothecium cymbalariae DBVPG#7215]
Length = 683
Score = 42.0 bits (97), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%)
Query: 1236 NWAVKRVTRSLQVLWPRSRTNIFGSNATGLSLPSSDVDLVV 1276
N A+ ++ ++++ WP S + FGS AT L LP SD+D VV
Sbjct: 210 NDAITKIRKAVKSFWPDSDLHCFGSYATDLYLPGSDIDCVV 250
>gi|351697778|gb|EHB00697.1| Terminal uridylyltransferase 4 [Heterocephalus glaber]
Length = 1668
Score = 42.0 bits (97), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L++ P TPL L + + +D GG+ SYC L+ FLQ
Sbjct: 504 TTDLLAALSKMEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMAMFFLQQ 557
>gi|187956551|gb|AAI50792.1| Zinc finger, CCHC domain containing 11 [Mus musculus]
Length = 1644
Score = 42.0 bits (97), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 499 TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 552
>gi|83977461|ref|NP_780681.2| terminal uridylyltransferase 4 [Mus musculus]
gi|259554115|sp|B2RX14.2|TUT4_MOUSE RecName: Full=Terminal uridylyltransferase 4; Short=TUTase 4;
AltName: Full=Zinc finger CCHC domain-containing protein
11
Length = 1644
Score = 42.0 bits (97), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 499 TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 552
>gi|60360306|dbj|BAD90397.1| mKIAA0191 protein [Mus musculus]
Length = 1556
Score = 42.0 bits (97), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 44/113 (38%), Gaps = 17/113 (15%)
Query: 1355 KEDAAHTTLKHDNHVHSDMVALDDSASPKCSHTSSDNIKAATSVRLDISFKSPSHTGLQT 1414
K+ A + ++ D H +V D S S+ N A T
Sbjct: 369 KKSALYIDVESDFHAKVPVVVCKDRKSALLCRVSAGNDMACL-----------------T 411
Query: 1415 TDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 412 TDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 464
>gi|340506956|gb|EGR32991.1| hypothetical protein IMG5_064460 [Ichthyophthirius multifiliis]
Length = 347
Score = 41.6 bits (96), Expect = 3.3, Method: Composition-based stats.
Identities = 21/75 (28%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
++ +T DISF G++ V++ + +P L ++K L R L+++Y+GG
Sbjct: 152 VEKSTQTNFDISFNK--EDGVKQLPEVQKCLQIYPEIKYLIFIMKCILRQRDLNETYTGG 209
Query: 1452 LSSYCLMLLITRFLQ 1466
+ S+ L +I FL+
Sbjct: 210 IGSFLLFCMILAFLR 224
>gi|449688600|ref|XP_002162478.2| PREDICTED: terminal uridylyltransferase 4-like, partial [Hydra
magnipapillata]
Length = 215
Score = 41.6 bits (96), Expect = 3.5, Method: Composition-based stats.
Identities = 21/66 (31%), Positives = 37/66 (56%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHHLG 1472
QT++L+ + T LA++++ + LD + +GGLS YC +L++ +LQH
Sbjct: 68 QTSNLIIKYTNADERCIKLAMLIRMWSKACGLDNADNGGLSPYCFVLMVINYLQHTEPPV 127
Query: 1473 RPINQV 1478
PI +V
Sbjct: 128 LPIIRV 133
>gi|396499098|ref|XP_003845390.1| hypothetical protein LEMA_P006980.1 [Leptosphaeria maculans JN3]
gi|312221971|emb|CBY01911.1| hypothetical protein LEMA_P006980.1 [Leptosphaeria maculans JN3]
Length = 617
Score = 41.6 bits (96), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 17/58 (29%), Positives = 30/58 (51%)
Query: 1413 QTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQHEHH 1470
Q+ D+++ +++P L V+K R L + GG SY L ++I +QH+ H
Sbjct: 326 QSRDIIQRYLKEYPYLLTLFTVIKTIFEIRGLSDVFRGGFGSYSLFMMIVASIQHQPH 383
>gi|330914987|ref|XP_003296862.1| hypothetical protein PTT_07070 [Pyrenophora teres f. teres 0-1]
gi|311330792|gb|EFQ95034.1| hypothetical protein PTT_07070 [Pyrenophora teres f. teres 0-1]
Length = 646
Score = 41.6 bits (96), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 8/79 (10%)
Query: 1199 CPTRKASLSLMHSLLHEEIDSFCKQ---VAAENTARKPYINWAVKRVTRSLQVLWPRSRT 1255
C R + S +H L EID F K E ARK I+ V + P+
Sbjct: 189 CVNRDRATSGLHRL-DMEIDKFYKYSKPTPHEQNARKQVIDQVRNHVLEQV----PKHHI 243
Query: 1256 NIFGSNATGLSLPSSDVDL 1274
+FGS TGL+LP+SD+D
Sbjct: 244 EVFGSQITGLALPTSDIDF 262
>gi|291385635|ref|XP_002709312.1| PREDICTED: death domain containing 1 [Oryctolagus cuniculus]
Length = 901
Score = 41.2 bits (95), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 47/187 (25%), Positives = 74/187 (39%), Gaps = 30/187 (16%)
Query: 528 VAGKGRTNARKTKTVKNKN-KNCTYNNPVPVKDPK-------------------VSVLET 567
+A +G NA +T TV N + Y N V K V+ E
Sbjct: 142 IAARGELNAIETATVSPTNGEESHYTNQVQFSTNKAHINSELLEEENNASLNGNVTEKED 201
Query: 568 SSSISLQDEVEKYDKLSAQNVSVDNSTCS-----NVLASNQSSCTSASVPAREGIATQST 622
S + L D E D ++++++N S +V+ + + S RE I T ST
Sbjct: 202 SENKMLSDNAENEDDKQIEHMTIENLNGSRENIHDVMQATEREIHETSESQREEITTSST 261
Query: 623 QEDCVVNSVN-----SECRRFSNGRIDNQTQHFLQETTDSKVECNIISPDMPARDLDNAF 677
D N VN SE + N ++ + L + TD++V C I +P + L
Sbjct: 262 TCDISSNDVNRLPKDSESLKQKNNTMEKEYLEVLNDGTDAQVSCYITAPSNVLQQLVCRI 321
Query: 678 GNSISGI 684
NS+S +
Sbjct: 322 INSMSSL 328
>gi|76363699|ref|XP_888584.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|12311885|emb|CAC22701.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 3012
Score = 40.8 bits (94), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
Query: 1395 ATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSS 1454
AT ++DI+F H ++ L + L E FP + L L LK + S +S GG++S
Sbjct: 1665 ATGTKVDITFAGGEH--WRSMQLTRSLLEVFPHARTLILFLKYCVRSLSAGESEPGGVTS 1722
Query: 1455 YCLMLLITRF 1464
+ + L++ F
Sbjct: 1723 FAIYLMVLHF 1732
>gi|26339984|dbj|BAC33655.1| unnamed protein product [Mus musculus]
Length = 435
Score = 40.8 bits (94), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 28/54 (51%)
Query: 1414 TTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSYCLMLLITRFLQH 1467
TTDL+ L + P TPL L + + +D GG+ SYC L++ FLQ
Sbjct: 289 TTDLLAALGKVEPVFTPLVLAFRYWAKLCYIDSQTDGGIPSYCFALMVMFFLQQ 342
>gi|308805280|ref|XP_003079952.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
gi|116058409|emb|CAL53598.1| DNA polymerase sigma (ISS) [Ostreococcus tauri]
Length = 322
Score = 40.4 bits (93), Expect = 7.6, Method: Composition-based stats.
Identities = 22/81 (27%), Positives = 39/81 (48%)
Query: 1392 IKAATSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGG 1451
+K+A+ + + + GL+ ++E E F A PL LK+ LA L+ + +GG
Sbjct: 1 MKSASPPYIAVDLSLGTTNGLEAVYWIREQVETFAALKPLVFYLKRLLATHHLNDASTGG 60
Query: 1452 LSSYCLMLLITRFLQHEHHLG 1472
Y L+ L+ L+ +G
Sbjct: 61 CGGYLLVSLVVSHLKQTGSVG 81
>gi|440296452|gb|ELP89279.1| PAP-associated domain containing protein, putative [Entamoeba
invadens IP1]
Length = 344
Score = 40.0 bits (92), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 43/82 (52%), Gaps = 4/82 (4%)
Query: 1396 TSVRLDISFKSPSHTGLQTTDLVKELTEQFPASTPLALVLKQFLADRSLDQSYSGGLSSY 1455
T++ +D+S G+ ++ L L + +AL +K + +L++ Y GG+ SY
Sbjct: 125 TTISIDLSINCEG--GIDSSALTHSLLTSSQFTQEIALFVKYLVFQNNLNEPYHGGIGSY 182
Query: 1456 CLMLLITRFLQH--EHHLGRPI 1475
++LL FL+ +H LGR +
Sbjct: 183 AIVLLTATFLKFYPQHSLGRAL 204
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,432,144,111
Number of Sequences: 23463169
Number of extensions: 1012528019
Number of successful extensions: 3155370
Number of sequences better than 100.0: 882
Number of HSP's better than 100.0 without gapping: 568
Number of HSP's successfully gapped in prelim test: 314
Number of HSP's that attempted gapping in prelim test: 3151594
Number of HSP's gapped (non-prelim): 3528
length of query: 1478
length of database: 8,064,228,071
effective HSP length: 156
effective length of query: 1322
effective length of database: 8,698,941,003
effective search space: 11500000005966
effective search space used: 11500000005966
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)