BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy4937
(356 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|332028068|gb|EGI68119.1| Alkylated DNA repair protein alkB-like protein 4 [Acromyrmex
echinatior]
Length = 300
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 164/364 (45%), Positives = 216/364 (59%), Gaps = 72/364 (19%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE E+ S PN
Sbjct: 1 METVRPCGCKGIRSCLICETEYEI--------------------------------SKPN 28
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
+K + Q ++Y+YCP+CNK G D+ Y+ HPNHSG + GV
Sbjct: 29 ------------LKNEL---QKSKSYIYCPYCNKLLSGWDIDEYKKHPNHSGNLINYPGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK DFL E E E++MK LDDLPW SQSGRRKQNFGPKCNFKK+K++LG F GFP T+
Sbjct: 74 YIKLDFLKEDEAEELMKALDDLPWQPSQSGRRKQNFGPKCNFKKRKLQLGSFNGFPKTTQ 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q +F + +L F+T+EQCTLEYN E G+SID+H+DDCWIWGERI+TVN++GDSVLT+
Sbjct: 134 FVQKKFEEISLLHGFETVEQCTLEYNPERGASIDSHVDDCWIWGERIITVNVLGDSVLTM 193
Query: 241 TYYNGDSNKYNL----NSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRI 296
T Y G +YNL N + Y D S ++D ++V+R+
Sbjct: 194 TPYRGPITRYNLDYVSNYDFILKKDIYDDTHSLGIDD-----------------NIVVRL 236
Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKA 352
PMPARSL+VL+G ARY+W+H VLR DV RRVC+AYRE T PY+ N + T+ + +A
Sbjct: 237 PMPARSLMVLYGPARYKWEHAVLRQDVTSRRVCLAYRELTPPYLKNGQHQKETSEILRQA 296
Query: 353 KCFF 356
F+
Sbjct: 297 SLFW 300
>gi|383858126|ref|XP_003704553.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Megachile rotundata]
Length = 300
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 218/360 (60%), Gaps = 66/360 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE E+ NIV
Sbjct: 1 METVRPCGCKGIRSCLICEAEY-----NIV------------------------------ 25
Query: 61 HSGKSFDIGGVYIKEDFLTSQTK-ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGG 119
K DF + K +YVYCP C+K+W G D Y++HPNH GK+ + G
Sbjct: 26 -------------KPDFKSCLEKCSSYVYCPLCDKSWPGWDADLYKTHPNHQGKAIEYPG 72
Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACT 179
VYI+ DFL+ E ++K+LD++PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP T
Sbjct: 73 VYIQLDFLNSCEITSLVKELDNIPWETSQSGRRKQNFGPKCNFKKRKLRLGSFNGFPKPT 132
Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
KF+Q++ + +L+ FQTIEQCTLEY+ G+SID HIDDCWIWGERIVTVN++G+SVLT
Sbjct: 133 KFVQEKLSKVPILADFQTIEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLT 192
Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMP 299
+T Y+G KYNL S + ST DL + +S D + +V+R+PMP
Sbjct: 193 MTPYHGLDTKYNLKS---------ITEYSTTCKDLKFDVESED----DNNDDIVVRLPMP 239
Query: 300 ARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN----NEATASVFEKAKCF 355
SL+VL+G ARY W+HCVLR D+ RR+C+AYREFT PY+N NE + ++A F
Sbjct: 240 EGSLMVLYGLARYSWEHCVLREDITSRRICLAYREFTPPYLNDTLKNEEVREILKRALIF 299
>gi|345485379|ref|XP_001605092.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Nasonia vitripennis]
Length = 294
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/362 (43%), Positives = 209/362 (57%), Gaps = 77/362 (21%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+RTC+ CE E+ ++ ++ CP+
Sbjct: 4 METVRPCGCKGIRTCLICENEYSISKPDLAADLKK----CPN------------------ 41
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
YVYCP CNKAW G D+ Y+ HP HSG + G
Sbjct: 42 -------------------------YVYCPECNKAWPGWDMNLYKQHPRHSGIPIEFPGT 76
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK DFL++ E + ++ LD LPW+ SQSGRRKQNFGPK NFKK+++++G+F GFP TK
Sbjct: 77 YIKLDFLNQDELKSLVNSLDLLPWEISQSGRRKQNFGPKTNFKKKRLRMGNFIGFPESTK 136
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+QD+F+ L +FQTIEQC+LEY+S+ G+SID HIDDCWIWGERIVTVN++GDSVLT+
Sbjct: 137 FVQDKFNDVKELKNFQTIEQCSLEYDSQRGASIDPHIDDCWIWGERIVTVNILGDSVLTM 196
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
T Y G SNKYNL+ + P+Q ++V+R+PMP
Sbjct: 197 TPYRGSSNKYNLDCVESLQNQAAPEQ------------------------NIVVRLPMPE 232
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN------NEATASVFEKAKC 354
SL+VL G ARY W+H VLR D+ RR+C+AYREFT PY+ NE + + E+A
Sbjct: 233 GSLMVLHGAARYSWEHAVLREDINLRRICLAYREFTPPYLPDDKGNCNEESREILERALN 292
Query: 355 FF 356
F+
Sbjct: 293 FW 294
>gi|66554392|ref|XP_623503.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Apis mellifera]
Length = 296
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 209/359 (58%), Gaps = 67/359 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE E+ N+ E+
Sbjct: 1 METIRPCGCKGIRSCLLCETEYKIVKPNLKACFEKC------------------------ 36
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
+YVYCP+C KAW G + Y+SHPNH G + + GV
Sbjct: 37 -----------------------SSYVYCPYCEKAWPGWNFDLYKSHPNHQGTAIEFPGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK DFL+ E ++ L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP T+
Sbjct: 74 YIKLDFLTSWEINSLINILEKIPWEASQSGRRKQNFGPKCNFKKKKLQLGSFNGFPKSTQ 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q +F + +L++FQT+EQCTLEY+ G+SID HIDDCWIWGERIVTVN+IGDSVLT+
Sbjct: 134 FVQQKFSEIPILNNFQTVEQCTLEYDPIRGASIDPHIDDCWIWGERIVTVNVIGDSVLTM 193
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
+ Y+G KYNL S + S+ + + K+ + +++R+PMP
Sbjct: 194 SSYHGPDTKYNLKSII---------EYSSTIEGFDVKNKNEN-------EDIIVRLPMPE 237
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
SL+VL+G ARY+W+HCVLR D+ RR+C+AYRE T PY+ NNE + ++A F
Sbjct: 238 GSLMVLYGIARYKWEHCVLREDINSRRICLAYRELTPPYLCNSTNNEVVKELLKRASVF 296
>gi|357617084|gb|EHJ70575.1| hypothetical protein KGM_10717 [Danaus plexippus]
Length = 324
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 156/358 (43%), Positives = 213/358 (59%), Gaps = 61/358 (17%)
Query: 7 CGCKGVRTCIKCEQEFGYEN-KNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
CGCKG RTC+ CE +G E KN+V+ ++
Sbjct: 6 CGCKGCRTCLICETYYGAEELKNLVKLDKD------------------------------ 35
Query: 66 FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKED 125
+ YV+CP CNKAW G D+ Y+ HP H G+S D GVYIK D
Sbjct: 36 ------------------KGYVFCPFCNKAWSGWDIDVYKQHPYHEGESIDYPGVYIKLD 77
Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
F+SE E ++M+ +D++PWD SQSGRRKQN+GPK NFKK+KI G F GFP ++++QDR
Sbjct: 78 FISEYEETELMRNIDEVPWDISQSGRRKQNYGPKTNFKKKKIVPGQFNGFPKFSQYLQDR 137
Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
F D+L ++ IEQC+LEY+ G+SID HIDDCW+WGERI+TVN + DSVLT+T + G
Sbjct: 138 FKTFDILKGYEVIEQCSLEYDPMKGASIDPHIDDCWVWGERILTVNCLSDSVLTMTPFKG 197
Query: 246 DSNKYNLNSPCFSYQKKYP---DQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
D+ KYNL K+YP T+ D + + ++ + V+IRIP+ RS
Sbjct: 198 DTIKYNLYC-----AKEYPPVVQDDGTVDMDFTSKQNMFEASKPKEDLDVIIRIPLIRRS 252
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEKAKCFF 356
LL+++GE+RY W+H VLR D++ RRVCIAYREFT P+MNN + ++AK F+
Sbjct: 253 LLIIYGESRYHWEHRVLREDIVSRRVCIAYREFTPPFMNNGVHEILGREIRDRAKLFW 310
>gi|340727539|ref|XP_003402099.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Bombus terrestris]
Length = 300
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 207/359 (57%), Gaps = 64/359 (17%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE E N+ E+
Sbjct: 1 METVRSCGCKGIRSCLLCETEHKIVKPNLKTHFEKC------------------------ 36
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
+YVYCP+C+K+W G D Y++HP H G + + GV
Sbjct: 37 -----------------------SSYVYCPNCDKSWPGWDADLYKNHPYHRGTAIEFPGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK DFL+ E + L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP T+
Sbjct: 74 YIKLDFLNPWEVNSLKNALEKIPWEVSQSGRRKQNFGPKCNFKKRKLRLGSFDGFPKSTQ 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q +F Q +L++FQTIEQCTLEY+ G+SID HIDDCWIWGERIVTVN++G+SVLT+
Sbjct: 134 FVQQKFCQVPLLNNFQTIEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLTM 193
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
+ Y+G KYNL S + STI+ + NE+ +V+R+PMP
Sbjct: 194 SPYHGPDTKYNLESVA---------EYSTILEGFEFDTNDK----NESNKDIVVRLPMPE 240
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
SL++L+ ARY+W+HCVLR D+ +R+C+AYRE T PY+ NNE + ++A F
Sbjct: 241 GSLMILYDSARYKWEHCVLREDITSKRICLAYRELTPPYLCTSTNNEIVKELLKRASVF 299
>gi|350423117|ref|XP_003493390.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Bombus impatiens]
Length = 300
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 150/359 (41%), Positives = 207/359 (57%), Gaps = 64/359 (17%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE E N+ E+
Sbjct: 1 METVRSCGCKGIRSCLLCETEHKIVKPNLKTHFEKC------------------------ 36
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
+YVYCP+C+K+W G D Y++HP H G + + GV
Sbjct: 37 -----------------------SSYVYCPNCDKSWPGWDADLYKNHPYHQGTAIEFPGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK DFL+ E + L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP T+
Sbjct: 74 YIKLDFLNPWEINSLKNALEKIPWEVSQSGRRKQNFGPKCNFKKRKLQLGSFDGFPKSTQ 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q +F Q +L++FQT+EQCTLEY+ G+SID HIDDCWIWGERIVTVN++G+SVLT+
Sbjct: 134 FVQQKFCQVPMLNNFQTVEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLTM 193
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
+ Y+G KYNL S + STI+ + NE+ +V+R+PMP
Sbjct: 194 SPYHGPDTKYNLESVA---------EYSTILEGFEFDTNDK----NESNKDIVVRLPMPE 240
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
SL++L+ ARY+W+HCVLR D+ +R+C+AYRE T PY+ NNE + ++A F
Sbjct: 241 GSLMILYDSARYKWEHCVLREDITSKRICLAYRELTPPYLCTSTNNEIVKELLKRASVF 299
>gi|307183142|gb|EFN70059.1| Alkylated DNA repair protein alkB-like protein 4 [Camponotus
floridanus]
Length = 298
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 208/360 (57%), Gaps = 66/360 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+R+C+ CE ++ ++ E
Sbjct: 1 METVRPCGCKGIRSCLICEAKYQILKPDLKEVL--------------------------- 33
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
Q ++YVYCP+C+K+ G D+ Y+ HP H G D GV
Sbjct: 34 --------------------QKSKSYVYCPYCDKSSPGWDIDEYKKHPYHDGNLIDYPGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YIK +FL+E E + +M+ LD LPW SQSGRRKQNFGPKCNFKK+K++LG F GFP T+
Sbjct: 74 YIKLNFLNEDEAKKLMEALDYLPWQASQSGRRKQNFGPKCNFKKRKLQLGTFNGFPKTTQ 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
FIQ++F + +L F+T+EQC+LEY+ + G+SID HIDDCWIWGERIVTVN++GDSVLT+
Sbjct: 134 FIQNKFAEIPLLHGFRTVEQCSLEYDPKRGASIDPHIDDCWIWGERIVTVNILGDSVLTM 193
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
T Y G +YNL+ C S S+ EK + + ++R+PMPA
Sbjct: 194 TPYRGSIMRYNLD--CVSSYN-------------SILEKDFCNNTQINIDDNIVRLPMPA 238
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
RSL++L G ARY+W+H +LR D+ RRVC+AYRE T PY+ N + + + +A F+
Sbjct: 239 RSLMILHGPARYQWEHSILRQDITSRRVCLAYRELTPPYLENGEYHKEASEILRQALSFW 298
>gi|170051906|ref|XP_001861979.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872935|gb|EDS36318.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 318
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/365 (43%), Positives = 214/365 (58%), Gaps = 70/365 (19%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
MD CGCKG RTC++CE++F E
Sbjct: 1 MDHPRPCGCKGRRTCLQCEKDFAIER---------------------------------- 26
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPN-HSGKSFD-I 117
D V+ QT + Y YCPHCN+ + G D ++H H G+ +
Sbjct: 27 -----IDYRDVF--------QTCKAYSYCPHCNRIYPGWDTDRVMEAHAGGHDGQPGEEY 73
Query: 118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPA 177
GV + D++SE E +MK +D++PWD SQSGRRKQNFGPK NFKK K++LG F+GFP
Sbjct: 74 PGVLVALDWVSEDEERALMKGIDEMPWDGSQSGRRKQNFGPKTNFKKNKLQLGQFRGFPQ 133
Query: 178 CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
+KF+QDRF Q ++++FQTIEQC+LEY E G+SID HIDDCWIWGERIVTVNL+ DSV
Sbjct: 134 FSKFVQDRFEQVPLMANFQTIEQCSLEYTPERGASIDPHIDDCWIWGERIVTVNLLSDSV 193
Query: 238 LTLTYYN--GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
LT++ Y+ G++ +YNL+ +Y D+ ++ +L+ D E + V+R
Sbjct: 194 LTMSRYSAEGNATRYNLH-----LVDRYQDK---LITELA------DDASMERFENAVVR 239
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEK 351
IPMP RSLLV++G RY+W+HCVLR D+ RRVC+AYREFT Y+ + + VF K
Sbjct: 240 IPMPRRSLLVMYGPPRYQWEHCVLREDIRERRVCLAYREFTPMYLQEGSEYAKSEEVFCK 299
Query: 352 AKCFF 356
A+ F+
Sbjct: 300 AQTFW 304
>gi|91087937|ref|XP_971924.1| PREDICTED: similar to CG4036 CG4036-PA [Tribolium castaneum]
gi|270011936|gb|EFA08384.1| hypothetical protein TcasGA2_TC006028 [Tribolium castaneum]
Length = 306
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 157/365 (43%), Positives = 205/365 (56%), Gaps = 72/365 (19%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
MD+ CGCKG RTC+ CE FG
Sbjct: 1 MDKPRPCGCKGCRTCLICESTFGL------------------------------------ 24
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
S K F + ++TYVYCP CN A+ G D HP+HSG G+
Sbjct: 25 -SAKKF------------ARRREDTYVYCPQCNVAFPGWDP---PKHPDHSGAPITFPGI 68
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YI+ DFL+ E +M+ +DD+ WD SQSGRRKQNFGPKCNFKK K++ GDF GFP T+
Sbjct: 69 YIQLDFLTPEEEAKLMQGIDDMSWDSSQSGRRKQNFGPKCNFKKNKLREGDFTGFPKFTE 128
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q++F +LS FQTIEQC+LEY+ E G+SID H+DDCW+WGER+VTVNL+ DSVLT+
Sbjct: 129 FVQEKFRNVPILSDFQTIEQCSLEYDPEKGASIDPHVDDCWVWGERVVTVNLLSDSVLTM 188
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYP---DQKSTIVNDLSLNEK--STDSIVNETLSSVVIR 295
T YN +N YNL+ KYP D+K D + E N+ V R
Sbjct: 189 T-YNRKNNYYNLH-----LVGKYPAVLDEKG----DRNFTESCYKVKEFENDRFYPTV-R 237
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEK 351
+PMP RSLLV++ ARY+W+H +LR D+ RRVC+AYREFT Y+ + A + +
Sbjct: 238 VPMPRRSLLVMFNSARYDWEHQILREDIRERRVCLAYREFTPIYLKGGDKEQEGAPILRQ 297
Query: 352 AKCFF 356
A+ F+
Sbjct: 298 ARNFW 302
>gi|158298439|ref|XP_318611.3| AGAP009588-PA [Anopheles gambiae str. PEST]
gi|157013881|gb|EAA14576.4| AGAP009588-PA [Anopheles gambiae str. PEST]
Length = 315
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 155/365 (42%), Positives = 207/365 (56%), Gaps = 68/365 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
MD CGCKG RTC+ CE EFG D Y+
Sbjct: 1 MDHPRPCGCKGRRTCLSCEAEFGIAP-----------------------IDFYS------ 31
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSD----LYAYQSHPNHSG-KSF 115
T Q ++YVYCP C+K + G D L + P H G +
Sbjct: 32 ------------------TFQNNDSYVYCPRCSKIYPGWDWEKVLAEHSDTPQHGGVQGE 73
Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
D GVYI DFL+ E ++++ LD+LPWD SQSGRRKQNFGPK NFKK +++ F GF
Sbjct: 74 DYPGVYIDLDFLTTTEEAELLQGLDELPWDVSQSGRRKQNFGPKTNFKKTRLRPAQFAGF 133
Query: 176 PACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGD 235
P ++F+Q RF Q +L+ FQTIEQC+LEY E G+SID HIDDCWIWGERIVTVNL+ D
Sbjct: 134 PQLSEFVQRRFEQVPLLATFQTIEQCSLEYCPERGASIDPHIDDCWIWGERIVTVNLLSD 193
Query: 236 SVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
SVLT++ Y G+ K N + ++Y + ++ +L ++E++ N ++R
Sbjct: 194 SVLTMSPYRGEEGKTKYN---LHFLEQY---REHLLGEL-MDEEALAGYENR-----IVR 241
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEK 351
IPMP RSLLVL+G RY+W+H VLR D+ RRVC+AYREFT Y+ + + +FE+
Sbjct: 242 IPMPRRSLLVLYGPPRYQWEHSVLREDIKERRVCLAYREFTPMYLQGGSEFGQSEEIFER 301
Query: 352 AKCFF 356
AK F+
Sbjct: 302 AKQFW 306
>gi|157127235|ref|XP_001654880.1| hypothetical protein AaeL_AAEL010768 [Aedes aegypti]
gi|108872983|gb|EAT37208.1| AAEL010768-PA [Aedes aegypti]
Length = 313
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 157/366 (42%), Positives = 214/366 (58%), Gaps = 72/366 (19%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
MD CGCKG RTC+ CE++F E
Sbjct: 1 MDHPRPCGCKGRRTCLSCEKDFEIE----------------------------------- 25
Query: 61 HSGKSFDIGGVYIKEDFLTS-QTKETYVYCPHCNKAWQGSDLYAYQSH--PNHSGKSF-D 116
+ DFL ++ + Y YCP C+K + G D + NH G + D
Sbjct: 26 -------------RVDFLKIFKSSKAYSYCPRCHKIYPGWDTAKVMAEHDRNHGGDAGED 72
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
GVYI DFLSE+E ++MK +D++PWD SQSGRRKQNFGPK NFKK KIK+G+F+GFP
Sbjct: 73 YPGVYIDLDFLSESEEAELMKGIDEMPWDMSQSGRRKQNFGPKTNFKKTKIKVGNFEGFP 132
Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
++F+Q +F Q ++ FQ+IEQC+LEYN E G+SID HIDDCWIWGERIVTVN + DS
Sbjct: 133 KFSEFVQRKFDQVPLMKGFQSIEQCSLEYNPEKGASIDPHIDDCWIWGERIVTVNFLSDS 192
Query: 237 VLTLTYYNGD--SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVI 294
VLT++ Y + N+YNL+ + +Y D+ ++ +L ++E++ D ++
Sbjct: 193 VLTMSRYRKEDGKNRYNLD-----FVDRYKDK---LIGEL-VDEETMDK-----FDDRIV 238
Query: 295 RIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM--NNEATAS--VFE 350
RIPMP RSLL+L+G RY+W+H VLR D+ RRVC+AYREFT Y+ NE + S +F
Sbjct: 239 RIPMPRRSLLILYGPPRYQWEHSVLREDITERRVCLAYREFTPMYLEDGNEYSKSEEIFR 298
Query: 351 KAKCFF 356
AK F+
Sbjct: 299 LAKNFW 304
>gi|321472550|gb|EFX83520.1| hypothetical protein DAPPUDRAFT_48015 [Daphnia pulex]
Length = 293
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 197/335 (58%), Gaps = 66/335 (19%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG R+C+ CE+EF N++ F ++++ + +S
Sbjct: 8 CGCKGYRSCLLCEKEF-----NLLPFKDDSF------------------------TAESL 38
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+ G+ ++YV+CP CN AW G D + HPNH G+ + G+YI++DF
Sbjct: 39 EEKGI------------QSYVFCPACNLAWPGWD-FDPTHHPNHCGEPIEFPGIYIQQDF 85
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
SE E +M D +PW+ S SGRRKQNFGPK NFKK+K++LG F+G P+ ++ DR
Sbjct: 86 FSEEEMVKLMSSFDSIPWEYSVSGRRKQNFGPKTNFKKRKLQLGSFKGVPSFSQLFHDRI 145
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L ++TIEQC+LEY+ E G+SI+ HIDDCW+WGERIVT+NL GDSVLT+T Y+G
Sbjct: 146 ESVPLLKDYRTIEQCSLEYDPERGASIEPHIDDCWVWGERIVTLNLAGDSVLTMTKYHGG 205
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
KYNL D+ L T+ I + + +RIP PARSL+++
Sbjct: 206 VEKYNL-------------------QDVDLELPKTEDIPD-----ICVRIPQPARSLVMI 241
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN 341
+G ARY+W HCVLR D+ RRVCIA REFT ++N
Sbjct: 242 YGSARYQWLHCVLRRDIKERRVCIALREFTKQFLN 276
>gi|332373008|gb|AEE61645.1| unknown [Dendroctonus ponderosae]
Length = 306
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 153/360 (42%), Positives = 205/360 (56%), Gaps = 61/360 (16%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ + CGCKG RTC+ CE FG
Sbjct: 1 MNTTRPCGCKGCRTCLHCEALFGI------------------------------------ 24
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
SFD KE+ + T+ VYCP CN + G + +Y+SHPNH G GV
Sbjct: 25 ----SFD------KEN-IKKGTQNKKVYCPFCNLCYPGKSVDSYKSHPNHEGTPTRYEGV 73
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
YI+ +FLS++E + +M+ +D + WD SQSGRRKQNFGPKCNFK +K++LG FQGFP T+
Sbjct: 74 YIEMNFLSDSEHDQLMQGIDSMAWDLSQSGRRKQNFGPKCNFKARKLRLGKFQGFPKFTR 133
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
F+Q++F +L FQTIEQC+LEY+ GSSID H+DDCW+WGERIVTVNL+ DSVLTL
Sbjct: 134 FVQEKFEDVPILQGFQTIEQCSLEYHPARGSSIDPHVDDCWVWGERIVTVNLLSDSVLTL 193
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
T N +YNL+ YP + + + L +D V + V+ +PMP
Sbjct: 194 T-PNQYPTRYNLD-----LVGTYP---KVVQHSVKLETNRSDQSVEACVEDTVL-VPMPK 243
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
SLLV++G ARY+++H +LR D+ RRVC+AYREFT Y+ E V + +K FF
Sbjct: 244 NSLLVMYGSARYDYEHRILREDITERRVCLAYREFTPIYLEGGQFYEEGKLVTKMSKEFF 303
>gi|195118911|ref|XP_002003975.1| GI20151 [Drosophila mojavensis]
gi|193914550|gb|EDW13417.1| GI20151 [Drosophila mojavensis]
Length = 304
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 196/344 (56%), Gaps = 64/344 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ F E ++
Sbjct: 1 MNTLRPCGCKGVRTCLTCEQNFNIEKPSV------------------------------- 29
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIG 118
++ F Q E++ YC C + G D A Q+ H +H+ S +
Sbjct: 30 -------------QQQF---QQLESWSYCTKCEAFYPGWDTQAVQAAHSDHNLTDSLALP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+Y++E FLS AE + ++ LD+LPWD SQSGRRKQN+GPK NFKK++++LG FQGFPA
Sbjct: 74 GIYVQEQFLSAAEGKQLIVDLDNLPWDISQSGRRKQNYGPKTNFKKRRLQLGSFQGFPAA 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TRYVQMRFESVPLLRDFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TL Y+ + KYNL+ PD +S ++ L E+ V+RIP
Sbjct: 194 TLNLYDQKLAGKYNLDL--------VPDYESHLLEPLLTQEQLA------AYKGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN 341
M SL+VL+G ARY+++H VLR DV RRVCIAYREFT Y++
Sbjct: 240 MNNLSLIVLYGPARYQFEHSVLREDVTERRVCIAYREFTPMYID 283
>gi|195473383|ref|XP_002088975.1| GE18871 [Drosophila yakuba]
gi|194175076|gb|EDW88687.1| GE18871 [Drosophila yakuba]
Length = 304
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 152/362 (41%), Positives = 195/362 (53%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ+F
Sbjct: 1 MNTIRPCGCKGVRTCLGCEQDF-------------------------------------- 22
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK--SFDIG 118
I ++E F Q E + YC C+ G D Q H K +
Sbjct: 23 ------QIAKPSLQEQF---QQLEAWSYCIQCDLLQPGWDTTKVQKDHEHHKKDEGLPLP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E+FLSE E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP
Sbjct: 74 GILVQENFLSEEEGSQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF + +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 134 TEYVQRRFEEVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 193
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y KYNL D ++ N+L D +V + V+RIP
Sbjct: 194 TLTPYEVQQPGKYNL------------DLVASYENELVAPLLIEDQLV--SFEGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA---SVFEKAKC 354
MP SL+ L+G ARY+++H VLR DV RRVC+AYREFT Y+N E V EK++
Sbjct: 240 MPNLSLIALYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGEDIQQGDPVREKSQI 299
Query: 355 FF 356
F+
Sbjct: 300 FW 301
>gi|195339545|ref|XP_002036380.1| GM17708 [Drosophila sechellia]
gi|194130260|gb|EDW52303.1| GM17708 [Drosophila sechellia]
Length = 389
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 197/362 (54%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ
Sbjct: 86 MNTIRPCGCKGVRTCLSCEQ---------------------------------------- 105
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIG 118
+F I +KE F Q E + YC C+ G D Q H NH + +
Sbjct: 106 ----NFQIAKTSLKEQF---QQLEAWSYCIQCDLLQPGWDTNQVQKDHENHKKDEGLPLP 158
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E+FLS E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP
Sbjct: 159 GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 218
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 219 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 278
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y KYNL+ SY+ + + ++ D L T V+RIP
Sbjct: 279 TLTPYEVQQQGKYNLDL-VASYEDEL---LAPLLTDDQL----------ATFEGKVLRIP 324
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV RRVC+AYREFT Y+N + V EK++
Sbjct: 325 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGADIQKGDPVREKSQI 384
Query: 355 FF 356
F+
Sbjct: 385 FW 386
>gi|24583140|ref|NP_609314.1| CG4036, isoform A [Drosophila melanogaster]
gi|320544830|ref|NP_001188762.1| CG4036, isoform B [Drosophila melanogaster]
gi|7297561|gb|AAF52815.1| CG4036, isoform A [Drosophila melanogaster]
gi|220944368|gb|ACL84727.1| CG4036-PA [synthetic construct]
gi|220954238|gb|ACL89662.1| CG4036-PA [synthetic construct]
gi|318068393|gb|ADV37012.1| CG4036, isoform B [Drosophila melanogaster]
Length = 304
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/362 (41%), Positives = 199/362 (54%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ+F
Sbjct: 1 MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPNHS-GKSFDIG 118
I ++E F Q E + YC C+ +G D + + H NH + +
Sbjct: 23 ------HIAKTSLREQF---QQLEAWSYCIQCDLLQRGWDTNHVQKDHENHKKDEGLPLP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E+FLS E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP
Sbjct: 74 GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 134 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 193
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y S KYNL D ++ ++L + D + T V+RIP
Sbjct: 194 TLTPYEVQQSGKYNL------------DLVASYEDELLAPLLTDDQLA--TFEGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV RRVC+AYREFT Y+N + V EK++
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGVDIQKGDPVREKSQI 299
Query: 355 FF 356
F+
Sbjct: 300 FW 301
>gi|21483444|gb|AAM52697.1| LD42289p [Drosophila melanogaster]
Length = 310
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/362 (41%), Positives = 199/362 (54%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ+F
Sbjct: 7 MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 28
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPNHS-GKSFDIG 118
I ++E F Q E + YC C+ +G D + + H NH + +
Sbjct: 29 ------HIAKTSLREQF---QQLEAWSYCIQCDLLQRGWDTNHVQKDHENHKKDEGLPLP 79
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E+FLS E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP
Sbjct: 80 GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 139
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 140 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 199
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y S KYNL D ++ ++L + D + T V+RIP
Sbjct: 200 TLTPYEVQQSGKYNL------------DLVASYEDELLAPLLTDDQLA--TFEGKVLRIP 245
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV RRVC+AYREFT Y+N + V EK++
Sbjct: 246 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGVDIQKGDPVREKSQI 305
Query: 355 FF 356
F+
Sbjct: 306 FW 307
>gi|307200597|gb|EFN80738.1| Alkylated DNA repair protein alkB-like protein 4 [Harpegnathos
saltator]
Length = 240
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 122/236 (51%), Positives = 169/236 (71%), Gaps = 16/236 (6%)
Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
DFL++ E E++MK LD LPW SQSGRRKQNFGPKCNFKK+K++LG F GFP T+F+Q
Sbjct: 17 DFLNKDEAEELMKSLDHLPWQASQSGRRKQNFGPKCNFKKRKLRLGTFNGFPKITQFVQT 76
Query: 185 RFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYN 244
+F + +L +F+T+EQC+LEY+ G+SID HIDDCWIWGERIVTVN++GD+VLT+T Y
Sbjct: 77 KFEEVPLLHNFKTVEQCSLEYDPARGASIDPHIDDCWIWGERIVTVNVLGDTVLTMTPYR 136
Query: 245 GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLL 304
G N+YNL+ P+ ++I+ D T +I ++T V+R+PMPARSL+
Sbjct: 137 GPINRYNLDC--------VPNYNNSILKDDVCKNTQTVNIGDDT----VVRLPMPARSLM 184
Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
+L+G RY+W+H VLR DV RRVC+AYRE T PY+ N + + + ++A+ F+
Sbjct: 185 ILYGPTRYKWEHSVLRQDVASRRVCLAYRELTPPYLENGENYKEASEILKQAEFFW 240
>gi|312381587|gb|EFR27301.1| hypothetical protein AND_06082 [Anopheles darlingi]
Length = 517
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 132/260 (50%), Positives = 172/260 (66%), Gaps = 20/260 (7%)
Query: 103 AYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNF 162
A + P+ + GGV+I FLSE E D+++ LD++PWD SQSGRRKQNFGPK NF
Sbjct: 263 ASPTAPHGGAPGEEYGGVFIDLHFLSEEEETDLIRTLDEIPWDVSQSGRRKQNFGPKTNF 322
Query: 163 KKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWI 222
KK ++K G F GFPA T+F+Q RF +L+ FQTIEQC+LEY+ E G+SID HIDDCWI
Sbjct: 323 KKTRLKEGQFVGFPAATEFVQRRFESVPLLTTFQTIEQCSLEYDPERGASIDPHIDDCWI 382
Query: 223 WGERIVTVNLIGDSVLTLTYYNGDSN--KYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS 280
WGERIVTVNL+ DSVLT++ Y G KYNL ++ DQ + L +E+S
Sbjct: 383 WGERIVTVNLLSDSVLTMSLYRGADGKPKYNL---------QFVDQYRDRLLGLLADEQS 433
Query: 281 TDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
N ++RIPMP RSLLVL+G RY+W+H VLR D+ RRVC+AYREFT Y+
Sbjct: 434 LACYENR-----IVRIPMPRRSLLVLYGSPRYQWEHSVLREDISDRRVCLAYREFTPMYL 488
Query: 341 NNEATAS----VFEKAKCFF 356
+ + S +F++AK F+
Sbjct: 489 QDGSEFSQSEEIFKRAKLFW 508
>gi|195053089|ref|XP_001993463.1| GH13824 [Drosophila grimshawi]
gi|193900522|gb|EDV99388.1| GH13824 [Drosophila grimshawi]
Length = 307
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/346 (41%), Positives = 196/346 (56%), Gaps = 65/346 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CE+++ E P + +Q D ++Y
Sbjct: 1 MNTLRPCGCKGVRTCLTCEKDYEIEK--------------PSLQQQFQQLDAWSY----- 41
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHSG-KSFDIG 118
C C + + G D A Q+ HP H+G +S +
Sbjct: 42 ----------------------------CTQCKRLFAGWDTKAVQAAHPAHNGAESLPLP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E FLS E +M LD+LPW SQSGRRKQN+GPK NFKK+++++G F+GFP
Sbjct: 74 GILVQEKFLSSTEGVQLMSDLDNLPWVISQSGRRKQNYGPKTNFKKRRLQVGAFEGFPIS 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q+RF +L +FQTIEQC+LEY+ G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TQYVQERFESVPLLHNFQTIEQCSLEYDPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYY--NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRI 296
TL Y S KYNL+ P + ++ L LN + T V+RI
Sbjct: 194 TLNLYEQQQQSGKYNLDL--------VPQYEEKLLEPL-LNPQQL-----ATYEGKVLRI 239
Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
PMP SL+VL+G AR++++H VLR DV RRVCIAYREFT Y++
Sbjct: 240 PMPNLSLIVLYGPARFQFEHAVLREDVSERRVCIAYREFTPMYIDG 285
>gi|195385022|ref|XP_002051207.1| GJ13536 [Drosophila virilis]
gi|194147664|gb|EDW63362.1| GJ13536 [Drosophila virilis]
Length = 306
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/364 (40%), Positives = 203/364 (55%), Gaps = 71/364 (19%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKN--IVEFTEETYIYCPHCNKAWQGSDLYAYQ-S 57
M+ CGCKGVRTC+ CEQ++ + + E++ YC C + + G D A Q +
Sbjct: 1 MNTLRPCGCKGVRTCLTCEQDYQIDKPSLQQQLQQLESWSYCAQCERLYAGWDTQAVQAA 60
Query: 58 HPNHS-GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD 116
HP+H+ +S + G+ ++E F+T
Sbjct: 61 HPDHNLAESLPLPGILVQEQFIT------------------------------------- 83
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
AE +M LD LPW SQSGRRKQN+GPK NFKK++++LG F+GFP
Sbjct: 84 ------------RAEGAQLMSDLDALPWAISQSGRRKQNYGPKTNFKKRRLQLGAFEGFP 131
Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
A T+F+Q RF +L +FQTIEQC+LEY+ G+SID H+DDCWIWGER+VTVN +GD+
Sbjct: 132 AATRFVQQRFESVPLLRNFQTIEQCSLEYDPSKGASIDPHVDDCWIWGERVVTVNCLGDA 191
Query: 237 VLTLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
VLTL Y +NKYNL+ ++Y K+ ++ L S D + V+R
Sbjct: 192 VLTLNLYEEQQANKYNLD-----LVQQY---KAQLLAPLL----SPDQLA--AYKGKVLR 237
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY---MNNEATASVFEKA 352
IPMP SL+VL+G ARY+++H VLR DV RRVCIAYREFT Y ++ + V EK+
Sbjct: 238 IPMPNLSLIVLYGPARYQFEHAVLREDVDERRVCIAYREFTPMYIDGLDKQQGDVVREKS 297
Query: 353 KCFF 356
F+
Sbjct: 298 HLFW 301
>gi|194859295|ref|XP_001969349.1| GG10057 [Drosophila erecta]
gi|190661216|gb|EDV58408.1| GG10057 [Drosophila erecta]
Length = 304
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 151/362 (41%), Positives = 192/362 (53%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ+F
Sbjct: 1 MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK--SFDIG 118
I ++E F Q E + YC C+ G D Q H K +
Sbjct: 23 ------QIAKPSLQEQF---QQLEAWSYCIQCDLLQPGWDTTKVQKDHEHHKKDEGLPLP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E FLS E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP
Sbjct: 74 GILVQEKFLSAEEGSRLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF + +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFEEVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT + KYNL+ Y D+ + LNE S V+RIP
Sbjct: 194 TLTPFEVQPPGKYNLD-----LVASYEDELLAPL----LNEDQLAS-----FEGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA---SVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV RRVC+AYREFT Y+N E V EK+
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGEDIQKGDPVREKSHI 299
Query: 355 FF 356
F+
Sbjct: 300 FW 301
>gi|194765625|ref|XP_001964927.1| GF22814 [Drosophila ananassae]
gi|190617537|gb|EDV33061.1| GF22814 [Drosophila ananassae]
Length = 304
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 190/346 (54%), Gaps = 64/346 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CEQ+F
Sbjct: 1 MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQ-SHPNHSGKS-FDIG 118
I ++E F + E + YC HC G D Q SH NH ++ +
Sbjct: 23 ------QIAKPSLQEQF---RQLEAWSYCVHCELFQPGWDTKEVQESHSNHDQENGLPLP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ ++E F+S E ++ LDDLPWD SQSGRRKQNFGPK NFKK+K+++G+F GFP+
Sbjct: 74 GILVQEKFISAEEGAQLINDLDDLPWDISQSGRRKQNFGPKTNFKKRKLQVGNFAGFPST 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF +L FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y + KYNL+ YQK+ + K V+RIP
Sbjct: 194 TLTPYELQQAGKYNLDL-VEEYQKELLAPLLEEEQLVRYKGK-------------VLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNE 343
MP SL+VL+G ARY+++H VLR DV RRVCIAYREFT Y+N E
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVEERRVCIAYREFTPMYINGE 285
>gi|195147010|ref|XP_002014473.1| GL18936 [Drosophila persimilis]
gi|194106426|gb|EDW28469.1| GL18936 [Drosophila persimilis]
Length = 304
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 149/362 (41%), Positives = 194/362 (53%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CE
Sbjct: 1 MNTIRPCGCKGVRTCLSCE----------------------------------------- 19
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNH-SGKSFDIG 118
K F I ++E F Q E + YC C + + G D A Q+ H H K +
Sbjct: 20 ---KDFQIAKPSLQEQF---QQLEPWSYCVQCERLYSGWDTTAVQADHQQHDKAKGLLMP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
GV ++E FLS+ E +M +LD L WD SQSGRRKQNFGPK NFKK+K+++G F GFP
Sbjct: 74 GVLVQEQFLSKDEGSQLMTELDALRWDISQSGRRKQNFGPKTNFKKRKLQVGTFAGFPKS 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF + +L +FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFDEVPLLRNFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y + KYNL+ + L E V+RIP
Sbjct: 194 TLTVYEEQQAGKYNLD--------------LVAAYENELLEPLLPEERLSAYKGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV+ RRVCIAYREFT Y+ + E V EK++
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVVERRVCIAYREFTPMYIAGADQEKGDPVREKSQI 299
Query: 355 FF 356
F+
Sbjct: 300 FW 301
>gi|125985173|ref|XP_001356350.1| GA25863 [Drosophila pseudoobscura pseudoobscura]
gi|54644673|gb|EAL33413.1| GA25863 [Drosophila pseudoobscura pseudoobscura]
Length = 304
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 148/362 (40%), Positives = 192/362 (53%), Gaps = 67/362 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CE
Sbjct: 1 MNTIRPCGCKGVRTCLSCE----------------------------------------- 19
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNH--SGKSFDIG 118
K F I ++E F Q E + YC C + + G D A Q+ H K +
Sbjct: 20 ---KDFQIAKPSLQEQF---QQLEPWSYCVQCERLYSGWDTTAVQADHQHHDKAKGLPMP 73
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
GV ++E FLS+ E +M LD L WD SQSGRRKQNFGPK NFKK+K+++G F GFP
Sbjct: 74 GVLVQEQFLSKDEGSQLMADLDALRWDISQSGRRKQNFGPKTNFKKRKLQVGTFAGFPNS 133
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
T+++Q RF + +L +FQTIEQC+LEY G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFDEVPLLRNFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193
Query: 239 TLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
TLT Y + KYNL+ + L E V+RIP
Sbjct: 194 TLTVYEEQQAGKYNLD--------------LVAAYENELLEPLLPEERLSAYKGKVLRIP 239
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKAKC 354
MP SL+VL+G ARY+++H VLR DV+ RRVCIAYREFT Y+ + E V EK++
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVVERRVCIAYREFTPMYIAGADQEKGDPVREKSQI 299
Query: 355 FF 356
F+
Sbjct: 300 FW 301
>gi|195438547|ref|XP_002067198.1| GK24867 [Drosophila willistoni]
gi|194163283|gb|EDW78184.1| GK24867 [Drosophila willistoni]
Length = 307
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 147/364 (40%), Positives = 194/364 (53%), Gaps = 68/364 (18%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKGVRTC+ CE++F I P K +Q
Sbjct: 1 MNTIRPCGCKGVRTCLTCEEDF--------------QIAKPSLCKQFQ------------ 34
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL----YAYQSHPNHSGKSFD 116
Q E + YC C + + G D +++H + K
Sbjct: 35 --------------------QELEPWTYCIKCQRFYAGWDTDDTDREHENHQHAPDKGLH 74
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
+ G+ ++E FLS E + +M LDD+PW SQSGRRKQN+GPK NFKK+K+++G F GFP
Sbjct: 75 LPGLLVQEQFLSHDEGKTLMAHLDDMPWTISQSGRRKQNYGPKTNFKKRKLQVGTFAGFP 134
Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
T+++Q RF +L FQTIEQC+LEY+ G+SID H+DDCWIWGER+VTVN +GD+
Sbjct: 135 QGTEYVQQRFEDVPLLRDFQTIEQCSLEYDPTKGASIDPHVDDCWIWGERVVTVNCLGDA 194
Query: 237 VLTLTYYNG-DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
VLTL Y+ KYNL D N+L S D + N V+R
Sbjct: 195 VLTLNVYDKQQPGKYNL------------DLVPEYENELLAPMLSDDQLAN--FKGKVLR 240
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKA 352
IPMP SL+VL+G RY+++H VLR DV RRVCIAYREFT Y+ + E V E A
Sbjct: 241 IPMPNLSLIVLYGPVRYQFEHSVLREDVTERRVCIAYREFTPMYIHGKDREKGDGVREMA 300
Query: 353 KCFF 356
K F+
Sbjct: 301 KKFW 304
>gi|242018131|ref|XP_002429534.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212514482|gb|EEB16796.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 279
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 109/249 (43%), Positives = 168/249 (67%), Gaps = 4/249 (1%)
Query: 92 CNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGR 151
C +W+ ++ PNH GKS D+ G+++ DFL+ E + ++ ++ WD SQSGR
Sbjct: 15 CLISWENEYDLIEKNCPNHRGKSLDVNGIFVLPDFLTAEEKDFILDGINKEKWDSSQSGR 74
Query: 152 RKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGS 211
RKQN+GPKCNFKK KI++G+F+GFP+ +K I ++F +L F+ IEQCTLEY+ G+
Sbjct: 75 RKQNYGPKCNFKKNKIQVGNFKGFPSYSKIIHEKFKTLPILKDFRVIEQCTLEYDPTRGA 134
Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
SID H+DDCWIWGERIVTVN++G++ L+++ + +NK+NLN SY + I
Sbjct: 135 SIDPHVDDCWIWGERIVTVNILGETYLSMSRHK-QNNKFNLNK-VESYPCVLNNNGEFIY 192
Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA 331
N+ + T ++ + +V ++I MP RS+++L+G ARYEW+H + R+D+ R+C+A
Sbjct: 193 NNNLRPFEFTHNVPQD--ENVELKILMPERSIIILYGSARYEWEHYIPRNDIKNLRICLA 250
Query: 332 YREFTAPYM 340
+REFT PY+
Sbjct: 251 FREFTPPYL 259
>gi|380015391|ref|XP_003691686.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like, partial [Apis florea]
Length = 194
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 107/208 (51%), Positives = 146/208 (70%), Gaps = 18/208 (8%)
Query: 152 RKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGS 211
RKQNFGPKCNFKK+K++LG F GFP T+F+Q +F + +L++FQT+EQCTLEY+ G+
Sbjct: 1 RKQNFGPKCNFKKRKLQLGSFDGFPKSTQFVQQKFSEIPILNNFQTVEQCTLEYDPIRGA 60
Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
SID HIDDCWIWGERIVTVN++GDSVLT++ Y+G KYNL S + S+ +
Sbjct: 61 SIDPHIDDCWIWGERIVTVNVMGDSVLTMSSYHGPDTKYNLKSII---------EYSSTI 111
Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA 331
L + K+ + NE +++R+PMP SL+VL+G ARY+W+HCVLR D+ RR+C+A
Sbjct: 112 EGLKFDIKNKNE--NE---DIIVRLPMPEGSLMVLYGIARYKWEHCVLREDINSRRICLA 166
Query: 332 YREFTAPYM----NNEATASVFEKAKCF 355
YRE T PY+ NNE + ++A F
Sbjct: 167 YRELTPPYLFNSTNNEVVKELLKRASVF 194
>gi|225718066|gb|ACO14879.1| Alkylated DNA repair protein alkB homolog 4 [Caligus clemensi]
Length = 324
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 133/371 (35%), Positives = 189/371 (50%), Gaps = 62/371 (16%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ + CGCK RTC KCE ++ T
Sbjct: 1 MNTNRPCGCKSFRTCYKCEASISPDSNPYASLTR-------------------------- 34
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHSGKSFDIGG 119
+ ++ ++ V+C C + G +L A S HP +SF G
Sbjct: 35 ------------------SLRSSQSGVFCVECESIYPGWELAATCSDHPPPILQSF--SG 74
Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACT 179
+ I +F++ E ED++K LD LPWD S SGRRKQNFGP+ F K+K K G F GFPACT
Sbjct: 75 IQIIPNFINHQEQEDLLKGLDSLPWDLSVSGRRKQNFGPRAKFNKRKAKNGPFSGFPACT 134
Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
+F+Q RF + D L +QT+EQC++EY G+ I+ H+DDCWIWGERIV ++++ SVLT
Sbjct: 135 EFVQKRFEEVDSLEGYQTVEQCSIEYQKNTGAWIEPHVDDCWIWGERIVQLHVLSSSVLT 194
Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS--------TDSIVNETLSS 291
LT ++KYNL + D + + N S ++ + +
Sbjct: 195 LTKCR-KAHKYNLKDVSLYPKVLSTDGQRVLYNPFKSVSSSSPSEPFEISEEVADGRTPL 253
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCIAYREFTAPYMN-----NEAT 345
VIRIP+P +SLLV+ G RY+W+H VLR D+ RRV IAYREFT P++ +
Sbjct: 254 DVIRIPLPEQSLLVMSGGPRYQWEHSVLREDIHDERRVIIAYREFTPPFLPGGDKFGDVG 313
Query: 346 ASVFEKAKCFF 356
+V E A+ F+
Sbjct: 314 RAVIENARSFW 324
>gi|351698749|gb|EHB01668.1| Alkylated DNA repair protein alkB-like protein 4 [Heterocephalus
glaber]
Length = 351
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 175/339 (51%), Gaps = 35/339 (10%)
Query: 7 CGCKGVRTCIKCEQEFGYENK-NIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
CGCKG+RTC+ CE++ + + ++YC A D + G +
Sbjct: 17 CGCKGIRTCLICERQRQVDPPWQLSPKKTHRFLYCVDTGWAVGAED-------SDFEGWA 69
Query: 66 FDIGGVYIKEDFLTSQTK-----ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
F GV + EDF+T + + ++YC A D + G +F GV
Sbjct: 70 FPFPGVTLIEDFVTREEEAEMKTHRFLYCVDTGWAVGAED-------SDFEGWAFPFPGV 122
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
+ EDF++ E ++++ +D PW SQSGRRKQ++GP+ NF+KQK+K F G P+ ++
Sbjct: 123 TLIEDFVTREEEAEMVRLMDCDPWKLSQSGRRKQDYGPRVNFRKQKLKTAGFLGLPSFSR 182
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
+ R L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V+++
Sbjct: 183 EVVQRMGLYPSLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVMSM 242
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
+ G N ++P S + + +T + V + IP+P
Sbjct: 243 S-QKGPGNLLLSSAPPASPKDSV--------------DSATAPCRSVPCQDVEVAIPVPR 287
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
RSLLVL G AR++W H + R + RRVC +RE +A +
Sbjct: 288 RSLLVLAGAARHQWTHGIHRRHIKARRVCATFRELSAEF 326
>gi|291242331|ref|XP_002741061.1| PREDICTED: alkB, alkylation repair homolog 4-like [Saccoglossus
kowalevskii]
Length = 270
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 173/359 (48%), Gaps = 99/359 (27%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
M+ CGCKG+RTC+ CE+ S +
Sbjct: 1 MNTVKLCGCKGIRTCLVCEK------------------------------------SKID 24
Query: 61 HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
S +S G + K D +Y +C CN AW S+ + HP+H GK GV
Sbjct: 25 LSCRS----GRFEKPD-------ASYSFCWLCNIAWLESN---TEQHPSHQGKFIRFPGV 70
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
+ E+ +SE E E +++ +D PW SQSGRRKQ++GPK NFKK+K+ F G PA +
Sbjct: 71 TLIENVVSEEEEEAIIQAVDATPWKVSQSGRRKQDYGPKVNFKKRKVNSKCFSGLPAFIR 130
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
+ +R Q D L+ FQ +EQC LEY + GSSID H DD W+WGER+VT+NL ++ LT+
Sbjct: 131 PLTERLVQMDGLADFQVVEQCNLEYVPDRGSSIDPHFDDVWLWGERLVTLNLNSETTLTM 190
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
T D + + IP+P
Sbjct: 191 TQKEKD---------------------------------------------ICVSIPLPK 205
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN---NEATAS-VFEKAKCF 355
RS++VL+G ARYEW H + R D+I RR+ + +RE +A +++ NE T S + E A+ +
Sbjct: 206 RSVIVLYGPARYEWMHAIHREDIINRRIAVTFRELSAEFLDGGVNEDTGSRLLEIARTY 264
>gi|68372246|ref|XP_687075.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Danio rerio]
Length = 315
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 164/334 (49%), Gaps = 95/334 (28%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC++CE + E K+++ Q +DL Y
Sbjct: 53 CGCKGIRTCLRCETD---ETKHLL-----------------QKNDLIHYD---------- 82
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+I + L S +E P C F+ GV + E+F
Sbjct: 83 -----FIYDPVLKSAVREEEGSTPQC----------------------FEFPGVLLWENF 115
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+SE E +++ ++D W +SQSGRRKQ+FGPK NFKK+++ +G F G PA ++ + R
Sbjct: 116 VSEDEERELVSRMDQDVWRESQSGRRKQDFGPKVNFKKRRVHVGSFSGLPAISRRLLVRM 175
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
LS F+ +EQC L+Y+S GS+ID H+DD W+WGE +VTVNL+ D+VLTL+ G
Sbjct: 176 SDLPQLSSFKPVEQCNLDYDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLDQG- 234
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
D+ E V + + +P RSL+VL
Sbjct: 235 ------------------------WGDMEQGE-------------VRVAVRLPRRSLVVL 257
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
+G+AR+ W+H + R D+ GRRVC +RE +A ++
Sbjct: 258 YGDARHRWKHAIHRKDIHGRRVCSTFRELSAEFL 291
>gi|344289767|ref|XP_003416612.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Loxodonta africana]
Length = 358
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 106/333 (31%), Positives = 165/333 (49%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G N +WQ
Sbjct: 71 CGCKGIRTCLICERQRGG-------------------NLSWQ------------------ 93
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
L Q + + YC H G + A +S + G +F GV + EDF
Sbjct: 94 -----------LPPQNPDRFTYCSHT-----GWAVGAEES--DFEGWAFPFPGVTLIEDF 135
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+K+K+K F+G P+ ++ + R
Sbjct: 136 VTRDEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKRKLKAAGFRGLPSFSREVVRRM 195
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++++
Sbjct: 196 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMSHEAPG 255
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S SP +V + +S V + +P+P RSLLVL
Sbjct: 256 SLLLCPASP---------GSLEPMVEGVVAFSRSV------LCQDVEVAVPVPRRSLLVL 300
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++WQH + R + RRVC+ +RE +A +
Sbjct: 301 SGAARHQWQHAIHRKHITARRVCVTFRELSAEF 333
>gi|405951338|gb|EKC19259.1| Alkylated DNA repair protein alkB-like protein 4 [Crassostrea
gigas]
Length = 270
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 156/333 (46%), Gaps = 97/333 (29%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+R C+ CE+ VE ET C
Sbjct: 4 CGCKGIRNCLVCEK---------VERQNETEKKC-------------------------- 28
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+++ + +C CN A++G HPNH G + + G++I+ DF
Sbjct: 29 ---------------SRKIHSFCDLCNMAYEGG---TADDHPNHHGNTLEFPGIFIQRDF 70
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+S E D+++ + P+ +SQSGRRKQ++GPK NFK++KIK F G P ++ + DR
Sbjct: 71 ISPEEESDILEHIYKSPFVESQSGRRKQDYGPKVNFKRKKIKSDVFTGLPDFSRVLYDRM 130
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
D+L F+ +EQC LEY E GSSID H DD W+WGER+VT+NL D++L +T
Sbjct: 131 KSTDILKDFEPVEQCNLEYLPERGSSIDPHFDDFWLWGERLVTLNLGSDTILCMT----- 185
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
YPD + + IP+ SL+V+
Sbjct: 186 -------------SSDYPD--------------------------ISVHIPLFRNSLIVV 206
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
+G AR+ W H + R D+ +R+ I YRE +A +
Sbjct: 207 YGPARHAWMHGIHRCDITEKRLAITYRELSAEF 239
>gi|432901333|ref|XP_004076835.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Oryzias latipes]
Length = 272
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 137/235 (58%), Gaps = 37/235 (15%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
SF G+++ +DF+SE E + ++ +D W+ SQSGRRKQ+FGPK NFKK+K+++G F
Sbjct: 59 SFSFPGIFLWDDFISEEEEKQLINSMDQDAWNPSQSGRRKQDFGPKVNFKKRKVRVGGFS 118
Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
G P+ +K + R QE LS FQ +EQC L+YN + GS+ID H DD W+WGER+VT+N++
Sbjct: 119 GLPSLSKELVLRMQQEPCLSGFQPVEQCNLDYNPQRGSAIDPHFDDSWLWGERLVTINML 178
Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
D+ LT++ G + +L L+E+ +
Sbjct: 179 SDTTLTMSLEQG-------------------------LPELGLSEE------------IH 201
Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASV 348
+ + +P R L+V++GEAR+ W+H + R D+ RRVC YRE +A ++ A +
Sbjct: 202 VAVRLPRRCLVVVYGEARHRWKHAIRRQDIHKRRVCSTYRELSAEFLPGGQQAEL 256
>gi|260803607|ref|XP_002596681.1| hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae]
gi|229281940|gb|EEN52693.1| hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae]
Length = 303
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/338 (34%), Positives = 165/338 (48%), Gaps = 67/338 (19%)
Query: 2 DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
D + CGCKG+RTC+ CE E T + P QG +
Sbjct: 7 DFTQICGCKGIRTCLLCE-------------GERTTVKAP------QGHAV--------- 38
Query: 62 SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVY 121
TY+YC C+KAW+ + Q+ + D GV
Sbjct: 39 -----------------------TYMYCNQCHKAWRHMEGSRCQT------EEMDFPGVL 69
Query: 122 IKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKF 181
+ EDF++ E D++ ++ W SQSGR+KQ+FGPK NFKK+K+KLG F G P +
Sbjct: 70 VVEDFITPEEEADIVTVIEGTEWKVSQSGRKKQDFGPKVNFKKKKLKLGGFTGLPKFSAP 129
Query: 182 IQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT 241
+ DR D+L F +EQC L+Y+ GS+ID H DD W+WGER+VTVNL+ DSVLT++
Sbjct: 130 LLDRMKSHDLLKDFIPVEQCHLDYDPSRGSAIDPHFDDFWLWGERLVTVNLLADSVLTMS 189
Query: 242 YYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPAR 301
D + + K P K++ ++ L D + IPMP R
Sbjct: 190 CEEKDEVEVTIQQTSNCEHSKKPVTKTSSLDSDILVPSDEDKVA----------IPMPRR 239
Query: 302 SLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
SLLV+ G AR++W H + R D+ RR+ + RE +A +
Sbjct: 240 SLLVVHGNARHKWMHAIHREDITSRRIAVTLRELSAEF 277
>gi|449265929|gb|EMC77056.1| Alkylated DNA repair protein alkB like protein 4, partial [Columba
livia]
Length = 280
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 96/266 (36%), Positives = 153/266 (57%), Gaps = 18/266 (6%)
Query: 81 QTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLD 140
Q ++ + YCP + L H +G +F GV++ E+F+S+ E ++++ +D
Sbjct: 1 QGEDNFTYCP-------ATGLAKGNEHSEFAGWAFPFPGVFLTEEFISKDEESEIVELMD 53
Query: 141 DLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQ 200
W SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I + VL F +EQ
Sbjct: 54 RDDWKPSQSGRKKQDYGPKVNFKKQRLKAGSFTGLPSFSKKIVAQMKACSVLGGFLPVEQ 113
Query: 201 CTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQ 260
C L+Y E GS+ID H DD W+WGER+V++NL+ +VL+++ + DS + P FS +
Sbjct: 114 CNLDYMPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDSIQL---FPTFSKE 170
Query: 261 KKYPDQKSTIVNDLSLNEKSTDSIVNETLSS-------VVIRIPMPARSLLVLWGEARYE 313
++ S + S + VN LS V + I +P RSL+VL+G+ARY+
Sbjct: 171 NGELSPPGSLTQT-SACKNSGEETVNCVLSPRLVPSKEVTVAIHLPRRSLVVLYGDARYK 229
Query: 314 WQHCVLRSDVIGRRVCIAYREFTAPY 339
W+H + R + RR+C+ +RE +A +
Sbjct: 230 WKHAIYRKHIEHRRICVTFRELSAEF 255
>gi|348532550|ref|XP_003453769.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Oreochromis niloticus]
Length = 274
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 137/235 (58%), Gaps = 37/235 (15%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
SF GV++ E+F+SE E ++++ +D W +SQSGRRKQ+FGPK NFKK+K+++G F
Sbjct: 61 SFPFSGVFLWENFISEEEEKELICAMDQDVWKESQSGRRKQDFGPKVNFKKRKVRVGGFS 120
Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
G PA + + R +QE L+ FQ +EQC L+Y+ E G++ID H+DD W+WGER+VT+N++
Sbjct: 121 GLPALSHKLVLRMYQESALAGFQPVEQCNLDYHPERGAAIDPHLDDSWLWGERLVTINML 180
Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
D+ LT++ G + +L L V
Sbjct: 181 SDTTLTMSLEQG-------------------------LPELGLT------------GEVH 203
Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASV 348
+ + +P R L+VL+GEAR++W+H + R D+ RRVC YRE +A ++ A +
Sbjct: 204 VAVNLPRRCLVVLYGEARHKWKHAIHRKDIHVRRVCSTYRELSAEFLPGGQQADL 258
>gi|326931240|ref|XP_003211741.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Meleagris gallopavo]
Length = 289
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/263 (35%), Positives = 153/263 (58%), Gaps = 16/263 (6%)
Query: 83 KETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL 142
++ + YCP + L H + +G +F GV++ E+F+S+ E V++ +D
Sbjct: 12 EDNFTYCP-------ATGLAEGNEHSSFAGWAFPFPGVFLAEEFISKDEEVGVVELMDRD 64
Query: 143 PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT 202
W SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I + VL F +EQC
Sbjct: 65 EWKPSQSGRKKQDYGPKVNFKKQRLKAGGFTGLPSFSKKIVAQMEACSVLGGFMPVEQCN 124
Query: 203 LEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK 262
L+Y E GS+ID H DD W+WGER+V++NL+ +VL+++ + DS + P FS +
Sbjct: 125 LDYRPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDSIQL---FPTFSNENG 181
Query: 263 YPDQKSTIVNDLSL---NEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYEWQH 316
+ ++ S ++ + +++ L V + I +P RSL+VL+GEARY+W+H
Sbjct: 182 ELNPPGSLTQTSSCENSGKQGSSCLLSPRLVPSKEVTVAIHLPRRSLVVLYGEARYKWKH 241
Query: 317 CVLRSDVIGRRVCIAYREFTAPY 339
+ R + RRVC+ +RE +A +
Sbjct: 242 AIHRRHIEHRRVCVTFRELSAEF 264
>gi|63102529|gb|AAH95774.1| LOC553475 protein, partial [Danio rerio]
Length = 276
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 107/334 (32%), Positives = 163/334 (48%), Gaps = 94/334 (28%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC++CE + T++ + Q ++L Y
Sbjct: 13 CGCKGIRTCLRCE----------TDGTKQHLL---------QNNELIHYD---------- 43
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+I + L S +E P C F+ GV + E+F
Sbjct: 44 -----FIYDPVLKSAVREEEGSTPQC----------------------FEFPGVLLWENF 76
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+SE E +++ ++D W +SQSGRRKQ+FGPK NFKK+++ +G F G PA ++ + R
Sbjct: 77 VSEDEERELVSRMDQDVWRESQSGRRKQDFGPKVNFKKRRVHVGSFSGLPAISRRLLVRM 136
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
LS F+ +EQC L+Y+S GS+ID H+DD W+WGE +VTVNL+ D+VLTL+ G
Sbjct: 137 SDLPQLSSFKPVEQCNLDYDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLDQG- 195
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
D+ E V + + +P RSL+VL
Sbjct: 196 ------------------------WGDMEQGE-------------VRVAVRLPRRSLVVL 218
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
+G+AR+ W+H + R D+ GRRVC +RE +A ++
Sbjct: 219 YGDARHRWKHAIHRKDIHGRRVCSTFRELSAEFL 252
>gi|147900037|ref|NP_001084791.1| alkB, alkylation repair homolog 4 [Xenopus laevis]
gi|47124930|gb|AAH70778.1| MGC83815 protein [Xenopus laevis]
Length = 311
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/230 (40%), Positives = 145/230 (63%), Gaps = 9/230 (3%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
F GV++ +DF++E E + +++ +D W SQSGRRKQ+FGPK NFKKQK+++G+F G
Sbjct: 61 FHFPGVHLIKDFVTEDEEQRLVQFMDQGEWRHSQSGRRKQDFGPKVNFKKQKLRVGNFSG 120
Query: 175 FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
P ++ + +R Q VL F +EQC L+Y+SE GS+ID H DD W+WGER+V++NL+
Sbjct: 121 LPNISQVLWERMKQHAVLEGFLPVEQCNLDYHSERGSAIDPHFDDSWLWGERLVSLNLLS 180
Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKY----PDQKSTIVNDLSLNEKSTDSIVNET-L 289
D+VLT+T + D + L P + D ++ VN + N S +S +
Sbjct: 181 DTVLTMT--SDDVSSLQL-MPVLQKGAELNMTGSDNNASDVN-IGQNPVSNNSQAKQVPC 236
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
S+V + I +P RSL+VL+G+ARY+W+H + R + RRVC +RE + +
Sbjct: 237 SNVEVAICLPRRSLVVLYGDARYKWKHAIHREHIKHRRVCSTFRELSVEF 286
>gi|301626260|ref|XP_002942310.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Xenopus (Silurana) tropicalis]
Length = 308
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 8/230 (3%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
F GVY+ +DF++E E + ++ +D W SQSGRRKQ+FGPK NFKKQK+++G+F G
Sbjct: 57 FHFPGVYLIQDFVTEDEEQHLVNFMDQEEWRLSQSGRRKQDFGPKVNFKKQKLRVGNFSG 116
Query: 175 FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
P ++ I +R Q VL F +EQC L+Y+SE GSSID H DD W+WGER+V++NL+
Sbjct: 117 LPNVSQVIWERMKQHTVLEGFLPVEQCNLDYHSERGSSIDPHFDDSWLWGERLVSLNLLS 176
Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEK-----STDSIVNETL 289
D+V T+T + D + L P + S N +N S +
Sbjct: 177 DTVFTMT--SDDVSSLQL-MPVLQEGSELNSTSSDGNNASDMNHTQNPGYSKSQAKHVPY 233
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
V + I +P RSL+VL+G+ARY+W+H + R ++ RRVC +RE + +
Sbjct: 234 CDVEVAIRLPRRSLVVLYGDARYKWKHAIHRENIEHRRVCSTFRELSKEF 283
>gi|115495653|ref|NP_001069021.1| probable alpha-ketoglutarate-dependent dioxygenase ABH4 [Bos
taurus]
gi|109939932|gb|AAI18458.1| AlkB, alkylation repair homolog 4 (E. coli) [Bos taurus]
Length = 303
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 165/333 (49%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
SQ ++Y P A + A +S + G +F GV + EDF
Sbjct: 40 -------------SQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +L + + S + V + +P+P RSL+VL
Sbjct: 200 SLLLCLAPSGFPE---------------ALADSAMASSRSVPCQEVEVAVPLPRRSLVVL 244
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRV + +RE +A +
Sbjct: 245 RGGARHQWKHAIHRRHIGARRVGVTFRELSAEF 277
>gi|296472939|tpg|DAA15054.1| TPA: alkB, alkylation repair homolog 4 [Bos taurus]
Length = 303
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 166/337 (49%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
SQ ++Y P A + A +S + G +F GV + EDF
Sbjct: 40 -------------SQRTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMS----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS + + S V + +P+P RS
Sbjct: 195 --------------REAPGSLLLCLAPSGFPEALADSAMAPSRSVPCQEVEVAVPLPRRS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
L+VL G AR++W+H + R + RRV + +RE +A +
Sbjct: 241 LVVLRGGARHQWKHAIHRRHIGARRVGVTFRELSAEF 277
>gi|332867922|ref|XP_003318746.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Pan troglodytes]
gi|410206532|gb|JAA00485.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
Length = 302
Score = 180 bits (457), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 164/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC N W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SNTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|149755992|ref|XP_001504522.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Equus caballus]
Length = 302
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 163/333 (48%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 40 -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTVIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTREEEAEMVQLMDREPWKPSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +V + +S V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPCRSLLVL 244
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRVC +RE +A +
Sbjct: 245 TGAARHQWKHAIHRRHIQARRVCATFRELSAEF 277
>gi|397465108|ref|XP_003804369.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Pan paniscus]
Length = 302
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW +SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKQSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|410984598|ref|XP_003998614.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Felis catus]
Length = 308
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 105/333 (31%), Positives = 163/333 (48%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 21 CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 46 -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 85
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 86 VTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSREVVRRM 145
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 146 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 205
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +V + +S V + IP+P RSLLVL
Sbjct: 206 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAIPLPRRSLLVL 250
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRVC +RE +A +
Sbjct: 251 TGAARHQWKHAIHRRHIEARRVCATFRELSAEF 283
>gi|302564965|ref|NP_001181358.1| probable alpha-ketoglutarate-dependent dioxygenase ABH4 [Macaca
mulatta]
gi|355560537|gb|EHH17223.1| hypothetical protein EGK_13569 [Macaca mulatta]
gi|355747566|gb|EHH52063.1| hypothetical protein EGM_12431 [Macaca fascicularis]
Length = 302
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|402863131|ref|XP_003895885.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Papio anubis]
Length = 302
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|440908430|gb|ELR58444.1| Putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
[Bos grunniens mutus]
Length = 299
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 164/335 (48%), Gaps = 78/335 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 11 CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 35
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
SQ ++Y P A + A +S + G +F GV + EDF
Sbjct: 36 -------------SQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 75
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 76 VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 135
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 136 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMS----- 190
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS + + S V + +P+P RS
Sbjct: 191 --------------REAPGSLLLCLAPSGFPEALADSAMAPSRSVPCQEVEVAVPLPRRS 236
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
L+VL G AR +W+H + R + RRV + +RE +A
Sbjct: 237 LVVLRGGARRQWKHAIHRRHIGARRVGVTFRELSA 271
>gi|332255155|ref|XP_003276698.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH4 [Nomascus
leucogenys]
Length = 311
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 165/337 (48%), Gaps = 69/337 (20%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + E C C K ++
Sbjct: 15 CGCKGIRTCLICERQRGGDPP--WELPPAPVDLCFFCRKTYR------------------ 54
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 55 -------------------FMYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 88
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 89 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 148
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 149 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 203
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+P RS
Sbjct: 204 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPTRS 249
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 250 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 286
>gi|297680122|ref|XP_002817855.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Pongo abelii]
Length = 302
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 105/337 (31%), Positives = 164/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+VYC + W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FVYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRL 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|380795719|gb|AFE69735.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
[Macaca mulatta]
gi|380795721|gb|AFE69736.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
[Macaca mulatta]
gi|380795723|gb|AFE69737.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
[Macaca mulatta]
Length = 301
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 14 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 44
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 45 -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 78
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 79 VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 138
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 139 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 193
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 194 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 239
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 240 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 276
>gi|62896937|dbj|BAD96409.1| hypothetical protein FLJ20013 variant [Homo sapiens]
Length = 302
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 161/337 (47%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGS-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA-----VGTEESDFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P ++ E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAVPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|345801177|ref|XP_850090.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Canis lupus familiaris]
Length = 302
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 164/333 (49%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 40 -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
+++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTQEEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTTGFRGLPSFSRDVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 ALYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +V + +S V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPRRSLLVL 244
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRVC +RE +A +
Sbjct: 245 TGAARHQWKHAIHRRHIEARRVCATFRELSAEF 277
>gi|426357351|ref|XP_004046007.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Gorilla gorilla gorilla]
Length = 302
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 164/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWE------------------ 37
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
L ++YC + +W + A +S + G +F GV + EDF
Sbjct: 38 -----------LPPAKTSRFIYC--SDTSWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTESFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|410914972|ref|XP_003970961.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Takifugu rubripes]
Length = 246
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/230 (38%), Positives = 134/230 (58%), Gaps = 42/230 (18%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S SF GV++ E+F+SE E E ++ +D W++SQSGRRKQ+FGPK NFKK++++LG
Sbjct: 35 SAASFPFHGVFLWENFISEEEEEHLISSIDQNLWNESQSGRRKQDFGPKVNFKKRRVRLG 94
Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
F G P ++ + +R QE LS F+ +EQC L+Y+ + GS+ID H+DD W+WGER+VT+
Sbjct: 95 GFNGLPVSSRKLLERMQQEPFLSDFRPVEQCNLDYHPQRGSAIDPHLDDSWLWGERLVTI 154
Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
N++ D+++T +SL E T I
Sbjct: 155 NMLSDTIIT----------------------------------MSLQEAPTGEI------ 174
Query: 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
+ +P P R LLVL+ +AR++W+H V R DV RRVC +RE +A ++
Sbjct: 175 --QVAVPFPRRCLLVLYHDARHKWKHAVYRQDVEDRRVCSTFRELSAEFL 222
>gi|410249988|gb|JAA12961.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410249990|gb|JAA12962.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410290248|gb|JAA23724.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410290250|gb|JAA23725.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410354469|gb|JAA43838.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410354471|gb|JAA43839.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
gi|410354473|gb|JAA43840.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
Length = 302
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 164/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|431898192|gb|ELK06887.1| Alkylated DNA repair protein alkB like protein 4 [Pteropus alecto]
Length = 302
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/333 (31%), Positives = 162/333 (48%), Gaps = 70/333 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKGVRTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGVRTCLICERQRG-------------------GDPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 40 -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLLEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K FQG P+ ++ + R
Sbjct: 80 VTREEETEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTTGFQGLPSYSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMSREAPG 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +V + +S V + IP+P RSL VL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAIPLPRRSLFVL 244
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRVC +RE +A +
Sbjct: 245 SGAARHQWKHAIHRRHIEARRVCATFRELSAEF 277
>gi|8923019|ref|NP_060091.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo
sapiens]
gi|74734701|sp|Q9NXW9.1|ALKB4_HUMAN RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 4; AltName: Full=Alkylated DNA repair protein
alkB homolog 4
gi|7019832|dbj|BAA90888.1| unnamed protein product [Homo sapiens]
gi|12803947|gb|AAH02820.1| AlkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
gi|16877714|gb|AAH17096.1| AlkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
gi|119570634|gb|EAW50249.1| alkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
gi|312150638|gb|ADQ31831.1| alkB, alkylation repair homolog 4 (E. coli) [synthetic construct]
Length = 302
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 102/337 (30%), Positives = 160/337 (47%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + W+ Y+
Sbjct: 15 CGCKGIRTCLICERQRGS-------------------DPPWELPPAKTYR---------- 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + G +F GV + EDF
Sbjct: 46 -------------------FIYC--SDTGWA-----VGTEESDFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 80 VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ P + E DS++ + S V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277
>gi|334324980|ref|XP_001378736.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Monodelphis domestica]
Length = 367
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 139/230 (60%), Gaps = 9/230 (3%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
+G +F GV + +DF+S E ++++ +D W SQSGRRKQ++GPK NF+KQK+K G
Sbjct: 121 AGWAFPFPGVAMIKDFVSADEETELVRLMDQDDWKLSQSGRRKQDYGPKVNFRKQKLKTG 180
Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
F G P+ ++ I R Q VL F +EQC L+Y+ E GS+ID H+DD W+WGER+V++
Sbjct: 181 GFDGLPSFSREIVHRMGQHPVLERFLPVEQCNLDYHPERGSAIDPHLDDSWLWGERLVSL 240
Query: 231 NLIGDSVLTLTYYNGDSN-KYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
NL+ +VL+++ DSN + L S + P + + + + T S +
Sbjct: 241 NLLSPTVLSMSR---DSNERLQLLSVAQQGTRNSPPN-----DPVPRDPEDTGSRRSVPC 292
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
V + I +PARSLLVL+G ARY+W+H + R + R+C +RE +A +
Sbjct: 293 DQVEVAIHLPARSLLVLFGAARYQWKHAIHRQHIESHRICATFRELSAEF 342
>gi|348568782|ref|XP_003470177.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Cavia porcellus]
Length = 304
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 103/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ + WQ
Sbjct: 17 CGCKGIRTCLICERQL-------------------QVDPPWQ------------------ 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
L+ Q ++Y + + W + A S N G F GV + EDF
Sbjct: 40 -----------LSPQKTHRFLY--NVDTGWA---VGAEDS--NCEGWGFPFPGVTLIEDF 81
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGR+KQ++GP+ NF+KQK+K+ DFQG P+ ++ + R
Sbjct: 82 VTREEEAEMVRLMDLDPWKLSQSGRKKQDYGPRVNFRKQKLKVADFQGLPSFSREVVRRM 141
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++
Sbjct: 142 GLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDTWLWGERLVSLNLLSPTVMSMS----- 196
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
++ PD + + S D+++ + S V + IP+P S
Sbjct: 197 --------------REAPDTLLLSSAPTTGPKDSVDNLITPSWSVLCQDVEVAIPIPRCS 242
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W H + R + RRVC +RE +A +
Sbjct: 243 LLVLTGAARHQWTHAIHRRHITARRVCATFRELSAEF 279
>gi|81905645|sp|Q9D8F1.1|ALKB4_MOUSE RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 4; AltName: Full=Alkylated DNA repair protein
alkB homolog 4
gi|12842046|dbj|BAB25449.1| unnamed protein product [Mus musculus]
Length = 300
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 78/340 (22%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ H + WQ
Sbjct: 13 CGCKGIRTCLICERQR-------------------HRDPPWQ------------------ 35
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYI 122
+ Q K ++YCP + W +GSDL G +F GV +
Sbjct: 36 -----------ICLQKKCCFLYCP--DTGWAAGAEGSDL---------EGWAFPFPGVTL 73
Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
+DF++ E ++++ +D PW SQSGR+KQ++GPK NF+KQK+K+ FQG P ++ +
Sbjct: 74 IQDFVTPEEEAEMVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKV 133
Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTY 242
R L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++
Sbjct: 134 VQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP 193
Query: 243 YNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
+P P + D SL S E V + I +P RS
Sbjct: 194 ----------EAPGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRS 238
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
LLVL G AR++W H + R + RRVC +RE ++ ++
Sbjct: 239 LLVLTGAARHQWTHAIHRRHIKARRVCATFRELSSEFLPG 278
>gi|148687373|gb|EDL19320.1| mCG1878, isoform CRA_b [Mus musculus]
Length = 304
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 78/340 (22%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ H + WQ
Sbjct: 17 CGCKGIRTCLICERQR-------------------HRDPPWQ------------------ 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYI 122
+ Q K ++YCP + W +GSDL G +F GV +
Sbjct: 40 -----------ICLQKKCCFLYCP--DTGWAAGAEGSDL---------EGWAFPFPGVTL 77
Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
+DF++ E ++++ +D PW SQSGR+KQ++GPK NF+KQK+K+ FQG P ++ +
Sbjct: 78 IQDFVTPEEEAEMVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKV 137
Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTY 242
R L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++
Sbjct: 138 VQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP 197
Query: 243 YNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
+P P + D SL S E V + I +P RS
Sbjct: 198 ----------EAPGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRS 242
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
LLVL G AR++W H + R + RRVC +RE ++ ++
Sbjct: 243 LLVLTGAARHQWTHAIHRRHIKARRVCATFRELSSEFLPG 282
>gi|239791449|dbj|BAH72189.1| ACYPI008664 [Acyrthosiphon pisum]
Length = 174
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 122/194 (62%), Gaps = 26/194 (13%)
Query: 169 LGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
+G+F GFP TKF+QDRF + ++ ++ TIEQC+LEY E G+SID H+DDCWIWGERI+
Sbjct: 1 MGNFNGFPLSTKFVQDRFDKIPIMKNYFTIEQCSLEYRPEMGASIDPHVDDCWIWGERII 60
Query: 229 TVNLIGDSVLTLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
T++L D+VLTLT +N SN+YN+ D N K VN
Sbjct: 61 TLSLQSDTVLTLTPHNIKYSNQYNI--------------------DYIPNVKCLPIPVNR 100
Query: 288 TLSSV-VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---- 342
+ V+RI MP RSLLVL+G+ RY W+HC+LR D+ RRVCIAYREFT PY+
Sbjct: 101 VNYPIDVVRILMPRRSLLVLYGKPRYLWEHCILREDIHERRVCIAYREFTPPYLKGGEKY 160
Query: 343 EATASVFEKAKCFF 356
+ + +KAKCF+
Sbjct: 161 TESCDILQKAKCFW 174
>gi|449480049|ref|XP_002192991.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Taeniopygia guttata]
Length = 406
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 97/264 (36%), Positives = 153/264 (57%), Gaps = 20/264 (7%)
Query: 83 KETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL 142
++ + YCP + L H +G +F GV++ E+F+SE E ++++ +D
Sbjct: 131 EDNFTYCP-------ATGLAKGNEHSEFAGWAFPFPGVFLVEEFISEDEECEIVELMDRD 183
Query: 143 PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT 202
W SQSGR+KQ++GPK NFKKQ++K G F G P+ ++ I + VLS F +EQC
Sbjct: 184 DWKPSQSGRKKQDYGPKVNFKKQRLKAGSFTGLPSFSRKIVAQMKACAVLSGFLPVEQCN 243
Query: 203 LEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK 262
L+Y+ E GS+ID H DD W+WGER+V++NL+ +VL+++ + D+ + P S ++
Sbjct: 244 LDYSPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDTIQLF---PISSKEEL 300
Query: 263 YPD----QKSTIVNDLSLNEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYEWQ 315
P Q S N E+ T ++ L V + I +P RSL+VL G+ARY+W+
Sbjct: 301 SPPSPFMQTSACRNS---GEEGTQCFLSPRLVPGKEVSVAILLPQRSLVVLQGDARYKWK 357
Query: 316 HCVLRSDVIGRRVCIAYREFTAPY 339
H + R + RRVCI +RE +A +
Sbjct: 358 HGIHRRHIEHRRVCITFRELSAEF 381
>gi|157786898|ref|NP_001099390.1| alkylated DNA repair protein alkB homolog 4 [Rattus norvegicus]
gi|149062997|gb|EDM13320.1| alkB, alkylation repair homolog 4 (E. coli) (predicted), isoform
CRA_a [Rattus norvegicus]
gi|183986288|gb|AAI66568.1| AlkB, alkylation repair homolog 4 (E. coli) [Rattus norvegicus]
Length = 301
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 104/336 (30%), Positives = 160/336 (47%), Gaps = 74/336 (22%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE+ + WQ
Sbjct: 14 CGCKGIRTCLICERRL-------------------QRDPPWQ------------------ 36
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAW--QGSDLYAYQSHPNHSGKSFDIGGVYIKE 124
++ Q K ++YCP A +GSD G +F GV + +
Sbjct: 37 -----------ISLQKKCCFLYCPDTGWAMGAEGSDF---------EGWAFPFPGVTLTQ 76
Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
DF++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K+ FQG P ++ +
Sbjct: 77 DFVTPEEEAEMVRLMDCDPWKPSQSGRRKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQ 136
Query: 185 RFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYN 244
R L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++
Sbjct: 137 RMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDSWLWGERLVSLNLLSATVVSMSR-- 194
Query: 245 GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLL 304
++ L P S V + S + V + I +P RSLL
Sbjct: 195 -EAPGSLLLCPVPS------------VGPDAFEGNSVAPSRSVPCQEVEVAISVPRRSLL 241
Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
VL G AR++W H + R + RRVC +RE + ++
Sbjct: 242 VLTGAARHQWTHAIHRRHIKARRVCATFRELASEFL 277
>gi|395842978|ref|XP_003794283.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Otolemur garnettii]
Length = 307
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 163/337 (48%), Gaps = 78/337 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CEQ+ + WQ HP GK+
Sbjct: 20 CGCKGIRTCLICEQQRSS-------------------DPPWQ---------HP--LGKTH 49
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
++YC + W + A +S + G +F GV + EDF
Sbjct: 50 H------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMVIEDF 84
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E +++ +D PW SQSGRRKQ++GPK NF+KQK+K FQG P+ ++ + R
Sbjct: 85 VTREEEAEMVHLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFQGLPSFSREVVRRM 144
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
L FQ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 145 GLYPGLEGFQPVEQCNLDYCPEWGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMC----- 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
+K P + E DS++ + S V + I +P RS
Sbjct: 200 --------------RKAPGSLLLCSAPSAAPEVLVDSVIAPSRSVLCQEVEVAISLPRRS 245
Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
LLVL G AR++W+H + R + RRVC +RE +A +
Sbjct: 246 LLVLTGAARHQWKHAIHRRHIKARRVCATFRELSAEF 282
>gi|311251081|ref|XP_003124433.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Sus scrofa]
gi|350581471|ref|XP_003481043.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Sus scrofa]
Length = 302
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 160/331 (48%), Gaps = 70/331 (21%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ S
Sbjct: 15 CGCKGIRTCLICERQRG-------------------GDPPWQHS---------------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 40 -------------PQKTHRFIY--YTDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTREEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTASFRGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
VL F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL+++
Sbjct: 140 GLYPVLEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +L E + + V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFPE---------------ALVEGAVAPSRSVLCQEVEVAVPLPRRSLLVL 244
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
G AR++W+H + R + RRV +RE +A
Sbjct: 245 TGAARHQWKHAIHRRHIEARRVSATFRELSA 275
>gi|395536498|ref|XP_003770252.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4,
partial [Sarcophilus harrisii]
Length = 269
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 140/230 (60%), Gaps = 9/230 (3%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
+G +F GV + DF+S E ++++ +D W SQSGRRKQ++GP+ NF+KQK+K G
Sbjct: 23 AGWAFPFPGVTLIRDFVSVEEEAELVRLMDQDDWKLSQSGRRKQDYGPRVNFRKQKLKPG 82
Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
F G P+ ++ I R +L F +EQC L+Y+ E GS+ID H+DD W+WGER+V++
Sbjct: 83 SFDGLPSFSREIVQRMGTYPILERFLPVEQCNLDYHPERGSAIDPHLDDSWLWGERLVSL 142
Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS-TDSIVNETL 289
NL+ +VL+++ +G+ L + Q+ +S+ ND + + S + S + +
Sbjct: 143 NLLSPTVLSMSQDSGE----RLQLLSVAQQRT----RSSGPNDPEMGDLSGSPSTRSVSC 194
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
V + I +PARSLLVL G ARY+W+H + R + R+C +RE +A +
Sbjct: 195 DQVEVAIHLPARSLLVLSGAARYQWKHAIHRQHIESHRICATFRELSAEF 244
>gi|198434600|ref|XP_002128162.1| PREDICTED: similar to alkB, alkylation repair homolog 4 [Ciona
intestinalis]
Length = 303
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/237 (39%), Positives = 140/237 (59%), Gaps = 15/237 (6%)
Query: 105 QSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKK 164
Q PN F GV + DF+SE E +D++ ++ W SQSGRRKQ+FGPK NFK+
Sbjct: 54 QLKPNGEFVGFSFPGVAVFPDFISENEEKDLVNAINSGQWKDSQSGRRKQDFGPKVNFKR 113
Query: 165 QKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWG 224
QK+KL F G P +K I +R L F+T+EQC LEY+ GSSID H DD W+WG
Sbjct: 114 QKVKLASFTGLPKYSKCIDERIKTLKGLESFETVEQCNLEYDPNRGSSIDPHFDDEWLWG 173
Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQK--STIVNDLSLNEKSTD 282
ER++++NL+ DS T+T DS + L+ K PD + T V + +++ D
Sbjct: 174 ERLLSLNLLADSWFTMT--TDDSTECQLSVI------KTPDVQISRTGVKQIKMDK---D 222
Query: 283 SIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
I + ++ ++IP+ RSL++L G+AR++W+H + R D+ +R+C RE + +
Sbjct: 223 DIFH--MNDFQVKIPLKRRSLVMLSGDARFKWKHSIQRQDIKSKRMCCTLRELSPEF 277
>gi|355668036|gb|AER94059.1| alkB, alkylation repair-like protein 4 [Mustela putorius furo]
Length = 261
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 130/228 (57%), Gaps = 15/228 (6%)
Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
G +F GV + EDF++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K
Sbjct: 24 GWAFPFPGVTVIEDFVTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAG 83
Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
F G P+ ++ + R +L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++N
Sbjct: 84 FSGLPSFSRDVVRRMGLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLN 143
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
L+ +VL+++ S L F +V + +S
Sbjct: 144 LLSPTVLSMSREAPGSLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQE 188
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
V + +P+P RSLLVL G AR++W+H + R + RRVC +RE +A +
Sbjct: 189 VEVAVPLPRRSLLVLSGAARHQWKHAIHRRHIEARRVCATFRELSAEF 236
>gi|213515534|ref|NP_001134079.1| Alkylated DNA repair protein alkB homolog 4 [Salmo salar]
gi|209730564|gb|ACI66151.1| Alkylated DNA repair protein alkB homolog 4 [Salmo salar]
Length = 231
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/236 (38%), Positives = 136/236 (57%), Gaps = 40/236 (16%)
Query: 109 NHSGK--SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQK 166
N+ G+ SF G+++ +F+SE E +++ +D W +SQSGRRKQ+FGPK NFKK K
Sbjct: 12 NYGGQQLSFPFPGIFLWNNFISEEEESELITDMDLNAWRESQSGRRKQDFGPKVNFKKHK 71
Query: 167 IKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGER 226
++LG F G P ++ + R QE VL+ FQ +EQC L+Y+ + GS+ID H+DD W+WGER
Sbjct: 72 VRLGGFCGLPPISRKLVLRMSQEPVLAGFQPVEQCNLDYHPQRGSAIDPHLDDSWLWGER 131
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
+VTVNL+ D+ LT++ G + +L E
Sbjct: 132 LVTVNLLSDTTLTMSLEEG-------------------------LTELGQGE-------- 158
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
V + + +P RSL+V++ EAR+ W+H + R D+ RRVC YRE +A +M+
Sbjct: 159 -----VRVSVHLPCRSLVVVYSEARHRWKHAIHRHDIHERRVCSTYRELSAEFMSG 209
>gi|221106579|ref|XP_002170457.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Hydra magnipapillata]
Length = 272
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 94/288 (32%), Positives = 141/288 (48%), Gaps = 53/288 (18%)
Query: 76 DFLTSQTKETYVYCPHCNKAWQGS--------DLYAYQSHPNHSGKSFDIGGVYIKEDFL 127
DF+ + K+ CP C ++G+ ++ H H+ + I V I +F+
Sbjct: 24 DFIQNSVKDICWLCPECEVVFEGNVNEILLKDNINWCSLHLQHNISNLHIDKVIIINNFV 83
Query: 128 SEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH 187
SE E ++ +++ PW SQSGRRKQ+FGPK NFK++K+K F GFP +KF+ ++
Sbjct: 84 SETEESYLLTEINKDPWKMSQSGRRKQDFGPKVNFKRKKLKTAVFTGFPGYSKFVVEKMR 143
Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
Q + L F +E C LEY+ E GSSID HIDD W+WGE++VT+NL ++LTLT
Sbjct: 144 QVESLKDFFPVELCNLEYSPERGSSIDPHIDDTWLWGEQLVTLNLNSATILTLT------ 197
Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
TL I+IPM RSL+++
Sbjct: 198 ---------------------------------------STLFQQEIQIPMMPRSLVIIE 218
Query: 308 GEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEKAKCF 355
ARY W H + SD+ RV + +R + P++ + + AK F
Sbjct: 219 KNARYNWMHGIKSSDITDLRVAMTFRNLSQPFVKDNIGQQILHIAKSF 266
>gi|148687372|gb|EDL19319.1| mCG1878, isoform CRA_a [Mus musculus]
Length = 261
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 92/257 (35%), Positives = 142/257 (55%), Gaps = 26/257 (10%)
Query: 86 YVYCPHCNKAW--QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLP 143
++YCP A +GSDL G +F GV + +DF++ E ++++ +D P
Sbjct: 5 FLYCPDTGWAAGAEGSDL---------EGWAFPFPGVTLIQDFVTPEEEAEMVRLMDCDP 55
Query: 144 WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTL 203
W SQSGR+KQ++GPK NF+KQK+K+ FQG P ++ + R L FQ +EQC L
Sbjct: 56 WKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQRMGLYPGLEDFQPVEQCNL 115
Query: 204 EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKY 263
+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++ +P
Sbjct: 116 DYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP----------EAPGSLLLCSA 165
Query: 264 PDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV 323
P + D SL S E V + I +P RSLLVL G AR++W H + R +
Sbjct: 166 PSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRSLLVLTGAARHQWTHAIHRRHI 220
Query: 324 IGRRVCIAYREFTAPYM 340
RRVC +RE ++ ++
Sbjct: 221 KARRVCATFRELSSEFL 237
>gi|296192301|ref|XP_002744023.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Callithrix jacchus]
Length = 306
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 132/232 (56%), Gaps = 23/232 (9%)
Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
G +F GV + EDF++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K
Sbjct: 69 GWAFPFPGVMLIEDFVTREEEAELVRLMDCDPWKLSQSGRRKQDYGPKVNFRKQKLKTEG 128
Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
F G P+ ++ + R L F+ +EQC L+Y E GSSID H+DD W+WGER+V++N
Sbjct: 129 FCGLPSFSREVVRRMGLYPGLEGFRPVEQCNLDYCPERGSSIDPHLDDAWLWGERLVSLN 188
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS- 290
L+ +VL++ ++ P + E DS++ + S
Sbjct: 189 LLSPTVLSMC-------------------REAPGSLLLCSAPSAAPEVLVDSVIAPSRSV 229
Query: 291 ---SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
V + +P+P RSLLVL G AR++W+H + R V RRVC+ +RE +A +
Sbjct: 230 LCQEVEVAVPLPRRSLLVLRGAARHQWKHAIHRGHVEARRVCVTFRELSAEF 281
>gi|240995541|ref|XP_002404613.1| conserved hypothetical protein [Ixodes scapularis]
gi|215491615|gb|EEC01256.1| conserved hypothetical protein [Ixodes scapularis]
Length = 319
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 147/281 (52%), Gaps = 23/281 (8%)
Query: 81 QTKETYVYCPHCNKAWQG-SDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQL 139
+ + +V+CP C W G P S + G+ + E+FL+ E + +
Sbjct: 31 KVRHPFVFCPKCCLCWPGWLSTDTGGGKPCGSVPPLRVPGLEVHEEFLTPDEETRLAADI 90
Query: 140 DDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIE 199
D W SQSGR KQ++GPK NFKK+++K F+G PA I+++ + + +S F+ +E
Sbjct: 91 DTANWAGSQSGRLKQDYGPKVNFKKKRVKSDGFRGLPAYYTRIEEKMRENETVSDFEAVE 150
Query: 200 QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT-------------YYNGD 246
C L+Y+ + GS ID H DD W+WG R+VT NL+ D+VLTL+ +GD
Sbjct: 151 LCNLDYHPKRGSCIDPHYDDWWLWGPRLVTFNLLSDTVLTLSRPEEHWMLTDEDLIESGD 210
Query: 247 SNKYNLNSPCFSYQKKY-PDQKSTIVNDLSLNEKSTDSIVNETLSSV-VIRIPMPARSLL 304
++ + ++ PD S ++ DSI + V+++ P RS+L
Sbjct: 211 RGADDVERLAAAAPLQHGPDCG-------SWSDAFKDSISTRVPPRLCVVQVSCPRRSML 263
Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEAT 345
VLW +ARY+W H +LR DV RRV + RE + +++ A+
Sbjct: 264 VLWSDARYKWHHSILREDVKSRRVAMTVRELSEEFLDGGAS 304
>gi|149062998|gb|EDM13321.1| alkB, alkylation repair homolog 4 (E. coli) (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 247
Score = 167 bits (423), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 130/229 (56%), Gaps = 15/229 (6%)
Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
G +F GV + +DF++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K+
Sbjct: 10 GWAFPFPGVTLTQDFVTPEEEAEMVRLMDCDPWKPSQSGRRKQDYGPKVNFRKQKLKMAG 69
Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
FQG P ++ + R L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++N
Sbjct: 70 FQGLPGFSQKVVQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDSWLWGERLVSLN 129
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
L+ +V++++ ++ L P S V + S +
Sbjct: 130 LLSATVVSMSR---EAPGSLLLCPVPS------------VGPDAFEGNSVAPSRSVPCQE 174
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
V + I +P RSLLVL G AR++W H + R + RRVC +RE + ++
Sbjct: 175 VEVAISVPRRSLLVLTGAARHQWTHAIHRRHIKARRVCATFRELASEFL 223
>gi|10438837|dbj|BAB15358.1| unnamed protein product [Homo sapiens]
Length = 229
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 128/223 (57%), Gaps = 23/223 (10%)
Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
+ EDF++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++
Sbjct: 1 MLIEDFVTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSR 60
Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
+ R L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 61 EVVRRMGLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 120
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRI 296
++ P + E DS++ + S V + I
Sbjct: 121 C-------------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAI 161
Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
P+PARSLLVL G AR++W+H + R + RRVC+ +RE +A +
Sbjct: 162 PLPARSLLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 204
>gi|363741354|ref|XP_415758.3| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Gallus gallus]
Length = 317
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 158/266 (59%), Gaps = 16/266 (6%)
Query: 80 SQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQL 139
+Q + + YCP A +G++ H +G +F GV++ E+F+SE E ++++ +
Sbjct: 37 TQGDDNFTYCPSTGLA-EGNE------HTPFAGWAFPFPGVFLAEEFVSEEEEAEMVELM 89
Query: 140 DDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIE 199
D W SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I + VL F +E
Sbjct: 90 DRDEWKPSQSGRKKQDYGPKVNFKKQRLKAGGFTGLPSFSKKIVAQMEACSVLGGFLPVE 149
Query: 200 QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSY 259
QC L+Y E GS+ID H DD W+WGER+V++NL+ +VL+++ + D+ + P F
Sbjct: 150 QCNLDYRPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDTIQL---FPTFGN 206
Query: 260 QKKYPDQKSTIVNDLSL---NEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYE 313
+ + ++++ + ++ + +++ L V + I +P RSL+VL+G+ARY+
Sbjct: 207 ENGELNPRASLTQTSACENSGKQGSSCLLSPRLVPSKEVTVAIHLPRRSLVVLYGDARYK 266
Query: 314 WQHCVLRSDVIGRRVCIAYREFTAPY 339
W+H + R + RRVC+ +RE +A +
Sbjct: 267 WKHAIYRRHIEHRRVCVTFRELSAEF 292
>gi|226479122|emb|CAX73056.1| Alkylated repair protein alkB homolog 4 [Schistosoma japonicum]
Length = 305
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 104/338 (30%), Positives = 164/338 (48%), Gaps = 87/338 (25%)
Query: 7 CGCKGVRTCIKCEQE--FGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGK 64
CGCKG+R+C C + E + + EFT +CP C+ A G++
Sbjct: 16 CGCKGIRSCAICRNDKFPSDETQQLFEFT-----FCPFCDMAVVGAN------------- 57
Query: 65 SFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKE 124
+ LT D++A+QS F + I +
Sbjct: 58 -----------NILT--------------------DIHAHQS-------CFPFLNIEIVQ 79
Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
+F+ E+E +++++D W SQSGRRKQ++GPK NFK+QK++LG+F G PA +KF+
Sbjct: 80 NFIDESEEAILIEEIDKQTWVLSQSGRRKQDYGPKVNFKRQKVQLGNFTGLPAYSKFLIT 139
Query: 185 RFHQE-----DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
R+++ + F +E C LEY S+ G+ I H DD W+WG+R+VTVNL G + LT
Sbjct: 140 RYNERIKNHTSLHPEFLPVELCNLEYKSDRGACIVPHYDDSWLWGDRLVTVNLCGSTYLT 199
Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS-IVNETLSSVVIRIPM 298
T D +V+ + + S I + + + +R+P+
Sbjct: 200 FTLPTDD-----------------------VVDGVKHEFQRIHSCIPSVSRGTFCVRVPL 236
Query: 299 PARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
RSL+V+ G ARY WQH + RSD+ RR+ + +RE +
Sbjct: 237 DRRSLVVVSGPARYIWQHEIRRSDIPTRRIAMTFRELS 274
>gi|443717035|gb|ELU08273.1| hypothetical protein CAPTEDRAFT_150134 [Capitella teleta]
Length = 318
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 102/348 (29%), Positives = 159/348 (45%), Gaps = 90/348 (25%)
Query: 4 SNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSG 63
S C CKG+R+C+ CE++ +++ +W
Sbjct: 22 SKPCACKGIRSCLLCERDSPSTPRSV------------SVTSSW---------------- 53
Query: 64 KSFDIGGVYIKEDFLTSQTKETYVYCPHCNKA----WQGSDLYAYQSHPNH-------SG 112
S D V +++ TK+ YVYC C +A W + + + NH S
Sbjct: 54 -SADTHDVKDRKE----ATKQIYVYCHRCRRAHIAPWTATP--SLKDVINHAELNCSPSD 106
Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
+ G+++ E+F+S E +++ +++ W SQSGRRKQ++GPK NFK++K+KL F
Sbjct: 107 SDLPLQGIHLFENFISADEEKELCDRINCTAWVVSQSGRRKQDYGPKVNFKRKKVKLASF 166
Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
G P+ ++ R Q LS F +E C LEY+ E GS+ID H DD W+WGER++T+N+
Sbjct: 167 SGLPSYSEPFIQRMLQLPQLSDFTPVELCNLEYSRERGSAIDAHFDDFWLWGERLLTINM 226
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
+ D+V T+T NE L
Sbjct: 227 VSDTVYTMT--------------------------------------------NEGLPRT 242
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
I I P + +V+ GEARYEW+H + R+ V RR+ RE T ++
Sbjct: 243 EILIHFPRFAFIVMMGEARYEWKHAIQRTHVAQRRMATTIRELTPEFL 290
>gi|156402493|ref|XP_001639625.1| predicted protein [Nematostella vectensis]
gi|156226754|gb|EDO47562.1| predicted protein [Nematostella vectensis]
Length = 267
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 156/339 (46%), Gaps = 101/339 (29%)
Query: 4 SNFCGCKGVRTCIKCEQEFGY-ENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHS 62
S CGC G+R+C+ C+ ++ V+ + Y++C C++
Sbjct: 2 STKCGCTGIRSCLFCKDNTKTSQSTTSVDEAKTKYLFCHLCSQ----------------- 44
Query: 63 GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYI 122
+ +GG C H + D Y G S D G+ +
Sbjct: 45 --TLPLGGT-----------------CSH-----ELGDCGRY-------GPSLD--GITL 71
Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
EDF+S+ E +++ +D+ W SQSGRRKQ++GP+ NFKK+K+K+ F G PA ++F+
Sbjct: 72 IEDFVSQREEARIVQVIDETVWKPSQSGRRKQDYGPQVNFKKKKVKMSHFNGLPAFSEFL 131
Query: 183 QDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
R + +DV L F +E C LEY+ GSSID H DD W+WGER+VT+NL+ + LT+
Sbjct: 132 VRRMN-DDVPGLKDFVPVELCNLEYDEARGSSIDAHFDDFWLWGERLVTLNLLSATRLTM 190
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
T + Y I +PMP
Sbjct: 191 T-----KDTYE------------------------------------------ISVPMPR 203
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
RSL+++ G AR+ WQH V R D+ GRR+ I RE + +
Sbjct: 204 RSLIIVSGAARHLWQHAVKREDISGRRIAITLRELSEEF 242
>gi|391346930|ref|XP_003747718.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Metaseiulus occidentalis]
Length = 313
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 153/303 (50%), Gaps = 50/303 (16%)
Query: 79 TSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETED 134
S+ +V+CP C Q +L A SF + G+ + E+ ++E +
Sbjct: 28 VSRIGRAFVFCPKCKLCIELDPQNGELGA---------SSFRLEGIEVAENAITEDHEKF 78
Query: 135 VMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH 194
++ ++ PW SQSGR KQ+FGPK NFKK+K+K F GFPA +Q F + L+
Sbjct: 79 LVDAIESAPWKDSQSGRLKQDFGPKVNFKKRKVKCDSFLGFPAYMDVVQMSFSRIPGLND 138
Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL-------------T 241
F +E C L+Y+ E GS ID H DD W+WGER+VT+NL+G +VLT T
Sbjct: 139 FVPVELCNLKYSPERGSCIDPHFDDSWLWGERLVTLNLLGQTVLTFSQAQDFDDLTQKQT 198
Query: 242 YYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSL----NEKSTDSI--VNETLSSVVIR 295
+Y+G S Y L SP +++ L +E+ ++++ N+ V+
Sbjct: 199 HYDGASAVYAL-SP------------EALMDALQAHGCGSEQGSEALGTRNKCQPMDVVH 245
Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN-----NEATASVFE 350
+ +P RSL+ + AR EW H +LR V R+ + RE + +++ +E + E
Sbjct: 246 VLLPRRSLVTVARAARSEWHHSILREHVTETRLAMTCRELSETFLDVNNAESEQGKILLE 305
Query: 351 KAK 353
KA+
Sbjct: 306 KAR 308
>gi|344245743|gb|EGW01847.1| Alkylated DNA repair protein alkB-like 4 [Cricetulus griseus]
Length = 224
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 125/220 (56%), Gaps = 25/220 (11%)
Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
++ E +++ +D PW SQSGRRKQ++GPK NF+KQK+K+ FQG P+ ++ + R
Sbjct: 1 MVTPEEEAEMVCMMDSDPWKLSQSGRRKQDYGPKVNFRKQKLKMAGFQGLPSFSQKVVQR 60
Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
L FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++
Sbjct: 61 MGVYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSCEAP 120
Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD-SIVNETLS----SVVIRIPMPA 300
S + + LS+ D SIV + S V + I +P
Sbjct: 121 GS--------------------LLLCSALSIGPDVFDGSIVAPSRSVPCQEVEVAICVPC 160
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
RSLLVL G AR++W H + R ++ RRVC +RE + ++
Sbjct: 161 RSLLVLTGAARHQWTHAIHRRHIVARRVCATFRELSNEFL 200
>gi|301780878|ref|XP_002925858.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Ailuropoda melanoleuca]
gi|281337677|gb|EFB13261.1| hypothetical protein PANDA_015429 [Ailuropoda melanoleuca]
Length = 294
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 156/333 (46%), Gaps = 78/333 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ HP
Sbjct: 15 CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 39
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
Q ++Y + + W + A +S + G +F GV + EDF
Sbjct: 40 -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F+G P+ ++ + R
Sbjct: 80 VTRDEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSRDVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
+L F+ +EQC L+Y E GS+ID H+ +V++NL+ +VL+++
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLG--------LVSLNLLSPTVLSMSREAPG 191
Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
S L F +V + +S V + +P+P RSLLVL
Sbjct: 192 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPRRSLLVL 236
Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
G AR++W+H + R + RRVCI +RE +A +
Sbjct: 237 TGAARHQWKHAIHRRHIEARRVCITFRELSAEF 269
>gi|47214665|emb|CAG00901.1| unnamed protein product [Tetraodon nigroviridis]
Length = 917
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 66/142 (46%), Positives = 103/142 (72%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S SF GV++ E+F+SE E E ++ +D W++SQSGRRKQ+FGPK NFKK+++ LG
Sbjct: 38 SAASFPFHGVFLWENFISEEEEEHLISSIDQNLWNESQSGRRKQDFGPKVNFKKRRVCLG 97
Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
F G P ++ + +R HQE +LS F+ +EQC L+Y+ + GS+ID H+DD W+WGER+VT+
Sbjct: 98 GFSGLPVSSRKLLERMHQEPLLSDFRPVEQCNLDYHPQRGSAIDPHLDDSWLWGERLVTI 157
Query: 231 NLIGDSVLTLTYYNGDSNKYNL 252
N++ D+ LT++ ++G S + +
Sbjct: 158 NMLSDTTLTMSLHDGLSGEVRV 179
>gi|110625894|ref|NP_082346.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Mus
musculus]
gi|18490963|gb|AAH22729.1| AlkB, alkylation repair homolog 4 (E. coli) [Mus musculus]
Length = 215
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 118/206 (57%), Gaps = 15/206 (7%)
Query: 135 VMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH 194
+++ +D PW SQSGR+KQ++GPK NF+KQK+K+ FQG P ++ + R L
Sbjct: 1 MVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQRMGLYPGLED 60
Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+ +V++++ +
Sbjct: 61 FQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP----------EA 110
Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
P P + D SL S E V + I +P RSLLVL G AR++W
Sbjct: 111 PGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRSLLVLTGAARHQW 165
Query: 315 QHCVLRSDVIGRRVCIAYREFTAPYM 340
H + R + RRVC +RE ++ ++
Sbjct: 166 THAIHRRHIKARRVCATFRELSSEFL 191
>gi|326433577|gb|EGD79147.1| hypothetical protein PTSG_09879 [Salpingoeca sp. ATCC 50818]
Length = 270
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 117/228 (51%), Gaps = 44/228 (19%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
GV++ +F++ E + ++ Q+D+ PW SQSGR KQ+FGPK NFK++K+K+G+F G P+
Sbjct: 68 GVHVFREFVTADEEQALLSQMDEWPWKLSQSGRWKQDFGPKVNFKRKKVKVGNFTGLPSS 127
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
++ + R L F+ +EQC L+Y E G++ID H DD WIWGER+VTVNL+ ++ L
Sbjct: 128 SRDLVARMQALPCLQDFEPVEQCHLDYRPERGAAIDMHFDDDWIWGERLVTVNLLSETRL 187
Query: 239 TLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPM 298
+ + + + + + +
Sbjct: 188 SFEHPQHEGAQ--------------------------------------------VYVVL 203
Query: 299 PARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA 346
PARSL+ + G AR +W+H V R + RR+ + RE ++ A
Sbjct: 204 PARSLVAVQGSARTQWKHAVHRGAITDRRIAVTLRELGPDFVAGAGRA 251
>gi|386783769|gb|AFJ24779.1| alpha ketoglutarate dependent dioxygenase ABH4 [Schmidtea
mediterranea]
Length = 287
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 100/342 (29%), Positives = 153/342 (44%), Gaps = 73/342 (21%)
Query: 6 FCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
C CKG+RTC C N N ++ + I C C K
Sbjct: 3 LCTCKGIRTCSSC-------NPNKIKIENQNCIVCYFCPK-------------------- 35
Query: 66 FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKED 125
++ KE + SDL + H + ++ G+ + E+
Sbjct: 36 ------------ISKIVKENCI-----------SDLKCEEFH----KVNIELNGIILIEN 68
Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
FL+E + ++ + W SQSGRRKQ+FGPK NFKK+KI L FQG P + +R
Sbjct: 69 FLTEDDKNYLLGGICSNSWVDSQSGRRKQDFGPKVNFKKRKINLTKFQGLPEYIERFVNR 128
Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
F L F +E C LEYN G+SID H DD W+WGER+VT+N+ + LT T G
Sbjct: 129 FSDIPELKDFNPVELCNLEYNPTRGASIDPHFDDFWLWGERLVTINVQSSTYLTFTP--G 186
Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLV 305
+ ++ +S F + + + N+ +V I++P+P SL++
Sbjct: 187 FPDMFDESSQAF-FSSVCSENHENMGNN----------------CAVSIKVPLPEGSLVI 229
Query: 306 LWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATAS 347
+ G+AR++W H V +DV R+ RE + + N+ S
Sbjct: 230 VSGDARHKWMHAVSAADVQSTRIASTLRELSMEFTENDRELS 271
>gi|268563819|ref|XP_002647020.1| Hypothetical protein CBG24043 [Caenorhabditis briggsae]
Length = 291
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 117/230 (50%), Gaps = 47/230 (20%)
Query: 107 HPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQK 166
H + + I G+ + DFL+EAE DV++ +D + W +SQSGRRKQ++GPK NFK +K
Sbjct: 73 HSTSAENAIQIDGLTLIHDFLTEAEESDVLEMIDTVEWVQSQSGRRKQDYGPKVNFKHKK 132
Query: 167 IKLGDFQGFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWG 224
+K F G P + ++ + DV L ++Q E C LEY S+I+ H DD WIWG
Sbjct: 133 VKTDSFIGMPEYADMLLNKMSEYDVTKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWG 192
Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
R++++NLI SV+TL+ DS + S C+ Y
Sbjct: 193 NRLISINLINGSVMTLS---NDSKE----SLCYVY------------------------- 220
Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
MP RSL+ + E RYEW+H VL + GRR+ + RE
Sbjct: 221 -------------MPHRSLICMADECRYEWKHGVLAHHIRGRRIALTMRE 257
>gi|170579486|ref|XP_001894850.1| LD42289p [Brugia malayi]
gi|158598401|gb|EDP36302.1| LD42289p, putative [Brugia malayi]
Length = 297
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/339 (29%), Positives = 147/339 (43%), Gaps = 87/339 (25%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVE---FTE-ETYIYCPHCNKAWQGSDLYAYQSHPNHS 62
CGCKGVR C C+ + + E F + + Y+Y H + A + L PN
Sbjct: 18 CGCKGVRFCAACKDTLRVKKLRLCEEYPFAKYKKYVYSSHHHIAIHDNSL------PNRP 71
Query: 63 GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYI 122
+ DI V NKA N+ + G+Y+
Sbjct: 72 SLA-DIHDV-----------------ASRINKA------------ENNFEDDLAVPGLYV 101
Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
DFLSEAE D++ +D W SQSGRRKQ++GP+ NFK +KIK+ F G P+ T I
Sbjct: 102 VIDFLSEAEEVDLVNAIDKTDWVLSQSGRRKQDYGPRINFKHKKIKMDRFFGMPSYTDVI 161
Query: 183 QDRFH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
+R + D+ + E C LEY S+I+ H DD WIWGER++ +NL+ SVLT
Sbjct: 162 LNRMNSISSDLFGSYHPFELCNLEYRDSRWSTIEMHYDDTWIWGERLICINLLTKSVLT- 220
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
Y D + ++I +P+P+
Sbjct: 221 --YANDEKQ------------------------------------------LIIYVPLPS 236
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
R+++ + E RY W+H V + GRR+ + RE + +
Sbjct: 237 RTMVCMSDEIRYSWRHAVFPEHIRGRRIALTMREPSTAF 275
>gi|393907552|gb|EJD74689.1| hypothetical protein LOAG_18028 [Loa loa]
Length = 293
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 150/336 (44%), Gaps = 81/336 (24%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKGVR C C K+ + T+ C + Y Y + + +
Sbjct: 14 CGCKGVRFCAAC--------KDTLRVTKLAL-----CGE-------YPYGEYKKYVYSTR 53
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYA-YQSHPNHSGKSFDIGGVYIKED 125
+ ++ K LT + T ++ DL N I G+++ D
Sbjct: 54 HLAAIHDKS--LTGRPTLTEIH-----------DLAGKINGTENKPEDYLIIPGLHVITD 100
Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
FL+EAE D++ +D W SQSGRRKQ++GP+ NFK +KIK+ F G P+ I +R
Sbjct: 101 FLTEAEEMDLINVIDRTDWVLSQSGRRKQDYGPRVNFKHKKIKMERFLGMPSYIDVILNR 160
Query: 186 FH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY 243
+ D+ +Q E C LEY+ + S+I+ H DD WIWGER++ VNL+ SVLT Y
Sbjct: 161 MNSISPDLFGSYQPFELCNLEYSDDRWSNIEMHYDDTWIWGERLICVNLLSKSVLT---Y 217
Query: 244 NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSL 303
D+ + ++I +P+P+R++
Sbjct: 218 TNDAKQ------------------------------------------LIIYVPLPSRTM 235
Query: 304 LVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
+ + E RY W+H V + GRR+ + RE +A +
Sbjct: 236 VCMSDEIRYSWRHAVFPEHIRGRRIAVTMREPSAAF 271
>gi|402593158|gb|EJW87085.1| hypothetical protein WUBG_02002 [Wuchereria bancrofti]
Length = 296
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 150/339 (44%), Gaps = 87/339 (25%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKGVR C C+ + + E Y + + Y Y +H +
Sbjct: 17 CGCKGVRFCAACKDTLRVKELRLCE----EYPFAKYKK--------YVYSTHHH------ 58
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD----IGGVYI 122
+ I ++ L+++ +D++ + N + F+ + G+++
Sbjct: 59 ----IAIHDNSLSNRP--------------SLADIHDVANRINKAENKFEDYLVVPGLHV 100
Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
+FLSEAE D++ +D W SQSGRRKQ++GP+ NFK +K+K+ F G P+ T I
Sbjct: 101 VTNFLSEAEEVDLVNAIDKTDWVLSQSGRRKQDYGPRINFKHKKVKMDRFFGMPSYTDVI 160
Query: 183 QDRFH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
+R + D+ +Q E C LEY S+I+ H DD WIWGER++ VNL+ SVLT
Sbjct: 161 LNRMNSISSDLFGSYQPFELCNLEYRDSRWSTIEMHYDDTWIWGERLICVNLLSKSVLT- 219
Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
Y D + ++I +P+P
Sbjct: 220 --YANDEKQ------------------------------------------LIIYVPLPC 235
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
R+++ + E RY W+H V + GRR+ + RE + +
Sbjct: 236 RTMVCMSDEIRYSWRHAVFPEHIRGRRIALTMREPSTAF 274
>gi|340057250|emb|CCC51593.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 299
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 131/262 (50%), Gaps = 57/262 (21%)
Query: 85 TYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD----IGGVYIKEDFLSEAETEDVMKQLD 140
T+ +C +C K + + A ++ +H+G + D + G+ + DF+S+ E + +++ D
Sbjct: 64 THSFCSNCGKIFVFPE-RAIRACADHNGLTADSETTMRGLMVSPDFVSDIEEDYLLRFFD 122
Query: 141 DL---PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ---EDVLSH 194
W SQSGRRKQ++GP+ NFKK+K+K GD G P K I R +Q + +
Sbjct: 123 GAHHSRWKVSQSGRRKQDYGPRANFKKRKLKKGDGNGMPIQLKDIITRVNQFISNETMKR 182
Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
+QTIE LEY+++ GSSID HIDD W+WG+RI +NL+ D VLTL
Sbjct: 183 YQTIEVSVLEYSTKCGSSIDTHIDDTWLWGDRIGGLNLLEDVVLTLV------------- 229
Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
D K T+ + +P RS +L GE+RY W
Sbjct: 230 ----------DSKGTVAT-----------------------VFVPRRSFFLLSGESRYNW 256
Query: 315 QHCVLRSDVIGRRVCIAYREFT 336
H + D+ RR+ + +REF
Sbjct: 257 MHGIRSEDIKSRRISMTFREFA 278
>gi|324523341|gb|ADY48232.1| Alpha-ketoglutarate-dependent dioxygenase ABH4, partial [Ascaris
suum]
Length = 296
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 129/272 (47%), Gaps = 61/272 (22%)
Query: 84 ETYVYCPHCNKAWQGS---------DLYAYQSHPNH-----SGKSFDIGGVYIKEDFLSE 129
E YVY KA S D++A + N K ++ G+ + +DFLSE
Sbjct: 43 ERYVYSSRHAKAIYDSRLPADPSLDDVHAAAARVNSLSTLDPDKCLNLDGILLIDDFLSE 102
Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH-- 187
E + +M+++D+ W SQSGRRKQ++GP+ NFK++K+K+ F G P T FI R
Sbjct: 103 QEEKTIMRKIDNKEWIISQSGRRKQDYGPRVNFKQKKVKMDRFFGMPDYTDFILQRMEAL 162
Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
+ L+ FQ E C LEY S+I+ H DD WIWG R++++NL+ D V+T T
Sbjct: 163 SANKLNAFQPFELCNLEYVESRQSAIELHFDDWWIWGNRLISLNLLNDCVMTFT------ 216
Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
ND +V+ +P R+LL +
Sbjct: 217 ------------------------ND---------------ERQLVVYAALPRRTLLCMC 237
Query: 308 GEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
E RYEW+H VL V GRR+ + RE + +
Sbjct: 238 DEVRYEWKHGVLPQHVRGRRIALTMREPSPAF 269
>gi|340380835|ref|XP_003388927.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Amphimedon queenslandica]
Length = 240
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 113/226 (50%), Gaps = 45/226 (19%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
+F GV I +F+S E ++++ +D W +SQSGR KQ++GPK NFKK+K+K F
Sbjct: 39 TFKFDGVKIIRNFVSFEEEKELVSFIDSSQWVESQSGRYKQDYGPKVNFKKRKLKTDTFY 98
Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
G P T + +R V F+ +E C L+Y+ GS+ID H DD W+WG+R++T+NL+
Sbjct: 99 GLPKYTHTLLERLESLSVTPSFKPVELCNLDYHPTRGSAIDPHFDDNWLWGDRLITINLL 158
Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
++L+ N + +V
Sbjct: 159 SSTLLSF---------------------------------------------NHPVDPLV 173
Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
+ +P P S++++ G ARY W H + R +V GRRV + +RE + +
Sbjct: 174 VHVPHPRLSIVIIEGSARYNWLHGIRRENVTGRRVAMTFRELSREF 219
>gi|403343479|gb|EJY71069.1| hypothetical protein OXYTRI_08063 [Oxytricha trifallax]
Length = 342
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 167/391 (42%), Gaps = 103/391 (26%)
Query: 2 DRSNFCGCKGVRTCIKC-EQEFGYENKN------IVEFTEE--TYIYCPHCNKAWQGSDL 52
DR C C GVR C C + EF + K+ I+E ++ TY+ C CN+ ++L
Sbjct: 8 DRERKCACTGVRYCKDCNDPEFRKQFKDLYPIDDILEAQQKVLTYVTCELCNRFKLKNEL 67
Query: 53 YAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSG 112
QS N+ K+D Q+ + Q + +
Sbjct: 68 NISQSDQNNGNND--------KQDDQNYQS-------------------FIDQCIGHLTS 100
Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
+ D GG+Y DF+SE +D++ +L D W SQSGR+K +FGP+ NFKKQK+K F
Sbjct: 101 EQLDFGGLYTINDFISEEFEQDIVNKLQDYKWVDSQSGRKKIDFGPQVNFKKQKLKYTKF 160
Query: 173 QGFPACTK------------FIQDRFHQE-------------DVLSHFQTIEQCTLEYNS 207
GFP K +Q++ ++ VL F IE LEY+
Sbjct: 161 TGFPLFIKPILDLIQTLNDEILQEKIEEQKNQQKPQLKENLPSVLKDFIPIEVNVLEYDE 220
Query: 208 ENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQK 267
+ GS+I H DD W+WGERI+ +NL+ D+ +T +Q+ +Q
Sbjct: 221 QRGSNIAPHKDDFWLWGERIIGINLLQDTFMT-------------------FQRDSQNQI 261
Query: 268 STIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRR 327
IV I +P+ R L ++ G++R+EW H + + G+R
Sbjct: 262 GQIVE---------------------IEVPVKRRMLYIISGKSRFEWMHGIKSEHIKGKR 300
Query: 328 VCIAYREFTAPY--MNNEATASVFEKAKCFF 356
+ REF+ + +NE + AK +
Sbjct: 301 IVCTIREFSDEFKSQDNEDANKIRAIAKTYI 331
>gi|25148697|ref|NP_741141.1| Protein F09F7.7, isoform a [Caenorhabditis elegans]
gi|351060673|emb|CCD68390.1| Protein F09F7.7, isoform a [Caenorhabditis elegans]
Length = 291
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 114/223 (51%), Gaps = 47/223 (21%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
S +I G+ + +FLSE+E ++ +D + W +SQSGRRKQ++GPK NFK +K+K F
Sbjct: 80 SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139
Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
G P + ++ + DV L ++Q E C LEY S+I+ H DD WIWG R++++N
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 199
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
LI SV+TL+ + NK L C+
Sbjct: 200 LINGSVMTLS----NDNKSFL---CY---------------------------------- 218
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
+ MP RSLL + E RY+W+H VL + GRR+ + RE
Sbjct: 219 ----VHMPHRSLLCMADECRYDWKHGVLAHHIRGRRIALTMRE 257
>gi|403286064|ref|XP_003934327.1| PREDICTED: uncharacterized protein LOC101032010 [Saimiri
boliviensis boliviensis]
Length = 517
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 118/235 (50%), Gaps = 57/235 (24%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ
Sbjct: 23 CGCKGIRTCLICERQRGG-------------------DPPWQ------------------ 45
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQ-GSDLYAYQSHPNHSGKSFDIGGVYIKED 125
L ++YC + W GS+ ++ G +F GV + ED
Sbjct: 46 -----------LPPAKTHHFIYC--SDTGWAVGSEESDFE------GWAFPFPGVMLIED 86
Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
F+++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K F G P+ ++ + R
Sbjct: 87 FVTQEEEAELVRLMDCDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRR 146
Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
L FQ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 147 MGLYPGLEGFQPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 201
>gi|444715583|gb|ELW56448.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4 [Tupaia
chinensis]
Length = 421
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 121/234 (51%), Gaps = 55/234 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE + G + WQ A ++H
Sbjct: 15 CGCKGIRTCLICELQRGG-------------------DPPWQP---LARKTH-------- 44
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
+VYC + W + A +S + G +F GV + EDF
Sbjct: 45 ------------------RFVYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+K+K+K+ F+G P+ ++ + R
Sbjct: 80 VTPEEEAEMVRLMDLDPWKLSQSGRRKQDYGPKVNFRKRKLKMAGFRGLPSFSRELVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
L F+ +EQC L+Y E GS+ID H+DD W+WGER+V++NL+ +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 193
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 29/39 (74%)
Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
RSLL+L G AR++W+H + R D+ RRVC+ +RE +A +
Sbjct: 358 RSLLLLPGAARHQWKHAIHRRDIEARRVCVTFRELSAEF 396
>gi|341883863|gb|EGT39798.1| hypothetical protein CAEBREN_17559 [Caenorhabditis brenneri]
Length = 292
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 116/223 (52%), Gaps = 47/223 (21%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
S +I G+ + +FLSE+E +++ +D++ W +SQSGRRKQ++GPK NFK +K+K F
Sbjct: 80 SIEIDGLTLIHNFLSESEETNILNMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTDSFI 139
Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
G P + ++ Q DV L ++Q E C LEY S+I+ H DD WIWG R++++N
Sbjct: 140 GMPEYADMLLEKMSQYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 199
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
LI SV+TL+ +S ++ C+
Sbjct: 200 LINGSVMTLS---NESRQF----LCY---------------------------------- 218
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
+ MP RSL+ + + RY+W+H VL + GRR+ + RE
Sbjct: 219 ----VHMPHRSLICMADDCRYDWKHGVLAHHIRGRRIALTMRE 257
>gi|403359506|gb|EJY79416.1| hypothetical protein OXYTRI_23312 [Oxytricha trifallax]
Length = 335
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 107/395 (27%), Positives = 164/395 (41%), Gaps = 107/395 (27%)
Query: 2 DRSNFCGCKGVRTCIKC-EQEFG------YENKNIVEFTEE--TYIYCPHCNKAWQGSDL 52
DR C C GVR C C + EF Y +I+E ++ TY+ C CN+ ++L
Sbjct: 8 DRERKCACTGVRYCKDCNDPEFRKQFKDLYPIDDILEAQQKVLTYVTCGLCNRFKLKNEL 67
Query: 53 YAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSG 112
S N K+D Q+ + Q + +
Sbjct: 68 NISISDQNDGNND--------KQDNQNDQS-------------------FIDQCIGHLTA 100
Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
+ D GG+Y ++ +SE +++ +L D W SQSGR+K +FGP+ NFKKQK+K F
Sbjct: 101 EQLDFGGLYTIKEIISEDFEYNIVNKLQDYKWVDSQSGRKKIDFGPQVNFKKQKLKYTKF 160
Query: 173 QGFPACTK-----------------------------FIQDRFHQEDVLSHFQTIEQCTL 203
GFP K +Q + H VL FQ IE L
Sbjct: 161 TGFPLFIKPILDLISTLNDQNLQQKEEELKEEQKDQALLQLKQHLPSVLKDFQPIEVNVL 220
Query: 204 EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKY 263
EY+ + GS+I H DD W+WGERI+ +NL+ D+ +T +Q+
Sbjct: 221 EYDEQRGSNIAPHKDDFWLWGERIIGINLLKDTFMT-------------------FQRDS 261
Query: 264 PDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV 323
+Q IV I +P+ R + V+ G++R+EW H + +
Sbjct: 262 ENQLGQIVE---------------------IEVPVKRRMMYVISGKSRFEWMHGIKSEHI 300
Query: 324 IGRRVCIAYREFTAPY--MNNEATASVFEKAKCFF 356
G+R+ +REF+ + +NE + E AK F
Sbjct: 301 KGKRIVCTFREFSDEFKSQDNEDANKIREIAKNFI 335
>gi|209489362|gb|ACI49132.1| hypothetical protein Cbre_JD16.004 [Caenorhabditis brenneri]
Length = 314
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 115/223 (51%), Gaps = 47/223 (21%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
S +I G+ + +FLSE+E +++ +D++ W +SQSGRRKQ++GPK NFK +K+K F
Sbjct: 102 SIEIDGLTLIHNFLSESEESNILNMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTDSFI 161
Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
G P + ++ Q DV L ++Q E C LEY S+I+ H DD WIWG R++++N
Sbjct: 162 GMPEYADMLLEKMSQYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 221
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
LI SV+TL+ +S ++ C+
Sbjct: 222 LINGSVMTLS---NESRQF----LCY---------------------------------- 240
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
+ MP RSL+ + RY+W+H VL + GRR+ + RE
Sbjct: 241 ----VHMPHRSLICMADACRYDWKHGVLAHHIRGRRIALTMRE 279
>gi|308497524|ref|XP_003110949.1| hypothetical protein CRE_04550 [Caenorhabditis remanei]
gi|308242829|gb|EFO86781.1| hypothetical protein CRE_04550 [Caenorhabditis remanei]
Length = 291
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 113/226 (50%), Gaps = 47/226 (20%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S + +I G+ + DFL+E E +++ +D++ W +SQSGRRKQ++GPK NFK +K+K
Sbjct: 77 SDDAIEIDGLTLIHDFLTEREESSILEMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTD 136
Query: 171 DFQGFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
F G P + ++ D+ L ++Q E C LEY S+I+ H DD WIWG R++
Sbjct: 137 SFIGMPEYADMLLNKMSDYDLKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLI 196
Query: 229 TVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNET 288
++NLI SV+TL S DS
Sbjct: 197 SINLISGSVMTL---------------------------------------SNDS----- 212
Query: 289 LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
S + + MP RSL+ + E RY+W+H VL + GRR+ + RE
Sbjct: 213 -KSFLCYVHMPHRSLICMADECRYDWKHGVLAHHIRGRRIALTMRE 257
>gi|426255358|ref|XP_004021316.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
[Ovis aries]
Length = 252
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 115/235 (48%), Gaps = 55/235 (23%)
Query: 7 CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
CGCKG+RTC+ CE++ G + WQ
Sbjct: 15 CGCKGIRTCLICERQRG-------------------GDPPWQH----------------- 38
Query: 67 DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
SQ ++Y P A + A +S + G +F GV + EDF
Sbjct: 39 ------------LSQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79
Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
++ E ++++ +D PW SQSGRRKQ++GPK NF+KQK+K+ DF+G P+ ++ + R
Sbjct: 80 VTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKMADFRGLPSFSREVVRRM 139
Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT 241
+L FQ +EQC L+Y E GS+ID H+DD W+WGER+ S++ L+
Sbjct: 140 GLYPILEDFQPVEQCNLDYCPERGSAIDPHLDDAWLWGERLGGPQPPRRSLVVLS 194
>gi|7498760|pir||T16009 hypothetical protein F09F7.7 - Caenorhabditis elegans
Length = 329
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 9/223 (4%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
S +I G+ + +FLSE+E ++ +D + W +SQSGRRKQ++GPK NFK +K+K F
Sbjct: 80 SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139
Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
G P + ++ + DV L ++Q E C LEY S+I+ H DD WIWG R++
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLIR-- 197
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
++ + +K L S + + ++S ++L S ++ N+ S
Sbjct: 198 --SENFRVFDFSQKLCSKKCLASRIINLNTNF--EESNQFFSINLINGSVMTLSNDN-KS 252
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
+ + MP RSLL + E RY+W+H VL + GRR+ + RE
Sbjct: 253 FLCYVHMPHRSLLCMADECRYDWKHGVLAHHIRGRRIALTMRE 295
>gi|167524358|ref|XP_001746515.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775277|gb|EDQ88902.1| predicted protein [Monosiga brevicollis MX1]
Length = 207
Score = 130 bits (327), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 117/229 (51%), Gaps = 44/229 (19%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
GV + DF++EAE ++ Q+D PW SQSGRRKQ+FGPK NFKK+K+K +F G PA
Sbjct: 5 GVRVWLDFVTEAEETALVAQMDAWPWTDSQSGRRKQDFGPKANFKKKKVKAANFTGLPAL 64
Query: 179 TKFIQDRFHQEDV-LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
+ + R L F +EQC L+Y G++I H DD W+WGER++T+NL+ +
Sbjct: 65 ARPLIPRLQALTADLKDFVPVEQCHLDYEPSRGAAIVPHFDDFWLWGERLITLNLLSTTF 124
Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
L CF + PD+ ++ I +P
Sbjct: 125 L-----------------CF----QLPDEP----------------------DALEIAVP 141
Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA 346
+P RSLL + G AR W H V R DV RR+ + +RE T +++ E A
Sbjct: 142 LPPRSLLEVKGVARMAWHHAVHRQDVSQRRIAMTWRELTPEFLSGERQA 190
>gi|312072236|ref|XP_003138973.1| hypothetical protein LOAG_03388 [Loa loa]
Length = 186
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 104/208 (50%), Gaps = 47/208 (22%)
Query: 134 DVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ--EDV 191
D++ +D W SQSGRRKQ++GP+ NFK +KIK+ F G P+ I +R + D+
Sbjct: 2 DLINVIDRTDWVLSQSGRRKQDYGPRVNFKHKKIKMERFLGMPSYIDVILNRMNSISPDL 61
Query: 192 LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYN 251
+Q E C LEY+ + S+I+ H DD WIWGER++ VNL+ SVLT Y D+ +
Sbjct: 62 FGSYQPFELCNLEYSDDRWSNIEMHYDDTWIWGERLICVNLLSKSVLT---YTNDAKQ-- 116
Query: 252 LNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEAR 311
++I +P+P+R+++ + E R
Sbjct: 117 ----------------------------------------LIIYVPLPSRTMVCMSDEIR 136
Query: 312 YEWQHCVLRSDVIGRRVCIAYREFTAPY 339
Y W+H V + GRR+ + RE +A +
Sbjct: 137 YSWRHAVFPEHIRGRRIAVTMREPSAAF 164
>gi|261332706|emb|CBH15701.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 304
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 109/229 (47%), Gaps = 54/229 (23%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
++ G+ I+ F+S E E ++ D+ W SQSGRRKQ +GPK NFKK+K+K+G
Sbjct: 102 IELSGLTIQPGFVSPNEEEYLVAFFDNPAPFAAWKVSQSGRRKQEYGPKANFKKRKLKVG 161
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERI 227
DF+ P K DR F E + + +E LEY +E S +D HIDD W+WG+RI
Sbjct: 162 DFRALPHQMKTTLDRVRSFVAEQTMREYCIVEVSVLEYTAEC-SCLDPHIDDTWLWGDRI 220
Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
+NL+ D +TLT+ + D
Sbjct: 221 GGLNLLDD--VTLTFVSAD----------------------------------------- 237
Query: 288 TLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
V + + +P + +L G +RYEW H + R DV RRV + +REF
Sbjct: 238 ---EVAVTVFVPRGAFFLLTGVSRYEWMHGIRREDVKNRRVSVTFREFA 283
>gi|71747664|ref|XP_822887.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832555|gb|EAN78059.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 304
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 109/229 (47%), Gaps = 54/229 (23%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
++ G+ I+ F+S E E ++ D+ W SQSGRRKQ +GPK NFKK+K+K+G
Sbjct: 102 IELSGLTIQPGFVSPNEEEYLVAFFDNPAPFAAWKVSQSGRRKQEYGPKANFKKRKLKVG 161
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERI 227
DF+ P K DR F E + + +E LEY +E S +D HIDD W+WG+RI
Sbjct: 162 DFRALPHQMKTTLDRVRSFVAEQTMREYCIVEVSVLEYTAEC-SCLDPHIDDTWLWGDRI 220
Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
+NL+ D +TLT+ + D
Sbjct: 221 GGLNLLDD--VTLTFVSAD----------------------------------------- 237
Query: 288 TLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
V + + +P + +L G +RYEW H + R DV RRV + +REF
Sbjct: 238 ---EVAVTVFVPRGAFFLLTGVSRYEWMHGIRREDVKNRRVSVTFREFA 283
>gi|407409697|gb|EKF32423.1| hypothetical protein MOQ_003722 [Trypanosoma cruzi marinkellei]
Length = 304
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/263 (28%), Positives = 121/263 (46%), Gaps = 58/263 (22%)
Query: 86 YVYCPHCNKAWQGSDLYAYQSHPNHSGKS----FDIGGVYIKEDFLSEAETEDVMKQLDD 141
Y YC CN+ + A++S +H G + I G+++ DFLS + E ++ LD+
Sbjct: 69 YGYCWQCNRIFLMHH-GAFKSCADHEGATPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDE 127
Query: 142 LP----WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR---FHQEDVLSH 194
W SQSGRRKQ+FGP+ NFKK+K+ +G P + + ++ F +E
Sbjct: 128 PSSFSGWKHSQSGRRKQDFGPRANFKKRKLNTSGMRGMPKQLESVMEKVKSFVREITSKE 187
Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
+ +E LEY SEN SSID HIDD W+WG+R+ +NL+ D+V+T G
Sbjct: 188 YHIVEVSALEYTSENSSSIDPHIDDTWVWGDRVGGLNLLEDTVMTFVNNEG--------- 238
Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
+ + +P + +L +RY+W
Sbjct: 239 -------------------------------------TAVDVFLPRGAFFLLSQGSRYDW 261
Query: 315 QHCVLRSDVIGRRVCIAYREFTA 337
H + ++ RR+ +REF++
Sbjct: 262 LHGIRLENIKHRRISFTFREFSS 284
>gi|407849132|gb|EKG03971.1| hypothetical protein TCSYLVIO_004974 [Trypanosoma cruzi]
Length = 304
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 147/355 (41%), Gaps = 97/355 (27%)
Query: 1 MDRSNFCGCKGVRTCIKC---EQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQS 57
M++ C C G+R C +C E+ G ++N++ + SD+ + Q
Sbjct: 7 MEKEEGCCCSGIRFCGRCIESERAQGIIHQNVL---------------LVKSSDVISQQY 51
Query: 58 HPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLY-----AYQSHPNHSG 112
TS T V H WQ + ++ A++S +H G
Sbjct: 52 SAGR-----------------TSSCSFTCVELSHYGLCWQCNRIFLMHHGAFKSCADHEG 94
Query: 113 KS----FDIGGVYIKEDFLSEAETEDVMKQLDDLP----WDKSQSGRRKQNFGPKCNFKK 164
+ I G+++ DFLS + E ++ LD+ W SQSGRRKQ+FGP+ NFKK
Sbjct: 95 ATPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDEPSSFSGWKHSQSGRRKQDFGPRANFKK 154
Query: 165 QKIKLGDFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
+K+ +G P + + + F ++ + +E LEY SEN SSID HIDD W
Sbjct: 155 RKLNTSGIRGMPKQLESVVKKVKSFVRDITSKEYHIVEVSALEYTSENSSSIDPHIDDTW 214
Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
+WG RI +NL+ D+V+T G +
Sbjct: 215 VWGNRIGGLNLLEDTVMTFVNNEGTA---------------------------------- 240
Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
+ + +P + +L +RY+W H + ++ RR+ +REF+
Sbjct: 241 ------------VDVFLPRGAFFLLSNGSRYDWLHGIRLENIKHRRISFTFREFS 283
>gi|71659461|ref|XP_821452.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886833|gb|EAN99601.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 304
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 88/355 (24%), Positives = 147/355 (41%), Gaps = 97/355 (27%)
Query: 1 MDRSNFCGCKGVRTCIKC---EQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQS 57
M++ C C G+R C +C E+ G ++N++ + SD+ + Q
Sbjct: 7 MEKEEGCCCSGIRFCGRCIESERAQGITHQNVL---------------LVKSSDVISQQY 51
Query: 58 HPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLY-----AYQSHPNHSG 112
TS T V H WQ + ++ ++S +H G
Sbjct: 52 GAGR-----------------TSSCSFTCVELSHYGLCWQCNRIFLMHHGVFKSCADHEG 94
Query: 113 KS----FDIGGVYIKEDFLSEAETEDVMKQLDDLP----WDKSQSGRRKQNFGPKCNFKK 164
+ I G+++ DFLS + E ++ LD+ W SQSGRRKQ+FGP+ NFKK
Sbjct: 95 TTPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDEPSSLSGWKHSQSGRRKQDFGPRANFKK 154
Query: 165 QKIKLGDFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
+K+ +G P + + ++ F ++ + +E LEY SEN SSID HIDD W
Sbjct: 155 RKLNTSGIRGMPKQLESVMEKVKSFVRDITSKEYHIVEVSALEYTSENSSSIDPHIDDTW 214
Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
+WG R+ +NL+ D+V+T G +
Sbjct: 215 VWGNRVGGLNLLEDTVMTFVNNEGTA---------------------------------- 240
Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
+ + +P + +L +RY+W H + ++ RR+ +REF+
Sbjct: 241 ------------VDVFLPRGAFFLLSNGSRYDWLHGIRLENIKHRRISFTFREFS 283
>gi|339242841|ref|XP_003377346.1| alkylated DNA repair protein AlkB protein [Trichinella spiralis]
gi|316973860|gb|EFV57409.1| alkylated DNA repair protein AlkB protein [Trichinella spiralis]
Length = 273
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 136/335 (40%), Gaps = 85/335 (25%)
Query: 6 FCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
CGCKG+R C CE +N E + W+ ++ Y + SG S
Sbjct: 11 ICGCKGIRFCRLCENSDRVKN-----LQRERCL-------DWKKYRIFVYCRKCDSSGHS 58
Query: 66 FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYA-YQSHPNHSGKSFDIGGVYIKE 124
LTS+ + Q +LY + + SF V + E
Sbjct: 59 ------------LTSEM----------HSVEQLLELYNNFHDLVQGNFDSFACETVKVVE 96
Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-----FPACT 179
+FL+E E ++K + W SQSGRRKQ++GPK NFKK+KIK P
Sbjct: 97 NFLTEDEEASLLKSISQSAWQVSQSGRRKQDYGPKVNFKKRKIKYHTPNNQRDNTLPEYF 156
Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
F+ DR Q L F ++ LEY SE GS I+ H DD W WGERIVT +L D+V++
Sbjct: 157 SFLFDRMKQLKFLQDFVAVQVTNLEYCSERGSWIEFHTDDQWAWGERIVTCSLCSDAVMS 216
Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMP 299
+S +I +P+
Sbjct: 217 FLEEENNS---------------------------------------------LIHVPLL 231
Query: 300 ARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
RSL+ + G R++ +H +L ++G R+ I RE
Sbjct: 232 KRSLICISGPMRHKLKHAILPGHIVGTRIAITVRE 266
>gi|154345804|ref|XP_001568839.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066181|emb|CAM43971.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 297
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 141/364 (38%), Gaps = 79/364 (21%)
Query: 2 DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
+ S+ C C G+R C KC + F+ ++ S + Q H +
Sbjct: 3 EHSSPCVCSGIRFCAKCRDTLRVQQL----FSGSVFLS--------SASSVIEKQWHNDR 50
Query: 62 -SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQG-SDLYAYQSHPNHSGKSFDIGG 119
S SF I G K T YC C ++ + + + H S I G
Sbjct: 51 LSSCSFAIIG------------KSTLSYCIECMTIFKSEAPIKSCVDHQGAMSTSIVISG 98
Query: 120 VYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
+ + +D L+E E ++ LD+ PW +SQSGRRKQ++GPK NFKK+K++ +
Sbjct: 99 LVVFQDVLTEEEETALIYYLDNSHPFPPWKESQSGRRKQDYGPKRNFKKKKVRPAEIPAM 158
Query: 176 PACTKFIQDRFHQED---VLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
P + + ++ E LEY S+ D HIDD W+WG+RI +NL
Sbjct: 159 PLALEPVCATISSTTENFTGRAYRIAEVSALEYVEGKMSNFDPHIDDTWLWGDRIAGLNL 218
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
V+T +G V
Sbjct: 219 NEPCVVTFVEPDG----------------------------------------------V 232
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEKA 352
+ +P RS ++ G RY+W H + V GRR+ + +RE + + + T+ V A
Sbjct: 233 CCDVYLPRRSFFLMSGNCRYKWMHGIRPEHVRGRRISLTFRELSDEILADAETSEVVLSA 292
Query: 353 KCFF 356
F
Sbjct: 293 ASTF 296
>gi|342184304|emb|CCC93785.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 306
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 105/227 (46%), Gaps = 54/227 (23%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDL----PWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
+ G + DF+S E + ++ DD PW SQSGRRKQ++GPK NFKK+K+KLG+F
Sbjct: 106 MDGFCLMTDFISPLEEQKLVTFFDDPAPFPPWKDSQSGRRKQDYGPKANFKKKKLKLGNF 165
Query: 173 QGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
QG P + + R F F E LEY ++ SS+D H+DD W+WG+RI
Sbjct: 166 QGMPQQMEELLGRVTSFVSRYTNKEFSVAEVSALEYTTKC-SSLDPHVDDTWLWGDRIGG 224
Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
+NL+ D VLT +G
Sbjct: 225 LNLLVDVVLTFVNASG-------------------------------------------- 240
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
+ + +P RS +L RYEW H + R D++GRR+ + +RE
Sbjct: 241 --IAVAAHIPRRSFFMLSKVCRYEWMHGIRREDIVGRRISVTFRELA 285
>gi|401420114|ref|XP_003874546.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490782|emb|CBZ26046.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 297
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 118/282 (41%), Gaps = 55/282 (19%)
Query: 83 KETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDD 141
K + +C C ++ + + H S I G+ + +D L E + V++ LDD
Sbjct: 61 KSSLSFCIECMSVFKSEVPIKSCADHHGAMATSITISGLAVFQDALPEEDETAVIRFLDD 120
Query: 142 ----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI---QDRFHQEDVLSH 194
PW +SQSGRRKQ++GP+ NFKK+K+K+ D P + I +
Sbjct: 121 SHPFPPWKESQSGRRKQDYGPRRNFKKKKVKVADIPAMPLVFEPIFSVISSMTETFTGKA 180
Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
++ E LEY S+ D H+DD W+WG+RI +N
Sbjct: 181 YRIAEVSALEYMEGKMSNFDPHVDDTWLWGDRIAGLN----------------------- 217
Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
LNE + VN V + +P R+ ++ G+ RY+W
Sbjct: 218 ---------------------LNEACAVTFVNS--EGVCCDVYLPRRTFFLMSGDCRYKW 254
Query: 315 QHCVLRSDVIGRRVCIAYREFTAPYM-NNEATASVFEKAKCF 355
H + V GRR+ + +RE + + + EA+ V A F
Sbjct: 255 MHGIRPEHVRGRRISLTFRELSDEILADTEASEMVLSAASNF 296
>gi|25148702|ref|NP_741142.1| Protein F09F7.7, isoform b [Caenorhabditis elegans]
gi|351060674|emb|CCD68391.1| Protein F09F7.7, isoform b [Caenorhabditis elegans]
Length = 208
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 75/127 (59%), Gaps = 2/127 (1%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
S +I G+ + +FLSE+E ++ +D + W +SQSGRRKQ++GPK NFK +K+K F
Sbjct: 80 SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139
Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
G P + ++ + DV L ++Q E C LEY S+I+ H DD WIWG R++
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLIRYE 199
Query: 232 LIGDSVL 238
+ VL
Sbjct: 200 FLFKKVL 206
>gi|146104297|ref|XP_001469788.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398024324|ref|XP_003865323.1| hypothetical protein, conserved [Leishmania donovani]
gi|134074158|emb|CAM72900.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322503560|emb|CBZ38646.1| hypothetical protein, conserved [Leishmania donovani]
Length = 297
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/370 (23%), Positives = 144/370 (38%), Gaps = 92/370 (24%)
Query: 2 DRSNFCGCKGVRTCIKCEQEFGYE---NKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSH 58
+ S+ C C G+R C KC + + ++ + T I H ++
Sbjct: 3 EHSSACSCAGIRFCAKCRDSSRVQQLFSGSVPLSSARTVIEKQHNDERL----------- 51
Query: 59 PNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDI 117
S SF I G K + +C C ++ + + H S I
Sbjct: 52 ---SSCSFAIIG------------KSSLSFCIECASVFKSEVPIKSCADHQGAMATSITI 96
Query: 118 GGVYIKEDFLSEAETEDVMKQLDDL----PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
G+ + + L+E + V++ LDD PW +SQSGRRKQ++GP+ NFKK+K+K+ +
Sbjct: 97 SGLAVLQHALTEEDEAAVIRFLDDSHPFPPWKESQSGRRKQDYGPRRNFKKKKVKVAEIP 156
Query: 174 GFP-------ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGER 226
P A + + F + ++ E LEY S+ D H+DD W+WG+R
Sbjct: 157 SMPLVFESVFAVISSMTETFTGK----AYRIAEVSALEYMEGKMSNFDPHVDDTWLWGDR 212
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
I +NL V+T G
Sbjct: 213 IAGLNLNEACVVTFVNPEG----------------------------------------- 231
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM-NNEAT 345
V + +P R+ ++ G RY+W H + V GRR+ + +RE + + + EA+
Sbjct: 232 -----VCCDVYLPRRAFFLMSGNCRYKWMHGIRHEHVRGRRISLTFRELSDEILADTEAS 286
Query: 346 ASVFEKAKCF 355
V A F
Sbjct: 287 EMVLSAASNF 296
>gi|195577921|ref|XP_002078817.1| GD23631 [Drosophila simulans]
gi|194190826|gb|EDX04402.1| GD23631 [Drosophila simulans]
Length = 349
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 140/320 (43%), Gaps = 48/320 (15%)
Query: 66 FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIGGVYIK 123
F I ++E F Q E + YC C+ G D Q H NH + + G+ ++
Sbjct: 46 FQIVKTSLQEQF---QELEAWSYCIQCDHLQPGWDTNQVQKDHENHKKDEGLPLPGILVQ 102
Query: 124 EDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ-GFPACTKFI 182
E+FLS E ++ LDDLPWD SQSGRRKQNFGPK N KI + + PA +
Sbjct: 103 EEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTN----KISVNALELSAPATPHLL 158
Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNH----------IDDCWIWGERIV---- 228
Q HQ+ +H Q Q ++ + S + ++D + R++
Sbjct: 159 Q---HQQ---AHKQAKPQGWMQLSGHPESIVPTSTGIVRKRISGLEDSEVHAYRLICKEP 212
Query: 229 -TVNLIG-----DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD 282
T ++ + + + PC K ++ + +++S K
Sbjct: 213 QTAQIVPAYFGIQEMQSQHFIELQDLLAGFRDPCVMDIKM--GSRTFLESEVSSMAKGRG 270
Query: 283 SIVNETLSSVVIRIPMPARSLLVL---WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
T + P + L L +G ARY+++H VLR DV RRVC+AYREFT Y
Sbjct: 271 RPTAGTKKPTL----QPTQRSLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMY 326
Query: 340 MNN---EATASVFEKAKCFF 356
+N + V EK++ F+
Sbjct: 327 INGADIQKGDPVREKSQIFW 346
>gi|157876860|ref|XP_001686772.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129847|emb|CAJ09153.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 297
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 142/367 (38%), Gaps = 86/367 (23%)
Query: 2 DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
+ S+ C C G+R C KC + ++ + S + Q +
Sbjct: 3 EHSSACSCAGIRFCAKCRGS-----------SRVQQLFNGSVPLSSARSVIEKQQDNERL 51
Query: 62 SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDIGGV 120
S SF + G K + +C C ++ + + H I G+
Sbjct: 52 SSCSFAVIG------------KSSLSFCIECASVFKSEVPIKSCADHQGAVATGITISGL 99
Query: 121 YIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
+ D L+E + V++ LDD PW +SQSGRRKQ++GPK NFKK+KIK+ + G P
Sbjct: 100 AVFRDTLTEEDETAVIRFLDDSRPFPPWKESQSGRRKQDYGPKRNFKKKKIKVAEIPGMP 159
Query: 177 -------ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
A + + F + ++ E LEY S+ D H+DD W+WG+RI
Sbjct: 160 LVFESVFAVISSMVETFTGK----AYRIAEVSALEYMEGKMSNFDPHVDDTWLWGDRIAG 215
Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
+N LNE + VN
Sbjct: 216 LN--------------------------------------------LNEACAVTFVNP-- 229
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM-NNEATASV 348
V + +P R+ ++ G RY W H + V GRR+ + +RE + + + +A+ V
Sbjct: 230 EGVCCDVYLPRRAFFLMSGNCRYRWMHGIRPEHVRGRRISLTFRELSDEILVDTKASEMV 289
Query: 349 FEKAKCF 355
A F
Sbjct: 290 LSAASHF 296
>gi|358337469|dbj|GAA55822.1| alkylated DNA repair protein alkB homolog 4 [Clonorchis sinensis]
Length = 183
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 62/99 (62%), Gaps = 4/99 (4%)
Query: 153 KQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH----FQTIEQCTLEYNSE 208
+Q++GPK NFK+Q++ G F G P+ ++F+ DR +Q SH F +E C LEY E
Sbjct: 12 RQDYGPKVNFKRQRVVPGRFTGLPSYSRFLVDRINQTIQTSHLRQTFIPVELCNLEYCPE 71
Query: 209 NGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
G+SI H+DD W+WG+R+VT+NL + L T N S
Sbjct: 72 RGASIVPHLDDAWLWGDRLVTLNLCSSTTLIFTLPNPGS 110
>gi|294944511|ref|XP_002784292.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239897326|gb|EER16088.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 252
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 2/123 (1%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD-FQGFPA 177
G+ + D ++EAE ++ + PW SQSGRRKQ++GP+ NFKK+K+K D FQG P
Sbjct: 55 GLVVLADAITEAEEATLLGDIYARPWKLSQSGRRKQDYGPQVNFKKRKLKCPDNFQGLPH 114
Query: 178 CTKFIQDRFHQE-DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
+ R H +L E EY GSSID H+D W+W + I+ ++L D
Sbjct: 115 SIDLVLPRIHTGLGLLLDHAWHEMVVQEYAVSRGSSIDLHVDHSWVWADGILDLSLAADC 174
Query: 237 VLT 239
++
Sbjct: 175 IMA 177
>gi|345316254|ref|XP_003429720.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH4-like
[Ornithorhynchus anatinus]
Length = 181
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 63/107 (58%), Gaps = 7/107 (6%)
Query: 70 GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
G + ++ L T +VYCP A QG++ Q G++F GV + EDF+S
Sbjct: 3 GPRLGQEPLPDPTVARFVYCPETGLA-QGAEATEVQ------GRAFPFPGVTLLEDFVSA 55
Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
E +++ +D PW SQSGRRKQ++GPK NF+K+K+K+G F G P
Sbjct: 56 EEEARMVRLMDRDPWKPSQSGRRKQDYGPKVNFRKRKLKVGAFDGLP 102
Score = 38.1 bits (87), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 36/73 (49%)
Query: 276 LNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREF 335
L + D + + + +P RSLLV+ G R+ W+H + R V RRVC +RE
Sbjct: 93 LKVGAFDGLPXXXXXXXXVAVGLPRRSLLVVHGPGRHRWRHGIRRRHVRARRVCATFREL 152
Query: 336 TAPYMNNEATASV 348
+A + A A +
Sbjct: 153 SAEFGPGGARAPL 165
>gi|380023187|ref|XP_003695407.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH4-like [Apis florea]
Length = 103
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 54/82 (65%), Gaps = 2/82 (2%)
Query: 1 MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEE--TYIYCPHCNKAWQGSDLYAYQSH 58
M+ CGCKG+R+C+ CE E+ N+ E+ +Y+YCP+C KAW G +L Y++H
Sbjct: 1 METIRPCGCKGIRSCLLCETEYKIVKPNLKACFEKCSSYVYCPYCEKAWPGWNLDLYKNH 60
Query: 59 PNHSGKSFDIGGVYIKEDFLTS 80
PNH G + + GVYIK DFLTS
Sbjct: 61 PNHQGTAIEFPGVYIKLDFLTS 82
>gi|428170226|gb|EKX39153.1| hypothetical protein GUITHDRAFT_143755 [Guillardia theta CCMP2712]
Length = 156
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 8/118 (6%)
Query: 109 NHSGKSFDIGGVYIKEDFLSEAETEDVMKQLD--DLPWDKSQSGRRKQNFGPKCNFKKQK 166
H G SF GV + E+F++E ++++D D W SQSGRRKQ FGPK NFKK++
Sbjct: 42 GHMGCSFQ--GVTVVEEFITEEVEGCWVREMDRDDRLWILSQSGRRKQEFGPKVNFKKEE 99
Query: 167 IKLGDFQG--FPACTK-FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
+K+ D G FP+ ++ F+Q +LS F+ +E LEY GS I HIDD W
Sbjct: 100 VKI-DPSGTIFPSWSQEFLQLCSSSSSLLSDFEAVELGLLEYEPLRGSCIAPHIDDSW 156
>gi|256052601|ref|XP_002569851.1| hypothetical protein [Schistosoma mansoni]
Length = 103
Score = 68.9 bits (167), Expect = 3e-09, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 50/78 (64%), Gaps = 1/78 (1%)
Query: 110 HSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKL 169
H G F + + +F+ E E +++++D W SQSGRRKQ++GPK NFK+QK+ +
Sbjct: 16 HQG-GFPFLDIEVIPNFIDENEEAMLVEEIDKQTWVLSQSGRRKQDYGPKVNFKRQKVHI 74
Query: 170 GDFQGFPACTKFIQDRFH 187
G F G PA ++F+ R++
Sbjct: 75 GGFYGLPAYSRFLITRYN 92
>gi|255557783|ref|XP_002519921.1| nucleic acid binding protein, putative [Ricinus communis]
gi|223540967|gb|EEF42525.1| nucleic acid binding protein, putative [Ricinus communis]
Length = 421
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 63/241 (26%), Positives = 109/241 (45%), Gaps = 47/241 (19%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+Y+ DF+S E ++++ +D PW KS S RR Q++G + ++ + + +LG
Sbjct: 153 LNIPGLYLLPDFVSAEEEQELLAAVDARPW-KSLSKRRVQHYGYEFCYQTRNVDTKQQLG 211
Query: 171 DFQGFPACTKFIQDRFHQEDVL--SHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERI 227
+ P + +R L S ++Q T+ EY G + HID + I
Sbjct: 212 EL---PPFISLVLERILSFPELGESASSILDQLTVNEY--PRGVGLSPHIDTHSAFEGSI 266
Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSL-NEKSTDSIVN 286
+++L G ++ Y+ DS K+T N++ + N + ++V
Sbjct: 267 FSLSLAGPCIMEFRRYSDDSRA----------------PKATTNNEMIIENPDNGTNLVR 310
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYREFT 336
I +P RS+L+L GEARY WQH + + +VI RRV +R+
Sbjct: 311 RA-------IYLPPRSMLLLSGEARYAWQHYIPHHKIDTVNGNVITRGSRRVSFTFRKVR 363
Query: 337 A 337
A
Sbjct: 364 A 364
>gi|428169970|gb|EKX38899.1| hypothetical protein GUITHDRAFT_115004 [Guillardia theta CCMP2712]
Length = 568
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 6/142 (4%)
Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
K + G+ +KEDFLS E+ED + QLD+L W+ S S RR Q+FG ++ +++
Sbjct: 147 KDLVVEGLILKEDFLSPEESEDTLNQLDELEWESSLS-RRVQHFGFTFDYATRRVNTLKT 205
Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVN 231
+ FP I R +E +L +QCT+ EY + G I +H+D + + I++V+
Sbjct: 206 RAFPPFLLRIAQRALREKLL--LFEPDQCTVNEY--QPGQGIRSHVDTHSAFEDGILSVS 261
Query: 232 LIGDSVLTLTYYNGDSNKYNLN 253
L V+ +G S L
Sbjct: 262 LGSSVVMEFRSPDGLSKNVQLK 283
>gi|225427651|ref|XP_002270503.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Vitis
vinifera]
Length = 349
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 36/232 (15%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
+I G+Y+ DF+S E E+++ +D + W KS S RR Q++G + ++ + + + G
Sbjct: 124 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 182
Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
P+ I +R L I L N G + HID + I +++L
Sbjct: 183 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 242
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
G ++ Y + +S S + YPD+ S+ +
Sbjct: 243 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRR------------------- 281
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
I +P RS+L+L GEARY W H + ++ VI RRV +R+
Sbjct: 282 --AIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRK 331
>gi|449455667|ref|XP_004145573.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Cucumis sativus]
gi|449523025|ref|XP_004168525.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Cucumis sativus]
Length = 344
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 98/217 (45%), Gaps = 40/217 (18%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S DI G+++ DF++ E ED+++++D PW+ + + RR Q++G + ++ + +
Sbjct: 115 SASELDIPGLFLLHDFVNAKEEEDLLREVDARPWN-NLAKRRVQHYGYEFCYQTRNVNTK 173
Query: 171 DFQG-FPACTKFIQDRF----HQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWG 224
G P+ + DR + ED+ +++Q T+ EY G + HID +
Sbjct: 174 HQLGELPSFVSHVVDRISMFPNTEDIAD--ASLDQLTVNEY--PPGVGLSPHIDTHSAFE 229
Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPD---QKSTIVNDLSLNEKST 281
I +++L G PC ++YP+ K DL +
Sbjct: 230 GLIFSLSLAG--------------------PCIMEFRRYPEGTWHKFPSSIDLKMENSVN 269
Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
DS + + I +P RS+L+L GEARY W H +
Sbjct: 270 DS------NYLRKAIYLPPRSMLLLSGEARYVWHHYI 300
>gi|440796420|gb|ELR17529.1| alkB, alkylation repair protein, putative [Acanthamoeba castellanii
str. Neff]
Length = 172
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 64/124 (51%), Gaps = 5/124 (4%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ + D+L+ E ++ LD+LPW + RR Q +G +++KQ++ PAC
Sbjct: 19 GLTLVVDYLTVDEETALVNSLDELPW-LGEIQRRTQQYGYHYSYRKQRVDDTPVPPLPAC 77
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
+F+ DRF Q+D H + + + G I HID WG+ +V+++L+ D +
Sbjct: 78 VRFLLDRF-QDD---HVPLVAHQLIINEYQPGEQIKPHIDSTTDWGDCVVSLSLLDDWDM 133
Query: 239 TLTY 242
T+
Sbjct: 134 IFTH 137
>gi|297603304|ref|NP_001053775.2| Os04g0602700 [Oryza sativa Japonica Group]
gi|38344149|emb|CAD41869.2| OSJNBa0041A02.16 [Oryza sativa Japonica Group]
gi|116310931|emb|CAH67869.1| B0403H10-OSIGBa0105A11.21 [Oryza sativa Indica Group]
gi|125549609|gb|EAY95431.1| hypothetical protein OsI_17273 [Oryza sativa Indica Group]
gi|125591534|gb|EAZ31884.1| hypothetical protein OsJ_16049 [Oryza sativa Japonica Group]
gi|255675751|dbj|BAF15689.2| Os04g0602700 [Oryza sativa Japonica Group]
Length = 348
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 103/243 (42%), Gaps = 51/243 (20%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S + G+Y+ DF++ AE ++++ +D+ PW KS + RR Q++G + ++ + +
Sbjct: 119 SASELGVPGIYLVPDFVTAAEEQELLAAVDNRPW-KSLAKRRVQHYGFEFLYETRNVDSK 177
Query: 171 DFQG--FPACTKFIQDRFHQEDVLSHFQTIEQCT------LEYNS-ENGSSIDNHIDDCW 221
F G P +K I D + F +CT L N G + HID
Sbjct: 178 QFLGELPPFVSKII-------DKIMSFPGANKCTSKLVDQLTVNEYPCGVGLSPHIDTHS 230
Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
+ E I +++L G PC +KYP + +S +K
Sbjct: 231 AFEEMIFSLSLAG--------------------PCIMEFRKYPKGSWRAPSMVSGTDK-- 268
Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLR---SDVIG-------RRVCIA 331
DSI E + + +P RS+L++ GE RY W H + DV G RRV
Sbjct: 269 DSI--EEPQCIRKAVFLPPRSMLLMSGEGRYAWHHYIPHHKIDDVGGQVIKRNTRRVSFT 326
Query: 332 YRE 334
+R+
Sbjct: 327 FRK 329
>gi|168033740|ref|XP_001769372.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679292|gb|EDQ65741.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 334
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 97/214 (45%), Gaps = 41/214 (19%)
Query: 114 SFDIG--GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
S D+G G+ + DF++ +E +++++++D+ W ++ + RR Q++G + +K + + L
Sbjct: 110 SADVGVPGLTLYTDFVTLSEEQELLREVDERSW-QTLAKRRVQHYGYEFLYKTRNVDLSQ 168
Query: 172 FQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIWG 224
G PA + + ++ ++ + E+ TL ++ G + HID
Sbjct: 169 KLGELPASLEPLLNKI---SLVPEIASAEEPTLPFDQLTVNEYPRGVGLSPHID------ 219
Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
T + + G +L PC +KY + + ND DS+
Sbjct: 220 --------------THSAFEGSILSLSLGGPCIMEFRKYIKRAIVVRND--------DSV 257
Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
++ + + +P RSLL+L GEARY W H +
Sbjct: 258 PTGSVYIIRKALFLPPRSLLILSGEARYAWHHYI 291
>gi|296085482|emb|CBI29214.3| unnamed protein product [Vitis vinifera]
Length = 912
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 100/237 (42%), Gaps = 37/237 (15%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
+I G+Y+ DF+S E E+++ +D + W KS S RR Q++G + ++ + + + G
Sbjct: 687 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 745
Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
P+ I +R L I L N G + HID + I +++L
Sbjct: 746 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 805
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
G ++ Y + +S S + YPD+ S+ +
Sbjct: 806 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRRA------------------ 845
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYREF-TAP 338
I +P RS+L+L GEARY W H + ++ VI RRV +R+ T P
Sbjct: 846 ---IYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRKVRTGP 899
>gi|147834541|emb|CAN76404.1| hypothetical protein VITISV_021238 [Vitis vinifera]
Length = 1036
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 36/232 (15%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
+I G+Y+ DF+S E E+++ +D + W KS S RR Q++G + ++ + + + G
Sbjct: 766 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 824
Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
P+ I +R L I L N G + HID + I +++L
Sbjct: 825 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 884
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
G ++ Y + +S S + YPD+ S+ +
Sbjct: 885 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRRA------------------ 924
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
I +P RS+L+L GEARY W H + ++ VI RRV +R+
Sbjct: 925 ---IYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRK 973
>gi|221485447|gb|EEE23728.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 1030
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 35/51 (68%)
Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
++L++FQ +E C LEY + GS I+ H DD W+WG R+VT NL ++L+
Sbjct: 775 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 825
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 51/113 (45%), Gaps = 15/113 (13%)
Query: 70 GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
GVY+ D L Q + + N G+ P G + + GG +
Sbjct: 589 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 638
Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
AETE W SQSGRRK +FGP+ NFKK+++K G F GFP K +
Sbjct: 639 AETEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 686
Score = 44.7 bits (104), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
V IR+ +P RSL+V G RY W H V + RRV + RE P NE
Sbjct: 926 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 985
Query: 349 FEKAKCFF 356
FF
Sbjct: 986 LLSLSAFF 993
>gi|237835449|ref|XP_002367022.1| hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]
gi|211964686|gb|EEA99881.1| hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]
Length = 1033
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 35/51 (68%)
Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
++L++FQ +E C LEY + GS I+ H DD W+WG R+VT NL ++L+
Sbjct: 778 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 828
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 51/113 (45%), Gaps = 15/113 (13%)
Query: 70 GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
GVY+ D L Q + + N G+ P G + + GG +
Sbjct: 592 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 641
Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
A+TE W SQSGRRK +FGP+ NFKK+++K G F GFP K +
Sbjct: 642 AKTEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 689
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
V IR+ +P RSL+V G RY W H V + RRV + RE P NE
Sbjct: 929 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 988
Query: 349 FEKAKCFF 356
FF
Sbjct: 989 LLSLSAFF 996
>gi|221506304|gb|EEE31939.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 1033
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 35/51 (68%)
Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
++L++FQ +E C LEY + GS I+ H DD W+WG R+VT NL ++L+
Sbjct: 778 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 828
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 51/113 (45%), Gaps = 15/113 (13%)
Query: 70 GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
GVY+ D L Q + + N G+ P G + + GG +
Sbjct: 592 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 641
Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
AETE W SQSGRRK +FGP+ NFKK+++K G F GFP K +
Sbjct: 642 AETEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 689
Score = 44.7 bits (104), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
V IR+ +P RSL+V G RY W H V + RRV + RE P NE
Sbjct: 929 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 988
Query: 349 FEKAKCFF 356
FF
Sbjct: 989 LLSLSAFF 996
>gi|357165655|ref|XP_003580453.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Brachypodium distachyon]
Length = 349
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 107/263 (40%), Gaps = 52/263 (19%)
Query: 90 PHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQS 149
P KA G+ + S I GV++ ++F++ AE ++++ +D PW K +
Sbjct: 102 PAAPKARPGAPVLPVAL----SSSELGIPGVHLVQEFVTAAEEQELLAAVDSRPW-KRLA 156
Query: 150 GRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCT------ 202
RR Q++G + ++ + + F G P+ + D+ + F ++ CT
Sbjct: 157 KRRVQHYGYEFLYETRNVDSKQFLGELPSFVSKVLDK------IGAFPGVKNCTSKLVDQ 210
Query: 203 LEYNS-ENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQK 261
L N G + HID + E I++++L G ++ Y S +
Sbjct: 211 LTVNEYPCGVGLSPHIDTHSAFEEMILSLSLAGSCIMEFRQYPKGSWR------------ 258
Query: 262 KYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLR- 320
+I N TD + + + I +P RS+L++ GE RY W H +
Sbjct: 259 ----APPSIGN-------GTDEVSIQDPQCIRKAIFLPPRSMLLMSGEGRYAWHHYIPHH 307
Query: 321 --SDVIG-------RRVCIAYRE 334
DV G RRV +R+
Sbjct: 308 KIDDVAGQVIKRNSRRVSFTFRK 330
>gi|401412938|ref|XP_003885916.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325120336|emb|CBZ55890.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 951
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 33/52 (63%)
Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
++ +LS FQ +E C LEY GS I+ H DD W+WG R+VT L ++L+
Sbjct: 691 RKKLLSTFQPVELCLLEYVPSRGSHIEEHFDDFWLWGPRLVTFTLASSTILS 742
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/39 (64%), Positives = 30/39 (76%)
Query: 144 WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
W SQSGRRK +FGPK NFKK+++KLG F GFP TK +
Sbjct: 582 WALSQSGRRKIDFGPKVNFKKKRLKLGLFNGFPPFTKRL 620
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 31/60 (51%)
Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEK 351
V IR+ +P RSL+V G RY W H + + RRV + RE ++ +A V +K
Sbjct: 848 VEIRVLLPRRSLVVCEGPCRYTWTHAIRSHHIFSRRVAVTLRELAPEFLPGGESAEVGQK 907
>gi|302758364|ref|XP_002962605.1| hypothetical protein SELMODRAFT_78643 [Selaginella moellendorffii]
gi|300169466|gb|EFJ36068.1| hypothetical protein SELMODRAFT_78643 [Selaginella moellendorffii]
Length = 315
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 86/215 (40%), Gaps = 50/215 (23%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S + I G+ + +F+S E E +++++D PW ++ + RR Q++G + + + +
Sbjct: 101 SAQELGIPGLSLLTEFISSREEERLLQEVDARPW-QALAKRRVQHYGYEFLYNARNVDTS 159
Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIW 223
F G FP F+Q + ++ Q + T ++ G + HID +
Sbjct: 160 KFLGEFP---DFLQPLLEKISSIAELQETSEATFPFDQLTVNEYPRGVGLSPHIDTHSAF 216
Query: 224 GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS 283
I++++L G V+ Y + SP F + +
Sbjct: 217 QGSIISLSLAGPCVMEFRKYASEG-----VSPEFERKALF-------------------- 251
Query: 284 IVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
+P RSLL+L GE+RY W H +
Sbjct: 252 --------------LPQRSLLILSGESRYGWHHYI 272
>gi|302797440|ref|XP_002980481.1| hypothetical protein SELMODRAFT_112315 [Selaginella moellendorffii]
gi|300152097|gb|EFJ18741.1| hypothetical protein SELMODRAFT_112315 [Selaginella moellendorffii]
Length = 315
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 86/215 (40%), Gaps = 50/215 (23%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
S + I G+ + +F+S E E +++++D PW ++ + RR Q++G + + + +
Sbjct: 101 SAQELGIPGLSLLTEFISSREEERLLQEVDARPW-QALAKRRVQHYGYEFLYNARNVDTS 159
Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIW 223
F G FP F+Q + ++ Q + T ++ G + HID +
Sbjct: 160 KFLGEFP---DFLQPLLEKISSIAELQETSEATFPFDQLTVNEYPRGVGLSPHIDTHSAF 216
Query: 224 GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS 283
I++++L G V+ Y + SP F + +
Sbjct: 217 QGSIISLSLAGPCVMEFRKYASEG-----VSPEFERKALF-------------------- 251
Query: 284 IVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
+P RSLL+L GE+RY W H +
Sbjct: 252 --------------LPQRSLLILSGESRYGWHHYI 272
>gi|224074611|ref|XP_002304398.1| predicted protein [Populus trichocarpa]
gi|222841830|gb|EEE79377.1| predicted protein [Populus trichocarpa]
Length = 344
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 41/234 (17%)
Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQ----KIKLGD 171
+I G+Y+ DFLS E ++++ +D+ PW S RR Q++G + + + K+ LG
Sbjct: 119 NIPGLYLVHDFLSPQEEKELLAAVDERPW-IGLSKRRVQHYGYEFCYGTRNVDTKLHLGK 177
Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTV 230
P+ + +R L+ +I L N G + HID + I ++
Sbjct: 178 L---PSFVSSVLERISLFPGLNSSTSILLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSL 234
Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
+L G ++ Y S PD S+ + N + ++V +L
Sbjct: 235 SLAGPCIMEFRRYLDGS--------------WVPDAASSAYTKVE-NVDNCSNLVRRSLY 279
Query: 291 SVVIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
+P RS+L+L GEARY WQH + + VI RRV +R+
Sbjct: 280 -------LPPRSILLLSGEARYAWQHYIPHHKIDMVNQSVIRRGARRVSFTFRK 326
>gi|290982964|ref|XP_002674199.1| hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi]
gi|284087788|gb|EFC41455.1| hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi]
Length = 259
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 117/285 (41%), Gaps = 56/285 (19%)
Query: 76 DFL--TSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETE 133
DFL + + K++ + N G+ + S+ S +IGG+YI E+ + AE
Sbjct: 7 DFLQTSDKKKQSTLKKIKSNSEVSGAPVKQISSYKTDS----EIGGLYIIENIIDVAEER 62
Query: 134 DVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKF------IQDRFH 187
++K +D W+ S RR Q++G N+ + +K + P ++F I+++
Sbjct: 63 KLVKFIDSQKWNDEIS-RRTQHYGVSYNYGARGVKEA-LKVPPVPSEFSDLLEEIKNKEG 120
Query: 188 QEDVLSHFQTIEQCTLEYNSENGSS--IDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
+ + + + I+ + N G+ I H+D C +G I+ ++L + V+
Sbjct: 121 LDSIRNLMEGIDFKQVIINEYKGAKQGISKHVDHCQDFGPLILILSLGDECVMKF----- 175
Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLV 305
K V + L +K E S MP RSL++
Sbjct: 176 --------------------HKLEQVKEEDLKKKKVKR--TEVSPSECYDRRMPRRSLII 213
Query: 306 LWGEARYEWQHCVLRSDVIG-------------RRVCIAYREFTA 337
L G+ARY++QH + ++ V RRV I YR T
Sbjct: 214 LSGDARYQYQHEIPKTMVFKIDGKQFLKRSESYRRVSITYRSLTT 258
>gi|242077058|ref|XP_002448465.1| hypothetical protein SORBIDRAFT_06g027580 [Sorghum bicolor]
gi|241939648|gb|EES12793.1| hypothetical protein SORBIDRAFT_06g027580 [Sorghum bicolor]
Length = 349
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/242 (19%), Positives = 99/242 (40%), Gaps = 49/242 (20%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
+ I G+Y+ ++F++ AE ++++ +D W K + RR Q++G + ++ + +
Sbjct: 118 AASELGIPGIYMVQEFVTAAEEQELLSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSK 176
Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSEN-------GSSIDNHIDDCWI 222
F G P + ++ ++ F ++ CT + G + HID
Sbjct: 177 QFLGELPTFVSTVLEK------IASFPGVKDCTTSLVDQLTVNEYPCGVGLSPHIDTHSA 230
Query: 223 WGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD 282
+ E I +++L G ++ Y S + ++VN + +
Sbjct: 231 FEEMIFSLSLAGPCIMEFRKYTKGSWR-----------------APSVVNGVDEDSSQEP 273
Query: 283 SIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVL--RSDVIG--------RRVCIAY 332
+ + + +P RS+L++ GE RY W H + + D +G RRV +
Sbjct: 274 GCIRKAIF-------LPPRSMLLMSGEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTF 326
Query: 333 RE 334
R+
Sbjct: 327 RK 328
>gi|297846510|ref|XP_002891136.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297336978|gb|EFH67395.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 344
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 91/207 (43%), Gaps = 36/207 (17%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+++ DF+S E + ++ +D PW + RR Q++G + + + + +LG
Sbjct: 121 LNIPGLFLLPDFVSVEEEQQLLAAVDARPW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 179
Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVT 229
+ F + F D+ S ++Q T+ EY S G + HID + + I +
Sbjct: 180 ELPSFVSPILERMSLFPNFDINSASLDLDQLTVNEYPS--GVGLSPHIDTHSAFEDCIFS 237
Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
++L G PC ++Y ST + EKS DS
Sbjct: 238 LSLAG--------------------PCIMEFRRY--SVSTWKASSADAEKSGDS------ 269
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQH 316
S + + +P RS+L+L GEARY W H
Sbjct: 270 SCIRKALYLPPRSMLLLSGEARYAWNH 296
>gi|226498774|ref|NP_001148410.1| nucleic acid binding protein [Zea mays]
gi|195619080|gb|ACG31370.1| nucleic acid binding protein [Zea mays]
Length = 349
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 54/277 (19%), Positives = 108/277 (38%), Gaps = 54/277 (19%)
Query: 76 DFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV 135
D L + P KA G + H+ I G+Y+ ++F++ AE +++
Sbjct: 88 DLLAGRVLHIRYSVPVKPKARPGGSVPV-----AHAASELGIPGIYMVQEFVTAAEEQEL 142
Query: 136 MKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSH 194
+ +D W K + RR Q++G + ++ + + F G P + ++ ++
Sbjct: 143 LSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSKQFLGELPTFVSTVLEK------IAS 195
Query: 195 FQTIEQCTLEYNSEN-------GSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
F ++ C + G + HID + E I +++L G ++ Y
Sbjct: 196 FPGVKDCATRLVDQLTVNEYPCGVGLSPHIDTHSAFEEMIFSLSLAGTCIMEFRKY---- 251
Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
K +P ++V+ + + + + I +P RS+L++
Sbjct: 252 TKGTWRAP-------------SVVDGVDEDSSQEPECIRKA-------IFLPPRSMLLMS 291
Query: 308 GEARYEWQHCVL--RSDVIG--------RRVCIAYRE 334
GE RY W H + + D +G RRV +R+
Sbjct: 292 GEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTFRK 328
>gi|171676605|ref|XP_001903255.1| hypothetical protein [Podospora anserina S mat+]
gi|170936369|emb|CAP61027.1| unnamed protein product [Podospora anserina S mat+]
Length = 298
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 94/206 (45%), Gaps = 17/206 (8%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDL-PWDKSQSGRR-KQNFGPKCNFKKQKIKLGDFQGFP 176
G+ + +F+S AE ++++ + P + S RR Q+FG ++ I P
Sbjct: 39 GLTLVHEFISPAEEQEMISAFHAISPLSPADSKRRISQHFGHHFDYTTFGIDESKHSPVP 98
Query: 177 ACTKFIQDRFHQE-DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGD 235
A DR + D + +Q T++Y G+ I H+D ++GE + +++ G
Sbjct: 99 AYITNFLDRLPVDTDGKEAGRKPDQFTVQYYPP-GAGIPPHVDTHSMFGEALYSLSF-GS 156
Query: 236 SVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVI- 294
V + +G++ L P S Q+ +D ++N K I+++ VV
Sbjct: 157 GVPMIFRMSGENEARKLRLPKRSLQES---------SDGNVNGKVGGEILDKAEGVVVHP 207
Query: 295 --RIPMPARSLLVLWGEARYEWQHCV 318
+ +PARSLLV+ G +RY + H +
Sbjct: 208 AWELMLPARSLLVMRGASRYGYTHGI 233
>gi|414585597|tpg|DAA36168.1| TPA: nucleic acid binding protein [Zea mays]
Length = 536
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 54/277 (19%), Positives = 108/277 (38%), Gaps = 54/277 (19%)
Query: 76 DFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV 135
D L + P KA G + H+ I G+Y+ ++F++ AE +++
Sbjct: 275 DLLAGRVLHIRYSVPVKPKARPGGSVPV-----AHAASELGIPGIYMVQEFVTAAEEQEL 329
Query: 136 MKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSH 194
+ +D W K + RR Q++G + ++ + + F G P + ++ ++
Sbjct: 330 LSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSKQFLGELPTFVSTVLEK------IAS 382
Query: 195 FQTIEQCTLEYNSEN-------GSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
F ++ C + G + HID + E I +++L G ++ Y
Sbjct: 383 FPGVKDCATRLVDQLTVNEYPCGVGLSPHIDTHSAFEEMIFSLSLAGTCIMEFRKY---- 438
Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
K +P ++V+ + + + + I +P RS+L++
Sbjct: 439 TKGTWRAP-------------SVVDGVDEDSSQEPECIRKA-------IFLPPRSMLLMS 478
Query: 308 GEARYEWQHCVL--RSDVIG--------RRVCIAYRE 334
GE RY W H + + D +G RRV +R+
Sbjct: 479 GEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTFRK 515
>gi|302847355|ref|XP_002955212.1| hypothetical protein VOLCADRAFT_119009 [Volvox carteri f.
nagariensis]
gi|300259504|gb|EFJ43731.1| hypothetical protein VOLCADRAFT_119009 [Volvox carteri f.
nagariensis]
Length = 753
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/250 (21%), Positives = 102/250 (40%), Gaps = 45/250 (18%)
Query: 100 DLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL-------PW------DK 146
++ A SHP G+ Y+ +F++ AE +++ DD PW +
Sbjct: 523 EMRAEVSHPALEGQ-------YLVLEFVTPAEEAELLAMCDDPVLKPSWSPWIGQMYGNA 575
Query: 147 SQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH-QEDVLSHFQTIEQCTLEY 205
+ R + +G ++ ++ + + P + + +R Q +L FQ E ++Y
Sbjct: 576 TAQKTRGKRWGVLPDYHRRGVAPVEHP-LPPLLRILTERMRVQVGLLRCFQPNEANAIDY 634
Query: 206 NSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPD 265
GS + H+DD + G+ IV ++L G +V+T + + +K P ++
Sbjct: 635 WRSRGSWLRPHVDDRILSGDLIVNLSLGGAAVMT---FARERDKDEGGHPGLQQHQQQQQ 691
Query: 266 QKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVI- 324
D +R+ + RSL +L ARY + H + SD++
Sbjct: 692 HPRQSAGD-------------------EVRVRLAPRSLQILSRAARYSYTHAIAASDLLD 732
Query: 325 GRRVCIAYRE 334
RRV I +R
Sbjct: 733 ARRVSITFRR 742
>gi|445456942|ref|ZP_21446201.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
gi|444777446|gb|ELX01476.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
Length = 200
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 98/246 (39%), Gaps = 73/246 (29%)
Query: 108 PNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI 167
P + K DI GV DF+S E +++K +D PW + GRR Q++G + N+KK
Sbjct: 10 PKPTIKVPDILGVEYLSDFVSVVEETEILKNIDSQPW-ITDLGRRVQHYGYRYNYKKA-- 66
Query: 168 KLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS--------ENGSSIDNHIDD 219
KL P ++ Q+D++ ++C+L+ E G I +HID
Sbjct: 67 KLDRHITLPPIPSWLIR--MQKDLM------DECSLDLPPNQLIINEYEPGQGITDHIDA 118
Query: 220 CWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEK 279
+GE I+ V+L V+ T S + N F Q
Sbjct: 119 PDEFGETIIMVSLGSSCVMDFT-----STESNQKEAIFLEQ------------------- 154
Query: 280 STDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVL--RSDV-------IGRRVCI 330
RSLL++ +ARY+W+H + ++D+ RRV +
Sbjct: 155 ---------------------RSLLMIRNDARYKWKHGIAKRKTDIWEGIQYPRSRRVSL 193
Query: 331 AYREFT 336
+R +
Sbjct: 194 TFRHVS 199
>gi|440789981|gb|ELR11271.1| DNA repair protein alkB, putative [Acanthamoeba castellanii str.
Neff]
Length = 253
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 88/219 (40%), Gaps = 60/219 (27%)
Query: 124 EDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQ 183
EDF++ E +++ +D W + RR Q++G + ++ +Q I P + I
Sbjct: 77 EDFITADEERALVQAIDAQEWSEKLH-RRTQHYGYEFDYSRQDINTSVPIELPVFAQQII 135
Query: 184 DRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY 243
++ Q + Q I EY G I+ HID +G +V+++L+ V+T T
Sbjct: 136 EKMRQRGLPQFDQLIIN---EYTP--GQGINPHIDKTHCFGPCVVSLSLLSTCVMTFT-- 188
Query: 244 NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSL 303
L EK VV+R RSL
Sbjct: 189 -----------------------------SLETGEK----------IPVVLR----PRSL 205
Query: 304 LVLWGEARYEWQHCV--LRSDVI-------GRRVCIAYR 333
+VL G+ARY WQH + R+D++ RRV + YR
Sbjct: 206 VVLRGQARYGWQHGIEPKRADIVAGKHTPRARRVSLTYR 244
>gi|281212158|gb|EFA86318.1| 2-oxoglutarate and Fe(II)-dependent oxygenase family protein
[Polysphondylium pallidum PN500]
Length = 168
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 49/228 (21%), Positives = 86/228 (37%), Gaps = 71/228 (31%)
Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
D+ G+ + E+F++E E +++ +D W S+ RR Q +G ++ + + + +G
Sbjct: 6 DVDGLRLIENFITEQEENSLIENIDRYDW-SSEIARRTQQYGYHYCYRLRGVDELNDEGA 64
Query: 176 PACT------KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
P F+ DR +
Sbjct: 65 PTTPPIPEFLNFLIDRLAK----------------------------------------- 83
Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
+ D V+ T+ N+YN N + D +ST+
Sbjct: 84 ---LSDPVIPFTFDQVIINEYNKNQKIQPHIDSVTDWESTV------------------- 121
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
V I P+P RSLLVL +ARY W+H + +S GRRV + +R++ +
Sbjct: 122 DEVEIETPLPRRSLLVLQDDARYVWKHGI-KSQNNGRRVSLTFRKYKS 168
>gi|42571709|ref|NP_973945.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|332193254|gb|AEE31375.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 344
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 93/212 (43%), Gaps = 42/212 (19%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+++ DF++ AE + ++ +D W + RR Q++G + + + + +LG
Sbjct: 121 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 179
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
+ F I +R F D S ++Q T+ EY S G + HID + +
Sbjct: 180 ELPSF---VSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 234
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
I +++L G PC ++Y ST + EKS DS
Sbjct: 235 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 269
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
S + + +P RS+L+L GEARY W H +
Sbjct: 270 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 298
>gi|12597837|gb|AAG60147.1|AC074360_12 hypothetical protein [Arabidopsis thaliana]
Length = 410
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 94/212 (44%), Gaps = 42/212 (19%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+++ DF++ AE + ++ +D W + RR Q++G + + + + +LG
Sbjct: 196 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 254
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
+ P+ I +R F D S ++Q T+ EY S G + HID + +
Sbjct: 255 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 309
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
I +++L G PC ++Y ST + EKS DS
Sbjct: 310 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 344
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
S + + +P RS+L+L GEARY W H +
Sbjct: 345 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 373
>gi|256069452|ref|XP_002571149.1| oxygenase-related [Schistosoma mansoni]
gi|238652196|emb|CAZ38834.1| oxygenase-related [Schistosoma mansoni]
Length = 75
Score = 48.9 bits (115), Expect = 0.004, Method: Composition-based stats.
Identities = 18/46 (39%), Positives = 32/46 (69%)
Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
+R+ + RSL+V+ G ARY WQH + R+D+ RR+ + +RE ++ +
Sbjct: 3 VRVLLDRRSLVVVSGPARYIWQHEIRRTDITTRRIAMTFRELSSTF 48
>gi|62319760|dbj|BAD93744.1| hypothetical protein [Arabidopsis thaliana]
Length = 431
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 57/212 (26%), Positives = 94/212 (44%), Gaps = 42/212 (19%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+++ DF++ AE + ++ +D W + RR Q++G + + + + +LG
Sbjct: 208 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 266
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
+ P+ I +R F D S ++Q T+ EY S G + HID
Sbjct: 267 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHID-------- 313
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
T + D V +L+ L PC ++Y ST + EKS DS
Sbjct: 314 --THSAFEDCVFSLS----------LAGPCIMEFRRY--SVSTWKASTTDAEKSGDS--- 356
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
S + + +P RS+L+L GEARY W H +
Sbjct: 357 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 385
>gi|324504403|gb|ADY41902.1| Alkylated DNA repair protein alkB 8 [Ascaris suum]
Length = 578
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/202 (20%), Positives = 83/202 (41%), Gaps = 48/202 (23%)
Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQ-NFGPKCNFKKQKIKLGDFQGFPAC 178
+ + ++F++E E + +M L + + K++ RK +FG K N+ + + P
Sbjct: 150 LRVLKNFINEDEEQQLMASLKECIYAKTKLKSRKVVHFGYKFNYDTNEADVPAENAIPQS 209
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
+ DR + + + + +Q T+ E G+ I +H+D +G+ IV+++LI D V+
Sbjct: 210 CDVVIDRMIEAGIFT--ERPDQLTVNI-YEPGNGIPSHVDSHSPFGDTIVSLSLISDLVM 266
Query: 239 TLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPM 298
+ S+ YN + +
Sbjct: 267 EFRDFANTSSIYN--------------------------------------------VLL 282
Query: 299 PARSLLVLWGEARYEWQHCVLR 320
P SL V+ GE+RY W+H + +
Sbjct: 283 PRYSLAVMQGESRYRWKHGIAK 304
>gi|30692572|ref|NP_174442.2| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|42571711|ref|NP_973946.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|20259468|gb|AAM13854.1| unknown protein [Arabidopsis thaliana]
gi|22136680|gb|AAM91659.1| unknown protein [Arabidopsis thaliana]
gi|332193253|gb|AEE31374.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
gi|332193255|gb|AEE31376.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
Length = 431
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 94/212 (44%), Gaps = 42/212 (19%)
Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
+I G+++ DF++ AE + ++ +D W + RR Q++G + + + + +LG
Sbjct: 208 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 266
Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
+ P+ I +R F D S ++Q T+ EY S G + HID + +
Sbjct: 267 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 321
Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
I +++L G PC ++Y ST + EKS DS
Sbjct: 322 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 356
Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
S + + +P RS+L+L GEARY W H +
Sbjct: 357 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 385
>gi|170044145|ref|XP_001849718.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867399|gb|EDS30782.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 153
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 184 DRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDD 219
DRF Q ++++FQTIEQ +LEY E +SID HIDD
Sbjct: 118 DRFEQVPLMANFQTIEQYSLEYTPERRASIDPHIDD 153
>gi|145345998|ref|XP_001417484.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577711|gb|ABO95777.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 347
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/244 (20%), Positives = 93/244 (38%), Gaps = 52/244 (21%)
Query: 99 SDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQ--SGRR 152
SD +A + + G YI E+F++E E ++ LDD PW S +
Sbjct: 131 SDAFAALARGAKLQAKVKLPGHYILENFITEDEERRIVDWLDDDIAAGPWRDSSFNGAHQ 190
Query: 153 KQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ-EDVLSHFQTIEQCTLEYNSENGS 211
+ +G + N K+ ++ + +F + L HF E + Y + GS
Sbjct: 191 GKKYGVEPNLLKRCVEPARVPMPKILRDLVVAKFAAAHETLKHFTPNECNAINYRKDLGS 250
Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
+ H DD + + +V ++L D +T +S++K
Sbjct: 251 VLTPHCDDRQLSSDILVNLSLCSDCTMT-----------------YSHEK---------- 283
Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCI 330
+S + + +P RSL + G RY++ H + ++ G RRV +
Sbjct: 284 -----------------FASKRVDVRLPRRSLQIQSGSTRYDYMHSIANENLHGNRRVSV 326
Query: 331 AYRE 334
+RE
Sbjct: 327 TFRE 330
>gi|380025248|ref|XP_003696389.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Apis
florea]
Length = 584
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 97/235 (41%), Gaps = 71/235 (30%)
Query: 118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSG----RRKQNFGPKCNFKKQKIKLGD-F 172
G+ + EDF++E E + + L + W+ +S R+ ++FG + + K+ L
Sbjct: 126 SGIKLIEDFITEEEEKML---LSTITWNNEESSDLKHRKVKHFGYEFQYDTNKVDLDKPI 182
Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
P +F+Q F Q +S+ +Q T+ + G I HID T ++
Sbjct: 183 IPIPKNYQFLQTLFKQYHDVSY--EYDQLTINHYLP-GQGIPPHID----------THSV 229
Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
DS+L+L+ L S C I+N NEK++
Sbjct: 230 FEDSILSLS----------LGSAC-------------IMNFKKENEKAS----------- 255
Query: 293 VIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI-----------GRRVCIAYRE 334
+ +P RSLL++ GEARY W H C +D++ G RV +R+
Sbjct: 256 ---LFLPPRSLLIMSGEARYAWSHGICPRHNDIVQTSNGITTQSRGTRVSFTFRK 307
>gi|115345714|ref|YP_771891.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
gi|115293033|gb|ABI93483.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
Length = 195
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 31/121 (25%), Positives = 62/121 (51%), Gaps = 5/121 (4%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPA 177
G+ E+++SE E +++++D W ++ RR Q++G + ++K ++ D+ G P
Sbjct: 16 GLTYLENYISEDEAGRLVQEIDAALW-RTDLKRRVQHYGYRYDYKARQAWREDYLGPLPE 74
Query: 178 CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
+ + +R E HFQT+ + + G I HID +GE I +++L+ V
Sbjct: 75 LFQSLAERLTAE---GHFQTVPDQVIVNEYQPGQGISAHIDCQPCFGETIASLSLLSACV 131
Query: 238 L 238
+
Sbjct: 132 M 132
>gi|255085018|ref|XP_002504940.1| predicted protein [Micromonas sp. RCC299]
gi|226520209|gb|ACO66198.1| predicted protein [Micromonas sp. RCC299]
Length = 353
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 50/231 (21%), Positives = 94/231 (40%), Gaps = 41/231 (17%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLD-----DLPWDKSQ-SGR-RKQNFGPKCNFKKQKIKL 169
+ G ++ DF++ E +++ LD PW S +G+ R + +G + + K++ +
Sbjct: 146 LPGHHLIPDFITPDEESRLIEYLDRDESDTNPWRPSNFNGKHRGKKWGVEVDLKRRTVAP 205
Query: 170 GDFQGFPACTKFIQDRF-HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
+ + PA + + DR L F E ++Y+ G+ + H+DD + + IV
Sbjct: 206 -ERRPLPALVRAVADRMPAAHPALRGFVPNEANAIDYDRRGGAELLPHVDDRQMSTDLIV 264
Query: 229 TVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNET 288
++L GD V+T Y D+ + D + + + D
Sbjct: 265 NLSLAGDCVMT---YVEDAGR---------------DGRRGGWEGVPAGARRVDVF---- 302
Query: 289 LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCIAYREFTAP 338
+P RSL V G R+ + H + ++ RRV I +R P
Sbjct: 303 ---------LPRRSLQVQSGPCRFNFAHSIRNENLRAPRRVSITFRRSQMP 344
>gi|255086679|ref|XP_002509306.1| predicted protein [Micromonas sp. RCC299]
gi|226524584|gb|ACO70564.1| predicted protein [Micromonas sp. RCC299]
Length = 418
Score = 44.7 bits (104), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 52/228 (22%), Positives = 91/228 (39%), Gaps = 51/228 (22%)
Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLD-DLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
+ + GV + DF++E E +++ +D D W + + RR ++G ++ + +
Sbjct: 192 TLGVPGVTLITDFVTEEEEREMLACVDSDERW-QGLAKRRVLHYGYAFDYGTRDAR-DKT 249
Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTI--EQCTL-EYNSENGSSIDNHIDDCWIWGERIVT 229
PA + R +++ +Q T+ EY + G I H+D +G I++
Sbjct: 250 SPMPAFVAGLLGRAASCGAPGACESVHCDQLTVNEYVA--GVGIAPHVDTHSAFGPTILS 307
Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
++L G +V+ + G +K P ++ I
Sbjct: 308 LSLAGRAVMEFRLHEGG--------------EKEPRERRAIS------------------ 335
Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVL---RSDVIGRRVCIAYRE 334
MP RSLLVL GEARY W H + R ++G C A E
Sbjct: 336 --------MPPRSLLVLHGEARYRWLHYIPHRKRDAIVGEDECEAREE 375
>gi|356531539|ref|XP_003534335.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Glycine max]
Length = 342
Score = 44.3 bits (103), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 94/225 (41%), Gaps = 31/225 (13%)
Query: 97 QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF 156
Q SDL + + I G+++ DF+S E E++++ +D PW+ S S RR Q++
Sbjct: 102 QASDLVPV----SITASDVSIPGLFLVHDFISAKEEEELLQAVDCRPWN-SLSKRRVQHY 156
Query: 157 GPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSID 214
G + + + + G P+ I DR + + I L N G +
Sbjct: 157 GYEFRYDIRNVNTRHCLGELPSFVSPILDRISSCPTFENVKNIVLDQLTVNEYPPGVGLS 216
Query: 215 NHIDDCWIWGERIVTVNLIGDSVLTLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVND 273
HID + + I +++L G ++ Y NGD P S+ V
Sbjct: 217 PHIDTHSAFEDLIFSLSLSGPCIMEFRRYENGD---------------WLPRVASSAVAK 261
Query: 274 LSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
+ E ++ I I +P RSLL+L GEARY W H +
Sbjct: 262 IENPEDQSNFIRRA--------IYLPPRSLLLLSGEARYAWHHYI 298
>gi|256822586|ref|YP_003146549.1| type 12 methyltransferase [Kangiella koreensis DSM 16069]
gi|256796125|gb|ACV26781.1| Methyltransferase type 12 [Kangiella koreensis DSM 16069]
Length = 260
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 73/160 (45%), Gaps = 19/160 (11%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
GV + EDF+ +A+ + + +++ KSQ+ +QN +F KQ +L D FPAC
Sbjct: 81 GVDVVEDFIQQAKIDAISQRV------KSQAEFVRQNM---ADFLKQNTELFDVIVFPAC 131
Query: 179 TKFIQDRFHQEDVLSH------FQTIEQ--CTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
T + DR D+LS F +E + +N +G + ++DD + T
Sbjct: 132 TDVLGDRVESLDILSKHINDDGFILVESDYKSHSFNIGSGVELTKNMDDVIAESKLQSTR 191
Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI 270
+I + + + Y D+ + L + + K P+ K +
Sbjct: 192 KIIWEQKILIDIYRRDTQR--LTTRVNELKVKQPEIKREL 229
>gi|350855225|emb|CCD58057.1| hypothetical protein Smp_197790 [Schistosoma mansoni]
Length = 44
Score = 42.4 bits (98), Expect = 0.33, Method: Composition-based stats.
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 155 NFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH 187
++GPK NFK+QK+ +G F G PA ++F+ R++
Sbjct: 1 DYGPKVNFKRQKVHIGGFYGLPAYSRFLITRYN 33
>gi|307180204|gb|EFN68237.1| Alkylated DNA repair protein alkB-like protein 8 [Camponotus
floridanus]
Length = 604
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 91/216 (42%), Gaps = 64/216 (29%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQS-------GRRKQNFGPKCNFKKQKIKLGD 171
G+ + E+F+++ + E +++ L+ WD+ +S R+ ++FG + + + D
Sbjct: 140 GLTLIENFITKKQEETLLRTLN---WDECESVTSLQLKHRQVKHFGYEFEYGTNIVNPDD 196
Query: 172 -FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
P +F+Q F++ + H +Q T+ G I HID T
Sbjct: 197 PIAPIPQDYEFLQTLFNKHN---HKYKYDQLTIN-KYLPGQGIPPHID----------TH 242
Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
++ D++L+L+ L S C +N K + +N
Sbjct: 243 SVFEDTILSLS----------LGSACI------------------MNFKKENQNIN---- 270
Query: 291 SVVIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI 324
+ +PARSLL++ GEARY W H C +DVI
Sbjct: 271 -----VFLPARSLLIMTGEARYAWTHGICSRHNDVI 301
>gi|326435968|gb|EGD81538.1| hypothetical protein PTSG_11856 [Salpingoeca sp. ATCC 50818]
Length = 656
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV 318
+ + +PARSLLV+ GEARY W HC+
Sbjct: 597 AVELHLPARSLLVVRGEARYRWTHCI 622
>gi|312081720|ref|XP_003143146.1| hypothetical protein LOAG_07565 [Loa loa]
Length = 129
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 26/38 (68%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQ 154
I G+++ DFL+EAE D++ +D W SQSGRRKQ
Sbjct: 92 IPGLHVITDFLTEAEEMDLINVIDRTDWVLSQSGRRKQ 129
>gi|67609385|ref|XP_666959.1| 2OG-Fe oxygenase superfamily (67.4 kD) (3E801) [Cryptosporidium
hominis TU502]
gi|54658038|gb|EAL36728.1| 2OG-Fe oxygenase superfamily (67.4 kD) (3E801) [Cryptosporidium
hominis]
Length = 350
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 54/233 (23%), Positives = 96/233 (41%), Gaps = 40/233 (17%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ + EDF+++ E +++ +D+ +++ R+ Q++G ++ + I + P
Sbjct: 123 GLVLIEDFINKLEAIELLNWIDNNGQWETKLNRKVQHYGYSFDYNNKTISSVWERDIPPI 182
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
+ +R +++ + +Q T+ EY E G I HID GE I ++L G +
Sbjct: 183 LNRLIERMLSLKIIT--EVPDQITINEY--EVGKGIGPHIDSHHTIGENISVISL-GSGI 237
Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
L +N S + NL D S E S N V
Sbjct: 238 LF--EFNELSKRKNL--------------------DFSSKEGSGSRKYNRISKRTVY--- 272
Query: 298 MPARSLLVLWGEARYEWQHCV--LRSDVI-------GRRVCIAYREFTAPYMN 341
+P SL ++ E RY W+H + + D I RRV I R++ + N
Sbjct: 273 IPENSLYIMKNEIRYAWEHGIKSRKYDKIQGKFQQRKRRVSITIRKYIESHYN 325
>gi|330819033|ref|XP_003291570.1| hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum]
gi|325078238|gb|EGC31900.1| hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum]
Length = 256
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 96/224 (42%), Gaps = 32/224 (14%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ-GF 175
I G+ I E+ ++ + + +++ WD RR Q++G K N+K +++K D F
Sbjct: 36 IKGLKIIENIFNDEYHDKLWAEVNKCEWDDKSLLRRTQHYGYKYNYKSRQLKDEDIAPPF 95
Query: 176 PACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
P + ++ +++ T +Q + EY G I H+ D I+G I ++
Sbjct: 96 PQWANDLIMLLLEKQIIN--DTPKQLIVNEYKDNQG--ISAHV-DSKIFGNLIFSI---- 146
Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLN--EKSTDSIVNETLSSV 292
+L S C KK + KS +D + N +KS + N T + V
Sbjct: 147 ----------------SLGSKCKMVFKKKIENKSNNTDDNTSNSKDKSKATSTNNTKNKV 190
Query: 293 ---VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYR 333
I + S+++L E+RY W H + + R+ + +R
Sbjct: 191 EYEKIEKELLPNSIVLLQDESRYLWTHEIPKLKKGDHRISLTFR 234
>gi|322799222|gb|EFZ20637.1| hypothetical protein SINV_06026 [Solenopsis invicta]
Length = 597
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 53/236 (22%), Positives = 92/236 (38%), Gaps = 74/236 (31%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQS------GRRKQNFGPKCNFKKQKIKLGDF 172
G+ + E+F+++ + E ++ L+ WD+ +S R+ ++FG + + +
Sbjct: 137 GLMLIENFITQEQEETLLGTLN---WDECESVSSQLKHRQVKHFGYEFEYGTNMVDPDKP 193
Query: 173 QGFPACT-KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
KF+Q F D H T +Q T+ +G I +HID ++ + I++++
Sbjct: 194 IPPIPEDYKFLQTLF---DKHGHKYTYDQLTIN-KYLSGQGIPSHIDTHSVFEDTILSLS 249
Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
L V+ K DQK
Sbjct: 250 LGSAYVMNF---------------------KKEDQK------------------------ 264
Query: 292 VVIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI-----------GRRVCIAYRE 334
I + +PARSLL++ GEARY W H C SD++ G RV +R+
Sbjct: 265 --IDVLLPARSLLIMTGEARYAWTHGICPRHSDIVKTENGTTTRERGTRVSFTFRK 318
>gi|428311031|ref|YP_007122008.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
gi|428252643|gb|AFZ18602.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
Length = 206
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 17/156 (10%)
Query: 196 QTIEQCTLEYNSENGSSI-DNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
+ I++ +Y+ ++GS + H+ + W + V V L D + N+Y
Sbjct: 55 RRIQEYGYKYDYKDGSFVASTHLGNLPDWAQN-VAVRLAEDGFMVNVPEQVIVNEYQPGQ 113
Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV--IRIPMPARSLLVLWGEARY 312
S+ P +TI+ LSL ++ ++N T S + I + A SLL+ GEARY
Sbjct: 114 GIVSHTDCIPCFGNTIIT-LSL---GSECVMNFTHSQTQKEVGILLQAGSLLIFKGEARY 169
Query: 313 EWQHCVL---------RSDVIGRRVCIAYREFTAPY 339
W+H ++ R + RR+ + +RE PY
Sbjct: 170 IWKHGIVPRKRDNYKGRIFMRTRRISMTFREVLFPY 205
>gi|170079593|ref|YP_001736226.1| 2OG-Fe(II) oxygenase family oxidoreductase [Synechococcus sp. PCC
7002]
gi|169887262|gb|ACB00971.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Synechococcus sp. PCC
7002]
Length = 204
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 64/126 (50%), Gaps = 30/126 (23%)
Query: 225 ERIVTVNLIG---DSVLTLTYYNGDSNKYNLN-SPCFSYQKKYPDQKSTIVNDLSLNEK- 279
E++V++NLI D + Y G +++ PCF+ TI++ LSLN
Sbjct: 90 EQMVSLNLISELPDQAIVNEYLPGQGITSHVDCKPCFT---------DTIIS-LSLNAPC 139
Query: 280 --STDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV--LRSD-------VIGRRV 328
+ DSIVN S +++ RSL++L GE+RY W+H + +SD + RR+
Sbjct: 140 IMNFDSIVNNERQSKLLK----PRSLVILQGESRYLWKHGIPPRKSDQWNGQKIMRDRRI 195
Query: 329 CIAYRE 334
I +R+
Sbjct: 196 SITFRK 201
>gi|255084479|ref|XP_002508814.1| chromodomain-containing protein [Micromonas sp. RCC299]
gi|226524091|gb|ACO70072.1| chromodomain-containing protein [Micromonas sp. RCC299]
Length = 742
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 57/131 (43%), Gaps = 15/131 (11%)
Query: 215 NHIDDCWIWGERIVTVNLIGDSVLTLTYY----NGDSNKYNLNSPCFSYQKKYPDQKSTI 270
+H+D WGE +V V+L D VL T G N+ N+ + YP +
Sbjct: 441 SHLDSSERWGEVVVGVSLGADGVLFFTPNCRSDRGIGNRIR-NAALENDFADYPTESGVT 499
Query: 271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCI 330
+D N + S VN+ S + + +P RS+ V+ G AR + +H + R
Sbjct: 500 ESDSWRNGRG--SRVNKGQKSWALELEIPRRSVYVMSGAARTDLEHGI--------RAVK 549
Query: 331 AYREFTAPYMN 341
A + +AP N
Sbjct: 550 APSQESAPAWN 560
>gi|332027277|gb|EGI67361.1| Alkylated DNA repair protein alkB-like protein 8 [Acromyrmex
echinatior]
Length = 583
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 23/33 (69%), Gaps = 2/33 (6%)
Query: 294 IRIPMPARSLLVLWGEARYEWQH--CVLRSDVI 324
I + +PARSLL++ GEARY W H C SDVI
Sbjct: 283 IDVLLPARSLLIMTGEARYAWTHGICPRYSDVI 315
>gi|116197553|ref|XP_001224588.1| hypothetical protein CHGG_06932 [Chaetomium globosum CBS 148.51]
gi|88178211|gb|EAQ85679.1| hypothetical protein CHGG_06932 [Chaetomium globosum CBS 148.51]
Length = 307
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/144 (26%), Positives = 66/144 (45%), Gaps = 19/144 (13%)
Query: 199 EQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFS 258
+Q T++Y GS I H+D ++GE + +++ G +V +G ++ + P S
Sbjct: 107 DQFTVQYYPP-GSGIPPHVDTHSMFGEALYSLSF-GSAVPMQFRMSGANDARKMRLPKRS 164
Query: 259 YQKKYPDQKSTIVNDLSLNEKSTDSIVN--------ETLSS-------VVIRIPMPARSL 303
Q++ Q+ + + N S S +TL I +PARSL
Sbjct: 165 VQQQQKQQQQQQQQNTTTNPPSPPSATTTSQEPPQPQTLPPSDPSDHHPTWEILLPARSL 224
Query: 304 LVLWGEARYEWQHCV--LRSDVIG 325
L++ G ARY + H + ++DVIG
Sbjct: 225 LLMTGAARYGYTHAIRPRKTDVIG 248
>gi|164423854|ref|XP_962305.2| hypothetical protein NCU07663 [Neurospora crassa OR74A]
gi|157070260|gb|EAA33069.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 510
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 8/67 (11%)
Query: 277 NEKSTDSIVN-ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-------RRV 328
N ST S N ++L+ I I +P SLLV+ + + EW+HCV + I RR+
Sbjct: 330 NSSSTTSNANLDSLNEGQISIHLPHNSLLVMHADMQEEWKHCVTPAQAIDPHPIAGKRRI 389
Query: 329 CIAYREF 335
+ YR++
Sbjct: 390 NVTYRDY 396
>gi|357484481|ref|XP_003612528.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|355513863|gb|AES95486.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 344
Score = 39.7 bits (91), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 89/212 (41%), Gaps = 31/212 (14%)
Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
+ +I G+Y+ DF+S E E++++ +D PW+ S + RR Q++G + + + +
Sbjct: 112 TASDLNIPGLYLLHDFVSATEEEELLQSVDSRPWN-SLAKRRVQHYGYEFCYDIRNVNTK 170
Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQ----TIEQCTL-EYNSENGSSIDNHIDDCWIWG 224
G P+ I +R + ++Q T+ EY G + HID +
Sbjct: 171 RCLGELPSFLSPILERISSCPTFKNADPDRIVLDQLTVNEYPP--GVGLSPHIDTHSAFE 228
Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
+ I +++L G ++ Y GD + P S+I + E ++ I
Sbjct: 229 DLIFSLSLAGPCIMEFRRY-GDGDWR-------------PRVASSIDTKVDCPEDGSNCI 274
Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQH 316
RSLL+L GEARY W H
Sbjct: 275 KRAIYLP--------PRSLLLLSGEARYAWHH 298
>gi|47222373|emb|CAG05122.1| unnamed protein product [Tetraodon nigroviridis]
Length = 491
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 36/69 (52%), Gaps = 6/69 (8%)
Query: 34 EETYIYCPHCNKAW-QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHC 92
+E C +C K++ S LY +Q HSGKS GG+ + L S +E CPHC
Sbjct: 127 DERQYRCSYCGKSYSHASSLYRHQQ--MHSGKS---GGLAQSKQVLESTPQEMRYTCPHC 181
Query: 93 NKAWQGSDL 101
+++GS +
Sbjct: 182 GMSFKGSRM 190
>gi|303274827|ref|XP_003056728.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461080|gb|EEH58373.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 372
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 211 SSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI 270
+S +H D WGE +V V+L SV+ Y+ D + + + + + Q ++
Sbjct: 101 ASFKDHFDGRERWGETVVGVSLGQSSVI---YFCPDCRTQSGKALSDAVRSRRNTQYTS- 156
Query: 271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
S T+ ++ ET S I I +P RS+ ++ ARYE +H +
Sbjct: 157 ----SDGPVKTELVIKETNVSFAIEIELPRRSIYIMSENARYELKHGI 200
>gi|440793749|gb|ELR14924.1| hypothetical protein ACA1_051460 [Acanthamoeba castellanii str.
Neff]
Length = 70
Score = 38.9 bits (89), Expect = 3.8, Method: Composition-based stats.
Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 295 RIPMPARSLLVLWGEARYEWQHCVLR--SDVIGRRV 328
++ + RSLLVL G+ARYEW H V R DV+G V
Sbjct: 13 QVYLERRSLLVLEGDARYEWMHAVARRAEDVVGDTV 48
>gi|449469096|ref|XP_004152257.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Cucumis sativus]
gi|449484295|ref|XP_004156842.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Cucumis sativus]
Length = 260
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 55/239 (23%), Positives = 91/239 (38%), Gaps = 52/239 (21%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
I G+++ FLS + ++ + + W S + FG N ++L D
Sbjct: 50 IKGLWLCRFFLSPQQQSSLLSAIRNEGWFMEASQNQAMRFG---NLPTWAVELSD----- 101
Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGE-----RIVTVN 231
+ + + H D TL + NG ID + +W E I V
Sbjct: 102 SVYEAVLSSNHMTD-----------TLIVDRYNGDKIDCPLPSDILWREPLFDQMIANVY 150
Query: 232 LIGDSV---LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD-SIVNE 287
G+ + + L + +L SPC I++ +++ S D SI E
Sbjct: 151 QPGEGICAHVDLMRFEDGIAIVSLESPC-------------IMHFTQVDKTSCDPSIKGE 197
Query: 288 T-LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD----------VIGRRVCIAYREF 335
LS+ + + + SL++LWGEARY W+H + R GRR I R+
Sbjct: 198 VNLSTSKVPVYLNPGSLVILWGEARYHWKHEINRKPGFQIWEGQELAQGRRTSITLRKL 256
>gi|118368517|ref|XP_001017465.1| hypothetical protein TTHERM_00483520 [Tetrahymena thermophila]
gi|89299232|gb|EAR97220.1| hypothetical protein TTHERM_00483520 [Tetrahymena thermophila
SB210]
Length = 199
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 65/139 (46%), Gaps = 16/139 (11%)
Query: 111 SGKSFD-IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKL 169
S ++FD + G+ + L+E E + K++ W+ +Q RR Q++G K ++ + I
Sbjct: 11 SAQTFDQVQGLRYIDSILTEEEEVFIFKEIYQNEWN-TQLKRRTQHYGYKYDYSIKSIDK 69
Query: 170 GDFQG-FPA-----CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIW 223
F G P C + I D+ + V+ I + G I+ HID I+
Sbjct: 70 NMFLGVLPKYAINFCQRLIDDKVIK--VMPDQMIINEYL------PGQGINPHIDKTDIF 121
Query: 224 GERIVTVNLIGDSVLTLTY 242
GE I +V+L ++ LTY
Sbjct: 122 GETIFSVSLGSGCIMKLTY 140
>gi|66362644|ref|XP_628288.1| F27M3_19 plant like RRM plus AlkB domain containing protein
[Cryptosporidium parvum Iowa II]
gi|46229757|gb|EAK90575.1| F27M3_19 plant like RRM plus AlkB domain containing protein
[Cryptosporidium parvum Iowa II]
Length = 350
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 51/233 (21%), Positives = 97/233 (41%), Gaps = 40/233 (17%)
Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
G+ + EDF+++ E +++ +D+ +++ R+ Q++G ++ + I + P
Sbjct: 123 GLVLVEDFINKLEAIELLDWIDNNGQWETKLNRKVQHYGYSFDYNNKTISSVWERDIPPI 182
Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
+ +R +++ + +Q T+ EY E G I HID GE I ++L G +
Sbjct: 183 LNRLIERMLSLKIIT--EVPDQITINEY--EVGKGIGPHIDSHHTIGENISVISL-GSGI 237
Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
L + N ++K PD S + D I T +
Sbjct: 238 L-----------FEFNELS---KRKNPDCSSK----EGSGSRKYDRISKRT-------VY 272
Query: 298 MPARSLLVLWGEARYEWQHCV---LRSDVIG------RRVCIAYREFTAPYMN 341
+P SL ++ E RY W+H + + + G RRV I R++ + N
Sbjct: 273 IPENSLYIMKNEIRYAWEHGIKSRMYDKIQGKFQQRKRRVSITIRKYIESHYN 325
>gi|336470928|gb|EGO59089.1| hypothetical protein NEUTE1DRAFT_145169 [Neurospora tetrasperma
FGSC 2508]
gi|350291999|gb|EGZ73194.1| hypothetical protein NEUTE2DRAFT_165345 [Neurospora tetrasperma
FGSC 2509]
Length = 494
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 7/66 (10%)
Query: 277 NEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-------RRVC 329
N +T + ++L+ I I +P SLLV+ + + EW+HCV + I RR+
Sbjct: 336 NSSTTSNANLDSLNQGQISIHLPHNSLLVMHADMQEEWKHCVTPAQAIDPHPIAGKRRIN 395
Query: 330 IAYREF 335
+ YR++
Sbjct: 396 VTYRDY 401
>gi|402079391|gb|EJT74656.1| GRF zinc finger domain-containing protein [Gaeumannomyces graminis
var. tritici R3-111a-1]
Length = 465
Score = 38.5 bits (88), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 7/53 (13%)
Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVI-------GRRVCIAYREFTAPY 339
I I +P SLLV+ E + EW+HC+ + I RR+ + YR++ A +
Sbjct: 329 ISIHLPHNSLLVMHAEMQEEWKHCIAPAQAIQPHPVAGNRRINVTYRDYKAAF 381
>gi|290993472|ref|XP_002679357.1| predicted protein [Naegleria gruberi]
gi|284092973|gb|EFC46613.1| predicted protein [Naegleria gruberi]
Length = 191
Score = 38.5 bits (88), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 59/140 (42%), Gaps = 18/140 (12%)
Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRK-QNFGPKCNFKKQKIKLGDFQGF 175
+ G+Y+ EDFLS E ++M +++ W K++S R Q +GPK + + I D
Sbjct: 5 VPGLYLIEDFLSSTEATNLMNEINQQTWIKNRSQTRNIQIYGPKHD-QSYTIIPNDITPL 63
Query: 176 PACTKFIQDRFHQ--------------EDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
P K + R + E L + E EY E+ +D H D
Sbjct: 64 PEFLKELSKRILETTRNKLPQIDLKDYEPYLGIDKYTEIFINEYKPED--KLDQHFDHRS 121
Query: 222 IWGERIVTVNLIGDSVLTLT 241
+ E I ++L S LT T
Sbjct: 122 TYKEIIFGLSLECTSTLTFT 141
>gi|357612338|gb|EHJ67931.1| hypothetical protein KGM_21226 [Danaus plexippus]
Length = 585
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 3/71 (4%)
Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
NKY+ S+ K+ TI LSL+ S+ + + S + + +P+RSLLV+
Sbjct: 210 NKYSPGQGIPSHVDKHSPFGDTI---LSLSLNSSVVMDWKHHSKAYVPVVVPSRSLLVMQ 266
Query: 308 GEARYEWQHCV 318
EARY+WQH +
Sbjct: 267 AEARYDWQHGI 277
>gi|355755380|gb|EHH59127.1| Zinc finger protein 358 [Macaca fascicularis]
Length = 451
Score = 37.7 bits (86), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 15 CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQSHPNHSGKSFDIGG 70
C C + FG + K++ T E CP C KA+ Q S L +Q + D G
Sbjct: 163 CPVCAKGFGQGSALLKHLRTHTGERPYPCPQCGKAFGQSSALLQHQRTHTAARPCPDCGK 222
Query: 71 VYIKEDFLTSQ----TKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
+ ++ L S T E CPHC+KA+ S H HSG+
Sbjct: 223 AFSQKSNLVSHRRIHTGERPYACPHCSKAFGQSSALLQHLH-VHSGE 268
>gi|417403069|gb|JAA48358.1| Putative c2h2-type zn-finger protein [Desmodus rotundus]
Length = 589
Score = 37.7 bits (86), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 15/111 (13%)
Query: 15 CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQ-----SHP---NHS 62
C C + FG + K++ T E CPHC KA+ Q S L +Q P H
Sbjct: 310 CPVCAKGFGQGSALLKHLRTHTGERPYPCPHCAKAFGQSSALLQHQRTHTAERPYRCPHC 369
Query: 63 GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
GK+F G + L T E CPHC KA+ S H HSG+
Sbjct: 370 GKAF--GQSSNLQHHLRIHTGERPYPCPHCAKAFGQSSALLQHLH-VHSGE 417
>gi|157123521|ref|XP_001660184.1| hypothetical protein AaeL_AAEL009522 [Aedes aegypti]
gi|108874376|gb|EAT38601.1| AAEL009522-PA [Aedes aegypti]
Length = 354
Score = 37.7 bits (86), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 29/121 (23%), Positives = 50/121 (41%), Gaps = 14/121 (11%)
Query: 12 VRTCIKCEQEFGYE---NKNIVEFTEETYIYCPHCNKAWQGS----DLYAYQSHPN---- 60
V C C ++F Y+ +K+ T+E CP CN ++ L+++ P
Sbjct: 186 VFVCASCPKQFLYKSLLDKHARVHTQEKPFKCPQCNTSFSQKVSRMILFSFHVQPKMFIC 245
Query: 61 -HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGG 119
+ KSFD + + + TKE C C K++ + A NH G+S +
Sbjct: 246 EYCQKSFDRPSTL--QIHIRTHTKERPFACMECTKSFTSNSALAMHIKTNHRGESILLQS 303
Query: 120 V 120
+
Sbjct: 304 I 304
>gi|76621956|ref|XP_582193.2| PREDICTED: zinc finger protein 358 isoform 1 [Bos taurus]
gi|297476686|ref|XP_002688899.1| PREDICTED: zinc finger protein 358 [Bos taurus]
gi|296485863|tpg|DAA27978.1| TPA: zinc finger protein 358 [Bos taurus]
Length = 596
Score = 37.7 bits (86), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 15/111 (13%)
Query: 15 CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQ-----SHP---NHS 62
C C + FG + K++ T E CP C KA+ Q S L +Q P H
Sbjct: 308 CPVCAKGFGQGSALLKHLRTHTGERPYPCPQCGKAFGQSSALLQHQRTHTAERPYRCPHC 367
Query: 63 GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
GK+F G + L + T E CPHC+KA+ S H HSG+
Sbjct: 368 GKAF--GQSSNLQHHLRTHTGERPYACPHCSKAFGQSSALLQHLH-VHSGE 415
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,777,312,506
Number of Sequences: 23463169
Number of extensions: 246053373
Number of successful extensions: 668310
Number of sequences better than 100.0: 218
Number of HSP's better than 100.0 without gapping: 142
Number of HSP's successfully gapped in prelim test: 76
Number of HSP's that attempted gapping in prelim test: 667769
Number of HSP's gapped (non-prelim): 456
length of query: 356
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 213
effective length of database: 9,003,962,200
effective search space: 1917843948600
effective search space used: 1917843948600
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)