BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy7738
(307 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|327287260|ref|XP_003228347.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Anolis carolinensis]
Length = 379
Score = 313 bits (802), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 150/238 (63%), Positives = 179/238 (75%), Gaps = 1/238 (0%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + S P D + + E QK+ G+ Q+ LFS EEC +E IDEV S AE+
Sbjct: 42 KRKYHEDSESEPSDYEERQREEEEA-QKVKSGIRQLRLFSPEECAKIEARIDEVVSRAEK 100
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G YK TVDRAPLRNKYFFGEGYTYGSQL R+G G ERLYP GEVD IP W+ LVI KL
Sbjct: 101 GLYKEHTVDRAPLRNKYFFGEGYTYGSQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKL 160
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 161 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 220
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
IRVSEPV +LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL +
Sbjct: 221 IRVSEPVFFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRLEM 278
>gi|118097886|ref|XP_001233463.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Gallus gallus]
Length = 374
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 143/210 (68%), Positives = 169/210 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS EEC +E ID+V S AE+G YK TVDRAPLRNKYFFGEGYTYG
Sbjct: 61 RKVKSGIRQLRLFSAEECAKIEARIDDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 120
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
SQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 121 SQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 180
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L +AA+
Sbjct: 181 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 240
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 241 EITHCIRPQDIKERRAVIILRKTRLDAPRL 270
>gi|56118909|ref|NP_001008038.1| RNA demethylase ALKBH5 [Xenopus (Silurana) tropicalis]
gi|82181400|sp|Q66JG8.1|ALKB5_XENTR RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|51704081|gb|AAH80920.1| MGC79570 protein [Xenopus (Silurana) tropicalis]
Length = 358
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 140/212 (66%), Positives = 171/212 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS +EC +E IDEV S AE+G Y+ TVDRAPLRNKYFFGEGYTYG
Sbjct: 49 RKVRSGIRQVRLFSPDECARIEAKIDEVVSRAEKGLYREHTVDRAPLRNKYFFGEGYTYG 108
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL R+G G ERLYP+GEVD IPAW+ LVI +L + V+P ++NSAVINDYQPGGCIV
Sbjct: 109 AQLQRRGPGQERLYPKGEVDEIPAWVNELVIRRLVEHRVIPEGFVNSAVINDYQPGGCIV 168
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPVQRG VT+L +AA+
Sbjct: 169 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVQRGSVTVLSGYAAD 228
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
ITHC+RPQD + RRAV++LR+ APRL +
Sbjct: 229 EITHCIRPQDIKERRAVVILRKTRTEAPRLEM 260
>gi|148231788|ref|NP_001085704.1| RNA demethylase ALKBH5 [Xenopus laevis]
gi|82201093|sp|Q6GPB5.1|ALKB5_XENLA RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|49118231|gb|AAH73226.1| MGC80552 protein [Xenopus laevis]
Length = 360
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 138/212 (65%), Positives = 171/212 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC A+E IDEV S A++G Y+ TVDRAPLRNKYFFGEGYTYG
Sbjct: 51 RKVRSGIRQMRLFSPDECAAIESKIDEVVSRADKGLYQEHTVDRAPLRNKYFFGEGYTYG 110
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL R+G G ERLYP+GEVD IP W+ LVI +L + ++P ++NSAVINDYQPGGCIV
Sbjct: 111 AQLQRRGPGQERLYPKGEVDEIPGWVHELVIRRLVERRIIPEGFVNSAVINDYQPGGCIV 170
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L +AA+
Sbjct: 171 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLSGYAAD 230
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
ITHC+RPQD + RRAV++LR+ APRL +
Sbjct: 231 EITHCIRPQDIKERRAVVILRKTRTEAPRLEM 262
>gi|387015598|gb|AFJ49918.1| alkylated DNA repair protein alkB-like 5 protein [Crotalus
adamanteus]
Length = 389
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 142/210 (67%), Positives = 169/210 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS EEC+ +E ID+V S AE+G YK TVDRAPLRNKYFFGEGYTYG
Sbjct: 75 RKVKSGIRQLRLFSPEECVKIEARIDDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 134
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
SQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 135 SQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 194
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L +AA+
Sbjct: 195 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLSGYAAD 254
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 255 EITHCIRPQDIKERRAVIILRKTRVDAPRL 284
>gi|224070277|ref|XP_002188303.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Taeniopygia guttata]
Length = 383
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 142/210 (67%), Positives = 169/210 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS EEC +E I++V S AE+G YK TVDRAPLRNKYFFGEGYTYG
Sbjct: 70 RKVKSGIRQLRLFSAEECAKIEARIEDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 129
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
SQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 130 SQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 189
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L +AA+
Sbjct: 190 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 249
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 250 EITHCIRPQDIKERRAVIILRKTRLDAPRL 279
>gi|348537052|ref|XP_003456009.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Oreochromis niloticus]
Length = 357
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 150/266 (56%), Positives = 191/266 (71%), Gaps = 13/266 (4%)
Query: 9 YKCQVVEQNLKLKNRYSRNQRSSPYDIGQSKHHARELRQ----KILQGVNQMSLFSHEEC 64
YK + V++ K+R R R S D + + +LR+ ++ + Q S+F+ EEC
Sbjct: 23 YKNKCVDETSNGKHR-KRKYRESDDDEYEHSDDSTDLREQEARRVRSSIQQKSIFTPEEC 81
Query: 65 LALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGE 124
+E+ IDEV + AE G Y+ TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLY +GE
Sbjct: 82 ARIEKKIDEVVAHAEAGLYREHTVDRAPLRNKYFFGEGYTYGAQLEKRGPGQERLYRKGE 141
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
VD IP+W+ LVI +L GV+P ++NSAVINDYQPGGCIVSH+DP HIFDRPI+S+S
Sbjct: 142 VDEIPSWVYELVINRLVTNGVIPEGFVNSAVINDYQPGGCIVSHVDPLHIFDRPIVSVSF 201
Query: 185 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVI 244
FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI
Sbjct: 202 FSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLSGYAADDITHCIRPQDIKERRAVI 261
Query: 245 LLRRVLPHAPRL--------TLSQTP 262
+LR+ P APRL +L+Q P
Sbjct: 262 ILRKTRPDAPRLGSDSSLRASLAQKP 287
>gi|344298054|ref|XP_003420709.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Loxodonta africana]
Length = 394
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 141/210 (67%), Positives = 168/210 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + GV+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHGVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|395514234|ref|XP_003761324.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Sarcophilus harrisii]
Length = 384
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 146/236 (61%), Positives = 177/236 (75%), Gaps = 3/236 (1%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + S D + K E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 50 KRKYHEDSDSERSDYEEQKE---EEARKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEK 106
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYGSQL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 107 GLYNEHTVDRAPLRNKYFFGEGYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 166
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 167 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 226
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL+LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 227 IRVSEPVLFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282
>gi|126333790|ref|XP_001364359.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Monodelphis domestica]
Length = 384
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 146/236 (61%), Positives = 177/236 (75%), Gaps = 3/236 (1%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + S D + K E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 50 KRKYHEDSDSDRSDYEEQKE---EEARKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEK 106
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYGSQL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 107 GLYNEHTVDRAPLRNKYFFGEGYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 166
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 167 VENRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 226
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL+LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 227 IRVSEPVLFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282
>gi|73956209|ref|XP_851719.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Canis lupus familiaris]
Length = 394
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|148539642|ref|NP_060228.3| RNA demethylase ALKBH5 [Homo sapiens]
gi|302565646|ref|NP_001180917.1| probable alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
mulatta]
gi|397476905|ref|XP_003809831.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pan paniscus]
gi|402898955|ref|XP_003912472.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Papio anubis]
gi|408359959|sp|Q6P6C2.2|ALKB5_HUMAN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|119576065|gb|EAW55661.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
sapiens]
gi|119576066|gb|EAW55662.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
sapiens]
gi|387542580|gb|AFJ71917.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
mulatta]
gi|410221108|gb|JAA07773.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410251666|gb|JAA13800.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410300496|gb|JAA28848.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410335931|gb|JAA36912.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
Length = 394
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|410980043|ref|XP_003996390.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Felis catus]
Length = 394
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|395748658|ref|XP_003778808.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pongo abelii]
Length = 394
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|390462888|ref|XP_003732930.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5
[Callithrix jacchus]
Length = 392
Score = 306 bits (783), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 81 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 140
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 141 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEYRVIPEGFVNSAVINDYQPGGCIV 200
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 201 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 260
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 261 EITHCIRPQDIKERRAVIILRKTRLDAPRL 290
>gi|30354744|gb|AAH52076.1| AlkB, alkylation repair homolog 5 (E. coli) [Mus musculus]
Length = 395
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/236 (60%), Positives = 175/236 (74%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + D + + E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 58 KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPKGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|441642261|ref|XP_004090430.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5 [Nomascus
leucogenys]
Length = 394
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSAYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|31044423|ref|NP_766531.2| RNA demethylase ALKBH5 [Mus musculus]
gi|114151682|sp|Q3TSG4.2|ALKB5_MOUSE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|26343643|dbj|BAC35478.1| unnamed protein product [Mus musculus]
gi|74194466|dbj|BAE37281.1| unnamed protein product [Mus musculus]
Length = 395
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/236 (60%), Positives = 175/236 (74%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + D + + E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 58 KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|347300461|ref|NP_001178572.1| RNA demethylase ALKBH5 [Rattus norvegicus]
gi|449061781|sp|D3ZKD3.1|ALKB5_RAT RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
Length = 395
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/236 (60%), Positives = 175/236 (74%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + D + + E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 58 KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 177
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|332848357|ref|XP_001158112.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pan troglodytes]
Length = 394
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|329664868|ref|NP_001192446.1| RNA demethylase ALKBH5 [Bos taurus]
gi|449061780|sp|E1BH29.1|ALKB5_BOVIN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|296476634|tpg|DAA18749.1| TPA: hypothetical protein BOS_18839 [Bos taurus]
Length = 394
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|449281378|gb|EMC88458.1| Alkylated DNA repair protein alkB like protein 5, partial [Columba
livia]
Length = 349
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 141/210 (67%), Positives = 168/210 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS EEC +E I++V S AE+G YK TVDRAPLRNKYFFGEGYTYG
Sbjct: 36 RKVKSGIRQLRLFSAEECAKIEARIEDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 95
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
SQL R+G G ERL P GEVD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 96 SQLQRRGPGQERLLPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 155
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L +AA+
Sbjct: 156 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 215
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 216 EITHCIRPQDIKERRAVIILRKTRLDAPRL 245
>gi|426349222|ref|XP_004042212.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Gorilla gorilla gorilla]
gi|7020317|dbj|BAA91078.1| unnamed protein product [Homo sapiens]
Length = 378
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 67 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 126
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 127 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 186
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 187 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 246
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 247 EITHCIRPQDIKERRAVIILRKTRLDAPRL 276
>gi|26343649|dbj|BAC35481.1| unnamed protein product [Mus musculus]
Length = 395
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 142/236 (60%), Positives = 176/236 (74%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + S D + + E +++ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 58 KRKYQEDSDSERSDYEEHQLQKEEEARQVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCI+SH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCILSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|355668039|gb|AER94060.1| alkB, alkylation repair-like protein 5 [Mustela putorius furo]
Length = 340
Score = 305 bits (780), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 29 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 88
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 89 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 148
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 149 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 208
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 209 EITHCIRPQDIKERRAVIILRKTRLDAPRL 238
>gi|380800137|gb|AFE71944.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5, partial
[Macaca mulatta]
Length = 358
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 47 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 106
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 107 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 166
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 167 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 226
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 227 EITHCIRPQDIKERRAVIILRKTRLDAPRL 256
>gi|116517268|ref|NP_001070855.1| RNA demethylase ALKBH5 [Danio rerio]
gi|123913930|sp|Q08BA6.1|ALKB5_DANRE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|115527853|gb|AAI24805.1| AlkB, alkylation repair homolog 5 (E. coli) [Danio rerio]
Length = 352
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 141/242 (58%), Positives = 179/242 (73%)
Query: 15 EQNLKLKNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEV 74
++ + N R R S D + + +++ G+ Q S+F+ EEC +E IDEV
Sbjct: 19 DKVFEYSNGEKRKYRESDDDESEYEERRDAEARRVKSGIKQASIFTLEECARIEAKIDEV 78
Query: 75 SSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQS 134
+ A++G Y+ TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLY +GEVD IP W+
Sbjct: 79 VAKADKGLYREHTVDRAPLRNKYFFGEGYTYGAQLEKRGPGQERLYSKGEVDDIPDWVHE 138
Query: 135 LVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGC 194
LVI +L GV+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGC
Sbjct: 139 LVIDRLVTHGVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGC 198
Query: 195 KFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAP 254
KF FKPIRVSEPVL+LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ AP
Sbjct: 199 KFLFKPIRVSEPVLHLPVRRGSVTVLSGYAADDITHCIRPQDIKERRAVIILRKTRADAP 258
Query: 255 RL 256
RL
Sbjct: 259 RL 260
>gi|417410040|gb|JAA51501.1| Putative alkylated dna repair protein alkb log, partial [Desmodus
rotundus]
Length = 359
Score = 304 bits (778), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 48 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 107
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 108 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 167
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 168 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 227
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 228 EITHCIRPQDIKERRAVIILRKTRLDAPRL 257
>gi|348560293|ref|XP_003465948.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Cavia porcellus]
Length = 394
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQIRLFSQDECTKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|194375187|dbj|BAG62706.1| unnamed protein product [Homo sapiens]
Length = 383
Score = 303 bits (776), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 72 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 131
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 132 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 191
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 192 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 251
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 252 EITHCIRPQDIKERRAVIILRKTRLDAPRL 281
>gi|351707716|gb|EHB10635.1| Alkylated DNA repair protein alkB-like protein 5, partial
[Heterocephalus glaber]
Length = 349
Score = 303 bits (776), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 139/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ Q+ LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 39 RKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 98
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 99 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 158
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 159 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 218
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 219 EITHCIRPQDIKERRAVIILRKTRLDAPRL 248
>gi|390345597|ref|XP_784548.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Strongylocentrotus purpuratus]
Length = 462
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 140/220 (63%), Positives = 168/220 (76%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+KI GV QM LF+ EEC +E I+EV +AE+G Y+ TVDRAPLRNKYFFGEGYTYG
Sbjct: 65 KKIHSGVAQMQLFTEEECSEIEDKINEVVDIAEKGLYRDHTVDRAPLRNKYFFGEGYTYG 124
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
SQL +KG G ERLYP GEVD IP W+Q LV++++ + G+VP +INSAVINDYQPGGCIV
Sbjct: 125 SQLSKKGPGQERLYPPGEVDDIPQWVQDLVVSRVVEAGLVPHGFINSAVINDYQPGGCIV 184
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIFDRPIIS+S SD AL FGCKF+F+PIRVS+P +P+ RGCV L +AA+
Sbjct: 185 SHVDPIHIFDRPIISVSFLSDCALSFGCKFSFRPIRVSKPKACIPLPRGCVMTLSGYAAD 244
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKP 266
+THC+RPQD RRAVI+LRRV P APRL P P
Sbjct: 245 EVTHCIRPQDVTARRAVIILRRVFPEAPRLEPVGLPSPAP 284
>gi|38383101|gb|AAH62339.1| AlkB, alkylation repair homolog 5 (E. coli) [Homo sapiens]
Length = 458
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 83 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|355568306|gb|EHH24587.1| hypothetical protein EGK_08266, partial [Macaca mulatta]
Length = 384
Score = 301 bits (771), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 28 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 87
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 88 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 147
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 148 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 207
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 208 EITHCIRPQDIKERRAVIILRKTRLDAPRL 237
>gi|355753817|gb|EHH57782.1| hypothetical protein EGM_07489, partial [Macaca fascicularis]
Length = 383
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 140/210 (66%), Positives = 167/210 (79%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 28 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 87
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 88 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 147
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 148 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 207
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 208 EITHCIRPQDIKERRAVIILRKTRLDAPRL 237
>gi|47218956|emb|CAF98154.1| unnamed protein product [Tetraodon nigroviridis]
Length = 355
Score = 300 bits (769), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 140/215 (65%), Positives = 168/215 (78%), Gaps = 4/215 (1%)
Query: 42 ARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
AR +R ILQ S+F+ E+C A+E ID+V + AE G Y+ TVDRAPLRNKYFFGE
Sbjct: 60 ARLVRNSILQ----RSVFTPEQCAAIEEKIDDVVAQAEAGLYREHTVDRAPLRNKYFFGE 115
Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 161
GYTYGSQL ++G G ERLY +GEVD IP+W+ LVI L GV+P ++NSAVINDYQP
Sbjct: 116 GYTYGSQLEKRGPGQERLYRKGEVDEIPSWVHELVIKPLVSSGVIPEGFVNSAVINDYQP 175
Query: 162 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 221
GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 176 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLS 235
Query: 222 DFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+AA+ ITHC+RPQD + RRAVI+LR+ P APRL
Sbjct: 236 GYAADSITHCIRPQDIKERRAVIILRKTRPDAPRL 270
>gi|74187512|dbj|BAE36711.1| unnamed protein product [Mus musculus]
Length = 395
Score = 300 bits (769), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 141/236 (59%), Positives = 174/236 (73%)
Query: 21 KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
K +Y + D + + E +K+ G+ Q+ LFS +EC +E IDEV S AE+
Sbjct: 58 KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117
Query: 81 GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
+ V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFG +F FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGWQFQFKP 237
Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
IRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|410917752|ref|XP_003972350.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Takifugu rubripes]
Length = 361
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 142/215 (66%), Positives = 167/215 (77%), Gaps = 4/215 (1%)
Query: 42 ARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
AR +R ILQ S+FS E+C A+E ID+V + AE G Y+ TVDRAPLRNKYFFGE
Sbjct: 65 ARLVRDSILQ----KSIFSPEQCAAIEEKIDDVVAQAEAGLYREHTVDRAPLRNKYFFGE 120
Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 161
GYTYGSQL ++G G ERLY +GEVD IP W+ LVI L GVVP ++NSAVINDYQP
Sbjct: 121 GYTYGSQLEKRGPGQERLYRKGEVDEIPNWVHELVIKPLVSNGVVPDGFVNSAVINDYQP 180
Query: 162 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 221
GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 181 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVKRGSVTVLS 240
Query: 222 DFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+AA+ ITHC+RPQD + RRAVI+LR+ P APRL
Sbjct: 241 GYAADDITHCIRPQDIKERRAVIILRKTRPDAPRL 275
>gi|432847440|ref|XP_004066024.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Oryzias latipes]
Length = 361
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 134/210 (63%), Positives = 164/210 (78%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+++ + Q S+FS EEC +E IDEV + E G Y+ TVDRAPLRNKYFFGEGYTYG
Sbjct: 67 RRVSSSIQQRSIFSSEECARIEEKIDEVVANGEAGLYREHTVDRAPLRNKYFFGEGYTYG 126
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLY +GEVD IP+W+ LVI L GV+P ++NSAVINDYQPGGCIV
Sbjct: 127 AQLEKRGPGQERLYRKGEVDDIPSWVHELVIKPLVSNGVIPEGFVNSAVINDYQPGGCIV 186
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
SH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L +AA+
Sbjct: 187 SHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLSGYAAD 246
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ P APR+
Sbjct: 247 EITHCIRPQDIKQRRAVIILRKTRPDAPRV 276
>gi|431914502|gb|ELK15752.1| Alkylated DNA repair protein alkB like protein 5 [Pteropus alecto]
Length = 303
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 137/201 (68%), Positives = 161/201 (80%)
Query: 56 MSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLG 115
M LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG+QL ++G G
Sbjct: 1 MRLFSQDECTKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPG 60
Query: 116 NERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIF 175
ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIVSH+DP HIF
Sbjct: 61 QERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIF 120
Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQ
Sbjct: 121 ERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQ 180
Query: 236 DTQHRRAVILLRRVLPHAPRL 256
D + RRAVI+LR+ APRL
Sbjct: 181 DIKERRAVIILRKTRLDAPRL 201
>gi|291224533|ref|XP_002732258.1| PREDICTED: alkB, alkylation repair homolog 5-like [Saccoglossus
kowalevskii]
Length = 362
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 135/214 (63%), Positives = 163/214 (76%), Gaps = 2/214 (0%)
Query: 44 ELRQKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEG 102
ELR + G+ Q +FS EEC +E ID V + A++ +K TVDRAPLRNKYFFGEG
Sbjct: 57 ELRL-VHSGIKQRRDVFSAEECRIIEEKIDSVVADADRSIFKKHTVDRAPLRNKYFFGEG 115
Query: 103 YTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPG 162
YTYG+QL +KG G ERLYP GEVD IP W+Q LVI +L ++P ++NSAVINDY+PG
Sbjct: 116 YTYGTQLDKKGPGQERLYPRGEVDDIPGWVQELVIKRLVSSKMIPDGFVNSAVINDYKPG 175
Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD 222
GCIVSH+DP HIF+RPI S+S SD ALCFGCKF+FKPIR S P+L LP+ RGCVTLL
Sbjct: 176 GCIVSHVDPIHIFERPIASVSFMSDCALCFGCKFSFKPIRTSRPLLSLPLTRGCVTLLSG 235
Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+AAN ITHC+RPQD + RRAVI+LR+V APRL
Sbjct: 236 YAANDITHCIRPQDVKARRAVIILRKVRDDAPRL 269
>gi|260817481|ref|XP_002603615.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
gi|229288935|gb|EEN59626.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
Length = 314
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 151/254 (59%), Positives = 188/254 (74%), Gaps = 3/254 (1%)
Query: 39 KHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF 98
K ELR +I QG+ Q LFSHEEC +E I+E+ A++G YK TVDRAPLR KYF
Sbjct: 63 KDRQDELR-RIHQGIRQRILFSHEECEKIEEKIEEIILRADRGYYKEHTVDRAPLRVKYF 121
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
FGEGYTYGSQL RKG G ERLY +G+VD IP WI +V+++L + +VP ++NSAVIN
Sbjct: 122 FGEGYTYGSQLERKGPGMERLYSKGDVDDIPDWIFKMVVSRLVEAKIVPEGFVNSAVINC 181
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVT 218
Y PGGCIVSHIDPPHIF RPI+S+S FSDSALCFGCKF+FKPIRVS+PVL LPV RGC T
Sbjct: 182 YLPGGCIVSHIDPPHIFARPIVSVSFFSDSALCFGCKFSFKPIRVSKPVLCLPVARGCCT 241
Query: 219 LLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARES 278
+L D+AA+ ITHCVRPQD +HRRAVI+LRRV APR L+ T V+ H +D + ++
Sbjct: 242 VLSDYAADEITHCVRPQDIKHRRAVIILRRVRDDAPR--LAPTTEVQMVHVNHDRREEDN 299
Query: 279 GGYFRSSIETYNNN 292
+ T++++
Sbjct: 300 RTSTDDRLSTFHSD 313
>gi|156381190|ref|XP_001632149.1| predicted protein [Nematostella vectensis]
gi|156219200|gb|EDO40086.1| predicted protein [Nematostella vectensis]
Length = 256
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 139/240 (57%), Positives = 173/240 (72%), Gaps = 4/240 (1%)
Query: 12 QVVEQNLKLKNRYSRNQRSSP---YDIGQSKHHARELRQKILQGVNQMSLFSHEECLALE 68
Q ++N KL+ + + +RS+ Y+ S EL Q++ QG+ Q LF EC +E
Sbjct: 18 QNTDRNRKLQQQNRKRRRSASKVVYEKHSSYEQGLEL-QRVHQGIWQRVLFDKVECEEIE 76
Query: 69 RNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPI 128
I++V A++G YK TVDRAPLR KYFFGEGYTYG QL +G G+ERLY G+VD I
Sbjct: 77 EKINDVVVEADRGVYKDRTVDRAPLRVKYFFGEGYTYGKQLEERGPGSERLYARGDVDDI 136
Query: 129 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDS 188
P WI LVI++LE+ G+VP ++INSAVINDYQPGGCIVSHIDP HIFDRPI+S S FS
Sbjct: 137 PEWIYELVISRLEKAGIVPKDFINSAVINDYQPGGCIVSHIDPMHIFDRPIVSCSFFSSC 196
Query: 189 ALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRR 248
L FGCKF+FKPIR ++P+L LP+ RGCVT L +AA+ ITHCVRPQD RRAVI+LRR
Sbjct: 197 TLSFGCKFSFKPIRTTDPILSLPLPRGCVTALSGYAADEITHCVRPQDVVTRRAVIILRR 256
>gi|198418993|ref|XP_002122576.1| PREDICTED: similar to alkB, alkylation repair homolog 5, partial
[Ciona intestinalis]
Length = 308
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 126/247 (51%), Positives = 172/247 (69%), Gaps = 1/247 (0%)
Query: 53 VNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRK 112
++Q LF+ EEC +ER I++V E+ Y TVDRAPLR KYFFGEGYTYG Q+ +K
Sbjct: 1 ISQKRLFNDEECDEIERCINDVVRNGEKNMYNEHTVDRAPLRVKYFFGEGYTYGKQMNQK 60
Query: 113 GLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPP 172
G G ERLY +G VDPIP+W+ ++ LE G++ +I SAVINDY PGGCIVSH+DP
Sbjct: 61 GPGQERLYQKGVVDPIPSWVFKKLVKPLEDCGMIKKKWITSAVINDYYPGGCIVSHVDPA 120
Query: 173 HIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
HIF+RPI+S+S FS +AL FGCKF+FKPIRVS+PV+ LP+ RG T + +AA+ ITHC+
Sbjct: 121 HIFERPIVSVSFFSTAALSFGCKFSFKPIRVSDPVVCLPIPRGVATSISGYAADSITHCI 180
Query: 233 RPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARESGGYFRSSIETYNNN 292
RPQD + RRAVI+LRR +APR + V + +L + + R S R S++++ +
Sbjct: 181 RPQDIRQRRAVIILRRTFDNAPRQG-EDSSLVSHHRTLNNHEDRRSSNRHRLSLDSHRDR 239
Query: 293 ENHSNKN 299
H+ ++
Sbjct: 240 RKHTRRS 246
>gi|194217772|ref|XP_001918245.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like, partial [Equus caballus]
Length = 277
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 122/174 (70%), Positives = 143/174 (82%)
Query: 83 YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
Y TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL +
Sbjct: 2 YNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVE 61
Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIR 202
V+P ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIR
Sbjct: 62 HRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIR 121
Query: 203 VSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
VSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 122 VSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 175
>gi|405958373|gb|EKC24507.1| Alkylated DNA repair protein alkB-like protein 5 [Crassostrea
gigas]
Length = 347
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 122/190 (64%), Positives = 147/190 (77%), Gaps = 1/190 (0%)
Query: 34 DIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPL 93
D+ S +A EL K+ G+ Q LF+ ++C +E I+E +A++G+YK TVDRAPL
Sbjct: 49 DLDASWSNADEL-SKVHAGIKQRRLFNDDQCEKIEEKINETVKIADKGEYKSHTVDRAPL 107
Query: 94 RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
RNKYFFGEGYTYGSQL +KG G ERLYP+GEVD IP WI LVI L +VP ++NS
Sbjct: 108 RNKYFFGEGYTYGSQLEKKGPGMERLYPKGEVDEIPDWIHELVIKPLYDAKIVPEGFVNS 167
Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQ 213
AVINDY PGGCIVSHIDPPHIFDRPI+S+S FSDSALCFGCKF+F+PIRVS PVL LP+
Sbjct: 168 AVINDYMPGGCIVSHIDPPHIFDRPIVSVSFFSDSALCFGCKFSFRPIRVSTPVLCLPID 227
Query: 214 RGCVTLLRDF 223
RGCVT++R F
Sbjct: 228 RGCVTMIRVF 237
>gi|403275437|ref|XP_003945296.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5 [Saimiri
boliviensis boliviensis]
Length = 386
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 119/174 (68%), Positives = 140/174 (80%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 72 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 131
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQPGGCIV
Sbjct: 132 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 191
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 192 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVL 245
>gi|221119302|ref|XP_002156519.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Hydra magnipapillata]
Length = 329
Score = 256 bits (655), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 123/211 (58%), Positives = 149/211 (70%), Gaps = 1/211 (0%)
Query: 47 QKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
Q I +G+ Q + F C +E+ ID + A++G Y P T D APLRNKYFFGEGYTY
Sbjct: 77 QLIHEGIFQKRNYFDDSMCRKIEKKIDGIVEKAKRGLYLPNTFDSAPLRNKYFFGEGYTY 136
Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
G + KG G ERL+ +GEVD IP WIQ VI KL VVP +INS VIN+Y PGGCI
Sbjct: 137 GKHMENKGPGQERLFKKGEVDKIPNWIQKHVIKKLYDDKVVPEGFINSVVINEYFPGGCI 196
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAA 225
VSHIDP HIFDRPIIS++ + S L FGCKF F PIR SEPVL LP+ RGC+T++ +AA
Sbjct: 197 VSHIDPIHIFDRPIISINFNTRSFLSFGCKFTFNPIRTSEPVLSLPMDRGCLTMISGYAA 256
Query: 226 NGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+ ITHC+RP D RR VI+LRRV P+APR+
Sbjct: 257 DNITHCIRPCDITERRCVIILRRVFPNAPRV 287
>gi|443714465|gb|ELU06866.1| hypothetical protein CAPTEDRAFT_35874, partial [Capitella teleta]
Length = 230
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 129/204 (63%), Positives = 154/204 (75%), Gaps = 3/204 (1%)
Query: 48 KILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGS 107
KI G+ Q LF+ C +E I+EV A G+YK TVD APLRNKYFFGEGYTYGS
Sbjct: 27 KIRTGIKQRCLFNDSACKEIEAKIEEVVRKAANGEYKKNTVDTAPLRNKYFFGEGYTYGS 86
Query: 108 QLVRKGLGNERLYP-EGE--VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
+ +G G ERLYP EGE VDPIP WI +V+ L + ++PP++INSAVINDYQPGGC
Sbjct: 87 HMEARGPGMERLYPKEGEESVDPIPEWIHEMVVQPLLRAQLIPPDFINSAVINDYQPGGC 146
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
IVSHIDP HIFDRPI+++S FS SAL FGCKF F+PIRVS PVL + + RG VTL+ FA
Sbjct: 147 IVSHIDPYHIFDRPIVTVSFFSSSALSFGCKFEFRPIRVSNPVLSVKLPRGGVTLISGFA 206
Query: 225 ANGITHCVRPQDTQHRRAVILLRR 248
A+ ITHC+RPQDT HRRAV++LRR
Sbjct: 207 ADEITHCIRPQDTPHRRAVVILRR 230
>gi|335310671|ref|XP_003362141.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Sus scrofa]
Length = 396
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 119/200 (59%), Positives = 139/200 (69%), Gaps = 14/200 (7%)
Query: 59 FSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
+ +EC +E IDEV S AE+G Y TVDRAPLRNKYF YG+QL ++G G ER
Sbjct: 106 LAADECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFXXXXXXYGAQLQKRGPGQER 165
Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHI--DPPHIFD 176
LYP G+VD IP W SAVINDYQPGGCIVSH+ DP HIF+
Sbjct: 166 LYPPGDVDEIPEWXXXXXXXX------------XSAVINDYQPGGCIVSHVHVDPIHIFE 213
Query: 177 RPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQD 236
RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD
Sbjct: 214 RPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQD 273
Query: 237 TQHRRAVILLRRVLPHAPRL 256
+ RRAVI+LR+ APRL
Sbjct: 274 IKERRAVIILRKTRLDAPRL 293
>gi|426238911|ref|XP_004023593.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5, partial
[Ovis aries]
Length = 312
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 114/210 (54%), Positives = 136/210 (64%), Gaps = 38/210 (18%)
Query: 47 QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
+K+ G+ QM LFS +EC +E IDEV S AE+G Y TVDRAPLRNKYFFGEGYTYG
Sbjct: 39 RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 98
Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
+QL ++G G ERLYP G+VD IP W+ LVI
Sbjct: 99 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVI----------------------------- 129
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+
Sbjct: 130 ---------ERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 180
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 181 EITHCIRPQDIKERRAVIILRKTRLDAPRL 210
>gi|351709131|gb|EHB12050.1| Alkylated DNA repair protein alkB-like protein 5 [Heterocephalus
glaber]
Length = 464
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 113/214 (52%), Positives = 135/214 (63%), Gaps = 34/214 (15%)
Query: 43 RELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEG 102
+E +K+ + Q+ LFS EEC + IDEV SLAE+G Y TVDRAPLRNKYFFG+G
Sbjct: 104 KEEARKVKSNIRQIRLFSQEECAKIAARIDEVVSLAEKGLYNEHTVDRAPLRNKYFFGDG 163
Query: 103 YTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPG 162
YTYG+QL ++ G ERLYP G+VD IP W LVI KL + V+P ++NSAVINDYQPG
Sbjct: 164 YTYGAQLQKRRPGQERLYPPGDVDEIPEWGPRLVIQKLVEHRVIPEGFVNSAVINDYQPG 223
Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD 222
GCI+S +S FSDSALCFGCK FKPIRVSEPVL LPV+R
Sbjct: 224 GCIMS------------VS---FSDSALCFGCKLQFKPIRVSEPVLSLPVRRR------- 261
Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+ Q RRAVI+LR+ APRL
Sbjct: 262 ------------KRHQERRAVIILRKTRLDAPRL 283
>gi|313240619|emb|CBY32944.1| unnamed protein product [Oikopleura dioica]
Length = 462
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)
Query: 46 RQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
R++IL G+ S+F ++ LE + + +G+++ ++D + LR KY +
Sbjct: 31 RREILDGIKGFSIFDADDVEDLESDCRSMIERGGKGKFRVKSIDISRLRMKY-----FFG 85
Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
G G+E+ + ++ PIP WI +I K+E+ G+V N+INS VINDY+PGG I
Sbjct: 86 YGYEYGGGKGSEKFFNPKDISPIPDWIYKNIIEKMEKAGIVEKNWINSVVINDYEPGGFI 145
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTL 219
V H DPPH+F RPI L+LFSDSAL FGC F +P+ V S+PVL LP+QRG +T
Sbjct: 146 VQHQDPPHLFQRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITS 205
Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
N I HCVRP+DT+ RR I+LRRV + P L
Sbjct: 206 FEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 242
>gi|313225990|emb|CBY21133.1| unnamed protein product [Oikopleura dioica]
Length = 462
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)
Query: 46 RQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
R++IL G+ S+F ++ LE + + +G+++ ++D + LR KY +
Sbjct: 31 RREILDGIKGFSIFDADDVEDLESDCRSMIERGGKGKFRVKSIDISRLRMKY-----FFG 85
Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
G G+E+ + ++ PIP WI +I K+E+ G+V N+INS VINDY+PGG I
Sbjct: 86 YGYEYGGGKGSEKFFNPKDISPIPDWIYKNIIGKMEKAGIVEKNWINSVVINDYEPGGFI 145
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTL 219
V H DPPH+F RPI L+LFSDSAL FGC F +P+ V S+PVL LP+QRG +T
Sbjct: 146 VQHQDPPHLFQRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITS 205
Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
N I HCVRP+DT+ RR I+LRRV + P L
Sbjct: 206 FEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 242
>gi|354467887|ref|XP_003496399.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like, partial [Cricetulus griseus]
Length = 203
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/101 (73%), Positives = 86/101 (85%)
Query: 156 INDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRG 215
INDYQPGGCIVSH+D HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG
Sbjct: 1 INDYQPGGCIVSHVDQIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRG 60
Query: 216 CVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 61 SVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 101
>gi|296216728|ref|XP_002754700.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Callithrix jacchus]
Length = 231
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 53/74 (71%), Positives = 61/74 (82%)
Query: 183 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRA 242
S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRA
Sbjct: 56 SFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRA 115
Query: 243 VILLRRVLPHAPRL 256
VI+LR+ APRL
Sbjct: 116 VIILRKTRLDAPRL 129
>gi|118352562|ref|XP_001009552.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila]
gi|89291319|gb|EAR89307.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila SB210]
Length = 1999
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 102/206 (49%), Gaps = 24/206 (11%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKP------CTVDRAPLRNKYFFGEGYTYGSQLVR 111
F+H+E +++E + E + Q Y P CT + R K+FFG Y + Q +
Sbjct: 1751 FFTHQELMSIENSCYETEIKSFQRHYFPMTAQMTCTAQKI-RRTKFFFGSRYIWTRQQLA 1809
Query: 112 K-----GLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CI 165
+ G G + +V P+W++ V+ LEQ G++PP +INS +N Y G +
Sbjct: 1810 EPHSFVGAGVRQ-----DVSQPPSWMRRKVVGPLEQAGILPPGFINSLALNVYHDGKEGL 1864
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--F 223
H D F +PI + LFSD+ L FG + +P+ RG V L + +
Sbjct: 1865 AQHFDDATRFRQPIFTFKLFSDARLSFGSQL----YGFCNSAFVIPMPRGSVLKLEENSY 1920
Query: 224 AANGITHCVRPQDTQHRRAVILLRRV 249
AANGI HCVRP D + A I++R++
Sbjct: 1921 AANGIKHCVRPCDMAGKNATIIMRQM 1946
>gi|403365523|gb|EJY82547.1| hypothetical protein OXYTRI_19840 [Oxytricha trifallax]
Length = 1315
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 112/228 (49%), Gaps = 13/228 (5%)
Query: 35 IGQSKHHARELRQKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRA-- 91
+G SK + +K+ G+ + +++EE +E I++ + Q Y P T +
Sbjct: 844 VGISKDEEIQDVEKLRVGITTFENFYTNEEMKNMEHMIEQTEKKSLQNAYLPMTAQKTFS 903
Query: 92 ---PLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP 148
R K+FFG Y + + + L N +V P W+++++ L G++
Sbjct: 904 GNTLKRTKFFFGFRYMWTRTQLLEPLSNVAAGVRADVSQPPVWMKTILEEPLVNAGIIEK 963
Query: 149 NYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++INS +N Y G + H D F +PI +L LFSDS L FG ++
Sbjct: 964 DFINSVALNVYHDGSEGLAQHFDDATRFKQPIYTLRLFSDSRLSFGSQY----YGFCNGA 1019
Query: 208 LYLPVQRGCVTLLRD--FAANGITHCVRPQDTQHRRAVILLRRVLPHA 253
+P+ RGC+ ++ + ++ANG+ HC+RP D + + ++LR++ P+
Sbjct: 1020 FTIPLPRGCICVMEEGSYSANGVKHCIRPCDMTGKSSAVILRQMHPNV 1067
>gi|340508382|gb|EGR34097.1| hypothetical protein IMG5_024190 [Ichthyophthirius multifiliis]
Length = 798
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 94/201 (46%), Gaps = 14/201 (6%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKP------CTVDRAPLRNKYFFGEGYTYGSQLVR 111
F+HEE + +E + Q Y P CT ++ R K+FFG Y + +
Sbjct: 557 FFTHEEMIDIENETYKTEIKCFQRHYFPMTAQVTCTAEKIR-RTKFFFGSRYMWTKMQLA 615
Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHID 170
+ +V P WI+ V+ KLE ++P +INS +N Y G + H D
Sbjct: 616 EPHSYVGAGVRQDVSETPLWIKKKVVQKLENANILPKKFINSLALNVYHDGNEGLAQHFD 675
Query: 171 PPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGI 228
F +PI + LFSD+ L FG +F +P+ RG + L + +AANGI
Sbjct: 676 DATRFMQPIFTFKLFSDARLSFGSQF----YGFCNSAFVIPMPRGSILKLEENSYAANGI 731
Query: 229 THCVRPQDTQHRRAVILLRRV 249
HCVRP D + A I++R++
Sbjct: 732 KHCVRPCDMTGKNATIIMRQM 752
>gi|145483981|ref|XP_001428013.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395096|emb|CAK60615.1| unnamed protein product [Paramecium tetraurelia]
Length = 1283
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 96/203 (47%), Gaps = 17/203 (8%)
Query: 59 FSHEECLALERNIDEVSSLAEQGQYKPCTVD---------RAPLRNKYFFGEGYTYGSQL 109
F++EE L +E+ I E+ G Y P T + R K+FFG Y +
Sbjct: 1041 FTNEELLQMEKEIYEMELKGFDGHYLPMTAQISHAQSNHHSSVRRTKFFFGYRYMWTKCQ 1100
Query: 110 VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
+ + +V P W+++L ITKLE VVP +INS N Y G + H
Sbjct: 1101 LAEPHSKVAAGVRRDVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNIYHDGKEGLAQH 1159
Query: 169 IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAAN 226
D F +PI ++ +FSD L FG +F +P+ RGC+ + + +AAN
Sbjct: 1160 FDDAVRFKQPIFTIRVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAAN 1215
Query: 227 GITHCVRPQDTQHRRAVILLRRV 249
I HCVRP D + A ++LR++
Sbjct: 1216 AIKHCVRPCDMTGKSAALILRQM 1238
>gi|145520431|ref|XP_001446071.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413548|emb|CAK78674.1| unnamed protein product [Paramecium tetraurelia]
Length = 941
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 97/203 (47%), Gaps = 17/203 (8%)
Query: 59 FSHEECLALERNIDEVSSLAEQGQYKPCTVD---------RAPLRNKYFFGEGYTYGSQL 109
F++EE L +E++I E+ G Y P T + R K+FFG Y +
Sbjct: 699 FTNEELLQMEKDIYEMELKGFDGHYLPMTAQISHAQSNHHSSVRRTKFFFGYRYMWTKCQ 758
Query: 110 VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
+ + +V P W+++L ITKLE VVP +INS N Y G + H
Sbjct: 759 LAEPHSMVAAGVRRDVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNVYHDGKEGLAQH 817
Query: 169 IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAAN 226
D F +PI ++ +FSD L FG +F +P+ RGC+ + + +AAN
Sbjct: 818 FDDAVRFKQPIYTIRVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAAN 873
Query: 227 GITHCVRPQDTQHRRAVILLRRV 249
I HCVRP D + A ++LR++
Sbjct: 874 AIKHCVRPCDMTGKSAALILRQM 896
>gi|440799034|gb|ELR20095.1| SAM domain (Sterile alpha motif) domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 1394
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 101/206 (49%), Gaps = 16/206 (7%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKP------CTV--DRAPLRNKYFFGEGYTYGSQL 109
F+ E +E++ D + A G+Y CT DR R K+FF Y + ++
Sbjct: 664 FFTDAELRQIEKSADATDADARNGKYDEVPNTYQCTRKGDRL-CRTKFFFSARYLWTAEQ 722
Query: 110 VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
+ + +V P W++ L+ L + +V ++N +N Y G + H
Sbjct: 723 MASPNSTQAQGIRADVAEPPFWMRQLIERPLVESRIVQEAWVNEFAMNIYHDGTEGLGQH 782
Query: 169 IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL--RDFAAN 226
D F RPI +L +FSDS L FG K +F ++ V ++P+ RGC+T+L + +A +
Sbjct: 783 YDDKKRFKRPIHTLRIFSDSRLSFGSK-DFS---MTNSVFFIPMPRGCITVLEKKGYAVD 838
Query: 227 GITHCVRPQDTQHRRAVILLRRVLPH 252
GI HCVR D + V+LLR + PH
Sbjct: 839 GIKHCVRAADMGGKSGVVLLRHIHPH 864
>gi|403336135|gb|EJY67254.1| hypothetical protein OXYTRI_12242 [Oxytricha trifallax]
Length = 943
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 92/206 (44%), Gaps = 28/206 (13%)
Query: 57 SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRA-----PLRNKYFFGEGYTYGSQLVR 111
+ F+H E L +E N+ E + P T + R K+FFG Y + + +
Sbjct: 683 NFFTHGELLNIEENVMETERQCSNSAFLPDTAQQTFSGSRLTRTKFFFGYRYMWTKEQLS 742
Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP 171
+ +V P W+ +++K+E+ G+V N+INS +N Y G
Sbjct: 743 EPFSQVAAGVRKDVSRPPVWVNEFMVSKMEKCGIVEKNFINSIAMNVYHDG--------- 793
Query: 172 PHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGIT 229
L+ D A F + F ++P+ RGC+ ++ + +AANG+
Sbjct: 794 -------TEGLAQHYDDATRFKQFYGF-----VNGAFFIPLARGCICVMEENGYAANGVK 841
Query: 230 HCVRPQDTQHRRAVILLRRVLPHAPR 255
HC+RPQD + A ++LR++ P+ +
Sbjct: 842 HCIRPQDMTGKSAALILRQMHPNVVK 867
>gi|303274614|ref|XP_003056625.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462709|gb|EEH60001.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 897
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 104/232 (44%), Gaps = 19/232 (8%)
Query: 29 RSSPYDIGQSKHHARELRQKI---LQGVNQ-MSLFSHEECLALERNIDE-VSSLAEQGQY 83
R+ D G+ K R R + GV Q + F +E E+ I E + G
Sbjct: 641 RADDGDAGERKSTKRVWRFRSNGQYFGVTQYLHAFDDDELEMTEKAITELIDRERASGDA 700
Query: 84 KPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQM 143
V R K FFG YTY S K + + PIP+W+ S + + + +
Sbjct: 701 PHVHVGLNGGRIKVFFGYRYTYDSVTAPKLIAD--------APPIPSWVHSTLGNRAKII 752
Query: 144 GVVPPNY-INSAVINDYQPGGCIVS-HIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
G + N I++AV+N Y +S H+DP +F RPI+S F D L FG K ++
Sbjct: 753 GAIDKNVVIDNAVVNLYARSKARLSVHMDPSALFKRPIVSARFFGDGVLSFGAKGQYEGQ 812
Query: 202 RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHA 253
R+ +P+ RG + ++ +AAN +TH V +D R +LR P A
Sbjct: 813 RIHS----VPLTRGSIAVMEGYAANMVTHAVNERDVVGRGCSFMLRSCQPDA 860
>gi|302841282|ref|XP_002952186.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
gi|300262451|gb|EFJ46657.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
Length = 2654
Score = 88.2 bits (217), Expect = 5e-15, Method: Composition-based stats.
Identities = 74/229 (32%), Positives = 100/229 (43%), Gaps = 39/229 (17%)
Query: 58 LFSHEECLALERNIDEVSSLAEQG-----------------QYKPCTVDRAPLRNKYFFG 100
LF +E +E +D +LA +G A R K FFG
Sbjct: 1604 LFRQDELYGMEAQVDATDTLARRGLLPSDCYHHSFGAGGGGSGGIAAAAAALKRTKMFFG 1663
Query: 101 EGYTYG-SQLVRKGL---GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVI 156
Y + QL G G R+ +V P W++ V L + GVVP +++S +
Sbjct: 1664 ARYLWTREQLAAPGARIAGGVRV----DVPPPLRWMRHQVEAPLVRCGVVPRGFVDSIAL 1719
Query: 157 NDYQPGG--------CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVL 208
N Y G SH D F RPI S+ LFSDS L FG +
Sbjct: 1720 NLYHDGSEGIQVGGGGEGSHYDDAVRFRRPIYSVRLFSDSRLSFGGHLYGN----TNNDF 1775
Query: 209 YLPVQRGCVTLLRD--FAANGITHCVRPQDTQHRRAVILLRRVLPHAPR 255
++P+ RGCVT+L D +AANGI HCVRP D + A ++LR + P A R
Sbjct: 1776 FVPMPRGCVTVLEDESYAANGIKHCVRPADMAGKSAALILRGIDPGAMR 1824
>gi|384246841|gb|EIE20330.1| hypothetical protein COCSUDRAFT_67586 [Coccomyxa subellipsoidea
C-169]
Length = 988
Score = 81.3 bits (199), Expect = 5e-13, Method: Composition-based stats.
Identities = 58/189 (30%), Positives = 90/189 (47%), Gaps = 13/189 (6%)
Query: 57 SLFSHEECLALERNIDEVSSLAEQGQY-KPC---TVDRAP--LRNKYFFGEGYTYGSQLV 110
+ ++ EE +E D V + A +G+ + C T R R K+FFG Y + ++
Sbjct: 778 NFYTAEELRQMEAGADVVHAKAREGRMPQTCFHETRGRGGGLKRTKFFFGARYLWTAEQA 837
Query: 111 RKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHI 169
+V P P W+Q+L L + G+VP ++ ++ +N Y G I SH
Sbjct: 838 ADPASRLAAGVRLDVPPAPTWMQALAEKPLVETGLVPKDFFDAWALNLYHDGSEGIQSHF 897
Query: 170 DPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR--DFAANG 227
D F RPI S+ LFSDS L FG + + + + RGC+T++ +A G
Sbjct: 898 DDGTRFSRPIFSVRLFSDSRLSFGTQL----YGYTNGAFTVDMPRGCITVMEAGGYAVEG 953
Query: 228 ITHCVRPQD 236
+ HCVRP D
Sbjct: 954 VKHCVRPAD 962
>gi|356552441|ref|XP_003544576.1| PREDICTED: uncharacterized protein LOC100797674 [Glycine max]
Length = 522
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 62/214 (28%), Positives = 95/214 (44%), Gaps = 25/214 (11%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVD------RAPLRNKYFFGE 101
IL+G+ +FS E + + + + +G+ K T R R FG
Sbjct: 194 ILEGLELHTGIFSAAEQKRIVNYVASLQEMGRKGELKEQTFSAPQKWMRGKGRQTIQFGC 253
Query: 102 GYTYGSQLVRKG-----LGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAV 155
Y Y + R G LGN G VDPIPA + ++I +L + V+PP + +S +
Sbjct: 254 CYNYA--VDRDGNPPGILGN------GMVDPIPALFK-VIIRRLIKWHVLPPTCVPDSCI 304
Query: 156 INDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRG 215
+N Y+ G CI HID H F RP ++S S+ + FG + + +P+ G
Sbjct: 305 VNIYEEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMG 363
Query: 216 CVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
V +L A+ HCV T +R I RR+
Sbjct: 364 SVLVLNGNGADVAKHCVPAVPT--KRISITFRRM 395
>gi|307105467|gb|EFN53716.1| hypothetical protein CHLNCDRAFT_136563 [Chlorella variabilis]
Length = 521
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 59/129 (45%), Gaps = 30/129 (23%)
Query: 129 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSD 187
P W++ V L G+V P + +S +N Y G I SH D F +PI+SL LFSD
Sbjct: 319 PRWMRQKVEAPLVGAGLVAPGFCDSIALNIYHDGSEGIQSHYDDAARFCQPILSLRLFSD 378
Query: 188 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLR 247
++ G +AANGI HCVRP D + A ++LR
Sbjct: 379 M-----------------------MEEG------GYAANGIKHCVRPMDMSGKSAGMILR 409
Query: 248 RVLPHAPRL 256
R+ P AP L
Sbjct: 410 RINPPAPAL 418
>gi|356562125|ref|XP_003549324.1| PREDICTED: uncharacterized protein LOC100787321 [Glycine max]
Length = 520
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 58/210 (27%), Positives = 93/210 (44%), Gaps = 18/210 (8%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVD------RAPLRNKYFFGE 101
IL+G+ +FS E + + + + ++G+ K T R R FG
Sbjct: 193 ILEGLELHTGIFSAAEQKRIVNYVASLQEMGKKGELKERTFSAPQKWMRGKGRQTIQFGC 252
Query: 102 GYTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
Y Y + GN + G VDPIP + ++I +L + V+PP + +S ++N Y
Sbjct: 253 CYNYADRD-----GNPPGILTNGMVDPIPDLFK-VIIRRLVKWHVLPPTCVPDSCIVNIY 306
Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTL 219
+ G CI HID H F RP ++S S+ + FG + + +P+ G V +
Sbjct: 307 EEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMGSVLV 365
Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRV 249
L A+ HCV T +R I RR+
Sbjct: 366 LNGNGADVAKHCVPAVPT--KRISITFRRM 393
>gi|384245374|gb|EIE18868.1| hypothetical protein COCSUDRAFT_20284, partial [Coccomyxa
subellipsoidea C-169]
Length = 246
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 77/155 (49%), Gaps = 9/155 (5%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + R+G + E V+P+P +Q+LV +L + GV+P + +SA+IN
Sbjct: 84 FGCCYNYA--IDREGR-EPGIIAEEVVEPMPPMLQALV-HRLVRWGVMPRSKAPDSAIIN 139
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKF-NFKPIRVSEPVLYLPVQRGC 216
Y CI HID H F RP ++SL S+ A+ FG K P + +P+ G
Sbjct: 140 IYDQEDCIPPHIDH-HDFSRPFCTISLLSEQAIMFGAKLIPLGPGKFGGNHCTIPLPVGS 198
Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
+L+ A+ HCV P RR I LR+ P
Sbjct: 199 CLVLKGNGADVAMHCVPP--VSQRRMSITLRKCAP 231
>gi|343172812|gb|AEL99109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
partial [Silene latifolia]
gi|343172814|gb|AEL99110.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
partial [Silene latifolia]
Length = 261
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 95/213 (44%), Gaps = 23/213 (10%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK---------PCTVDRAPLRNKYF 98
+LQG+ +FS EE ++ ++ V + GQ K P R R
Sbjct: 39 VLQGLELHTGVFSPEE---QKKIVESVYNFQRMGQKKQLMQRTYTEPAKWMRGKGRVTIQ 95
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y V K + + EVDP+P + + I+++ + V+PP + NS ++N
Sbjct: 96 FGCCYNYA---VDKDGNPPGIIRDEEVDPLPPVFKQM-ISRMVKWHVLPPTRVPNSCIVN 151
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGC 216
Y G CI HID H F RP ++S ++ + FG + P S PV +P+ G
Sbjct: 152 IYDEGDCIPPHID-HHDFLRPFCTVSFLTECNIMFGTNLKVEGPGEFSGPVT-IPLPVGS 209
Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
V +L A+ HC+ +R + RR+
Sbjct: 210 VLVLNGNGADVAKHCI--PAVPAKRISVTFRRM 240
>gi|79326344|ref|NP_001031794.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|55819794|gb|AAV66092.1| At4g36090 [Arabidopsis thaliana]
gi|59958356|gb|AAX12888.1| At4g36090 [Arabidopsis thaliana]
gi|332661217|gb|AEE86617.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 520
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 116/282 (41%), Gaps = 43/282 (15%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
IL+G+ +FS + ++ +D V L E+G+ R LR + F
Sbjct: 216 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 265
Query: 99 -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
FG Y Y G + G+VDP+P+ I ++I +L V+PP +
Sbjct: 266 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPS-IFKVIIKRLVGWHVLPPTCV 321
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
+S ++N Y+ CI HID H F RP ++S S+ + FG +
Sbjct: 322 PDSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGSNLKVLGPGEFSGSYSI 380
Query: 211 PVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK----- 265
P+ G V +L+ A+ HCV T +R I R++ + + P ++
Sbjct: 381 PLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKPL 438
Query: 266 PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
PY H+ A RSSI+ N NH+N+ G K
Sbjct: 439 PYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 478
>gi|449434166|ref|XP_004134867.1| PREDICTED: uncharacterized protein LOC101203292 [Cucumis sativus]
gi|449531418|ref|XP_004172683.1| PREDICTED: uncharacterized protein LOC101225118 [Cucumis sativus]
Length = 499
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 63/217 (29%), Positives = 98/217 (45%), Gaps = 31/217 (14%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG-EGYTYG 106
IL+G+ +FS E R +D V +L E G+ R LR + F + + G
Sbjct: 218 ILEGLELHTCIFSAAE---QTRIVDHVYALQEMGK-------RGELRERTFSAPKKWMKG 267
Query: 107 SQLVRKGLG-------NERLYPEGE-----VDPIPAWIQSLVITKLEQMGVVPPNYI-NS 153
V G ++ P G VDP+P+ + ++I +L + V+PP + +S
Sbjct: 268 KGRVTMQFGCCYNYAPDKNGNPPGILRSEIVDPLPSLFK-VIIRRLVRWHVLPPTCVPDS 326
Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF-KPIRVSEPVLYLPV 212
++N Y G CI HID H F RP ++S S+ + FG + P S P+ +P+
Sbjct: 327 CIVNIYDEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGTNLSIVGPGEFSGPIA-IPL 384
Query: 213 QRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
G V +L A+ HCV T +R I RR+
Sbjct: 385 PVGSVLVLNGNGADVAKHCVPAVPT--KRISITFRRI 419
>gi|79326336|ref|NP_001031793.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|51971445|dbj|BAD44387.1| hypothetical protein [Arabidopsis thaliana]
gi|332661216|gb|AEE86616.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 452
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 116/282 (41%), Gaps = 43/282 (15%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
IL+G+ +FS + ++ +D V L E+G+ R LR + F
Sbjct: 148 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 197
Query: 99 -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
FG Y Y G + G+VDP+P+ I ++I +L V+PP +
Sbjct: 198 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPS-IFKVIIKRLVGWHVLPPTCV 253
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
+S ++N Y+ CI HID H F RP ++S S+ + FG +
Sbjct: 254 PDSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGSNLKVLGPGEFSGSYSI 312
Query: 211 PVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK----- 265
P+ G V +L+ A+ HCV T +R I R++ + + P ++
Sbjct: 313 PLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKPL 370
Query: 266 PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
PY H+ A RSSI+ N NH+N+ G K
Sbjct: 371 PYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 410
>gi|119638440|gb|ABL85031.1| oxidoreductase [Brachypodium sylvaticum]
Length = 598
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 29/264 (10%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYF 98
IL G+ +FS E R +D V L EQG+ +P R R
Sbjct: 197 ILAGLELHAGVFSAAE---QRRIVDCVYGLQEQGKRGELGDRTYTEPEKWMRGKGRVTIQ 253
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + G R + DPIP + ++I +L + ++PP+ + +S ++N
Sbjct: 254 FGCCYNYATDKNGNPPGIIRTFAS---DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVN 309
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
Y PG CI HID H F RP ++S S+ + FG +P+ G V
Sbjct: 310 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSFAIPLPVGSV 368
Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARE 277
++ A+ HCV +R I R++ P P L D+ E
Sbjct: 369 LVINGNGADVAKHCV--PAVPAKRISITFRKMDPAKRPFGFKDDP------ELLDITPME 420
Query: 278 SGG--YFRSSIETYNNNENHSNKN 299
+ RSS E + NKN
Sbjct: 421 AASEETSRSSDEGKGKQLDVQNKN 444
>gi|147857702|emb|CAN80815.1| hypothetical protein VITISV_020466 [Vitis vinifera]
Length = 849
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
IL G+ + +FS E + I E+ + GQ K P R R FG
Sbjct: 561 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 620
Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
Y Y + GN + EV DPIP + ++I +L + V+PP+ + +S ++N Y
Sbjct: 621 CYNYATDKN----GNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 675
Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
+ G CI HID H F RP ++S S+ + FG N K + E V +P+ G V
Sbjct: 676 EEGDCIPPHID-NHDFVRPFCTVSFLSECBIVFGT--NLKXLGAGEFVGPFAIPLPVGSV 732
Query: 218 TLLRDFAANGITHCV 232
+L A+ HCV
Sbjct: 733 LVLNGNGADVAKHCV 747
>gi|302757365|ref|XP_002962106.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
gi|300170765|gb|EFJ37366.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
Length = 289
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 89/202 (44%), Gaps = 21/202 (10%)
Query: 43 RELRQKILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
+ +R ILQG+ ++FS E R +D V L +GQ K+ G+
Sbjct: 64 KGMRVNILQGLELHTNIFSAAE---QSRLLDMVFELELKGQRNELKERTYSAPRKWMQGK 120
Query: 102 G---------YTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
G Y Y GN + + EVDP+P ++S I +L + V+PP +
Sbjct: 121 GRVTLQFGCCYNYS----HDKFGNTPGILQDEEVDPLPPLLKS-TIKRLVRWHVLPPTCV 175
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
+S ++N Y+ G CI HID H F RP ++SL S ++ FG + +
Sbjct: 176 PDSCIVNIYETGDCIPPHID-HHDFLRPFCTVSLLSQCSIVFGSSLSVAAPGEFDGSFST 234
Query: 211 PVQRGCVTLLRDFAANGITHCV 232
+ G V +L D A+ HC+
Sbjct: 235 QLPVGSVLVLNDNGADVAKHCI 256
>gi|297798302|ref|XP_002867035.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312871|gb|EFH43294.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 490
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 73/271 (26%), Positives = 114/271 (42%), Gaps = 38/271 (14%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
IL+G+ +FS E ++ +D V L E+GQ R LR++ F
Sbjct: 186 ILEGLELHTGVFSAVE---QKKIVDFVYELQEKGQ-------RGELRDRTFTAPHKWMRG 235
Query: 99 -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
FG Y Y G + VDPIP+ + ++I +L V+PP I
Sbjct: 236 KGRVTIQFGCCYNYAPDKAGNPPG---ILQREAVDPIPSLFK-VIIKRLVGWHVLPPTCI 291
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPVLY 209
+S ++N Y G CI HID H F RP ++S S+ + FG P S P
Sbjct: 292 PDSCIVNIYDEGDCIPPHIDN-HDFVRPFCTVSFLSECNILFGSNLKVVGPGEFSGP-YS 349
Query: 210 LPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHS 269
+P+ G V +L+ A+ HCV T +R I R++ + + P ++
Sbjct: 350 MPLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLQGIKP 407
Query: 270 L-YDVQARESGGYFRSSIETYNNNENHSNKN 299
L Y+ + + S +N++N SN N
Sbjct: 408 LPYEQTTQSTPAAAAISSSRSSNDQNGSNHN 438
>gi|297735021|emb|CBI17383.3| unnamed protein product [Vitis vinifera]
Length = 547
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
IL G+ + +FS E + I E+ + GQ K P R R FG
Sbjct: 259 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 318
Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
Y Y + GN + EV DPIP + ++I +L + V+PP+ + +S ++N Y
Sbjct: 319 CYNYATD----KNGNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 373
Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
+ G CI HID H F RP ++S S+ + FG N K + E V +P+ G V
Sbjct: 374 EEGDCIPPHID-NHDFVRPFCTVSFLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSV 430
Query: 218 TLLRDFAANGITHCV 232
+L A+ HCV
Sbjct: 431 LVLNGNGADVAKHCV 445
>gi|225431153|ref|XP_002268446.1| PREDICTED: uncharacterized protein LOC100250563 [Vitis vinifera]
Length = 510
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
IL G+ + +FS E + I E+ + GQ K P R R FG
Sbjct: 222 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 281
Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
Y Y + GN + EV DPIP + ++I +L + V+PP+ + +S ++N Y
Sbjct: 282 CYNYATD----KNGNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 336
Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
+ G CI HID H F RP ++S S+ + FG N K + E V +P+ G V
Sbjct: 337 EEGDCIPPHID-NHDFVRPFCTVSFLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSV 393
Query: 218 TLLRDFAANGITHCV 232
+L A+ HCV
Sbjct: 394 LVLNGNGADVAKHCV 408
>gi|357118936|ref|XP_003561203.1| PREDICTED: uncharacterized protein LOC100826719 [Brachypodium
distachyon]
Length = 594
Score = 60.8 bits (146), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 102/258 (39%), Gaps = 24/258 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYF 98
IL G+ +FS E R +D V L EQG+ +P R R
Sbjct: 193 ILAGLELHAGVFSAAE---QRRIVDCVYGLQEQGKRGELGDRTYTEPEKWMRGKGRVTIQ 249
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + G R + DPIP + ++I +L + ++PP+ + +S ++N
Sbjct: 250 FGCCYNYATDKNGNPPGIIRTFAS---DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVN 305
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
Y PG CI HID H F RP ++S S+ + FG +P+ G V
Sbjct: 306 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSFAIPLPAGSV 364
Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
++ A+ HCV +R I R++ P P + P + +
Sbjct: 365 LVINGNGADVAKHCV--PAVPAKRISITFRKMDPAKRPFGFKDDPELLNITPMEAASEET 422
Query: 275 ARESGGYFRSSIETYNNN 292
+R S ++ N N
Sbjct: 423 SRSSDEGKGKQLDVQNKN 440
>gi|297847176|ref|XP_002891469.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
lyrata]
gi|297337311|gb|EFH67728.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
lyrata]
Length = 335
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 85/191 (44%), Gaps = 14/191 (7%)
Query: 66 ALERN--IDEVSSLAEQGQ---YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLY 120
A E+N +D+V L E+GQ K R R+ FG + Y + G +
Sbjct: 114 AAEQNMIVDKVCELQEKGQKGELKRAFTARGKGRSTIQFGCCFNYRTSKAGNPAG---IL 170
Query: 121 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPI 179
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP
Sbjct: 171 KHETVDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-NHDFLRPF 228
Query: 180 ISLSLFSDSALCFGCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVRPQDTQ 238
++S S+ + FG K Y LP+ G V +L A+ HCV T
Sbjct: 229 CTVSFLSECNILFGSNLKVKETGEYTGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT- 287
Query: 239 HRRAVILLRRV 249
+R I R++
Sbjct: 288 -KRISITFRKM 297
>gi|224094961|ref|XP_002310305.1| predicted protein [Populus trichocarpa]
gi|222853208|gb|EEE90755.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
VDPIP + ++I +L + V+PP + +S ++N Y+ G CI HID H F RP ++S
Sbjct: 34 VDPIPVLFK-VIIRRLVRWHVIPPTCVPDSCIVNIYEEGDCIPPHIDN-HDFVRPFCTVS 91
Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
S+ + FG + + + +P+ G +L AA+ HCV + H+R +
Sbjct: 92 FLSECNMLFGSHLKIVGVGDFDGPIAIPLPVGSALVLNGNAADVAKHCV---PSVHKRRI 148
Query: 244 ILLRRVLPHAPR 255
+ R + A R
Sbjct: 149 SITFRKMDEAKR 160
>gi|218197526|gb|EEC79953.1| hypothetical protein OsI_21553 [Oryza sativa Indica Group]
Length = 600
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
IL G+ +FS E +R +D V L E G++ +P R R
Sbjct: 186 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 242
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + G R DP+P+ + ++I +L + V+P I +S ++N
Sbjct: 243 FGCCYNYATDKNGNPPGIIRTI---ASDPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 298
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
Y PG CI HID H F RP ++S S+ + FG L +P+ G V
Sbjct: 299 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 357
Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
+L A+ HCV T +R I R++ P P + P +
Sbjct: 358 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 415
Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
R S G + I+ N ++ H NK +
Sbjct: 416 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 443
>gi|222634922|gb|EEE65054.1| hypothetical protein OsJ_20056 [Oryza sativa Japonica Group]
Length = 602
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
IL G+ +FS E +R +D V L E G++ +P R R
Sbjct: 187 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 243
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + G R DP+P+ + ++I +L + V+P I +S ++N
Sbjct: 244 FGCCYNYATDKNGNPPGIIRTIAS---DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 299
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
Y PG CI HID H F RP ++S S+ + FG L +P+ G V
Sbjct: 300 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 358
Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
+L A+ HCV T +R I R++ P P + P +
Sbjct: 359 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 416
Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
R S G + I+ N ++ H NK +
Sbjct: 417 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 444
>gi|15227938|ref|NP_179387.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|4406821|gb|AAD20129.1| hypothetical protein [Arabidopsis thaliana]
gi|34365749|gb|AAQ65186.1| At2g17970 [Arabidopsis thaliana]
gi|51970124|dbj|BAD43754.1| hypothetical protein [Arabidopsis thaliana]
gi|330251612|gb|AEC06706.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 507
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 17/192 (8%)
Query: 68 ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
+R +D+V L E+G+ P R R FG Y Y G
Sbjct: 229 KRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 285
Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
+ EVDP+P + ++I KL + V+PP + +S ++N Y G CI HID H F R
Sbjct: 286 ILQREEVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLR 343
Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
P ++S S+ + FG + +P+ G V +L A+ HCV T
Sbjct: 344 PFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 403
Query: 238 QHRRAVILLRRV 249
+R I R++
Sbjct: 404 --KRISITFRKM 413
>gi|357445515|ref|XP_003593035.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|124360268|gb|ABN08281.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
gi|355482083|gb|AES63286.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 391
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 74/161 (45%), Gaps = 14/161 (8%)
Query: 77 LAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLV 136
L E+ KP R R FG Y Y V K + EVDP+P + +
Sbjct: 182 LRERTYSKPRKWMRGKGRETLQFGCCYNYA---VDKNGNPPGISRTEEVDPLPRMFKQM- 237
Query: 137 ITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 195
I ++ + ++PP + +S ++N Y G CI HID H F RP S+S + + + FG
Sbjct: 238 IKRMVRWKIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSVSFLNKAKILFGS- 295
Query: 196 FNFKPIRVSE----PVLYLPVQRGCVTLLRDFAANGITHCV 232
N K +R E + LPV G V +L+ A+ HC+
Sbjct: 296 -NLKAVRPGEFFGPASISLPV--GSVFVLKGNGADIAKHCI 333
>gi|115466278|ref|NP_001056738.1| Os06g0138200 [Oryza sativa Japonica Group]
gi|55296462|dbj|BAD68658.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein-like [Oryza
sativa Japonica Group]
gi|113594778|dbj|BAF18652.1| Os06g0138200 [Oryza sativa Japonica Group]
Length = 616
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
IL G+ +FS E +R +D V L E G++ +P R R
Sbjct: 201 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 257
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y + G R DP+P+ + ++I +L + V+P I +S ++N
Sbjct: 258 FGCCYNYATDKNGNPPGIIRTIAS---DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 313
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
Y PG CI HID H F RP ++S S+ + FG L +P+ G V
Sbjct: 314 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 372
Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
+L A+ HCV T +R I R++ P P + P +
Sbjct: 373 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 430
Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
R S G + I+ N ++ H NK +
Sbjct: 431 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 458
>gi|145328752|ref|NP_001077911.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|334184285|ref|NP_001189546.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330251613|gb|AEC06707.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330251614|gb|AEC06708.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 433
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 17/192 (8%)
Query: 68 ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
+R +D+V L E+G+ P R R FG Y Y G
Sbjct: 155 KRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 211
Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
+ EVDP+P + ++I KL + V+PP + +S ++N Y G CI HID H F R
Sbjct: 212 ILQREEVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLR 269
Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
P ++S S+ + FG + +P+ G V +L A+ HCV T
Sbjct: 270 PFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 329
Query: 238 QHRRAVILLRRV 249
+R I R++
Sbjct: 330 --KRISITFRKM 339
>gi|255579797|ref|XP_002530736.1| conserved hypothetical protein [Ricinus communis]
gi|223529700|gb|EEF31642.1| conserved hypothetical protein [Ricinus communis]
Length = 511
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 92/209 (44%), Gaps = 15/209 (7%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
IL+G+ +FS E + + E+ + ++G+ K P R R FG
Sbjct: 228 ILEGLELHTCVFSAVEQNRIVNYVHELEEMGKRGELKERTYTAPQKWMRGKGRVTIQFGC 287
Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQ 160
Y Y + K + VDPIP + ++I +L + V+PP + +S ++N Y
Sbjct: 288 CYNYATD---KNGNPPGILQSETVDPIPHLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYD 343
Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
G CI HID H F+RP ++S S+ + FG + +P+ G V +L
Sbjct: 344 EGDCIPPHID-NHDFNRPFCTVSFVSECNIVFGSNLKVLGPGEFSGSIAIPLPVGSVLVL 402
Query: 221 RDFAANGITHCVRPQDTQHRRAVILLRRV 249
AA+ HCV T +R I R++
Sbjct: 403 DGNAADVSKHCVPSVPT--KRISITFRKM 429
>gi|297836514|ref|XP_002886139.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297331979|gb|EFH62398.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 17/192 (8%)
Query: 68 ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
+R +D+V L E+G+ P R R FG Y Y G
Sbjct: 224 KRIVDQVYQLQEKGRKGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 280
Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
+ EVDP+P + ++I +L + V+PP + +S ++N Y CI HID H F R
Sbjct: 281 ILQREEVDPLPPLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYDESDCIPPHID-NHDFLR 338
Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
P ++S S+ + FG N + +P+ G V +L A+ HCV T
Sbjct: 339 PFCTISFLSECDILFGSNLNVEGPGEFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 398
Query: 238 QHRRAVILLRRV 249
+R I R++
Sbjct: 399 --KRISITFRKM 408
>gi|298204538|emb|CBI23813.3| unnamed protein product [Vitis vinifera]
Length = 439
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 13/192 (6%)
Query: 44 ELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGY 103
EL ++ Q + EC+ + + + L E+ +P R R FG Y
Sbjct: 162 ELHTRVFNSEEQKKIV---ECVYNLQRMGQKGMLRERTYSEPKKWMRGKGRVTIQFGCCY 218
Query: 104 TYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPG 162
Y V K + E EVDP+P + + I ++ + ++PP + NS ++N Y G
Sbjct: 219 NYA---VDKNGNPPGIIREEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEG 274
Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLL 220
CI HID H F RP ++S ++ + FG S PV + LP +G V +L
Sbjct: 275 DCIPPHID-HHDFLRPFCTVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLIL 331
Query: 221 RDFAANGITHCV 232
A+ HCV
Sbjct: 332 NGNGADVAKHCV 343
>gi|225447996|ref|XP_002270229.1| PREDICTED: uncharacterized protein LOC100256702 [Vitis vinifera]
Length = 457
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 13/192 (6%)
Query: 44 ELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGY 103
EL ++ Q + EC+ + + + L E+ +P R R FG Y
Sbjct: 180 ELHTRVFNSEEQKKIV---ECVYNLQRMGQKGMLRERTYSEPKKWMRGKGRVTIQFGCCY 236
Query: 104 TYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPG 162
Y V K + E EVDP+P + + I ++ + ++PP + NS ++N Y G
Sbjct: 237 NYA---VDKNGNPPGIIREEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEG 292
Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLL 220
CI HID H F RP ++S ++ + FG S PV + LP +G V +L
Sbjct: 293 DCIPPHID-HHDFLRPFCTVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLIL 349
Query: 221 RDFAANGITHCV 232
A+ HCV
Sbjct: 350 NGNGADVAKHCV 361
>gi|302775126|ref|XP_002970980.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
gi|300160962|gb|EFJ27578.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
Length = 307
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 93/209 (44%), Gaps = 35/209 (16%)
Query: 43 RELRQKILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
+ +R ILQG+ ++FS E R +D V L +GQ K+ G+
Sbjct: 78 KGMRVNILQGLELHTNIFSAAE---QSRLLDMVFELELKGQRNELKERTYSAPRKWMQGK 134
Query: 102 G---------YTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
G Y Y GN + + EVDP+P ++S I +L + V+ P +
Sbjct: 135 GRVTLQFGCCYNYS----HDKFGNTPGILQDEEVDPLPPLLKS-TIKRLVRWHVLSPTCV 189
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLY- 209
+S ++N Y+ G CI HID H F RP ++SL S ++ FG + V+ P +
Sbjct: 190 PDSCIVNIYEIGDCIPPHID-HHDFLRPFCTVSLLSQCSIVFGSSLS-----VAAPGEFD 243
Query: 210 ------LPVQRGCVTLLRDFAANGITHCV 232
LPV G V +L D A+ HC+
Sbjct: 244 GSFSTQLPV--GSVLVLNDNGADVAKHCI 270
>gi|357445523|ref|XP_003593039.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|124360274|gb|ABN08287.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
gi|355482087|gb|AES63290.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 497
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 65/129 (50%), Gaps = 9/129 (6%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
EVDP+P + + I ++ + ++PP + +S ++N Y G CI HID H F RP S+
Sbjct: 285 EVDPLPDVFKQM-IKRMVRWNIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSV 342
Query: 183 SLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHR 240
S +++ + FG +P S P + LP+ G V +L A+ HC+ +
Sbjct: 343 SFLNEAKILFGSNLKEIQPGEFSGPASISLPL--GSVFVLNGNGADIAKHCI--PSVSSK 398
Query: 241 RAVILLRRV 249
R I R++
Sbjct: 399 RISITFRKM 407
>gi|79319564|ref|NP_001031159.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|52354159|gb|AAU44400.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
gi|332194254|gb|AEE32375.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 327
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 12/186 (6%)
Query: 69 RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
R +D+V L E+ G+ K + R+ FG + Y + G + V
Sbjct: 114 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 170
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 171 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 228
Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
S+ + FG + S LP+ G V +L A+ HCV T +R
Sbjct: 229 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT--KRIS 286
Query: 244 ILLRRV 249
I R++
Sbjct: 287 ITFRKM 292
>gi|79361742|ref|NP_175328.2| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|52354161|gb|AAU44401.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
gi|60547627|gb|AAX23777.1| hypothetical protein At1g48980 [Arabidopsis thaliana]
gi|332194253|gb|AEE32374.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 331
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 12/186 (6%)
Query: 69 RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
R +D+V L E+ G+ K + R+ FG + Y + G + V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232
Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
S+ + FG + S LP+ G V +L A+ HCV T +R
Sbjct: 233 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT--KRIS 290
Query: 244 ILLRRV 249
I R++
Sbjct: 291 ITFRKM 296
>gi|186489643|ref|NP_001117455.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|332194255|gb|AEE32376.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 325
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 75/169 (44%), Gaps = 10/169 (5%)
Query: 69 RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
R +D+V L E+ G+ K + R+ FG + Y + G + V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232
Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
S+ + FG + S LP+ G V +L A+ HCV
Sbjct: 233 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCV 281
>gi|242074876|ref|XP_002447374.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
gi|241938557|gb|EES11702.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
Length = 345
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 92/198 (46%), Gaps = 24/198 (12%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ----------YKPCTVDRAPLRNKY 97
ILQG+ +FS E +R +D V L ++G+ +P R R
Sbjct: 33 ILQGLELHTGVFSPAE---QQRIVDLVHDLQDKGRRGLLRGERTYSEPRKWMRGKGRATI 89
Query: 98 FFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVI 156
FG Y Y + R G + E E DP+P ++S+V +L V+PP+ + +S ++
Sbjct: 90 QFGCCYNYATD--RDGNPPGIIRRE-EADPLPPPLKSMV-RRLVSWRVLPPDCVPDSCIV 145
Query: 157 NDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQR 214
N Y G CI H+D H F RP ++S ++ A+ FG + P + PV + LP
Sbjct: 146 NVYDVGDCIPPHVD-HHDFARPFCTVSFLAECAILFGRELRVVGPGEFAGPVSISLPP-- 202
Query: 215 GCVTLLRDFAANGITHCV 232
G V +L A+ HCV
Sbjct: 203 GSVLVLAGNGADVAKHCV 220
>gi|413953301|gb|AFW85950.1| hypothetical protein ZEAMMB73_605385 [Zea mays]
Length = 614
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 15/132 (11%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P +++ I +L + V+P + +S ++N Y+PG CI HID H F RP ++S
Sbjct: 281 DPMPDLFKTM-IKRLVRWQVLPATCVPDSCIVNIYEPGDCIPPHID-SHDFVRPFCTVSF 338
Query: 185 FSDSALCFGCKFNFKPIRVSEP-----VLYLPVQRGCVTLLRDFAANGITHCVRPQDTQH 239
S+ + FG ++V+ P + +P+ G V +L A+ HCV T
Sbjct: 339 LSECNILFGPN-----LKVAAPGEFIGSIAIPLPVGSVLVLNGNGADVAKHCVPAVPT-- 391
Query: 240 RRAVILLRRVLP 251
+R I R++ P
Sbjct: 392 KRISITFRKMDP 403
>gi|224134110|ref|XP_002327758.1| predicted protein [Populus trichocarpa]
gi|222836843|gb|EEE75236.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
VDPIP + ++I +L Q V+PP + +S ++N Y+ CI HID H F RP ++S
Sbjct: 2 VDPIPDLFK-VIIRRLVQWHVIPPTCVPDSCIVNIYEEWDCIPPHID-SHDFVRPFCTVS 59
Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
S + FG + + +P+ G V +L A+ HCV T +R
Sbjct: 60 FLSQCNILFGSNLKVVGAGDFDGPIAIPLPVGSVLVLNGNGADVAKHCVPSVPT--KRIS 117
Query: 244 ILLRRV 249
I R++
Sbjct: 118 ITFRKM 123
>gi|440792230|gb|ELR13458.1| 2OG-Fe(II) oxygenase family oxidoreductase [Acanthamoeba
castellanii str. Neff]
Length = 214
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 211
+ +IN+Y PG I +H+D P +FD I S+SL S + +F K + + PVL
Sbjct: 110 DQVIINEYLPGQGISAHVDKPSLFDNEIASISLGSTCVM----EFKHKATKTTHPVL--- 162
Query: 212 VQRGCVTLLRDFAANGITHCV--RPQDTQHRRAVILLRRV 249
+ R + L++ A THC+ R +D R + RRV
Sbjct: 163 LGRRSLVLMKGAARYEWTHCIPARKKDKVDGRQIPRQRRV 202
>gi|356525956|ref|XP_003531587.1| PREDICTED: uncharacterized protein LOC100306211 [Glycine max]
Length = 423
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 10/153 (6%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
FG Y Y V K + + EVDP+P + + I ++ + ++P I +S ++N
Sbjct: 229 FGCCYNYA---VDKNGSPPGIMRDEEVDPLPPVFKQM-IKRMVRWNIIPSTCIPDSCIVN 284
Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGC 216
Y+ G CI HID H F RP ++S ++ + FG P + PV +P+ G
Sbjct: 285 IYEEGDCIPPHID-HHDFVRPFYTVSFLNECKILFGSNLQVVCPGEFAGPV-SIPLPVGS 342
Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
V +L A+ HC+ +R I R++
Sbjct: 343 VFVLNGNGADIAKHCI--PSVSSKRISITFRKM 373
>gi|148906930|gb|ABR16610.1| unknown [Picea sitchensis]
Length = 496
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 3/110 (2%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
EVDP+P ++ I ++ + V+PP+ I +S ++N Y G CI HID H F RP ++
Sbjct: 290 EVDPLPPLFKA-AIRRMVRWHVLPPSCIPDSCIVNIYDEGDCIPPHID-HHDFVRPFCTV 347
Query: 183 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
SL S+ + FG +P+ G V +L A+ H V
Sbjct: 348 SLLSECNIIFGSNLKILGPGEFAGSTAIPLPMGSVLILNGNGADVAKHSV 397
>gi|297793433|ref|XP_002864601.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310436|gb|EFH40860.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 302
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 99/218 (45%), Gaps = 22/218 (10%)
Query: 45 LRQKILQGVNQMSL----FSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG 100
+++KI+ ++ + L FS +E +R ++EV L E+G+ K+ G
Sbjct: 68 VKEKIVNVLDGLELHTGVFSAKE---QKRIVNEVYQLQEKGRKGELKEHTFNAPRKWMRG 124
Query: 101 EGYT-------YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-N 152
+G Y + R G L E EVDP+P + ++I +L + V+PP + +
Sbjct: 125 KGRVTIQFGCCYNYAIDRSGNPPGILQRE-EVDPLPPLFK-VMIRRLIKWHVLPPTCVPD 182
Query: 153 SAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLP 211
S ++N Y G CI HID H F RP ++S S+ + FG + P S P P
Sbjct: 183 SCIVNIYDEGDCIPPHID-NHDFLRPFCTISFLSECDILFGSNLKTEGPGEFSGP-YSTP 240
Query: 212 VQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
+ G V +L A+ H V T +R I R++
Sbjct: 241 LPVGSVLVLNGNGADVAKHYVPAVPT--KRISITFRKM 276
>gi|308810579|ref|XP_003082598.1| unnamed protein product [Ostreococcus tauri]
gi|116061067|emb|CAL56455.1| unnamed protein product [Ostreococcus tauri]
Length = 544
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 85/189 (44%), Gaps = 13/189 (6%)
Query: 52 GVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKP---CTVDRAPLRNKYFFGEGYTYGS 107
GVN M + +E ER I E++ A QG T R K FFG Y S
Sbjct: 327 GVNVMEGALTDDEIHEAERGILELTRKALQGDLGGDPFVTKAGTTSRIKLFFGFAYEKRS 386
Query: 108 QLVRKGLGNERLYPE--GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY-QPGG 163
K +RL G DP+ A + L T L++ GV P+++ N V+N Y + G
Sbjct: 387 DSRNK---PQRLIKNVPGIDDPLAAPLHKLAET-LQKRGVFAPDFVPNQYVLNIYGRAGA 442
Query: 164 CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDF 223
+++H D H+F+ PI + LF+ L F + + I V ++ + G VT + F
Sbjct: 443 YLMAHKDALHLFEGPIYGVRLFNPRILSFAPDGHMR-INVDRGMIDVMQSVGSVTSMDGF 501
Query: 224 AANGITHCV 232
A + H +
Sbjct: 502 AKEDVQHSI 510
>gi|313240624|emb|CBY32949.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%)
Query: 212 VQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+QRG +T N I HCVRP+DT+ RR I+LRRV + P L
Sbjct: 1 MQRGIITSFEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 45
>gi|7770341|gb|AAF69711.1|AC016041_16 F27J15.23 [Arabidopsis thaliana]
Length = 264
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 61/129 (47%), Gaps = 9/129 (6%)
Query: 69 RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
R +D+V L E+ G+ K + R+ FG + Y + G + V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232
Query: 185 FSDSALCFG 193
S+ + FG
Sbjct: 233 LSECNILFG 241
>gi|3036813|emb|CAA18503.1| hypothetical protein [Arabidopsis thaliana]
gi|7270561|emb|CAB81518.1| hypothetical protein [Arabidopsis thaliana]
Length = 505
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 111/283 (39%), Gaps = 60/283 (21%)
Query: 49 ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
IL+G+ +FS + ++ +D V L E+G+ R LR + F
Sbjct: 216 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 265
Query: 99 -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
FG Y Y G + G+VDP+P+ + + I
Sbjct: 266 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPSIFKVI---------------I 307
Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSE--PVLY 209
S ++N Y+ CI HID H F RP ++S S+ + FG N K + E
Sbjct: 308 KSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGS--NLKVLGPGEFSGSYS 364
Query: 210 LPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK---- 265
+P+ G V +L+ A+ HCV T +R I R++ + + P ++
Sbjct: 365 IPLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKP 422
Query: 266 -PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
PY H+ A RSSI+ N NH+N+ G K
Sbjct: 423 LPYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 463
>gi|395836690|ref|XP_003791285.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Otolemur garnettii]
Length = 174
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 219 LLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 35 LFSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 72
>gi|440789981|gb|ELR11271.1| DNA repair protein alkB, putative [Acanthamoeba castellanii str.
Neff]
Length = 253
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 20/134 (14%)
Query: 128 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
+P + Q +I K+ Q G+ P + + +IN+Y PG I HID H F ++SLSL S
Sbjct: 127 LPVFAQQ-IIEKMRQRGL--PQF-DQLIINEYTPGQGINPHIDKTHCFGPCVVSLSLLST 182
Query: 188 SALCFGCKFNFKPIRVSE--PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVIL 245
C F + E PV+ P + +LR A G H + P +RA I+
Sbjct: 183 ------CVMTFTSLETGEKIPVVLRPR---SLVVLRGQARYGWQHGIEP-----KRADIV 228
Query: 246 LRRVLPHAPRLTLS 259
+ P A R++L+
Sbjct: 229 AGKHTPRARRVSLT 242
>gi|397614206|gb|EJK62658.1| hypothetical protein THAOC_16720 [Thalassiosira oceanica]
Length = 396
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKF------NFKPIRVSE 205
+S +IN Y G CI H+D ++RPI +LSL + A+ G KF +KPI V +
Sbjct: 208 DSCIINQYGEGDCIPPHVDH-ESYERPISTLSLLGEEAMLLGTKFKTVKNCTWKPI-VGQ 265
Query: 206 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
VL LP R + +L + N HC+ RR I LR+ P
Sbjct: 266 SVL-LP--RRSLLVLGGNSGNISKHCI--SACTRRRISITLRKQPP 306
>gi|345305229|ref|XP_001510391.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Ornithorhynchus anatinus]
Length = 150
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 27/34 (79%)
Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 13 YAADEITHCIRPQDIRERRAVIILRKTRLDAPRL 46
>gi|71425222|ref|XP_813050.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877899|gb|EAN91199.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 305
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)
Query: 88 VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
V++ LR Y + YT S +R L + R ++ P+P +I+++EQ G+VP
Sbjct: 45 VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96
Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++N+ N Y+PG I +HID ++D +SL S++ L F N + + P
Sbjct: 97 DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155
Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
G + ++ A H V P + Q R +++ R +L TP +
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208
Query: 266 PY---HSLYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
PY H L + A++ GG + + Y E +T+
Sbjct: 209 PYRSTHILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTA 247
>gi|42567443|ref|NP_195332.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661215|gb|AEE86615.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 385
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 123 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 181
G+VDP+P+ I ++I +L V+PP + +S ++N Y+ CI HID H F RP +
Sbjct: 294 GDVDPMPS-IFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDCIPPHID-NHDFLRPFCT 351
Query: 182 LSLFSDSALCFG 193
+S S+ + FG
Sbjct: 352 VSFLSECNILFG 363
>gi|11094804|gb|AAG29733.1|AC084414_1 hypothetical protein, 3' partial [Arabidopsis thaliana]
Length = 235
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 9/123 (7%)
Query: 69 RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
R +D+V L E+ G+ K + R+ FG + Y + G + V
Sbjct: 117 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 173
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 174 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 231
Query: 185 FSD 187
S+
Sbjct: 232 LSE 234
>gi|407408875|gb|EKF32135.1| hypothetical protein MOQ_004021 [Trypanosoma cruzi marinkellei]
Length = 305
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)
Query: 88 VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
V++ LR Y + YT S +R L + R ++ P+P +I+++EQ G+VP
Sbjct: 45 VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96
Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++N+ N Y+PG I +HID ++D +SL S++ L F N + + P
Sbjct: 97 DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155
Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
G + ++ A H V P + Q R +++ R +L TP +
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208
Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
PY S L + A++ GG + + Y E +T+
Sbjct: 209 PYRSTQILNTLYAKQIGGVRVTVDDNYLEKEEIGAFDTA 247
>gi|407843604|gb|EKG01501.1| hypothetical protein TCSYLVIO_007502 [Trypanosoma cruzi]
Length = 354
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)
Query: 88 VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
V++ LR Y + YT S +R L + R ++ P+P +I+++EQ G+VP
Sbjct: 94 VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 145
Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++N+ N Y+PG I +HID ++D +SL S++ L F N + + P
Sbjct: 146 DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 204
Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
G + ++ A H V P + Q R +++ R +L TP +
Sbjct: 205 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 257
Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
PY S L + A++ GG + + Y E +T+
Sbjct: 258 PYRSTQILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTA 296
>gi|428311031|ref|YP_007122008.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
gi|428252643|gb|AFZ18602.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
Length = 206
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 22/151 (14%)
Query: 99 FGEGYTY--GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVI 156
+G Y Y GS + LGN +P W Q++ + E +V N ++
Sbjct: 60 YGYKYDYKDGSFVASTHLGN-----------LPDWAQNVAVRLAEDGFMV--NVPEQVIV 106
Query: 157 NDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGC 216
N+YQPG IVSH D F II+LSL S+ C NF + + V L +Q G
Sbjct: 107 NEYQPGQGIVSHTDCIPCFGNTIITLSLGSE------CVMNFTHSQTQKEVGIL-LQAGS 159
Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLR 247
+ + + A H + P+ + + I +R
Sbjct: 160 LLIFKGEARYIWKHGIVPRKRDNYKGRIFMR 190
>gi|357442687|ref|XP_003591621.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
gi|355480669|gb|AES61872.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
Length = 513
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 101/218 (46%), Gaps = 17/218 (7%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
+F+ E L +DE+ + + G+ T L NK G +L++ G+
Sbjct: 202 VFTDSELCKLSDFVDEIHTAGQNGELSGETFI---LFNKQMKGNK----RELIQLGVPIF 254
Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
G + + ++PIPA +Q VI Q ++P N +IN ++ G + PPH
Sbjct: 255 GQIKEDTKSNIEPIPALLQR-VIDHFIQWHLLPEYKRPNGCIINFFEEGEFSQPFLKPPH 313
Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
+ D+P+ +L L S+S + FG + + L L +++G + ++R +A+ H +
Sbjct: 314 L-DQPVTTL-LLSESTMAFGRILMSENDGNYKGPLMLSLKKGSLLVMRGNSADMARHVMC 371
Query: 234 PQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLY 271
P + +RR I RV P + + S TP + +++
Sbjct: 372 P--SPNRRVSITFFRVRPDSNQCQ-SPTPTMTSAMTMW 406
>gi|328772232|gb|EGF82271.1| hypothetical protein BATDEDRAFT_87049 [Batrachochytrium
dendrobatidis JAM81]
Length = 225
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 19/137 (13%)
Query: 128 IPAWIQSLVITK--LEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+PA+ S VI + L + V P + N ++N+YQPG I+ H+D F + SLSL+
Sbjct: 95 LPAF-SSFVIDRMLLPEFNVFPNDPPNHVLVNEYQPGQGIMPHVDSQDTFGDVVTSLSLW 153
Query: 186 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT-------- 237
S + FG K + + + P R + +L A TH + +D
Sbjct: 154 SSCVMSFGNKMTGEKVHLELP-------RRSLLILTGDARTHYTHAIPKEDMLFAGNECV 206
Query: 238 -QHRRAVILLRRVLPHA 253
+ RR + +R +L A
Sbjct: 207 DRGRRVSLTIRSILKSA 223
>gi|343474281|emb|CCD14041.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 305
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P + ++ ++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S
Sbjct: 76 PLPTGSFTPLLDRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
++ L F N + + V P G + ++ + H V P + Q R +V+
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
R +L TP + PY S L + A++ GG ++Y E +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247
>gi|342180903|emb|CCC90378.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 305
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P + ++ ++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S
Sbjct: 76 PLPTGSFTPLLGRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
++ L F N + + V P G + ++ + H V P + Q R +V+
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
R +L TP + PY S L + A++ GG ++Y E +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247
>gi|219883161|ref|YP_002478323.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
gi|219867286|gb|ACL47624.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
Length = 204
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 13/117 (11%)
Query: 76 SLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSL 135
+L E+ +P D R +G Y Y ++ + PE + +P W++ L
Sbjct: 41 TLLEEIDQQPWITDLK--RRVQHYGYRYDYKARAIS---------PEAYLGTLPEWLKPL 89
Query: 136 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
+L Q G +P + + ++N+Y PG I +HID F I+SLSL SD + F
Sbjct: 90 T-NRLWQEGYIP-DLPDQVIVNEYIPGQGITAHIDCIDCFSDTILSLSLGSDCIMRF 144
>gi|30690892|ref|NP_182329.2| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
thaliana]
gi|330255838|gb|AEC10932.1| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
thaliana]
Length = 438
Score = 47.0 bits (110), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 105/229 (45%), Gaps = 39/229 (17%)
Query: 38 SKHHARELRQKILQGVNQ-MSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNK 96
+K H R +++G+ +F+ + L + +D ++ L E G+ + + L NK
Sbjct: 153 AKEHVRGHTANVVKGLKLYQDVFTRPQ---LSKLLDSINQLREAGRNHQLSGETFVLFNK 209
Query: 97 YFFGEGYTYGSQLVRKGL---GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
G +L++ G+ GN E V+PIP +QS VI L Q ++P
Sbjct: 210 NTKGTKR----ELLQLGVPIFGNT--TDEHSVEPIPTLVQS-VIDHLLQWRLIP------ 256
Query: 154 AVINDYQ-PGGCIVSHID----------PPHIFDRPIISLSLFSDSALCFGCKFNFKPIR 202
+Y+ P GC+++ D PPH+ D+PI +L L S+S + FG +
Sbjct: 257 ----EYKRPNGCVINFFDEDEHSQPFQKPPHV-DQPISTLVL-SESTMVFGHRLGVDNDG 310
Query: 203 VSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
L LP++ G + ++R +A+ H + P + ++R I ++ P
Sbjct: 311 NFRGSLTLPLKEGSLLVMRGNSADMARHVMCP--SPNKRVAITFFKLKP 357
>gi|115345714|ref|YP_771891.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
gi|115293033|gb|ABI93483.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
Length = 195
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 13/120 (10%)
Query: 120 YPEGEVDPIPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 177
+ E + P+P QSL +T VP + ++N+YQPG I +HID F
Sbjct: 64 WREDYLGPLPELFQSLAERLTAEGHFQTVP----DQVIVNEYQPGQGISAHIDCQPCFGE 119
Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
I SLSL S + F R+ + L +Q + +L+ A + TH + P+ T
Sbjct: 120 TIASLSLLSACVMRFAS-------RIYSQQMELHLQPSSLLVLQSDARHLWTHAIPPRKT 172
>gi|71423866|ref|XP_812599.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877399|gb|EAN90748.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 305
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 21/219 (9%)
Query: 88 VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
V++ LR Y + YT S +R L + R ++ P+P +I+++EQ G+VP
Sbjct: 45 VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96
Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++N+ N Y+PG I +HID ++D +SL S++ L F N + + P
Sbjct: 97 DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155
Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
G + ++ A H V P + Q R +++ R +L TP +
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208
Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
PY S L + ++ GG + + Y E +T+
Sbjct: 209 PYRSTQILNTLYTKQIGGVRVTVDDNYLEKEGIGAFDTA 247
>gi|297828529|ref|XP_002882147.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
lyrata]
gi|297327986|gb|EFH58406.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 46.2 bits (108), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 88/196 (44%), Gaps = 29/196 (14%)
Query: 67 LERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVD 126
L + +D ++ L E G+ + + L NK G QL GN E V+
Sbjct: 187 LSKLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKREL-LQLGVPIFGNT--TDEHSVE 243
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID----------PPHIF 175
PIP +QS VI L Q V+P +Y+ P GC+++ D PPH+
Sbjct: 244 PIPTLVQS-VIDHLLQWRVIP----------EYKRPNGCVINFFDEDEHSQPFQKPPHV- 291
Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
D+PI +L L S+S + FG + L LP++ G + ++R +A+ H + P
Sbjct: 292 DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARHVMCP- 349
Query: 236 DTQHRRAVILLRRVLP 251
+ ++R I ++ P
Sbjct: 350 -SPNKRVAITFFKLKP 364
>gi|4249414|gb|AAD13711.1| unknown protein [Arabidopsis thaliana]
Length = 433
Score = 45.8 bits (107), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 88/196 (44%), Gaps = 29/196 (14%)
Query: 67 LERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVD 126
L + +D ++ L E G+ + + L NK G QL GN E V+
Sbjct: 175 LSKLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKREL-LQLGVPIFGNT--TDEHSVE 231
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID----------PPHIF 175
PIP +QS VI L Q ++P +Y+ P GC+++ D PPH+
Sbjct: 232 PIPTLVQS-VIDHLLQWRLIP----------EYKRPNGCVINFFDEDEHSQPFQKPPHV- 279
Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
D+PI +L L S+S + FG + L LP++ G + ++R +A+ H + P
Sbjct: 280 DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARHVMCP- 337
Query: 236 DTQHRRAVILLRRVLP 251
+ ++R I ++ P
Sbjct: 338 -SPNKRVAITFFKLKP 352
>gi|356502827|ref|XP_003520217.1| PREDICTED: uncharacterized protein LOC100791586 [Glycine max]
Length = 514
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 94/200 (47%), Gaps = 16/200 (8%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
+F+ E L ++E+ + + G+ T L NK G +L++ G+
Sbjct: 201 IFTDSELCKLTDFVNEIHAAGQNGELSGETFI---LFNKQMKGNKR----ELIQLGVPIF 253
Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
G + + ++PIP +Q VI L Q ++P N +IN ++ G + PPH
Sbjct: 254 GQIKEDAKSNIEPIPVLLQG-VIDHLIQWQLLPEYKRPNGCIINFFEKGEFSQPFLKPPH 312
Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
+ D+P+ +L L S+SA+ FG + + L L +++G + ++R +A+ H +
Sbjct: 313 L-DQPVSTL-LLSESAMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMC 370
Query: 234 PQDTQHRRAVILLRRVLPHA 253
P + +RR I RV P +
Sbjct: 371 P--SPNRRVSITFFRVRPDS 388
>gi|146100771|ref|XP_001468941.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398023105|ref|XP_003864714.1| hypothetical protein, conserved [Leishmania donovani]
gi|134073310|emb|CAM72036.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322502950|emb|CBZ38034.1| hypothetical protein, conserved [Leishmania donovani]
Length = 318
Score = 45.4 bits (106), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 83/185 (44%), Gaps = 13/185 (7%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P ++ +LEQ VVP N++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPPASFMPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
R ++ +TP + PY + L + +++ GG S + + + N +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESANIGAFDTS 250
Query: 302 NGLDK 306
+ +
Sbjct: 251 RWVKR 255
>gi|445456942|ref|ZP_21446201.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
gi|444777446|gb|ELX01476.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
Length = 200
Score = 45.1 bits (105), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 4/67 (5%)
Query: 127 PIPAWIQSLVITKLEQMGV-VPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
PIP+W+ + +++ + +PPN + +IN+Y+PG I HID P F II +SL
Sbjct: 76 PIPSWLIRMQKDLMDECSLDLPPNQL---IINEYEPGQGITDHIDAPDEFGETIIMVSLG 132
Query: 186 SDSALCF 192
S + F
Sbjct: 133 SSCVMDF 139
>gi|356570554|ref|XP_003553450.1| PREDICTED: uncharacterized protein LOC100776159 [Glycine max]
Length = 525
Score = 45.1 bits (105), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 89/198 (44%), Gaps = 16/198 (8%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG---EGYTYGSQLVRKGL 114
+FS E L + E+ + A+ G+ T L NK G E G + R+
Sbjct: 216 VFSESEICKLTDFVKEIHAAAQNGELSGETFI---LFNKQMKGNKRELIQLGVPIFRQIK 272
Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
+ + ++PIPA + VI L Q ++P N +IN ++ G + PPH
Sbjct: 273 DDNK----SNIEPIPALLHD-VIDHLIQWKLIPEYKRPNGCIINFFEEGEFSQPFLKPPH 327
Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
+ D+P+ +L L S+S + FG + + L L ++ G + ++R +A+ H +
Sbjct: 328 L-DQPLSTL-LLSESTMAFGRILMSENDGNYKGPLMLSLKEGSLLVMRGNSADMARHVMC 385
Query: 234 PQDTQHRRAVILLRRVLP 251
P + +RR I RV P
Sbjct: 386 P--SPNRRVSITFFRVRP 401
>gi|154344577|ref|XP_001568230.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065567|emb|CAM43337.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 318
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 73/159 (45%), Gaps = 13/159 (8%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P + ++ +LEQ VVP N++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPPGSFTPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 VCSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGG 280
R ++ TP V PY + L + +++ GG
Sbjct: 191 RSIMESDGGFRPVTTPLKEVMPYRATQILNALYSKQVGG 229
>gi|401429284|ref|XP_003879124.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495374|emb|CBZ30678.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 318
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 43/185 (23%), Positives = 84/185 (45%), Gaps = 13/185 (7%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P+ ++ +LEQ VVP N++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPSASFLPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
R ++ +TP + PY + L + +++ GG S + + + + +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESADIGAFDTS 250
Query: 302 NGLDK 306
+ +
Sbjct: 251 RWVKR 255
>gi|356536752|ref|XP_003536899.1| PREDICTED: uncharacterized protein LOC100802371 [Glycine max]
Length = 466
Score = 44.7 bits (104), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 94/200 (47%), Gaps = 16/200 (8%)
Query: 58 LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
+F+ E L ++E+ + + G+ T L NK G +L++ G+
Sbjct: 187 IFTDSELCKLTDFVNEIHAAGQNGELSGETFI---LFNKQMKGNK----RELIQLGVPIF 239
Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
G + + ++PIPA +Q VI L Q ++P N +IN ++ G + PPH
Sbjct: 240 GQIKEDAKCNIEPIPALLQG-VIDHLIQWQLLPEYKRPNGCIINFFEEGEFSQPFLKPPH 298
Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
+ D+P+ +L L S+S + FG + + L L +++G + ++R +A+ H +
Sbjct: 299 L-DQPVSTL-LLSESTMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMC 356
Query: 234 PQDTQHRRAVILLRRVLPHA 253
P + +RR I RV P +
Sbjct: 357 P--SPNRRVSITFFRVRPDS 374
>gi|195027836|ref|XP_001986788.1| GH20337 [Drosophila grimshawi]
gi|193902788|gb|EDW01655.1| GH20337 [Drosophila grimshawi]
Length = 616
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 19/112 (16%)
Query: 86 CTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPA-----WIQSLVITKL 140
CT D R+ FG + YGS V E+ PIPA W Q L TK+
Sbjct: 162 CTEDTLKHRHVKHFGYEFLYGSNNVDAAQPLEQ--------PIPAACDFLW-QRLDSTKI 212
Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
G P+ + +N+Y+PG I H+D F PI+SLSL +D + F
Sbjct: 213 S--GTFLPDQLT---VNEYEPGQGIPPHVDTHSAFVDPILSLSLQADVVMDF 259
>gi|124002832|ref|ZP_01687684.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
gi|123992060|gb|EAY31447.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
Length = 192
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 17/136 (12%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
+V +P W Q +V +++ + Y + +IN+Y PG I H+D FD I+S+S
Sbjct: 62 KVGDLPHWAQKIVQQAVDEQ--LLSEYFDQMIINEYLPGQGIARHVDCEPCFDHTIMSVS 119
Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQH---- 239
L + + F + + + PVL P G LL A H +R ++
Sbjct: 120 LGTACVMHFN---SLEDKNLDVPVLLAP---GSAILLSGDARYRWQHSIRANKSEMFEGQ 173
Query: 240 -----RRAVILLRRVL 250
RR + R+V+
Sbjct: 174 KIVRGRRVSLTFRKVI 189
>gi|72388954|ref|XP_844772.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62176055|gb|AAX70176.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801306|gb|AAZ11213.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261328019|emb|CBH10996.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 305
Score = 44.3 bits (103), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 75/172 (43%), Gaps = 13/172 (7%)
Query: 136 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 195
++ +LEQ G+VP ++N+ N Y+P I +HID ++D +SL +++ L F
Sbjct: 85 LVERLEQDGIVPRGWLNNQTANLYEPEDFIRAHIDNLFVYDDIFAVISLGANALLRFVHV 144
Query: 196 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPR 255
N + + V P G V ++ + H V P + Q R +++ R +L
Sbjct: 145 QNGEELDVVIP-------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGG 196
Query: 256 LTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTSN 302
TP + PY S L + A++ GG + + Y E +T
Sbjct: 197 FRPVSTPLGDLMPYRSTQILNTLYAKQIGGVRVAVDDKYLEKEEIGAFDTGK 248
>gi|414077137|ref|YP_006996455.1| alkylated DNA repair protein [Anabaena sp. 90]
gi|413970553|gb|AFW94642.1| alkylated DNA repair protein [Anabaena sp. 90]
Length = 226
Score = 43.9 bits (102), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
+ P+P W +L +L + V P + ++NDYQPG I SH D F IISLS
Sbjct: 98 IAPLPDWALTLA-QRLHK--VFSPTVPDQVIVNDYQPGQGISSHTDCVSCFTDVIISLS- 153
Query: 185 FSDSALCFGCKFNF 198
LC C +F
Sbjct: 154 -----LCSSCVMDF 162
>gi|341897299|gb|EGT53234.1| hypothetical protein CAEBREN_14010 [Caenorhabditis brenneri]
Length = 603
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 4/70 (5%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
E +PIP I+ L+ L + P+ I + V Y+PG I SH D FD PI+S+
Sbjct: 193 EAEPIPQIIEDLIDRLLSDKYITERPDQITANV---YEPGHGIPSHYDTHSAFDDPIVSV 249
Query: 183 SLFSDSALCF 192
SL SD + F
Sbjct: 250 SLLSDVVMEF 259
>gi|418296695|ref|ZP_12908538.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
CCNWGS0286]
gi|355538870|gb|EHH08112.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
CCNWGS0286]
Length = 187
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 70/168 (41%), Gaps = 29/168 (17%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
FG Y Y ++ V P+ + P+P W + +L G N + + N+
Sbjct: 40 FGYRYDYKARTVT---------PDAYLGPLPQWF-GIFAERLVTEGYCR-NLPDQVIANE 88
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVT 218
Y PG I +HID FD I+S+SL S C+ F +R L R V
Sbjct: 89 YLPGQGISAHIDCVPCFDDTIVSISLLS------ACEMVFHDVRGPAACGVLLQPRSGV- 141
Query: 219 LLRDFAANGITHCV--RPQDT-------QHRRAVILLRRVLPHAPRLT 257
LLRD G TH + R D + RR + R+V+ AP+ +
Sbjct: 142 LLRDSGRYGWTHEIPARKSDIVNGVRTDRSRRISLTFRKVI--APKWS 187
>gi|170079543|ref|YP_001736177.1| 2OG-Fe(II) oxygenase family oxidoreductase [Synechococcus sp. PCC
7002]
gi|169887212|gb|ACB00922.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Synechococcus sp. PCC
7002]
Length = 204
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 6/72 (8%)
Query: 128 IPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+P W+ ++ + L + +P + A+IN+Y PG I SHID F I+SLSL
Sbjct: 81 LPDWLFPIIEQMVSLNLISEIP----DQAIINEYLPGQGITSHIDCKPCFTDTIVSLSLN 136
Query: 186 SDSALCFGCKFN 197
S + F C N
Sbjct: 137 SPCVMNFDCTVN 148
>gi|308810811|ref|XP_003082714.1| unnamed protein product [Ostreococcus tauri]
gi|116061183|emb|CAL56571.1| unnamed protein product [Ostreococcus tauri]
Length = 597
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 121 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 180
P+ +D + A I VIT+ E+M +S IN Y+ G I HID P +F RP +
Sbjct: 93 PKALIDVVDALIARGVITEDERM--------DSCTINVYERGMWIPPHIDNP-LFARPFV 143
Query: 181 SLSLFSDSALCFG 193
++SL S A+ G
Sbjct: 144 TVSLASAQAMTLG 156
>gi|410897523|ref|XP_003962248.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Takifugu rubripes]
Length = 234
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 65/135 (48%), Gaps = 10/135 (7%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P+G + + IP W+Q K+ +G N ++N+Y+PG I+ H D P ++
Sbjct: 66 HPKGMLAEKIPDWLQKYC-EKISSLGAFGGKTANHVLVNEYKPGEGIMPHEDGP-LYYPT 123
Query: 179 IISLSLFSDSALCF----GCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVR 233
I ++SL S + L F G P L+ L V+ + +L+D + H +R
Sbjct: 124 ITTISLGSHTLLDFYTPVGSMQGDAPQTEENRFLFSLLVKPRSLLILQDDMYQRLLHGIR 183
Query: 234 P--QDTQHRRAVILL 246
P QDT +AV LL
Sbjct: 184 PCDQDTLTEKAVNLL 198
>gi|195120121|ref|XP_002004577.1| GI20008 [Drosophila mojavensis]
gi|193909645|gb|EDW08512.1| GI20008 [Drosophila mojavensis]
Length = 602
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 59/130 (45%), Gaps = 18/130 (13%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGVVPPNYINSAVINDYQPGGCI 165
K G E LY V+P+ Q + + +LE PP+ + +N+Y+ G I
Sbjct: 170 KHFGYEFLYGSNNVNPLQPLEQGIPAACNFMWERLELPAFEPPDQLT---VNEYEAGQGI 226
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAA 225
H+D F PI+SLSL SD + F R + ++++ + R + ++ +
Sbjct: 227 PPHVDTHSAFKDPILSLSLQSDVVMDF---------RRGDQLVHVLLPRRSLLVMSGESR 277
Query: 226 NGITHCVRPQ 235
TH +RP+
Sbjct: 278 YDWTHGIRPK 287
>gi|443704913|gb|ELU01726.1| hypothetical protein CAPTEDRAFT_156565 [Capitella teleta]
Length = 610
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 25/137 (18%)
Query: 112 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
K G E LY VDP IP ++ L Q ++ N + +N YQPG
Sbjct: 171 KHFGFEFLYSTNNVDPDHPLDMGIPPECSPILQRMLSQQIIL--NLPDQLTVNQYQPGQG 228
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF----GCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
I H+D F+ ++SLSL S + F GC + PV +LP QR V ++
Sbjct: 229 IPPHVDTHSAFEEELVSLSLGSQVVMDFKAPGGCHY---------PV-FLP-QRSLV-VM 276
Query: 221 RDFAANGITHCVRPQDT 237
R + +TH + P+ +
Sbjct: 277 RGESRYQLTHAIAPRKS 293
>gi|58394263|ref|XP_320626.2| AGAP011900-PA [Anopheles gambiae str. PEST]
gi|55234784|gb|EAA00113.2| AGAP011900-PA [Anopheles gambiae str. PEST]
Length = 621
Score = 42.0 bits (97), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 39/125 (31%), Positives = 55/125 (44%), Gaps = 14/125 (11%)
Query: 94 RNKYFFGEGYTYGSQLVRKGLGNERLYPE--GEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
R FG + YGS V + ER P GE+ W KL+Q+ ++
Sbjct: 171 RKVKHFGYKFVYGSNNVDRTKPLERKIPSVCGEL-----W------QKLQQLHPHLRWHV 219
Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
+ +N Y+PG I H+D FD PI+SLSL SD + F + K + V P L
Sbjct: 220 PDQLTVNQYEPGQGIPPHVDTHSAFDDPILSLSLGSDVVMEFKQPSSGKVVCVDLPSRSL 279
Query: 211 PVQRG 215
+ G
Sbjct: 280 LIMSG 284
>gi|17552176|ref|NP_497751.1| Protein ALKB-8 [Caenorhabditis elegans]
gi|15718120|emb|CAB63431.2| Protein ALKB-8 [Caenorhabditis elegans]
Length = 591
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 36/70 (51%), Gaps = 4/70 (5%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
E DPIP I SL+ + + P+ + + N Y+ G I SH D FD PI+S+
Sbjct: 193 EADPIPPVINSLIDRLISDKYITERPDQVTA---NVYESGHGIPSHYDTHSAFDDPIVSI 249
Query: 183 SLFSDSALCF 192
SL SD + F
Sbjct: 250 SLLSDVVMEF 259
>gi|434406443|ref|YP_007149328.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
gi|428260698|gb|AFZ26648.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
Length = 226
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
+ P+P W +L +L + V P + ++N+YQPG I SH D F IISLS
Sbjct: 98 IAPLPNWALTLA-QRLHK--VFSPTLPDQVIVNEYQPGQGISSHTDCVSCFTDVIISLS- 153
Query: 185 FSDSALCFGCKFNF 198
LC C +F
Sbjct: 154 -----LCSSCVMDF 162
>gi|389594597|ref|XP_003722521.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|11022582|emb|CAC14240.1| hypothetical protein L7845.08 [Leishmania major]
gi|323363749|emb|CBZ12755.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 318
Score = 42.0 bits (97), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 42/185 (22%), Positives = 83/185 (44%), Gaps = 13/185 (7%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
P+P ++ +LEQ VVP +++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPPASFMPLLERLEQDNVVPKSWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
R ++ +TP + PY + L + +++ GG S + + + + +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESADIGAFDTS 250
Query: 302 NGLDK 306
+ +
Sbjct: 251 RWVKR 255
>gi|255081432|ref|XP_002507938.1| predicted protein [Micromonas sp. RCC299]
gi|226523214|gb|ACO69196.1| predicted protein [Micromonas sp. RCC299]
Length = 684
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 14/137 (10%)
Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
V P+P + ++V + + + +S IN Y PG I HID P FDRP +++SL
Sbjct: 126 VAPLPPELDAVVDALIARGALTELQRPDSCTINLYGPGQWIPPHIDNP-AFDRPFVTVSL 184
Query: 185 FSDSALCFGCKFNF---------KPIRVSEP-VLYLPVQRGCVTLLRDFAANGITHCVRP 234
S+ + G + R++E L LPV G ++ AA+ H V P
Sbjct: 185 CSEQPMVLGRGMVWPEGGRGPCGDDERLNEEHALSLPV--GSAVVVEGEAADEYEHAVPP 242
Query: 235 QDTQHRRAVILLRRVLP 251
T R ++ RR P
Sbjct: 243 V-TAERISLTFRRRGRP 258
>gi|392871202|gb|EAS33085.2| Alkbh6 protein [Coccidioides immitis RS]
Length = 265
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 86 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 134
>gi|308502209|ref|XP_003113289.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
gi|308265590|gb|EFP09543.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
Length = 570
Score = 41.6 bits (96), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 126 DPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
DPIP I +L+ + + + P+ I + V Y+PG I SH D F+ PI+S+SL
Sbjct: 173 DPIPPVINTLIDKLMSEKYITERPDQITANV---YEPGHGIPSHYDTHSAFEDPIVSISL 229
Query: 185 FSDSALCF 192
SD + F
Sbjct: 230 LSDVVMEF 237
>gi|303316794|ref|XP_003068399.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
delta SOWgp]
gi|240108080|gb|EER26254.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
delta SOWgp]
Length = 271
Score = 41.6 bits (96), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 92 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 140
>gi|320038239|gb|EFW20175.1| hypothetical protein CPSG_03351 [Coccidioides posadasii str.
Silveira]
Length = 248
Score = 41.6 bits (96), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 69 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 117
>gi|291383953|ref|XP_002708552.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Oryctolagus cuniculus]
Length = 664
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 27/162 (16%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN YQPG
Sbjct: 177 KHFGYEFRYDNNNVDKDRPLPGGLPDICDSILEKW--LQEGYIKHKPDQLTINQYQPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
I +HID F+ IISLSL S+ + F P ++ PV+ LP R + ++ +
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDFK-----HPDGITVPVM-LP--RRSLLVMTGES 286
Query: 225 ANGITHCVRP------QDTQHRRAVILLRRVLPHAPRLTLSQ 260
TH + P Q ++H ++ I+ V LTLS+
Sbjct: 287 RYLWTHGITPRKFDTIQASEHHKSGIITSDV----GDLTLSK 324
>gi|297804104|ref|XP_002869936.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
lyrata]
gi|297315772|gb|EFH46195.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
lyrata]
Length = 241
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 66/159 (41%), Gaps = 34/159 (21%)
Query: 105 YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
+G + KGL + L P+W+ + E G+ P + IN +IN+Y P
Sbjct: 52 WGGMVHEKGLVPQEL---------PSWLTKITAKICESSGLFP-SAINHVLINEYHPNQG 101
Query: 165 IVSHIDPPHIFDRPIIS-LSLFSDSALCFG------------------CKFNFKPIRVSE 205
I++H D P F P+++ LSL S + F C +++P R S
Sbjct: 102 IMAHQDGPAYF--PVVAILSLGSPVVMDFSPHLRLRSGDGYISKDQSPCAESYEPERDSF 159
Query: 206 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVI 244
VL +P + + +D A + H + TQ V+
Sbjct: 160 SVLMMP---RSLLIFKDDAYSDFLHGISDSPTQCYNQVV 195
>gi|194880701|ref|XP_001974503.1| GG21781 [Drosophila erecta]
gi|190657690|gb|EDV54903.1| GG21781 [Drosophila erecta]
Length = 615
Score = 41.2 bits (95), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSL---VITKLEQMGVVPPNYINSA----VINDYQPGGC 164
K G E LY VDP QS+ ++ P N+ S+ +N+Y+PG
Sbjct: 175 KHFGFEFLYGSNNVDPFKPLEQSIPSACDILWPRLDSFPSNWDWSSPDQLTVNEYEPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I H+D F PI+SLSL SD + F
Sbjct: 235 IPPHVDTHSAFLDPILSLSLQSDVVMDF 262
>gi|198474666|ref|XP_001356776.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
gi|198138491|gb|EAL33842.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
Length = 237
Score = 41.2 bits (95), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|195148282|ref|XP_002015103.1| GL18598 [Drosophila persimilis]
gi|194107056|gb|EDW29099.1| GL18598 [Drosophila persimilis]
Length = 237
Score = 41.2 bits (95), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|421597443|ref|ZP_16041063.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
CCGE-LA001]
gi|404270447|gb|EJZ34507.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
CCGE-LA001]
Length = 190
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 9/77 (11%)
Query: 123 GEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
GE +PIP W+ + ++E +P + + +Y+ G I H D PH FD+ ++ L
Sbjct: 71 GEAEPIPDWVLP-IARQVEAWAGLPEASVRQVLCTEYEAGVGIGWHRDKPH-FDK-VLGL 127
Query: 183 SLFSDSALCFGCKFNFK 199
SL + CKF F+
Sbjct: 128 SLGAP------CKFRFR 138
>gi|195439545|ref|XP_002067665.1| GK21248 [Drosophila willistoni]
gi|194163750|gb|EDW78651.1| GK21248 [Drosophila willistoni]
Length = 228
Score = 40.8 bits (94), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|449305107|gb|EMD01114.1| hypothetical protein BAUCODRAFT_196023 [Baudoinia compniacensis
UAMH 10762]
Length = 248
Score = 40.8 bits (94), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 127 PIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPP 172
P+PAW+ V+ ++ +GV P+ IN +IN+Y PG I+ H D P
Sbjct: 71 PLPAWLTEPVVGRIHDLGVFADAPHGINHCLINEYLPGQGIMPHEDGP 118
>gi|326914414|ref|XP_003203520.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Meleagris gallopavo]
Length = 846
Score = 40.8 bits (94), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y +VD P+P + + I+ LE+ + YI + +N Y+PG
Sbjct: 351 KHFGYEFRYDNNDVDKDKPLPGGLPEICISFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 408
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I HID F+ IISLSL S+ + F
Sbjct: 409 IPPHIDTHSAFEDEIISLSLGSEIVMDF 436
>gi|194761924|ref|XP_001963141.1| GF15798 [Drosophila ananassae]
gi|190616838|gb|EDV32362.1| GF15798 [Drosophila ananassae]
Length = 231
Score = 40.8 bits (94), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS-LSLFSDSALCFG 193
IIS +S S + L F
Sbjct: 115 IISTISCGSHTVLEFA 130
>gi|301104152|ref|XP_002901161.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
gi|262101095|gb|EEY59147.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
Length = 231
Score = 40.8 bits (94), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
E +P+P W+Q + T L+ N A+IN+Y G CI+ H D P F P++ S
Sbjct: 94 EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 149
Query: 184 LFSDSALCFGCKFNFKPIR 202
+ S A C+ F+P R
Sbjct: 150 IISTGA---ECRVTFEPHR 165
>gi|321463990|gb|EFX75001.1| hypothetical protein DAPPUDRAFT_306906 [Daphnia pulex]
Length = 574
Score = 40.4 bits (93), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 112 KGLGNERLYPEGEVDP------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
K G E LY +VD IP ++ L I K + G+ PN + +N Y PG I
Sbjct: 145 KHYGFEFLYGSNKVDKNPLSSGIPVEMEPL-IEKFMEFGLARPNQL---TVNHYVPGQGI 200
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL---RD 222
H D F+ IIS+SL SD + +FK + E V +R C+ + R
Sbjct: 201 PLHTDTHSSFEDGIISVSLGSDIVM------DFKNNKGGECVSITLPRRSCLIMTNESRY 254
Query: 223 FAANGIT 229
++GIT
Sbjct: 255 MWSHGIT 261
>gi|195386178|ref|XP_002051781.1| GJ17179 [Drosophila virilis]
gi|194148238|gb|EDW63936.1| GJ17179 [Drosophila virilis]
Length = 221
Score = 40.4 bits (93), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|119187755|ref|XP_001244484.1| hypothetical protein CIMG_03925 [Coccidioides immitis RS]
Length = 119
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 51 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 99
>gi|170579523|ref|XP_001894867.1| hypothetical protein [Brugia malayi]
gi|158598386|gb|EDP36290.1| conserved hypothetical protein [Brugia malayi]
Length = 576
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 3/91 (3%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
DPIP+ Q VI ++ G+ + +N Y+PG I SH+D F I SLSL
Sbjct: 183 DPIPSVCQP-VIDRMLGAGIFKEK-PDQVTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240
Query: 186 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 215
SD + F N I V P L V RG
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRFSLTVMRG 271
>gi|47196062|emb|CAF89344.1| unnamed protein product [Tetraodon nigroviridis]
Length = 234
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 8/121 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P+G + + IP W+Q+ ++ +G N ++N+Y+PG I+ H D P ++
Sbjct: 66 HPKGMLAETIPEWLQTYC-QRISSLGAFGGKVANHVLVNEYKPGEGIMPHEDGP-LYHPT 123
Query: 179 IISLSLFSDSALCF-----GCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
+ +LSL S + L F G + + + + L V+ + +L+D + H +R
Sbjct: 124 VTTLSLGSHTLLDFYTPVGGVQGDAPQTEENRFLFSLLVEPRSLLILQDEMYQKLLHGIR 183
Query: 234 P 234
P
Sbjct: 184 P 184
>gi|301118833|ref|XP_002907144.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105656|gb|EEY63708.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 134
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
E +P+P W+Q + T L+ N A+IN+Y G CI+ H D P F P++ S
Sbjct: 11 EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 66
Query: 184 LFSDSALCFGCKFNFKPIR 202
+ S A C+ F+P R
Sbjct: 67 IISTGA---ECRVTFEPHR 82
>gi|195114110|ref|XP_002001610.1| GI16672 [Drosophila mojavensis]
gi|193912185|gb|EDW11052.1| GI16672 [Drosophila mojavensis]
Length = 222
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|395861438|ref|XP_003802993.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Otolemur
garnettii]
Length = 986
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 114 LGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGCIV 166
G E LY VD P+P + + + LE+ + YI + IN Y+PG I
Sbjct: 501 FGYEFLYENNNVDKDKPLPGGLPDICDSILEKW--LKEGYIRQKPDQVTINQYEPGHGIP 558
Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVL 208
+HID F+ I+SLSL S+ + F P V+ PV+
Sbjct: 559 AHIDTHSAFEDEIVSLSLGSEIVMDFK-----HPDGVTVPVM 595
>gi|291238544|ref|XP_002739188.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 742
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 58/129 (44%), Gaps = 13/129 (10%)
Query: 126 DPIPAWIQSLVITKLEQMGVV-PPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
D IPA ++ ++ + + PN I +N YQPG I HID F+ IISLSL
Sbjct: 267 DGIPAALRKIIDDIMATQNIQHGPNQI---TVNQYQPGQGIPPHIDTHSAFEGEIISLSL 323
Query: 185 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGIT---HCVRPQDTQHRR 241
S+ + F N + I V P L V G R +GIT H V P DT
Sbjct: 324 GSNVIMDFK-HPNGQHIPVLVPQRSLLVMTG---ESRYLWTHGITPRKHDVAPADTD--T 377
Query: 242 AVILLRRVL 250
+ L +R L
Sbjct: 378 GLTLTKRTL 386
>gi|363729209|ref|XP_417166.3| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Gallus
gallus]
Length = 695
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y +VD P+P + + I LE+ + YI + +N Y+PG
Sbjct: 201 KHFGYEFRYDNNDVDKDNPLPGGLPEICIPFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 258
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I HID F+ IISLSL S+ + F
Sbjct: 259 IPPHIDTHSAFEDEIISLSLGSEIVMDF 286
>gi|291301566|ref|YP_003512844.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
gi|290570786|gb|ADD43751.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
Length = 197
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 17/105 (16%)
Query: 94 RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
R +G Y YG R+ +G P PIPAW+ +V +++ + P +
Sbjct: 42 RRVQHYGHRYDYG----RRSVGQTDAAP-----PIPAWLDEVVARLVDEKVMDQPA--DQ 90
Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF 198
++N+Y PG I +H+D F + ++SL S GC +F
Sbjct: 91 VIVNEYLPGQGISAHVDCVPCFGPVVAAISLES------GCVMDF 129
>gi|195033105|ref|XP_001988620.1| GH11262 [Drosophila grimshawi]
gi|193904620|gb|EDW03487.1| GH11262 [Drosophila grimshawi]
Length = 221
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + +P W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEMPEWLQSYV-DKVNNLGVFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 179 IIS 181
IIS
Sbjct: 115 IIS 117
>gi|348681591|gb|EGZ21407.1| hypothetical protein PHYSODRAFT_491312 [Phytophthora sojae]
Length = 688
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 55/126 (43%), Gaps = 3/126 (2%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+PIP +Q +I K+ + G++ + + +N+Y PG I H+D F I SLS+
Sbjct: 208 EPIPEVLQP-IIEKIARCGIMDGDEPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 266
Query: 186 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV--RPQDTQHRRAV 243
S+ + F + V P L V G + + A T V Q + RR
Sbjct: 267 SEVVMDFRHPDGVRNEGVLLPARSLAVMSGASRYMWEHAIVPRTFDVIDGKQVNRQRRVS 326
Query: 244 ILLRRV 249
I R++
Sbjct: 327 ITFRKI 332
>gi|195151073|ref|XP_002016472.1| GL10469 [Drosophila persimilis]
gi|194110319|gb|EDW32362.1| GL10469 [Drosophila persimilis]
Length = 614
Score = 39.7 bits (91), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGV-VPPNYINSAVINDYQPGGC 164
K G E LY VDP QS+ + +LE + + +N+Y+PG
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACGFLWPRLESFSSSWDWSTPDQLTVNEYEPGHG 232
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
I H+D F PI+SLSL SD + F R E + + + R + ++ A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283
Query: 225 ANGITHCVRPQ 235
TH ++P+
Sbjct: 284 RYDWTHGIKPK 294
>gi|195339853|ref|XP_002036531.1| GM18491 [Drosophila sechellia]
gi|194130411|gb|EDW52454.1| GM18491 [Drosophila sechellia]
Length = 228
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 179 IISLSLFSDSALCFG 193
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|340053555|emb|CCC47848.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 305
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 86/198 (43%), Gaps = 21/198 (10%)
Query: 88 VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
+++ LR Y + YT S +R G +L P + +P ++ ++EQ G+VP
Sbjct: 45 MEKIALRKGYV--DVYTQRSASIRLSDGRFQLPPLPQHSFLP------LVERMEQDGLVP 96
Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
++N+ N Y+ I +HID I+D +SL +++ L F N + + V P
Sbjct: 97 CGWLNNQTANFYETNDFIRAHIDNLFIYDDIFAIVSLGANALLRFVHVQNGEELDVVIP- 155
Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
G V ++ + H V P + Q R +++ R +L TP +
Sbjct: 156 ------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGSFRPVSTPLGDLM 208
Query: 266 PYHSLYDVQ---ARESGG 280
PY S V A++ GG
Sbjct: 209 PYRSTQIVNTLYAKQVGG 226
>gi|198457852|ref|XP_001360812.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
gi|198136129|gb|EAL25387.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
Length = 615
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGV-VPPNYINSAVINDYQPGGC 164
K G E LY VDP QS+ + +LE + + +N+Y+PG
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACDFLWPRLESFSSSWDWSTPDQLTVNEYKPGHG 232
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
I H+D F PI+SLSL SD + F R E + + + R + ++ A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283
Query: 225 ANGITHCVRPQ 235
TH ++P+
Sbjct: 284 RYDWTHGIKPK 294
>gi|258576637|ref|XP_002542500.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902766|gb|EEP77167.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 259
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHID 170
P+P W++S V+++ ++G+ P N +IN+YQPG I+ H D
Sbjct: 80 PLPDWLRSPVVSRFTELGIFFDSPHRAPNHVLINEYQPGQGIMPHED 126
>gi|357498515|ref|XP_003619546.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|355494561|gb|AES75764.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 266
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 12/90 (13%)
Query: 112 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 163
K L N RL G V P+P+W+ + E+ G+ P + IN +IN+YQP
Sbjct: 50 KLLKNRRLQNWGGVVHEKGLLPQPLPSWLTNFTQKISEESGLFP-SPINHVLINEYQPNQ 108
Query: 164 CIVSHIDPPHIFDRPIIS-LSLFSDSALCF 192
I+ H D P F P+++ LSL S + F
Sbjct: 109 GIMPHQDGPSYF--PVVAILSLGSPVVMDF 136
>gi|67458989|ref|YP_246613.1| alkylated DNA repair protein [Rickettsia felis URRWXCal2]
gi|67004522|gb|AAY61448.1| Alkylated DNA repair protein [Rickettsia felis URRWXCal2]
Length = 188
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 63/139 (45%), Gaps = 21/139 (15%)
Query: 53 VNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPL-----RNKYFFGEGYTYGS 107
++Q+SLF+ + ++ + I+E + AEQ +D +P R +G Y Y S
Sbjct: 1 MSQLSLFNDQIIISGLKYIEEYIT-AEQEDRLIKLIDSSPWITDLKRRVQHYGYKYDYKS 59
Query: 108 QLVRKGLGNERLYPEGEVDPIPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCI 165
+ + E+ Y G +P W+Q + K +P N ++N+Y PG I
Sbjct: 60 RSI------EQSYYLG---LLPKWLQIVADEFYKKNIFNEIP----NQVIVNEYMPGQGI 106
Query: 166 VSHIDPPHIFDRPIISLSL 184
SHID F I SLSL
Sbjct: 107 ASHIDCIPCFSDTICSLSL 125
>gi|440791732|gb|ELR12970.1| hypothetical protein ACA1_096410 [Acanthamoeba castellanii str.
Neff]
Length = 259
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 49/112 (43%), Gaps = 13/112 (11%)
Query: 123 GEVDPIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 180
G + P+P + Q V +L G++P P+ + +IN Y PG I H+D H F+ +
Sbjct: 103 GYLGPLPDFAQQ-VTARLVSTGLMPYEPDQM---IINHYTPGQGIHPHVDKTHCFEGVVG 158
Query: 181 SLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
SL L S C FK I V +R +L A G TH +
Sbjct: 159 SLGLGSS------CIMEFKHIETGRRVDVF-FERRTALMLTGEARYGWTHGI 203
>gi|195578245|ref|XP_002078976.1| GD23711 [Drosophila simulans]
gi|194190985|gb|EDX04561.1| GD23711 [Drosophila simulans]
Length = 228
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 179 IISLSLFSDSALCFG 193
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|358395346|gb|EHK44733.1| hypothetical protein TRIATDRAFT_184325, partial [Trichoderma
atroviride IMI 206040]
Length = 345
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 24/122 (19%)
Query: 83 YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
+KP T+ +A R ++ G Y R+YPE PA + + + T
Sbjct: 159 HKPLTIKQALQRRLHWVTLGGQYD--------WTNRIYPEAPPPSFPADLSNFLTT---- 206
Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK--FNFKP 200
+ P +A++N Y PG ++ H D D+ ++SLS FGC+ F P
Sbjct: 207 --LFPETLAQAAIVNFYTPGDTMMMHRDVSEEIDKGLVSLS--------FGCECLFMIAP 256
Query: 201 IR 202
R
Sbjct: 257 SR 258
>gi|298707663|emb|CBJ25980.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 329
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+P+P W+ +L + + + N ++N+YQPG I++H D P ++ +LSL
Sbjct: 139 EPLPPWVDALCESLVVRGVFSEETRPNHVLLNEYQPGQGIMAHTDGP-FYEPRTATLSLG 197
Query: 186 SDSALCF 192
SD+ + F
Sbjct: 198 SDAVMHF 204
>gi|85724924|ref|NP_001033895.1| CG6144, isoform B [Drosophila melanogaster]
gi|85726418|ref|NP_609414.3| CG6144, isoform C [Drosophila melanogaster]
gi|16076870|gb|AAL13362.1| LD37206p [Drosophila melanogaster]
gi|84795296|gb|AAF52964.3| CG6144, isoform C [Drosophila melanogaster]
gi|84795297|gb|ABC65892.1| CG6144, isoform B [Drosophila melanogaster]
gi|220946136|gb|ACL85611.1| CG6144-PA [synthetic construct]
gi|220960392|gb|ACL92732.1| CG6144-PA [synthetic construct]
Length = 228
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 179 IISLSLFSDSALCF 192
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEF 129
>gi|328715721|ref|XP_001945058.2| PREDICTED: hypothetical protein LOC100163323 [Acyrthosiphon pisum]
Length = 312
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
DPIP + ++ ++ P N +N Y PG I SHID +FD I+SLSL
Sbjct: 40 DPIPKEFEFILNAIYLRLKWQP----NQITVNKYLPGQGIPSHIDTHGVFDEYILSLSLN 95
Query: 186 SDSALCF 192
SD + F
Sbjct: 96 SDIIMEF 102
>gi|312086366|ref|XP_003145047.1| hypothetical protein LOAG_09472 [Loa loa]
gi|307759789|gb|EFO19023.1| hypothetical protein LOAG_09472 [Loa loa]
Length = 576
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 3/91 (3%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+PIPA Q +I ++ G+ + +N Y+PG I SH+D F I SLSL
Sbjct: 183 NPIPAACQP-IIDRMLDAGIFKEE-PDQLTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240
Query: 186 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 215
SD + F N I V P L L V +G
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRLSLAVMQG 271
>gi|301121774|ref|XP_002908614.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
gi|262103645|gb|EEY61697.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
Length = 640
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+PIP +Q VI K+ + G++ + + +N+Y PG I H+D F I SLS+
Sbjct: 203 EPIPEVLQP-VIDKIVECGIMDGDRPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 261
Query: 186 SDSALCF 192
S+ + F
Sbjct: 262 SEVVMDF 268
>gi|390349756|ref|XP_790566.3| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Strongylocentrotus purpuratus]
Length = 245
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
L N RL +P+G + + +P W+ + K+ +GV + N ++N+YQPG I
Sbjct: 55 LSNRRLQNWGGLPHPKGMIAEGLPKWL-DVYAKKIAGLGVFGDHIPNHVLVNEYQPGQGI 113
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
+ H D P +F + ++SL S + L F
Sbjct: 114 MPHEDGP-LFHPVVTTISLGSHTFLDF 139
>gi|91080367|ref|XP_974935.1| PREDICTED: similar to CG17807 CG17807-PA [Tribolium castaneum]
gi|270005595|gb|EFA02043.1| hypothetical protein TcasGA2_TC007671 [Tribolium castaneum]
Length = 582
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%)
Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
N IN Y PG I SH+D F PI+SLSL SD + F
Sbjct: 203 NQLTINRYNPGQGIPSHVDTHSAFGDPILSLSLSSDVVMEF 243
>gi|195471890|ref|XP_002088235.1| GE18466 [Drosophila yakuba]
gi|194174336|gb|EDW87947.1| GE18466 [Drosophila yakuba]
Length = 234
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 179 IISLSLFSDSALCFG 193
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|443725258|gb|ELU12938.1| hypothetical protein CAPTEDRAFT_112863, partial [Capitella teleta]
Length = 214
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 12/93 (12%)
Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
L N RL +P+G V + IP W+ S ++ Q+GV N ++N+Y PG I
Sbjct: 36 LSNRRLQNWGGLPHPKGMVPEKIPEWLDSFG-QRIGQLGVFDGQMPNHVLVNEYLPGQGI 94
Query: 166 VSHIDPPHIFDRPIIS-LSLFSDSALCFGCKFN 197
+ H D P F P +S ++L S + L F N
Sbjct: 95 MPHTDGPLYF--PTVSTITLGSHTLLDFYTPLN 125
>gi|168057031|ref|XP_001780520.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667998|gb|EDQ54614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 258
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIIS-LSLF 185
PIPAW+ S + K+ + + P IN ++N+Y PG I SH D P + P+++ LSL
Sbjct: 59 PIPAWLSS-ITEKIAKETNLFPAPINHVLVNEYLPGQGITSHQDGPVYY--PVVAILSLG 115
Query: 186 SDSALCF 192
+ + + F
Sbjct: 116 APTLMHF 122
>gi|440902148|gb|ELR52976.1| Alkylated DNA repair protein alkB-like protein 8, partial [Bos
grunniens mutus]
Length = 349
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + +I N IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPNQLTINQYEPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ IISLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262
>gi|351710212|gb|EHB13131.1| Alkylated DNA repair protein alkB-like protein 8, partial
[Heterocephalus glaber]
Length = 667
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 179 KHFGYEFHYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKCRPDQLTINQYEPGQG 236
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ IISLSL S+ + F
Sbjct: 237 IPAHIDTHSAFEDEIISLSLGSEVVMDF 264
>gi|340507083|gb|EGR33100.1| hypothetical protein IMG5_061460 [Ichthyophthirius multifiliis]
Length = 392
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 40/175 (22%), Positives = 72/175 (41%), Gaps = 23/175 (13%)
Query: 79 EQGQYKPCTVDRAPL-----RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQ 133
EQ Y +D P R +G + YG + K E + +PIP ++
Sbjct: 189 EQENYIIQQIDEKPWIKLSKRQVQHYGYEFIYGQNTINK---------EKKTNPIPDFLS 239
Query: 134 SLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFG 193
++V +L + ++ INDY+PG I SH+D F+ I+ +SL S +
Sbjct: 240 NMV-QQLNEYIKNKQKPLDQLTINDYKPGNGISSHVDAHSPFEECIVVVSLVSGVVIT-- 296
Query: 194 CKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRR 248
FK + E L+L + + ++ THC+ + + ++ R+
Sbjct: 297 ----FKSYKNEEKSLFL--ENRSLLIMSGEGRYAWTHCISSRKVDRVQDEVIFRK 345
>gi|348541845|ref|XP_003458397.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Oreochromis niloticus]
Length = 234
Score = 38.9 bits (89), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 28/146 (19%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP 171
KG+ ERL P W+Q+ K+ +G N ++N+Y+PG I+ H D
Sbjct: 68 KGMLGERL---------PDWLQTYC-EKISSLGAFSGKTANHVLVNEYKPGEGIMPHEDG 117
Query: 172 PHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLR 221
P ++ + ++SL S + L F + P+ +E + L V+ + +L+
Sbjct: 118 P-LYHPTVTTISLGSHTLLDF-----YTPLSSAEGATPQTEESRFLFSLLVRPRSLLILQ 171
Query: 222 DFAANGITHCV--RPQDTQHRRAVIL 245
D + H + R QDT ++ V L
Sbjct: 172 DEMYQRLLHGIHGREQDTLTKKVVNL 197
>gi|220908820|ref|YP_002484131.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
gi|219865431|gb|ACL45770.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
Length = 207
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 18/103 (17%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
+G Y Y S+ V K + + +P W + L ++ P+ + ++N+
Sbjct: 63 YGYKYDYKSRGVDKSM---------YIASLPIWAKELAHKIRKKYTTDLPDQV---IVNE 110
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
Y PG I +HID + F I+SLS LC C +F I
Sbjct: 111 YMPGQGIANHIDCVNCFTDTIVSLS------LCSSCVMDFVHI 147
>gi|300793742|ref|NP_001178838.1| alkylated DNA repair protein alkB homolog 8 [Rattus norvegicus]
Length = 664
Score = 38.9 bits (89), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNTVDKDKPLPGGLPEICSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S + F
Sbjct: 235 IPAHIDSHSAFEHEIVSLSLGSAIVMDF 262
>gi|194754858|ref|XP_001959711.1| GF13009 [Drosophila ananassae]
gi|190621009|gb|EDV36533.1| GF13009 [Drosophila ananassae]
Length = 609
Score = 38.9 bits (89), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 59/135 (43%), Gaps = 24/135 (17%)
Query: 112 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLE----QMGVVPPNYINSAVINDYQ 160
K G E LY VDP IP+ ++ +LE + P+ + +N+Y+
Sbjct: 174 KHFGYEFLYGSNNVDPSKPLEQPIPSACD-ILWPRLEGNSTTWDWITPDQL---TVNEYE 229
Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
PG I H+D F PI+SLSL SD + F R E + + + R + ++
Sbjct: 230 PGNGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEEHVQVRLPRRSLLVM 280
Query: 221 RDFAANGITHCVRPQ 235
A TH +RP+
Sbjct: 281 SGEARYDWTHGIRPK 295
>gi|327269144|ref|XP_003219355.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Anolis
carolinensis]
Length = 666
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 13/131 (9%)
Query: 69 RNIDEVSSLAEQGQYKPC-------TVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYP 121
R I+EV S E+GQ C + A K+ + + Y Q + +R P
Sbjct: 141 RVIEEVVSPEEEGQLLECIDWGKDEVIPNAQKSLKHRRVKHFGYEFQYDNNNVDRDRPLP 200
Query: 122 EGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIIS 181
G D I + ++ K +MG + + + IN Y+PG I HID F+ IIS
Sbjct: 201 GGLPD-----ICNTLLAKWLKMGYIKESS-DQLTINQYEPGQGIPPHIDTHSAFEDEIIS 254
Query: 182 LSLFSDSALCF 192
LSL + + F
Sbjct: 255 LSLGAGIVMDF 265
>gi|145522424|ref|XP_001447056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414556|emb|CAK79659.1| unnamed protein product [Paramecium tetraurelia]
Length = 636
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 50/118 (42%), Gaps = 17/118 (14%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
+G + YG V E+ IPA+++ V K+ + V P IN IN+
Sbjct: 192 YGYEFIYGDNTVNVDQPAEK--------KIPAFLED-VRAKVSDL-VKPQAEINQLTINE 241
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGC 216
Y PG I H D F +S+SL S G +FK + E LYLP R C
Sbjct: 242 YLPGMGIPPHFDVHPPFHEKFVSISLLS------GLVMSFKSYKGEEQHLYLP-PRSC 292
>gi|408785541|ref|ZP_11197285.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
gi|408488554|gb|EKJ96864.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
Length = 196
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 49/106 (46%), Gaps = 17/106 (16%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
FG Y Y ++ V P+ + P+P W+ + +L + G N + + N+
Sbjct: 49 FGYRYDYKARAVA---------PDAYLGPLPPWL-GVFARRLVRNGHCE-NLPDQVIANE 97
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVS 204
Y PG I +H+D FD I+S+SL S C+ F+ +R S
Sbjct: 98 YFPGQGISAHVDCVPCFDDTIVSISLLS------ACEMVFRDLRGS 137
>gi|195380141|ref|XP_002048829.1| GJ21257 [Drosophila virilis]
gi|194143626|gb|EDW60022.1| GJ21257 [Drosophila virilis]
Length = 616
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 6/87 (6%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQ------SLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
K G E LY VDP+ Q + +L + + +N+Y+PG I
Sbjct: 174 KHFGYEFLYGSNNVDPLQPLEQPIPTACDFLWQRLACCEAPALDTPDQLTVNEYEPGQGI 233
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
H+D F PI+SLSL SD + F
Sbjct: 234 PPHVDTHSAFVDPILSLSLQSDVVMDF 260
>gi|194862285|ref|XP_001969966.1| GG23646 [Drosophila erecta]
gi|190661833|gb|EDV59025.1| GG23646 [Drosophila erecta]
Length = 232
Score = 38.5 bits (88), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P G + + IP W+Q + + K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQ-IYVDKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 179 IISLSLFSDSALCFG 193
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|348553212|ref|XP_003462421.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Cavia
porcellus]
Length = 664
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + +N Y+PG
Sbjct: 177 KHFGYEFYYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKHKPDQLTVNQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +H+D F+ IISLSL S+ + F
Sbjct: 235 IPAHVDTHSAFEDEIISLSLGSEVVMDF 262
>gi|26334747|dbj|BAC31074.1| unnamed protein product [Mus musculus]
Length = 447
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ IISLSL S + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262
>gi|61675696|ref|NP_080579.1| alkylated DNA repair protein alkB homolog 8 [Mus musculus]
gi|81895388|sp|Q80Y20.1|ALKB8_MOUSE RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
Full=Probable alpha-ketoglutarate-dependent dioxygenase
ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
tRNA methyltransferase ABH8; AltName: Full=tRNA
(carboxymethyluridine(34)-5-O)-methyltransferase ABH8
gi|30046964|gb|AAH50863.1| AlkB, alkylation repair homolog 8 (E. coli) [Mus musculus]
Length = 664
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ IISLSL S + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262
>gi|355752600|gb|EHH56720.1| hypothetical protein EGM_06185 [Macaca fascicularis]
Length = 732
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 265 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 322
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 323 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 350
>gi|302854777|ref|XP_002958893.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
nagariensis]
gi|300255737|gb|EFJ40024.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
nagariensis]
Length = 536
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 128 IPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
+P W+ LV E++ +P ++ +N+Y PG + H+D F PIISLSL
Sbjct: 190 LPPWVSDLV----ERISALPEVSQPLDQLTVNEYDPGVGLAPHVDTHSAFTGPIISLSLG 245
Query: 186 SDSAL 190
S + +
Sbjct: 246 STAVM 250
>gi|346466049|gb|AEO32869.1| hypothetical protein [Amblyomma maculatum]
Length = 241
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
+P+G V +P+P W++ + K+ +GV N ++N+Y+PG I+ H D P + P
Sbjct: 67 HPKGMVPEPLPQWLKG-ICAKVASLGVFGEKLPNHVLVNEYKPGEGILPHEDGPLYY--P 123
Query: 179 IIS-LSLFSDSALCF 192
+++ ++L S + + F
Sbjct: 124 VVTNVTLNSSTVIDF 138
>gi|332208092|ref|XP_003253130.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Nomascus leucogenys]
Length = 664
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|26349167|dbj|BAC38223.1| unnamed protein product [Mus musculus]
Length = 537
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 50 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 107
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ IISLSL S + F
Sbjct: 108 IPAHIDTHSAFEDEIISLSLGSAIVMDF 135
>gi|441644875|ref|XP_004090625.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Nomascus
leucogenys]
Length = 667
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|380789693|gb|AFE66722.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
gi|380789695|gb|AFE66723.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
Length = 664
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|297690113|ref|XP_002822470.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Pongo abelii]
Length = 667
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|402895130|ref|XP_003910687.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
[Papio anubis]
Length = 667
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|402895128|ref|XP_003910686.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Papio anubis]
Length = 664
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|297690115|ref|XP_002822471.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
[Pongo abelii]
Length = 664
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|195487923|ref|XP_002092096.1| GE11857 [Drosophila yakuba]
gi|194178197|gb|EDW91808.1| GE11857 [Drosophila yakuba]
Length = 615
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 55/135 (40%), Gaps = 24/135 (17%)
Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSA-----------VINDYQ 160
K G E LY VDP QS+ ++ P N A +N+Y+
Sbjct: 175 KHFGFEFLYGSNNVDPSKPLEQSIP----SACDILWPRLDNFASTWDWSSPDQLTVNEYE 230
Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
PG I H+D F PI+SLSL SD + F R + + + + R + ++
Sbjct: 231 PGHGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGDDQVQVRLPRRSLLIM 281
Query: 221 RDFAANGITHCVRPQ 235
A TH +RP+
Sbjct: 282 SGEARYDWTHGIRPK 296
>gi|75048494|sp|Q95K79.1|ALKB8_MACFA RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
Full=Probable alpha-ketoglutarate-dependent dioxygenase
ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
tRNA methyltransferase ABH8; AltName: Full=tRNA
(carboxymethyluridine(34)-5-O)-methyltransferase ABH8
gi|14388555|dbj|BAB60797.1| hypothetical protein [Macaca fascicularis]
Length = 664
Score = 38.1 bits (87), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|109108545|ref|XP_001102763.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
1 [Macaca mulatta]
gi|109108547|ref|XP_001102947.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
3 [Macaca mulatta]
Length = 664
Score = 38.1 bits (87), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|297269101|ref|XP_001102856.2| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
2 [Macaca mulatta]
Length = 667
Score = 38.1 bits (87), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|169764793|ref|XP_001816868.1| calpain [Aspergillus oryzae RIB40]
gi|238503944|ref|XP_002383204.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83764722|dbj|BAE54866.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220690675|gb|EED47024.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|391863454|gb|EIT72765.1| calpain [Aspergillus oryzae 3.042]
Length = 236
Score = 38.1 bits (87), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 127 PIPAWIQSLVITKLEQMGV---VPPNYINSAVINDYQPGGCIVSHID 170
P+P+W+ S VI + E +G+ P N ++N+Y+PG I+ H D
Sbjct: 69 PLPSWLVSPVIPRFESLGIFADAPHGAPNHVLVNEYRPGQGIMPHED 115
>gi|340939079|gb|EGS19701.1| hypothetical protein CTHT_0041830 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 353
Score = 38.1 bits (87), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 118 RLYPEGEVDP-IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFD 176
R+YPEGE P PA + + T V P +A++N Y PG ++ H D D
Sbjct: 194 RVYPEGEEPPAFPADLAGFLET------VFPETLAQAAIVNFYTPGDTMMMHRDVSEETD 247
Query: 177 RPIISLSLFSDSALCFGCKFNF 198
+ ++SLS FGC+ F
Sbjct: 248 KGLVSLS--------FGCEGLF 261
>gi|302803799|ref|XP_002983652.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
gi|300148489|gb|EFJ15148.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
Length = 556
Score = 37.7 bits (86), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 13/118 (11%)
Query: 122 EGEVDPIPAWIQSLVITKLEQMGVVPPNY-INSAVINDYQPGGCIVSHIDPPHI---FDR 177
E +P+P +++S +I +L + VVP + +S I+ +PG + PPH F++
Sbjct: 279 EETAEPMPTFLES-IIDRLVKCQVVPASKRPDSCSISVLEPGDYM-----PPHKHNNFEQ 332
Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCVRP 234
P+ LSL S S L FGC E + LP G V +L +A + V+P
Sbjct: 333 PLFILSLGSQSELAFGCNLKANSSSTDEKYKVGLPA--GSVLVLEGNSAQMVQCAVQP 388
>gi|383858166|ref|XP_003704573.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Megachile rotundata]
Length = 582
Score = 37.7 bits (86), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 11/94 (11%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
FG + YGS V + + PIP Q L + ++ VP +Y + IN
Sbjct: 166 FGYEFQYGSNKVN---------LDKPITPIPEEYQFLQVL-FKKYHNVPYDY-DQLTINH 214
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
Y PG I HID +F+ I+SLSL S + F
Sbjct: 215 YLPGQGIPPHIDTHSVFEDSILSLSLGSAYVMNF 248
>gi|444723561|gb|ELW64212.1| Alkylated DNA repair protein alkB like protein 8 [Tupaia chinensis]
Length = 583
Score = 37.7 bits (86), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 9/88 (10%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 98 KHFGYEFCYENNNVDRDKPLPGGLPDICDSILEKW--LKEGYIKYKPDQLTINQYEPGQG 155
Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
I HID F+ I+SLSL S+ + F
Sbjct: 156 IPPHIDTHSAFEDEIVSLSLGSEVVMDF 183
>gi|53292605|ref|NP_001005390.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Danio
rerio]
gi|82184989|sp|Q6IQE9.1|ALKB6_DANRE RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 6; AltName: Full=Alkylated DNA repair protein
alkB homolog 6
gi|47939433|gb|AAH71457.1| AlkB, alkylation repair homolog 6 (E. coli) [Danio rerio]
Length = 234
Score = 37.7 bits (86), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 59/124 (47%), Gaps = 18/124 (14%)
Query: 121 PEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPI 179
P+G + + +P W+ K+ +G N ++N+Y+PG I+ H D P ++ +
Sbjct: 67 PKGMLAEKLPDWLLEYT-EKISALGAFAGKTANHVLVNEYKPGEGIMPHEDGP-LYHPTV 124
Query: 180 ISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLRDFAANGIT 229
++++ S + L F ++P+ +EP +L L VQR + +L+D
Sbjct: 125 TTITVGSHTLLDF-----YRPVCQAEPDAPQTEESRYMLSLLVQRKSLLILQDDMYKCYL 179
Query: 230 HCVR 233
H +R
Sbjct: 180 HGIR 183
>gi|417403669|gb|JAA48633.1| Putative alkylated dna repair protein alkb log 8 [Desmodus
rotundus]
Length = 656
Score = 37.4 bits (85), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 23/130 (17%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQM---GVVPPNYINSAVINDYQPGGCI 165
K G E Y VD P+P + + + LE+ G++ + + +N Y+PG I
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDIYDSILEKWLKEGIIR-HKPDQLTVNQYEPGHGI 235
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL----- 220
+HID F+ IISLSL S+ + F P V+ PV+ LP C +LL
Sbjct: 236 PAHIDTHSAFEDEIISLSLGSEIVMDFK-----HPDGVTVPVM-LP----CRSLLVMTGE 285
Query: 221 -RDFAANGIT 229
R +GIT
Sbjct: 286 SRYLWTHGIT 295
>gi|424909400|ref|ZP_18332777.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
USDA 2370]
gi|392845431|gb|EJA97953.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
USDA 2370]
Length = 215
Score = 37.4 bits (85), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)
Query: 99 FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
FG Y Y ++ V P+ + P+P W+ + +L + G + + + N+
Sbjct: 68 FGYRYDYKARAVA---------PDAYLGPLPPWL-GVFARRLVRDGHCE-SLPDQVIANE 116
Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVS 204
Y PG I +H+D FD I+S+SL S C+ F+ +R S
Sbjct: 117 YFPGQGISAHVDCVPCFDDTIVSISLLS------ACEMVFRDLRGS 156
>gi|302823387|ref|XP_002993346.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
gi|300138777|gb|EFJ05531.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
Length = 229
Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 11/79 (13%)
Query: 112 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 163
K L N RL G + P+P+W+ + K+ + + P+ IN ++N+Y PG
Sbjct: 42 KTLTNRRLQNWGGIVESNGLVPQPLPSWLTK-ITEKISSVTGLFPSPINHVLVNEYLPGQ 100
Query: 164 CIVSHIDPPHIFDRPIISL 182
I+ H D P F P++++
Sbjct: 101 GIMLHQDGPSYF--PVVAI 117
>gi|260831338|ref|XP_002610616.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
gi|229295983|gb|EEN66626.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
Length = 231
Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
L N RL +P+G V + +P W+Q + ++ +GV N ++N+YQ G I
Sbjct: 43 LSNRRLQNWGGLPHPKGMVVEKLPEWLQ-MCADRIHSLGVFQDKMPNHVLVNEYQQGQGI 101
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
+ H D P +F I +++L S + L F +KP+
Sbjct: 102 MPHEDGP-LFYPTITTVNLGSHTMLDF-----YKPL 131
>gi|156361264|ref|XP_001625438.1| predicted protein [Nematostella vectensis]
gi|156212272|gb|EDO33338.1| predicted protein [Nematostella vectensis]
Length = 648
Score = 37.4 bits (85), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 7/87 (8%)
Query: 112 KGLGNERLYPEGEVD---PIPAWIQSL---VITKLEQMGVVPPNYINSAVINDYQPGGCI 165
K G E LY ++D P+P + ++ ++T++ G V N + +N+Y PG I
Sbjct: 176 KHYGYEFLYGTNDIDRAKPLPGGMPAVCNDILTRMVSQGAVQ-NTPDQLTVNEYLPGQGI 234
Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
H+D F+ I SLSL + ++ F
Sbjct: 235 PPHVDTHSAFEDGICSLSLGAKISMDF 261
>gi|402078397|gb|EJT73662.1| oxidoreductase [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 368
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 12/105 (11%)
Query: 83 YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
+KP TV R R ++ G Y R+YP+G P PA+ + V LE
Sbjct: 177 HKPLTVRRVLDRRLHWVTLGGQYD--------WTNRVYPDGGTSP-PAFPRD-VAAFLET 226
Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
V P +A++N Y PG ++ H D D+ ++S+SL D
Sbjct: 227 --VFPDTLAQAAIVNFYSPGDTMMMHRDVSEETDKGLVSISLGCD 269
>gi|145491391|ref|XP_001431695.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398800|emb|CAK64297.1| unnamed protein product [Paramecium tetraurelia]
Length = 634
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 9/92 (9%)
Query: 128 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
IPA+++ V K+ + + P IN IN+Y PG I H D F +++SL S
Sbjct: 211 IPAFLED-VRAKVSDL-IKPQAEINQLTINEYLPGMGIPPHFDVHPPFHEKFVTISLLS- 267
Query: 188 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTL 219
G +FK + E LYLP R C
Sbjct: 268 -----GLVMSFKSFKGEEHHLYLP-PRSCALF 293
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,952,568,633
Number of Sequences: 23463169
Number of extensions: 210736851
Number of successful extensions: 638449
Number of sequences better than 100.0: 263
Number of HSP's better than 100.0 without gapping: 86
Number of HSP's successfully gapped in prelim test: 177
Number of HSP's that attempted gapping in prelim test: 638215
Number of HSP's gapped (non-prelim): 310
length of query: 307
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 165
effective length of database: 9,027,425,369
effective search space: 1489525185885
effective search space used: 1489525185885
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)