BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy11248
(227 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|56118909|ref|NP_001008038.1| RNA demethylase ALKBH5 [Xenopus (Silurana) tropicalis]
gi|82181400|sp|Q66JG8.1|ALKB5_XENTR RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|51704081|gb|AAH80920.1| MGC79570 protein [Xenopus (Silurana) tropicalis]
Length = 358
Score = 243 bits (621), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 107/158 (67%), Positives = 130/158 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL R+G G ERLYP+GEVD IPAW+ LVI +L + V+P ++NSAVINDYQP
Sbjct: 104 GYTYGAQLQRRGPGQERLYPKGEVDEIPAWVNELVIRRLVEHRVIPEGFVNSAVINDYQP 163
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPVQRG VT+L
Sbjct: 164 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVQRGSVTVLS 223
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTLS 158
+AA+ ITHC+RPQD + RRAV++LR+ APRL +
Sbjct: 224 GYAADEITHCIRPQDIKERRAVVILRKTRTEAPRLEMK 261
>gi|118097886|ref|XP_001233463.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Gallus gallus]
Length = 374
Score = 241 bits (616), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 109/155 (70%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 116 GYTYGSQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQP 175
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 176 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLS 235
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 236 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 270
>gi|148231788|ref|NP_001085704.1| RNA demethylase ALKBH5 [Xenopus laevis]
gi|82201093|sp|Q6GPB5.1|ALKB5_XENLA RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|49118231|gb|AAH73226.1| MGC80552 protein [Xenopus laevis]
Length = 360
Score = 241 bits (614), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 104/158 (65%), Positives = 129/158 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL R+G G ERLYP+GEVD IP W+ LVI +L + ++P ++NSAVINDYQP
Sbjct: 106 GYTYGAQLQRRGPGQERLYPKGEVDEIPGWVHELVIRRLVERRIIPEGFVNSAVINDYQP 165
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L
Sbjct: 166 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLS 225
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTLS 158
+AA+ ITHC+RPQD + RRAV++LR+ APRL +
Sbjct: 226 GYAADEITHCIRPQDIKERRAVVILRKTRTEAPRLEMK 263
>gi|224070277|ref|XP_002188303.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Taeniopygia guttata]
Length = 383
Score = 240 bits (613), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 109/155 (70%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 125 GYTYGSQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQP 184
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 185 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLS 244
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 245 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 279
>gi|348537052|ref|XP_003456009.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Oreochromis niloticus]
Length = 357
Score = 240 bits (613), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 109/169 (64%), Positives = 133/169 (78%), Gaps = 8/169 (4%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLY +GEVD IP+W+ LVI +L GV+P ++NSAVINDYQP
Sbjct: 119 GYTYGAQLEKRGPGQERLYRKGEVDEIPSWVYELVINRLVTNGVIPEGFVNSAVINDYQP 178
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIFDRPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 179 GGCIVSHVDPLHIFDRPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLS 238
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL--------TLSQTP 161
+AA+ ITHC+RPQD + RRAVI+LR+ P APRL +L+Q P
Sbjct: 239 GYAADDITHCIRPQDIKERRAVIILRKTRPDAPRLGSDSSLRASLAQKP 287
>gi|327287260|ref|XP_003228347.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Anolis carolinensis]
Length = 379
Score = 239 bits (611), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 108/157 (68%), Positives = 128/157 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 122 GYTYGSQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQP 181
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L
Sbjct: 182 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLS 241
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTL 157
+AA+ ITHC+RPQD + RRAVI+LR+ APRL +
Sbjct: 242 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRLEM 278
>gi|387015598|gb|AFJ49918.1| alkylated DNA repair protein alkB-like 5 protein [Crotalus
adamanteus]
Length = 389
Score = 239 bits (609), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 108/155 (69%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL R+G G ERLYP GEVD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 130 GYTYGSQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQP 189
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L
Sbjct: 190 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLS 249
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 250 GYAADEITHCIRPQDIKERRAVIILRKTRVDAPRL 284
>gi|344298054|ref|XP_003420709.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Loxodonta africana]
Length = 394
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 107/155 (69%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + GV+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHGVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|395514234|ref|XP_003761324.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Sarcophilus harrisii]
Length = 384
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 107/155 (69%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 128 GYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 187
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 188 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVRRGSVTVLS 247
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 248 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282
>gi|126333790|ref|XP_001364359.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Monodelphis domestica]
Length = 384
Score = 238 bits (608), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 107/155 (69%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 128 GYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVENRVIPEGFVNSAVINDYQP 187
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 188 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVRRGSVTVLS 247
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 248 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282
>gi|116517268|ref|NP_001070855.1| RNA demethylase ALKBH5 [Danio rerio]
gi|123913930|sp|Q08BA6.1|ALKB5_DANRE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|115527853|gb|AAI24805.1| AlkB, alkylation repair homolog 5 (E. coli) [Danio rerio]
Length = 352
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 128/155 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLY +GEVD IP W+ LVI +L GV+P ++NSAVINDYQP
Sbjct: 106 GYTYGAQLEKRGPGQERLYSKGEVDDIPDWVHELVIDRLVTHGVIPEGFVNSAVINDYQP 165
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 166 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFLFKPIRVSEPVLHLPVRRGSVTVLS 225
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 226 GYAADDITHCIRPQDIKERRAVIILRKTRADAPRL 260
>gi|260817481|ref|XP_002603615.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
gi|229288935|gb|EEN59626.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
Length = 314
Score = 238 bits (606), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 114/191 (59%), Positives = 145/191 (75%), Gaps = 2/191 (1%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL RKG G ERLY +G+VD IP WI +V+++L + +VP ++NSAVIN Y P
Sbjct: 125 GYTYGSQLERKGPGMERLYSKGDVDDIPDWIFKMVVSRLVEAKIVPEGFVNSAVINCYLP 184
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSHIDPPHIF RPI+S+S FSDSALCFGCKF+FKPIRVS+PVL LPV RGC T+L
Sbjct: 185 GGCIVSHIDPPHIFARPIVSVSFFSDSALCFGCKFSFKPIRVSKPVLCLPVARGCCTVLS 244
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARESGGY 180
D+AA+ ITHCVRPQD ++RRAVI+LRRV APR L+ T V+ H +D + ++
Sbjct: 245 DYAADEITHCVRPQDIKHRRAVIILRRVRDDAPR--LAPTTEVQMVHVNHDRREEDNRTS 302
Query: 181 FRSSIETYNNN 191
+ T++++
Sbjct: 303 TDDRLSTFHSD 313
>gi|47218956|emb|CAF98154.1| unnamed protein product [Tetraodon nigroviridis]
Length = 355
Score = 237 bits (605), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 126/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL ++G G ERLY +GEVD IP+W+ LVI L GV+P ++NSAVINDYQP
Sbjct: 116 GYTYGSQLEKRGPGQERLYRKGEVDEIPSWVHELVIKPLVSSGVIPEGFVNSAVINDYQP 175
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 176 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLS 235
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ P APRL
Sbjct: 236 GYAADSITHCIRPQDIKERRAVIILRKTRPDAPRL 270
>gi|449281378|gb|EMC88458.1| Alkylated DNA repair protein alkB like protein 5, partial [Columba
livia]
Length = 349
Score = 236 bits (603), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 108/155 (69%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL R+G G ERL P GEVD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 91 GYTYGSQLQRRGPGQERLLPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQP 150
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L
Sbjct: 151 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLS 210
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 211 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 245
>gi|347300461|ref|NP_001178572.1| RNA demethylase ALKBH5 [Rattus norvegicus]
gi|449061781|sp|D3ZKD3.1|ALKB5_RAT RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
Length = 395
Score = 236 bits (602), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 139 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 198
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 199 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 258
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 259 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|30354744|gb|AAH52076.1| AlkB, alkylation repair homolog 5 (E. coli) [Mus musculus]
Length = 395
Score = 236 bits (602), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 139 GYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKLVEHRVIPKGFVNSAVINDYQP 198
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 199 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 258
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 259 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|31044423|ref|NP_766531.2| RNA demethylase ALKBH5 [Mus musculus]
gi|114151682|sp|Q3TSG4.2|ALKB5_MOUSE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|26343643|dbj|BAC35478.1| unnamed protein product [Mus musculus]
gi|74194466|dbj|BAE37281.1| unnamed protein product [Mus musculus]
Length = 395
Score = 236 bits (601), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 139 GYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 198
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 199 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 258
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 259 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|410980043|ref|XP_003996390.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Felis catus]
Length = 394
Score = 236 bits (601), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|73956209|ref|XP_851719.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Canis lupus familiaris]
Length = 394
Score = 236 bits (601), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|148539642|ref|NP_060228.3| RNA demethylase ALKBH5 [Homo sapiens]
gi|302565646|ref|NP_001180917.1| probable alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
mulatta]
gi|397476905|ref|XP_003809831.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pan paniscus]
gi|402898955|ref|XP_003912472.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Papio anubis]
gi|408359959|sp|Q6P6C2.2|ALKB5_HUMAN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|119576065|gb|EAW55661.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
sapiens]
gi|119576066|gb|EAW55662.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
sapiens]
gi|387542580|gb|AFJ71917.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
mulatta]
gi|410221108|gb|JAA07773.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410251666|gb|JAA13800.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410300496|gb|JAA28848.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
gi|410335931|gb|JAA36912.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
Length = 394
Score = 236 bits (601), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|390462888|ref|XP_003732930.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5
[Callithrix jacchus]
Length = 392
Score = 236 bits (601), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 136 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEYRVIPEGFVNSAVINDYQP 195
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 196 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 255
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 256 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 290
>gi|395748658|ref|XP_003778808.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pongo abelii]
Length = 394
Score = 236 bits (601), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|410917752|ref|XP_003972350.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Takifugu rubripes]
Length = 361
Score = 235 bits (600), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 107/155 (69%), Positives = 125/155 (80%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL ++G G ERLY +GEVD IP W+ LVI L GVVP ++NSAVINDYQP
Sbjct: 121 GYTYGSQLEKRGPGQERLYRKGEVDEIPNWVHELVIKPLVSNGVVPDGFVNSAVINDYQP 180
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 181 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVKRGSVTVLS 240
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ P APRL
Sbjct: 241 GYAADDITHCIRPQDIKERRAVIILRKTRPDAPRL 275
>gi|348560293|ref|XP_003465948.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Cavia porcellus]
Length = 394
Score = 235 bits (600), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|441642261|ref|XP_004090430.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5 [Nomascus
leucogenys]
Length = 394
Score = 235 bits (600), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 AYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|332848357|ref|XP_001158112.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Pan troglodytes]
Length = 394
Score = 235 bits (600), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|329664868|ref|NP_001192446.1| RNA demethylase ALKBH5 [Bos taurus]
gi|449061780|sp|E1BH29.1|ALKB5_BOVIN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
repair protein alkB homolog 5; AltName:
Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 5
gi|296476634|tpg|DAA18749.1| TPA: hypothetical protein BOS_18839 [Bos taurus]
Length = 394
Score = 235 bits (599), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|426349222|ref|XP_004042212.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Gorilla gorilla gorilla]
gi|7020317|dbj|BAA91078.1| unnamed protein product [Homo sapiens]
Length = 378
Score = 235 bits (599), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 122 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 181
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 182 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 241
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 242 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 276
>gi|432847440|ref|XP_004066024.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Oryzias latipes]
Length = 361
Score = 234 bits (598), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 104/155 (67%), Positives = 126/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLY +GEVD IP+W+ LVI L GV+P ++NSAVINDYQP
Sbjct: 122 GYTYGAQLEKRGPGQERLYRKGEVDDIPSWVHELVIKPLVSNGVIPEGFVNSAVINDYQP 181
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV LPV+RG VT+L
Sbjct: 182 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLS 241
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ P APR+
Sbjct: 242 GYAADEITHCIRPQDIKQRRAVIILRKTRPDAPRV 276
>gi|26343649|dbj|BAC35481.1| unnamed protein product [Mus musculus]
Length = 395
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 105/155 (67%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 139 GYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 198
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCI+SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 199 GGCILSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 258
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 259 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|355668039|gb|AER94060.1| alkB, alkylation repair-like protein 5 [Mustela putorius furo]
Length = 340
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 84 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 143
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 144 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 203
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 204 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 238
>gi|351707716|gb|EHB10635.1| Alkylated DNA repair protein alkB-like protein 5, partial
[Heterocephalus glaber]
Length = 349
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 94 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 153
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 154 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 213
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 214 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 248
>gi|380800137|gb|AFE71944.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5, partial
[Macaca mulatta]
Length = 358
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 102 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 161
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 162 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 221
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 222 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 256
>gi|417410040|gb|JAA51501.1| Putative alkylated dna repair protein alkb log, partial [Desmodus
rotundus]
Length = 359
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 103 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 162
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 163 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 222
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 223 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 257
>gi|291224533|ref|XP_002732258.1| PREDICTED: alkB, alkylation repair homolog 5-like [Saccoglossus
kowalevskii]
Length = 362
Score = 234 bits (596), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 104/155 (67%), Positives = 124/155 (80%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL +KG G ERLYP GEVD IP W+Q LVI +L ++P ++NSAVINDY+P
Sbjct: 115 GYTYGTQLDKKGPGQERLYPRGEVDDIPGWVQELVIKRLVSSKMIPDGFVNSAVINDYKP 174
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI S+S SD ALCFGCKF+FKPIR S P+L LP+ RGCVTLL
Sbjct: 175 GGCIVSHVDPIHIFERPIASVSFMSDCALCFGCKFSFKPIRTSRPLLSLPLTRGCVTLLS 234
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AAN ITHC+RPQD + RRAVI+LR+V APRL
Sbjct: 235 GYAANDITHCIRPQDVKARRAVIILRKVRDDAPRL 269
>gi|431914502|gb|ELK15752.1| Alkylated DNA repair protein alkB like protein 5 [Pteropus alecto]
Length = 303
Score = 233 bits (595), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 47 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 106
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 107 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 166
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 167 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 201
>gi|194375187|dbj|BAG62706.1| unnamed protein product [Homo sapiens]
Length = 383
Score = 233 bits (594), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 127 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 186
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 187 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 246
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 247 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 281
>gi|390345597|ref|XP_784548.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Strongylocentrotus purpuratus]
Length = 462
Score = 233 bits (594), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 106/165 (64%), Positives = 127/165 (76%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL +KG G ERLYP GEVD IP W+Q LV++++ + G+VP +INSAVINDYQP
Sbjct: 120 GYTYGSQLSKKGPGQERLYPPGEVDDIPQWVQDLVVSRVVEAGLVPHGFINSAVINDYQP 179
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIFDRPIIS+S SD AL FGCKF+F+PIRVS+P +P+ RGCV L
Sbjct: 180 GGCIVSHVDPIHIFDRPIISVSFLSDCALSFGCKFSFRPIRVSKPKACIPLPRGCVMTLS 239
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTLSQTPRVKP 165
+AA+ +THC+RPQD RRAVI+LRRV P APRL P P
Sbjct: 240 GYAADEVTHCIRPQDVTARRAVIILRRVFPEAPRLEPVGLPSPAP 284
>gi|194217772|ref|XP_001918245.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like, partial [Equus caballus]
Length = 277
Score = 232 bits (592), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 21 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 80
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 81 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 140
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 141 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 175
>gi|355568306|gb|EHH24587.1| hypothetical protein EGK_08266, partial [Macaca mulatta]
Length = 384
Score = 232 bits (591), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 83 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 142
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 143 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 202
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 203 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 237
>gi|38383101|gb|AAH62339.1| AlkB, alkylation repair homolog 5 (E. coli) [Homo sapiens]
Length = 458
Score = 232 bits (591), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 138 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 197
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 198 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 257
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 258 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 292
>gi|355753817|gb|EHH57782.1| hypothetical protein EGM_07489, partial [Macaca fascicularis]
Length = 383
Score = 232 bits (591), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 106/155 (68%), Positives = 127/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 83 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 142
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 143 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 202
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 203 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 237
>gi|74187512|dbj|BAE36711.1| unnamed protein product [Mus musculus]
Length = 395
Score = 230 bits (587), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 104/155 (67%), Positives = 126/155 (81%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 139 GYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 198
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FSDSALCFG +F FKPIRVSEPVL LPV+RG VT+L
Sbjct: 199 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGWQFQFKPIRVSEPVLSLPVRRGSVTVLS 258
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 259 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|156381190|ref|XP_001632149.1| predicted protein [Nematostella vectensis]
gi|156219200|gb|EDO40086.1| predicted protein [Nematostella vectensis]
Length = 256
Score = 220 bits (560), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 100/147 (68%), Positives = 118/147 (80%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG QL +G G+ERLY G+VD IP WI LVI++LE+ G+VP ++INSAVINDYQP
Sbjct: 110 GYTYGKQLEERGPGSERLYARGDVDDIPEWIYELVISRLEKAGIVPKDFINSAVINDYQP 169
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSHIDP HIFDRPI+S S FS L FGCKF+FKPIR ++P+L LP+ RGCVT L
Sbjct: 170 GGCIVSHIDPMHIFDRPIVSCSFFSSCTLSFGCKFSFKPIRTTDPILSLPLPRGCVTALS 229
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRR 147
+AA+ ITHCVRPQD RRAVI+LRR
Sbjct: 230 GYAADEITHCVRPQDVVTRRAVIILRR 256
>gi|198418993|ref|XP_002122576.1| PREDICTED: similar to alkB, alkylation repair homolog 5, partial
[Ciona intestinalis]
Length = 308
Score = 215 bits (548), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 100/198 (50%), Positives = 139/198 (70%), Gaps = 1/198 (0%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG Q+ +KG G ERLY +G VDPIP+W+ ++ LE G++ +I SAVINDY P
Sbjct: 50 GYTYGKQMNQKGPGQERLYQKGVVDPIPSWVFKKLVKPLEDCGMIKKKWITSAVINDYYP 109
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSH+DP HIF+RPI+S+S FS +AL FGCKF+FKPIRVS+PV+ LP+ RG T +
Sbjct: 110 GGCIVSHVDPAHIFERPIVSVSFFSTAALSFGCKFSFKPIRVSDPVVCLPIPRGVATSIS 169
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARESGGY 180
+AA+ ITHC+RPQD + RRAVI+LRR +APR + V + +L + + R S
Sbjct: 170 GYAADSITHCIRPQDIRQRRAVIILRRTFDNAPRQG-EDSSLVSHHRTLNNHEDRRSSNR 228
Query: 181 FRSSIETYNNNENHSNKN 198
R S++++ + H+ ++
Sbjct: 229 HRLSLDSHRDRRKHTRRS 246
>gi|221119302|ref|XP_002156519.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Hydra magnipapillata]
Length = 329
Score = 208 bits (530), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 97/155 (62%), Positives = 115/155 (74%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG + KG G ERL+ +GEVD IP WIQ VI KL VVP +INS VIN+Y P
Sbjct: 133 GYTYGKHMENKGPGQERLFKKGEVDKIPNWIQKHVIKKLYDDKVVPEGFINSVVINEYFP 192
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSHIDP HIFDRPIIS++ + S L FGCKF F PIR SEPVL LP+ RGC+T++
Sbjct: 193 GGCIVSHIDPIHIFDRPIISINFNTRSFLSFGCKFTFNPIRTSEPVLSLPMDRGCLTMIS 252
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RP D RR VI+LRRV P+APR+
Sbjct: 253 GYAADNITHCIRPCDITERRCVIILRRVFPNAPRV 287
>gi|405958373|gb|EKC24507.1| Alkylated DNA repair protein alkB-like protein 5 [Crassostrea
gigas]
Length = 347
Score = 199 bits (507), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 89/122 (72%), Positives = 101/122 (82%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYGSQL +KG G ERLYP+GEVD IP WI LVI L +VP ++NSAVINDY P
Sbjct: 116 GYTYGSQLEKKGPGMERLYPKGEVDEIPDWIHELVIKPLYDAKIVPEGFVNSAVINDYMP 175
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCIVSHIDPPHIFDRPI+S+S FSDSALCFGCKF+F+PIRVS PVL LP+ RGCVT++R
Sbjct: 176 GGCIVSHIDPPHIFDRPIVSVSFFSDSALCFGCKFSFRPIRVSTPVLCLPIDRGCVTMIR 235
Query: 121 DF 122
F
Sbjct: 236 VF 237
>gi|335310671|ref|XP_003362141.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Sus scrofa]
Length = 396
Score = 193 bits (490), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 95/154 (61%), Positives = 111/154 (72%), Gaps = 14/154 (9%)
Query: 4 YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 63
YG+QL ++G G ERLYP G+VD IP W SAVINDYQPGGC
Sbjct: 152 YGAQLQKRGPGQERLYPPGDVDEIPEWXXXXXXXX------------XSAVINDYQPGGC 199
Query: 64 IVSHI--DPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD 121
IVSH+ DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 200 IVSHVHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSG 259
Query: 122 FAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 260 YAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293
>gi|443714465|gb|ELU06866.1| hypothetical protein CAPTEDRAFT_35874, partial [Capitella teleta]
Length = 230
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/150 (66%), Positives = 120/150 (80%), Gaps = 3/150 (2%)
Query: 1 GYTYGSQLVRKGLGNERLYP-EGE--VDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 57
GYTYGS + +G G ERLYP EGE VDPIP WI +V+ L + ++PP++INSAVIND
Sbjct: 81 GYTYGSHMEARGPGMERLYPKEGEESVDPIPEWIHEMVVQPLLRAQLIPPDFINSAVIND 140
Query: 58 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVT 117
YQPGGCIVSHIDP HIFDRPI+++S FS SAL FGCKF F+PIRVS PVL + + RG VT
Sbjct: 141 YQPGGCIVSHIDPYHIFDRPIVTVSFFSSSALSFGCKFEFRPIRVSNPVLSVKLPRGGVT 200
Query: 118 LLRDFAANGITHCVRPQDTQYRRAVILLRR 147
L+ FAA+ ITHC+RPQDT +RRAV++LRR
Sbjct: 201 LISGFAADEITHCIRPQDTPHRRAVVILRR 230
>gi|403275437|ref|XP_003945296.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5 [Saimiri
boliviensis boliviensis]
Length = 386
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 85/119 (71%), Positives = 100/119 (84%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI KL + V+P ++NSAVINDYQP
Sbjct: 127 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQP 186
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 119
GGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 187 GGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVL 245
>gi|354467887|ref|XP_003496399.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like, partial [Cricetulus griseus]
Length = 203
Score = 162 bits (411), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 74/101 (73%), Positives = 86/101 (85%)
Query: 55 INDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRG 114
INDYQPGGCIVSH+D HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG
Sbjct: 1 INDYQPGGCIVSHVDQIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRG 60
Query: 115 CVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
VT+L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 61 SVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 101
>gi|313240619|emb|CBY32944.1| unnamed protein product [Oikopleura dioica]
Length = 462
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 98/147 (66%), Gaps = 6/147 (4%)
Query: 15 NERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIF 74
+E+ + ++ PIP WI +I K+E+ G+V N+INS VINDY+PGG IV H DPPH+F
Sbjct: 96 SEKFFNPKDISPIPDWIYKNIIEKMEKAGIVEKNWINSVVINDYEPGGFIVQHQDPPHLF 155
Query: 75 DRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTLLRDFAANGIT 128
RPI L+LFSDSAL FGC F +P+ V S+PVL LP+QRG +T N I
Sbjct: 156 QRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITSFEGMCMNEIK 215
Query: 129 HCVRPQDTQYRRAVILLRRVLPHAPRL 155
HCVRP+DT+ RR I+LRRV + P L
Sbjct: 216 HCVRPEDTKERRVAIILRRVCNNGPVL 242
>gi|313225990|emb|CBY21133.1| unnamed protein product [Oikopleura dioica]
Length = 462
Score = 160 bits (406), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 98/147 (66%), Gaps = 6/147 (4%)
Query: 15 NERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIF 74
+E+ + ++ PIP WI +I K+E+ G+V N+INS VINDY+PGG IV H DPPH+F
Sbjct: 96 SEKFFNPKDISPIPDWIYKNIIGKMEKAGIVEKNWINSVVINDYEPGGFIVQHQDPPHLF 155
Query: 75 DRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTLLRDFAANGIT 128
RPI L+LFSDSAL FGC F +P+ V S+PVL LP+QRG +T N I
Sbjct: 156 QRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITSFEGMCMNEIK 215
Query: 129 HCVRPQDTQYRRAVILLRRVLPHAPRL 155
HCVRP+DT+ RR I+LRRV + P L
Sbjct: 216 HCVRPEDTKERRVAIILRRVCNNGPVL 242
>gi|426238911|ref|XP_004023593.1| PREDICTED: LOW QUALITY PROTEIN: probable
alpha-ketoglutarate-dependent dioxygenase ABH5, partial
[Ovis aries]
Length = 312
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 96/155 (61%), Gaps = 38/155 (24%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++G G ERLYP G+VD IP W+ LVI
Sbjct: 94 GYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVI------------------------ 129
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 130 --------------ERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLS 175
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 176 GYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 210
>gi|351709131|gb|EHB12050.1| Alkylated DNA repair protein alkB-like protein 5 [Heterocephalus
glaber]
Length = 464
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 94/155 (60%), Gaps = 34/155 (21%)
Query: 1 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 60
GYTYG+QL ++ G ERLYP G+VD IP W LVI KL + V+P ++NSAVINDYQP
Sbjct: 163 GYTYGAQLQKRRPGQERLYPPGDVDEIPEWGPRLVIQKLVEHRVIPEGFVNSAVINDYQP 222
Query: 61 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 120
GGCI+S +S FSDSALCFGCK FKPIRVSEPVL LPV+R
Sbjct: 223 GGCIMS------------VS---FSDSALCFGCKLQFKPIRVSEPVLSLPVRRR------ 261
Query: 121 DFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+ Q RRAVI+LR+ APRL
Sbjct: 262 -------------KRHQERRAVIILRKTRLDAPRL 283
>gi|296216728|ref|XP_002754700.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Callithrix jacchus]
Length = 231
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 53/74 (71%), Positives = 61/74 (82%)
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L +AA+ ITHC+RPQD + RRA
Sbjct: 56 SFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRA 115
Query: 142 VILLRRVLPHAPRL 155
VI+LR+ APRL
Sbjct: 116 VIILRKTRLDAPRL 129
>gi|118352562|ref|XP_001009552.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila]
gi|89291319|gb|EAR89307.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila SB210]
Length = 1999
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 7/129 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISL 81
+V P+W++ V+ LEQ G++PP +INS +N Y G + H D F +PI +
Sbjct: 1822 DVSQPPSWMRRKVVGPLEQAGILPPGFINSLALNVYHDGKEGLAQHFDDATRFRQPIFTF 1881
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQYR 139
LFSD+ L FG + +P+ RG V L + +AANGI HCVRP D +
Sbjct: 1882 KLFSDARLSFGSQL----YGFCNSAFVIPMPRGSVLKLEENSYAANGIKHCVRPCDMAGK 1937
Query: 140 RAVILLRRV 148
A I++R++
Sbjct: 1938 NATIIMRQM 1946
>gi|340508382|gb|EGR34097.1| hypothetical protein IMG5_024190 [Ichthyophthirius multifiliis]
Length = 798
Score = 84.0 bits (206), Expect = 5e-14, Method: Composition-based stats.
Identities = 46/129 (35%), Positives = 68/129 (52%), Gaps = 7/129 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISL 81
+V P WI+ V+ KLE ++P +INS +N Y G + H D F +PI +
Sbjct: 628 DVSETPLWIKKKVVQKLENANILPKKFINSLALNVYHDGNEGLAQHFDDATRFMQPIFTF 687
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQYR 139
LFSD+ L FG +F +P+ RG + L + +AANGI HCVRP D +
Sbjct: 688 KLFSDARLSFGSQF----YGFCNSAFVIPMPRGSILKLEENSYAANGIKHCVRPCDMTGK 743
Query: 140 RAVILLRRV 148
A I++R++
Sbjct: 744 NATIIMRQM 752
>gi|302841282|ref|XP_002952186.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
gi|300262451|gb|EFJ46657.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
Length = 2654
Score = 83.6 bits (205), Expect = 6e-14, Method: Composition-based stats.
Identities = 54/142 (38%), Positives = 73/142 (51%), Gaps = 14/142 (9%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG--------CIVSHIDPPHIF 74
+V P W++ V L + GVVP +++S +N Y G SH D F
Sbjct: 1687 DVPPPLRWMRHQVEAPLVRCGVVPRGFVDSIALNLYHDGSEGIQVGGGGEGSHYDDAVRF 1746
Query: 75 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVR 132
RPI S+ LFSDS L FG + ++P+ RGCVT+L D +AANGI HCVR
Sbjct: 1747 RRPIYSVRLFSDSRLSFGGHLYGN----TNNDFFVPMPRGCVTVLEDESYAANGIKHCVR 1802
Query: 133 PQDTQYRRAVILLRRVLPHAPR 154
P D + A ++LR + P A R
Sbjct: 1803 PADMAGKSAALILRGIDPGAMR 1824
>gi|145483981|ref|XP_001428013.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395096|emb|CAK60615.1| unnamed protein product [Paramecium tetraurelia]
Length = 1283
Score = 83.2 bits (204), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 69/129 (53%), Gaps = 8/129 (6%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISL 81
+V P W+++L ITKLE VVP +INS N Y G + H D F +PI ++
Sbjct: 1115 DVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNIYHDGKEGLAQHFDDAVRFKQPIFTI 1173
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQYR 139
+FSD L FG +F +P+ RGC+ + + +AAN I HCVRP D +
Sbjct: 1174 RVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAANAIKHCVRPCDMTGK 1229
Query: 140 RAVILLRRV 148
A ++LR++
Sbjct: 1230 SAALILRQM 1238
>gi|145520431|ref|XP_001446071.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124413548|emb|CAK78674.1| unnamed protein product [Paramecium tetraurelia]
Length = 941
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 69/129 (53%), Gaps = 8/129 (6%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISL 81
+V P W+++L ITKLE VVP +INS N Y G + H D F +PI ++
Sbjct: 773 DVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNVYHDGKEGLAQHFDDAVRFKQPIYTI 831
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQYR 139
+FSD L FG +F +P+ RGC+ + + +AAN I HCVRP D +
Sbjct: 832 RVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAANAIKHCVRPCDMTGK 887
Query: 140 RAVILLRRV 148
A ++LR++
Sbjct: 888 SAALILRQM 896
>gi|403365523|gb|EJY82547.1| hypothetical protein OXYTRI_19840 [Oxytricha trifallax]
Length = 1315
Score = 80.5 bits (197), Expect = 4e-13, Method: Composition-based stats.
Identities = 40/133 (30%), Positives = 73/133 (54%), Gaps = 7/133 (5%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIIS 80
+V P W+++++ L G++ ++INS +N Y G + H D F +PI +
Sbjct: 938 ADVSQPPVWMKTILEEPLVNAGIIEKDFINSVALNVYHDGSEGLAQHFDDATRFKQPIYT 997
Query: 81 LSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQY 138
L LFSDS L FG ++ +P+ RGC+ ++ + ++ANG+ HC+RP D
Sbjct: 998 LRLFSDSRLSFGSQY----YGFCNGAFTIPLPRGCICVMEEGSYSANGVKHCIRPCDMTG 1053
Query: 139 RRAVILLRRVLPH 151
+ + ++LR++ P+
Sbjct: 1054 KSSAVILRQMHPN 1066
>gi|303274614|ref|XP_003056625.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462709|gb|EEH60001.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 897
Score = 80.5 bits (197), Expect = 5e-13, Method: Composition-based stats.
Identities = 43/129 (33%), Positives = 69/129 (53%), Gaps = 6/129 (4%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNY-INSAVINDYQPGGCIVS-HIDPPHIFDRPIISLSL 83
PIP+W+ S + + + +G + N I++AV+N Y +S H+DP +F RPI+S
Sbjct: 736 PIPSWVHSTLGNRAKIIGAIDKNVVIDNAVVNLYARSKARLSVHMDPSALFKRPIVSARF 795
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
F D L FG K ++ R+ +P+ RG + ++ +AAN +TH V +D R
Sbjct: 796 FGDGVLSFGAKGQYEGQRIHS----VPLTRGSIAVMEGYAANMVTHAVNERDVVGRGCSF 851
Query: 144 LLRRVLPHA 152
+LR P A
Sbjct: 852 MLRSCQPDA 860
>gi|440799034|gb|ELR20095.1| SAM domain (Sterile alpha motif) domain containing protein
[Acanthamoeba castellanii str. Neff]
Length = 1394
Score = 79.3 bits (194), Expect = 1e-12, Method: Composition-based stats.
Identities = 45/127 (35%), Positives = 69/127 (54%), Gaps = 7/127 (5%)
Query: 28 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSD 86
P W++ L+ L + +V ++N +N Y G + H D F RPI +L +FSD
Sbjct: 742 PFWMRQLIERPLVESRIVQEAWVNEFAMNIYHDGTEGLGQHYDDKKRFKRPIHTLRIFSD 801
Query: 87 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL--RDFAANGITHCVRPQDTQYRRAVIL 144
S L FG K ++ V ++P+ RGC+T+L + +A +GI HCVR D + V+L
Sbjct: 802 SRLSFGSK----DFSMTNSVFFIPMPRGCITVLEKKGYAVDGIKHCVRAADMGGKSGVVL 857
Query: 145 LRRVLPH 151
LR + PH
Sbjct: 858 LRHIHPH 864
>gi|384246841|gb|EIE20330.1| hypothetical protein COCSUDRAFT_67586 [Coccomyxa subellipsoidea
C-169]
Length = 988
Score = 75.5 bits (184), Expect = 2e-11, Method: Composition-based stats.
Identities = 42/116 (36%), Positives = 62/116 (53%), Gaps = 7/116 (6%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISL 81
+V P P W+Q+L L + G+VP ++ ++ +N Y G I SH D F RPI S+
Sbjct: 851 DVPPAPTWMQALAEKPLVETGLVPKDFFDAWALNLYHDGSEGIQSHFDDGTRFSRPIFSV 910
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR--DFAANGITHCVRPQD 135
LFSDS L FG + + + + RGC+T++ +A G+ HCVRP D
Sbjct: 911 RLFSDSRLSFGTQL----YGYTNGAFTVDMPRGCITVMEAGGYAVEGVKHCVRPAD 962
>gi|403336135|gb|EJY67254.1| hypothetical protein OXYTRI_12242 [Oxytricha trifallax]
Length = 943
Score = 68.2 bits (165), Expect = 3e-09, Method: Composition-based stats.
Identities = 37/131 (28%), Positives = 65/131 (49%), Gaps = 23/131 (17%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
+V P W+ +++K+E+ G+V N+INS +N Y G L+
Sbjct: 755 DVSRPPVWVNEFMVSKMEKCGIVEKNFINSIAMNVYHDG----------------TEGLA 798
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGITHCVRPQDTQYRR 140
D A F + F ++P+ RGC+ ++ + +AANG+ HC+RPQD +
Sbjct: 799 QHYDDATRFKQFYGF-----VNGAFFIPLARGCICVMEENGYAANGVKHCIRPQDMTGKS 853
Query: 141 AVILLRRVLPH 151
A ++LR++ P+
Sbjct: 854 AALILRQMHPN 864
>gi|307105467|gb|EFN53716.1| hypothetical protein CHLNCDRAFT_136563 [Chlorella variabilis]
Length = 521
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 59/129 (45%), Gaps = 30/129 (23%)
Query: 28 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSD 86
P W++ V L G+V P + +S +N Y G I SH D F +PI+SL LFSD
Sbjct: 319 PRWMRQKVEAPLVGAGLVAPGFCDSIALNIYHDGSEGIQSHYDDAARFCQPILSLRLFSD 378
Query: 87 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLR 146
++ G +AANGI HCVRP D + A ++LR
Sbjct: 379 M-----------------------MEEG------GYAANGIKHCVRPMDMSGKSAGMILR 409
Query: 147 RVLPHAPRL 155
R+ P AP L
Sbjct: 410 RINPPAPAL 418
>gi|384245374|gb|EIE18868.1| hypothetical protein COCSUDRAFT_20284, partial [Coccomyxa
subellipsoidea C-169]
Length = 246
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 69/135 (51%), Gaps = 6/135 (4%)
Query: 18 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 76
+ E V+P+P +Q+LV +L + GV+P + +SA+IN Y CI HID H F R
Sbjct: 101 IIAEEVVEPMPPMLQALV-HRLVRWGVMPRSKAPDSAIINIYDQEDCIPPHIDH-HDFSR 158
Query: 77 PIISLSLFSDSALCFGCKF-NFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQD 135
P ++SL S+ A+ FG K P + +P+ G +L+ A+ HCV P
Sbjct: 159 PFCTISLLSEQAIMFGAKLIPLGPGKFGGNHCTIPLPVGSCLVLKGNGADVAMHCVPP-- 216
Query: 136 TQYRRAVILLRRVLP 150
RR I LR+ P
Sbjct: 217 VSQRRMSITLRKCAP 231
>gi|356552441|ref|XP_003544576.1| PREDICTED: uncharacterized protein LOC100797674 [Glycine max]
Length = 522
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 65/128 (50%), Gaps = 5/128 (3%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 80
G VDPIPA + ++I +L + V+PP + +S ++N Y+ G CI HID H F RP +
Sbjct: 272 GMVDPIPALFK-VIIRRLIKWHVLPPTCVPDSCIVNIYEEGDCIPPHID-NHDFVRPFCT 329
Query: 81 LSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
+S S+ + FG + + +P+ G V +L A+ HCV T +R
Sbjct: 330 VSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMGSVLVLNGNGADVAKHCVPAVPT--KR 387
Query: 141 AVILLRRV 148
I RR+
Sbjct: 388 ISITFRRM 395
>gi|356562125|ref|XP_003549324.1| PREDICTED: uncharacterized protein LOC100787321 [Glycine max]
Length = 520
Score = 59.7 bits (143), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 64/128 (50%), Gaps = 5/128 (3%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 80
G VDPIP + ++I +L + V+PP + +S ++N Y+ G CI HID H F RP +
Sbjct: 270 GMVDPIPDLFK-VIIRRLVKWHVLPPTCVPDSCIVNIYEEGDCIPPHID-NHDFVRPFCT 327
Query: 81 LSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
+S S+ + FG + + +P+ G V +L A+ HCV T +R
Sbjct: 328 VSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMGSVLVLNGNGADVAKHCVPAVPT--KR 385
Query: 141 AVILLRRV 148
I RR+
Sbjct: 386 ISITFRRM 393
>gi|449434166|ref|XP_004134867.1| PREDICTED: uncharacterized protein LOC101203292 [Cucumis sativus]
gi|449531418|ref|XP_004172683.1| PREDICTED: uncharacterized protein LOC101225118 [Cucumis sativus]
Length = 499
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 66/127 (51%), Gaps = 7/127 (5%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P+ + ++I +L + V+PP + +S ++N Y G CI HID H F RP ++S
Sbjct: 298 VDPLPSLFK-VIIRRLVRWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFVRPFCTVS 355
Query: 83 LFSDSALCFGCKFNF-KPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S+ + FG + P S P+ +P+ G V +L A+ HCV T +R
Sbjct: 356 FLSECNIVFGTNLSIVGPGEFSGPIA-IPLPVGSVLVLNGNGADVAKHCVPAVPT--KRI 412
Query: 142 VILLRRV 148
I RR+
Sbjct: 413 SITFRRI 419
>gi|224094961|ref|XP_002310305.1| predicted protein [Populus trichocarpa]
gi|222853208|gb|EEE90755.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 5/126 (3%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L + V+PP + +S ++N Y+ G CI HID H F RP ++S
Sbjct: 34 VDPIPVLFK-VIIRRLVRWHVIPPTCVPDSCIVNIYEEGDCIPPHIDN-HDFVRPFCTVS 91
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
S+ + FG + + + +P+ G +L AA+ HCV RR
Sbjct: 92 FLSECNMLFGSHLKIVGVGDFDGPIAIPLPVGSALVLNGNAADVAKHCV--PSVHKRRIS 149
Query: 143 ILLRRV 148
I R++
Sbjct: 150 ITFRKM 155
>gi|115466278|ref|NP_001056738.1| Os06g0138200 [Oryza sativa Japonica Group]
gi|55296462|dbj|BAD68658.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein-like [Oryza
sativa Japonica Group]
gi|113594778|dbj|BAF18652.1| Os06g0138200 [Oryza sativa Japonica Group]
Length = 616
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 79/181 (43%), Gaps = 10/181 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DP+P+ + ++I +L + V+P I +S ++N Y PG CI HID H F RP ++S
Sbjct: 282 DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVNIYDPGDCIPPHID-SHDFVRPFCTVSF 339
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + FG L +P+ G V +L A+ HCV T +R I
Sbjct: 340 LSECNILFGSTLKIAGPGEFTGSLPIPLPVGSVLILNGNGADVAKHCVPAVPT--KRISI 397
Query: 144 LLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQARES--GGYFRSSIETYNNNENHSNKN 198
R++ P P + P + R S G + I+ N ++ H NK
Sbjct: 398 TFRKMDPAKRPFNFRDDPELLNIIPLETAVQETGRSSDEGKGKQPDIQIRNPSKAHRNKK 457
Query: 199 T 199
+
Sbjct: 458 S 458
>gi|218197526|gb|EEC79953.1| hypothetical protein OsI_21553 [Oryza sativa Indica Group]
Length = 600
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 79/181 (43%), Gaps = 10/181 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DP+P+ + ++I +L + V+P I +S ++N Y PG CI HID H F RP ++S
Sbjct: 267 DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVNIYDPGDCIPPHID-SHDFVRPFCTVSF 324
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + FG L +P+ G V +L A+ HCV T +R I
Sbjct: 325 LSECNILFGSTLKIAGPGEFTGSLPIPLPVGSVLILNGNGADVAKHCVPAVPT--KRISI 382
Query: 144 LLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQARES--GGYFRSSIETYNNNENHSNKN 198
R++ P P + P + R S G + I+ N ++ H NK
Sbjct: 383 TFRKMDPAKRPFNFRDDPELLNIIPLETAVQETGRSSDEGKGKQPDIQIRNPSKAHRNKK 442
Query: 199 T 199
+
Sbjct: 443 S 443
>gi|222634922|gb|EEE65054.1| hypothetical protein OsJ_20056 [Oryza sativa Japonica Group]
Length = 602
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 79/181 (43%), Gaps = 10/181 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DP+P+ + ++I +L + V+P I +S ++N Y PG CI HID H F RP ++S
Sbjct: 268 DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVNIYDPGDCIPPHID-SHDFVRPFCTVSF 325
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + FG L +P+ G V +L A+ HCV T +R I
Sbjct: 326 LSECNILFGSTLKIAGPGEFTGSLPIPLPVGSVLILNGNGADVAKHCVPAVPT--KRISI 383
Query: 144 LLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQARES--GGYFRSSIETYNNNENHSNKN 198
R++ P P + P + R S G + I+ N ++ H NK
Sbjct: 384 TFRKMDPAKRPFNFRDDPELLNIIPLETAVQETGRSSDEGKGKQPDIQIRNPSKAHRNKK 443
Query: 199 T 199
+
Sbjct: 444 S 444
>gi|79326336|ref|NP_001031793.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|51971445|dbj|BAD44387.1| hypothetical protein [Arabidopsis thaliana]
gi|332661216|gb|AEE86616.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 452
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 85/188 (45%), Gaps = 13/188 (6%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 80
G+VDP+P+ I ++I +L V+PP + +S ++N Y+ CI HID H F RP +
Sbjct: 226 GDVDPMPS-IFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDCIPPHID-NHDFLRPFCT 283
Query: 81 LSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
+S S+ + FG +P+ G V +L+ A+ HCV T +R
Sbjct: 284 VSFLSECNILFGSNLKVLGPGEFSGSYSIPLPVGSVLVLKGNGADVAKHCVPAVPT--KR 341
Query: 141 AVILLRRVLPHAPRLTLSQTPRVK-----PY-HSLYDVQARESGGYFRSSIETYNNNENH 194
I R++ + + P ++ PY H+ A RSSI+ N NH
Sbjct: 342 ISITFRKMDESKRPVGFTPEPDLEEIKPLPYEHTTPSTPADAVISSSRSSID--QNGSNH 399
Query: 195 SNKNTSNG 202
+N+ G
Sbjct: 400 NNRTAHGG 407
>gi|79326344|ref|NP_001031794.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|55819794|gb|AAV66092.1| At4g36090 [Arabidopsis thaliana]
gi|59958356|gb|AAX12888.1| At4g36090 [Arabidopsis thaliana]
gi|332661217|gb|AEE86617.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 520
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 85/188 (45%), Gaps = 13/188 (6%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 80
G+VDP+P+ I ++I +L V+PP + +S ++N Y+ CI HID H F RP +
Sbjct: 294 GDVDPMPS-IFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDCIPPHID-NHDFLRPFCT 351
Query: 81 LSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
+S S+ + FG +P+ G V +L+ A+ HCV T +R
Sbjct: 352 VSFLSECNILFGSNLKVLGPGEFSGSYSIPLPVGSVLVLKGNGADVAKHCVPAVPT--KR 409
Query: 141 AVILLRRVLPHAPRLTLSQTPRVK-----PY-HSLYDVQARESGGYFRSSIETYNNNENH 194
I R++ + + P ++ PY H+ A RSSI+ N NH
Sbjct: 410 ISITFRKMDESKRPVGFTPEPDLEEIKPLPYEHTTPSTPADAVISSSRSSID--QNGSNH 467
Query: 195 SNKNTSNG 202
+N+ G
Sbjct: 468 NNRTAHGG 475
>gi|147857702|emb|CAN80815.1| hypothetical protein VITISV_020466 [Vitis vinifera]
Length = 849
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 7/111 (6%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L + V+PP+ + +S ++N Y+ G CI HID H F RP ++S
Sbjct: 641 VDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIYEEGDCIPPHID-NHDFVRPFCTVS 698
Query: 83 LFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCVTLLRDFAANGITHCV 131
S+ + FG N K + E V +P+ G V +L A+ HCV
Sbjct: 699 FLSECBIVFGT--NLKXLGAGEFVGPFAIPLPVGSVLVLNGNGADVAKHCV 747
>gi|119638440|gb|ABL85031.1| oxidoreductase [Brachypodium sylvaticum]
Length = 598
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 13/177 (7%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DPIP + ++I +L + ++PP+ + +S ++N Y PG CI HID H F RP ++S
Sbjct: 278 DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVNIYDPGDCIPPHID-SHDFVRPFCTVSF 335
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + FG +P+ G V ++ A+ HCV +R I
Sbjct: 336 LSECNILFGSTLKIAGPGEFTGSFAIPLPVGSVLVINGNGADVAKHCV--PAVPAKRISI 393
Query: 144 LLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARESGG--YFRSSIETYNNNENHSNKN 198
R++ P P L D+ E+ RSS E + NKN
Sbjct: 394 TFRKMDPAKRPFGFKDDP------ELLDITPMEAASEETSRSSDEGKGKQLDVQNKN 444
>gi|343172812|gb|AEL99109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
partial [Silene latifolia]
gi|343172814|gb|AEL99110.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
partial [Silene latifolia]
Length = 261
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 65/128 (50%), Gaps = 7/128 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + + I+++ + V+PP + NS ++N Y G CI HID H F RP ++
Sbjct: 118 EVDPLPPVFKQM-ISRMVKWHVLPPTRVPNSCIVNIYDEGDCIPPHID-HHDFLRPFCTV 175
Query: 82 SLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
S ++ + FG + P S PV +P+ G V +L A+ HC+ +R
Sbjct: 176 SFLTECNIMFGTNLKVEGPGEFSGPVT-IPLPVGSVLVLNGNGADVAKHCI--PAVPAKR 232
Query: 141 AVILLRRV 148
+ RR+
Sbjct: 233 ISVTFRRM 240
>gi|297798302|ref|XP_002867035.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312871|gb|EFH43294.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 490
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/178 (29%), Positives = 83/178 (46%), Gaps = 8/178 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP+ + ++I +L V+PP I +S ++N Y G CI HID H F RP ++S
Sbjct: 266 VDPIPSLFK-VIIKRLVGWHVLPPTCIPDSCIVNIYDEGDCIPPHIDN-HDFVRPFCTVS 323
Query: 83 LFSDSALCFGCKFN-FKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S+ + FG P S P +P+ G V +L+ A+ HCV T +R
Sbjct: 324 FLSECNILFGSNLKVVGPGEFSGP-YSMPLPVGSVLVLKGNGADVAKHCVPAVPT--KRI 380
Query: 142 VILLRRVLPHAPRLTLSQTPRVKPYHSL-YDVQARESGGYFRSSIETYNNNENHSNKN 198
I R++ + + P ++ L Y+ + + S +N++N SN N
Sbjct: 381 SITFRKMDESKRPVGFTPEPDLQGIKPLPYEQTTQSTPAAAAISSSRSSNDQNGSNHN 438
>gi|357118936|ref|XP_003561203.1| PREDICTED: uncharacterized protein LOC100826719 [Brachypodium
distachyon]
Length = 594
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 8/171 (4%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DPIP + ++I +L + ++PP+ + +S ++N Y PG CI HID H F RP ++S
Sbjct: 274 DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVNIYDPGDCIPPHID-SHDFVRPFCTVSF 331
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + FG +P+ G V ++ A+ HCV +R I
Sbjct: 332 LSECNILFGSTLKIAGPGEFTGSFAIPLPAGSVLVINGNGADVAKHCV--PAVPAKRISI 389
Query: 144 LLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQARESGGYFRSSIETYNNN 191
R++ P P + P + + +R S ++ N N
Sbjct: 390 TFRKMDPAKRPFGFKDDPELLNITPMEAASEETSRSSDEGKGKQLDVQNKN 440
>gi|302757365|ref|XP_002962106.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
gi|300170765|gb|EFJ37366.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
Length = 289
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 62/117 (52%), Gaps = 17/117 (14%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P ++S I +L + V+PP + +S ++N Y+ G CI HID H F RP ++
Sbjct: 149 EVDPLPPLLKS-TIKRLVRWHVLPPTCVPDSCIVNIYETGDCIPPHID-HHDFLRPFCTV 206
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLY-------LPVQRGCVTLLRDFAANGITHCV 131
SL S ++ FG + V+ P + LPV G V +L D A+ HC+
Sbjct: 207 SLLSQCSIVFGSSLS-----VAAPGEFDGSFSTQLPV--GSVLVLNDNGADVAKHCI 256
>gi|357445515|ref|XP_003593035.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|124360268|gb|ABN08281.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
gi|355482083|gb|AES63286.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 391
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 60/114 (52%), Gaps = 11/114 (9%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + + I ++ + ++PP + +S ++N Y G CI HID H F RP S+
Sbjct: 226 EVDPLPRMFKQM-IKRMVRWKIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSV 283
Query: 82 SLFSDSALCFGCKFNFKPIRVSE----PVLYLPVQRGCVTLLRDFAANGITHCV 131
S + + + FG N K +R E + LPV G V +L+ A+ HC+
Sbjct: 284 SFLNKAKILFGS--NLKAVRPGEFFGPASISLPV--GSVFVLKGNGADIAKHCI 333
>gi|225431153|ref|XP_002268446.1| PREDICTED: uncharacterized protein LOC100250563 [Vitis vinifera]
Length = 510
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 7/111 (6%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L + V+PP+ + +S ++N Y+ G CI HID H F RP ++S
Sbjct: 302 VDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIYEEGDCIPPHID-NHDFVRPFCTVS 359
Query: 83 LFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCVTLLRDFAANGITHCV 131
S+ + FG N K + E V +P+ G V +L A+ HCV
Sbjct: 360 FLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSVLVLNGNGADVAKHCV 408
>gi|297735021|emb|CBI17383.3| unnamed protein product [Vitis vinifera]
Length = 547
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 60/111 (54%), Gaps = 7/111 (6%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L + V+PP+ + +S ++N Y+ G CI HID H F RP ++S
Sbjct: 339 VDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIYEEGDCIPPHID-NHDFVRPFCTVS 396
Query: 83 LFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCVTLLRDFAANGITHCV 131
S+ + FG N K + E V +P+ G V +L A+ HCV
Sbjct: 397 FLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSVLVLNGNGADVAKHCV 445
>gi|145328752|ref|NP_001077911.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|334184285|ref|NP_001189546.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330251613|gb|AEC06707.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330251614|gb|AEC06708.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 433
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 5/127 (3%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + ++I KL + V+PP + +S ++N Y G CI HID H F RP ++
Sbjct: 217 EVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLRPFCTI 274
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S S+ + FG + +P+ G V +L A+ HCV T +R
Sbjct: 275 SFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT--KRI 332
Query: 142 VILLRRV 148
I R++
Sbjct: 333 SITFRKM 339
>gi|15227938|ref|NP_179387.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|4406821|gb|AAD20129.1| hypothetical protein [Arabidopsis thaliana]
gi|34365749|gb|AAQ65186.1| At2g17970 [Arabidopsis thaliana]
gi|51970124|dbj|BAD43754.1| hypothetical protein [Arabidopsis thaliana]
gi|330251612|gb|AEC06706.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 507
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 62/127 (48%), Gaps = 5/127 (3%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + ++I KL + V+PP + +S ++N Y G CI HID H F RP ++
Sbjct: 291 EVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLRPFCTI 348
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S S+ + FG + +P+ G V +L A+ HCV T +R
Sbjct: 349 SFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT--KRI 406
Query: 142 VILLRRV 148
I R++
Sbjct: 407 SITFRKM 413
>gi|357445523|ref|XP_003593039.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|124360274|gb|ABN08287.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
gi|355482087|gb|AES63290.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 497
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 65/129 (50%), Gaps = 9/129 (6%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + + I ++ + ++PP + +S ++N Y G CI HID H F RP S+
Sbjct: 285 EVDPLPDVFKQM-IKRMVRWNIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSV 342
Query: 82 SLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCVRPQDTQYR 139
S +++ + FG +P S P + LP+ G V +L A+ HC+ +
Sbjct: 343 SFLNEAKILFGSNLKEIQPGEFSGPASISLPL--GSVFVLNGNGADIAKHCI--PSVSSK 398
Query: 140 RAVILLRRV 148
R I R++
Sbjct: 399 RISITFRKM 407
>gi|255579797|ref|XP_002530736.1| conserved hypothetical protein [Ricinus communis]
gi|223529700|gb|EEF31642.1| conserved hypothetical protein [Ricinus communis]
Length = 511
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 63/126 (50%), Gaps = 5/126 (3%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L + V+PP + +S ++N Y G CI HID H F+RP ++S
Sbjct: 308 VDPIPHLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFNRPFCTVS 365
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
S+ + FG + +P+ G V +L AA+ HCV T +R
Sbjct: 366 FVSECNIVFGSNLKVLGPGEFSGSIAIPLPVGSVLVLDGNAADVSKHCVPSVPT--KRIS 423
Query: 143 ILLRRV 148
I R++
Sbjct: 424 ITFRKM 429
>gi|297836514|ref|XP_002886139.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297331979|gb|EFH62398.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 62/127 (48%), Gaps = 5/127 (3%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + ++I +L + V+PP + +S ++N Y CI HID H F RP ++
Sbjct: 286 EVDPLPPLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYDESDCIPPHID-NHDFLRPFCTI 343
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S S+ + FG N + +P+ G V +L A+ HCV T +R
Sbjct: 344 SFLSECDILFGSNLNVEGPGEFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT--KRI 401
Query: 142 VILLRRV 148
I R++
Sbjct: 402 SITFRKM 408
>gi|298204538|emb|CBI23813.3| unnamed protein product [Vitis vinifera]
Length = 439
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 7/114 (6%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPII 79
E EVDP+P + + I ++ + ++PP + NS ++N Y G CI HID H F RP
Sbjct: 234 EEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEGDCIPPHID-HHDFLRPFC 291
Query: 80 SLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCV 131
++S ++ + FG S PV + LP +G V +L A+ HCV
Sbjct: 292 TVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLILNGNGADVAKHCV 343
>gi|225447996|ref|XP_002270229.1| PREDICTED: uncharacterized protein LOC100256702 [Vitis vinifera]
Length = 457
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 58/114 (50%), Gaps = 7/114 (6%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPII 79
E EVDP+P + + I ++ + ++PP + NS ++N Y G CI HID H F RP
Sbjct: 252 EEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEGDCIPPHID-HHDFLRPFC 309
Query: 80 SLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCV 131
++S ++ + FG S PV + LP +G V +L A+ HCV
Sbjct: 310 TVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLILNGNGADVAKHCV 361
>gi|302775126|ref|XP_002970980.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
gi|300160962|gb|EFJ27578.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
Length = 307
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/117 (34%), Positives = 61/117 (52%), Gaps = 17/117 (14%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P ++S I +L + V+ P + +S ++N Y+ G CI HID H F RP ++
Sbjct: 163 EVDPLPPLLKS-TIKRLVRWHVLSPTCVPDSCIVNIYEIGDCIPPHID-HHDFLRPFCTV 220
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLY-------LPVQRGCVTLLRDFAANGITHCV 131
SL S ++ FG + V+ P + LPV G V +L D A+ HC+
Sbjct: 221 SLLSQCSIVFGSSLS-----VAAPGEFDGSFSTQLPV--GSVLVLNDNGADVAKHCI 270
>gi|224134110|ref|XP_002327758.1| predicted protein [Populus trichocarpa]
gi|222836843|gb|EEE75236.1| predicted protein [Populus trichocarpa]
Length = 213
Score = 53.5 bits (127), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 61/126 (48%), Gaps = 5/126 (3%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDPIP + ++I +L Q V+PP + +S ++N Y+ CI HID H F RP ++S
Sbjct: 2 VDPIPDLFK-VIIRRLVQWHVIPPTCVPDSCIVNIYEEWDCIPPHID-SHDFVRPFCTVS 59
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
S + FG + + +P+ G V +L A+ HCV T +R
Sbjct: 60 FLSQCNILFGSNLKVVGAGDFDGPIAIPLPVGSVLVLNGNGADVAKHCVPSVPT--KRIS 117
Query: 143 ILLRRV 148
I R++
Sbjct: 118 ITFRKM 123
>gi|413953301|gb|AFW85950.1| hypothetical protein ZEAMMB73_605385 [Zea mays]
Length = 614
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 15/132 (11%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DP+P +++ I +L + V+P + +S ++N Y+PG CI HID H F RP ++S
Sbjct: 281 DPMPDLFKTM-IKRLVRWQVLPATCVPDSCIVNIYEPGDCIPPHID-SHDFVRPFCTVSF 338
Query: 84 FSDSALCFGCKFNFKPIRVSEP-----VLYLPVQRGCVTLLRDFAANGITHCVRPQDTQY 138
S+ + FG ++V+ P + +P+ G V +L A+ HCV T
Sbjct: 339 LSECNILFGPN-----LKVAAPGEFIGSIAIPLPVGSVLVLNGNGADVAKHCVPAVPT-- 391
Query: 139 RRAVILLRRVLP 150
+R I R++ P
Sbjct: 392 KRISITFRKMDP 403
>gi|440792230|gb|ELR13458.1| 2OG-Fe(II) oxygenase family oxidoreductase [Acanthamoeba
castellanii str. Neff]
Length = 214
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 110
+ +IN+Y PG I +H+D P +FD I S+SL S + +F K + + PVL
Sbjct: 110 DQVIINEYLPGQGISAHVDKPSLFDNEIASISLGSTCVM----EFKHKATKTTHPVL--- 162
Query: 111 VQRGCVTLLRDFAANGITHCV--RPQDTQYRRAVILLRRV 148
+ R + L++ A THC+ R +D R + RRV
Sbjct: 163 LGRRSLVLMKGAARYEWTHCIPARKKDKVDGRQIPRQRRV 202
>gi|297847176|ref|XP_002891469.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
lyrata]
gi|297337311|gb|EFH67728.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
lyrata]
Length = 335
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 61/127 (48%), Gaps = 6/127 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 175 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-NHDFLRPFCTVS 232
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVRPQDTQYRRA 141
S+ + FG K Y LP+ G V +L A+ HCV + +R
Sbjct: 233 FLSECNILFGSNLKVKETGEYTGGSYSLPLPVGSVLVLNGNGADVAKHCV--PEVPTKRI 290
Query: 142 VILLRRV 148
I R++
Sbjct: 291 SITFRKM 297
>gi|242074876|ref|XP_002447374.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
gi|241938557|gb|EES11702.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
Length = 345
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 60/112 (53%), Gaps = 7/112 (6%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
E DP+P ++S+V +L V+PP+ + +S ++N Y G CI H+D H F RP ++
Sbjct: 113 EADPLPPPLKSMV-RRLVSWRVLPPDCVPDSCIVNVYDVGDCIPPHVD-HHDFARPFCTV 170
Query: 82 SLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCV 131
S ++ A+ FG + P + PV + LP G V +L A+ HCV
Sbjct: 171 SFLAECAILFGRELRVVGPGEFAGPVSISLPP--GSVLVLAGNGADVAKHCV 220
>gi|148906930|gb|ABR16610.1| unknown [Picea sitchensis]
Length = 496
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 3/110 (2%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P ++ I ++ + V+PP+ I +S ++N Y G CI HID H F RP ++
Sbjct: 290 EVDPLPPLFKA-AIRRMVRWHVLPPSCIPDSCIVNIYDEGDCIPPHID-HHDFVRPFCTV 347
Query: 82 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 131
SL S+ + FG +P+ G V +L A+ H V
Sbjct: 348 SLLSECNIIFGSNLKILGPGEFAGSTAIPLPMGSVLILNGNGADVAKHSV 397
>gi|79319564|ref|NP_001031159.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|52354159|gb|AAU44400.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
gi|332194254|gb|AEE32375.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 327
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 16/132 (12%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 170 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVS 227
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLY------LPVQRGCVTLLRDFAANGITHCVRPQDT 136
S+ + FG ++V E Y LP+ G V +L A+ HCV T
Sbjct: 228 FLSECNILFGSN-----LKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT 282
Query: 137 QYRRAVILLRRV 148
+R I R++
Sbjct: 283 --KRISITFRKM 292
>gi|79361742|ref|NP_175328.2| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|52354161|gb|AAU44401.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
gi|60547627|gb|AAX23777.1| hypothetical protein At1g48980 [Arabidopsis thaliana]
gi|332194253|gb|AEE32374.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 331
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 16/132 (12%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 174 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVS 231
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLY------LPVQRGCVTLLRDFAANGITHCVRPQDT 136
S+ + FG ++V E Y LP+ G V +L A+ HCV T
Sbjct: 232 FLSECNILFGSN-----LKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT 286
Query: 137 QYRRAVILLRRV 148
+R I R++
Sbjct: 287 --KRISITFRKM 296
>gi|186489643|ref|NP_001117455.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|332194255|gb|AEE32376.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 325
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 174 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVS 231
Query: 83 LFSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCV 131
S+ + FG + S LP+ G V +L A+ HCV
Sbjct: 232 FLSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCV 281
>gi|356525956|ref|XP_003531587.1| PREDICTED: uncharacterized protein LOC100306211 [Glycine max]
Length = 423
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 63/128 (49%), Gaps = 7/128 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + + I ++ + ++P I +S ++N Y+ G CI HID H F RP ++
Sbjct: 251 EVDPLPPVFKQM-IKRMVRWNIIPSTCIPDSCIVNIYEEGDCIPPHID-HHDFVRPFYTV 308
Query: 82 SLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
S ++ + FG P + PV +P+ G V +L A+ HC+ +R
Sbjct: 309 SFLNECKILFGSNLQVVCPGEFAGPV-SIPLPVGSVFVLNGNGADIAKHCI--PSVSSKR 365
Query: 141 AVILLRRV 148
I R++
Sbjct: 366 ISITFRKM 373
>gi|313240624|emb|CBY32949.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 28/45 (62%)
Query: 111 VQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+QRG +T N I HCVRP+DT+ RR I+LRRV + P L
Sbjct: 1 MQRGIITSFEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 45
>gi|440789981|gb|ELR11271.1| DNA repair protein alkB, putative [Acanthamoeba castellanii str.
Neff]
Length = 253
Score = 50.1 bits (118), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 20/134 (14%)
Query: 27 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 86
+P + Q +I K+ Q G+ P + + +IN+Y PG I HID H F ++SLSL S
Sbjct: 127 LPVFAQQ-IIEKMRQRGL--PQF-DQLIINEYTPGQGINPHIDKTHCFGPCVVSLSLLST 182
Query: 87 SALCFGCKFNFKPIRVSE--PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVIL 144
C F + E PV+ P + +LR A G H + P +RA I+
Sbjct: 183 ------CVMTFTSLETGEKIPVVLRPR---SLVVLRGQARYGWQHGIEP-----KRADIV 228
Query: 145 LRRVLPHAPRLTLS 158
+ P A R++L+
Sbjct: 229 AGKHTPRARRVSLT 242
>gi|395836690|ref|XP_003791285.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
[Otolemur garnettii]
Length = 174
Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/38 (57%), Positives = 28/38 (73%)
Query: 118 LLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
L +AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 35 LFSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 72
>gi|345305229|ref|XP_001510391.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH5-like [Ornithorhynchus anatinus]
Length = 150
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 27/34 (79%)
Query: 122 FAANGITHCVRPQDTQYRRAVILLRRVLPHAPRL 155
+AA+ ITHC+RPQD + RRAVI+LR+ APRL
Sbjct: 13 YAADEITHCIRPQDIRERRAVIILRKTRLDAPRL 46
>gi|328772232|gb|EGF82271.1| hypothetical protein BATDEDRAFT_87049 [Batrachochytrium
dendrobatidis JAM81]
Length = 225
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 19/137 (13%)
Query: 27 IPAWIQSLVITK--LEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+PA+ S VI + L + V P + N ++N+YQPG I+ H+D F + SLSL+
Sbjct: 95 LPAF-SSFVIDRMLLPEFNVFPNDPPNHVLVNEYQPGQGIMPHVDSQDTFGDVVTSLSLW 153
Query: 85 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQY------ 138
S + FG K + + + P R + +L A TH + +D +
Sbjct: 154 SSCVMSFGNKMTGEKVHLELP-------RRSLLILTGDARTHYTHAIPKEDMLFAGNECV 206
Query: 139 ---RRAVILLRRVLPHA 152
RR + +R +L A
Sbjct: 207 DRGRRVSLTIRSILKSA 223
>gi|297793433|ref|XP_002864601.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310436|gb|EFH40860.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 302
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 63/128 (49%), Gaps = 7/128 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
EVDP+P + ++I +L + V+PP + +S ++N Y G CI HID H F RP ++
Sbjct: 154 EVDPLPPLFK-VMIRRLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLRPFCTI 211
Query: 82 SLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRR 140
S S+ + FG + P S P P+ G V +L A+ H V T +R
Sbjct: 212 SFLSECDILFGSNLKTEGPGEFSGP-YSTPLPVGSVLVLNGNGADVAKHYVPAVPT--KR 268
Query: 141 AVILLRRV 148
I R++
Sbjct: 269 ISITFRKM 276
>gi|397614206|gb|EJK62658.1| hypothetical protein THAOC_16720 [Thalassiosira oceanica]
Length = 396
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 13/106 (12%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN------FKPIRVSE 104
+S +IN Y G CI H+D ++RPI +LSL + A+ G KF +KPI V +
Sbjct: 208 DSCIINQYGEGDCIPPHVDH-ESYERPISTLSLLGEEAMLLGTKFKTVKNCTWKPI-VGQ 265
Query: 105 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLP 150
VL LP R + +L + N HC+ RR I LR+ P
Sbjct: 266 SVL-LP--RRSLLVLGGNSGNISKHCI--SACTRRRISITLRKQPP 306
>gi|42567443|ref|NP_195332.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661215|gb|AEE86615.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 385
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 80
G+VDP+P+ I ++I +L V+PP + +S ++N Y+ CI HID H F RP +
Sbjct: 294 GDVDPMPS-IFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDCIPPHID-NHDFLRPFCT 351
Query: 81 LSLFSDSALCFG 92
+S S+ + FG
Sbjct: 352 VSFLSECNILFG 363
>gi|3036813|emb|CAA18503.1| hypothetical protein [Arabidopsis thaliana]
gi|7270561|emb|CAB81518.1| hypothetical protein [Arabidopsis thaliana]
Length = 505
Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 80/189 (42%), Gaps = 30/189 (15%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
G+VDP+P+ + + I S ++N Y+ CI HID H F RP ++
Sbjct: 294 GDVDPMPSIFKVI---------------IKSCIVNIYEEDDCIPPHID-NHDFLRPFCTV 337
Query: 82 SLFSDSALCFGCKFNFKPIRVSE--PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYR 139
S S+ + FG N K + E +P+ G V +L+ A+ HCV T +
Sbjct: 338 SFLSECNILFGS--NLKVLGPGEFSGSYSIPLPVGSVLVLKGNGADVAKHCVPAVPT--K 393
Query: 140 RAVILLRRVLPHAPRLTLSQTPRVK-----PY-HSLYDVQARESGGYFRSSIETYNNNEN 193
R I R++ + + P ++ PY H+ A RSSI+ N N
Sbjct: 394 RISITFRKMDESKRPVGFTPEPDLEEIKPLPYEHTTPSTPADAVISSSRSSID--QNGSN 451
Query: 194 HSNKNTSNG 202
H+N+ G
Sbjct: 452 HNNRTAHGG 460
>gi|71425222|ref|XP_813050.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877899|gb|EAN91199.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 305
Score = 47.4 bits (111), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 77/172 (44%), Gaps = 13/172 (7%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 94
+I+++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S++ L F
Sbjct: 85 LISRIEQDGIVPDGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHV 144
Query: 95 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPR 154
N + + P G + ++ A H V P + Q R +++ R +L
Sbjct: 145 QNGEELDAVVP-------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGG 196
Query: 155 LTLSQTP--RVKPY---HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSN 201
TP + PY H L + A++ GG + + Y E +T+
Sbjct: 197 FRPVTTPLGDLMPYRSTHILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTAK 248
>gi|407408875|gb|EKF32135.1| hypothetical protein MOQ_004021 [Trypanosoma cruzi marinkellei]
Length = 305
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 77/172 (44%), Gaps = 13/172 (7%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 94
+I+++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S++ L F
Sbjct: 85 LISRIEQDGIVPDGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHV 144
Query: 95 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPR 154
N + + P G + ++ A H V P + Q R +++ R +L
Sbjct: 145 QNGEELDAVVP-------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGG 196
Query: 155 LTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTSN 201
TP + PY S L + A++ GG + + Y E +T+
Sbjct: 197 FRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRVTVDDNYLEKEEIGAFDTAK 248
>gi|7770341|gb|AAF69711.1|AC016041_16 F27J15.23 [Arabidopsis thaliana]
Length = 264
Score = 46.6 bits (109), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 3/70 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 174 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVS 231
Query: 83 LFSDSALCFG 92
S+ + FG
Sbjct: 232 FLSECNILFG 241
>gi|407843604|gb|EKG01501.1| hypothetical protein TCSYLVIO_007502 [Trypanosoma cruzi]
Length = 354
Score = 46.6 bits (109), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 77/172 (44%), Gaps = 13/172 (7%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 94
+I+++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S++ L F
Sbjct: 134 LISRIEQDGIVPDGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHV 193
Query: 95 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPR 154
N + + P G + ++ A H V P + Q R +++ R +L
Sbjct: 194 QNGEELDAVVP-------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGG 245
Query: 155 LTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTSN 201
TP + PY S L + A++ GG + + Y E +T+
Sbjct: 246 FRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTAK 297
>gi|115345714|ref|YP_771891.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
gi|115293033|gb|ABI93483.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
Length = 195
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 13/120 (10%)
Query: 19 YPEGEVDPIPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
+ E + P+P QSL +T VP + ++N+YQPG I +HID F
Sbjct: 64 WREDYLGPLPELFQSLAERLTAEGHFQTVP----DQVIVNEYQPGQGISAHIDCQPCFGE 119
Query: 77 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 136
I SLSL S + F R+ + L +Q + +L+ A + TH + P+ T
Sbjct: 120 TIASLSLLSACVMRFAS-------RIYSQQMELHLQPSSLLVLQSDARHLWTHAIPPRKT 172
>gi|219883161|ref|YP_002478323.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
gi|219867286|gb|ACL47624.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
Length = 204
Score = 46.6 bits (109), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 2/72 (2%)
Query: 20 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
PE + +P W++ L +L Q G +P + + ++N+Y PG I +HID F I+
Sbjct: 75 PEAYLGTLPEWLKPLT-NRLWQEGYIP-DLPDQVIVNEYIPGQGITAHIDCIDCFSDTIL 132
Query: 80 SLSLFSDSALCF 91
SLSL SD + F
Sbjct: 133 SLSLGSDCIMRF 144
>gi|343474281|emb|CCD14041.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 305
Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P + ++ ++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S
Sbjct: 76 PLPTGSFTPLLDRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135
Query: 86 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILL 145
++ L F N + + V P G + ++ + H V P + Q R +V+
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187
Query: 146 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 200
R +L TP + PY S L + A++ GG ++Y E +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247
>gi|342180903|emb|CCC90378.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 305
Score = 45.8 bits (107), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P + ++ ++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S
Sbjct: 76 PLPTGSFTPLLGRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135
Query: 86 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILL 145
++ L F N + + V P G + ++ + H V P + Q R +V+
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187
Query: 146 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 200
R +L TP + PY S L + A++ GG ++Y E +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247
>gi|428311031|ref|YP_007122008.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
gi|428252643|gb|AFZ18602.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
Length = 206
Score = 45.8 bits (107), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 64/150 (42%), Gaps = 24/150 (16%)
Query: 1 GYTY----GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIN 56
GY Y GS + LGN +P W Q++ + E +V N ++N
Sbjct: 61 GYKYDYKDGSFVASTHLGN-----------LPDWAQNVAVRLAEDGFMV--NVPEQVIVN 107
Query: 57 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 116
+YQPG IVSH D F II+LSL S+ C NF + + V L +Q G +
Sbjct: 108 EYQPGQGIVSHTDCIPCFGNTIITLSLGSE------CVMNFTHSQTQKEVGIL-LQAGSL 160
Query: 117 TLLRDFAANGITHCVRPQDTQYRRAVILLR 146
+ + A H + P+ + I +R
Sbjct: 161 LIFKGEARYIWKHGIVPRKRDNYKGRIFMR 190
>gi|71423866|ref|XP_812599.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877399|gb|EAN90748.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 305
Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/57 (36%), Positives = 36/57 (63%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 91
+I+++EQ G+VP ++N+ N Y+PG I +HID ++D +SL S++ L F
Sbjct: 85 LISRIEQDGIVPDGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRF 141
>gi|445456942|ref|ZP_21446201.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
gi|444777446|gb|ELX01476.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
baumannii OIFC047]
Length = 200
Score = 45.1 bits (105), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 4/67 (5%)
Query: 26 PIPAWIQSLVITKLEQMGV-VPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
PIP+W+ + +++ + +PPN + +IN+Y+PG I HID P F II +SL
Sbjct: 76 PIPSWLIRMQKDLMDECSLDLPPNQL---IINEYEPGQGITDHIDAPDEFGETIIMVSLG 132
Query: 85 SDSALCF 91
S + F
Sbjct: 133 SSCVMDF 139
>gi|124002832|ref|ZP_01687684.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
gi|123992060|gb|EAY31447.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
Length = 192
Score = 45.1 bits (105), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 17/136 (12%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
+V +P W Q +V +++ + Y + +IN+Y PG I H+D FD I+S+S
Sbjct: 62 KVGDLPHWAQKIVQQAVDEQ--LLSEYFDQMIINEYLPGQGIARHVDCEPCFDHTIMSVS 119
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQY---- 138
L + + F + + + PVL P G LL A H +R ++
Sbjct: 120 LGTACVMHFN---SLEDKNLDVPVLLAP---GSAILLSGDARYRWQHSIRANKSEMFEGQ 173
Query: 139 -----RRAVILLRRVL 149
RR + R+V+
Sbjct: 174 KIVRGRRVSLTFRKVI 189
>gi|146100771|ref|XP_001468941.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398023105|ref|XP_003864714.1| hypothetical protein, conserved [Leishmania donovani]
gi|134073310|emb|CAM72036.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322502950|emb|CBZ38034.1| hypothetical protein, conserved [Leishmania donovani]
Length = 318
Score = 44.3 bits (103), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 83/183 (45%), Gaps = 13/183 (7%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
++ P+P ++ +LEQ VVP N++N+ N Y+PG I +HID ++D
Sbjct: 76 QLPPLPPASFMPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------D 129
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
+F+ +L C F ++ E + + R V ++ A H V P + Q R ++
Sbjct: 130 IFAICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSL 187
Query: 143 ILLRRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNK 197
+ R ++ +TP + PY + L + +++ GG S + + + N
Sbjct: 188 VFRRSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESANIGAF 247
Query: 198 NTS 200
+TS
Sbjct: 248 DTS 250
>gi|308810579|ref|XP_003082598.1| unnamed protein product [Ostreococcus tauri]
gi|116061067|emb|CAL56455.1| unnamed protein product [Ostreococcus tauri]
Length = 544
Score = 44.3 bits (103), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY-QPGGCIVSHIDPPHIFDRPIISLS 82
DP+ A + L T L++ GV P+++ N V+N Y + G +++H D H+F+ PI +
Sbjct: 404 DPLAAPLHKLAET-LQKRGVFAPDFVPNQYVLNIYGRAGAYLMAHKDALHLFEGPIYGVR 462
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 131
LF+ L F + + I V ++ + G VT + FA + H +
Sbjct: 463 LFNPRILSFAPDGHMR-INVDRGMIDVMQSVGSVTSMDGFAKEDVQHSI 510
>gi|297828529|ref|XP_002882147.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
lyrata]
gi|297327986|gb|EFH58406.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 44.3 bits (103), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 26/141 (18%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID---------- 69
E V+PIP +QS VI L Q V+P +Y+ P GC+++ D
Sbjct: 239 EHSVEPIPTLVQS-VIDHLLQWRVIP----------EYKRPNGCVINFFDEDEHSQPFQK 287
Query: 70 PPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITH 129
PPH+ D+PI +L L S+S + FG + L LP++ G + ++R +A+ H
Sbjct: 288 PPHV-DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARH 345
Query: 130 CVRPQDTQYRRAVILLRRVLP 150
+ P + +R I ++ P
Sbjct: 346 VMCP--SPNKRVAITFFKLKP 364
>gi|401429284|ref|XP_003879124.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322495374|emb|CBZ30678.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 318
Score = 44.3 bits (103), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 74/159 (46%), Gaps = 13/159 (8%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P+ ++ +LEQ VVP N++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPSASFLPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 86 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILL 145
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 146 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGG 179
R ++ +TP + PY + L + +++ GG
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGG 229
>gi|30690892|ref|NP_182329.2| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
thaliana]
gi|330255838|gb|AEC10932.1| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
thaliana]
Length = 438
Score = 43.9 bits (102), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 26/141 (18%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID---------- 69
E V+PIP +QS VI L Q ++P +Y+ P GC+++ D
Sbjct: 232 EHSVEPIPTLVQS-VIDHLLQWRLIP----------EYKRPNGCVINFFDEDEHSQPFQK 280
Query: 70 PPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITH 129
PPH+ D+PI +L L S+S + FG + L LP++ G + ++R +A+ H
Sbjct: 281 PPHV-DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARH 338
Query: 130 CVRPQDTQYRRAVILLRRVLP 150
+ P + +R I ++ P
Sbjct: 339 VMCP--SPNKRVAITFFKLKP 357
>gi|154344577|ref|XP_001568230.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065567|emb|CAM43337.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 318
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 73/159 (45%), Gaps = 13/159 (8%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P + ++ +LEQ VVP N++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPPGSFTPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 86 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILL 145
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 VCSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 146 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGG 179
R ++ TP V PY + L + +++ GG
Sbjct: 191 RSIMESDGGFRPVTTPLKEVMPYRATQILNALYSKQVGG 229
>gi|4249414|gb|AAD13711.1| unknown protein [Arabidopsis thaliana]
Length = 433
Score = 43.5 bits (101), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 26/141 (18%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID---------- 69
E V+PIP +QS VI L Q ++P +Y+ P GC+++ D
Sbjct: 227 EHSVEPIPTLVQS-VIDHLLQWRLIP----------EYKRPNGCVINFFDEDEHSQPFQK 275
Query: 70 PPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITH 129
PPH+ D+PI +L L S+S + FG + L LP++ G + ++R +A+ H
Sbjct: 276 PPHV-DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARH 333
Query: 130 CVRPQDTQYRRAVILLRRVLP 150
+ P + +R I ++ P
Sbjct: 334 VMCP--SPNKRVAITFFKLKP 352
>gi|414077137|ref|YP_006996455.1| alkylated DNA repair protein [Anabaena sp. 90]
gi|413970553|gb|AFW94642.1| alkylated DNA repair protein [Anabaena sp. 90]
Length = 226
Score = 43.5 bits (101), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
+ P+P W +L +L + V P + ++NDYQPG I SH D F IISLS
Sbjct: 98 IAPLPDWALTLA-QRLHK--VFSPTVPDQVIVNDYQPGQGISSHTDCVSCFTDVIISLS- 153
Query: 84 FSDSALCFGCKFNF 97
LC C +F
Sbjct: 154 -----LCSSCVMDF 162
>gi|356502827|ref|XP_003520217.1| PREDICTED: uncharacterized protein LOC100791586 [Glycine max]
Length = 514
Score = 43.5 bits (101), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 65/128 (50%), Gaps = 6/128 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
++PIP +Q VI L Q ++P N +IN ++ G + PPH+ D+P+ +L
Sbjct: 264 IEPIPVLLQG-VIDHLIQWQLLPEYKRPNGCIINFFEKGEFSQPFLKPPHL-DQPVSTL- 320
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
L S+SA+ FG + + L L +++G + ++R +A+ H + P + RR
Sbjct: 321 LLSESAMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMCP--SPNRRVS 378
Query: 143 ILLRRVLP 150
I RV P
Sbjct: 379 ITFFRVRP 386
>gi|357442687|ref|XP_003591621.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
gi|355480669|gb|AES61872.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
Length = 513
Score = 43.5 bits (101), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 73/148 (49%), Gaps = 7/148 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
++PIPA +Q VI Q ++P N +IN ++ G + PPH+ D+P+ +L
Sbjct: 265 IEPIPALLQR-VIDHFIQWHLLPEYKRPNGCIINFFEEGEFSQPFLKPPHL-DQPVTTL- 321
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
L S+S + FG + + L L +++G + ++R +A+ H + P + RR
Sbjct: 322 LLSESTMAFGRILMSENDGNYKGPLMLSLKKGSLLVMRGNSADMARHVMCP--SPNRRVS 379
Query: 143 ILLRRVLPHAPRLTLSQTPRVKPYHSLY 170
I RV P + + S TP + +++
Sbjct: 380 ITFFRVRPDSNQCQ-SPTPTMTSAMTMW 406
>gi|356536752|ref|XP_003536899.1| PREDICTED: uncharacterized protein LOC100802371 [Glycine max]
Length = 466
Score = 43.1 bits (100), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 65/128 (50%), Gaps = 6/128 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
++PIPA +Q VI L Q ++P N +IN ++ G + PPH+ D+P+ +L
Sbjct: 250 IEPIPALLQG-VIDHLIQWQLLPEYKRPNGCIINFFEEGEFSQPFLKPPHL-DQPVSTL- 306
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
L S+S + FG + + L L +++G + ++R +A+ H + P + RR
Sbjct: 307 LLSESTMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMCP--SPNRRVS 364
Query: 143 ILLRRVLP 150
I RV P
Sbjct: 365 ITFFRVRP 372
>gi|72388954|ref|XP_844772.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62176055|gb|AAX70176.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801306|gb|AAZ11213.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261328019|emb|CBH10996.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 305
Score = 43.1 bits (100), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 75/172 (43%), Gaps = 13/172 (7%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 94
++ +LEQ G+VP ++N+ N Y+P I +HID ++D +SL +++ L F
Sbjct: 85 LVERLEQDGIVPRGWLNNQTANLYEPEDFIRAHIDNLFVYDDIFAVISLGANALLRFVHV 144
Query: 95 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPR 154
N + + V P G V ++ + H V P + Q R +++ R +L
Sbjct: 145 QNGEELDVVIP-------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGG 196
Query: 155 LTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTSN 201
TP + PY S L + A++ GG + + Y E +T
Sbjct: 197 FRPVSTPLGDLMPYRSTQILNTLYAKQIGGVRVAVDDKYLEKEEIGAFDTGK 248
>gi|341897299|gb|EGT53234.1| hypothetical protein CAEBREN_14010 [Caenorhabditis brenneri]
Length = 603
Score = 43.1 bits (100), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 4/70 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
E +PIP I+ L+ L + P+ I + V Y+PG I SH D FD PI+S+
Sbjct: 193 EAEPIPQIIEDLIDRLLSDKYITERPDQITANV---YEPGHGIPSHYDTHSAFDDPIVSV 249
Query: 82 SLFSDSALCF 91
SL SD + F
Sbjct: 250 SLLSDVVMEF 259
>gi|410897523|ref|XP_003962248.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Takifugu rubripes]
Length = 234
Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 65/135 (48%), Gaps = 10/135 (7%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P+G + + IP W+Q K+ +G N ++N+Y+PG I+ H D P ++
Sbjct: 66 HPKGMLAEKIPDWLQKYC-EKISSLGAFGGKTANHVLVNEYKPGEGIMPHEDGP-LYYPT 123
Query: 78 IISLSLFSDSALCF----GCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVR 132
I ++SL S + L F G P L+ L V+ + +L+D + H +R
Sbjct: 124 ITTISLGSHTLLDFYTPVGSMQGDAPQTEENRFLFSLLVKPRSLLILQDDMYQRLLHGIR 183
Query: 133 P--QDTQYRRAVILL 145
P QDT +AV LL
Sbjct: 184 PCDQDTLTEKAVNLL 198
>gi|11094804|gb|AAG29733.1|AC084414_1 hypothetical protein, 3' partial [Arabidopsis thaliana]
Length = 235
Score = 42.7 bits (99), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 3/64 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
VDP+P + ++I +L + V+PP + + V+N Y G CI HID H F RP ++S
Sbjct: 173 VDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVS 230
Query: 83 LFSD 86
S+
Sbjct: 231 FLSE 234
>gi|356570554|ref|XP_003553450.1| PREDICTED: uncharacterized protein LOC100776159 [Glycine max]
Length = 525
Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 63/128 (49%), Gaps = 6/128 (4%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
++PIPA + VI L Q ++P N +IN ++ G + PPH+ D+P+ +L
Sbjct: 279 IEPIPALLHD-VIDHLIQWKLIPEYKRPNGCIINFFEEGEFSQPFLKPPHL-DQPLSTL- 335
Query: 83 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAV 142
L S+S + FG + + L L ++ G + ++R +A+ H + P + RR
Sbjct: 336 LLSESTMAFGRILMSENDGNYKGPLMLSLKEGSLLVMRGNSADMARHVMCP--SPNRRVS 393
Query: 143 ILLRRVLP 150
I RV P
Sbjct: 394 ITFFRVRP 401
>gi|418296695|ref|ZP_12908538.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
CCNWGS0286]
gi|355538870|gb|EHH08112.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
CCNWGS0286]
Length = 187
Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 48/110 (43%), Gaps = 9/110 (8%)
Query: 20 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
P+ + P+P W + +L G N + + N+Y PG I +HID FD I+
Sbjct: 53 PDAYLGPLPQWF-GIFAERLVTEGYCR-NLPDQVIANEYLPGQGISAHIDCVPCFDDTIV 110
Query: 80 SLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITH 129
S+SL S C+ F +R L R V LLRD G TH
Sbjct: 111 SISLLS------ACEMVFHDVRGPAACGVLLQPRSGV-LLRDSGRYGWTH 153
>gi|308810811|ref|XP_003082714.1| unnamed protein product [Ostreococcus tauri]
gi|116061183|emb|CAL56571.1| unnamed protein product [Ostreococcus tauri]
Length = 597
Score = 42.4 bits (98), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 9/73 (12%)
Query: 20 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
P+ +D + A I VIT+ E+M +S IN Y+ G I HID P +F RP +
Sbjct: 93 PKALIDVVDALIARGVITEDERM--------DSCTINVYERGMWIPPHIDNP-LFARPFV 143
Query: 80 SLSLFSDSALCFG 92
++SL S A+ G
Sbjct: 144 TVSLASAQAMTLG 156
>gi|170079543|ref|YP_001736177.1| 2OG-Fe(II) oxygenase family oxidoreductase [Synechococcus sp. PCC
7002]
gi|169887212|gb|ACB00922.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Synechococcus sp. PCC
7002]
Length = 204
Score = 42.4 bits (98), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 6/72 (8%)
Query: 27 IPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+P W+ ++ + L + +P + A+IN+Y PG I SHID F I+SLSL
Sbjct: 81 LPDWLFPIIEQMVSLNLISEIP----DQAIINEYLPGQGITSHIDCKPCFTDTIVSLSLN 136
Query: 85 SDSALCFGCKFN 96
S + F C N
Sbjct: 137 SPCVMNFDCTVN 148
>gi|195120121|ref|XP_002004577.1| GI20008 [Drosophila mojavensis]
gi|193909645|gb|EDW08512.1| GI20008 [Drosophila mojavensis]
Length = 602
Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 59/130 (45%), Gaps = 18/130 (13%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGVVPPNYINSAVINDYQPGGCI 64
K G E LY V+P+ Q + + +LE PP+ + +N+Y+ G I
Sbjct: 170 KHFGYEFLYGSNNVNPLQPLEQGIPAACNFMWERLELPAFEPPDQLT---VNEYEAGQGI 226
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAA 124
H+D F PI+SLSL SD + F R + ++++ + R + ++ +
Sbjct: 227 PPHVDTHSAFKDPILSLSLQSDVVMDF---------RRGDQLVHVLLPRRSLLVMSGESR 277
Query: 125 NGITHCVRPQ 134
TH +RP+
Sbjct: 278 YDWTHGIRPK 287
>gi|443704913|gb|ELU01726.1| hypothetical protein CAPTEDRAFT_156565 [Capitella teleta]
Length = 610
Score = 41.6 bits (96), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 25/137 (18%)
Query: 11 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 63
K G E LY VDP IP ++ L Q ++ N + +N YQPG
Sbjct: 171 KHFGFEFLYSTNNVDPDHPLDMGIPPECSPILQRMLSQQIIL--NLPDQLTVNQYQPGQG 228
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF----GCKFNFKPIRVSEPVLYLPVQRGCVTLL 119
I H+D F+ ++SLSL S + F GC + PV +LP QR V ++
Sbjct: 229 IPPHVDTHSAFEEELVSLSLGSQVVMDFKAPGGCHY---------PV-FLP-QRSLV-VM 276
Query: 120 RDFAANGITHCVRPQDT 136
R + +TH + P+ +
Sbjct: 277 RGESRYQLTHAIAPRKS 293
>gi|434406443|ref|YP_007149328.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
gi|428260698|gb|AFZ26648.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
Length = 226
Score = 41.6 bits (96), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 9/74 (12%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
+ P+P W +L +L + V P + ++N+YQPG I SH D F IISLS
Sbjct: 98 IAPLPNWALTLA-QRLHK--VFSPTLPDQVIVNEYQPGQGISSHTDCVSCFTDVIISLS- 153
Query: 84 FSDSALCFGCKFNF 97
LC C +F
Sbjct: 154 -----LCSSCVMDF 162
>gi|392871202|gb|EAS33085.2| Alkbh6 protein [Coccidioides immitis RS]
Length = 265
Score = 41.6 bits (96), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 26 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 71
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 86 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 134
>gi|320038239|gb|EFW20175.1| hypothetical protein CPSG_03351 [Coccidioides posadasii str.
Silveira]
Length = 248
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 26 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 71
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 69 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 117
>gi|303316794|ref|XP_003068399.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
delta SOWgp]
gi|240108080|gb|EER26254.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
delta SOWgp]
Length = 271
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 26 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 71
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 92 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 140
>gi|421597443|ref|ZP_16041063.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
CCGE-LA001]
gi|404270447|gb|EJZ34507.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
CCGE-LA001]
Length = 190
Score = 41.2 bits (95), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 9/77 (11%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
GE +PIP W+ + ++E +P + + +Y+ G I H D PH FD+ ++ L
Sbjct: 71 GEAEPIPDWVLP-IARQVEAWAGLPEASVRQVLCTEYEAGVGIGWHRDKPH-FDK-VLGL 127
Query: 82 SLFSDSALCFGCKFNFK 98
SL + CKF F+
Sbjct: 128 SLGAP------CKFRFR 138
>gi|389594597|ref|XP_003722521.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|11022582|emb|CAC14240.1| hypothetical protein L7845.08 [Leishmania major]
gi|323363749|emb|CBZ12755.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 318
Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 73/159 (45%), Gaps = 13/159 (8%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P ++ +LEQ VVP +++N+ N Y+PG I +HID ++D +F+
Sbjct: 79 PLPPASFMPLLERLEQDNVVPKSWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132
Query: 86 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILL 145
+L C F ++ E + + R V ++ A H V P + Q R +++
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190
Query: 146 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGG 179
R ++ +TP + PY + L + +++ GG
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGG 229
>gi|297804104|ref|XP_002869936.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
lyrata]
gi|297315772|gb|EFH46195.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
lyrata]
Length = 241
Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 66/159 (41%), Gaps = 34/159 (21%)
Query: 4 YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 63
+G + KGL + L P+W+ + E G+ P + IN +IN+Y P
Sbjct: 52 WGGMVHEKGLVPQEL---------PSWLTKITAKICESSGLFP-SAINHVLINEYHPNQG 101
Query: 64 IVSHIDPPHIFDRPIIS-LSLFSDSALCFG------------------CKFNFKPIRVSE 104
I++H D P F P+++ LSL S + F C +++P R S
Sbjct: 102 IMAHQDGPAYF--PVVAILSLGSPVVMDFSPHLRLRSGDGYISKDQSPCAESYEPERDSF 159
Query: 105 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
VL +P + + +D A + H + TQ V+
Sbjct: 160 SVLMMP---RSLLIFKDDAYSDFLHGISDSPTQCYNQVV 195
>gi|255081432|ref|XP_002507938.1| predicted protein [Micromonas sp. RCC299]
gi|226523214|gb|ACO69196.1| predicted protein [Micromonas sp. RCC299]
Length = 684
Score = 40.8 bits (94), Expect = 0.40, Method: Composition-based stats.
Identities = 23/69 (33%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 24 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
V P+P + ++V + + + +S IN Y PG I HID P FDRP +++SL
Sbjct: 126 VAPLPPELDAVVDALIARGALTELQRPDSCTINLYGPGQWIPPHIDNP-AFDRPFVTVSL 184
Query: 84 FSDSALCFG 92
S+ + G
Sbjct: 185 CSEQPMVLG 193
>gi|301104152|ref|XP_002901161.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
gi|262101095|gb|EEY59147.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
Length = 231
Score = 40.8 bits (94), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 7/80 (8%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
E +P+P W+Q + T L+ N A+IN+Y G CI+ H D P F P++ S
Sbjct: 94 EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 149
Query: 83 LFSDSALCFGCKFNFKPIRV 102
+ S A C+ F+P R
Sbjct: 150 IISTGA---ECRVTFEPHRA 166
>gi|308502209|ref|XP_003113289.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
gi|308265590|gb|EFP09543.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
Length = 570
Score = 40.8 bits (94), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
DPIP I +L+ + + + P+ I + V Y+PG I SH D F+ PI+S+SL
Sbjct: 173 DPIPPVINTLIDKLMSEKYITERPDQITANV---YEPGHGIPSHYDTHSAFEDPIVSISL 229
Query: 84 FSDSALCF 91
SD + F
Sbjct: 230 LSDVVMEF 237
>gi|17552176|ref|NP_497751.1| Protein ALKB-8 [Caenorhabditis elegans]
gi|15718120|emb|CAB63431.2| Protein ALKB-8 [Caenorhabditis elegans]
Length = 591
Score = 40.4 bits (93), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 36/70 (51%), Gaps = 4/70 (5%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 81
E DPIP I SL+ + + P+ + + N Y+ G I SH D FD PI+S+
Sbjct: 193 EADPIPPVINSLIDRLISDKYITERPDQVTA---NVYESGHGIPSHYDTHSAFDDPIVSI 249
Query: 82 SLFSDSALCF 91
SL SD + F
Sbjct: 250 SLLSDVVMEF 259
>gi|195027836|ref|XP_001986788.1| GH20337 [Drosophila grimshawi]
gi|193902788|gb|EDW01655.1| GH20337 [Drosophila grimshawi]
Length = 616
Score = 40.4 bits (93), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 44/93 (47%), Gaps = 18/93 (19%)
Query: 11 KGLGNERLYPEGEVD-------PIPA-----WIQSLVITKLEQMGVVPPNYINSAVINDY 58
K G E LY VD PIPA W Q L TK+ G P+ + +N+Y
Sbjct: 173 KHFGYEFLYGSNNVDAAQPLEQPIPAACDFLW-QRLDSTKIS--GTFLPDQLT---VNEY 226
Query: 59 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 91
+PG I H+D F PI+SLSL +D + F
Sbjct: 227 EPGQGIPPHVDTHSAFVDPILSLSLQADVVMDF 259
>gi|449305107|gb|EMD01114.1| hypothetical protein BAUCODRAFT_196023 [Baudoinia compniacensis
UAMH 10762]
Length = 248
Score = 40.4 bits (93), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 2/48 (4%)
Query: 26 PIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPP 71
P+PAW+ V+ ++ +GV P+ IN +IN+Y PG I+ H D P
Sbjct: 71 PLPAWLTEPVVGRIHDLGVFADAPHGINHCLINEYLPGQGIMPHEDGP 118
>gi|301118833|ref|XP_002907144.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105656|gb|EEY63708.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 134
Score = 40.4 bits (93), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)
Query: 23 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 82
E +P+P W+Q + T L+ N A+IN+Y G CI+ H D P F P++ S
Sbjct: 11 EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 66
Query: 83 LFSDSALCFGCKFNFKPIR 101
+ S A C+ F+P R
Sbjct: 67 IISTGA---ECRVTFEPHR 82
>gi|119187755|ref|XP_001244484.1| hypothetical protein CIMG_03925 [Coccidioides immitis RS]
Length = 119
Score = 40.4 bits (93), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 26 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 71
P+PAW+ + VIT+ +++ + P N +IN+YQPG I+ H D P
Sbjct: 51 PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 99
>gi|198474666|ref|XP_001356776.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
gi|198138491|gb|EAL33842.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
Length = 237
Score = 40.4 bits (93), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|195148282|ref|XP_002015103.1| GL18598 [Drosophila persimilis]
gi|194107056|gb|EDW29099.1| GL18598 [Drosophila persimilis]
Length = 237
Score = 40.4 bits (93), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|194761924|ref|XP_001963141.1| GF15798 [Drosophila ananassae]
gi|190616838|gb|EDV32362.1| GF15798 [Drosophila ananassae]
Length = 231
Score = 40.4 bits (93), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS-LSLFSDSALCFG 92
IIS +S S + L F
Sbjct: 115 IISTISCGSHTVLEFA 130
>gi|326914414|ref|XP_003203520.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Meleagris gallopavo]
Length = 846
Score = 40.4 bits (93), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y +VD P+P + + I+ LE+ + YI + +N Y+PG
Sbjct: 351 KHFGYEFRYDNNDVDKDKPLPGGLPEICISFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 408
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I HID F+ IISLSL S+ + F
Sbjct: 409 IPPHIDTHSAFEDEIISLSLGSEIVMDF 436
>gi|47196062|emb|CAF89344.1| unnamed protein product [Tetraodon nigroviridis]
Length = 234
Score = 40.4 bits (93), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 8/121 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P+G + + IP W+Q+ ++ +G N ++N+Y+PG I+ H D P ++
Sbjct: 66 HPKGMLAETIPEWLQTYC-QRISSLGAFGGKVANHVLVNEYKPGEGIMPHEDGP-LYHPT 123
Query: 78 IISLSLFSDSALCF-----GCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 132
+ +LSL S + L F G + + + + L V+ + +L+D + H +R
Sbjct: 124 VTTLSLGSHTLLDFYTPVGGVQGDAPQTEENRFLFSLLVEPRSLLILQDEMYQKLLHGIR 183
Query: 133 P 133
P
Sbjct: 184 P 184
>gi|195439545|ref|XP_002067665.1| GK21248 [Drosophila willistoni]
gi|194163750|gb|EDW78651.1| GK21248 [Drosophila willistoni]
Length = 228
Score = 40.0 bits (92), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|195114110|ref|XP_002001610.1| GI16672 [Drosophila mojavensis]
gi|193912185|gb|EDW11052.1| GI16672 [Drosophila mojavensis]
Length = 222
Score = 39.7 bits (91), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|195386178|ref|XP_002051781.1| GJ17179 [Drosophila virilis]
gi|194148238|gb|EDW63936.1| GJ17179 [Drosophila virilis]
Length = 221
Score = 39.7 bits (91), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+QS V K+ +G+ N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|321463990|gb|EFX75001.1| hypothetical protein DAPPUDRAFT_306906 [Daphnia pulex]
Length = 574
Score = 39.7 bits (91), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 19/127 (14%)
Query: 11 KGLGNERLYPEGEVDP------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
K G E LY +VD IP ++ L I K + G+ PN + +N Y PG I
Sbjct: 145 KHYGFEFLYGSNKVDKNPLSSGIPVEMEPL-IEKFMEFGLARPNQL---TVNHYVPGQGI 200
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL---RD 121
H D F+ IIS+SL SD + +FK + E V +R C+ + R
Sbjct: 201 PLHTDTHSSFEDGIISVSLGSDIVM------DFKNNKGGECVSITLPRRSCLIMTNESRY 254
Query: 122 FAANGIT 128
++GIT
Sbjct: 255 MWSHGIT 261
>gi|194880701|ref|XP_001974503.1| GG21781 [Drosophila erecta]
gi|190657690|gb|EDV54903.1| GG21781 [Drosophila erecta]
Length = 615
Score = 39.7 bits (91), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSL---VITKLEQMGVVPPNYINSA----VINDYQPGGC 63
K G E LY VDP QS+ ++ P N+ S+ +N+Y+PG
Sbjct: 175 KHFGFEFLYGSNNVDPFKPLEQSIPSACDILWPRLDSFPSNWDWSSPDQLTVNEYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I H+D F PI+SLSL SD + F
Sbjct: 235 IPPHVDTHSAFLDPILSLSLQSDVVMDF 262
>gi|195033105|ref|XP_001988620.1| GH11262 [Drosophila grimshawi]
gi|193904620|gb|EDW03487.1| GH11262 [Drosophila grimshawi]
Length = 221
Score = 39.7 bits (91), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + +P W+QS V K+ +GV N ++N+Y PG I+ H D P + P
Sbjct: 58 HPNGMIAEEMPEWLQSYV-DKVNNLGVFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114
Query: 78 IIS 80
IIS
Sbjct: 115 IIS 117
>gi|291383953|ref|XP_002708552.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Oryctolagus cuniculus]
Length = 664
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN YQPG
Sbjct: 177 KHFGYEFRYDNNNVDKDRPLPGGLPDICDSILEKW--LQEGYIKHKPDQLTINQYQPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262
>gi|258576637|ref|XP_002542500.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237902766|gb|EEP77167.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 259
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 26 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHID 69
P+P W++S V+++ ++G+ P N +IN+YQPG I+ H D
Sbjct: 80 PLPDWLRSPVVSRFTELGIFFDSPHRAPNHVLINEYQPGQGIMPHED 126
>gi|170579523|ref|XP_001894867.1| hypothetical protein [Brugia malayi]
gi|158598386|gb|EDP36290.1| conserved hypothetical protein [Brugia malayi]
Length = 576
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 3/91 (3%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
DPIP+ Q VI ++ G+ + +N Y+PG I SH+D F I SLSL
Sbjct: 183 DPIPSVCQP-VIDRMLGAGIFKEK-PDQVTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240
Query: 85 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 114
SD + F N I V P L V RG
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRFSLTVMRG 271
>gi|363729209|ref|XP_417166.3| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Gallus
gallus]
Length = 695
Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y +VD P+P + + I LE+ + YI + +N Y+PG
Sbjct: 201 KHFGYEFRYDNNDVDKDNPLPGGLPEICIPFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 258
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I HID F+ IISLSL S+ + F
Sbjct: 259 IPPHIDTHSAFEDEIISLSLGSEIVMDF 286
>gi|58394263|ref|XP_320626.2| AGAP011900-PA [Anopheles gambiae str. PEST]
gi|55234784|gb|EAA00113.2| AGAP011900-PA [Anopheles gambiae str. PEST]
Length = 621
Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 32/64 (50%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 110
+ +N Y+PG I H+D FD PI+SLSL SD + F + K + V P L
Sbjct: 221 DQLTVNQYEPGQGIPPHVDTHSAFDDPILSLSLGSDVVMEFKQPSSGKVVCVDLPSRSLL 280
Query: 111 VQRG 114
+ G
Sbjct: 281 IMSG 284
>gi|440791732|gb|ELR12970.1| hypothetical protein ACA1_096410 [Acanthamoeba castellanii str.
Neff]
Length = 259
Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 49/112 (43%), Gaps = 13/112 (11%)
Query: 22 GEVDPIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
G + P+P + Q V +L G++P P+ + +IN Y PG I H+D H F+ +
Sbjct: 103 GYLGPLPDFAQQ-VTARLVSTGLMPYEPDQM---IINHYTPGQGIHPHVDKTHCFEGVVG 158
Query: 80 SLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 131
SL L S C FK I V +R +L A G TH +
Sbjct: 159 SLGLGSS------CIMEFKHIETGRRVDVF-FERRTALMLTGEARYGWTHGI 203
>gi|395861438|ref|XP_003802993.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Otolemur
garnettii]
Length = 986
Score = 39.3 bits (90), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 42/86 (48%), Gaps = 9/86 (10%)
Query: 13 LGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGCIV 65
G E LY VD P+P + + + LE+ + YI + IN Y+PG I
Sbjct: 501 FGYEFLYENNNVDKDKPLPGGLPDICDSILEKW--LKEGYIRQKPDQVTINQYEPGHGIP 558
Query: 66 SHIDPPHIFDRPIISLSLFSDSALCF 91
+HID F+ I+SLSL S+ + F
Sbjct: 559 AHIDTHSAFEDEIVSLSLGSEIVMDF 584
>gi|298707663|emb|CBJ25980.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 329
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+P+P W+ +L + + + N ++N+YQPG I++H D P ++ +LSL
Sbjct: 139 EPLPPWVDALCESLVVRGVFSEETRPNHVLLNEYQPGQGIMAHTDGP-FYEPRTATLSLG 197
Query: 85 SDSALCF 91
SD+ + F
Sbjct: 198 SDAVMHF 204
>gi|357498515|ref|XP_003619546.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
gi|355494561|gb|AES75764.1| Alkylated DNA repair protein alkB-like protein [Medicago
truncatula]
Length = 266
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 12/90 (13%)
Query: 11 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 62
K L N RL G V P+P+W+ + E+ G+ P + IN +IN+YQP
Sbjct: 50 KLLKNRRLQNWGGVVHEKGLLPQPLPSWLTNFTQKISEESGLFP-SPINHVLINEYQPNQ 108
Query: 63 CIVSHIDPPHIFDRPIIS-LSLFSDSALCF 91
I+ H D P F P+++ LSL S + F
Sbjct: 109 GIMPHQDGPSYF--PVVAILSLGSPVVMDF 136
>gi|443725258|gb|ELU12938.1| hypothetical protein CAPTEDRAFT_112863, partial [Capitella teleta]
Length = 214
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 13/134 (9%)
Query: 13 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
L N RL +P+G V + IP W+ S ++ Q+GV N ++N+Y PG I
Sbjct: 36 LSNRRLQNWGGLPHPKGMVPEKIPEWLDSFG-QRIGQLGVFDGQMPNHVLVNEYLPGQGI 94
Query: 65 VSHIDPPHIFDRPIIS-LSLFSDSALCFGCKFNFKPIRVSEP-VLYLPVQRGCVTLLRDF 122
+ H D P F P +S ++L S + L F N + + ++R + L+R+
Sbjct: 95 MPHTDGPLYF--PTVSTITLGSHTLLDFYTPLNDRSSSFDDRHFASFLLERRSLVLVREE 152
Query: 123 AANGITHCVRPQDT 136
+ + H ++ +T
Sbjct: 153 MYSRMLHGIKEVET 166
>gi|301121774|ref|XP_002908614.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
gi|262103645|gb|EEY61697.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
Length = 640
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+PIP +Q VI K+ + G++ + + +N+Y PG I H+D F I SLS+
Sbjct: 203 EPIPEVLQP-VIDKIVECGIMDGDRPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 261
Query: 85 SDSALCF 91
S+ + F
Sbjct: 262 SEVVMDF 268
>gi|302803799|ref|XP_002983652.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
gi|300148489|gb|EFJ15148.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
Length = 556
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 13/118 (11%)
Query: 21 EGEVDPIPAWIQSLVITKLEQMGVVPPNY-INSAVINDYQPGGCIVSHIDPPHI---FDR 76
E +P+P +++S +I +L + VVP + +S I+ +PG + PPH F++
Sbjct: 279 EETAEPMPTFLES-IIDRLVKCQVVPASKRPDSCSISVLEPGDYM-----PPHKHNNFEQ 332
Query: 77 PIISLSLFSDSALCFGCKFNFKPIRVSEP-VLYLPVQRGCVTLLRDFAANGITHCVRP 133
P+ LSL S S L FGC E + LP G V +L +A + V+P
Sbjct: 333 PLFILSLGSQSELAFGCNLKANSSSTDEKYKVGLPA--GSVLVLEGNSAQMVQCAVQP 388
>gi|328715721|ref|XP_001945058.2| PREDICTED: hypothetical protein LOC100163323 [Acyrthosiphon pisum]
Length = 312
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
DPIP + ++ ++ P N +N Y PG I SHID +FD I+SLSL
Sbjct: 40 DPIPKEFEFILNAIYLRLKWQP----NQITVNKYLPGQGIPSHIDTHGVFDEYILSLSLN 95
Query: 85 SDSALCF 91
SD + F
Sbjct: 96 SDIIMEF 102
>gi|195339853|ref|XP_002036531.1| GM18491 [Drosophila sechellia]
gi|194130411|gb|EDW52454.1| GM18491 [Drosophila sechellia]
Length = 228
Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 78 IISLSLFSDSALCFG 92
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|195151073|ref|XP_002016472.1| GL10469 [Drosophila persimilis]
gi|194110319|gb|EDW32362.1| GL10469 [Drosophila persimilis]
Length = 614
Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGVVPP-NYINSAVINDYQPGGC 63
K G E LY VDP QS+ + +LE + + +N+Y+PG
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACGFLWPRLESFSSSWDWSTPDQLTVNEYEPGHG 232
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 123
I H+D F PI+SLSL SD + F R E + + + R + ++ A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283
Query: 124 ANGITHCVRPQ 134
TH ++P+
Sbjct: 284 RYDWTHGIKPK 294
>gi|198457852|ref|XP_001360812.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
gi|198136129|gb|EAL25387.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
Length = 615
Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGVVPP-NYINSAVINDYQPGGC 63
K G E LY VDP QS+ + +LE + + +N+Y+PG
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACDFLWPRLESFSSSWDWSTPDQLTVNEYKPGHG 232
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 123
I H+D F PI+SLSL SD + F R E + + + R + ++ A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283
Query: 124 ANGITHCVRPQ 134
TH ++P+
Sbjct: 284 RYDWTHGIKPK 294
>gi|195578245|ref|XP_002078976.1| GD23711 [Drosophila simulans]
gi|194190985|gb|EDX04561.1| GD23711 [Drosophila simulans]
Length = 228
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 78 IISLSLFSDSALCFG 92
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|348541845|ref|XP_003458397.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Oreochromis niloticus]
Length = 234
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 28/146 (19%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP 70
KG+ ERL P W+Q+ K+ +G N ++N+Y+PG I+ H D
Sbjct: 68 KGMLGERL---------PDWLQTYC-EKISSLGAFSGKTANHVLVNEYKPGEGIMPHEDG 117
Query: 71 PHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLR 120
P ++ + ++SL S + L F + P+ +E + L V+ + +L+
Sbjct: 118 P-LYHPTVTTISLGSHTLLDF-----YTPLSSAEGATPQTEESRFLFSLLVRPRSLLILQ 171
Query: 121 DFAANGITHCV--RPQDTQYRRAVIL 144
D + H + R QDT ++ V L
Sbjct: 172 DEMYQRLLHGIHGREQDTLTKKVVNL 197
>gi|312086366|ref|XP_003145047.1| hypothetical protein LOAG_09472 [Loa loa]
gi|307759789|gb|EFO19023.1| hypothetical protein LOAG_09472 [Loa loa]
Length = 576
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 3/91 (3%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+PIPA Q +I ++ G+ + +N Y+PG I SH+D F I SLSL
Sbjct: 183 NPIPAACQP-IIDRMLDAGIFKEE-PDQLTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240
Query: 85 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 114
SD + F N I V P L L V +G
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRLSLAVMQG 271
>gi|91080367|ref|XP_974935.1| PREDICTED: similar to CG17807 CG17807-PA [Tribolium castaneum]
gi|270005595|gb|EFA02043.1| hypothetical protein TcasGA2_TC007671 [Tribolium castaneum]
Length = 582
Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 23/41 (56%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 91
N IN Y PG I SH+D F PI+SLSL SD + F
Sbjct: 203 NQLTINRYNPGQGIPSHVDTHSAFGDPILSLSLSSDVVMEF 243
>gi|291238544|ref|XP_002739188.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 742
Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 54/126 (42%), Gaps = 7/126 (5%)
Query: 25 DPIPAWIQSLVITKLEQMGVV-PPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 83
D IPA ++ ++ + + PN I +N YQPG I HID F+ IISLSL
Sbjct: 267 DGIPAALRKIIDDIMATQNIQHGPNQI---TVNQYQPGQGIPPHIDTHSAFEGEIISLSL 323
Query: 84 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVI 143
S+ + F N + I V P L V G L H V P DT +
Sbjct: 324 GSNVIMDFK-HPNGQHIPVLVPQRSLLVMTGESRYLWTHGITPRKHDVAPADTD--TGLT 380
Query: 144 LLRRVL 149
L +R L
Sbjct: 381 LTKRTL 386
>gi|168057031|ref|XP_001780520.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667998|gb|EDQ54614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 258
Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIIS-LSLF 84
PIPAW+ S + K+ + + P IN ++N+Y PG I SH D P + P+++ LSL
Sbjct: 59 PIPAWLSS-ITEKIAKETNLFPAPINHVLVNEYLPGQGITSHQDGPVYY--PVVAILSLG 115
Query: 85 SDSALCF 91
+ + + F
Sbjct: 116 APTLMHF 122
>gi|390349756|ref|XP_790566.3| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
ABH6-like [Strongylocentrotus purpuratus]
Length = 245
Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)
Query: 13 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
L N RL +P+G + + +P W+ + K+ +GV + N ++N+YQPG I
Sbjct: 55 LSNRRLQNWGGLPHPKGMIAEGLPKWL-DVYAKKIAGLGVFGDHIPNHVLVNEYQPGQGI 113
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCF 91
+ H D P +F + ++SL S + L F
Sbjct: 114 MPHEDGP-LFHPVVTTISLGSHTFLDF 139
>gi|351710212|gb|EHB13131.1| Alkylated DNA repair protein alkB-like protein 8, partial
[Heterocephalus glaber]
Length = 667
Score = 38.5 bits (88), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 179 KHFGYEFHYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKCRPDQLTINQYEPGQG 236
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 237 IPAHIDTHSAFEDEIISLSLGSEVVMDF 264
>gi|348681591|gb|EGZ21407.1| hypothetical protein PHYSODRAFT_491312 [Phytophthora sojae]
Length = 688
Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats.
Identities = 21/67 (31%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 25 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+PIP +Q +I K+ + G++ + + +N+Y PG I H+D F I SLS+
Sbjct: 208 EPIPEVLQP-IIEKIARCGIMDGDEPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 266
Query: 85 SDSALCF 91
S+ + F
Sbjct: 267 SEVVMDF 273
>gi|340939079|gb|EGS19701.1| hypothetical protein CTHT_0041830 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 353
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 15/82 (18%)
Query: 17 RLYPEGEVDP-IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFD 75
R+YPEGE P PA + + T V P +A++N Y PG ++ H D D
Sbjct: 194 RVYPEGEEPPAFPADLAGFLET------VFPETLAQAAIVNFYTPGDTMMMHRDVSEETD 247
Query: 76 RPIISLSLFSDSALCFGCKFNF 97
+ ++SLS FGC+ F
Sbjct: 248 KGLVSLS--------FGCEGLF 261
>gi|85724924|ref|NP_001033895.1| CG6144, isoform B [Drosophila melanogaster]
gi|85726418|ref|NP_609414.3| CG6144, isoform C [Drosophila melanogaster]
gi|16076870|gb|AAL13362.1| LD37206p [Drosophila melanogaster]
gi|84795296|gb|AAF52964.3| CG6144, isoform C [Drosophila melanogaster]
gi|84795297|gb|ABC65892.1| CG6144, isoform B [Drosophila melanogaster]
gi|220946136|gb|ACL85611.1| CG6144-PA [synthetic construct]
gi|220960392|gb|ACL92732.1| CG6144-PA [synthetic construct]
Length = 228
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 78 IISLSLFSDSALCF 91
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEF 129
>gi|195471890|ref|XP_002088235.1| GE18466 [Drosophila yakuba]
gi|194174336|gb|EDW87947.1| GE18466 [Drosophila yakuba]
Length = 234
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+Q+ V K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 78 IISLSLFSDSALCFG 92
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|440902148|gb|ELR52976.1| Alkylated DNA repair protein alkB-like protein 8, partial [Bos
grunniens mutus]
Length = 349
Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + +I N IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPNQLTINQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262
>gi|395329787|gb|EJF62172.1| hypothetical protein DICSQDRAFT_104425 [Dichomitus squalens
LYAD-421 SS1]
Length = 233
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 50/122 (40%), Gaps = 34/122 (27%)
Query: 47 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 106
P A+IN Y PG I H+D PH + I+ +SL S GC+
Sbjct: 123 PPRARQAIINLYWPGEGITPHVDLPHRYGDGIMGVSLGS------GCQC----------A 166
Query: 107 LYLPVQRGCVTLLRDFAANGITHCVR-------------PQDTQYRRAV---ILLRRVLP 150
LYLP +G V ++ + A G TH + PQ+ R + I R +LP
Sbjct: 167 LYLP--KGSVLIMTEEARYGWTHGIEKRFDDWVEESPGSPQNVLLERDIRLSITFRWLLP 224
Query: 151 HA 152
A
Sbjct: 225 GA 226
>gi|408785541|ref|ZP_11197285.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
gi|408488554|gb|EKJ96864.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
Length = 196
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 8/84 (9%)
Query: 20 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
P+ + P+P W+ + +L + G N + + N+Y PG I +H+D FD I+
Sbjct: 62 PDAYLGPLPPWL-GVFARRLVRNGHCE-NLPDQVIANEYFPGQGISAHVDCVPCFDDTIV 119
Query: 80 SLSLFSDSALCFGCKFNFKPIRVS 103
S+SL S C+ F+ +R S
Sbjct: 120 SISLLS------ACEMVFRDLRGS 137
>gi|340053555|emb|CCC47848.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 305
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 13/150 (8%)
Query: 35 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 94
++ ++EQ G+VP ++N+ N Y+ I +HID I+D +SL +++ L F
Sbjct: 85 LVERMEQDGLVPCGWLNNQTANFYETNDFIRAHIDNLFIYDDIFAIVSLGANALLRFVHV 144
Query: 95 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQYRRAVILLRRVLPHAPR 154
N + + V P G V ++ + H V P + Q R +++ R +L
Sbjct: 145 QNGEELDVVIP-------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGS 196
Query: 155 LTLSQTP--RVKPYHSLYDVQ---ARESGG 179
TP + PY S V A++ GG
Sbjct: 197 FRPVSTPLGDLMPYRSTQIVNTLYAKQVGG 226
>gi|383191518|ref|YP_005201646.1| alkylated DNA repair protein [Rahnella aquatilis CIP 78.65 = ATCC
33071]
gi|371589776|gb|AEX53506.1| alkylated DNA repair protein [Rahnella aquatilis CIP 78.65 = ATCC
33071]
Length = 217
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 2/68 (2%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP-PHIFDRPIISLSLF 84
P+PA + L ++ E+ G ++ +IN YQPG + H D H F +PI+S+SL
Sbjct: 95 PMPALFRRLAVSAAEEAGFAG-FAPDACLINRYQPGAKMSLHQDKDEHDFGQPIVSVSLG 153
Query: 85 SDSALCFG 92
+ FG
Sbjct: 154 LPAVFQFG 161
>gi|195380141|ref|XP_002048829.1| GJ21257 [Drosophila virilis]
gi|194143626|gb|EDW60022.1| GJ21257 [Drosophila virilis]
Length = 616
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 6/87 (6%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQ------SLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
K G E LY VDP+ Q + +L + + +N+Y+PG I
Sbjct: 174 KHFGYEFLYGSNNVDPLQPLEQPIPTACDFLWQRLACCEAPALDTPDQLTVNEYEPGQGI 233
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCF 91
H+D F PI+SLSL SD + F
Sbjct: 234 PPHVDTHSAFVDPILSLSLQSDVVMDF 260
>gi|145522424|ref|XP_001447056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414556|emb|CAK79659.1| unnamed protein product [Paramecium tetraurelia]
Length = 636
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 42/90 (46%), Gaps = 9/90 (10%)
Query: 27 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 86
IPA+++ V K+ + V P IN IN+Y PG I H D F +S+SL S
Sbjct: 213 IPAFLED-VRAKVSDL-VKPQAEINQLTINEYLPGMGIPPHFDVHPPFHEKFVSISLLS- 269
Query: 87 SALCFGCKFNFKPIRVSEPVLYLPVQRGCV 116
G +FK + E LYLP R C
Sbjct: 270 -----GLVMSFKSYKGEEQHLYLP-PRSCA 293
>gi|26334747|dbj|BAC31074.1| unnamed protein product [Mus musculus]
Length = 447
Score = 38.1 bits (87), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262
>gi|300793742|ref|NP_001178838.1| alkylated DNA repair protein alkB homolog 8 [Rattus norvegicus]
Length = 664
Score = 38.1 bits (87), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNTVDKDKPLPGGLPEICSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S + F
Sbjct: 235 IPAHIDSHSAFEHEIVSLSLGSAIVMDF 262
>gi|380025248|ref|XP_003696389.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Apis
florea]
Length = 584
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 46/106 (43%), Gaps = 15/106 (14%)
Query: 11 KGLGNERLYPEGEVD------PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
K G E Y +VD PIP Q L T +Q V Y + IN Y PG I
Sbjct: 163 KHFGYEFQYDTNKVDLDKPIIPIPKNYQFLQ-TLFKQYHDVSYEY-DQLTINHYLPGQGI 220
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 110
HID +F+ I+SLSL S C NFK + L+LP
Sbjct: 221 PPHIDTHSVFEDSILSLSLGS------ACIMNFKK-ENEKASLFLP 259
>gi|194754858|ref|XP_001959711.1| GF13009 [Drosophila ananassae]
gi|190621009|gb|EDV36533.1| GF13009 [Drosophila ananassae]
Length = 609
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 59/135 (43%), Gaps = 24/135 (17%)
Query: 11 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLE----QMGVVPPNYINSAVINDYQ 59
K G E LY VDP IP+ ++ +LE + P+ + +N+Y+
Sbjct: 174 KHFGYEFLYGSNNVDPSKPLEQPIPSAC-DILWPRLEGNSTTWDWITPDQLT---VNEYE 229
Query: 60 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 119
PG I H+D F PI+SLSL SD + F R E + + + R + ++
Sbjct: 230 PGNGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEEHVQVRLPRRSLLVM 280
Query: 120 RDFAANGITHCVRPQ 134
A TH +RP+
Sbjct: 281 SGEARYDWTHGIRPK 295
>gi|348553212|ref|XP_003462421.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Cavia
porcellus]
Length = 664
Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + +N Y+PG
Sbjct: 177 KHFGYEFYYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKHKPDQLTVNQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +H+D F+ IISLSL S+ + F
Sbjct: 235 IPAHVDTHSAFEDEIISLSLGSEVVMDF 262
>gi|61675696|ref|NP_080579.1| alkylated DNA repair protein alkB homolog 8 [Mus musculus]
gi|81895388|sp|Q80Y20.1|ALKB8_MOUSE RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
Full=Probable alpha-ketoglutarate-dependent dioxygenase
ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
tRNA methyltransferase ABH8; AltName: Full=tRNA
(carboxymethyluridine(34)-5-O)-methyltransferase ABH8
gi|30046964|gb|AAH50863.1| AlkB, alkylation repair homolog 8 (E. coli) [Mus musculus]
Length = 664
Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262
>gi|302854777|ref|XP_002958893.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
nagariensis]
gi|300255737|gb|EFJ40024.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
nagariensis]
Length = 536
Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 27 IPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 84
+P W+ LV E++ +P ++ +N+Y PG + H+D F PIISLSL
Sbjct: 190 LPPWVSDLV----ERISALPEVSQPLDQLTVNEYDPGVGLAPHVDTHSAFTGPIISLSLG 245
Query: 85 SDSAL 89
S + +
Sbjct: 246 STAVM 250
>gi|53292605|ref|NP_001005390.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Danio
rerio]
gi|82184989|sp|Q6IQE9.1|ALKB6_DANRE RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
homolog 6; AltName: Full=Alkylated DNA repair protein
alkB homolog 6
gi|47939433|gb|AAH71457.1| AlkB, alkylation repair homolog 6 (E. coli) [Danio rerio]
Length = 234
Score = 37.7 bits (86), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 59/124 (47%), Gaps = 18/124 (14%)
Query: 20 PEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPI 78
P+G + + +P W+ K+ +G N ++N+Y+PG I+ H D P ++ +
Sbjct: 67 PKGMLAEKLPDWLLEYT-EKISALGAFAGKTANHVLVNEYKPGEGIMPHEDGP-LYHPTV 124
Query: 79 ISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLRDFAANGIT 128
++++ S + L F ++P+ +EP +L L VQR + +L+D
Sbjct: 125 TTITVGSHTLLDF-----YRPVCQAEPDAPQTEESRYMLSLLVQRKSLLILQDDMYKCYL 179
Query: 129 HCVR 132
H +R
Sbjct: 180 HGIR 183
>gi|346466049|gb|AEO32869.1| hypothetical protein [Amblyomma maculatum]
Length = 241
Score = 37.7 bits (86), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P+G V +P+P W++ + K+ +GV N ++N+Y+PG I+ H D P + P
Sbjct: 67 HPKGMVPEPLPQWLKG-ICAKVASLGVFGEKLPNHVLVNEYKPGEGILPHEDGPLYY--P 123
Query: 78 IIS-LSLFSDSALCF 91
+++ ++L S + + F
Sbjct: 124 VVTNVTLNSSTVIDF 138
>gi|194862285|ref|XP_001969966.1| GG23646 [Drosophila erecta]
gi|190661833|gb|EDV59025.1| GG23646 [Drosophila erecta]
Length = 232
Score = 37.7 bits (86), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P G + + IP W+Q + + K+ +GV N ++N+Y PG I+ H D P +F
Sbjct: 58 HPNGMIAEEIPEWLQ-IYVDKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115
Query: 78 IISLSLFSDSALCFG 92
I ++S + + L F
Sbjct: 116 ISTISTGAHTVLEFA 130
>gi|355752600|gb|EHH56720.1| hypothetical protein EGM_06185 [Macaca fascicularis]
Length = 732
Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 265 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 322
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 323 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 350
>gi|291301566|ref|YP_003512844.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
gi|290570786|gb|ADD43751.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
Length = 197
Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 17/96 (17%)
Query: 2 YTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPG 61
Y YG R+ +G P PIPAW+ +V +++ + P + ++N+Y PG
Sbjct: 51 YDYG----RRSVGQTDAAP-----PIPAWLDEVVARLVDEKVMDQPA--DQVIVNEYLPG 99
Query: 62 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF 97
I +H+D F + ++SL S GC +F
Sbjct: 100 QGISAHVDCVPCFGPVVAAISLES------GCVMDF 129
>gi|156060317|ref|XP_001596081.1| hypothetical protein SS1G_02297 [Sclerotinia sclerotiorum 1980]
gi|154699705|gb|EDN99443.1| hypothetical protein SS1G_02297 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 1103
Score = 37.7 bits (86), Expect = 3.7, Method: Composition-based stats.
Identities = 22/70 (31%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 17 RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
++YPEGE P+ I +L+ G+ P +A++N Y P + H D DR
Sbjct: 227 KIYPEGEPPKFPSDIANLI------EGLFPDMEPQAAIVNFYSPKDTLQLHRDGAESVDR 280
Query: 77 PIISLSLFSD 86
++S+SL D
Sbjct: 281 GLVSISLGCD 290
>gi|26349167|dbj|BAC38223.1| unnamed protein product [Mus musculus]
Length = 537
Score = 37.7 bits (86), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE++ + YI + IN Y+PG
Sbjct: 50 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 107
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S + F
Sbjct: 108 IPAHIDTHSAFEDEIISLSLGSAIVMDF 135
>gi|332208092|ref|XP_003253130.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Nomascus leucogenys]
Length = 664
Score = 37.7 bits (86), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|380789693|gb|AFE66722.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
gi|380789695|gb|AFE66723.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
Length = 664
Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|441644875|ref|XP_004090625.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Nomascus
leucogenys]
Length = 667
Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|75048494|sp|Q95K79.1|ALKB8_MACFA RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
Full=Probable alpha-ketoglutarate-dependent dioxygenase
ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
tRNA methyltransferase ABH8; AltName: Full=tRNA
(carboxymethyluridine(34)-5-O)-methyltransferase ABH8
gi|14388555|dbj|BAB60797.1| hypothetical protein [Macaca fascicularis]
Length = 664
Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|297690113|ref|XP_002822470.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Pongo abelii]
Length = 667
Score = 37.7 bits (86), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|297690115|ref|XP_002822471.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
[Pongo abelii]
Length = 664
Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|402895130|ref|XP_003910687.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
[Papio anubis]
Length = 667
Score = 37.4 bits (85), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|402895128|ref|XP_003910686.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
[Papio anubis]
Length = 664
Score = 37.4 bits (85), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|109108545|ref|XP_001102763.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
1 [Macaca mulatta]
gi|109108547|ref|XP_001102947.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
3 [Macaca mulatta]
Length = 664
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262
>gi|297269101|ref|XP_001102856.2| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
2 [Macaca mulatta]
Length = 667
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ I+SLSL S+ + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265
>gi|219114389|ref|XP_002176365.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402611|gb|EEC42601.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 345
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 44/105 (41%), Gaps = 16/105 (15%)
Query: 45 VPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSE 104
PP N +IN+YQP I+ H D P ++ +LSL D L F + + R +
Sbjct: 185 TPPRLPNHVLINEYQPADGILPHTDGPS-YEPCTATLSLGGDVVLRFAPRVH----RKED 239
Query: 105 PVLYLPVQ-----------RGCVTLLRDFAANGITHCVRPQDTQY 138
P L + RG + + RD A H +R D +Y
Sbjct: 240 PRARLTTRPTDAHAVYLSGRGSLVVFRDDAYRDYGHAIRYTDHRY 284
>gi|344287968|ref|XP_003415723.1| PREDICTED: LOW QUALITY PROTEIN: alkylated DNA repair protein alkB
homolog 8-like [Loxodonta africana]
Length = 625
Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 176 KHFGYEFHYETNNVDKNKPLPGGLPEICDSILEKW--LKEGYIKHKPDQLTINQYEPGHG 233
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL ++ + F
Sbjct: 234 IPAHIDTHSAFEDEIISLSLGAEIVMDF 261
>gi|169764793|ref|XP_001816868.1| calpain [Aspergillus oryzae RIB40]
gi|238503944|ref|XP_002383204.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|83764722|dbj|BAE54866.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220690675|gb|EED47024.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|391863454|gb|EIT72765.1| calpain [Aspergillus oryzae 3.042]
Length = 236
Score = 37.4 bits (85), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 26 PIPAWIQSLVITKLEQMGV---VPPNYINSAVINDYQPGGCIVSHID 69
P+P+W+ S VI + E +G+ P N ++N+Y+PG I+ H D
Sbjct: 69 PLPSWLVSPVIPRFESLGIFADAPHGAPNHVLVNEYRPGQGIMPHED 115
>gi|260831338|ref|XP_002610616.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
gi|229295983|gb|EEN66626.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
Length = 231
Score = 37.4 bits (85), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 13 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 64
L N RL +P+G V + +P W+Q + ++ +GV N ++N+YQ G I
Sbjct: 43 LSNRRLQNWGGLPHPKGMVVEKLPEWLQ-MCADRIHSLGVFQDKMPNHVLVNEYQQGQGI 101
Query: 65 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 100
+ H D P +F I +++L S + L F +KP+
Sbjct: 102 MPHEDGP-LFYPTITTVNLGSHTMLDF-----YKPL 131
>gi|195487923|ref|XP_002092096.1| GE11857 [Drosophila yakuba]
gi|194178197|gb|EDW91808.1| GE11857 [Drosophila yakuba]
Length = 615
Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 55/135 (40%), Gaps = 24/135 (17%)
Query: 11 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSA-----------VINDYQ 59
K G E LY VDP QS+ ++ P N A +N+Y+
Sbjct: 175 KHFGFEFLYGSNNVDPSKPLEQSIP----SACDILWPRLDNFASTWDWSSPDQLTVNEYE 230
Query: 60 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 119
PG I H+D F PI+SLSL SD + F R + + + + R + ++
Sbjct: 231 PGHGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGDDQVQVRLPRRSLLIM 281
Query: 120 RDFAANGITHCVRPQ 134
A TH +RP+
Sbjct: 282 SGEARYDWTHGIRPK 296
>gi|444723561|gb|ELW64212.1| Alkylated DNA repair protein alkB like protein 8 [Tupaia chinensis]
Length = 583
Score = 37.0 bits (84), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + YI + IN Y+PG
Sbjct: 98 KHFGYEFCYENNNVDRDKPLPGGLPDICDSILEKW--LKEGYIKYKPDQLTINQYEPGQG 155
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I HID F+ I+SLSL S+ + F
Sbjct: 156 IPPHIDTHSAFEDEIVSLSLGSEVVMDF 183
>gi|424909400|ref|ZP_18332777.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
USDA 2370]
gi|392845431|gb|EJA97953.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
USDA 2370]
Length = 215
Score = 37.0 bits (84), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 42/84 (50%), Gaps = 8/84 (9%)
Query: 20 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 79
P+ + P+P W+ + +L + G + + + N+Y PG I +H+D FD I+
Sbjct: 81 PDAYLGPLPPWL-GVFARRLVRDGHCE-SLPDQVIANEYFPGQGISAHVDCVPCFDDTIV 138
Query: 80 SLSLFSDSALCFGCKFNFKPIRVS 103
S+SL S C+ F+ +R S
Sbjct: 139 SISLLS------ACEMVFRDLRGS 156
>gi|452846411|gb|EME48343.1| hypothetical protein DOTSEDRAFT_118289 [Dothistroma septosporum
NZE10]
Length = 315
Score = 37.0 bits (84), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 7/78 (8%)
Query: 17 RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
++YPEG P I+ LV V P +A++N Y PG + H D +R
Sbjct: 166 KIYPEGAPPAFPQDIKRLVED-------VFPMKAEAAIVNLYSPGDTLSLHRDVSEECNR 218
Query: 77 PIISLSLFSDSALCFGCK 94
P+IS+SL D+ G +
Sbjct: 219 PLISISLGCDAIFICGLE 236
>gi|189027511|sp|A1A4L5.1|ALKB8_BOVIN RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
Full=Probable alpha-ketoglutarate-dependent dioxygenase
ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
tRNA methyltransferase ABH8; AltName: Full=tRNA
(carboxymethyluridine(34)-5-O)-methyltransferase ABH8
gi|119223961|gb|AAI26688.1| Hypothetical protein LOC781788 [Bos taurus]
gi|296480338|tpg|DAA22453.1| TPA: alkylated DNA repair protein alkB homolog 8 [Bos taurus]
Length = 664
Score = 37.0 bits (84), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + +I + IN Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPDQLTINQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262
>gi|346644780|ref|NP_001073810.2| alkylated DNA repair protein alkB homolog 8 [Bos taurus]
Length = 671
Score = 37.0 bits (84), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + +I + IN Y+PG
Sbjct: 184 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPDQLTINQYEPGHG 241
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 242 IPAHIDTHSAFEDEIISLSLGSEIVMDF 269
>gi|426244465|ref|XP_004016042.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Ovis aries]
Length = 671
Score = 37.0 bits (84), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + +I + IN Y+PG
Sbjct: 184 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPDQLTINQYEPGHG 241
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 242 IPAHIDTHSAFEDEIISLSLGSEIVMDF 269
>gi|347833688|emb|CCD49385.1| hypothetical protein [Botryotinia fuckeliana]
Length = 358
Score = 37.0 bits (84), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 17 RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
++YPEGE P+ I +LV G+ P +A++N Y P + H D DR
Sbjct: 210 KVYPEGEPPKFPSDIANLV------EGLFPDMEPQAAIVNFYSPKDTLQLHRDGAESVDR 263
Query: 77 PIISLSLFSD 86
++S+SL D
Sbjct: 264 GLVSISLGCD 273
>gi|154292616|ref|XP_001546879.1| hypothetical protein BC1G_14634 [Botryotinia fuckeliana B05.10]
Length = 351
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 35/70 (50%), Gaps = 6/70 (8%)
Query: 17 RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
++YPEGE P+ I +LV G+ P +A++N Y P + H D DR
Sbjct: 203 KVYPEGEPPKFPSDIANLV------EGLFPDMEPQAAIVNFYSPKDTLQLHRDGAESVDR 256
Query: 77 PIISLSLFSD 86
++S+SL D
Sbjct: 257 GLVSISLGCD 266
>gi|145491391|ref|XP_001431695.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398800|emb|CAK64297.1| unnamed protein product [Paramecium tetraurelia]
Length = 634
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 27 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHID--PPHIFDRPIISLSLF 84
IPA+++ V K+ + + P IN IN+Y PG I H D PP F +++SL
Sbjct: 211 IPAFLED-VRAKVSDL-IKPQAEINQLTINEYLPGMGIPPHFDVHPP--FHEKFVTISLL 266
Query: 85 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTL 118
S G +FK + E LYLP R C
Sbjct: 267 S------GLVMSFKSFKGEEHHLYLP-PRSCALF 293
>gi|24658267|ref|NP_611690.2| CG17807 [Drosophila melanogaster]
gi|7291441|gb|AAF46867.1| CG17807 [Drosophila melanogaster]
Length = 615
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 9/84 (10%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 110
+ +N+Y+PG I H+D F PI+SLSL SD + F R + + +
Sbjct: 222 DQLTVNEYEPGHGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGDDQVQVR 272
Query: 111 VQRGCVTLLRDFAANGITHCVRPQ 134
+ R + ++ A TH +RP+
Sbjct: 273 LPRRSLLIMSGEARYDWTHGIRPK 296
>gi|17863052|gb|AAL40003.1| SD10403p [Drosophila melanogaster]
Length = 615
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 9/84 (10%)
Query: 51 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 110
+ +N+Y+PG I H+D F PI+SLSL SD + F R + + +
Sbjct: 222 DQLTVNEYEPGHGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGDDQVQVR 272
Query: 111 VQRGCVTLLRDFAANGITHCVRPQ 134
+ R + ++ A TH +RP+
Sbjct: 273 LPRRSLLIMSGEARYDWTHGIRPK 296
>gi|302823387|ref|XP_002993346.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
gi|300138777|gb|EFJ05531.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
Length = 229
Score = 36.6 bits (83), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 11/79 (13%)
Query: 11 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 62
K L N RL G + P+P+W+ + K+ + + P+ IN ++N+Y PG
Sbjct: 42 KTLTNRRLQNWGGIVESNGLVPQPLPSWLTK-ITEKISSVTGLFPSPINHVLVNEYLPGQ 100
Query: 63 CIVSHIDPPHIFDRPIISL 81
I+ H D P F P++++
Sbjct: 101 GIMLHQDGPSYF--PVVAI 117
>gi|358395346|gb|EHK44733.1| hypothetical protein TRIATDRAFT_184325, partial [Trichoderma
atroviride IMI 206040]
Length = 345
Score = 36.6 bits (83), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 39/87 (44%), Gaps = 16/87 (18%)
Query: 17 RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 76
R+YPE PA + + + T + P +A++N Y PG ++ H D D+
Sbjct: 186 RIYPEAPPPSFPADLSNFLTT------LFPETLAQAAIVNFYTPGDTMMMHRDVSEEIDK 239
Query: 77 PIISLSLFSDSALCFGCK--FNFKPIR 101
++SLS FGC+ F P R
Sbjct: 240 GLVSLS--------FGCECLFMIAPSR 258
>gi|348684972|gb|EGZ24787.1| hypothetical protein PHYSODRAFT_311607 [Phytophthora sojae]
Length = 224
Score = 36.6 bits (83), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 37/79 (46%), Gaps = 7/79 (8%)
Query: 26 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 85
P+P W+ + T +E N A+IN+Y G CI+ H D P + P + S+ S
Sbjct: 97 PLPRWLTQISQTLVEAGIFSEEKKPNHALINEYGVGDCIMPHEDGPAYY--PFV--SIIS 152
Query: 86 DSALCFGCKFNFKPIRVSE 104
A C+ F+P R E
Sbjct: 153 TGA---ECRVTFEPHRALE 168
>gi|302781915|ref|XP_002972731.1| hypothetical protein SELMODRAFT_98137 [Selaginella moellendorffii]
gi|300159332|gb|EFJ25952.1| hypothetical protein SELMODRAFT_98137 [Selaginella moellendorffii]
Length = 231
Score = 36.6 bits (83), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 12/90 (13%)
Query: 11 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 62
K L N RL G + P+P+W+ V K+ + + P+ IN ++N+Y PG
Sbjct: 42 KTLTNRRLQNWGGIVESNGLVPQPLPSWLIK-VTEKISSVTGLFPSPINHVLVNEYLPGQ 100
Query: 63 CIVSHIDPPHIFDRPIIS-LSLFSDSALCF 91
I+ H D P F P+++ LSL + + + F
Sbjct: 101 GIMLHQDGPSYF--PVVAILSLGAPTLMRF 128
>gi|395526088|ref|XP_003765203.1| PREDICTED: uncharacterized protein LOC100929995 [Sarcophilus
harrisii]
Length = 658
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 88/195 (45%), Gaps = 27/195 (13%)
Query: 19 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 77
+P+G V + +P W+Q V + +G+ + N ++N+Y+PG I+ H D P ++
Sbjct: 333 HPKGMVPEQLPRWLQRWV-DAVSALGLFGGSQANHVLVNEYRPGEGIMPHEDGP-LYYPT 390
Query: 78 IISLSLFSDSALCF----GCKFNFKPIRVSEPVLY----LPVQRGCVTLLRDFAANGITH 129
+ ++SL S + L G + P R +P L+ L ++ + +LR A + H
Sbjct: 391 VSTISLGSHTVLDLYPPRGPELEKDP-REEQPQLHPKFSLLLEPRSLLVLRGEAYTHLLH 449
Query: 130 CVRP------QDTQYRRAVIL----LRRVLPHAPR--LTLSQTPRVKPYHSLYDV---QA 174
+RP DT A L +LP R LT+ + PRV L +A
Sbjct: 450 GIRPILVDSLSDTPVANASACPSARLGALLPRGTRVSLTIRRVPRVLRAGLLLSKKSPKA 509
Query: 175 RESGGYFRSSIETYN 189
ES G ++ T++
Sbjct: 510 WESPGASHGTVPTWD 524
>gi|383858166|ref|XP_003704573.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
[Megachile rotundata]
Length = 582
Score = 36.2 bits (82), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 14 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHI 73
G+ ++ + + PIP Q L + ++ VP +Y + IN Y PG I HID +
Sbjct: 173 GSNKVNLDKPITPIPEEYQFLQVL-FKKYHNVPYDY-DQLTINHYLPGQGIPPHIDTHSV 230
Query: 74 FDRPIISLSLFSDSALCF 91
F+ I+SLSL S + F
Sbjct: 231 FEDSILSLSLGSAYVMNF 248
>gi|432100256|gb|ELK29031.1| Alkylated DNA repair protein alkB like protein 8 [Myotis davidii]
Length = 705
Score = 36.2 bits (82), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEV---DPIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E LY V P+P + + + LE+ + +I + +N Y+PG
Sbjct: 177 KHFGYEFLYENNNVHKDKPLPGGLPDICDSMLEKW--LKEGFIRHKPDQLTVNQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +H+D F+ IISLSL S+ + F
Sbjct: 235 IPAHVDTHSAFEDEIISLSLGSEIVMDF 262
>gi|115389510|ref|XP_001212260.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194656|gb|EAU36356.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 243
Score = 36.2 bits (82), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 17/48 (35%), Positives = 29/48 (60%), Gaps = 4/48 (8%)
Query: 26 PIPAWIQS-LVITKLEQMGV---VPPNYINSAVINDYQPGGCIVSHID 69
P+PAW+ S ++I + + +G+ P N ++N+YQPG I+ H D
Sbjct: 69 PLPAWLVSPIIIPRFDALGIFADAPHGAPNHVLVNEYQPGQGIMPHED 116
>gi|335294821|ref|XP_003357321.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Sus
scrofa]
Length = 665
Score = 36.2 bits (82), Expect = 9.9, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)
Query: 11 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 63
K G E Y VD P+P + + + LE+ + +I + +N Y+PG
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSILEKW--LKEGFIKHKPDQLTVNQYEPGHG 234
Query: 64 IVSHIDPPHIFDRPIISLSLFSDSALCF 91
I +HID F+ IISLSL S+ + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262
>gi|410971941|ref|XP_003992419.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Felis
catus]
Length = 625
Score = 36.2 bits (82), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 48/102 (47%), Gaps = 17/102 (16%)
Query: 5 GSQLVRKGLGNERL--------YPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI--- 50
G+Q V+K L + R+ Y VD P+P + + LE+ + YI
Sbjct: 124 GNQNVQKSLKHRRVKHFGYEFHYENNNVDRGKPLPGGLPDICDGILEKW--LKKGYIKHK 181
Query: 51 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 91
+ +N Y+PG I +HID F+ I+SLSL S+ + F
Sbjct: 182 PDQLTVNQYEPGHGIPAHIDTHSAFEDEIVSLSLGSEVVMDF 223
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.137 0.418
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,850,063,292
Number of Sequences: 23463169
Number of extensions: 162966813
Number of successful extensions: 794801
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 586
Number of HSP's successfully gapped in prelim test: 442
Number of HSP's that attempted gapping in prelim test: 738713
Number of HSP's gapped (non-prelim): 49582
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)