BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy7738
         (307 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|327287260|ref|XP_003228347.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Anolis carolinensis]
          Length = 379

 Score =  313 bits (802), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 150/238 (63%), Positives = 179/238 (75%), Gaps = 1/238 (0%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +  S P D  + +    E  QK+  G+ Q+ LFS EEC  +E  IDEV S AE+
Sbjct: 42  KRKYHEDSESEPSDYEERQREEEEA-QKVKSGIRQLRLFSPEECAKIEARIDEVVSRAEK 100

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G YK  TVDRAPLRNKYFFGEGYTYGSQL R+G G ERLYP GEVD IP W+  LVI KL
Sbjct: 101 GLYKEHTVDRAPLRNKYFFGEGYTYGSQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKL 160

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 161 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 220

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
           IRVSEPV +LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL +
Sbjct: 221 IRVSEPVFFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRLEM 278


>gi|118097886|ref|XP_001233463.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Gallus gallus]
          Length = 374

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 143/210 (68%), Positives = 169/210 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS EEC  +E  ID+V S AE+G YK  TVDRAPLRNKYFFGEGYTYG
Sbjct: 61  RKVKSGIRQLRLFSAEECAKIEARIDDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 120

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           SQL R+G G ERLYP GEVD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 121 SQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 180

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L  +AA+
Sbjct: 181 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 240

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 241 EITHCIRPQDIKERRAVIILRKTRLDAPRL 270


>gi|56118909|ref|NP_001008038.1| RNA demethylase ALKBH5 [Xenopus (Silurana) tropicalis]
 gi|82181400|sp|Q66JG8.1|ALKB5_XENTR RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|51704081|gb|AAH80920.1| MGC79570 protein [Xenopus (Silurana) tropicalis]
          Length = 358

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 140/212 (66%), Positives = 171/212 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS +EC  +E  IDEV S AE+G Y+  TVDRAPLRNKYFFGEGYTYG
Sbjct: 49  RKVRSGIRQVRLFSPDECARIEAKIDEVVSRAEKGLYREHTVDRAPLRNKYFFGEGYTYG 108

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL R+G G ERLYP+GEVD IPAW+  LVI +L +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 109 AQLQRRGPGQERLYPKGEVDEIPAWVNELVIRRLVEHRVIPEGFVNSAVINDYQPGGCIV 168

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPVQRG VT+L  +AA+
Sbjct: 169 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVQRGSVTVLSGYAAD 228

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
            ITHC+RPQD + RRAV++LR+    APRL +
Sbjct: 229 EITHCIRPQDIKERRAVVILRKTRTEAPRLEM 260


>gi|148231788|ref|NP_001085704.1| RNA demethylase ALKBH5 [Xenopus laevis]
 gi|82201093|sp|Q6GPB5.1|ALKB5_XENLA RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|49118231|gb|AAH73226.1| MGC80552 protein [Xenopus laevis]
          Length = 360

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 138/212 (65%), Positives = 171/212 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC A+E  IDEV S A++G Y+  TVDRAPLRNKYFFGEGYTYG
Sbjct: 51  RKVRSGIRQMRLFSPDECAAIESKIDEVVSRADKGLYQEHTVDRAPLRNKYFFGEGYTYG 110

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL R+G G ERLYP+GEVD IP W+  LVI +L +  ++P  ++NSAVINDYQPGGCIV
Sbjct: 111 AQLQRRGPGQERLYPKGEVDEIPGWVHELVIRRLVERRIIPEGFVNSAVINDYQPGGCIV 170

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L  +AA+
Sbjct: 171 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLSGYAAD 230

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTL 258
            ITHC+RPQD + RRAV++LR+    APRL +
Sbjct: 231 EITHCIRPQDIKERRAVVILRKTRTEAPRLEM 262


>gi|387015598|gb|AFJ49918.1| alkylated DNA repair protein alkB-like 5 protein [Crotalus
           adamanteus]
          Length = 389

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 142/210 (67%), Positives = 169/210 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS EEC+ +E  ID+V S AE+G YK  TVDRAPLRNKYFFGEGYTYG
Sbjct: 75  RKVKSGIRQLRLFSPEECVKIEARIDDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 134

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           SQL R+G G ERLYP GEVD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 135 SQLQRRGPGQERLYPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 194

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPV +LPV+RG VT+L  +AA+
Sbjct: 195 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVFFLPVRRGSVTVLSGYAAD 254

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 255 EITHCIRPQDIKERRAVIILRKTRVDAPRL 284


>gi|224070277|ref|XP_002188303.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Taeniopygia guttata]
          Length = 383

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 142/210 (67%), Positives = 169/210 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS EEC  +E  I++V S AE+G YK  TVDRAPLRNKYFFGEGYTYG
Sbjct: 70  RKVKSGIRQLRLFSAEECAKIEARIEDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 129

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           SQL R+G G ERLYP GEVD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 130 SQLQRRGPGQERLYPRGEVDAIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 189

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L  +AA+
Sbjct: 190 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 249

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 250 EITHCIRPQDIKERRAVIILRKTRLDAPRL 279


>gi|348537052|ref|XP_003456009.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Oreochromis niloticus]
          Length = 357

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 150/266 (56%), Positives = 191/266 (71%), Gaps = 13/266 (4%)

Query: 9   YKCQVVEQNLKLKNRYSRNQRSSPYDIGQSKHHARELRQ----KILQGVNQMSLFSHEEC 64
           YK + V++    K+R  R  R S  D  +    + +LR+    ++   + Q S+F+ EEC
Sbjct: 23  YKNKCVDETSNGKHR-KRKYRESDDDEYEHSDDSTDLREQEARRVRSSIQQKSIFTPEEC 81

Query: 65  LALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGE 124
             +E+ IDEV + AE G Y+  TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLY +GE
Sbjct: 82  ARIEKKIDEVVAHAEAGLYREHTVDRAPLRNKYFFGEGYTYGAQLEKRGPGQERLYRKGE 141

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           VD IP+W+  LVI +L   GV+P  ++NSAVINDYQPGGCIVSH+DP HIFDRPI+S+S 
Sbjct: 142 VDEIPSWVYELVINRLVTNGVIPEGFVNSAVINDYQPGGCIVSHVDPLHIFDRPIVSVSF 201

Query: 185 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVI 244
           FSDSALCFGC+F FKPIRVSEPV  LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI
Sbjct: 202 FSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLSGYAADDITHCIRPQDIKERRAVI 261

Query: 245 LLRRVLPHAPRL--------TLSQTP 262
           +LR+  P APRL        +L+Q P
Sbjct: 262 ILRKTRPDAPRLGSDSSLRASLAQKP 287


>gi|344298054|ref|XP_003420709.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Loxodonta africana]
          Length = 394

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 141/210 (67%), Positives = 168/210 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL + GV+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHGVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|395514234|ref|XP_003761324.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Sarcophilus harrisii]
          Length = 384

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 146/236 (61%), Positives = 177/236 (75%), Gaps = 3/236 (1%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +  S   D  + K    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 50  KRKYHEDSDSERSDYEEQKE---EEARKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEK 106

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYGSQL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 107 GLYNEHTVDRAPLRNKYFFGEGYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 166

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 167 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 226

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL+LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 227 IRVSEPVLFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282


>gi|126333790|ref|XP_001364359.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Monodelphis domestica]
          Length = 384

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 146/236 (61%), Positives = 177/236 (75%), Gaps = 3/236 (1%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +  S   D  + K    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 50  KRKYHEDSDSDRSDYEEQKE---EEARKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEK 106

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYGSQL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 107 GLYNEHTVDRAPLRNKYFFGEGYTYGSQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 166

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 167 VENRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 226

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL+LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 227 IRVSEPVLFLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 282


>gi|73956209|ref|XP_851719.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Canis lupus familiaris]
          Length = 394

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|148539642|ref|NP_060228.3| RNA demethylase ALKBH5 [Homo sapiens]
 gi|302565646|ref|NP_001180917.1| probable alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
           mulatta]
 gi|397476905|ref|XP_003809831.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Pan paniscus]
 gi|402898955|ref|XP_003912472.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Papio anubis]
 gi|408359959|sp|Q6P6C2.2|ALKB5_HUMAN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|119576065|gb|EAW55661.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
           sapiens]
 gi|119576066|gb|EAW55662.1| alkB, alkylation repair homolog 5 (E. coli), isoform CRA_a [Homo
           sapiens]
 gi|387542580|gb|AFJ71917.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5 [Macaca
           mulatta]
 gi|410221108|gb|JAA07773.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
 gi|410251666|gb|JAA13800.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
 gi|410300496|gb|JAA28848.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
 gi|410335931|gb|JAA36912.1| alkB, alkylation repair homolog 5 [Pan troglodytes]
          Length = 394

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|410980043|ref|XP_003996390.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Felis catus]
          Length = 394

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|395748658|ref|XP_003778808.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Pongo abelii]
          Length = 394

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|390462888|ref|XP_003732930.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH5
           [Callithrix jacchus]
          Length = 392

 Score =  306 bits (783), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 81  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 140

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 141 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEYRVIPEGFVNSAVINDYQPGGCIV 200

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 201 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 260

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 261 EITHCIRPQDIKERRAVIILRKTRLDAPRL 290


>gi|30354744|gb|AAH52076.1| AlkB, alkylation repair homolog 5 (E. coli) [Mus musculus]
          Length = 395

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/236 (60%), Positives = 175/236 (74%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +      D  + +    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 58  KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPKGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293


>gi|441642261|ref|XP_004090430.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH5 [Nomascus
           leucogenys]
          Length = 394

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSAYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|31044423|ref|NP_766531.2| RNA demethylase ALKBH5 [Mus musculus]
 gi|114151682|sp|Q3TSG4.2|ALKB5_MOUSE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|26343643|dbj|BAC35478.1| unnamed protein product [Mus musculus]
 gi|74194466|dbj|BAE37281.1| unnamed protein product [Mus musculus]
          Length = 395

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/236 (60%), Positives = 175/236 (74%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +      D  + +    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 58  KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293


>gi|347300461|ref|NP_001178572.1| RNA demethylase ALKBH5 [Rattus norvegicus]
 gi|449061781|sp|D3ZKD3.1|ALKB5_RAT RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
          Length = 395

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/236 (60%), Positives = 175/236 (74%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +      D  + +    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 58  KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKL 177

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293


>gi|332848357|ref|XP_001158112.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Pan troglodytes]
          Length = 394

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|329664868|ref|NP_001192446.1| RNA demethylase ALKBH5 [Bos taurus]
 gi|449061780|sp|E1BH29.1|ALKB5_BOVIN RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|296476634|tpg|DAA18749.1| TPA: hypothetical protein BOS_18839 [Bos taurus]
          Length = 394

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|449281378|gb|EMC88458.1| Alkylated DNA repair protein alkB like protein 5, partial [Columba
           livia]
          Length = 349

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 141/210 (67%), Positives = 168/210 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS EEC  +E  I++V S AE+G YK  TVDRAPLRNKYFFGEGYTYG
Sbjct: 36  RKVKSGIRQLRLFSAEECAKIEARIEDVVSRAEKGLYKEHTVDRAPLRNKYFFGEGYTYG 95

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           SQL R+G G ERL P GEVD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 96  SQLQRRGPGQERLLPRGEVDTIPEWVHDLVIRKLVEHRVIPEGFVNSAVINDYQPGGCIV 155

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL+LPV+RG VT+L  +AA+
Sbjct: 156 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLFLPVKRGSVTVLSGYAAD 215

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 216 EITHCIRPQDIKERRAVIILRKTRLDAPRL 245


>gi|426349222|ref|XP_004042212.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Gorilla gorilla gorilla]
 gi|7020317|dbj|BAA91078.1| unnamed protein product [Homo sapiens]
          Length = 378

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 67  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 126

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 127 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 186

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 187 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 246

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 247 EITHCIRPQDIKERRAVIILRKTRLDAPRL 276


>gi|26343649|dbj|BAC35481.1| unnamed protein product [Mus musculus]
          Length = 395

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 142/236 (60%), Positives = 176/236 (74%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +  S   D  + +    E  +++  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 58  KRKYQEDSDSERSDYEEHQLQKEEEARQVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCI+SH+DP HIF+RPI+S+S FSDSALCFGCKF FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCILSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKP 237

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293


>gi|355668039|gb|AER94060.1| alkB, alkylation repair-like protein 5 [Mustela putorius furo]
          Length = 340

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 29  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 88

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 89  AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 148

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 149 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 208

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 209 EITHCIRPQDIKERRAVIILRKTRLDAPRL 238


>gi|380800137|gb|AFE71944.1| putative alpha-ketoglutarate-dependent dioxygenase ABH5, partial
           [Macaca mulatta]
          Length = 358

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 47  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 106

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 107 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 166

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 167 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 226

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 227 EITHCIRPQDIKERRAVIILRKTRLDAPRL 256


>gi|116517268|ref|NP_001070855.1| RNA demethylase ALKBH5 [Danio rerio]
 gi|123913930|sp|Q08BA6.1|ALKB5_DANRE RecName: Full=RNA demethylase ALKBH5; AltName: Full=Alkylated DNA
           repair protein alkB homolog 5; AltName:
           Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 5
 gi|115527853|gb|AAI24805.1| AlkB, alkylation repair homolog 5 (E. coli) [Danio rerio]
          Length = 352

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 141/242 (58%), Positives = 179/242 (73%)

Query: 15  EQNLKLKNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEV 74
           ++  +  N   R  R S  D  + +       +++  G+ Q S+F+ EEC  +E  IDEV
Sbjct: 19  DKVFEYSNGEKRKYRESDDDESEYEERRDAEARRVKSGIKQASIFTLEECARIEAKIDEV 78

Query: 75  SSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQS 134
            + A++G Y+  TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLY +GEVD IP W+  
Sbjct: 79  VAKADKGLYREHTVDRAPLRNKYFFGEGYTYGAQLEKRGPGQERLYSKGEVDDIPDWVHE 138

Query: 135 LVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGC 194
           LVI +L   GV+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGC
Sbjct: 139 LVIDRLVTHGVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGC 198

Query: 195 KFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAP 254
           KF FKPIRVSEPVL+LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    AP
Sbjct: 199 KFLFKPIRVSEPVLHLPVRRGSVTVLSGYAADDITHCIRPQDIKERRAVIILRKTRADAP 258

Query: 255 RL 256
           RL
Sbjct: 259 RL 260


>gi|417410040|gb|JAA51501.1| Putative alkylated dna repair protein alkb log, partial [Desmodus
           rotundus]
          Length = 359

 Score =  304 bits (778), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 48  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 107

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 108 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 167

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 168 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 227

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 228 EITHCIRPQDIKERRAVIILRKTRLDAPRL 257


>gi|348560293|ref|XP_003465948.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Cavia porcellus]
          Length = 394

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 139/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQIRLFSQDECTKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|194375187|dbj|BAG62706.1| unnamed protein product [Homo sapiens]
          Length = 383

 Score =  303 bits (776), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 72  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 131

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 132 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 191

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 192 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 251

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 252 EITHCIRPQDIKERRAVIILRKTRLDAPRL 281


>gi|351707716|gb|EHB10635.1| Alkylated DNA repair protein alkB-like protein 5, partial
           [Heterocephalus glaber]
          Length = 349

 Score =  303 bits (776), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 139/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ Q+ LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 39  RKVKSGIRQIRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 98

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 99  AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 158

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 159 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 218

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 219 EITHCIRPQDIKERRAVIILRKTRLDAPRL 248


>gi|390345597|ref|XP_784548.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Strongylocentrotus purpuratus]
          Length = 462

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 140/220 (63%), Positives = 168/220 (76%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +KI  GV QM LF+ EEC  +E  I+EV  +AE+G Y+  TVDRAPLRNKYFFGEGYTYG
Sbjct: 65  KKIHSGVAQMQLFTEEECSEIEDKINEVVDIAEKGLYRDHTVDRAPLRNKYFFGEGYTYG 124

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           SQL +KG G ERLYP GEVD IP W+Q LV++++ + G+VP  +INSAVINDYQPGGCIV
Sbjct: 125 SQLSKKGPGQERLYPPGEVDDIPQWVQDLVVSRVVEAGLVPHGFINSAVINDYQPGGCIV 184

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIFDRPIIS+S  SD AL FGCKF+F+PIRVS+P   +P+ RGCV  L  +AA+
Sbjct: 185 SHVDPIHIFDRPIISVSFLSDCALSFGCKFSFRPIRVSKPKACIPLPRGCVMTLSGYAAD 244

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKP 266
            +THC+RPQD   RRAVI+LRRV P APRL     P   P
Sbjct: 245 EVTHCIRPQDVTARRAVIILRRVFPEAPRLEPVGLPSPAP 284


>gi|38383101|gb|AAH62339.1| AlkB, alkylation repair homolog 5 (E. coli) [Homo sapiens]
          Length = 458

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 83  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 142

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 143 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 202

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 203 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 262

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 263 EITHCIRPQDIKERRAVIILRKTRLDAPRL 292


>gi|355568306|gb|EHH24587.1| hypothetical protein EGK_08266, partial [Macaca mulatta]
          Length = 384

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 28  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 87

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 88  AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 147

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 148 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 207

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 208 EITHCIRPQDIKERRAVIILRKTRLDAPRL 237


>gi|355753817|gb|EHH57782.1| hypothetical protein EGM_07489, partial [Macaca fascicularis]
          Length = 383

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 140/210 (66%), Positives = 167/210 (79%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 28  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 87

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 88  AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 147

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 148 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 207

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 208 EITHCIRPQDIKERRAVIILRKTRLDAPRL 237


>gi|47218956|emb|CAF98154.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 355

 Score =  300 bits (769), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 140/215 (65%), Positives = 168/215 (78%), Gaps = 4/215 (1%)

Query: 42  ARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
           AR +R  ILQ     S+F+ E+C A+E  ID+V + AE G Y+  TVDRAPLRNKYFFGE
Sbjct: 60  ARLVRNSILQ----RSVFTPEQCAAIEEKIDDVVAQAEAGLYREHTVDRAPLRNKYFFGE 115

Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 161
           GYTYGSQL ++G G ERLY +GEVD IP+W+  LVI  L   GV+P  ++NSAVINDYQP
Sbjct: 116 GYTYGSQLEKRGPGQERLYRKGEVDEIPSWVHELVIKPLVSSGVIPEGFVNSAVINDYQP 175

Query: 162 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 221
           GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV  LPV+RG VT+L 
Sbjct: 176 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLS 235

Query: 222 DFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            +AA+ ITHC+RPQD + RRAVI+LR+  P APRL
Sbjct: 236 GYAADSITHCIRPQDIKERRAVIILRKTRPDAPRL 270


>gi|74187512|dbj|BAE36711.1| unnamed protein product [Mus musculus]
          Length = 395

 Score =  300 bits (769), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 141/236 (59%), Positives = 174/236 (73%)

Query: 21  KNRYSRNQRSSPYDIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQ 80
           K +Y  +      D  + +    E  +K+  G+ Q+ LFS +EC  +E  IDEV S AE+
Sbjct: 58  KRKYQEDSDPERSDYEEHQLQKEEEARKVKSGIRQIRLFSQDECSKIEARIDEVVSRAEK 117

Query: 81  GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKL 140
           G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL
Sbjct: 118 GLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPDWVHQLVIQKL 177

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKP 200
            +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFG +F FKP
Sbjct: 178 VEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGWQFQFKP 237

Query: 201 IRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           IRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 238 IRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 293


>gi|410917752|ref|XP_003972350.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Takifugu rubripes]
          Length = 361

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 142/215 (66%), Positives = 167/215 (77%), Gaps = 4/215 (1%)

Query: 42  ARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
           AR +R  ILQ     S+FS E+C A+E  ID+V + AE G Y+  TVDRAPLRNKYFFGE
Sbjct: 65  ARLVRDSILQ----KSIFSPEQCAAIEEKIDDVVAQAEAGLYREHTVDRAPLRNKYFFGE 120

Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQP 161
           GYTYGSQL ++G G ERLY +GEVD IP W+  LVI  L   GVVP  ++NSAVINDYQP
Sbjct: 121 GYTYGSQLEKRGPGQERLYRKGEVDEIPNWVHELVIKPLVSNGVVPDGFVNSAVINDYQP 180

Query: 162 GGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR 221
           GGCIVSH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV  LPV+RG VT+L 
Sbjct: 181 GGCIVSHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVKRGSVTVLS 240

Query: 222 DFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            +AA+ ITHC+RPQD + RRAVI+LR+  P APRL
Sbjct: 241 GYAADDITHCIRPQDIKERRAVIILRKTRPDAPRL 275


>gi|432847440|ref|XP_004066024.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Oryzias latipes]
          Length = 361

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 134/210 (63%), Positives = 164/210 (78%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +++   + Q S+FS EEC  +E  IDEV +  E G Y+  TVDRAPLRNKYFFGEGYTYG
Sbjct: 67  RRVSSSIQQRSIFSSEECARIEEKIDEVVANGEAGLYREHTVDRAPLRNKYFFGEGYTYG 126

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLY +GEVD IP+W+  LVI  L   GV+P  ++NSAVINDYQPGGCIV
Sbjct: 127 AQLEKRGPGQERLYRKGEVDDIPSWVHELVIKPLVSNGVIPEGFVNSAVINDYQPGGCIV 186

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
           SH+DP HIF RPI+S+S FSDSALCFGC+F FKPIRVSEPV  LPV+RG VT+L  +AA+
Sbjct: 187 SHVDPLHIFARPIVSVSFFSDSALCFGCRFQFKPIRVSEPVFVLPVRRGSVTVLSGYAAD 246

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+  P APR+
Sbjct: 247 EITHCIRPQDIKQRRAVIILRKTRPDAPRV 276


>gi|431914502|gb|ELK15752.1| Alkylated DNA repair protein alkB like protein 5 [Pteropus alecto]
          Length = 303

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 137/201 (68%), Positives = 161/201 (80%)

Query: 56  MSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLG 115
           M LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G
Sbjct: 1   MRLFSQDECTKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPG 60

Query: 116 NERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIF 175
            ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIVSH+DP HIF
Sbjct: 61  QERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIF 120

Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
           +RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQ
Sbjct: 121 ERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQ 180

Query: 236 DTQHRRAVILLRRVLPHAPRL 256
           D + RRAVI+LR+    APRL
Sbjct: 181 DIKERRAVIILRKTRLDAPRL 201


>gi|291224533|ref|XP_002732258.1| PREDICTED: alkB, alkylation repair homolog 5-like [Saccoglossus
           kowalevskii]
          Length = 362

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 135/214 (63%), Positives = 163/214 (76%), Gaps = 2/214 (0%)

Query: 44  ELRQKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEG 102
           ELR  +  G+ Q   +FS EEC  +E  ID V + A++  +K  TVDRAPLRNKYFFGEG
Sbjct: 57  ELRL-VHSGIKQRRDVFSAEECRIIEEKIDSVVADADRSIFKKHTVDRAPLRNKYFFGEG 115

Query: 103 YTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPG 162
           YTYG+QL +KG G ERLYP GEVD IP W+Q LVI +L    ++P  ++NSAVINDY+PG
Sbjct: 116 YTYGTQLDKKGPGQERLYPRGEVDDIPGWVQELVIKRLVSSKMIPDGFVNSAVINDYKPG 175

Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD 222
           GCIVSH+DP HIF+RPI S+S  SD ALCFGCKF+FKPIR S P+L LP+ RGCVTLL  
Sbjct: 176 GCIVSHVDPIHIFERPIASVSFMSDCALCFGCKFSFKPIRTSRPLLSLPLTRGCVTLLSG 235

Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           +AAN ITHC+RPQD + RRAVI+LR+V   APRL
Sbjct: 236 YAANDITHCIRPQDVKARRAVIILRKVRDDAPRL 269


>gi|260817481|ref|XP_002603615.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
 gi|229288935|gb|EEN59626.1| hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]
          Length = 314

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 151/254 (59%), Positives = 188/254 (74%), Gaps = 3/254 (1%)

Query: 39  KHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF 98
           K    ELR +I QG+ Q  LFSHEEC  +E  I+E+   A++G YK  TVDRAPLR KYF
Sbjct: 63  KDRQDELR-RIHQGIRQRILFSHEECEKIEEKIEEIILRADRGYYKEHTVDRAPLRVKYF 121

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           FGEGYTYGSQL RKG G ERLY +G+VD IP WI  +V+++L +  +VP  ++NSAVIN 
Sbjct: 122 FGEGYTYGSQLERKGPGMERLYSKGDVDDIPDWIFKMVVSRLVEAKIVPEGFVNSAVINC 181

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVT 218
           Y PGGCIVSHIDPPHIF RPI+S+S FSDSALCFGCKF+FKPIRVS+PVL LPV RGC T
Sbjct: 182 YLPGGCIVSHIDPPHIFARPIVSVSFFSDSALCFGCKFSFKPIRVSKPVLCLPVARGCCT 241

Query: 219 LLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARES 278
           +L D+AA+ ITHCVRPQD +HRRAVI+LRRV   APR  L+ T  V+  H  +D +  ++
Sbjct: 242 VLSDYAADEITHCVRPQDIKHRRAVIILRRVRDDAPR--LAPTTEVQMVHVNHDRREEDN 299

Query: 279 GGYFRSSIETYNNN 292
                  + T++++
Sbjct: 300 RTSTDDRLSTFHSD 313


>gi|156381190|ref|XP_001632149.1| predicted protein [Nematostella vectensis]
 gi|156219200|gb|EDO40086.1| predicted protein [Nematostella vectensis]
          Length = 256

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 139/240 (57%), Positives = 173/240 (72%), Gaps = 4/240 (1%)

Query: 12  QVVEQNLKLKNRYSRNQRSSP---YDIGQSKHHARELRQKILQGVNQMSLFSHEECLALE 68
           Q  ++N KL+ +  + +RS+    Y+   S     EL Q++ QG+ Q  LF   EC  +E
Sbjct: 18  QNTDRNRKLQQQNRKRRRSASKVVYEKHSSYEQGLEL-QRVHQGIWQRVLFDKVECEEIE 76

Query: 69  RNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPI 128
             I++V   A++G YK  TVDRAPLR KYFFGEGYTYG QL  +G G+ERLY  G+VD I
Sbjct: 77  EKINDVVVEADRGVYKDRTVDRAPLRVKYFFGEGYTYGKQLEERGPGSERLYARGDVDDI 136

Query: 129 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDS 188
           P WI  LVI++LE+ G+VP ++INSAVINDYQPGGCIVSHIDP HIFDRPI+S S FS  
Sbjct: 137 PEWIYELVISRLEKAGIVPKDFINSAVINDYQPGGCIVSHIDPMHIFDRPIVSCSFFSSC 196

Query: 189 ALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRR 248
            L FGCKF+FKPIR ++P+L LP+ RGCVT L  +AA+ ITHCVRPQD   RRAVI+LRR
Sbjct: 197 TLSFGCKFSFKPIRTTDPILSLPLPRGCVTALSGYAADEITHCVRPQDVVTRRAVIILRR 256


>gi|198418993|ref|XP_002122576.1| PREDICTED: similar to alkB, alkylation repair homolog 5, partial
           [Ciona intestinalis]
          Length = 308

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 126/247 (51%), Positives = 172/247 (69%), Gaps = 1/247 (0%)

Query: 53  VNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRK 112
           ++Q  LF+ EEC  +ER I++V    E+  Y   TVDRAPLR KYFFGEGYTYG Q+ +K
Sbjct: 1   ISQKRLFNDEECDEIERCINDVVRNGEKNMYNEHTVDRAPLRVKYFFGEGYTYGKQMNQK 60

Query: 113 GLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPP 172
           G G ERLY +G VDPIP+W+   ++  LE  G++   +I SAVINDY PGGCIVSH+DP 
Sbjct: 61  GPGQERLYQKGVVDPIPSWVFKKLVKPLEDCGMIKKKWITSAVINDYYPGGCIVSHVDPA 120

Query: 173 HIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
           HIF+RPI+S+S FS +AL FGCKF+FKPIRVS+PV+ LP+ RG  T +  +AA+ ITHC+
Sbjct: 121 HIFERPIVSVSFFSTAALSFGCKFSFKPIRVSDPVVCLPIPRGVATSISGYAADSITHCI 180

Query: 233 RPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARESGGYFRSSIETYNNN 292
           RPQD + RRAVI+LRR   +APR     +  V  + +L + + R S    R S++++ + 
Sbjct: 181 RPQDIRQRRAVIILRRTFDNAPRQG-EDSSLVSHHRTLNNHEDRRSSNRHRLSLDSHRDR 239

Query: 293 ENHSNKN 299
             H+ ++
Sbjct: 240 RKHTRRS 246


>gi|194217772|ref|XP_001918245.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like, partial [Equus caballus]
          Length = 277

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 122/174 (70%), Positives = 143/174 (82%)

Query: 83  YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
           Y   TVDRAPLRNKYFFGEGYTYG+QL ++G G ERLYP G+VD IP W+  LVI KL +
Sbjct: 2   YNEHTVDRAPLRNKYFFGEGYTYGAQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVE 61

Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIR 202
             V+P  ++NSAVINDYQPGGCIVSH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIR
Sbjct: 62  HRVIPEGFVNSAVINDYQPGGCIVSHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIR 121

Query: 203 VSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           VSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 122 VSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 175


>gi|405958373|gb|EKC24507.1| Alkylated DNA repair protein alkB-like protein 5 [Crassostrea
           gigas]
          Length = 347

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 122/190 (64%), Positives = 147/190 (77%), Gaps = 1/190 (0%)

Query: 34  DIGQSKHHARELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPL 93
           D+  S  +A EL  K+  G+ Q  LF+ ++C  +E  I+E   +A++G+YK  TVDRAPL
Sbjct: 49  DLDASWSNADEL-SKVHAGIKQRRLFNDDQCEKIEEKINETVKIADKGEYKSHTVDRAPL 107

Query: 94  RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
           RNKYFFGEGYTYGSQL +KG G ERLYP+GEVD IP WI  LVI  L    +VP  ++NS
Sbjct: 108 RNKYFFGEGYTYGSQLEKKGPGMERLYPKGEVDEIPDWIHELVIKPLYDAKIVPEGFVNS 167

Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQ 213
           AVINDY PGGCIVSHIDPPHIFDRPI+S+S FSDSALCFGCKF+F+PIRVS PVL LP+ 
Sbjct: 168 AVINDYMPGGCIVSHIDPPHIFDRPIVSVSFFSDSALCFGCKFSFRPIRVSTPVLCLPID 227

Query: 214 RGCVTLLRDF 223
           RGCVT++R F
Sbjct: 228 RGCVTMIRVF 237


>gi|403275437|ref|XP_003945296.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH5 [Saimiri
           boliviensis boliviensis]
          Length = 386

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 119/174 (68%), Positives = 140/174 (80%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 72  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 131

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI KL +  V+P  ++NSAVINDYQPGGCIV
Sbjct: 132 AQLQKRGPGQERLYPPGDVDEIPEWVHQLVIQKLVEHRVIPEGFVNSAVINDYQPGGCIV 191

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
           SH+DP HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L
Sbjct: 192 SHVDPIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVL 245


>gi|221119302|ref|XP_002156519.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Hydra magnipapillata]
          Length = 329

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 123/211 (58%), Positives = 149/211 (70%), Gaps = 1/211 (0%)

Query: 47  QKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
           Q I +G+ Q  + F    C  +E+ ID +   A++G Y P T D APLRNKYFFGEGYTY
Sbjct: 77  QLIHEGIFQKRNYFDDSMCRKIEKKIDGIVEKAKRGLYLPNTFDSAPLRNKYFFGEGYTY 136

Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           G  +  KG G ERL+ +GEVD IP WIQ  VI KL    VVP  +INS VIN+Y PGGCI
Sbjct: 137 GKHMENKGPGQERLFKKGEVDKIPNWIQKHVIKKLYDDKVVPEGFINSVVINEYFPGGCI 196

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAA 225
           VSHIDP HIFDRPIIS++  + S L FGCKF F PIR SEPVL LP+ RGC+T++  +AA
Sbjct: 197 VSHIDPIHIFDRPIISINFNTRSFLSFGCKFTFNPIRTSEPVLSLPMDRGCLTMISGYAA 256

Query: 226 NGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           + ITHC+RP D   RR VI+LRRV P+APR+
Sbjct: 257 DNITHCIRPCDITERRCVIILRRVFPNAPRV 287


>gi|443714465|gb|ELU06866.1| hypothetical protein CAPTEDRAFT_35874, partial [Capitella teleta]
          Length = 230

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 129/204 (63%), Positives = 154/204 (75%), Gaps = 3/204 (1%)

Query: 48  KILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGS 107
           KI  G+ Q  LF+   C  +E  I+EV   A  G+YK  TVD APLRNKYFFGEGYTYGS
Sbjct: 27  KIRTGIKQRCLFNDSACKEIEAKIEEVVRKAANGEYKKNTVDTAPLRNKYFFGEGYTYGS 86

Query: 108 QLVRKGLGNERLYP-EGE--VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
            +  +G G ERLYP EGE  VDPIP WI  +V+  L +  ++PP++INSAVINDYQPGGC
Sbjct: 87  HMEARGPGMERLYPKEGEESVDPIPEWIHEMVVQPLLRAQLIPPDFINSAVINDYQPGGC 146

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
           IVSHIDP HIFDRPI+++S FS SAL FGCKF F+PIRVS PVL + + RG VTL+  FA
Sbjct: 147 IVSHIDPYHIFDRPIVTVSFFSSSALSFGCKFEFRPIRVSNPVLSVKLPRGGVTLISGFA 206

Query: 225 ANGITHCVRPQDTQHRRAVILLRR 248
           A+ ITHC+RPQDT HRRAV++LRR
Sbjct: 207 ADEITHCIRPQDTPHRRAVVILRR 230


>gi|335310671|ref|XP_003362141.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Sus scrofa]
          Length = 396

 Score =  237 bits (605), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 119/200 (59%), Positives = 139/200 (69%), Gaps = 14/200 (7%)

Query: 59  FSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
            + +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYF      YG+QL ++G G ER
Sbjct: 106 LAADECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFXXXXXXYGAQLQKRGPGQER 165

Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHI--DPPHIFD 176
           LYP G+VD IP W                     SAVINDYQPGGCIVSH+  DP HIF+
Sbjct: 166 LYPPGDVDEIPEWXXXXXXXX------------XSAVINDYQPGGCIVSHVHVDPIHIFE 213

Query: 177 RPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQD 236
           RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD
Sbjct: 214 RPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQD 273

Query: 237 TQHRRAVILLRRVLPHAPRL 256
            + RRAVI+LR+    APRL
Sbjct: 274 IKERRAVIILRKTRLDAPRL 293


>gi|426238911|ref|XP_004023593.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH5, partial
           [Ovis aries]
          Length = 312

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 114/210 (54%), Positives = 136/210 (64%), Gaps = 38/210 (18%)

Query: 47  QKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYG 106
           +K+  G+ QM LFS +EC  +E  IDEV S AE+G Y   TVDRAPLRNKYFFGEGYTYG
Sbjct: 39  RKVKSGIRQMRLFSQDECAKIEARIDEVVSRAEKGLYNEHTVDRAPLRNKYFFGEGYTYG 98

Query: 107 SQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIV 166
           +QL ++G G ERLYP G+VD IP W+  LVI                             
Sbjct: 99  AQLQKRGPGQERLYPPGDVDEIPEWVHQLVI----------------------------- 129

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAAN 226
                    +RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+
Sbjct: 130 ---------ERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAAD 180

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 181 EITHCIRPQDIKERRAVIILRKTRLDAPRL 210


>gi|351709131|gb|EHB12050.1| Alkylated DNA repair protein alkB-like protein 5 [Heterocephalus
           glaber]
          Length = 464

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 113/214 (52%), Positives = 135/214 (63%), Gaps = 34/214 (15%)

Query: 43  RELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEG 102
           +E  +K+   + Q+ LFS EEC  +   IDEV SLAE+G Y   TVDRAPLRNKYFFG+G
Sbjct: 104 KEEARKVKSNIRQIRLFSQEECAKIAARIDEVVSLAEKGLYNEHTVDRAPLRNKYFFGDG 163

Query: 103 YTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPG 162
           YTYG+QL ++  G ERLYP G+VD IP W   LVI KL +  V+P  ++NSAVINDYQPG
Sbjct: 164 YTYGAQLQKRRPGQERLYPPGDVDEIPEWGPRLVIQKLVEHRVIPEGFVNSAVINDYQPG 223

Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD 222
           GCI+S            +S   FSDSALCFGCK  FKPIRVSEPVL LPV+R        
Sbjct: 224 GCIMS------------VS---FSDSALCFGCKLQFKPIRVSEPVLSLPVRRR------- 261

Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
                       +  Q RRAVI+LR+    APRL
Sbjct: 262 ------------KRHQERRAVIILRKTRLDAPRL 283


>gi|313240619|emb|CBY32944.1| unnamed protein product [Oikopleura dioica]
          Length = 462

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)

Query: 46  RQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
           R++IL G+   S+F  ++   LE +   +     +G+++  ++D + LR KY     +  
Sbjct: 31  RREILDGIKGFSIFDADDVEDLESDCRSMIERGGKGKFRVKSIDISRLRMKY-----FFG 85

Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
                  G G+E+ +   ++ PIP WI   +I K+E+ G+V  N+INS VINDY+PGG I
Sbjct: 86  YGYEYGGGKGSEKFFNPKDISPIPDWIYKNIIEKMEKAGIVEKNWINSVVINDYEPGGFI 145

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTL 219
           V H DPPH+F RPI  L+LFSDSAL FGC   F    +P+ V  S+PVL LP+QRG +T 
Sbjct: 146 VQHQDPPHLFQRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITS 205

Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
                 N I HCVRP+DT+ RR  I+LRRV  + P L
Sbjct: 206 FEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 242


>gi|313225990|emb|CBY21133.1| unnamed protein product [Oikopleura dioica]
          Length = 462

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 92/217 (42%), Positives = 130/217 (59%), Gaps = 11/217 (5%)

Query: 46  RQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTY 105
           R++IL G+   S+F  ++   LE +   +     +G+++  ++D + LR KY     +  
Sbjct: 31  RREILDGIKGFSIFDADDVEDLESDCRSMIERGGKGKFRVKSIDISRLRMKY-----FFG 85

Query: 106 GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
                  G G+E+ +   ++ PIP WI   +I K+E+ G+V  N+INS VINDY+PGG I
Sbjct: 86  YGYEYGGGKGSEKFFNPKDISPIPDWIYKNIIGKMEKAGIVEKNWINSVVINDYEPGGFI 145

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF----KPIRV--SEPVLYLPVQRGCVTL 219
           V H DPPH+F RPI  L+LFSDSAL FGC   F    +P+ V  S+PVL LP+QRG +T 
Sbjct: 146 VQHQDPPHLFQRPIFILTLFSDSALSFGCNLRFDRSVEPVEVTASDPVLRLPMQRGIITS 205

Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
                 N I HCVRP+DT+ RR  I+LRRV  + P L
Sbjct: 206 FEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 242


>gi|354467887|ref|XP_003496399.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like, partial [Cricetulus griseus]
          Length = 203

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 74/101 (73%), Positives = 86/101 (85%)

Query: 156 INDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRG 215
           INDYQPGGCIVSH+D  HIF+RPI+S+S FSDSALCFGCKF FKPIRVSEPVL LPV+RG
Sbjct: 1   INDYQPGGCIVSHVDQIHIFERPIVSVSFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRG 60

Query: 216 CVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
            VT+L  +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 61  SVTVLSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 101


>gi|296216728|ref|XP_002754700.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Callithrix jacchus]
          Length = 231

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 53/74 (71%), Positives = 61/74 (82%)

Query: 183 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRA 242
           S FSDSALCFGCKF FKPIRVSEPVL LPV+RG VT+L  +AA+ ITHC+RPQD + RRA
Sbjct: 56  SFFSDSALCFGCKFQFKPIRVSEPVLSLPVRRGSVTVLSGYAADEITHCIRPQDIKERRA 115

Query: 243 VILLRRVLPHAPRL 256
           VI+LR+    APRL
Sbjct: 116 VIILRKTRLDAPRL 129


>gi|118352562|ref|XP_001009552.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila]
 gi|89291319|gb|EAR89307.1| hypothetical protein TTHERM_00371220 [Tetrahymena thermophila SB210]
          Length = 1999

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 102/206 (49%), Gaps = 24/206 (11%)

Query: 58   LFSHEECLALERNIDEVSSLAEQGQYKP------CTVDRAPLRNKYFFGEGYTYGSQLVR 111
             F+H+E +++E +  E    + Q  Y P      CT  +   R K+FFG  Y +  Q + 
Sbjct: 1751 FFTHQELMSIENSCYETEIKSFQRHYFPMTAQMTCTAQKI-RRTKFFFGSRYIWTRQQLA 1809

Query: 112  K-----GLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CI 165
            +     G G  +     +V   P+W++  V+  LEQ G++PP +INS  +N Y  G   +
Sbjct: 1810 EPHSFVGAGVRQ-----DVSQPPSWMRRKVVGPLEQAGILPPGFINSLALNVYHDGKEGL 1864

Query: 166  VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--F 223
              H D    F +PI +  LFSD+ L FG +              +P+ RG V  L +  +
Sbjct: 1865 AQHFDDATRFRQPIFTFKLFSDARLSFGSQL----YGFCNSAFVIPMPRGSVLKLEENSY 1920

Query: 224  AANGITHCVRPQDTQHRRAVILLRRV 249
            AANGI HCVRP D   + A I++R++
Sbjct: 1921 AANGIKHCVRPCDMAGKNATIIMRQM 1946


>gi|403365523|gb|EJY82547.1| hypothetical protein OXYTRI_19840 [Oxytricha trifallax]
          Length = 1315

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 112/228 (49%), Gaps = 13/228 (5%)

Query: 35   IGQSKHHARELRQKILQGVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRA-- 91
            +G SK    +  +K+  G+    + +++EE   +E  I++    + Q  Y P T  +   
Sbjct: 844  VGISKDEEIQDVEKLRVGITTFENFYTNEEMKNMEHMIEQTEKKSLQNAYLPMTAQKTFS 903

Query: 92   ---PLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP 148
                 R K+FFG  Y +    + + L N       +V   P W+++++   L   G++  
Sbjct: 904  GNTLKRTKFFFGFRYMWTRTQLLEPLSNVAAGVRADVSQPPVWMKTILEEPLVNAGIIEK 963

Query: 149  NYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
            ++INS  +N Y  G   +  H D    F +PI +L LFSDS L FG ++           
Sbjct: 964  DFINSVALNVYHDGSEGLAQHFDDATRFKQPIYTLRLFSDSRLSFGSQY----YGFCNGA 1019

Query: 208  LYLPVQRGCVTLLRD--FAANGITHCVRPQDTQHRRAVILLRRVLPHA 253
              +P+ RGC+ ++ +  ++ANG+ HC+RP D   + + ++LR++ P+ 
Sbjct: 1020 FTIPLPRGCICVMEEGSYSANGVKHCIRPCDMTGKSSAVILRQMHPNV 1067


>gi|340508382|gb|EGR34097.1| hypothetical protein IMG5_024190 [Ichthyophthirius multifiliis]
          Length = 798

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 94/201 (46%), Gaps = 14/201 (6%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKP------CTVDRAPLRNKYFFGEGYTYGSQLVR 111
            F+HEE + +E    +      Q  Y P      CT ++   R K+FFG  Y +    + 
Sbjct: 557 FFTHEEMIDIENETYKTEIKCFQRHYFPMTAQVTCTAEKIR-RTKFFFGSRYMWTKMQLA 615

Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHID 170
           +           +V   P WI+  V+ KLE   ++P  +INS  +N Y  G   +  H D
Sbjct: 616 EPHSYVGAGVRQDVSETPLWIKKKVVQKLENANILPKKFINSLALNVYHDGNEGLAQHFD 675

Query: 171 PPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGI 228
               F +PI +  LFSD+ L FG +F             +P+ RG +  L +  +AANGI
Sbjct: 676 DATRFMQPIFTFKLFSDARLSFGSQF----YGFCNSAFVIPMPRGSILKLEENSYAANGI 731

Query: 229 THCVRPQDTQHRRAVILLRRV 249
            HCVRP D   + A I++R++
Sbjct: 732 KHCVRPCDMTGKNATIIMRQM 752


>gi|145483981|ref|XP_001428013.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395096|emb|CAK60615.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1283

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 96/203 (47%), Gaps = 17/203 (8%)

Query: 59   FSHEECLALERNIDEVSSLAEQGQYKPCTVD---------RAPLRNKYFFGEGYTYGSQL 109
            F++EE L +E+ I E+      G Y P T            +  R K+FFG  Y +    
Sbjct: 1041 FTNEELLQMEKEIYEMELKGFDGHYLPMTAQISHAQSNHHSSVRRTKFFFGYRYMWTKCQ 1100

Query: 110  VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
            + +           +V   P W+++L ITKLE   VVP  +INS   N Y  G   +  H
Sbjct: 1101 LAEPHSKVAAGVRRDVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNIYHDGKEGLAQH 1159

Query: 169  IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAAN 226
             D    F +PI ++ +FSD  L FG +F             +P+ RGC+  + +  +AAN
Sbjct: 1160 FDDAVRFKQPIFTIRVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAAN 1215

Query: 227  GITHCVRPQDTQHRRAVILLRRV 249
             I HCVRP D   + A ++LR++
Sbjct: 1216 AIKHCVRPCDMTGKSAALILRQM 1238


>gi|145520431|ref|XP_001446071.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124413548|emb|CAK78674.1| unnamed protein product [Paramecium tetraurelia]
          Length = 941

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 97/203 (47%), Gaps = 17/203 (8%)

Query: 59  FSHEECLALERNIDEVSSLAEQGQYKPCTVD---------RAPLRNKYFFGEGYTYGSQL 109
           F++EE L +E++I E+      G Y P T            +  R K+FFG  Y +    
Sbjct: 699 FTNEELLQMEKDIYEMELKGFDGHYLPMTAQISHAQSNHHSSVRRTKFFFGYRYMWTKCQ 758

Query: 110 VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
           + +           +V   P W+++L ITKLE   VVP  +INS   N Y  G   +  H
Sbjct: 759 LAEPHSMVAAGVRRDVSAPPLWMRNL-ITKLENDNVVPKKFINSIACNVYHDGKEGLAQH 817

Query: 169 IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAAN 226
            D    F +PI ++ +FSD  L FG +F             +P+ RGC+  + +  +AAN
Sbjct: 818 FDDAVRFKQPIYTIRVFSDCRLSFGSQF----YGFCNGAFAVPLPRGCILCMEEGSYAAN 873

Query: 227 GITHCVRPQDTQHRRAVILLRRV 249
            I HCVRP D   + A ++LR++
Sbjct: 874 AIKHCVRPCDMTGKSAALILRQM 896


>gi|440799034|gb|ELR20095.1| SAM domain (Sterile alpha motif) domain containing protein
           [Acanthamoeba castellanii str. Neff]
          Length = 1394

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 101/206 (49%), Gaps = 16/206 (7%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKP------CTV--DRAPLRNKYFFGEGYTYGSQL 109
            F+  E   +E++ D   + A  G+Y        CT   DR   R K+FF   Y + ++ 
Sbjct: 664 FFTDAELRQIEKSADATDADARNGKYDEVPNTYQCTRKGDRL-CRTKFFFSARYLWTAEQ 722

Query: 110 VRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSH 168
           +      +      +V   P W++ L+   L +  +V   ++N   +N Y  G   +  H
Sbjct: 723 MASPNSTQAQGIRADVAEPPFWMRQLIERPLVESRIVQEAWVNEFAMNIYHDGTEGLGQH 782

Query: 169 IDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL--RDFAAN 226
            D    F RPI +L +FSDS L FG K +F    ++  V ++P+ RGC+T+L  + +A +
Sbjct: 783 YDDKKRFKRPIHTLRIFSDSRLSFGSK-DFS---MTNSVFFIPMPRGCITVLEKKGYAVD 838

Query: 227 GITHCVRPQDTQHRRAVILLRRVLPH 252
           GI HCVR  D   +  V+LLR + PH
Sbjct: 839 GIKHCVRAADMGGKSGVVLLRHIHPH 864


>gi|403336135|gb|EJY67254.1| hypothetical protein OXYTRI_12242 [Oxytricha trifallax]
          Length = 943

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 92/206 (44%), Gaps = 28/206 (13%)

Query: 57  SLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRA-----PLRNKYFFGEGYTYGSQLVR 111
           + F+H E L +E N+ E         + P T  +        R K+FFG  Y +  + + 
Sbjct: 683 NFFTHGELLNIEENVMETERQCSNSAFLPDTAQQTFSGSRLTRTKFFFGYRYMWTKEQLS 742

Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP 171
           +           +V   P W+   +++K+E+ G+V  N+INS  +N Y  G         
Sbjct: 743 EPFSQVAAGVRKDVSRPPVWVNEFMVSKMEKCGIVEKNFINSIAMNVYHDG--------- 793

Query: 172 PHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRD--FAANGIT 229
                     L+   D A  F   + F          ++P+ RGC+ ++ +  +AANG+ 
Sbjct: 794 -------TEGLAQHYDDATRFKQFYGF-----VNGAFFIPLARGCICVMEENGYAANGVK 841

Query: 230 HCVRPQDTQHRRAVILLRRVLPHAPR 255
           HC+RPQD   + A ++LR++ P+  +
Sbjct: 842 HCIRPQDMTGKSAALILRQMHPNVVK 867


>gi|303274614|ref|XP_003056625.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462709|gb|EEH60001.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 897

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 104/232 (44%), Gaps = 19/232 (8%)

Query: 29  RSSPYDIGQSKHHARELRQKI---LQGVNQ-MSLFSHEECLALERNIDE-VSSLAEQGQY 83
           R+   D G+ K   R  R +      GV Q +  F  +E    E+ I E +      G  
Sbjct: 641 RADDGDAGERKSTKRVWRFRSNGQYFGVTQYLHAFDDDELEMTEKAITELIDRERASGDA 700

Query: 84  KPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQM 143
               V     R K FFG  YTY S    K + +          PIP+W+ S +  + + +
Sbjct: 701 PHVHVGLNGGRIKVFFGYRYTYDSVTAPKLIAD--------APPIPSWVHSTLGNRAKII 752

Query: 144 GVVPPNY-INSAVINDYQPGGCIVS-HIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
           G +  N  I++AV+N Y      +S H+DP  +F RPI+S   F D  L FG K  ++  
Sbjct: 753 GAIDKNVVIDNAVVNLYARSKARLSVHMDPSALFKRPIVSARFFGDGVLSFGAKGQYEGQ 812

Query: 202 RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHA 253
           R+      +P+ RG + ++  +AAN +TH V  +D   R    +LR   P A
Sbjct: 813 RIHS----VPLTRGSIAVMEGYAANMVTHAVNERDVVGRGCSFMLRSCQPDA 860


>gi|302841282|ref|XP_002952186.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
 gi|300262451|gb|EFJ46657.1| hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]
          Length = 2654

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 74/229 (32%), Positives = 100/229 (43%), Gaps = 39/229 (17%)

Query: 58   LFSHEECLALERNIDEVSSLAEQG-----------------QYKPCTVDRAPLRNKYFFG 100
            LF  +E   +E  +D   +LA +G                          A  R K FFG
Sbjct: 1604 LFRQDELYGMEAQVDATDTLARRGLLPSDCYHHSFGAGGGGSGGIAAAAAALKRTKMFFG 1663

Query: 101  EGYTYG-SQLVRKGL---GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVI 156
              Y +   QL   G    G  R+    +V P   W++  V   L + GVVP  +++S  +
Sbjct: 1664 ARYLWTREQLAAPGARIAGGVRV----DVPPPLRWMRHQVEAPLVRCGVVPRGFVDSIAL 1719

Query: 157  NDYQPGG--------CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVL 208
            N Y  G            SH D    F RPI S+ LFSDS L FG          +    
Sbjct: 1720 NLYHDGSEGIQVGGGGEGSHYDDAVRFRRPIYSVRLFSDSRLSFGGHLYGN----TNNDF 1775

Query: 209  YLPVQRGCVTLLRD--FAANGITHCVRPQDTQHRRAVILLRRVLPHAPR 255
            ++P+ RGCVT+L D  +AANGI HCVRP D   + A ++LR + P A R
Sbjct: 1776 FVPMPRGCVTVLEDESYAANGIKHCVRPADMAGKSAALILRGIDPGAMR 1824


>gi|384246841|gb|EIE20330.1| hypothetical protein COCSUDRAFT_67586 [Coccomyxa subellipsoidea
           C-169]
          Length = 988

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 58/189 (30%), Positives = 90/189 (47%), Gaps = 13/189 (6%)

Query: 57  SLFSHEECLALERNIDEVSSLAEQGQY-KPC---TVDRAP--LRNKYFFGEGYTYGSQLV 110
           + ++ EE   +E   D V + A +G+  + C   T  R     R K+FFG  Y + ++  
Sbjct: 778 NFYTAEELRQMEAGADVVHAKAREGRMPQTCFHETRGRGGGLKRTKFFFGARYLWTAEQA 837

Query: 111 RKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHI 169
                        +V P P W+Q+L    L + G+VP ++ ++  +N Y  G   I SH 
Sbjct: 838 ADPASRLAAGVRLDVPPAPTWMQALAEKPLVETGLVPKDFFDAWALNLYHDGSEGIQSHF 897

Query: 170 DPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLR--DFAANG 227
           D    F RPI S+ LFSDS L FG +        +     + + RGC+T++    +A  G
Sbjct: 898 DDGTRFSRPIFSVRLFSDSRLSFGTQL----YGYTNGAFTVDMPRGCITVMEAGGYAVEG 953

Query: 228 ITHCVRPQD 236
           + HCVRP D
Sbjct: 954 VKHCVRPAD 962


>gi|356552441|ref|XP_003544576.1| PREDICTED: uncharacterized protein LOC100797674 [Glycine max]
          Length = 522

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 95/214 (44%), Gaps = 25/214 (11%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVD------RAPLRNKYFFGE 101
           IL+G+     +FS  E   +   +  +  +  +G+ K  T        R   R    FG 
Sbjct: 194 ILEGLELHTGIFSAAEQKRIVNYVASLQEMGRKGELKEQTFSAPQKWMRGKGRQTIQFGC 253

Query: 102 GYTYGSQLVRKG-----LGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAV 155
            Y Y   + R G     LGN      G VDPIPA  + ++I +L +  V+PP  + +S +
Sbjct: 254 CYNYA--VDRDGNPPGILGN------GMVDPIPALFK-VIIRRLIKWHVLPPTCVPDSCI 304

Query: 156 INDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRG 215
           +N Y+ G CI  HID  H F RP  ++S  S+  + FG           +  + +P+  G
Sbjct: 305 VNIYEEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMG 363

Query: 216 CVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
            V +L    A+   HCV    T  +R  I  RR+
Sbjct: 364 SVLVLNGNGADVAKHCVPAVPT--KRISITFRRM 395


>gi|307105467|gb|EFN53716.1| hypothetical protein CHLNCDRAFT_136563 [Chlorella variabilis]
          Length = 521

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 59/129 (45%), Gaps = 30/129 (23%)

Query: 129 PAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG-CIVSHIDPPHIFDRPIISLSLFSD 187
           P W++  V   L   G+V P + +S  +N Y  G   I SH D    F +PI+SL LFSD
Sbjct: 319 PRWMRQKVEAPLVGAGLVAPGFCDSIALNIYHDGSEGIQSHYDDAARFCQPILSLRLFSD 378

Query: 188 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLR 247
                                   ++ G       +AANGI HCVRP D   + A ++LR
Sbjct: 379 M-----------------------MEEG------GYAANGIKHCVRPMDMSGKSAGMILR 409

Query: 248 RVLPHAPRL 256
           R+ P AP L
Sbjct: 410 RINPPAPAL 418


>gi|356562125|ref|XP_003549324.1| PREDICTED: uncharacterized protein LOC100787321 [Glycine max]
          Length = 520

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 93/210 (44%), Gaps = 18/210 (8%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVD------RAPLRNKYFFGE 101
           IL+G+     +FS  E   +   +  +  + ++G+ K  T        R   R    FG 
Sbjct: 193 ILEGLELHTGIFSAAEQKRIVNYVASLQEMGKKGELKERTFSAPQKWMRGKGRQTIQFGC 252

Query: 102 GYTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
            Y Y  +      GN   +   G VDPIP   + ++I +L +  V+PP  + +S ++N Y
Sbjct: 253 CYNYADRD-----GNPPGILTNGMVDPIPDLFK-VIIRRLVKWHVLPPTCVPDSCIVNIY 306

Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTL 219
           + G CI  HID  H F RP  ++S  S+  + FG           +  + +P+  G V +
Sbjct: 307 EEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGSNLKIVGPGEFDGSIAIPLPMGSVLV 365

Query: 220 LRDFAANGITHCVRPQDTQHRRAVILLRRV 249
           L    A+   HCV    T  +R  I  RR+
Sbjct: 366 LNGNGADVAKHCVPAVPT--KRISITFRRM 393


>gi|384245374|gb|EIE18868.1| hypothetical protein COCSUDRAFT_20284, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 246

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 53/155 (34%), Positives = 77/155 (49%), Gaps = 9/155 (5%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y   + R+G     +  E  V+P+P  +Q+LV  +L + GV+P +   +SA+IN
Sbjct: 84  FGCCYNYA--IDREGR-EPGIIAEEVVEPMPPMLQALV-HRLVRWGVMPRSKAPDSAIIN 139

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKF-NFKPIRVSEPVLYLPVQRGC 216
            Y    CI  HID  H F RP  ++SL S+ A+ FG K     P +       +P+  G 
Sbjct: 140 IYDQEDCIPPHIDH-HDFSRPFCTISLLSEQAIMFGAKLIPLGPGKFGGNHCTIPLPVGS 198

Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
             +L+   A+   HCV P     RR  I LR+  P
Sbjct: 199 CLVLKGNGADVAMHCVPP--VSQRRMSITLRKCAP 231


>gi|343172812|gb|AEL99109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
           partial [Silene latifolia]
 gi|343172814|gb|AEL99110.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein,
           partial [Silene latifolia]
          Length = 261

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 95/213 (44%), Gaps = 23/213 (10%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK---------PCTVDRAPLRNKYF 98
           +LQG+     +FS EE    ++ ++ V +    GQ K         P    R   R    
Sbjct: 39  VLQGLELHTGVFSPEE---QKKIVESVYNFQRMGQKKQLMQRTYTEPAKWMRGKGRVTIQ 95

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y    V K      +  + EVDP+P   + + I+++ +  V+PP  + NS ++N
Sbjct: 96  FGCCYNYA---VDKDGNPPGIIRDEEVDPLPPVFKQM-ISRMVKWHVLPPTRVPNSCIVN 151

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGC 216
            Y  G CI  HID  H F RP  ++S  ++  + FG     + P   S PV  +P+  G 
Sbjct: 152 IYDEGDCIPPHID-HHDFLRPFCTVSFLTECNIMFGTNLKVEGPGEFSGPVT-IPLPVGS 209

Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
           V +L    A+   HC+       +R  +  RR+
Sbjct: 210 VLVLNGNGADVAKHCI--PAVPAKRISVTFRRM 240


>gi|79326344|ref|NP_001031794.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|55819794|gb|AAV66092.1| At4g36090 [Arabidopsis thaliana]
 gi|59958356|gb|AAX12888.1| At4g36090 [Arabidopsis thaliana]
 gi|332661217|gb|AEE86617.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 520

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 116/282 (41%), Gaps = 43/282 (15%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
           IL+G+     +FS    +  ++ +D V  L E+G+       R  LR + F         
Sbjct: 216 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 265

Query: 99  -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
                  FG  Y Y         G   +   G+VDP+P+ I  ++I +L    V+PP  +
Sbjct: 266 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPS-IFKVIIKRLVGWHVLPPTCV 321

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
            +S ++N Y+   CI  HID  H F RP  ++S  S+  + FG                +
Sbjct: 322 PDSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGSNLKVLGPGEFSGSYSI 380

Query: 211 PVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK----- 265
           P+  G V +L+   A+   HCV    T  +R  I  R++      +  +  P ++     
Sbjct: 381 PLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKPL 438

Query: 266 PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
           PY H+     A       RSSI+   N  NH+N+    G  K
Sbjct: 439 PYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 478


>gi|449434166|ref|XP_004134867.1| PREDICTED: uncharacterized protein LOC101203292 [Cucumis sativus]
 gi|449531418|ref|XP_004172683.1| PREDICTED: uncharacterized protein LOC101225118 [Cucumis sativus]
          Length = 499

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 63/217 (29%), Positives = 98/217 (45%), Gaps = 31/217 (14%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG-EGYTYG 106
           IL+G+     +FS  E     R +D V +L E G+       R  LR + F   + +  G
Sbjct: 218 ILEGLELHTCIFSAAE---QTRIVDHVYALQEMGK-------RGELRERTFSAPKKWMKG 267

Query: 107 SQLVRKGLG-------NERLYPEGE-----VDPIPAWIQSLVITKLEQMGVVPPNYI-NS 153
              V    G       ++   P G      VDP+P+  + ++I +L +  V+PP  + +S
Sbjct: 268 KGRVTMQFGCCYNYAPDKNGNPPGILRSEIVDPLPSLFK-VIIRRLVRWHVLPPTCVPDS 326

Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF-KPIRVSEPVLYLPV 212
            ++N Y  G CI  HID  H F RP  ++S  S+  + FG   +   P   S P+  +P+
Sbjct: 327 CIVNIYDEGDCIPPHID-NHDFVRPFCTVSFLSECNIVFGTNLSIVGPGEFSGPIA-IPL 384

Query: 213 QRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
             G V +L    A+   HCV    T  +R  I  RR+
Sbjct: 385 PVGSVLVLNGNGADVAKHCVPAVPT--KRISITFRRI 419


>gi|79326336|ref|NP_001031793.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|51971445|dbj|BAD44387.1| hypothetical protein [Arabidopsis thaliana]
 gi|332661216|gb|AEE86616.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 452

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 73/282 (25%), Positives = 116/282 (41%), Gaps = 43/282 (15%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
           IL+G+     +FS    +  ++ +D V  L E+G+       R  LR + F         
Sbjct: 148 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 197

Query: 99  -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
                  FG  Y Y         G   +   G+VDP+P+ I  ++I +L    V+PP  +
Sbjct: 198 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPS-IFKVIIKRLVGWHVLPPTCV 253

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
            +S ++N Y+   CI  HID  H F RP  ++S  S+  + FG                +
Sbjct: 254 PDSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGSNLKVLGPGEFSGSYSI 312

Query: 211 PVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK----- 265
           P+  G V +L+   A+   HCV    T  +R  I  R++      +  +  P ++     
Sbjct: 313 PLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKPL 370

Query: 266 PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
           PY H+     A       RSSI+   N  NH+N+    G  K
Sbjct: 371 PYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 410


>gi|119638440|gb|ABL85031.1| oxidoreductase [Brachypodium sylvaticum]
          Length = 598

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 29/264 (10%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYF 98
           IL G+     +FS  E     R +D V  L EQG+          +P    R   R    
Sbjct: 197 ILAGLELHAGVFSAAE---QRRIVDCVYGLQEQGKRGELGDRTYTEPEKWMRGKGRVTIQ 253

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y +       G  R +     DPIP   + ++I +L +  ++PP+ + +S ++N
Sbjct: 254 FGCCYNYATDKNGNPPGIIRTFAS---DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVN 309

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
            Y PG CI  HID  H F RP  ++S  S+  + FG                +P+  G V
Sbjct: 310 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSFAIPLPVGSV 368

Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLYDVQARE 277
            ++    A+   HCV       +R  I  R++ P          P       L D+   E
Sbjct: 369 LVINGNGADVAKHCV--PAVPAKRISITFRKMDPAKRPFGFKDDP------ELLDITPME 420

Query: 278 SGG--YFRSSIETYNNNENHSNKN 299
           +      RSS E      +  NKN
Sbjct: 421 AASEETSRSSDEGKGKQLDVQNKN 444


>gi|147857702|emb|CAN80815.1| hypothetical protein VITISV_020466 [Vitis vinifera]
          Length = 849

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
           IL G+   + +FS  E   +   I E+  +   GQ K      P    R   R    FG 
Sbjct: 561 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 620

Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
            Y Y +       GN     + EV DPIP   + ++I +L +  V+PP+ + +S ++N Y
Sbjct: 621 CYNYATDKN----GNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 675

Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
           + G CI  HID  H F RP  ++S  S+  + FG   N K +   E V    +P+  G V
Sbjct: 676 EEGDCIPPHID-NHDFVRPFCTVSFLSECBIVFGT--NLKXLGAGEFVGPFAIPLPVGSV 732

Query: 218 TLLRDFAANGITHCV 232
            +L    A+   HCV
Sbjct: 733 LVLNGNGADVAKHCV 747


>gi|302757365|ref|XP_002962106.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
 gi|300170765|gb|EFJ37366.1| hypothetical protein SELMODRAFT_64626 [Selaginella moellendorffii]
          Length = 289

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 57/202 (28%), Positives = 89/202 (44%), Gaps = 21/202 (10%)

Query: 43  RELRQKILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
           + +R  ILQG+    ++FS  E     R +D V  L  +GQ             K+  G+
Sbjct: 64  KGMRVNILQGLELHTNIFSAAE---QSRLLDMVFELELKGQRNELKERTYSAPRKWMQGK 120

Query: 102 G---------YTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
           G         Y Y         GN   +  + EVDP+P  ++S  I +L +  V+PP  +
Sbjct: 121 GRVTLQFGCCYNYS----HDKFGNTPGILQDEEVDPLPPLLKS-TIKRLVRWHVLPPTCV 175

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
            +S ++N Y+ G CI  HID  H F RP  ++SL S  ++ FG   +       +     
Sbjct: 176 PDSCIVNIYETGDCIPPHID-HHDFLRPFCTVSLLSQCSIVFGSSLSVAAPGEFDGSFST 234

Query: 211 PVQRGCVTLLRDFAANGITHCV 232
            +  G V +L D  A+   HC+
Sbjct: 235 QLPVGSVLVLNDNGADVAKHCI 256


>gi|297798302|ref|XP_002867035.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312871|gb|EFH43294.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 490

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 73/271 (26%), Positives = 114/271 (42%), Gaps = 38/271 (14%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
           IL+G+     +FS  E    ++ +D V  L E+GQ       R  LR++ F         
Sbjct: 186 ILEGLELHTGVFSAVE---QKKIVDFVYELQEKGQ-------RGELRDRTFTAPHKWMRG 235

Query: 99  -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
                  FG  Y Y         G   +     VDPIP+  + ++I +L    V+PP  I
Sbjct: 236 KGRVTIQFGCCYNYAPDKAGNPPG---ILQREAVDPIPSLFK-VIIKRLVGWHVLPPTCI 291

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPVLY 209
            +S ++N Y  G CI  HID  H F RP  ++S  S+  + FG       P   S P   
Sbjct: 292 PDSCIVNIYDEGDCIPPHIDN-HDFVRPFCTVSFLSECNILFGSNLKVVGPGEFSGP-YS 349

Query: 210 LPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHS 269
           +P+  G V +L+   A+   HCV    T  +R  I  R++      +  +  P ++    
Sbjct: 350 MPLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLQGIKP 407

Query: 270 L-YDVQARESGGYFRSSIETYNNNENHSNKN 299
           L Y+   + +      S    +N++N SN N
Sbjct: 408 LPYEQTTQSTPAAAAISSSRSSNDQNGSNHN 438


>gi|297735021|emb|CBI17383.3| unnamed protein product [Vitis vinifera]
          Length = 547

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
           IL G+   + +FS  E   +   I E+  +   GQ K      P    R   R    FG 
Sbjct: 259 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 318

Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
            Y Y +       GN     + EV DPIP   + ++I +L +  V+PP+ + +S ++N Y
Sbjct: 319 CYNYATD----KNGNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 373

Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
           + G CI  HID  H F RP  ++S  S+  + FG   N K +   E V    +P+  G V
Sbjct: 374 EEGDCIPPHID-NHDFVRPFCTVSFLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSV 430

Query: 218 TLLRDFAANGITHCV 232
            +L    A+   HCV
Sbjct: 431 LVLNGNGADVAKHCV 445


>gi|225431153|ref|XP_002268446.1| PREDICTED: uncharacterized protein LOC100250563 [Vitis vinifera]
          Length = 510

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 89/195 (45%), Gaps = 19/195 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
           IL G+   + +FS  E   +   I E+  +   GQ K      P    R   R    FG 
Sbjct: 222 ILDGLELHVGIFSAAEQKRIVDFIYELQEMGRNGQLKERTYSAPQKWMRGKGRVTIQFGC 281

Query: 102 GYTYGSQLVRKGLGNERLYPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY 159
            Y Y +       GN     + EV DPIP   + ++I +L +  V+PP+ + +S ++N Y
Sbjct: 282 CYNYATD----KNGNPPGILQNEVVDPIPPLFK-VIIRRLVRWHVLPPSCVPDSCIVNIY 336

Query: 160 QPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV--LYLPVQRGCV 217
           + G CI  HID  H F RP  ++S  S+  + FG   N K +   E V    +P+  G V
Sbjct: 337 EEGDCIPPHID-NHDFVRPFCTVSFLSECDIVFGT--NLKILGAGEFVGPFAIPLPVGSV 393

Query: 218 TLLRDFAANGITHCV 232
            +L    A+   HCV
Sbjct: 394 LVLNGNGADVAKHCV 408


>gi|357118936|ref|XP_003561203.1| PREDICTED: uncharacterized protein LOC100826719 [Brachypodium
           distachyon]
          Length = 594

 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 102/258 (39%), Gaps = 24/258 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYF 98
           IL G+     +FS  E     R +D V  L EQG+          +P    R   R    
Sbjct: 193 ILAGLELHAGVFSAAE---QRRIVDCVYGLQEQGKRGELGDRTYTEPEKWMRGKGRVTIQ 249

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y +       G  R +     DPIP   + ++I +L +  ++PP+ + +S ++N
Sbjct: 250 FGCCYNYATDKNGNPPGIIRTFAS---DPIPDLFK-VMIKRLVRWCILPPDCVPDSCIVN 305

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
            Y PG CI  HID  H F RP  ++S  S+  + FG                +P+  G V
Sbjct: 306 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSFAIPLPAGSV 364

Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
            ++    A+   HCV       +R  I  R++ P          P    + P  +  +  
Sbjct: 365 LVINGNGADVAKHCV--PAVPAKRISITFRKMDPAKRPFGFKDDPELLNITPMEAASEET 422

Query: 275 ARESGGYFRSSIETYNNN 292
           +R S       ++  N N
Sbjct: 423 SRSSDEGKGKQLDVQNKN 440


>gi|297847176|ref|XP_002891469.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337311|gb|EFH67728.1| hypothetical protein ARALYDRAFT_314327 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 335

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 85/191 (44%), Gaps = 14/191 (7%)

Query: 66  ALERN--IDEVSSLAEQGQ---YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLY 120
           A E+N  +D+V  L E+GQ    K     R   R+   FG  + Y +       G   + 
Sbjct: 114 AAEQNMIVDKVCELQEKGQKGELKRAFTARGKGRSTIQFGCCFNYRTSKAGNPAG---IL 170

Query: 121 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPI 179
               VDP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP 
Sbjct: 171 KHETVDPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-NHDFLRPF 228

Query: 180 ISLSLFSDSALCFGCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVRPQDTQ 238
            ++S  S+  + FG     K         Y LP+  G V +L    A+   HCV    T 
Sbjct: 229 CTVSFLSECNILFGSNLKVKETGEYTGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT- 287

Query: 239 HRRAVILLRRV 249
            +R  I  R++
Sbjct: 288 -KRISITFRKM 297


>gi|224094961|ref|XP_002310305.1| predicted protein [Populus trichocarpa]
 gi|222853208|gb|EEE90755.1| predicted protein [Populus trichocarpa]
          Length = 245

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 6/132 (4%)

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
           VDPIP   + ++I +L +  V+PP  + +S ++N Y+ G CI  HID  H F RP  ++S
Sbjct: 34  VDPIPVLFK-VIIRRLVRWHVIPPTCVPDSCIVNIYEEGDCIPPHIDN-HDFVRPFCTVS 91

Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
             S+  + FG       +   +  + +P+  G   +L   AA+   HCV    + H+R +
Sbjct: 92  FLSECNMLFGSHLKIVGVGDFDGPIAIPLPVGSALVLNGNAADVAKHCV---PSVHKRRI 148

Query: 244 ILLRRVLPHAPR 255
            +  R +  A R
Sbjct: 149 SITFRKMDEAKR 160


>gi|218197526|gb|EEC79953.1| hypothetical protein OsI_21553 [Oryza sativa Indica Group]
          Length = 600

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
           IL G+     +FS  E    +R +D V  L E G++         +P    R   R    
Sbjct: 186 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 242

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y +       G  R       DP+P+  + ++I +L +  V+P   I +S ++N
Sbjct: 243 FGCCYNYATDKNGNPPGIIRTI---ASDPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 298

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
            Y PG CI  HID  H F RP  ++S  S+  + FG              L +P+  G V
Sbjct: 299 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 357

Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
            +L    A+   HCV    T  +R  I  R++ P          P    + P  +     
Sbjct: 358 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 415

Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
            R S  G   +  I+  N ++ H NK +
Sbjct: 416 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 443


>gi|222634922|gb|EEE65054.1| hypothetical protein OsJ_20056 [Oryza sativa Japonica Group]
          Length = 602

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
           IL G+     +FS  E    +R +D V  L E G++         +P    R   R    
Sbjct: 187 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 243

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y +       G  R       DP+P+  + ++I +L +  V+P   I +S ++N
Sbjct: 244 FGCCYNYATDKNGNPPGIIRTIAS---DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 299

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
            Y PG CI  HID  H F RP  ++S  S+  + FG              L +P+  G V
Sbjct: 300 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 358

Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
            +L    A+   HCV    T  +R  I  R++ P          P    + P  +     
Sbjct: 359 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 416

Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
            R S  G   +  I+  N ++ H NK +
Sbjct: 417 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 444


>gi|15227938|ref|NP_179387.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|4406821|gb|AAD20129.1| hypothetical protein [Arabidopsis thaliana]
 gi|34365749|gb|AAQ65186.1| At2g17970 [Arabidopsis thaliana]
 gi|51970124|dbj|BAD43754.1| hypothetical protein [Arabidopsis thaliana]
 gi|330251612|gb|AEC06706.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 507

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 17/192 (8%)

Query: 68  ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
           +R +D+V  L E+G+           P    R   R    FG  Y Y         G   
Sbjct: 229 KRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 285

Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
           +    EVDP+P   + ++I KL +  V+PP  + +S ++N Y  G CI  HID  H F R
Sbjct: 286 ILQREEVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLR 343

Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
           P  ++S  S+  + FG     +          +P+  G V +L    A+   HCV    T
Sbjct: 344 PFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 403

Query: 238 QHRRAVILLRRV 249
             +R  I  R++
Sbjct: 404 --KRISITFRKM 413


>gi|357445515|ref|XP_003593035.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
 gi|124360268|gb|ABN08281.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
 gi|355482083|gb|AES63286.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
          Length = 391

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 74/161 (45%), Gaps = 14/161 (8%)

Query: 77  LAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLV 136
           L E+   KP    R   R    FG  Y Y    V K      +    EVDP+P   + + 
Sbjct: 182 LRERTYSKPRKWMRGKGRETLQFGCCYNYA---VDKNGNPPGISRTEEVDPLPRMFKQM- 237

Query: 137 ITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 195
           I ++ +  ++PP  + +S ++N Y  G CI  HID  H F RP  S+S  + + + FG  
Sbjct: 238 IKRMVRWKIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSVSFLNKAKILFGS- 295

Query: 196 FNFKPIRVSE----PVLYLPVQRGCVTLLRDFAANGITHCV 232
            N K +R  E      + LPV  G V +L+   A+   HC+
Sbjct: 296 -NLKAVRPGEFFGPASISLPV--GSVFVLKGNGADIAKHCI 333


>gi|115466278|ref|NP_001056738.1| Os06g0138200 [Oryza sativa Japonica Group]
 gi|55296462|dbj|BAD68658.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein-like [Oryza
           sativa Japonica Group]
 gi|113594778|dbj|BAF18652.1| Os06g0138200 [Oryza sativa Japonica Group]
          Length = 616

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 26/268 (9%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQY---------KPCTVDRAPLRNKYF 98
           IL G+     +FS  E    +R +D V  L E G++         +P    R   R    
Sbjct: 201 ILAGLELHCGVFSAAE---QKRIVDYVYDLQEMGKHGELGDRTYTEPQRWMRGKGRVTIQ 257

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y +       G  R       DP+P+  + ++I +L +  V+P   I +S ++N
Sbjct: 258 FGCCYNYATDKNGNPPGIIRTIAS---DPMPSLFK-IMIKRLVRWHVLPKTCIPDSCIVN 313

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCV 217
            Y PG CI  HID  H F RP  ++S  S+  + FG              L +P+  G V
Sbjct: 314 IYDPGDCIPPHID-SHDFVRPFCTVSFLSECNILFGSTLKIAGPGEFTGSLPIPLPVGSV 372

Query: 218 TLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPR---VKPYHSLYDVQ 274
            +L    A+   HCV    T  +R  I  R++ P          P    + P  +     
Sbjct: 373 LILNGNGADVAKHCVPAVPT--KRISITFRKMDPAKRPFNFRDDPELLNIIPLETAVQET 430

Query: 275 ARES--GGYFRSSIETYNNNENHSNKNT 300
            R S  G   +  I+  N ++ H NK +
Sbjct: 431 GRSSDEGKGKQPDIQIRNPSKAHRNKKS 458


>gi|145328752|ref|NP_001077911.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|334184285|ref|NP_001189546.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330251613|gb|AEC06707.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330251614|gb|AEC06708.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 433

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 82/192 (42%), Gaps = 17/192 (8%)

Query: 68  ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
           +R +D+V  L E+G+           P    R   R    FG  Y Y         G   
Sbjct: 155 KRIVDQVYQLQEKGRRGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 211

Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
           +    EVDP+P   + ++I KL +  V+PP  + +S ++N Y  G CI  HID  H F R
Sbjct: 212 ILQREEVDPLPHLFK-VIIRKLIKWHVLPPTCVPDSCIVNIYDEGDCIPPHID-NHDFLR 269

Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
           P  ++S  S+  + FG     +          +P+  G V +L    A+   HCV    T
Sbjct: 270 PFCTISFLSECDILFGSNLKVEGPGDFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 329

Query: 238 QHRRAVILLRRV 249
             +R  I  R++
Sbjct: 330 --KRISITFRKM 339


>gi|255579797|ref|XP_002530736.1| conserved hypothetical protein [Ricinus communis]
 gi|223529700|gb|EEF31642.1| conserved hypothetical protein [Ricinus communis]
          Length = 511

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 92/209 (44%), Gaps = 15/209 (7%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYK------PCTVDRAPLRNKYFFGE 101
           IL+G+     +FS  E   +   + E+  + ++G+ K      P    R   R    FG 
Sbjct: 228 ILEGLELHTCVFSAVEQNRIVNYVHELEEMGKRGELKERTYTAPQKWMRGKGRVTIQFGC 287

Query: 102 GYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQ 160
            Y Y +    K      +     VDPIP   + ++I +L +  V+PP  + +S ++N Y 
Sbjct: 288 CYNYATD---KNGNPPGILQSETVDPIPHLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYD 343

Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
            G CI  HID  H F+RP  ++S  S+  + FG              + +P+  G V +L
Sbjct: 344 EGDCIPPHID-NHDFNRPFCTVSFVSECNIVFGSNLKVLGPGEFSGSIAIPLPVGSVLVL 402

Query: 221 RDFAANGITHCVRPQDTQHRRAVILLRRV 249
              AA+   HCV    T  +R  I  R++
Sbjct: 403 DGNAADVSKHCVPSVPT--KRISITFRKM 429


>gi|297836514|ref|XP_002886139.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297331979|gb|EFH62398.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 17/192 (8%)

Query: 68  ERNIDEVSSLAEQGQ---------YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNER 118
           +R +D+V  L E+G+           P    R   R    FG  Y Y         G   
Sbjct: 224 KRIVDQVYQLQEKGRKGELKKRTFTAPHKWMRGKGRETIQFGCCYNYAPDRAGNPPG--- 280

Query: 119 LYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDR 177
           +    EVDP+P   + ++I +L +  V+PP  + +S ++N Y    CI  HID  H F R
Sbjct: 281 ILQREEVDPLPPLFK-VIIRRLIKWHVLPPTCVPDSCIVNIYDESDCIPPHID-NHDFLR 338

Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
           P  ++S  S+  + FG   N +          +P+  G V +L    A+   HCV    T
Sbjct: 339 PFCTISFLSECDILFGSNLNVEGPGEFSGSYSIPLPVGSVLVLNGNGADVAKHCVPAVPT 398

Query: 238 QHRRAVILLRRV 249
             +R  I  R++
Sbjct: 399 --KRISITFRKM 408


>gi|298204538|emb|CBI23813.3| unnamed protein product [Vitis vinifera]
          Length = 439

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 13/192 (6%)

Query: 44  ELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGY 103
           EL  ++     Q  +    EC+   + + +   L E+   +P    R   R    FG  Y
Sbjct: 162 ELHTRVFNSEEQKKIV---ECVYNLQRMGQKGMLRERTYSEPKKWMRGKGRVTIQFGCCY 218

Query: 104 TYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPG 162
            Y    V K      +  E EVDP+P   + + I ++ +  ++PP  + NS ++N Y  G
Sbjct: 219 NYA---VDKNGNPPGIIREEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEG 274

Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLL 220
            CI  HID  H F RP  ++S  ++  + FG           S PV + LP  +G V +L
Sbjct: 275 DCIPPHID-HHDFLRPFCTVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLIL 331

Query: 221 RDFAANGITHCV 232
               A+   HCV
Sbjct: 332 NGNGADVAKHCV 343


>gi|225447996|ref|XP_002270229.1| PREDICTED: uncharacterized protein LOC100256702 [Vitis vinifera]
          Length = 457

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 84/192 (43%), Gaps = 13/192 (6%)

Query: 44  ELRQKILQGVNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGY 103
           EL  ++     Q  +    EC+   + + +   L E+   +P    R   R    FG  Y
Sbjct: 180 ELHTRVFNSEEQKKIV---ECVYNLQRMGQKGMLRERTYSEPKKWMRGKGRVTIQFGCCY 236

Query: 104 TYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPG 162
            Y    V K      +  E EVDP+P   + + I ++ +  ++PP  + NS ++N Y  G
Sbjct: 237 NYA---VDKNGNPPGIIREEEVDPLPPLFKQM-IKRMVRWHILPPTCVPNSCIVNIYDEG 292

Query: 163 GCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLL 220
            CI  HID  H F RP  ++S  ++  + FG           S PV + LP  +G V +L
Sbjct: 293 DCIPPHID-HHDFLRPFCTVSFLTECNILFGSSLKILDAGEFSGPVSISLP--KGSVLIL 349

Query: 221 RDFAANGITHCV 232
               A+   HCV
Sbjct: 350 NGNGADVAKHCV 361


>gi|302775126|ref|XP_002970980.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
 gi|300160962|gb|EFJ27578.1| hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]
          Length = 307

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 93/209 (44%), Gaps = 35/209 (16%)

Query: 43  RELRQKILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGE 101
           + +R  ILQG+    ++FS  E     R +D V  L  +GQ             K+  G+
Sbjct: 78  KGMRVNILQGLELHTNIFSAAE---QSRLLDMVFELELKGQRNELKERTYSAPRKWMQGK 134

Query: 102 G---------YTYGSQLVRKGLGNE-RLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
           G         Y Y         GN   +  + EVDP+P  ++S  I +L +  V+ P  +
Sbjct: 135 GRVTLQFGCCYNYS----HDKFGNTPGILQDEEVDPLPPLLKS-TIKRLVRWHVLSPTCV 189

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLY- 209
            +S ++N Y+ G CI  HID  H F RP  ++SL S  ++ FG   +     V+ P  + 
Sbjct: 190 PDSCIVNIYEIGDCIPPHID-HHDFLRPFCTVSLLSQCSIVFGSSLS-----VAAPGEFD 243

Query: 210 ------LPVQRGCVTLLRDFAANGITHCV 232
                 LPV  G V +L D  A+   HC+
Sbjct: 244 GSFSTQLPV--GSVLVLNDNGADVAKHCI 270


>gi|357445523|ref|XP_003593039.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
 gi|124360274|gb|ABN08287.1| 2OG-Fe(II) oxygenase [Medicago truncatula]
 gi|355482087|gb|AES63290.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
          Length = 497

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 65/129 (50%), Gaps = 9/129 (6%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
           EVDP+P   + + I ++ +  ++PP  + +S ++N Y  G CI  HID  H F RP  S+
Sbjct: 285 EVDPLPDVFKQM-IKRMVRWNIIPPTCVPDSCIVNIYDVGDCIPPHID-HHDFVRPFYSV 342

Query: 183 SLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHR 240
           S  +++ + FG      +P   S P  + LP+  G V +L    A+   HC+       +
Sbjct: 343 SFLNEAKILFGSNLKEIQPGEFSGPASISLPL--GSVFVLNGNGADIAKHCI--PSVSSK 398

Query: 241 RAVILLRRV 249
           R  I  R++
Sbjct: 399 RISITFRKM 407


>gi|79319564|ref|NP_001031159.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|52354159|gb|AAU44400.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
 gi|332194254|gb|AEE32375.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 327

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 12/186 (6%)

Query: 69  RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
           R +D+V  L E+   G+ K     +   R+   FG  + Y +       G   +     V
Sbjct: 114 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 170

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP  ++S 
Sbjct: 171 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 228

Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
            S+  + FG     +     S     LP+  G V +L    A+   HCV    T  +R  
Sbjct: 229 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT--KRIS 286

Query: 244 ILLRRV 249
           I  R++
Sbjct: 287 ITFRKM 292


>gi|79361742|ref|NP_175328.2| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|52354161|gb|AAU44401.1| hypothetical protein AT1G48980 [Arabidopsis thaliana]
 gi|60547627|gb|AAX23777.1| hypothetical protein At1g48980 [Arabidopsis thaliana]
 gi|332194253|gb|AEE32374.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 331

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 12/186 (6%)

Query: 69  RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
           R +D+V  L E+   G+ K     +   R+   FG  + Y +       G   +     V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP  ++S 
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232

Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
            S+  + FG     +     S     LP+  G V +L    A+   HCV    T  +R  
Sbjct: 233 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCVPEVPT--KRIS 290

Query: 244 ILLRRV 249
           I  R++
Sbjct: 291 ITFRKM 296


>gi|186489643|ref|NP_001117455.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|332194255|gb|AEE32376.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 325

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 75/169 (44%), Gaps = 10/169 (5%)

Query: 69  RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
           R +D+V  L E+   G+ K     +   R+   FG  + Y +       G   +     V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP  ++S 
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232

Query: 185 FSDSALCFGCKFNFKPI-RVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
            S+  + FG     +     S     LP+  G V +L    A+   HCV
Sbjct: 233 LSECNILFGSNLKVEETGEYSGGSYSLPLPVGSVLVLNGNGADVAKHCV 281


>gi|242074876|ref|XP_002447374.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
 gi|241938557|gb|EES11702.1| hypothetical protein SORBIDRAFT_06g033940 [Sorghum bicolor]
          Length = 345

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 60/198 (30%), Positives = 92/198 (46%), Gaps = 24/198 (12%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQ----------YKPCTVDRAPLRNKY 97
           ILQG+     +FS  E    +R +D V  L ++G+           +P    R   R   
Sbjct: 33  ILQGLELHTGVFSPAE---QQRIVDLVHDLQDKGRRGLLRGERTYSEPRKWMRGKGRATI 89

Query: 98  FFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVI 156
            FG  Y Y +   R G     +  E E DP+P  ++S+V  +L    V+PP+ + +S ++
Sbjct: 90  QFGCCYNYATD--RDGNPPGIIRRE-EADPLPPPLKSMV-RRLVSWRVLPPDCVPDSCIV 145

Query: 157 NDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFN-FKPIRVSEPV-LYLPVQR 214
           N Y  G CI  H+D  H F RP  ++S  ++ A+ FG +     P   + PV + LP   
Sbjct: 146 NVYDVGDCIPPHVD-HHDFARPFCTVSFLAECAILFGRELRVVGPGEFAGPVSISLPP-- 202

Query: 215 GCVTLLRDFAANGITHCV 232
           G V +L    A+   HCV
Sbjct: 203 GSVLVLAGNGADVAKHCV 220


>gi|413953301|gb|AFW85950.1| hypothetical protein ZEAMMB73_605385 [Zea mays]
          Length = 614

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 67/132 (50%), Gaps = 15/132 (11%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   +++ I +L +  V+P   + +S ++N Y+PG CI  HID  H F RP  ++S 
Sbjct: 281 DPMPDLFKTM-IKRLVRWQVLPATCVPDSCIVNIYEPGDCIPPHID-SHDFVRPFCTVSF 338

Query: 185 FSDSALCFGCKFNFKPIRVSEP-----VLYLPVQRGCVTLLRDFAANGITHCVRPQDTQH 239
            S+  + FG       ++V+ P      + +P+  G V +L    A+   HCV    T  
Sbjct: 339 LSECNILFGPN-----LKVAAPGEFIGSIAIPLPVGSVLVLNGNGADVAKHCVPAVPT-- 391

Query: 240 RRAVILLRRVLP 251
           +R  I  R++ P
Sbjct: 392 KRISITFRKMDP 403


>gi|224134110|ref|XP_002327758.1| predicted protein [Populus trichocarpa]
 gi|222836843|gb|EEE75236.1| predicted protein [Populus trichocarpa]
          Length = 213

 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 61/126 (48%), Gaps = 5/126 (3%)

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
           VDPIP   + ++I +L Q  V+PP  + +S ++N Y+   CI  HID  H F RP  ++S
Sbjct: 2   VDPIPDLFK-VIIRRLVQWHVIPPTCVPDSCIVNIYEEWDCIPPHID-SHDFVRPFCTVS 59

Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAV 243
             S   + FG           +  + +P+  G V +L    A+   HCV    T  +R  
Sbjct: 60  FLSQCNILFGSNLKVVGAGDFDGPIAIPLPVGSVLVLNGNGADVAKHCVPSVPT--KRIS 117

Query: 244 ILLRRV 249
           I  R++
Sbjct: 118 ITFRKM 123


>gi|440792230|gb|ELR13458.1| 2OG-Fe(II) oxygenase family oxidoreductase [Acanthamoeba
           castellanii str. Neff]
          Length = 214

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 51/100 (51%), Gaps = 9/100 (9%)

Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLP 211
           +  +IN+Y PG  I +H+D P +FD  I S+SL S   +    +F  K  + + PVL   
Sbjct: 110 DQVIINEYLPGQGISAHVDKPSLFDNEIASISLGSTCVM----EFKHKATKTTHPVL--- 162

Query: 212 VQRGCVTLLRDFAANGITHCV--RPQDTQHRRAVILLRRV 249
           + R  + L++  A    THC+  R +D    R +   RRV
Sbjct: 163 LGRRSLVLMKGAARYEWTHCIPARKKDKVDGRQIPRQRRV 202


>gi|356525956|ref|XP_003531587.1| PREDICTED: uncharacterized protein LOC100306211 [Glycine max]
          Length = 423

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 71/153 (46%), Gaps = 10/153 (6%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVIN 157
           FG  Y Y    V K      +  + EVDP+P   + + I ++ +  ++P   I +S ++N
Sbjct: 229 FGCCYNYA---VDKNGSPPGIMRDEEVDPLPPVFKQM-IKRMVRWNIIPSTCIPDSCIVN 284

Query: 158 DYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLPVQRGC 216
            Y+ G CI  HID  H F RP  ++S  ++  + FG       P   + PV  +P+  G 
Sbjct: 285 IYEEGDCIPPHID-HHDFVRPFYTVSFLNECKILFGSNLQVVCPGEFAGPV-SIPLPVGS 342

Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
           V +L    A+   HC+       +R  I  R++
Sbjct: 343 VFVLNGNGADIAKHCI--PSVSSKRISITFRKM 373


>gi|148906930|gb|ABR16610.1| unknown [Picea sitchensis]
          Length = 496

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 3/110 (2%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
           EVDP+P   ++  I ++ +  V+PP+ I +S ++N Y  G CI  HID  H F RP  ++
Sbjct: 290 EVDPLPPLFKA-AIRRMVRWHVLPPSCIPDSCIVNIYDEGDCIPPHID-HHDFVRPFCTV 347

Query: 183 SLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
           SL S+  + FG                +P+  G V +L    A+   H V
Sbjct: 348 SLLSECNIIFGSNLKILGPGEFAGSTAIPLPMGSVLILNGNGADVAKHSV 397


>gi|297793433|ref|XP_002864601.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297310436|gb|EFH40860.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 302

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 99/218 (45%), Gaps = 22/218 (10%)

Query: 45  LRQKILQGVNQMSL----FSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG 100
           +++KI+  ++ + L    FS +E    +R ++EV  L E+G+             K+  G
Sbjct: 68  VKEKIVNVLDGLELHTGVFSAKE---QKRIVNEVYQLQEKGRKGELKEHTFNAPRKWMRG 124

Query: 101 EGYT-------YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI-N 152
           +G         Y   + R G     L  E EVDP+P   + ++I +L +  V+PP  + +
Sbjct: 125 KGRVTIQFGCCYNYAIDRSGNPPGILQRE-EVDPLPPLFK-VMIRRLIKWHVLPPTCVPD 182

Query: 153 SAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFK-PIRVSEPVLYLP 211
           S ++N Y  G CI  HID  H F RP  ++S  S+  + FG     + P   S P    P
Sbjct: 183 SCIVNIYDEGDCIPPHID-NHDFLRPFCTISFLSECDILFGSNLKTEGPGEFSGP-YSTP 240

Query: 212 VQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRV 249
           +  G V +L    A+   H V    T  +R  I  R++
Sbjct: 241 LPVGSVLVLNGNGADVAKHYVPAVPT--KRISITFRKM 276


>gi|308810579|ref|XP_003082598.1| unnamed protein product [Ostreococcus tauri]
 gi|116061067|emb|CAL56455.1| unnamed protein product [Ostreococcus tauri]
          Length = 544

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 85/189 (44%), Gaps = 13/189 (6%)

Query: 52  GVNQM-SLFSHEECLALERNIDEVSSLAEQGQYKP---CTVDRAPLRNKYFFGEGYTYGS 107
           GVN M    + +E    ER I E++  A QG        T      R K FFG  Y   S
Sbjct: 327 GVNVMEGALTDDEIHEAERGILELTRKALQGDLGGDPFVTKAGTTSRIKLFFGFAYEKRS 386

Query: 108 QLVRKGLGNERLYPE--GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDY-QPGG 163
               K    +RL     G  DP+ A +  L  T L++ GV  P+++ N  V+N Y + G 
Sbjct: 387 DSRNK---PQRLIKNVPGIDDPLAAPLHKLAET-LQKRGVFAPDFVPNQYVLNIYGRAGA 442

Query: 164 CIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDF 223
            +++H D  H+F+ PI  + LF+   L F    + + I V   ++ +    G VT +  F
Sbjct: 443 YLMAHKDALHLFEGPIYGVRLFNPRILSFAPDGHMR-INVDRGMIDVMQSVGSVTSMDGF 501

Query: 224 AANGITHCV 232
           A   + H +
Sbjct: 502 AKEDVQHSI 510


>gi|313240624|emb|CBY32949.1| unnamed protein product [Oikopleura dioica]
          Length = 265

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 28/45 (62%)

Query: 212 VQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           +QRG +T       N I HCVRP+DT+ RR  I+LRRV  + P L
Sbjct: 1   MQRGIITSFEGMCMNEIKHCVRPEDTKERRVAIILRRVCNNGPVL 45


>gi|7770341|gb|AAF69711.1|AC016041_16 F27J15.23 [Arabidopsis thaliana]
          Length = 264

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 61/129 (47%), Gaps = 9/129 (6%)

Query: 69  RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
           R +D+V  L E+   G+ K     +   R+   FG  + Y +       G   +     V
Sbjct: 118 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 174

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP  ++S 
Sbjct: 175 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 232

Query: 185 FSDSALCFG 193
            S+  + FG
Sbjct: 233 LSECNILFG 241


>gi|3036813|emb|CAA18503.1| hypothetical protein [Arabidopsis thaliana]
 gi|7270561|emb|CAB81518.1| hypothetical protein [Arabidopsis thaliana]
          Length = 505

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 111/283 (39%), Gaps = 60/283 (21%)

Query: 49  ILQGVN-QMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYF--------- 98
           IL+G+     +FS    +  ++ +D V  L E+G+       R  LR + F         
Sbjct: 216 ILEGLELHTGVFS---AVEQKKIVDFVYELQEKGR-------RGELRERTFTAPHKWMRG 265

Query: 99  -------FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
                  FG  Y Y         G   +   G+VDP+P+  + +               I
Sbjct: 266 KGRVTIQFGCCYNYAPDKAGNPPG---ILQRGDVDPMPSIFKVI---------------I 307

Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSE--PVLY 209
            S ++N Y+   CI  HID  H F RP  ++S  S+  + FG   N K +   E      
Sbjct: 308 KSCIVNIYEEDDCIPPHID-NHDFLRPFCTVSFLSECNILFGS--NLKVLGPGEFSGSYS 364

Query: 210 LPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTPRVK---- 265
           +P+  G V +L+   A+   HCV    T  +R  I  R++      +  +  P ++    
Sbjct: 365 IPLPVGSVLVLKGNGADVAKHCVPAVPT--KRISITFRKMDESKRPVGFTPEPDLEEIKP 422

Query: 266 -PY-HSLYDVQARESGGYFRSSIETYNNNENHSNKNTSNGLDK 306
            PY H+     A       RSSI+   N  NH+N+    G  K
Sbjct: 423 LPYEHTTPSTPADAVISSSRSSID--QNGSNHNNRTAHGGGSK 463


>gi|395836690|ref|XP_003791285.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5
           [Otolemur garnettii]
          Length = 174

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/38 (57%), Positives = 28/38 (73%)

Query: 219 LLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           L   +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 35  LFSGYAADEITHCIRPQDIKERRAVIILRKTRLDAPRL 72


>gi|440789981|gb|ELR11271.1| DNA repair protein alkB, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 253

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 64/134 (47%), Gaps = 20/134 (14%)

Query: 128 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
           +P + Q  +I K+ Q G+  P + +  +IN+Y PG  I  HID  H F   ++SLSL S 
Sbjct: 127 LPVFAQQ-IIEKMRQRGL--PQF-DQLIINEYTPGQGINPHIDKTHCFGPCVVSLSLLST 182

Query: 188 SALCFGCKFNFKPIRVSE--PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVIL 245
                 C   F  +   E  PV+  P     + +LR  A  G  H + P     +RA I+
Sbjct: 183 ------CVMTFTSLETGEKIPVVLRPR---SLVVLRGQARYGWQHGIEP-----KRADIV 228

Query: 246 LRRVLPHAPRLTLS 259
             +  P A R++L+
Sbjct: 229 AGKHTPRARRVSLT 242


>gi|397614206|gb|EJK62658.1| hypothetical protein THAOC_16720 [Thalassiosira oceanica]
          Length = 396

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKF------NFKPIRVSE 205
           +S +IN Y  G CI  H+D    ++RPI +LSL  + A+  G KF       +KPI V +
Sbjct: 208 DSCIINQYGEGDCIPPHVDH-ESYERPISTLSLLGEEAMLLGTKFKTVKNCTWKPI-VGQ 265

Query: 206 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
            VL LP  R  + +L   + N   HC+       RR  I LR+  P
Sbjct: 266 SVL-LP--RRSLLVLGGNSGNISKHCI--SACTRRRISITLRKQPP 306


>gi|345305229|ref|XP_001510391.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH5-like [Ornithorhynchus anatinus]
          Length = 150

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 27/34 (79%)

Query: 223 FAANGITHCVRPQDTQHRRAVILLRRVLPHAPRL 256
           +AA+ ITHC+RPQD + RRAVI+LR+    APRL
Sbjct: 13  YAADEITHCIRPQDIRERRAVIILRKTRLDAPRL 46


>gi|71425222|ref|XP_813050.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877899|gb|EAN91199.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 305

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)

Query: 88  VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
           V++  LR  Y   + YT  S  +R  L + R     ++ P+P      +I+++EQ G+VP
Sbjct: 45  VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96

Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
             ++N+   N Y+PG  I +HID   ++D     +SL S++ L F    N + +    P 
Sbjct: 97  DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155

Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
                  G + ++   A     H V P + Q R +++  R +L          TP   + 
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208

Query: 266 PY---HSLYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           PY   H L  + A++ GG   +  + Y   E     +T+
Sbjct: 209 PYRSTHILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTA 247


>gi|42567443|ref|NP_195332.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661215|gb|AEE86615.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 385

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 43/72 (59%), Gaps = 3/72 (4%)

Query: 123 GEVDPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIIS 181
           G+VDP+P+ I  ++I +L    V+PP  + +S ++N Y+   CI  HID  H F RP  +
Sbjct: 294 GDVDPMPS-IFKVIIKRLVGWHVLPPTCVPDSCIVNIYEEDDCIPPHID-NHDFLRPFCT 351

Query: 182 LSLFSDSALCFG 193
           +S  S+  + FG
Sbjct: 352 VSFLSECNILFG 363


>gi|11094804|gb|AAG29733.1|AC084414_1 hypothetical protein, 3' partial [Arabidopsis thaliana]
          Length = 235

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 9/123 (7%)

Query: 69  RNIDEVSSLAEQ---GQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEV 125
           R +D+V  L E+   G+ K     +   R+   FG  + Y +       G   +     V
Sbjct: 117 RIVDKVCELQEKVQKGELKRAFTAQGKGRSTIQFGCCFNYRTSKTGNLAG---ILKHETV 173

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYI-NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DP+P   + ++I +L +  V+PP  + +  V+N Y  G CI  HID  H F RP  ++S 
Sbjct: 174 DPLPHLFK-VIIRRLVKWHVLPPTCVPDCCVVNIYDEGDCIPPHID-HHDFLRPFCTVSF 231

Query: 185 FSD 187
            S+
Sbjct: 232 LSE 234


>gi|407408875|gb|EKF32135.1| hypothetical protein MOQ_004021 [Trypanosoma cruzi marinkellei]
          Length = 305

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)

Query: 88  VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
           V++  LR  Y   + YT  S  +R  L + R     ++ P+P      +I+++EQ G+VP
Sbjct: 45  VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96

Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
             ++N+   N Y+PG  I +HID   ++D     +SL S++ L F    N + +    P 
Sbjct: 97  DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155

Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
                  G + ++   A     H V P + Q R +++  R +L          TP   + 
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208

Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           PY S   L  + A++ GG   +  + Y   E     +T+
Sbjct: 209 PYRSTQILNTLYAKQIGGVRVTVDDNYLEKEEIGAFDTA 247


>gi|407843604|gb|EKG01501.1| hypothetical protein TCSYLVIO_007502 [Trypanosoma cruzi]
          Length = 354

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 97/219 (44%), Gaps = 21/219 (9%)

Query: 88  VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
           V++  LR  Y   + YT  S  +R  L + R     ++ P+P      +I+++EQ G+VP
Sbjct: 94  VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 145

Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
             ++N+   N Y+PG  I +HID   ++D     +SL S++ L F    N + +    P 
Sbjct: 146 DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 204

Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
                  G + ++   A     H V P + Q R +++  R +L          TP   + 
Sbjct: 205 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 257

Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           PY S   L  + A++ GG   +  + Y   E     +T+
Sbjct: 258 PYRSTQILNTLYAKQIGGVRVTVDDNYLEKEGIGAFDTA 296


>gi|428311031|ref|YP_007122008.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
 gi|428252643|gb|AFZ18602.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
          Length = 206

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 66/151 (43%), Gaps = 22/151 (14%)

Query: 99  FGEGYTY--GSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVI 156
           +G  Y Y  GS +    LGN           +P W Q++ +   E   +V  N     ++
Sbjct: 60  YGYKYDYKDGSFVASTHLGN-----------LPDWAQNVAVRLAEDGFMV--NVPEQVIV 106

Query: 157 NDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGC 216
           N+YQPG  IVSH D    F   II+LSL S+      C  NF   +  + V  L +Q G 
Sbjct: 107 NEYQPGQGIVSHTDCIPCFGNTIITLSLGSE------CVMNFTHSQTQKEVGIL-LQAGS 159

Query: 217 VTLLRDFAANGITHCVRPQDTQHRRAVILLR 247
           + + +  A     H + P+   + +  I +R
Sbjct: 160 LLIFKGEARYIWKHGIVPRKRDNYKGRIFMR 190


>gi|357442687|ref|XP_003591621.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
 gi|355480669|gb|AES61872.1| hypothetical protein MTR_1g089860 [Medicago truncatula]
          Length = 513

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 101/218 (46%), Gaps = 17/218 (7%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
           +F+  E   L   +DE+ +  + G+    T     L NK   G       +L++ G+   
Sbjct: 202 VFTDSELCKLSDFVDEIHTAGQNGELSGETFI---LFNKQMKGNK----RELIQLGVPIF 254

Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
           G  +   +  ++PIPA +Q  VI    Q  ++P     N  +IN ++ G      + PPH
Sbjct: 255 GQIKEDTKSNIEPIPALLQR-VIDHFIQWHLLPEYKRPNGCIINFFEEGEFSQPFLKPPH 313

Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
           + D+P+ +L L S+S + FG     +     +  L L +++G + ++R  +A+   H + 
Sbjct: 314 L-DQPVTTL-LLSESTMAFGRILMSENDGNYKGPLMLSLKKGSLLVMRGNSADMARHVMC 371

Query: 234 PQDTQHRRAVILLRRVLPHAPRLTLSQTPRVKPYHSLY 271
           P  + +RR  I   RV P + +   S TP +    +++
Sbjct: 372 P--SPNRRVSITFFRVRPDSNQCQ-SPTPTMTSAMTMW 406


>gi|328772232|gb|EGF82271.1| hypothetical protein BATDEDRAFT_87049 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 225

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 62/137 (45%), Gaps = 19/137 (13%)

Query: 128 IPAWIQSLVITK--LEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +PA+  S VI +  L +  V P +  N  ++N+YQPG  I+ H+D    F   + SLSL+
Sbjct: 95  LPAF-SSFVIDRMLLPEFNVFPNDPPNHVLVNEYQPGQGIMPHVDSQDTFGDVVTSLSLW 153

Query: 186 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT-------- 237
           S   + FG K   + + +  P       R  + +L   A    TH +  +D         
Sbjct: 154 SSCVMSFGNKMTGEKVHLELP-------RRSLLILTGDARTHYTHAIPKEDMLFAGNECV 206

Query: 238 -QHRRAVILLRRVLPHA 253
            + RR  + +R +L  A
Sbjct: 207 DRGRRVSLTIRSILKSA 223


>gi|343474281|emb|CCD14041.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 305

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P    + ++ ++EQ G+VP  ++N+   N Y+PG  I +HID   ++D     +SL S
Sbjct: 76  PLPTGSFTPLLDRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
           ++ L F    N + + V  P        G + ++   +     H V P + Q R +V+  
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           R +L          TP   + PY S   L  + A++ GG      ++Y   E     +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247


>gi|342180903|emb|CCC90378.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 305

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 13/180 (7%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P    + ++ ++EQ G+VP  ++N+   N Y+PG  I +HID   ++D     +SL S
Sbjct: 76  PLPTGSFTPLLGRIEQDGIVPRGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAVVSLGS 135

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
           ++ L F    N + + V  P        G + ++   +     H V P + Q R +V+  
Sbjct: 136 NALLRFVHVQNGEELDVVIP-------DGSLYIMSGPSRYVYFHMVLPVEAQ-RVSVVFR 187

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           R +L          TP   + PY S   L  + A++ GG      ++Y   E     +T+
Sbjct: 188 RSILNSDGGFRPVTTPLGDLMPYRSTQILNTLYAKQIGGVRIVMDDSYLEKEEIGAFDTA 247


>gi|219883161|ref|YP_002478323.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
 gi|219867286|gb|ACL47624.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
          Length = 204

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 13/117 (11%)

Query: 76  SLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSL 135
           +L E+   +P   D    R    +G  Y Y ++ +          PE  +  +P W++ L
Sbjct: 41  TLLEEIDQQPWITDLK--RRVQHYGYRYDYKARAIS---------PEAYLGTLPEWLKPL 89

Query: 136 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
              +L Q G +P +  +  ++N+Y PG  I +HID    F   I+SLSL SD  + F
Sbjct: 90  T-NRLWQEGYIP-DLPDQVIVNEYIPGQGITAHIDCIDCFSDTILSLSLGSDCIMRF 144


>gi|30690892|ref|NP_182329.2| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
           thaliana]
 gi|330255838|gb|AEC10932.1| oxidoreductase, 2OG-Fe(II) oxygenase-like protein [Arabidopsis
           thaliana]
          Length = 438

 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 58/229 (25%), Positives = 105/229 (45%), Gaps = 39/229 (17%)

Query: 38  SKHHARELRQKILQGVNQ-MSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNK 96
           +K H R     +++G+     +F+  +   L + +D ++ L E G+    + +   L NK
Sbjct: 153 AKEHVRGHTANVVKGLKLYQDVFTRPQ---LSKLLDSINQLREAGRNHQLSGETFVLFNK 209

Query: 97  YFFGEGYTYGSQLVRKGL---GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
              G       +L++ G+   GN     E  V+PIP  +QS VI  L Q  ++P      
Sbjct: 210 NTKGTKR----ELLQLGVPIFGNT--TDEHSVEPIPTLVQS-VIDHLLQWRLIP------ 256

Query: 154 AVINDYQ-PGGCIVSHID----------PPHIFDRPIISLSLFSDSALCFGCKFNFKPIR 202
               +Y+ P GC+++  D          PPH+ D+PI +L L S+S + FG +       
Sbjct: 257 ----EYKRPNGCVINFFDEDEHSQPFQKPPHV-DQPISTLVL-SESTMVFGHRLGVDNDG 310

Query: 203 VSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLP 251
                L LP++ G + ++R  +A+   H + P  + ++R  I   ++ P
Sbjct: 311 NFRGSLTLPLKEGSLLVMRGNSADMARHVMCP--SPNKRVAITFFKLKP 357


>gi|115345714|ref|YP_771891.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
 gi|115293033|gb|ABI93483.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
          Length = 195

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 13/120 (10%)

Query: 120 YPEGEVDPIPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDR 177
           + E  + P+P   QSL   +T       VP    +  ++N+YQPG  I +HID    F  
Sbjct: 64  WREDYLGPLPELFQSLAERLTAEGHFQTVP----DQVIVNEYQPGQGISAHIDCQPCFGE 119

Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDT 237
            I SLSL S   + F         R+    + L +Q   + +L+  A +  TH + P+ T
Sbjct: 120 TIASLSLLSACVMRFAS-------RIYSQQMELHLQPSSLLVLQSDARHLWTHAIPPRKT 172


>gi|71423866|ref|XP_812599.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877399|gb|EAN90748.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 305

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 96/219 (43%), Gaps = 21/219 (9%)

Query: 88  VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
           V++  LR  Y   + YT  S  +R  L + R     ++ P+P      +I+++EQ G+VP
Sbjct: 45  VEKITLRKGYV--DVYTQKSASIR--LSDGRF----QLPPLPPRSFLSLISRIEQDGIVP 96

Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
             ++N+   N Y+PG  I +HID   ++D     +SL S++ L F    N + +    P 
Sbjct: 97  DGWLNNQTANLYEPGDFIRAHIDNLFVYDDIFAIVSLGSNALLRFVHVQNGEELDAVVP- 155

Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
                  G + ++   A     H V P + Q R +++  R +L          TP   + 
Sbjct: 156 ------DGSLYIMSGPARYVYFHMVLPVEEQ-RFSIVFRRSILNSDGGFRPVTTPLGDLM 208

Query: 266 PYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           PY S   L  +  ++ GG   +  + Y   E     +T+
Sbjct: 209 PYRSTQILNTLYTKQIGGVRVTVDDNYLEKEGIGAFDTA 247


>gi|297828529|ref|XP_002882147.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327986|gb|EFH58406.1| hypothetical protein ARALYDRAFT_322448 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 454

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 88/196 (44%), Gaps = 29/196 (14%)

Query: 67  LERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVD 126
           L + +D ++ L E G+    + +   L NK   G       QL     GN     E  V+
Sbjct: 187 LSKLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKREL-LQLGVPIFGNT--TDEHSVE 243

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID----------PPHIF 175
           PIP  +QS VI  L Q  V+P          +Y+ P GC+++  D          PPH+ 
Sbjct: 244 PIPTLVQS-VIDHLLQWRVIP----------EYKRPNGCVINFFDEDEHSQPFQKPPHV- 291

Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
           D+PI +L L S+S + FG +            L LP++ G + ++R  +A+   H + P 
Sbjct: 292 DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARHVMCP- 349

Query: 236 DTQHRRAVILLRRVLP 251
            + ++R  I   ++ P
Sbjct: 350 -SPNKRVAITFFKLKP 364


>gi|4249414|gb|AAD13711.1| unknown protein [Arabidopsis thaliana]
          Length = 433

 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 88/196 (44%), Gaps = 29/196 (14%)

Query: 67  LERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVD 126
           L + +D ++ L E G+    + +   L NK   G       QL     GN     E  V+
Sbjct: 175 LSKLLDSINQLREAGRNHQLSGETFVLFNKNTKGTKREL-LQLGVPIFGNT--TDEHSVE 231

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQ-PGGCIVSHID----------PPHIF 175
           PIP  +QS VI  L Q  ++P          +Y+ P GC+++  D          PPH+ 
Sbjct: 232 PIPTLVQS-VIDHLLQWRLIP----------EYKRPNGCVINFFDEDEHSQPFQKPPHV- 279

Query: 176 DRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQ 235
           D+PI +L L S+S + FG +            L LP++ G + ++R  +A+   H + P 
Sbjct: 280 DQPISTLVL-SESTMVFGHRLGVDNDGNFRGSLTLPLKEGSLLVMRGNSADMARHVMCP- 337

Query: 236 DTQHRRAVILLRRVLP 251
            + ++R  I   ++ P
Sbjct: 338 -SPNKRVAITFFKLKP 352


>gi|356502827|ref|XP_003520217.1| PREDICTED: uncharacterized protein LOC100791586 [Glycine max]
          Length = 514

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 94/200 (47%), Gaps = 16/200 (8%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
           +F+  E   L   ++E+ +  + G+    T     L NK   G       +L++ G+   
Sbjct: 201 IFTDSELCKLTDFVNEIHAAGQNGELSGETFI---LFNKQMKGNKR----ELIQLGVPIF 253

Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
           G  +   +  ++PIP  +Q  VI  L Q  ++P     N  +IN ++ G      + PPH
Sbjct: 254 GQIKEDAKSNIEPIPVLLQG-VIDHLIQWQLLPEYKRPNGCIINFFEKGEFSQPFLKPPH 312

Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
           + D+P+ +L L S+SA+ FG     +     +  L L +++G + ++R  +A+   H + 
Sbjct: 313 L-DQPVSTL-LLSESAMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMC 370

Query: 234 PQDTQHRRAVILLRRVLPHA 253
           P  + +RR  I   RV P +
Sbjct: 371 P--SPNRRVSITFFRVRPDS 388


>gi|146100771|ref|XP_001468941.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398023105|ref|XP_003864714.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134073310|emb|CAM72036.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322502950|emb|CBZ38034.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 318

 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 44/185 (23%), Positives = 83/185 (44%), Gaps = 13/185 (7%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P      ++ +LEQ  VVP N++N+   N Y+PG  I +HID   ++D       +F+
Sbjct: 79  PLPPASFMPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
             +L   C   F  ++  E +  +   R  V ++   A     H V P + Q R +++  
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           R ++         +TP   + PY +   L  + +++ GG   S  + +  + N    +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESANIGAFDTS 250

Query: 302 NGLDK 306
             + +
Sbjct: 251 RWVKR 255


>gi|445456942|ref|ZP_21446201.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
           baumannii OIFC047]
 gi|444777446|gb|ELX01476.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
           baumannii OIFC047]
          Length = 200

 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 38/67 (56%), Gaps = 4/67 (5%)

Query: 127 PIPAWIQSLVITKLEQMGV-VPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           PIP+W+  +    +++  + +PPN +   +IN+Y+PG  I  HID P  F   II +SL 
Sbjct: 76  PIPSWLIRMQKDLMDECSLDLPPNQL---IINEYEPGQGITDHIDAPDEFGETIIMVSLG 132

Query: 186 SDSALCF 192
           S   + F
Sbjct: 133 SSCVMDF 139


>gi|356570554|ref|XP_003553450.1| PREDICTED: uncharacterized protein LOC100776159 [Glycine max]
          Length = 525

 Score = 45.1 bits (105), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 89/198 (44%), Gaps = 16/198 (8%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFG---EGYTYGSQLVRKGL 114
           +FS  E   L   + E+ + A+ G+    T     L NK   G   E    G  + R+  
Sbjct: 216 VFSESEICKLTDFVKEIHAAAQNGELSGETFI---LFNKQMKGNKRELIQLGVPIFRQIK 272

Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
            + +      ++PIPA +   VI  L Q  ++P     N  +IN ++ G      + PPH
Sbjct: 273 DDNK----SNIEPIPALLHD-VIDHLIQWKLIPEYKRPNGCIINFFEEGEFSQPFLKPPH 327

Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
           + D+P+ +L L S+S + FG     +     +  L L ++ G + ++R  +A+   H + 
Sbjct: 328 L-DQPLSTL-LLSESTMAFGRILMSENDGNYKGPLMLSLKEGSLLVMRGNSADMARHVMC 385

Query: 234 PQDTQHRRAVILLRRVLP 251
           P  + +RR  I   RV P
Sbjct: 386 P--SPNRRVSITFFRVRP 401


>gi|154344577|ref|XP_001568230.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065567|emb|CAM43337.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 318

 Score = 45.1 bits (105), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 73/159 (45%), Gaps = 13/159 (8%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P    + ++ +LEQ  VVP N++N+   N Y+PG  I +HID   ++D       +F+
Sbjct: 79  PLPPGSFTPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
             +L   C   F  ++  E +  +   R  V ++   A     H V P + Q R +++  
Sbjct: 133 VCSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGG 280
           R ++          TP   V PY +   L  + +++ GG
Sbjct: 191 RSIMESDGGFRPVTTPLKEVMPYRATQILNALYSKQVGG 229


>gi|401429284|ref|XP_003879124.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322495374|emb|CBZ30678.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 318

 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 43/185 (23%), Positives = 84/185 (45%), Gaps = 13/185 (7%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P+     ++ +LEQ  VVP N++N+   N Y+PG  I +HID   ++D       +F+
Sbjct: 79  PLPSASFLPLLERLEQDNVVPKNWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
             +L   C   F  ++  E +  +   R  V ++   A     H V P + Q R +++  
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           R ++         +TP   + PY +   L  + +++ GG   S  + +  + +    +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESADIGAFDTS 250

Query: 302 NGLDK 306
             + +
Sbjct: 251 RWVKR 255


>gi|356536752|ref|XP_003536899.1| PREDICTED: uncharacterized protein LOC100802371 [Glycine max]
          Length = 466

 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 94/200 (47%), Gaps = 16/200 (8%)

Query: 58  LFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGL--- 114
           +F+  E   L   ++E+ +  + G+    T     L NK   G       +L++ G+   
Sbjct: 187 IFTDSELCKLTDFVNEIHAAGQNGELSGETFI---LFNKQMKGNK----RELIQLGVPIF 239

Query: 115 GNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPP-NYINSAVINDYQPGGCIVSHIDPPH 173
           G  +   +  ++PIPA +Q  VI  L Q  ++P     N  +IN ++ G      + PPH
Sbjct: 240 GQIKEDAKCNIEPIPALLQG-VIDHLIQWQLLPEYKRPNGCIINFFEEGEFSQPFLKPPH 298

Query: 174 IFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
           + D+P+ +L L S+S + FG     +     +  L L +++G + ++R  +A+   H + 
Sbjct: 299 L-DQPVSTL-LLSESTMAFGRILMSENDGNYKGPLTLSLKQGSLLVMRGNSADMARHVMC 356

Query: 234 PQDTQHRRAVILLRRVLPHA 253
           P  + +RR  I   RV P +
Sbjct: 357 P--SPNRRVSITFFRVRPDS 374


>gi|195027836|ref|XP_001986788.1| GH20337 [Drosophila grimshawi]
 gi|193902788|gb|EDW01655.1| GH20337 [Drosophila grimshawi]
          Length = 616

 Score = 44.3 bits (103), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 19/112 (16%)

Query: 86  CTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPA-----WIQSLVITKL 140
           CT D    R+   FG  + YGS  V      E+        PIPA     W Q L  TK+
Sbjct: 162 CTEDTLKHRHVKHFGYEFLYGSNNVDAAQPLEQ--------PIPAACDFLW-QRLDSTKI 212

Query: 141 EQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
              G   P+ +    +N+Y+PG  I  H+D    F  PI+SLSL +D  + F
Sbjct: 213 S--GTFLPDQLT---VNEYEPGQGIPPHVDTHSAFVDPILSLSLQADVVMDF 259


>gi|124002832|ref|ZP_01687684.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
 gi|123992060|gb|EAY31447.1| alkylated DNA repair protein [Microscilla marina ATCC 23134]
          Length = 192

 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 59/136 (43%), Gaps = 17/136 (12%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
           +V  +P W Q +V   +++   +   Y +  +IN+Y PG  I  H+D    FD  I+S+S
Sbjct: 62  KVGDLPHWAQKIVQQAVDEQ--LLSEYFDQMIINEYLPGQGIARHVDCEPCFDHTIMSVS 119

Query: 184 LFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQH---- 239
           L +   + F    + +   +  PVL  P   G   LL   A     H +R   ++     
Sbjct: 120 LGTACVMHFN---SLEDKNLDVPVLLAP---GSAILLSGDARYRWQHSIRANKSEMFEGQ 173

Query: 240 -----RRAVILLRRVL 250
                RR  +  R+V+
Sbjct: 174 KIVRGRRVSLTFRKVI 189


>gi|72388954|ref|XP_844772.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62176055|gb|AAX70176.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801306|gb|AAZ11213.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|261328019|emb|CBH10996.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 305

 Score = 44.3 bits (103), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 75/172 (43%), Gaps = 13/172 (7%)

Query: 136 VITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK 195
           ++ +LEQ G+VP  ++N+   N Y+P   I +HID   ++D     +SL +++ L F   
Sbjct: 85  LVERLEQDGIVPRGWLNNQTANLYEPEDFIRAHIDNLFVYDDIFAVISLGANALLRFVHV 144

Query: 196 FNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPR 255
            N + + V  P        G V ++   +     H V P + Q R +++  R +L     
Sbjct: 145 QNGEELDVVIP-------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGG 196

Query: 256 LTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTSN 302
                TP   + PY S   L  + A++ GG   +  + Y   E     +T  
Sbjct: 197 FRPVSTPLGDLMPYRSTQILNTLYAKQIGGVRVAVDDKYLEKEEIGAFDTGK 248


>gi|414077137|ref|YP_006996455.1| alkylated DNA repair protein [Anabaena sp. 90]
 gi|413970553|gb|AFW94642.1| alkylated DNA repair protein [Anabaena sp. 90]
          Length = 226

 Score = 43.9 bits (102), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 36/74 (48%), Gaps = 9/74 (12%)

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           + P+P W  +L   +L +  V  P   +  ++NDYQPG  I SH D    F   IISLS 
Sbjct: 98  IAPLPDWALTLA-QRLHK--VFSPTVPDQVIVNDYQPGQGISSHTDCVSCFTDVIISLS- 153

Query: 185 FSDSALCFGCKFNF 198
                LC  C  +F
Sbjct: 154 -----LCSSCVMDF 162


>gi|341897299|gb|EGT53234.1| hypothetical protein CAEBREN_14010 [Caenorhabditis brenneri]
          Length = 603

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 37/70 (52%), Gaps = 4/70 (5%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
           E +PIP  I+ L+   L    +   P+ I + V   Y+PG  I SH D    FD PI+S+
Sbjct: 193 EAEPIPQIIEDLIDRLLSDKYITERPDQITANV---YEPGHGIPSHYDTHSAFDDPIVSV 249

Query: 183 SLFSDSALCF 192
           SL SD  + F
Sbjct: 250 SLLSDVVMEF 259


>gi|418296695|ref|ZP_12908538.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
           CCNWGS0286]
 gi|355538870|gb|EHH08112.1| hypothetical protein ATCR1_04209 [Agrobacterium tumefaciens
           CCNWGS0286]
          Length = 187

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 70/168 (41%), Gaps = 29/168 (17%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           FG  Y Y ++ V          P+  + P+P W   +   +L   G    N  +  + N+
Sbjct: 40  FGYRYDYKARTVT---------PDAYLGPLPQWF-GIFAERLVTEGYCR-NLPDQVIANE 88

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVT 218
           Y PG  I +HID    FD  I+S+SL S       C+  F  +R       L   R  V 
Sbjct: 89  YLPGQGISAHIDCVPCFDDTIVSISLLS------ACEMVFHDVRGPAACGVLLQPRSGV- 141

Query: 219 LLRDFAANGITHCV--RPQDT-------QHRRAVILLRRVLPHAPRLT 257
           LLRD    G TH +  R  D        + RR  +  R+V+  AP+ +
Sbjct: 142 LLRDSGRYGWTHEIPARKSDIVNGVRTDRSRRISLTFRKVI--APKWS 187


>gi|170079543|ref|YP_001736177.1| 2OG-Fe(II) oxygenase family oxidoreductase [Synechococcus sp. PCC
           7002]
 gi|169887212|gb|ACB00922.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Synechococcus sp. PCC
           7002]
          Length = 204

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 6/72 (8%)

Query: 128 IPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +P W+  ++  +  L  +  +P    + A+IN+Y PG  I SHID    F   I+SLSL 
Sbjct: 81  LPDWLFPIIEQMVSLNLISEIP----DQAIINEYLPGQGITSHIDCKPCFTDTIVSLSLN 136

Query: 186 SDSALCFGCKFN 197
           S   + F C  N
Sbjct: 137 SPCVMNFDCTVN 148


>gi|308810811|ref|XP_003082714.1| unnamed protein product [Ostreococcus tauri]
 gi|116061183|emb|CAL56571.1| unnamed protein product [Ostreococcus tauri]
          Length = 597

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 9/73 (12%)

Query: 121 PEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 180
           P+  +D + A I   VIT+ E+M        +S  IN Y+ G  I  HID P +F RP +
Sbjct: 93  PKALIDVVDALIARGVITEDERM--------DSCTINVYERGMWIPPHIDNP-LFARPFV 143

Query: 181 SLSLFSDSALCFG 193
           ++SL S  A+  G
Sbjct: 144 TVSLASAQAMTLG 156


>gi|410897523|ref|XP_003962248.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH6-like [Takifugu rubripes]
          Length = 234

 Score = 42.7 bits (99), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 65/135 (48%), Gaps = 10/135 (7%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P+G + + IP W+Q     K+  +G       N  ++N+Y+PG  I+ H D P ++   
Sbjct: 66  HPKGMLAEKIPDWLQKYC-EKISSLGAFGGKTANHVLVNEYKPGEGIMPHEDGP-LYYPT 123

Query: 179 IISLSLFSDSALCF----GCKFNFKPIRVSEPVLY-LPVQRGCVTLLRDFAANGITHCVR 233
           I ++SL S + L F    G      P       L+ L V+   + +L+D     + H +R
Sbjct: 124 ITTISLGSHTLLDFYTPVGSMQGDAPQTEENRFLFSLLVKPRSLLILQDDMYQRLLHGIR 183

Query: 234 P--QDTQHRRAVILL 246
           P  QDT   +AV LL
Sbjct: 184 PCDQDTLTEKAVNLL 198


>gi|195120121|ref|XP_002004577.1| GI20008 [Drosophila mojavensis]
 gi|193909645|gb|EDW08512.1| GI20008 [Drosophila mojavensis]
          Length = 602

 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 59/130 (45%), Gaps = 18/130 (13%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           K  G E LY    V+P+    Q +      +  +LE     PP+ +    +N+Y+ G  I
Sbjct: 170 KHFGYEFLYGSNNVNPLQPLEQGIPAACNFMWERLELPAFEPPDQLT---VNEYEAGQGI 226

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAA 225
             H+D    F  PI+SLSL SD  + F         R  + ++++ + R  + ++   + 
Sbjct: 227 PPHVDTHSAFKDPILSLSLQSDVVMDF---------RRGDQLVHVLLPRRSLLVMSGESR 277

Query: 226 NGITHCVRPQ 235
              TH +RP+
Sbjct: 278 YDWTHGIRPK 287


>gi|443704913|gb|ELU01726.1| hypothetical protein CAPTEDRAFT_156565 [Capitella teleta]
          Length = 610

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 61/137 (44%), Gaps = 25/137 (18%)

Query: 112 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
           K  G E LY    VDP       IP     ++   L Q  ++  N  +   +N YQPG  
Sbjct: 171 KHFGFEFLYSTNNVDPDHPLDMGIPPECSPILQRMLSQQIIL--NLPDQLTVNQYQPGQG 228

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF----GCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
           I  H+D    F+  ++SLSL S   + F    GC +         PV +LP QR  V ++
Sbjct: 229 IPPHVDTHSAFEEELVSLSLGSQVVMDFKAPGGCHY---------PV-FLP-QRSLV-VM 276

Query: 221 RDFAANGITHCVRPQDT 237
           R  +   +TH + P+ +
Sbjct: 277 RGESRYQLTHAIAPRKS 293


>gi|58394263|ref|XP_320626.2| AGAP011900-PA [Anopheles gambiae str. PEST]
 gi|55234784|gb|EAA00113.2| AGAP011900-PA [Anopheles gambiae str. PEST]
          Length = 621

 Score = 42.0 bits (97), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 55/125 (44%), Gaps = 14/125 (11%)

Query: 94  RNKYFFGEGYTYGSQLVRKGLGNERLYPE--GEVDPIPAWIQSLVITKLEQMGVVPPNYI 151
           R    FG  + YGS  V +    ER  P   GE+     W       KL+Q+      ++
Sbjct: 171 RKVKHFGYKFVYGSNNVDRTKPLERKIPSVCGEL-----W------QKLQQLHPHLRWHV 219

Query: 152 -NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYL 210
            +   +N Y+PG  I  H+D    FD PI+SLSL SD  + F    + K + V  P   L
Sbjct: 220 PDQLTVNQYEPGQGIPPHVDTHSAFDDPILSLSLGSDVVMEFKQPSSGKVVCVDLPSRSL 279

Query: 211 PVQRG 215
            +  G
Sbjct: 280 LIMSG 284


>gi|17552176|ref|NP_497751.1| Protein ALKB-8 [Caenorhabditis elegans]
 gi|15718120|emb|CAB63431.2| Protein ALKB-8 [Caenorhabditis elegans]
          Length = 591

 Score = 42.0 bits (97), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 36/70 (51%), Gaps = 4/70 (5%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
           E DPIP  I SL+   +    +   P+ + +   N Y+ G  I SH D    FD PI+S+
Sbjct: 193 EADPIPPVINSLIDRLISDKYITERPDQVTA---NVYESGHGIPSHYDTHSAFDDPIVSI 249

Query: 183 SLFSDSALCF 192
           SL SD  + F
Sbjct: 250 SLLSDVVMEF 259


>gi|434406443|ref|YP_007149328.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
 gi|428260698|gb|AFZ26648.1| alkylated DNA repair protein [Cylindrospermum stagnale PCC 7417]
          Length = 226

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 9/74 (12%)

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           + P+P W  +L   +L +  V  P   +  ++N+YQPG  I SH D    F   IISLS 
Sbjct: 98  IAPLPNWALTLA-QRLHK--VFSPTLPDQVIVNEYQPGQGISSHTDCVSCFTDVIISLS- 153

Query: 185 FSDSALCFGCKFNF 198
                LC  C  +F
Sbjct: 154 -----LCSSCVMDF 162


>gi|389594597|ref|XP_003722521.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|11022582|emb|CAC14240.1| hypothetical protein L7845.08 [Leishmania major]
 gi|323363749|emb|CBZ12755.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 318

 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 42/185 (22%), Positives = 83/185 (44%), Gaps = 13/185 (7%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFS 186
           P+P      ++ +LEQ  VVP +++N+   N Y+PG  I +HID   ++D       +F+
Sbjct: 79  PLPPASFMPLLERLEQDNVVPKSWLNNQTANLYEPGDFIRAHIDNLFVYD------DIFA 132

Query: 187 DSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILL 246
             +L   C   F  ++  E +  +   R  V ++   A     H V P + Q R +++  
Sbjct: 133 ICSLGSNCLLRFVHVQNGEELDVMVPDR-SVYIMSGPARYVYFHMVLPVEAQ-RFSLVFR 190

Query: 247 RRVLPHAPRLTLSQTP--RVKPYHS---LYDVQARESGGYFRSSIETYNNNENHSNKNTS 301
           R ++         +TP   + PY +   L  + +++ GG   S  + +  + +    +TS
Sbjct: 191 RSIMESDGGFRPVKTPFKEIMPYRATQILNALYSKQVGGVRVSVDDDFLESADIGAFDTS 250

Query: 302 NGLDK 306
             + +
Sbjct: 251 RWVKR 255


>gi|255081432|ref|XP_002507938.1| predicted protein [Micromonas sp. RCC299]
 gi|226523214|gb|ACO69196.1| predicted protein [Micromonas sp. RCC299]
          Length = 684

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 14/137 (10%)

Query: 125 VDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           V P+P  + ++V   + +  +      +S  IN Y PG  I  HID P  FDRP +++SL
Sbjct: 126 VAPLPPELDAVVDALIARGALTELQRPDSCTINLYGPGQWIPPHIDNP-AFDRPFVTVSL 184

Query: 185 FSDSALCFGCKFNF---------KPIRVSEP-VLYLPVQRGCVTLLRDFAANGITHCVRP 234
            S+  +  G    +            R++E   L LPV  G   ++   AA+   H V P
Sbjct: 185 CSEQPMVLGRGMVWPEGGRGPCGDDERLNEEHALSLPV--GSAVVVEGEAADEYEHAVPP 242

Query: 235 QDTQHRRAVILLRRVLP 251
             T  R ++   RR  P
Sbjct: 243 V-TAERISLTFRRRGRP 258


>gi|392871202|gb|EAS33085.2| Alkbh6 protein [Coccidioides immitis RS]
          Length = 265

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
           P+PAW+ + VIT+ +++ +    P    N  +IN+YQPG  I+ H D P
Sbjct: 86  PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 134


>gi|308502209|ref|XP_003113289.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
 gi|308265590|gb|EFP09543.1| hypothetical protein CRE_25438 [Caenorhabditis remanei]
          Length = 570

 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 4/68 (5%)

Query: 126 DPIPAWIQSLVITKLEQMGVVP-PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           DPIP  I +L+   + +  +   P+ I + V   Y+PG  I SH D    F+ PI+S+SL
Sbjct: 173 DPIPPVINTLIDKLMSEKYITERPDQITANV---YEPGHGIPSHYDTHSAFEDPIVSISL 229

Query: 185 FSDSALCF 192
            SD  + F
Sbjct: 230 LSDVVMEF 237


>gi|303316794|ref|XP_003068399.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240108080|gb|EER26254.1| hypothetical protein CPC735_004250 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 271

 Score = 41.6 bits (96), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
           P+PAW+ + VIT+ +++ +    P    N  +IN+YQPG  I+ H D P
Sbjct: 92  PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 140


>gi|320038239|gb|EFW20175.1| hypothetical protein CPSG_03351 [Coccidioides posadasii str.
           Silveira]
          Length = 248

 Score = 41.6 bits (96), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
           P+PAW+ + VIT+ +++ +    P    N  +IN+YQPG  I+ H D P
Sbjct: 69  PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 117


>gi|291383953|ref|XP_002708552.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Oryctolagus cuniculus]
          Length = 664

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 27/162 (16%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN YQPG  
Sbjct: 177 KHFGYEFRYDNNNVDKDRPLPGGLPDICDSILEKW--LQEGYIKHKPDQLTINQYQPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
           I +HID    F+  IISLSL S+  + F       P  ++ PV+ LP  R  + ++   +
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDFK-----HPDGITVPVM-LP--RRSLLVMTGES 286

Query: 225 ANGITHCVRP------QDTQHRRAVILLRRVLPHAPRLTLSQ 260
               TH + P      Q ++H ++ I+   V      LTLS+
Sbjct: 287 RYLWTHGITPRKFDTIQASEHHKSGIITSDV----GDLTLSK 324


>gi|297804104|ref|XP_002869936.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315772|gb|EFH46195.1| hypothetical protein ARALYDRAFT_329536 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 241

 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 66/159 (41%), Gaps = 34/159 (21%)

Query: 105 YGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGC 164
           +G  +  KGL  + L         P+W+  +     E  G+ P + IN  +IN+Y P   
Sbjct: 52  WGGMVHEKGLVPQEL---------PSWLTKITAKICESSGLFP-SAINHVLINEYHPNQG 101

Query: 165 IVSHIDPPHIFDRPIIS-LSLFSDSALCFG------------------CKFNFKPIRVSE 205
           I++H D P  F  P+++ LSL S   + F                   C  +++P R S 
Sbjct: 102 IMAHQDGPAYF--PVVAILSLGSPVVMDFSPHLRLRSGDGYISKDQSPCAESYEPERDSF 159

Query: 206 PVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVI 244
            VL +P     + + +D A +   H +    TQ    V+
Sbjct: 160 SVLMMP---RSLLIFKDDAYSDFLHGISDSPTQCYNQVV 195


>gi|194880701|ref|XP_001974503.1| GG21781 [Drosophila erecta]
 gi|190657690|gb|EDV54903.1| GG21781 [Drosophila erecta]
          Length = 615

 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSL---VITKLEQMGVVPPNYINSA----VINDYQPGGC 164
           K  G E LY    VDP     QS+         ++   P N+  S+     +N+Y+PG  
Sbjct: 175 KHFGFEFLYGSNNVDPFKPLEQSIPSACDILWPRLDSFPSNWDWSSPDQLTVNEYEPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I  H+D    F  PI+SLSL SD  + F
Sbjct: 235 IPPHVDTHSAFLDPILSLSLQSDVVMDF 262


>gi|198474666|ref|XP_001356776.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
 gi|198138491|gb|EAL33842.2| GA19388 [Drosophila pseudoobscura pseudoobscura]
          Length = 237

 Score = 41.2 bits (95), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +GV      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|195148282|ref|XP_002015103.1| GL18598 [Drosophila persimilis]
 gi|194107056|gb|EDW29099.1| GL18598 [Drosophila persimilis]
          Length = 237

 Score = 41.2 bits (95), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +GV      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|421597443|ref|ZP_16041063.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
           CCGE-LA001]
 gi|404270447|gb|EJZ34507.1| hypothetical protein BCCGELA001_08731 [Bradyrhizobium sp.
           CCGE-LA001]
          Length = 190

 Score = 41.2 bits (95), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 9/77 (11%)

Query: 123 GEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISL 182
           GE +PIP W+   +  ++E    +P   +   +  +Y+ G  I  H D PH FD+ ++ L
Sbjct: 71  GEAEPIPDWVLP-IARQVEAWAGLPEASVRQVLCTEYEAGVGIGWHRDKPH-FDK-VLGL 127

Query: 183 SLFSDSALCFGCKFNFK 199
           SL +       CKF F+
Sbjct: 128 SLGAP------CKFRFR 138


>gi|195439545|ref|XP_002067665.1| GK21248 [Drosophila willistoni]
 gi|194163750|gb|EDW78651.1| GK21248 [Drosophila willistoni]
          Length = 228

 Score = 40.8 bits (94), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +G+      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|449305107|gb|EMD01114.1| hypothetical protein BAUCODRAFT_196023 [Baudoinia compniacensis
           UAMH 10762]
          Length = 248

 Score = 40.8 bits (94), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 2/48 (4%)

Query: 127 PIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPP 172
           P+PAW+   V+ ++  +GV    P+ IN  +IN+Y PG  I+ H D P
Sbjct: 71  PLPAWLTEPVVGRIHDLGVFADAPHGINHCLINEYLPGQGIMPHEDGP 118


>gi|326914414|ref|XP_003203520.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Meleagris gallopavo]
          Length = 846

 Score = 40.8 bits (94), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 43/88 (48%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y   +VD   P+P  +  + I+ LE+   +   YI    +   +N Y+PG  
Sbjct: 351 KHFGYEFRYDNNDVDKDKPLPGGLPEICISFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 408

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I  HID    F+  IISLSL S+  + F
Sbjct: 409 IPPHIDTHSAFEDEIISLSLGSEIVMDF 436


>gi|194761924|ref|XP_001963141.1| GF15798 [Drosophila ananassae]
 gi|190616838|gb|EDV32362.1| GF15798 [Drosophila ananassae]
          Length = 231

 Score = 40.8 bits (94), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +G+      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS-LSLFSDSALCFG 193
           IIS +S  S + L F 
Sbjct: 115 IISTISCGSHTVLEFA 130


>gi|301104152|ref|XP_002901161.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
 gi|262101095|gb|EEY59147.1| alkylated DNA repair protein alkB [Phytophthora infestans T30-4]
          Length = 231

 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
           E +P+P W+Q +  T L+          N A+IN+Y  G CI+ H D P  F  P++  S
Sbjct: 94  EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 149

Query: 184 LFSDSALCFGCKFNFKPIR 202
           + S  A    C+  F+P R
Sbjct: 150 IISTGA---ECRVTFEPHR 165


>gi|321463990|gb|EFX75001.1| hypothetical protein DAPPUDRAFT_306906 [Daphnia pulex]
          Length = 574

 Score = 40.4 bits (93), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 57/127 (44%), Gaps = 19/127 (14%)

Query: 112 KGLGNERLYPEGEVDP------IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           K  G E LY   +VD       IP  ++ L I K  + G+  PN +    +N Y PG  I
Sbjct: 145 KHYGFEFLYGSNKVDKNPLSSGIPVEMEPL-IEKFMEFGLARPNQL---TVNHYVPGQGI 200

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL---RD 222
             H D    F+  IIS+SL SD  +      +FK  +  E V     +R C+ +    R 
Sbjct: 201 PLHTDTHSSFEDGIISVSLGSDIVM------DFKNNKGGECVSITLPRRSCLIMTNESRY 254

Query: 223 FAANGIT 229
             ++GIT
Sbjct: 255 MWSHGIT 261


>gi|195386178|ref|XP_002051781.1| GJ17179 [Drosophila virilis]
 gi|194148238|gb|EDW63936.1| GJ17179 [Drosophila virilis]
          Length = 221

 Score = 40.4 bits (93), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +G+      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQNANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|119187755|ref|XP_001244484.1| hypothetical protein CIMG_03925 [Coccidioides immitis RS]
          Length = 119

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 30/49 (61%), Gaps = 3/49 (6%)

Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHIDPP 172
           P+PAW+ + VIT+ +++ +    P    N  +IN+YQPG  I+ H D P
Sbjct: 51  PLPAWLVTPVITRFQKLAIFSASPHKAPNHVLINEYQPGQGIMPHEDGP 99


>gi|170579523|ref|XP_001894867.1| hypothetical protein [Brugia malayi]
 gi|158598386|gb|EDP36290.1| conserved hypothetical protein [Brugia malayi]
          Length = 576

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 33/91 (36%), Positives = 42/91 (46%), Gaps = 3/91 (3%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           DPIP+  Q  VI ++   G+      +   +N Y+PG  I SH+D    F   I SLSL 
Sbjct: 183 DPIPSVCQP-VIDRMLGAGIFKEK-PDQVTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240

Query: 186 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 215
           SD  + F    N   I  V  P   L V RG
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRFSLTVMRG 271


>gi|47196062|emb|CAF89344.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 234

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/121 (24%), Positives = 59/121 (48%), Gaps = 8/121 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P+G + + IP W+Q+    ++  +G       N  ++N+Y+PG  I+ H D P ++   
Sbjct: 66  HPKGMLAETIPEWLQTYC-QRISSLGAFGGKVANHVLVNEYKPGEGIMPHEDGP-LYHPT 123

Query: 179 IISLSLFSDSALCF-----GCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVR 233
           + +LSL S + L F     G + +      +  +  L V+   + +L+D     + H +R
Sbjct: 124 VTTLSLGSHTLLDFYTPVGGVQGDAPQTEENRFLFSLLVEPRSLLILQDEMYQKLLHGIR 183

Query: 234 P 234
           P
Sbjct: 184 P 184


>gi|301118833|ref|XP_002907144.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105656|gb|EEY63708.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 134

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 124 EVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLS 183
           E +P+P W+Q +  T L+          N A+IN+Y  G CI+ H D P  F  P++  S
Sbjct: 11  EPNPLPEWLQQISQTLLDTGIFSEEKKPNHALINEYGVGDCILPHEDGPAYF--PLV--S 66

Query: 184 LFSDSALCFGCKFNFKPIR 202
           + S  A    C+  F+P R
Sbjct: 67  IISTGA---ECRVTFEPHR 82


>gi|195114110|ref|XP_002001610.1| GI16672 [Drosophila mojavensis]
 gi|193912185|gb|EDW11052.1| GI16672 [Drosophila mojavensis]
          Length = 222

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+QS V  K+  +G+      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEIPEWLQSYV-DKVNNLGIFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|395861438|ref|XP_003802993.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Otolemur
           garnettii]
          Length = 986

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 48/102 (47%), Gaps = 14/102 (13%)

Query: 114 LGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGCIV 166
            G E LY    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  I 
Sbjct: 501 FGYEFLYENNNVDKDKPLPGGLPDICDSILEKW--LKEGYIRQKPDQVTINQYEPGHGIP 558

Query: 167 SHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVL 208
           +HID    F+  I+SLSL S+  + F       P  V+ PV+
Sbjct: 559 AHIDTHSAFEDEIVSLSLGSEIVMDFK-----HPDGVTVPVM 595


>gi|291238544|ref|XP_002739188.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 742

 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 58/129 (44%), Gaps = 13/129 (10%)

Query: 126 DPIPAWIQSLVITKLEQMGVV-PPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSL 184
           D IPA ++ ++   +    +   PN I    +N YQPG  I  HID    F+  IISLSL
Sbjct: 267 DGIPAALRKIIDDIMATQNIQHGPNQI---TVNQYQPGQGIPPHIDTHSAFEGEIISLSL 323

Query: 185 FSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGIT---HCVRPQDTQHRR 241
            S+  + F    N + I V  P   L V  G     R    +GIT   H V P DT    
Sbjct: 324 GSNVIMDFK-HPNGQHIPVLVPQRSLLVMTG---ESRYLWTHGITPRKHDVAPADTD--T 377

Query: 242 AVILLRRVL 250
            + L +R L
Sbjct: 378 GLTLTKRTL 386


>gi|363729209|ref|XP_417166.3| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Gallus
           gallus]
          Length = 695

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y   +VD   P+P  +  + I  LE+   +   YI    +   +N Y+PG  
Sbjct: 201 KHFGYEFRYDNNDVDKDNPLPGGLPEICIPFLEK--CLKQGYIKHKPDQLTVNQYEPGQG 258

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I  HID    F+  IISLSL S+  + F
Sbjct: 259 IPPHIDTHSAFEDEIISLSLGSEIVMDF 286


>gi|291301566|ref|YP_003512844.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
 gi|290570786|gb|ADD43751.1| 2OG-Fe(II) oxygenase [Stackebrandtia nassauensis DSM 44728]
          Length = 197

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 17/105 (16%)

Query: 94  RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINS 153
           R    +G  Y YG    R+ +G     P     PIPAW+  +V   +++  +  P   + 
Sbjct: 42  RRVQHYGHRYDYG----RRSVGQTDAAP-----PIPAWLDEVVARLVDEKVMDQPA--DQ 90

Query: 154 AVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNF 198
            ++N+Y PG  I +H+D    F   + ++SL S      GC  +F
Sbjct: 91  VIVNEYLPGQGISAHVDCVPCFGPVVAAISLES------GCVMDF 129


>gi|195033105|ref|XP_001988620.1| GH11262 [Drosophila grimshawi]
 gi|193904620|gb|EDW03487.1| GH11262 [Drosophila grimshawi]
          Length = 221

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + +P W+QS V  K+  +GV      N  ++N+Y PG  I+ H D P  +  P
Sbjct: 58  HPNGMIAEEMPEWLQSYV-DKVNNLGVFESQKANHVLVNEYLPGQGILPHTDGPLFY--P 114

Query: 179 IIS 181
           IIS
Sbjct: 115 IIS 117


>gi|348681591|gb|EGZ21407.1| hypothetical protein PHYSODRAFT_491312 [Phytophthora sojae]
          Length = 688

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 34/126 (26%), Positives = 55/126 (43%), Gaps = 3/126 (2%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +PIP  +Q  +I K+ + G++  +  +   +N+Y PG  I  H+D    F   I SLS+ 
Sbjct: 208 EPIPEVLQP-IIEKIARCGIMDGDEPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 266

Query: 186 SDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV--RPQDTQHRRAV 243
           S+  + F      +   V  P   L V  G    + + A    T  V    Q  + RR  
Sbjct: 267 SEVVMDFRHPDGVRNEGVLLPARSLAVMSGASRYMWEHAIVPRTFDVIDGKQVNRQRRVS 326

Query: 244 ILLRRV 249
           I  R++
Sbjct: 327 ITFRKI 332


>gi|195151073|ref|XP_002016472.1| GL10469 [Drosophila persimilis]
 gi|194110319|gb|EDW32362.1| GL10469 [Drosophila persimilis]
          Length = 614

 Score = 39.7 bits (91), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGV-VPPNYINSAVINDYQPGGC 164
           K  G E LY    VDP     QS+      +  +LE        +  +   +N+Y+PG  
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACGFLWPRLESFSSSWDWSTPDQLTVNEYEPGHG 232

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
           I  H+D    F  PI+SLSL SD  + F         R  E  + + + R  + ++   A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283

Query: 225 ANGITHCVRPQ 235
               TH ++P+
Sbjct: 284 RYDWTHGIKPK 294


>gi|195339853|ref|XP_002036531.1| GM18491 [Drosophila sechellia]
 gi|194130411|gb|EDW52454.1| GM18491 [Drosophila sechellia]
          Length = 228

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+Q+ V  K+  +GV      N  ++N+Y PG  I+ H D P +F   
Sbjct: 58  HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115

Query: 179 IISLSLFSDSALCFG 193
           I ++S  + + L F 
Sbjct: 116 ISTISTGAHTVLEFA 130


>gi|340053555|emb|CCC47848.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 305

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 86/198 (43%), Gaps = 21/198 (10%)

Query: 88  VDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVP 147
           +++  LR  Y   + YT  S  +R   G  +L P  +   +P      ++ ++EQ G+VP
Sbjct: 45  MEKIALRKGYV--DVYTQRSASIRLSDGRFQLPPLPQHSFLP------LVERMEQDGLVP 96

Query: 148 PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPV 207
             ++N+   N Y+    I +HID   I+D     +SL +++ L F    N + + V  P 
Sbjct: 97  CGWLNNQTANFYETNDFIRAHIDNLFIYDDIFAIVSLGANALLRFVHVQNGEELDVVIP- 155

Query: 208 LYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRRVLPHAPRLTLSQTP--RVK 265
                  G V ++   +     H V P + Q R +++  R +L          TP   + 
Sbjct: 156 ------DGSVYIMSGPSRYVYFHMVLPVEAQ-RVSIVFRRSILNSDGSFRPVSTPLGDLM 208

Query: 266 PYHSLYDVQ---ARESGG 280
           PY S   V    A++ GG
Sbjct: 209 PYRSTQIVNTLYAKQVGG 226


>gi|198457852|ref|XP_001360812.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
 gi|198136129|gb|EAL25387.2| GA14676 [Drosophila pseudoobscura pseudoobscura]
          Length = 615

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 16/131 (12%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSL------VITKLEQMGV-VPPNYINSAVINDYQPGGC 164
           K  G E LY    VDP     QS+      +  +LE        +  +   +N+Y+PG  
Sbjct: 173 KHFGYEFLYGSNNVDPTKPLEQSVPTACDFLWPRLESFSSSWDWSTPDQLTVNEYKPGHG 232

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFA 224
           I  H+D    F  PI+SLSL SD  + F         R  E  + + + R  + ++   A
Sbjct: 233 IPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEAQVQVKLPRRSLLIMSGEA 283

Query: 225 ANGITHCVRPQ 235
               TH ++P+
Sbjct: 284 RYDWTHGIKPK 294


>gi|258576637|ref|XP_002542500.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902766|gb|EEP77167.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 259

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 29/47 (61%), Gaps = 3/47 (6%)

Query: 127 PIPAWIQSLVITKLEQMGVV---PPNYINSAVINDYQPGGCIVSHID 170
           P+P W++S V+++  ++G+    P    N  +IN+YQPG  I+ H D
Sbjct: 80  PLPDWLRSPVVSRFTELGIFFDSPHRAPNHVLINEYQPGQGIMPHED 126


>gi|357498515|ref|XP_003619546.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
 gi|355494561|gb|AES75764.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
          Length = 266

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 45/90 (50%), Gaps = 12/90 (13%)

Query: 112 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 163
           K L N RL   G V         P+P+W+ +      E+ G+ P + IN  +IN+YQP  
Sbjct: 50  KLLKNRRLQNWGGVVHEKGLLPQPLPSWLTNFTQKISEESGLFP-SPINHVLINEYQPNQ 108

Query: 164 CIVSHIDPPHIFDRPIIS-LSLFSDSALCF 192
            I+ H D P  F  P+++ LSL S   + F
Sbjct: 109 GIMPHQDGPSYF--PVVAILSLGSPVVMDF 136


>gi|67458989|ref|YP_246613.1| alkylated DNA repair protein [Rickettsia felis URRWXCal2]
 gi|67004522|gb|AAY61448.1| Alkylated DNA repair protein [Rickettsia felis URRWXCal2]
          Length = 188

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 63/139 (45%), Gaps = 21/139 (15%)

Query: 53  VNQMSLFSHEECLALERNIDEVSSLAEQGQYKPCTVDRAPL-----RNKYFFGEGYTYGS 107
           ++Q+SLF+ +  ++  + I+E  + AEQ       +D +P      R    +G  Y Y S
Sbjct: 1   MSQLSLFNDQIIISGLKYIEEYIT-AEQEDRLIKLIDSSPWITDLKRRVQHYGYKYDYKS 59

Query: 108 QLVRKGLGNERLYPEGEVDPIPAWIQSLV--ITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           + +      E+ Y  G    +P W+Q +     K      +P    N  ++N+Y PG  I
Sbjct: 60  RSI------EQSYYLG---LLPKWLQIVADEFYKKNIFNEIP----NQVIVNEYMPGQGI 106

Query: 166 VSHIDPPHIFDRPIISLSL 184
            SHID    F   I SLSL
Sbjct: 107 ASHIDCIPCFSDTICSLSL 125


>gi|440791732|gb|ELR12970.1| hypothetical protein ACA1_096410 [Acanthamoeba castellanii str.
           Neff]
          Length = 259

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 49/112 (43%), Gaps = 13/112 (11%)

Query: 123 GEVDPIPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPII 180
           G + P+P + Q  V  +L   G++P  P+ +   +IN Y PG  I  H+D  H F+  + 
Sbjct: 103 GYLGPLPDFAQQ-VTARLVSTGLMPYEPDQM---IINHYTPGQGIHPHVDKTHCFEGVVG 158

Query: 181 SLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCV 232
           SL L S       C   FK I     V     +R    +L   A  G TH +
Sbjct: 159 SLGLGSS------CIMEFKHIETGRRVDVF-FERRTALMLTGEARYGWTHGI 203


>gi|195578245|ref|XP_002078976.1| GD23711 [Drosophila simulans]
 gi|194190985|gb|EDX04561.1| GD23711 [Drosophila simulans]
          Length = 228

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+Q+ V  K+  +GV      N  ++N+Y PG  I+ H D P +F   
Sbjct: 58  HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115

Query: 179 IISLSLFSDSALCFG 193
           I ++S  + + L F 
Sbjct: 116 ISTISTGAHTVLEFA 130


>gi|358395346|gb|EHK44733.1| hypothetical protein TRIATDRAFT_184325, partial [Trichoderma
           atroviride IMI 206040]
          Length = 345

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 51/122 (41%), Gaps = 24/122 (19%)

Query: 83  YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
           +KP T+ +A  R  ++   G  Y            R+YPE      PA + + + T    
Sbjct: 159 HKPLTIKQALQRRLHWVTLGGQYD--------WTNRIYPEAPPPSFPADLSNFLTT---- 206

Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCK--FNFKP 200
             + P     +A++N Y PG  ++ H D     D+ ++SLS        FGC+  F   P
Sbjct: 207 --LFPETLAQAAIVNFYTPGDTMMMHRDVSEEIDKGLVSLS--------FGCECLFMIAP 256

Query: 201 IR 202
            R
Sbjct: 257 SR 258


>gi|298707663|emb|CBJ25980.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 329

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +P+P W+ +L  + + +         N  ++N+YQPG  I++H D P  ++    +LSL 
Sbjct: 139 EPLPPWVDALCESLVVRGVFSEETRPNHVLLNEYQPGQGIMAHTDGP-FYEPRTATLSLG 197

Query: 186 SDSALCF 192
           SD+ + F
Sbjct: 198 SDAVMHF 204


>gi|85724924|ref|NP_001033895.1| CG6144, isoform B [Drosophila melanogaster]
 gi|85726418|ref|NP_609414.3| CG6144, isoform C [Drosophila melanogaster]
 gi|16076870|gb|AAL13362.1| LD37206p [Drosophila melanogaster]
 gi|84795296|gb|AAF52964.3| CG6144, isoform C [Drosophila melanogaster]
 gi|84795297|gb|ABC65892.1| CG6144, isoform B [Drosophila melanogaster]
 gi|220946136|gb|ACL85611.1| CG6144-PA [synthetic construct]
 gi|220960392|gb|ACL92732.1| CG6144-PA [synthetic construct]
          Length = 228

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+Q+ V  K+  +GV      N  ++N+Y PG  I+ H D P +F   
Sbjct: 58  HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115

Query: 179 IISLSLFSDSALCF 192
           I ++S  + + L F
Sbjct: 116 ISTISTGAHTVLEF 129


>gi|328715721|ref|XP_001945058.2| PREDICTED: hypothetical protein LOC100163323 [Acyrthosiphon pisum]
          Length = 312

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           DPIP   + ++     ++   P    N   +N Y PG  I SHID   +FD  I+SLSL 
Sbjct: 40  DPIPKEFEFILNAIYLRLKWQP----NQITVNKYLPGQGIPSHIDTHGVFDEYILSLSLN 95

Query: 186 SDSALCF 192
           SD  + F
Sbjct: 96  SDIIMEF 102


>gi|312086366|ref|XP_003145047.1| hypothetical protein LOAG_09472 [Loa loa]
 gi|307759789|gb|EFO19023.1| hypothetical protein LOAG_09472 [Loa loa]
          Length = 576

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 43/91 (47%), Gaps = 3/91 (3%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +PIPA  Q  +I ++   G+      +   +N Y+PG  I SH+D    F   I SLSL 
Sbjct: 183 NPIPAACQP-IIDRMLDAGIFKEE-PDQLTVNIYEPGNGIPSHVDTHSAFSDTIASLSLL 240

Query: 186 SDSALCFGCKFNFKPI-RVSEPVLYLPVQRG 215
           SD  + F    N   I  V  P L L V +G
Sbjct: 241 SDLVMEFRDFANTSTIYDVLLPRLSLAVMQG 271


>gi|301121774|ref|XP_002908614.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
 gi|262103645|gb|EEY61697.1| alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]
          Length = 640

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 126 DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +PIP  +Q  VI K+ + G++  +  +   +N+Y PG  I  H+D    F   I SLS+ 
Sbjct: 203 EPIPEVLQP-VIDKIVECGIMDGDRPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSIC 261

Query: 186 SDSALCF 192
           S+  + F
Sbjct: 262 SEVVMDF 268


>gi|390349756|ref|XP_790566.3| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH6-like [Strongylocentrotus purpuratus]
          Length = 245

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 46/87 (52%), Gaps = 10/87 (11%)

Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           L N RL       +P+G + + +P W+  +   K+  +GV   +  N  ++N+YQPG  I
Sbjct: 55  LSNRRLQNWGGLPHPKGMIAEGLPKWL-DVYAKKIAGLGVFGDHIPNHVLVNEYQPGQGI 113

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
           + H D P +F   + ++SL S + L F
Sbjct: 114 MPHEDGP-LFHPVVTTISLGSHTFLDF 139


>gi|91080367|ref|XP_974935.1| PREDICTED: similar to CG17807 CG17807-PA [Tribolium castaneum]
 gi|270005595|gb|EFA02043.1| hypothetical protein TcasGA2_TC007671 [Tribolium castaneum]
          Length = 582

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 23/41 (56%)

Query: 152 NSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
           N   IN Y PG  I SH+D    F  PI+SLSL SD  + F
Sbjct: 203 NQLTINRYNPGQGIPSHVDTHSAFGDPILSLSLSSDVVMEF 243


>gi|195471890|ref|XP_002088235.1| GE18466 [Drosophila yakuba]
 gi|194174336|gb|EDW87947.1| GE18466 [Drosophila yakuba]
          Length = 234

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 3/75 (4%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+Q+ V  K+  +GV      N  ++N+Y PG  I+ H D P +F   
Sbjct: 58  HPNGMIAEEIPEWLQTYV-DKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115

Query: 179 IISLSLFSDSALCFG 193
           I ++S  + + L F 
Sbjct: 116 ISTISTGAHTVLEFA 130


>gi|443725258|gb|ELU12938.1| hypothetical protein CAPTEDRAFT_112863, partial [Capitella teleta]
          Length = 214

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 12/93 (12%)

Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           L N RL       +P+G V + IP W+ S    ++ Q+GV      N  ++N+Y PG  I
Sbjct: 36  LSNRRLQNWGGLPHPKGMVPEKIPEWLDSFG-QRIGQLGVFDGQMPNHVLVNEYLPGQGI 94

Query: 166 VSHIDPPHIFDRPIIS-LSLFSDSALCFGCKFN 197
           + H D P  F  P +S ++L S + L F    N
Sbjct: 95  MPHTDGPLYF--PTVSTITLGSHTLLDFYTPLN 125


>gi|168057031|ref|XP_001780520.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667998|gb|EDQ54614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 258

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 127 PIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIIS-LSLF 185
           PIPAW+ S +  K+ +   + P  IN  ++N+Y PG  I SH D P  +  P+++ LSL 
Sbjct: 59  PIPAWLSS-ITEKIAKETNLFPAPINHVLVNEYLPGQGITSHQDGPVYY--PVVAILSLG 115

Query: 186 SDSALCF 192
           + + + F
Sbjct: 116 APTLMHF 122


>gi|440902148|gb|ELR52976.1| Alkylated DNA repair protein alkB-like protein 8, partial [Bos
           grunniens mutus]
          Length = 349

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   +I    N   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICESILEKW--LKEGFIKHKPNQLTINQYEPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  IISLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSEIVMDF 262


>gi|351710212|gb|EHB13131.1| Alkylated DNA repair protein alkB-like protein 8, partial
           [Heterocephalus glaber]
          Length = 667

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 179 KHFGYEFHYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKCRPDQLTINQYEPGQG 236

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  IISLSL S+  + F
Sbjct: 237 IPAHIDTHSAFEDEIISLSLGSEVVMDF 264


>gi|340507083|gb|EGR33100.1| hypothetical protein IMG5_061460 [Ichthyophthirius multifiliis]
          Length = 392

 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 72/175 (41%), Gaps = 23/175 (13%)

Query: 79  EQGQYKPCTVDRAPL-----RNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQ 133
           EQ  Y    +D  P      R    +G  + YG   + K         E + +PIP ++ 
Sbjct: 189 EQENYIIQQIDEKPWIKLSKRQVQHYGYEFIYGQNTINK---------EKKTNPIPDFLS 239

Query: 134 SLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFG 193
           ++V  +L +        ++   INDY+PG  I SH+D    F+  I+ +SL S   +   
Sbjct: 240 NMV-QQLNEYIKNKQKPLDQLTINDYKPGNGISSHVDAHSPFEECIVVVSLVSGVVIT-- 296

Query: 194 CKFNFKPIRVSEPVLYLPVQRGCVTLLRDFAANGITHCVRPQDTQHRRAVILLRR 248
               FK  +  E  L+L  +   + ++        THC+  +     +  ++ R+
Sbjct: 297 ----FKSYKNEEKSLFL--ENRSLLIMSGEGRYAWTHCISSRKVDRVQDEVIFRK 345


>gi|348541845|ref|XP_003458397.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH6-like [Oreochromis niloticus]
          Length = 234

 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 65/146 (44%), Gaps = 28/146 (19%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDP 171
           KG+  ERL         P W+Q+    K+  +G       N  ++N+Y+PG  I+ H D 
Sbjct: 68  KGMLGERL---------PDWLQTYC-EKISSLGAFSGKTANHVLVNEYKPGEGIMPHEDG 117

Query: 172 PHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLR 221
           P ++   + ++SL S + L F     + P+  +E           +  L V+   + +L+
Sbjct: 118 P-LYHPTVTTISLGSHTLLDF-----YTPLSSAEGATPQTEESRFLFSLLVRPRSLLILQ 171

Query: 222 DFAANGITHCV--RPQDTQHRRAVIL 245
           D     + H +  R QDT  ++ V L
Sbjct: 172 DEMYQRLLHGIHGREQDTLTKKVVNL 197


>gi|220908820|ref|YP_002484131.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
 gi|219865431|gb|ACL45770.1| 2OG-Fe(II) oxygenase [Cyanothece sp. PCC 7425]
          Length = 207

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 45/103 (43%), Gaps = 18/103 (17%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           +G  Y Y S+ V K +          +  +P W + L     ++     P+ +   ++N+
Sbjct: 63  YGYKYDYKSRGVDKSM---------YIASLPIWAKELAHKIRKKYTTDLPDQV---IVNE 110

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
           Y PG  I +HID  + F   I+SLS      LC  C  +F  I
Sbjct: 111 YMPGQGIANHIDCVNCFTDTIVSLS------LCSSCVMDFVHI 147


>gi|300793742|ref|NP_001178838.1| alkylated DNA repair protein alkB homolog 8 [Rattus norvegicus]
          Length = 664

 Score = 38.9 bits (89), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE++  +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNTVDKDKPLPGGLPEICSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S   + F
Sbjct: 235 IPAHIDSHSAFEHEIVSLSLGSAIVMDF 262


>gi|194754858|ref|XP_001959711.1| GF13009 [Drosophila ananassae]
 gi|190621009|gb|EDV36533.1| GF13009 [Drosophila ananassae]
          Length = 609

 Score = 38.9 bits (89), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 59/135 (43%), Gaps = 24/135 (17%)

Query: 112 KGLGNERLYPEGEVDP-------IPAWIQSLVITKLE----QMGVVPPNYINSAVINDYQ 160
           K  G E LY    VDP       IP+    ++  +LE        + P+ +    +N+Y+
Sbjct: 174 KHFGYEFLYGSNNVDPSKPLEQPIPSACD-ILWPRLEGNSTTWDWITPDQL---TVNEYE 229

Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
           PG  I  H+D    F  PI+SLSL SD  + F         R  E  + + + R  + ++
Sbjct: 230 PGNGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGEEHVQVRLPRRSLLVM 280

Query: 221 RDFAANGITHCVRPQ 235
              A    TH +RP+
Sbjct: 281 SGEARYDWTHGIRPK 295


>gi|327269144|ref|XP_003219355.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Anolis
           carolinensis]
          Length = 666

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 57/131 (43%), Gaps = 13/131 (9%)

Query: 69  RNIDEVSSLAEQGQYKPC-------TVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYP 121
           R I+EV S  E+GQ   C        +  A    K+   + + Y  Q     +  +R  P
Sbjct: 141 RVIEEVVSPEEEGQLLECIDWGKDEVIPNAQKSLKHRRVKHFGYEFQYDNNNVDRDRPLP 200

Query: 122 EGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIIS 181
            G  D     I + ++ K  +MG +  +  +   IN Y+PG  I  HID    F+  IIS
Sbjct: 201 GGLPD-----ICNTLLAKWLKMGYIKESS-DQLTINQYEPGQGIPPHIDTHSAFEDEIIS 254

Query: 182 LSLFSDSALCF 192
           LSL +   + F
Sbjct: 255 LSLGAGIVMDF 265


>gi|145522424|ref|XP_001447056.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124414556|emb|CAK79659.1| unnamed protein product [Paramecium tetraurelia]
          Length = 636

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 50/118 (42%), Gaps = 17/118 (14%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           +G  + YG   V      E+         IPA+++  V  K+  + V P   IN   IN+
Sbjct: 192 YGYEFIYGDNTVNVDQPAEK--------KIPAFLED-VRAKVSDL-VKPQAEINQLTINE 241

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGC 216
           Y PG  I  H D    F    +S+SL S      G   +FK  +  E  LYLP  R C
Sbjct: 242 YLPGMGIPPHFDVHPPFHEKFVSISLLS------GLVMSFKSYKGEEQHLYLP-PRSC 292


>gi|408785541|ref|ZP_11197285.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
 gi|408488554|gb|EKJ96864.1| hypothetical protein C241_04278 [Rhizobium lupini HPC(L)]
          Length = 196

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 49/106 (46%), Gaps = 17/106 (16%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           FG  Y Y ++ V          P+  + P+P W+  +   +L + G    N  +  + N+
Sbjct: 49  FGYRYDYKARAVA---------PDAYLGPLPPWL-GVFARRLVRNGHCE-NLPDQVIANE 97

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVS 204
           Y PG  I +H+D    FD  I+S+SL S       C+  F+ +R S
Sbjct: 98  YFPGQGISAHVDCVPCFDDTIVSISLLS------ACEMVFRDLRGS 137


>gi|195380141|ref|XP_002048829.1| GJ21257 [Drosophila virilis]
 gi|194143626|gb|EDW60022.1| GJ21257 [Drosophila virilis]
          Length = 616

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 38/87 (43%), Gaps = 6/87 (6%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQ------SLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           K  G E LY    VDP+    Q        +  +L        +  +   +N+Y+PG  I
Sbjct: 174 KHFGYEFLYGSNNVDPLQPLEQPIPTACDFLWQRLACCEAPALDTPDQLTVNEYEPGQGI 233

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
             H+D    F  PI+SLSL SD  + F
Sbjct: 234 PPHVDTHSAFVDPILSLSLQSDVVMDF 260


>gi|194862285|ref|XP_001969966.1| GG23646 [Drosophila erecta]
 gi|190661833|gb|EDV59025.1| GG23646 [Drosophila erecta]
          Length = 232

 Score = 38.5 bits (88), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 3/75 (4%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P G + + IP W+Q + + K+  +GV      N  ++N+Y PG  I+ H D P +F   
Sbjct: 58  HPNGMIAEEIPEWLQ-IYVDKVNNLGVFESQNANHVLVNEYLPGQGILPHTDGP-LFHPI 115

Query: 179 IISLSLFSDSALCFG 193
           I ++S  + + L F 
Sbjct: 116 ISTISTGAHTVLEFA 130


>gi|348553212|ref|XP_003462421.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Cavia
           porcellus]
          Length = 664

 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   +N Y+PG  
Sbjct: 177 KHFGYEFYYENNNVDKDKPLPEGLPDICDSFLEKW--LAEGYIKHKPDQLTVNQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +H+D    F+  IISLSL S+  + F
Sbjct: 235 IPAHVDTHSAFEDEIISLSLGSEVVMDF 262


>gi|26334747|dbj|BAC31074.1| unnamed protein product [Mus musculus]
          Length = 447

 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE++  +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  IISLSL S   + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262


>gi|61675696|ref|NP_080579.1| alkylated DNA repair protein alkB homolog 8 [Mus musculus]
 gi|81895388|sp|Q80Y20.1|ALKB8_MOUSE RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
           Full=Probable alpha-ketoglutarate-dependent dioxygenase
           ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
           tRNA methyltransferase ABH8; AltName: Full=tRNA
           (carboxymethyluridine(34)-5-O)-methyltransferase ABH8
 gi|30046964|gb|AAH50863.1| AlkB, alkylation repair homolog 8 (E. coli) [Mus musculus]
          Length = 664

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE++  +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  IISLSL S   + F
Sbjct: 235 IPAHIDTHSAFEDEIISLSLGSAIVMDF 262


>gi|355752600|gb|EHH56720.1| hypothetical protein EGM_06185 [Macaca fascicularis]
          Length = 732

 Score = 38.5 bits (88), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 265 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 322

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 323 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 350


>gi|302854777|ref|XP_002958893.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
           nagariensis]
 gi|300255737|gb|EFJ40024.1| hypothetical protein VOLCADRAFT_108374 [Volvox carteri f.
           nagariensis]
          Length = 536

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)

Query: 128 IPAWIQSLVITKLEQMGVVP--PNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLF 185
           +P W+  LV    E++  +P     ++   +N+Y PG  +  H+D    F  PIISLSL 
Sbjct: 190 LPPWVSDLV----ERISALPEVSQPLDQLTVNEYDPGVGLAPHVDTHSAFTGPIISLSLG 245

Query: 186 SDSAL 190
           S + +
Sbjct: 246 STAVM 250


>gi|346466049|gb|AEO32869.1| hypothetical protein [Amblyomma maculatum]
          Length = 241

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 120 YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRP 178
           +P+G V +P+P W++  +  K+  +GV      N  ++N+Y+PG  I+ H D P  +  P
Sbjct: 67  HPKGMVPEPLPQWLKG-ICAKVASLGVFGEKLPNHVLVNEYKPGEGILPHEDGPLYY--P 123

Query: 179 IIS-LSLFSDSALCF 192
           +++ ++L S + + F
Sbjct: 124 VVTNVTLNSSTVIDF 138


>gi|332208092|ref|XP_003253130.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
           [Nomascus leucogenys]
          Length = 664

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|26349167|dbj|BAC38223.1| unnamed protein product [Mus musculus]
          Length = 537

 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE++  +   YI    +   IN Y+PG  
Sbjct: 50  KHFGYEFHYESNTVDKDKPLPGGLPEVCSSILEKL--LKEGYIKHKPDQLTINQYEPGHG 107

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  IISLSL S   + F
Sbjct: 108 IPAHIDTHSAFEDEIISLSLGSAIVMDF 135


>gi|441644875|ref|XP_004090625.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 [Nomascus
           leucogenys]
          Length = 667

 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265


>gi|380789693|gb|AFE66722.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
 gi|380789695|gb|AFE66723.1| alkylated DNA repair protein alkB homolog 8 [Macaca mulatta]
          Length = 664

 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|297690113|ref|XP_002822470.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
           [Pongo abelii]
          Length = 667

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265


>gi|402895130|ref|XP_003910687.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
           [Papio anubis]
          Length = 667

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265


>gi|402895128|ref|XP_003910686.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 1
           [Papio anubis]
          Length = 664

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|297690115|ref|XP_002822471.1| PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform 2
           [Pongo abelii]
          Length = 664

 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|195487923|ref|XP_002092096.1| GE11857 [Drosophila yakuba]
 gi|194178197|gb|EDW91808.1| GE11857 [Drosophila yakuba]
          Length = 615

 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 55/135 (40%), Gaps = 24/135 (17%)

Query: 112 KGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSA-----------VINDYQ 160
           K  G E LY    VDP     QS+         ++ P   N A            +N+Y+
Sbjct: 175 KHFGFEFLYGSNNVDPSKPLEQSIP----SACDILWPRLDNFASTWDWSSPDQLTVNEYE 230

Query: 161 PGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL 220
           PG  I  H+D    F  PI+SLSL SD  + F         R  +  + + + R  + ++
Sbjct: 231 PGHGIPPHVDTHSAFLDPILSLSLQSDVVMDF---------RRGDDQVQVRLPRRSLLIM 281

Query: 221 RDFAANGITHCVRPQ 235
              A    TH +RP+
Sbjct: 282 SGEARYDWTHGIRPK 296


>gi|75048494|sp|Q95K79.1|ALKB8_MACFA RecName: Full=Alkylated DNA repair protein alkB homolog 8; AltName:
           Full=Probable alpha-ketoglutarate-dependent dioxygenase
           ABH8; AltName: Full=S-adenosyl-L-methionine-dependent
           tRNA methyltransferase ABH8; AltName: Full=tRNA
           (carboxymethyluridine(34)-5-O)-methyltransferase ABH8
 gi|14388555|dbj|BAB60797.1| hypothetical protein [Macaca fascicularis]
          Length = 664

 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|109108545|ref|XP_001102763.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
           1 [Macaca mulatta]
 gi|109108547|ref|XP_001102947.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
           3 [Macaca mulatta]
          Length = 664

 Score = 38.1 bits (87), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 234

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 235 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 262


>gi|297269101|ref|XP_001102856.2| PREDICTED: alkylated DNA repair protein alkB homolog 8-like isoform
           2 [Macaca mulatta]
          Length = 667

 Score = 38.1 bits (87), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 180 KHFGYEFHYENNNVDKDKPLPGGLPDICDSFLEKW--LREGYIKHKPDQMTINQYEPGQG 237

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I +HID    F+  I+SLSL S+  + F
Sbjct: 238 IPAHIDTHSAFEDEIVSLSLGSEIVMDF 265


>gi|169764793|ref|XP_001816868.1| calpain [Aspergillus oryzae RIB40]
 gi|238503944|ref|XP_002383204.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|83764722|dbj|BAE54866.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220690675|gb|EED47024.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|391863454|gb|EIT72765.1| calpain [Aspergillus oryzae 3.042]
          Length = 236

 Score = 38.1 bits (87), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 127 PIPAWIQSLVITKLEQMGV---VPPNYINSAVINDYQPGGCIVSHID 170
           P+P+W+ S VI + E +G+    P    N  ++N+Y+PG  I+ H D
Sbjct: 69  PLPSWLVSPVIPRFESLGIFADAPHGAPNHVLVNEYRPGQGIMPHED 115


>gi|340939079|gb|EGS19701.1| hypothetical protein CTHT_0041830 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 353

 Score = 38.1 bits (87), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 39/82 (47%), Gaps = 15/82 (18%)

Query: 118 RLYPEGEVDP-IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFD 176
           R+YPEGE  P  PA +   + T      V P     +A++N Y PG  ++ H D     D
Sbjct: 194 RVYPEGEEPPAFPADLAGFLET------VFPETLAQAAIVNFYTPGDTMMMHRDVSEETD 247

Query: 177 RPIISLSLFSDSALCFGCKFNF 198
           + ++SLS        FGC+  F
Sbjct: 248 KGLVSLS--------FGCEGLF 261


>gi|302803799|ref|XP_002983652.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
 gi|300148489|gb|EFJ15148.1| hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]
          Length = 556

 Score = 37.7 bits (86), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 57/118 (48%), Gaps = 13/118 (11%)

Query: 122 EGEVDPIPAWIQSLVITKLEQMGVVPPNY-INSAVINDYQPGGCIVSHIDPPHI---FDR 177
           E   +P+P +++S +I +L +  VVP +   +S  I+  +PG  +     PPH    F++
Sbjct: 279 EETAEPMPTFLES-IIDRLVKCQVVPASKRPDSCSISVLEPGDYM-----PPHKHNNFEQ 332

Query: 178 PIISLSLFSDSALCFGCKFNFKPIRVSEPV-LYLPVQRGCVTLLRDFAANGITHCVRP 234
           P+  LSL S S L FGC          E   + LP   G V +L   +A  +   V+P
Sbjct: 333 PLFILSLGSQSELAFGCNLKANSSSTDEKYKVGLPA--GSVLVLEGNSAQMVQCAVQP 388


>gi|383858166|ref|XP_003704573.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Megachile rotundata]
          Length = 582

 Score = 37.7 bits (86), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 43/94 (45%), Gaps = 11/94 (11%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           FG  + YGS  V           +  + PIP   Q L +   ++   VP +Y +   IN 
Sbjct: 166 FGYEFQYGSNKVN---------LDKPITPIPEEYQFLQVL-FKKYHNVPYDY-DQLTINH 214

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCF 192
           Y PG  I  HID   +F+  I+SLSL S   + F
Sbjct: 215 YLPGQGIPPHIDTHSVFEDSILSLSLGSAYVMNF 248


>gi|444723561|gb|ELW64212.1| Alkylated DNA repair protein alkB like protein 8 [Tupaia chinensis]
          Length = 583

 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 41/88 (46%), Gaps = 9/88 (10%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQMGVVPPNYI----NSAVINDYQPGGC 164
           K  G E  Y    VD   P+P  +  +  + LE+   +   YI    +   IN Y+PG  
Sbjct: 98  KHFGYEFCYENNNVDRDKPLPGGLPDICDSILEKW--LKEGYIKYKPDQLTINQYEPGQG 155

Query: 165 IVSHIDPPHIFDRPIISLSLFSDSALCF 192
           I  HID    F+  I+SLSL S+  + F
Sbjct: 156 IPPHIDTHSAFEDEIVSLSLGSEVVMDF 183


>gi|53292605|ref|NP_001005390.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Danio
           rerio]
 gi|82184989|sp|Q6IQE9.1|ALKB6_DANRE RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 6; AltName: Full=Alkylated DNA repair protein
           alkB homolog 6
 gi|47939433|gb|AAH71457.1| AlkB, alkylation repair homolog 6 (E. coli) [Danio rerio]
          Length = 234

 Score = 37.7 bits (86), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 59/124 (47%), Gaps = 18/124 (14%)

Query: 121 PEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPI 179
           P+G + + +P W+      K+  +G       N  ++N+Y+PG  I+ H D P ++   +
Sbjct: 67  PKGMLAEKLPDWLLEYT-EKISALGAFAGKTANHVLVNEYKPGEGIMPHEDGP-LYHPTV 124

Query: 180 ISLSLFSDSALCFGCKFNFKPIRVSEP----------VLYLPVQRGCVTLLRDFAANGIT 229
            ++++ S + L F     ++P+  +EP          +L L VQR  + +L+D       
Sbjct: 125 TTITVGSHTLLDF-----YRPVCQAEPDAPQTEESRYMLSLLVQRKSLLILQDDMYKCYL 179

Query: 230 HCVR 233
           H +R
Sbjct: 180 HGIR 183


>gi|417403669|gb|JAA48633.1| Putative alkylated dna repair protein alkb log 8 [Desmodus
           rotundus]
          Length = 656

 Score = 37.4 bits (85), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 23/130 (17%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSLVITKLEQM---GVVPPNYINSAVINDYQPGGCI 165
           K  G E  Y    VD   P+P  +  +  + LE+    G++  +  +   +N Y+PG  I
Sbjct: 177 KHFGYEFHYENNNVDKDKPLPGGLPDIYDSILEKWLKEGIIR-HKPDQLTVNQYEPGHGI 235

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVSEPVLYLPVQRGCVTLL----- 220
            +HID    F+  IISLSL S+  + F       P  V+ PV+ LP    C +LL     
Sbjct: 236 PAHIDTHSAFEDEIISLSLGSEIVMDFK-----HPDGVTVPVM-LP----CRSLLVMTGE 285

Query: 221 -RDFAANGIT 229
            R    +GIT
Sbjct: 286 SRYLWTHGIT 295


>gi|424909400|ref|ZP_18332777.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
           USDA 2370]
 gi|392845431|gb|EJA97953.1| alkylated DNA repair protein [Rhizobium leguminosarum bv. viciae
           USDA 2370]
          Length = 215

 Score = 37.4 bits (85), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)

Query: 99  FGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQMGVVPPNYINSAVIND 158
           FG  Y Y ++ V          P+  + P+P W+  +   +L + G    +  +  + N+
Sbjct: 68  FGYRYDYKARAVA---------PDAYLGPLPPWL-GVFARRLVRDGHCE-SLPDQVIANE 116

Query: 159 YQPGGCIVSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPIRVS 204
           Y PG  I +H+D    FD  I+S+SL S       C+  F+ +R S
Sbjct: 117 YFPGQGISAHVDCVPCFDDTIVSISLLS------ACEMVFRDLRGS 156


>gi|302823387|ref|XP_002993346.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
 gi|300138777|gb|EFJ05531.1| hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]
          Length = 229

 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 11/79 (13%)

Query: 112 KGLGNERLYPEGEV--------DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGG 163
           K L N RL   G +         P+P+W+   +  K+  +  + P+ IN  ++N+Y PG 
Sbjct: 42  KTLTNRRLQNWGGIVESNGLVPQPLPSWLTK-ITEKISSVTGLFPSPINHVLVNEYLPGQ 100

Query: 164 CIVSHIDPPHIFDRPIISL 182
            I+ H D P  F  P++++
Sbjct: 101 GIMLHQDGPSYF--PVVAI 117


>gi|260831338|ref|XP_002610616.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
 gi|229295983|gb|EEN66626.1| hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]
          Length = 231

 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 15/96 (15%)

Query: 114 LGNERL-------YPEGEV-DPIPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           L N RL       +P+G V + +P W+Q +   ++  +GV      N  ++N+YQ G  I
Sbjct: 43  LSNRRLQNWGGLPHPKGMVVEKLPEWLQ-MCADRIHSLGVFQDKMPNHVLVNEYQQGQGI 101

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCFGCKFNFKPI 201
           + H D P +F   I +++L S + L F     +KP+
Sbjct: 102 MPHEDGP-LFYPTITTVNLGSHTMLDF-----YKPL 131


>gi|156361264|ref|XP_001625438.1| predicted protein [Nematostella vectensis]
 gi|156212272|gb|EDO33338.1| predicted protein [Nematostella vectensis]
          Length = 648

 Score = 37.4 bits (85), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 7/87 (8%)

Query: 112 KGLGNERLYPEGEVD---PIPAWIQSL---VITKLEQMGVVPPNYINSAVINDYQPGGCI 165
           K  G E LY   ++D   P+P  + ++   ++T++   G V  N  +   +N+Y PG  I
Sbjct: 176 KHYGYEFLYGTNDIDRAKPLPGGMPAVCNDILTRMVSQGAVQ-NTPDQLTVNEYLPGQGI 234

Query: 166 VSHIDPPHIFDRPIISLSLFSDSALCF 192
             H+D    F+  I SLSL +  ++ F
Sbjct: 235 PPHVDTHSAFEDGICSLSLGAKISMDF 261


>gi|402078397|gb|EJT73662.1| oxidoreductase [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 368

 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 12/105 (11%)

Query: 83  YKPCTVDRAPLRNKYFFGEGYTYGSQLVRKGLGNERLYPEGEVDPIPAWIQSLVITKLEQ 142
           +KP TV R   R  ++   G  Y            R+YP+G   P PA+ +  V   LE 
Sbjct: 177 HKPLTVRRVLDRRLHWVTLGGQYD--------WTNRVYPDGGTSP-PAFPRD-VAAFLET 226

Query: 143 MGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
             V P     +A++N Y PG  ++ H D     D+ ++S+SL  D
Sbjct: 227 --VFPDTLAQAAIVNFYSPGDTMMMHRDVSEETDKGLVSISLGCD 269


>gi|145491391|ref|XP_001431695.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398800|emb|CAK64297.1| unnamed protein product [Paramecium tetraurelia]
          Length = 634

 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 42/92 (45%), Gaps = 9/92 (9%)

Query: 128 IPAWIQSLVITKLEQMGVVPPNYINSAVINDYQPGGCIVSHIDPPHIFDRPIISLSLFSD 187
           IPA+++  V  K+  + + P   IN   IN+Y PG  I  H D    F    +++SL S 
Sbjct: 211 IPAFLED-VRAKVSDL-IKPQAEINQLTINEYLPGMGIPPHFDVHPPFHEKFVTISLLS- 267

Query: 188 SALCFGCKFNFKPIRVSEPVLYLPVQRGCVTL 219
                G   +FK  +  E  LYLP  R C   
Sbjct: 268 -----GLVMSFKSFKGEEHHLYLP-PRSCALF 293


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,952,568,633
Number of Sequences: 23463169
Number of extensions: 210736851
Number of successful extensions: 638449
Number of sequences better than 100.0: 263
Number of HSP's better than 100.0 without gapping: 86
Number of HSP's successfully gapped in prelim test: 177
Number of HSP's that attempted gapping in prelim test: 638215
Number of HSP's gapped (non-prelim): 310
length of query: 307
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 165
effective length of database: 9,027,425,369
effective search space: 1489525185885
effective search space used: 1489525185885
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)