BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy4937
         (356 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|332028068|gb|EGI68119.1| Alkylated DNA repair protein alkB-like protein 4 [Acromyrmex
           echinatior]
          Length = 300

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 164/364 (45%), Positives = 216/364 (59%), Gaps = 72/364 (19%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE E+                                  S PN
Sbjct: 1   METVRPCGCKGIRSCLICETEYEI--------------------------------SKPN 28

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                       +K +    Q  ++Y+YCP+CNK   G D+  Y+ HPNHSG   +  GV
Sbjct: 29  ------------LKNEL---QKSKSYIYCPYCNKLLSGWDIDEYKKHPNHSGNLINYPGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK DFL E E E++MK LDDLPW  SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+
Sbjct: 74  YIKLDFLKEDEAEELMKALDDLPWQPSQSGRRKQNFGPKCNFKKRKLQLGSFNGFPKTTQ 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q +F +  +L  F+T+EQCTLEYN E G+SID+H+DDCWIWGERI+TVN++GDSVLT+
Sbjct: 134 FVQKKFEEISLLHGFETVEQCTLEYNPERGASIDSHVDDCWIWGERIITVNVLGDSVLTM 193

Query: 241 TYYNGDSNKYNL----NSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRI 296
           T Y G   +YNL    N      +  Y D  S  ++D                 ++V+R+
Sbjct: 194 TPYRGPITRYNLDYVSNYDFILKKDIYDDTHSLGIDD-----------------NIVVRL 236

Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKA 352
           PMPARSL+VL+G ARY+W+H VLR DV  RRVC+AYRE T PY+ N    + T+ +  +A
Sbjct: 237 PMPARSLMVLYGPARYKWEHAVLRQDVTSRRVCLAYRELTPPYLKNGQHQKETSEILRQA 296

Query: 353 KCFF 356
             F+
Sbjct: 297 SLFW 300


>gi|383858126|ref|XP_003704553.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Megachile rotundata]
          Length = 300

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/360 (45%), Positives = 218/360 (60%), Gaps = 66/360 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE E+     NIV                              
Sbjct: 1   METVRPCGCKGIRSCLICEAEY-----NIV------------------------------ 25

Query: 61  HSGKSFDIGGVYIKEDFLTSQTK-ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGG 119
                        K DF +   K  +YVYCP C+K+W G D   Y++HPNH GK+ +  G
Sbjct: 26  -------------KPDFKSCLEKCSSYVYCPLCDKSWPGWDADLYKTHPNHQGKAIEYPG 72

Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACT 179
           VYI+ DFL+  E   ++K+LD++PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP  T
Sbjct: 73  VYIQLDFLNSCEITSLVKELDNIPWETSQSGRRKQNFGPKCNFKKRKLRLGSFNGFPKPT 132

Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
           KF+Q++  +  +L+ FQTIEQCTLEY+   G+SID HIDDCWIWGERIVTVN++G+SVLT
Sbjct: 133 KFVQEKLSKVPILADFQTIEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLT 192

Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMP 299
           +T Y+G   KYNL S           + ST   DL  + +S D    +    +V+R+PMP
Sbjct: 193 MTPYHGLDTKYNLKS---------ITEYSTTCKDLKFDVESED----DNNDDIVVRLPMP 239

Query: 300 ARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN----NEATASVFEKAKCF 355
             SL+VL+G ARY W+HCVLR D+  RR+C+AYREFT PY+N    NE    + ++A  F
Sbjct: 240 EGSLMVLYGLARYSWEHCVLREDITSRRICLAYREFTPPYLNDTLKNEEVREILKRALIF 299


>gi|345485379|ref|XP_001605092.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Nasonia vitripennis]
          Length = 294

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 157/362 (43%), Positives = 209/362 (57%), Gaps = 77/362 (21%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+RTC+ CE E+     ++    ++    CP+                  
Sbjct: 4   METVRPCGCKGIRTCLICENEYSISKPDLAADLKK----CPN------------------ 41

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                                    YVYCP CNKAW G D+  Y+ HP HSG   +  G 
Sbjct: 42  -------------------------YVYCPECNKAWPGWDMNLYKQHPRHSGIPIEFPGT 76

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK DFL++ E + ++  LD LPW+ SQSGRRKQNFGPK NFKK+++++G+F GFP  TK
Sbjct: 77  YIKLDFLNQDELKSLVNSLDLLPWEISQSGRRKQNFGPKTNFKKKRLRMGNFIGFPESTK 136

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+QD+F+    L +FQTIEQC+LEY+S+ G+SID HIDDCWIWGERIVTVN++GDSVLT+
Sbjct: 137 FVQDKFNDVKELKNFQTIEQCSLEYDSQRGASIDPHIDDCWIWGERIVTVNILGDSVLTM 196

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           T Y G SNKYNL+       +  P+Q                        ++V+R+PMP 
Sbjct: 197 TPYRGSSNKYNLDCVESLQNQAAPEQ------------------------NIVVRLPMPE 232

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN------NEATASVFEKAKC 354
            SL+VL G ARY W+H VLR D+  RR+C+AYREFT PY+       NE +  + E+A  
Sbjct: 233 GSLMVLHGAARYSWEHAVLREDINLRRICLAYREFTPPYLPDDKGNCNEESREILERALN 292

Query: 355 FF 356
           F+
Sbjct: 293 FW 294


>gi|66554392|ref|XP_623503.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Apis mellifera]
          Length = 296

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 209/359 (58%), Gaps = 67/359 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE E+     N+    E+                         
Sbjct: 1   METIRPCGCKGIRSCLLCETEYKIVKPNLKACFEKC------------------------ 36

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                                   +YVYCP+C KAW G +   Y+SHPNH G + +  GV
Sbjct: 37  -----------------------SSYVYCPYCEKAWPGWNFDLYKSHPNHQGTAIEFPGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK DFL+  E   ++  L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+
Sbjct: 74  YIKLDFLTSWEINSLINILEKIPWEASQSGRRKQNFGPKCNFKKKKLQLGSFNGFPKSTQ 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q +F +  +L++FQT+EQCTLEY+   G+SID HIDDCWIWGERIVTVN+IGDSVLT+
Sbjct: 134 FVQQKFSEIPILNNFQTVEQCTLEYDPIRGASIDPHIDDCWIWGERIVTVNVIGDSVLTM 193

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           + Y+G   KYNL S           + S+ +    +  K+ +         +++R+PMP 
Sbjct: 194 SSYHGPDTKYNLKSII---------EYSSTIEGFDVKNKNEN-------EDIIVRLPMPE 237

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
            SL+VL+G ARY+W+HCVLR D+  RR+C+AYRE T PY+    NNE    + ++A  F
Sbjct: 238 GSLMVLYGIARYKWEHCVLREDINSRRICLAYRELTPPYLCNSTNNEVVKELLKRASVF 296


>gi|357617084|gb|EHJ70575.1| hypothetical protein KGM_10717 [Danaus plexippus]
          Length = 324

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 156/358 (43%), Positives = 213/358 (59%), Gaps = 61/358 (17%)

Query: 7   CGCKGVRTCIKCEQEFGYEN-KNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
           CGCKG RTC+ CE  +G E  KN+V+  ++                              
Sbjct: 6   CGCKGCRTCLICETYYGAEELKNLVKLDKD------------------------------ 35

Query: 66  FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKED 125
                             + YV+CP CNKAW G D+  Y+ HP H G+S D  GVYIK D
Sbjct: 36  ------------------KGYVFCPFCNKAWSGWDIDVYKQHPYHEGESIDYPGVYIKLD 77

Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
           F+SE E  ++M+ +D++PWD SQSGRRKQN+GPK NFKK+KI  G F GFP  ++++QDR
Sbjct: 78  FISEYEETELMRNIDEVPWDISQSGRRKQNYGPKTNFKKKKIVPGQFNGFPKFSQYLQDR 137

Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
           F   D+L  ++ IEQC+LEY+   G+SID HIDDCW+WGERI+TVN + DSVLT+T + G
Sbjct: 138 FKTFDILKGYEVIEQCSLEYDPMKGASIDPHIDDCWVWGERILTVNCLSDSVLTMTPFKG 197

Query: 246 DSNKYNLNSPCFSYQKKYP---DQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
           D+ KYNL        K+YP       T+  D +  +   ++   +    V+IRIP+  RS
Sbjct: 198 DTIKYNLYC-----AKEYPPVVQDDGTVDMDFTSKQNMFEASKPKEDLDVIIRIPLIRRS 252

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEKAKCFF 356
           LL+++GE+RY W+H VLR D++ RRVCIAYREFT P+MNN         + ++AK F+
Sbjct: 253 LLIIYGESRYHWEHRVLREDIVSRRVCIAYREFTPPFMNNGVHEILGREIRDRAKLFW 310


>gi|340727539|ref|XP_003402099.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Bombus terrestris]
          Length = 300

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 151/359 (42%), Positives = 207/359 (57%), Gaps = 64/359 (17%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE E      N+    E+                         
Sbjct: 1   METVRSCGCKGIRSCLLCETEHKIVKPNLKTHFEKC------------------------ 36

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                                   +YVYCP+C+K+W G D   Y++HP H G + +  GV
Sbjct: 37  -----------------------SSYVYCPNCDKSWPGWDADLYKNHPYHRGTAIEFPGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK DFL+  E   +   L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+
Sbjct: 74  YIKLDFLNPWEVNSLKNALEKIPWEVSQSGRRKQNFGPKCNFKKRKLRLGSFDGFPKSTQ 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q +F Q  +L++FQTIEQCTLEY+   G+SID HIDDCWIWGERIVTVN++G+SVLT+
Sbjct: 134 FVQQKFCQVPLLNNFQTIEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLTM 193

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           + Y+G   KYNL S           + STI+     +        NE+   +V+R+PMP 
Sbjct: 194 SPYHGPDTKYNLESVA---------EYSTILEGFEFDTNDK----NESNKDIVVRLPMPE 240

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
            SL++L+  ARY+W+HCVLR D+  +R+C+AYRE T PY+    NNE    + ++A  F
Sbjct: 241 GSLMILYDSARYKWEHCVLREDITSKRICLAYRELTPPYLCTSTNNEIVKELLKRASVF 299


>gi|350423117|ref|XP_003493390.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Bombus impatiens]
          Length = 300

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 150/359 (41%), Positives = 207/359 (57%), Gaps = 64/359 (17%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE E      N+    E+                         
Sbjct: 1   METVRSCGCKGIRSCLLCETEHKIVKPNLKTHFEKC------------------------ 36

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                                   +YVYCP+C+K+W G D   Y++HP H G + +  GV
Sbjct: 37  -----------------------SSYVYCPNCDKSWPGWDADLYKNHPYHQGTAIEFPGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK DFL+  E   +   L+ +PW+ SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+
Sbjct: 74  YIKLDFLNPWEINSLKNALEKIPWEVSQSGRRKQNFGPKCNFKKRKLQLGSFDGFPKSTQ 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q +F Q  +L++FQT+EQCTLEY+   G+SID HIDDCWIWGERIVTVN++G+SVLT+
Sbjct: 134 FVQQKFCQVPMLNNFQTVEQCTLEYDPLRGASIDPHIDDCWIWGERIVTVNVMGNSVLTM 193

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           + Y+G   KYNL S           + STI+     +        NE+   +V+R+PMP 
Sbjct: 194 SPYHGPDTKYNLESVA---------EYSTILEGFEFDTNDK----NESNKDIVVRLPMPE 240

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM----NNEATASVFEKAKCF 355
            SL++L+  ARY+W+HCVLR D+  +R+C+AYRE T PY+    NNE    + ++A  F
Sbjct: 241 GSLMILYDSARYKWEHCVLREDITSKRICLAYRELTPPYLCTSTNNEIVKELLKRASVF 299


>gi|307183142|gb|EFN70059.1| Alkylated DNA repair protein alkB-like protein 4 [Camponotus
           floridanus]
          Length = 298

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 208/360 (57%), Gaps = 66/360 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+R+C+ CE ++     ++ E                             
Sbjct: 1   METVRPCGCKGIRSCLICEAKYQILKPDLKEVL--------------------------- 33

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
                               Q  ++YVYCP+C+K+  G D+  Y+ HP H G   D  GV
Sbjct: 34  --------------------QKSKSYVYCPYCDKSSPGWDIDEYKKHPYHDGNLIDYPGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YIK +FL+E E + +M+ LD LPW  SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+
Sbjct: 74  YIKLNFLNEDEAKKLMEALDYLPWQASQSGRRKQNFGPKCNFKKRKLQLGTFNGFPKTTQ 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           FIQ++F +  +L  F+T+EQC+LEY+ + G+SID HIDDCWIWGERIVTVN++GDSVLT+
Sbjct: 134 FIQNKFAEIPLLHGFRTVEQCSLEYDPKRGASIDPHIDDCWIWGERIVTVNILGDSVLTM 193

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           T Y G   +YNL+  C S                S+ EK   +     +   ++R+PMPA
Sbjct: 194 TPYRGSIMRYNLD--CVSSYN-------------SILEKDFCNNTQINIDDNIVRLPMPA 238

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
           RSL++L G ARY+W+H +LR D+  RRVC+AYRE T PY+ N    +  + +  +A  F+
Sbjct: 239 RSLMILHGPARYQWEHSILRQDITSRRVCLAYRELTPPYLENGEYHKEASEILRQALSFW 298


>gi|170051906|ref|XP_001861979.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872935|gb|EDS36318.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 318

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 157/365 (43%), Positives = 214/365 (58%), Gaps = 70/365 (19%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           MD    CGCKG RTC++CE++F  E                                   
Sbjct: 1   MDHPRPCGCKGRRTCLQCEKDFAIER---------------------------------- 26

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPN-HSGKSFD-I 117
                 D   V+        QT + Y YCPHCN+ + G D     ++H   H G+  +  
Sbjct: 27  -----IDYRDVF--------QTCKAYSYCPHCNRIYPGWDTDRVMEAHAGGHDGQPGEEY 73

Query: 118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPA 177
            GV +  D++SE E   +MK +D++PWD SQSGRRKQNFGPK NFKK K++LG F+GFP 
Sbjct: 74  PGVLVALDWVSEDEERALMKGIDEMPWDGSQSGRRKQNFGPKTNFKKNKLQLGQFRGFPQ 133

Query: 178 CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
            +KF+QDRF Q  ++++FQTIEQC+LEY  E G+SID HIDDCWIWGERIVTVNL+ DSV
Sbjct: 134 FSKFVQDRFEQVPLMANFQTIEQCSLEYTPERGASIDPHIDDCWIWGERIVTVNLLSDSV 193

Query: 238 LTLTYYN--GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
           LT++ Y+  G++ +YNL+        +Y D+   ++ +L+      D    E   + V+R
Sbjct: 194 LTMSRYSAEGNATRYNLH-----LVDRYQDK---LITELA------DDASMERFENAVVR 239

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEK 351
           IPMP RSLLV++G  RY+W+HCVLR D+  RRVC+AYREFT  Y+   +    +  VF K
Sbjct: 240 IPMPRRSLLVMYGPPRYQWEHCVLREDIRERRVCLAYREFTPMYLQEGSEYAKSEEVFCK 299

Query: 352 AKCFF 356
           A+ F+
Sbjct: 300 AQTFW 304


>gi|91087937|ref|XP_971924.1| PREDICTED: similar to CG4036 CG4036-PA [Tribolium castaneum]
 gi|270011936|gb|EFA08384.1| hypothetical protein TcasGA2_TC006028 [Tribolium castaneum]
          Length = 306

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 157/365 (43%), Positives = 205/365 (56%), Gaps = 72/365 (19%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           MD+   CGCKG RTC+ CE  FG                                     
Sbjct: 1   MDKPRPCGCKGCRTCLICESTFGL------------------------------------ 24

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
            S K F              + ++TYVYCP CN A+ G D      HP+HSG      G+
Sbjct: 25  -SAKKF------------ARRREDTYVYCPQCNVAFPGWDP---PKHPDHSGAPITFPGI 68

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YI+ DFL+  E   +M+ +DD+ WD SQSGRRKQNFGPKCNFKK K++ GDF GFP  T+
Sbjct: 69  YIQLDFLTPEEEAKLMQGIDDMSWDSSQSGRRKQNFGPKCNFKKNKLREGDFTGFPKFTE 128

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q++F    +LS FQTIEQC+LEY+ E G+SID H+DDCW+WGER+VTVNL+ DSVLT+
Sbjct: 129 FVQEKFRNVPILSDFQTIEQCSLEYDPEKGASIDPHVDDCWVWGERVVTVNLLSDSVLTM 188

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYP---DQKSTIVNDLSLNEK--STDSIVNETLSSVVIR 295
           T YN  +N YNL+        KYP   D+K     D +  E         N+     V R
Sbjct: 189 T-YNRKNNYYNLH-----LVGKYPAVLDEKG----DRNFTESCYKVKEFENDRFYPTV-R 237

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEK 351
           +PMP RSLLV++  ARY+W+H +LR D+  RRVC+AYREFT  Y+      +  A +  +
Sbjct: 238 VPMPRRSLLVMFNSARYDWEHQILREDIRERRVCLAYREFTPIYLKGGDKEQEGAPILRQ 297

Query: 352 AKCFF 356
           A+ F+
Sbjct: 298 ARNFW 302


>gi|158298439|ref|XP_318611.3| AGAP009588-PA [Anopheles gambiae str. PEST]
 gi|157013881|gb|EAA14576.4| AGAP009588-PA [Anopheles gambiae str. PEST]
          Length = 315

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 155/365 (42%), Positives = 207/365 (56%), Gaps = 68/365 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           MD    CGCKG RTC+ CE EFG                           D Y+      
Sbjct: 1   MDHPRPCGCKGRRTCLSCEAEFGIAP-----------------------IDFYS------ 31

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSD----LYAYQSHPNHSG-KSF 115
                             T Q  ++YVYCP C+K + G D    L  +   P H G +  
Sbjct: 32  ------------------TFQNNDSYVYCPRCSKIYPGWDWEKVLAEHSDTPQHGGVQGE 73

Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
           D  GVYI  DFL+  E  ++++ LD+LPWD SQSGRRKQNFGPK NFKK +++   F GF
Sbjct: 74  DYPGVYIDLDFLTTTEEAELLQGLDELPWDVSQSGRRKQNFGPKTNFKKTRLRPAQFAGF 133

Query: 176 PACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGD 235
           P  ++F+Q RF Q  +L+ FQTIEQC+LEY  E G+SID HIDDCWIWGERIVTVNL+ D
Sbjct: 134 PQLSEFVQRRFEQVPLLATFQTIEQCSLEYCPERGASIDPHIDDCWIWGERIVTVNLLSD 193

Query: 236 SVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
           SVLT++ Y G+  K   N     + ++Y   +  ++ +L ++E++     N      ++R
Sbjct: 194 SVLTMSPYRGEEGKTKYN---LHFLEQY---REHLLGEL-MDEEALAGYENR-----IVR 241

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEA----TASVFEK 351
           IPMP RSLLVL+G  RY+W+H VLR D+  RRVC+AYREFT  Y+   +    +  +FE+
Sbjct: 242 IPMPRRSLLVLYGPPRYQWEHSVLREDIKERRVCLAYREFTPMYLQGGSEFGQSEEIFER 301

Query: 352 AKCFF 356
           AK F+
Sbjct: 302 AKQFW 306


>gi|157127235|ref|XP_001654880.1| hypothetical protein AaeL_AAEL010768 [Aedes aegypti]
 gi|108872983|gb|EAT37208.1| AAEL010768-PA [Aedes aegypti]
          Length = 313

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 157/366 (42%), Positives = 214/366 (58%), Gaps = 72/366 (19%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           MD    CGCKG RTC+ CE++F  E                                   
Sbjct: 1   MDHPRPCGCKGRRTCLSCEKDFEIE----------------------------------- 25

Query: 61  HSGKSFDIGGVYIKEDFLTS-QTKETYVYCPHCNKAWQGSDLYAYQSH--PNHSGKSF-D 116
                        + DFL   ++ + Y YCP C+K + G D     +    NH G +  D
Sbjct: 26  -------------RVDFLKIFKSSKAYSYCPRCHKIYPGWDTAKVMAEHDRNHGGDAGED 72

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
             GVYI  DFLSE+E  ++MK +D++PWD SQSGRRKQNFGPK NFKK KIK+G+F+GFP
Sbjct: 73  YPGVYIDLDFLSESEEAELMKGIDEMPWDMSQSGRRKQNFGPKTNFKKTKIKVGNFEGFP 132

Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
             ++F+Q +F Q  ++  FQ+IEQC+LEYN E G+SID HIDDCWIWGERIVTVN + DS
Sbjct: 133 KFSEFVQRKFDQVPLMKGFQSIEQCSLEYNPEKGASIDPHIDDCWIWGERIVTVNFLSDS 192

Query: 237 VLTLTYYNGD--SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVI 294
           VLT++ Y  +   N+YNL+     +  +Y D+   ++ +L ++E++ D          ++
Sbjct: 193 VLTMSRYRKEDGKNRYNLD-----FVDRYKDK---LIGEL-VDEETMDK-----FDDRIV 238

Query: 295 RIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM--NNEATAS--VFE 350
           RIPMP RSLL+L+G  RY+W+H VLR D+  RRVC+AYREFT  Y+   NE + S  +F 
Sbjct: 239 RIPMPRRSLLILYGPPRYQWEHSVLREDITERRVCLAYREFTPMYLEDGNEYSKSEEIFR 298

Query: 351 KAKCFF 356
            AK F+
Sbjct: 299 LAKNFW 304


>gi|321472550|gb|EFX83520.1| hypothetical protein DAPPUDRAFT_48015 [Daphnia pulex]
          Length = 293

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 197/335 (58%), Gaps = 66/335 (19%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG R+C+ CE+EF     N++ F ++++                        + +S 
Sbjct: 8   CGCKGYRSCLLCEKEF-----NLLPFKDDSF------------------------TAESL 38

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
           +  G+            ++YV+CP CN AW G D +    HPNH G+  +  G+YI++DF
Sbjct: 39  EEKGI------------QSYVFCPACNLAWPGWD-FDPTHHPNHCGEPIEFPGIYIQQDF 85

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
            SE E   +M   D +PW+ S SGRRKQNFGPK NFKK+K++LG F+G P+ ++   DR 
Sbjct: 86  FSEEEMVKLMSSFDSIPWEYSVSGRRKQNFGPKTNFKKRKLQLGSFKGVPSFSQLFHDRI 145

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  ++TIEQC+LEY+ E G+SI+ HIDDCW+WGERIVT+NL GDSVLT+T Y+G 
Sbjct: 146 ESVPLLKDYRTIEQCSLEYDPERGASIEPHIDDCWVWGERIVTLNLAGDSVLTMTKYHGG 205

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
             KYNL                    D+ L    T+ I +     + +RIP PARSL+++
Sbjct: 206 VEKYNL-------------------QDVDLELPKTEDIPD-----ICVRIPQPARSLVMI 241

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN 341
           +G ARY+W HCVLR D+  RRVCIA REFT  ++N
Sbjct: 242 YGSARYQWLHCVLRRDIKERRVCIALREFTKQFLN 276


>gi|332373008|gb|AEE61645.1| unknown [Dendroctonus ponderosae]
          Length = 306

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 205/360 (56%), Gaps = 61/360 (16%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+ +  CGCKG RTC+ CE  FG                                     
Sbjct: 1   MNTTRPCGCKGCRTCLHCEALFGI------------------------------------ 24

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
               SFD      KE+ +   T+   VYCP CN  + G  + +Y+SHPNH G      GV
Sbjct: 25  ----SFD------KEN-IKKGTQNKKVYCPFCNLCYPGKSVDSYKSHPNHEGTPTRYEGV 73

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
           YI+ +FLS++E + +M+ +D + WD SQSGRRKQNFGPKCNFK +K++LG FQGFP  T+
Sbjct: 74  YIEMNFLSDSEHDQLMQGIDSMAWDLSQSGRRKQNFGPKCNFKARKLRLGKFQGFPKFTR 133

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           F+Q++F    +L  FQTIEQC+LEY+   GSSID H+DDCW+WGERIVTVNL+ DSVLTL
Sbjct: 134 FVQEKFEDVPILQGFQTIEQCSLEYHPARGSSIDPHVDDCWVWGERIVTVNLLSDSVLTL 193

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           T  N    +YNL+         YP     + + + L    +D  V   +   V+ +PMP 
Sbjct: 194 T-PNQYPTRYNLD-----LVGTYP---KVVQHSVKLETNRSDQSVEACVEDTVL-VPMPK 243

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
            SLLV++G ARY+++H +LR D+  RRVC+AYREFT  Y+      E    V + +K FF
Sbjct: 244 NSLLVMYGSARYDYEHRILREDITERRVCLAYREFTPIYLEGGQFYEEGKLVTKMSKEFF 303


>gi|195118911|ref|XP_002003975.1| GI20151 [Drosophila mojavensis]
 gi|193914550|gb|EDW13417.1| GI20151 [Drosophila mojavensis]
          Length = 304

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 196/344 (56%), Gaps = 64/344 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ F  E  ++                               
Sbjct: 1   MNTLRPCGCKGVRTCLTCEQNFNIEKPSV------------------------------- 29

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIG 118
                        ++ F   Q  E++ YC  C   + G D  A Q+ H +H+   S  + 
Sbjct: 30  -------------QQQF---QQLESWSYCTKCEAFYPGWDTQAVQAAHSDHNLTDSLALP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+Y++E FLS AE + ++  LD+LPWD SQSGRRKQN+GPK NFKK++++LG FQGFPA 
Sbjct: 74  GIYVQEQFLSAAEGKQLIVDLDNLPWDISQSGRRKQNYGPKTNFKKRRLQLGSFQGFPAA 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF    +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TRYVQMRFESVPLLRDFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TL  Y+   + KYNL+          PD +S ++  L   E+             V+RIP
Sbjct: 194 TLNLYDQKLAGKYNLDL--------VPDYESHLLEPLLTQEQLA------AYKGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN 341
           M   SL+VL+G ARY+++H VLR DV  RRVCIAYREFT  Y++
Sbjct: 240 MNNLSLIVLYGPARYQFEHSVLREDVTERRVCIAYREFTPMYID 283


>gi|195473383|ref|XP_002088975.1| GE18871 [Drosophila yakuba]
 gi|194175076|gb|EDW88687.1| GE18871 [Drosophila yakuba]
          Length = 304

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 152/362 (41%), Positives = 195/362 (53%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ+F                                      
Sbjct: 1   MNTIRPCGCKGVRTCLGCEQDF-------------------------------------- 22

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK--SFDIG 118
                  I    ++E F   Q  E + YC  C+    G D    Q    H  K     + 
Sbjct: 23  ------QIAKPSLQEQF---QQLEAWSYCIQCDLLQPGWDTTKVQKDHEHHKKDEGLPLP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E+FLSE E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP  
Sbjct: 74  GILVQENFLSEEEGSQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF +  +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 134 TEYVQRRFEEVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 193

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y      KYNL            D  ++  N+L       D +V  +    V+RIP
Sbjct: 194 TLTPYEVQQPGKYNL------------DLVASYENELVAPLLIEDQLV--SFEGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA---SVFEKAKC 354
           MP  SL+ L+G ARY+++H VLR DV  RRVC+AYREFT  Y+N E       V EK++ 
Sbjct: 240 MPNLSLIALYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGEDIQQGDPVREKSQI 299

Query: 355 FF 356
           F+
Sbjct: 300 FW 301


>gi|195339545|ref|XP_002036380.1| GM17708 [Drosophila sechellia]
 gi|194130260|gb|EDW52303.1| GM17708 [Drosophila sechellia]
          Length = 389

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 197/362 (54%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ                                        
Sbjct: 86  MNTIRPCGCKGVRTCLSCEQ---------------------------------------- 105

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIG 118
               +F I    +KE F   Q  E + YC  C+    G D    Q  H NH   +   + 
Sbjct: 106 ----NFQIAKTSLKEQF---QQLEAWSYCIQCDLLQPGWDTNQVQKDHENHKKDEGLPLP 158

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E+FLS  E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP  
Sbjct: 159 GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 218

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF    +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 219 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 278

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y      KYNL+    SY+ +     + ++ D  L           T    V+RIP
Sbjct: 279 TLTPYEVQQQGKYNLDL-VASYEDEL---LAPLLTDDQL----------ATFEGKVLRIP 324

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV  RRVC+AYREFT  Y+N    +    V EK++ 
Sbjct: 325 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGADIQKGDPVREKSQI 384

Query: 355 FF 356
           F+
Sbjct: 385 FW 386


>gi|24583140|ref|NP_609314.1| CG4036, isoform A [Drosophila melanogaster]
 gi|320544830|ref|NP_001188762.1| CG4036, isoform B [Drosophila melanogaster]
 gi|7297561|gb|AAF52815.1| CG4036, isoform A [Drosophila melanogaster]
 gi|220944368|gb|ACL84727.1| CG4036-PA [synthetic construct]
 gi|220954238|gb|ACL89662.1| CG4036-PA [synthetic construct]
 gi|318068393|gb|ADV37012.1| CG4036, isoform B [Drosophila melanogaster]
          Length = 304

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 151/362 (41%), Positives = 199/362 (54%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ+F                                      
Sbjct: 1   MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPNHS-GKSFDIG 118
                  I    ++E F   Q  E + YC  C+   +G D  +  + H NH   +   + 
Sbjct: 23  ------HIAKTSLREQF---QQLEAWSYCIQCDLLQRGWDTNHVQKDHENHKKDEGLPLP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E+FLS  E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP  
Sbjct: 74  GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF    +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 134 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 193

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y    S KYNL            D  ++  ++L     + D +   T    V+RIP
Sbjct: 194 TLTPYEVQQSGKYNL------------DLVASYEDELLAPLLTDDQLA--TFEGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV  RRVC+AYREFT  Y+N    +    V EK++ 
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGVDIQKGDPVREKSQI 299

Query: 355 FF 356
           F+
Sbjct: 300 FW 301


>gi|21483444|gb|AAM52697.1| LD42289p [Drosophila melanogaster]
          Length = 310

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 151/362 (41%), Positives = 199/362 (54%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ+F                                      
Sbjct: 7   MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 28

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL-YAYQSHPNHS-GKSFDIG 118
                  I    ++E F   Q  E + YC  C+   +G D  +  + H NH   +   + 
Sbjct: 29  ------HIAKTSLREQF---QQLEAWSYCIQCDLLQRGWDTNHVQKDHENHKKDEGLPLP 79

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E+FLS  E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP  
Sbjct: 80  GILVQEEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 139

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF    +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GDSVL
Sbjct: 140 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVL 199

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y    S KYNL            D  ++  ++L     + D +   T    V+RIP
Sbjct: 200 TLTPYEVQQSGKYNL------------DLVASYEDELLAPLLTDDQLA--TFEGKVLRIP 245

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---EATASVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV  RRVC+AYREFT  Y+N    +    V EK++ 
Sbjct: 246 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGVDIQKGDPVREKSQI 305

Query: 355 FF 356
           F+
Sbjct: 306 FW 307


>gi|307200597|gb|EFN80738.1| Alkylated DNA repair protein alkB-like protein 4 [Harpegnathos
           saltator]
          Length = 240

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 122/236 (51%), Positives = 169/236 (71%), Gaps = 16/236 (6%)

Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
           DFL++ E E++MK LD LPW  SQSGRRKQNFGPKCNFKK+K++LG F GFP  T+F+Q 
Sbjct: 17  DFLNKDEAEELMKSLDHLPWQASQSGRRKQNFGPKCNFKKRKLRLGTFNGFPKITQFVQT 76

Query: 185 RFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYN 244
           +F +  +L +F+T+EQC+LEY+   G+SID HIDDCWIWGERIVTVN++GD+VLT+T Y 
Sbjct: 77  KFEEVPLLHNFKTVEQCSLEYDPARGASIDPHIDDCWIWGERIVTVNVLGDTVLTMTPYR 136

Query: 245 GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLL 304
           G  N+YNL+          P+  ++I+ D       T +I ++T    V+R+PMPARSL+
Sbjct: 137 GPINRYNLDC--------VPNYNNSILKDDVCKNTQTVNIGDDT----VVRLPMPARSLM 184

Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN----EATASVFEKAKCFF 356
           +L+G  RY+W+H VLR DV  RRVC+AYRE T PY+ N    +  + + ++A+ F+
Sbjct: 185 ILYGPTRYKWEHSVLRQDVASRRVCLAYRELTPPYLENGENYKEASEILKQAEFFW 240


>gi|312381587|gb|EFR27301.1| hypothetical protein AND_06082 [Anopheles darlingi]
          Length = 517

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 132/260 (50%), Positives = 172/260 (66%), Gaps = 20/260 (7%)

Query: 103 AYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNF 162
           A  + P+      + GGV+I   FLSE E  D+++ LD++PWD SQSGRRKQNFGPK NF
Sbjct: 263 ASPTAPHGGAPGEEYGGVFIDLHFLSEEEETDLIRTLDEIPWDVSQSGRRKQNFGPKTNF 322

Query: 163 KKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWI 222
           KK ++K G F GFPA T+F+Q RF    +L+ FQTIEQC+LEY+ E G+SID HIDDCWI
Sbjct: 323 KKTRLKEGQFVGFPAATEFVQRRFESVPLLTTFQTIEQCSLEYDPERGASIDPHIDDCWI 382

Query: 223 WGERIVTVNLIGDSVLTLTYYNGDSN--KYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS 280
           WGERIVTVNL+ DSVLT++ Y G     KYNL         ++ DQ    +  L  +E+S
Sbjct: 383 WGERIVTVNLLSDSVLTMSLYRGADGKPKYNL---------QFVDQYRDRLLGLLADEQS 433

Query: 281 TDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
                N      ++RIPMP RSLLVL+G  RY+W+H VLR D+  RRVC+AYREFT  Y+
Sbjct: 434 LACYENR-----IVRIPMPRRSLLVLYGSPRYQWEHSVLREDISDRRVCLAYREFTPMYL 488

Query: 341 NNEATAS----VFEKAKCFF 356
            + +  S    +F++AK F+
Sbjct: 489 QDGSEFSQSEEIFKRAKLFW 508


>gi|195053089|ref|XP_001993463.1| GH13824 [Drosophila grimshawi]
 gi|193900522|gb|EDV99388.1| GH13824 [Drosophila grimshawi]
          Length = 307

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 142/346 (41%), Positives = 196/346 (56%), Gaps = 65/346 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CE+++  E               P   + +Q  D ++Y     
Sbjct: 1   MNTLRPCGCKGVRTCLTCEKDYEIEK--------------PSLQQQFQQLDAWSY----- 41

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHSG-KSFDIG 118
                                       C  C + + G D  A Q+ HP H+G +S  + 
Sbjct: 42  ----------------------------CTQCKRLFAGWDTKAVQAAHPAHNGAESLPLP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E FLS  E   +M  LD+LPW  SQSGRRKQN+GPK NFKK+++++G F+GFP  
Sbjct: 74  GILVQEKFLSSTEGVQLMSDLDNLPWVISQSGRRKQNYGPKTNFKKRRLQVGAFEGFPIS 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q+RF    +L +FQTIEQC+LEY+   G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TQYVQERFESVPLLHNFQTIEQCSLEYDPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYY--NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRI 296
           TL  Y     S KYNL+          P  +  ++  L LN +        T    V+RI
Sbjct: 194 TLNLYEQQQQSGKYNLDL--------VPQYEEKLLEPL-LNPQQL-----ATYEGKVLRI 239

Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
           PMP  SL+VL+G AR++++H VLR DV  RRVCIAYREFT  Y++ 
Sbjct: 240 PMPNLSLIVLYGPARFQFEHAVLREDVSERRVCIAYREFTPMYIDG 285


>gi|195385022|ref|XP_002051207.1| GJ13536 [Drosophila virilis]
 gi|194147664|gb|EDW63362.1| GJ13536 [Drosophila virilis]
          Length = 306

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 147/364 (40%), Positives = 203/364 (55%), Gaps = 71/364 (19%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKN--IVEFTEETYIYCPHCNKAWQGSDLYAYQ-S 57
           M+    CGCKGVRTC+ CEQ++  +  +        E++ YC  C + + G D  A Q +
Sbjct: 1   MNTLRPCGCKGVRTCLTCEQDYQIDKPSLQQQLQQLESWSYCAQCERLYAGWDTQAVQAA 60

Query: 58  HPNHS-GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD 116
           HP+H+  +S  + G+ ++E F+T                                     
Sbjct: 61  HPDHNLAESLPLPGILVQEQFIT------------------------------------- 83

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
                        AE   +M  LD LPW  SQSGRRKQN+GPK NFKK++++LG F+GFP
Sbjct: 84  ------------RAEGAQLMSDLDALPWAISQSGRRKQNYGPKTNFKKRRLQLGAFEGFP 131

Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
           A T+F+Q RF    +L +FQTIEQC+LEY+   G+SID H+DDCWIWGER+VTVN +GD+
Sbjct: 132 AATRFVQQRFESVPLLRNFQTIEQCSLEYDPSKGASIDPHVDDCWIWGERVVTVNCLGDA 191

Query: 237 VLTLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
           VLTL  Y    +NKYNL+       ++Y   K+ ++  L     S D +        V+R
Sbjct: 192 VLTLNLYEEQQANKYNLD-----LVQQY---KAQLLAPLL----SPDQLA--AYKGKVLR 237

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY---MNNEATASVFEKA 352
           IPMP  SL+VL+G ARY+++H VLR DV  RRVCIAYREFT  Y   ++ +    V EK+
Sbjct: 238 IPMPNLSLIVLYGPARYQFEHAVLREDVDERRVCIAYREFTPMYIDGLDKQQGDVVREKS 297

Query: 353 KCFF 356
             F+
Sbjct: 298 HLFW 301


>gi|194859295|ref|XP_001969349.1| GG10057 [Drosophila erecta]
 gi|190661216|gb|EDV58408.1| GG10057 [Drosophila erecta]
          Length = 304

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 151/362 (41%), Positives = 192/362 (53%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ+F                                      
Sbjct: 1   MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK--SFDIG 118
                  I    ++E F   Q  E + YC  C+    G D    Q    H  K     + 
Sbjct: 23  ------QIAKPSLQEQF---QQLEAWSYCIQCDLLQPGWDTTKVQKDHEHHKKDEGLPLP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E FLS  E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K++LG F GFP  
Sbjct: 74  GILVQEKFLSAEEGSRLIADLDDLPWDISQSGRRKQNFGPKTNFKKRKLRLGSFAGFPRT 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF +  +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFEEVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT +      KYNL+         Y D+    +    LNE    S         V+RIP
Sbjct: 194 TLTPFEVQPPGKYNLD-----LVASYEDELLAPL----LNEDQLAS-----FEGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA---SVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV  RRVC+AYREFT  Y+N E       V EK+  
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMYINGEDIQKGDPVREKSHI 299

Query: 355 FF 356
           F+
Sbjct: 300 FW 301


>gi|194765625|ref|XP_001964927.1| GF22814 [Drosophila ananassae]
 gi|190617537|gb|EDV33061.1| GF22814 [Drosophila ananassae]
          Length = 304

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 147/346 (42%), Positives = 190/346 (54%), Gaps = 64/346 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CEQ+F                                      
Sbjct: 1   MNTIRPCGCKGVRTCLSCEQDF-------------------------------------- 22

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQ-SHPNHSGKS-FDIG 118
                  I    ++E F   +  E + YC HC     G D    Q SH NH  ++   + 
Sbjct: 23  ------QIAKPSLQEQF---RQLEAWSYCVHCELFQPGWDTKEVQESHSNHDQENGLPLP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ ++E F+S  E   ++  LDDLPWD SQSGRRKQNFGPK NFKK+K+++G+F GFP+ 
Sbjct: 74  GILVQEKFISAEEGAQLINDLDDLPWDISQSGRRKQNFGPKTNFKKRKLQVGNFAGFPST 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF    +L  FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFEDVPLLRGFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYYN-GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y    + KYNL+     YQK+           +    K             V+RIP
Sbjct: 194 TLTPYELQQAGKYNLDL-VEEYQKELLAPLLEEEQLVRYKGK-------------VLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNE 343
           MP  SL+VL+G ARY+++H VLR DV  RRVCIAYREFT  Y+N E
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVEERRVCIAYREFTPMYINGE 285


>gi|195147010|ref|XP_002014473.1| GL18936 [Drosophila persimilis]
 gi|194106426|gb|EDW28469.1| GL18936 [Drosophila persimilis]
          Length = 304

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 149/362 (41%), Positives = 194/362 (53%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CE                                         
Sbjct: 1   MNTIRPCGCKGVRTCLSCE----------------------------------------- 19

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNH-SGKSFDIG 118
              K F I    ++E F   Q  E + YC  C + + G D  A Q+ H  H   K   + 
Sbjct: 20  ---KDFQIAKPSLQEQF---QQLEPWSYCVQCERLYSGWDTTAVQADHQQHDKAKGLLMP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           GV ++E FLS+ E   +M +LD L WD SQSGRRKQNFGPK NFKK+K+++G F GFP  
Sbjct: 74  GVLVQEQFLSKDEGSQLMTELDALRWDISQSGRRKQNFGPKTNFKKRKLQVGTFAGFPKS 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF +  +L +FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFDEVPLLRNFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y    + KYNL+                   +  L E              V+RIP
Sbjct: 194 TLTVYEEQQAGKYNLD--------------LVAAYENELLEPLLPEERLSAYKGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV+ RRVCIAYREFT  Y+   + E    V EK++ 
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVVERRVCIAYREFTPMYIAGADQEKGDPVREKSQI 299

Query: 355 FF 356
           F+
Sbjct: 300 FW 301


>gi|125985173|ref|XP_001356350.1| GA25863 [Drosophila pseudoobscura pseudoobscura]
 gi|54644673|gb|EAL33413.1| GA25863 [Drosophila pseudoobscura pseudoobscura]
          Length = 304

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 148/362 (40%), Positives = 192/362 (53%), Gaps = 67/362 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CE                                         
Sbjct: 1   MNTIRPCGCKGVRTCLSCE----------------------------------------- 19

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNH--SGKSFDIG 118
              K F I    ++E F   Q  E + YC  C + + G D  A Q+   H    K   + 
Sbjct: 20  ---KDFQIAKPSLQEQF---QQLEPWSYCVQCERLYSGWDTTAVQADHQHHDKAKGLPMP 73

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           GV ++E FLS+ E   +M  LD L WD SQSGRRKQNFGPK NFKK+K+++G F GFP  
Sbjct: 74  GVLVQEQFLSKDEGSQLMADLDALRWDISQSGRRKQNFGPKTNFKKRKLQVGTFAGFPNS 133

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           T+++Q RF +  +L +FQTIEQC+LEY    G+SID H+DDCWIWGER+VTVN +GD+VL
Sbjct: 134 TEYVQRRFDEVPLLRNFQTIEQCSLEYEPSKGASIDPHVDDCWIWGERVVTVNCLGDAVL 193

Query: 239 TLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           TLT Y    + KYNL+                   +  L E              V+RIP
Sbjct: 194 TLTVYEEQQAGKYNLD--------------LVAAYENELLEPLLPEERLSAYKGKVLRIP 239

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKAKC 354
           MP  SL+VL+G ARY+++H VLR DV+ RRVCIAYREFT  Y+   + E    V EK++ 
Sbjct: 240 MPNLSLIVLYGPARYQFEHSVLREDVVERRVCIAYREFTPMYIAGADQEKGDPVREKSQI 299

Query: 355 FF 356
           F+
Sbjct: 300 FW 301


>gi|195438547|ref|XP_002067198.1| GK24867 [Drosophila willistoni]
 gi|194163283|gb|EDW78184.1| GK24867 [Drosophila willistoni]
          Length = 307

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 147/364 (40%), Positives = 194/364 (53%), Gaps = 68/364 (18%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKGVRTC+ CE++F               I  P   K +Q            
Sbjct: 1   MNTIRPCGCKGVRTCLTCEEDF--------------QIAKPSLCKQFQ------------ 34

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDL----YAYQSHPNHSGKSFD 116
                               Q  E + YC  C + + G D       +++H +   K   
Sbjct: 35  --------------------QELEPWTYCIKCQRFYAGWDTDDTDREHENHQHAPDKGLH 74

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
           + G+ ++E FLS  E + +M  LDD+PW  SQSGRRKQN+GPK NFKK+K+++G F GFP
Sbjct: 75  LPGLLVQEQFLSHDEGKTLMAHLDDMPWTISQSGRRKQNYGPKTNFKKRKLQVGTFAGFP 134

Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
             T+++Q RF    +L  FQTIEQC+LEY+   G+SID H+DDCWIWGER+VTVN +GD+
Sbjct: 135 QGTEYVQQRFEDVPLLRDFQTIEQCSLEYDPTKGASIDPHVDDCWIWGERVVTVNCLGDA 194

Query: 237 VLTLTYYNG-DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIR 295
           VLTL  Y+     KYNL            D      N+L     S D + N      V+R
Sbjct: 195 VLTLNVYDKQQPGKYNL------------DLVPEYENELLAPMLSDDQLAN--FKGKVLR 240

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM---NNEATASVFEKA 352
           IPMP  SL+VL+G  RY+++H VLR DV  RRVCIAYREFT  Y+   + E    V E A
Sbjct: 241 IPMPNLSLIVLYGPVRYQFEHSVLREDVTERRVCIAYREFTPMYIHGKDREKGDGVREMA 300

Query: 353 KCFF 356
           K F+
Sbjct: 301 KKFW 304


>gi|242018131|ref|XP_002429534.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212514482|gb|EEB16796.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 279

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 109/249 (43%), Positives = 168/249 (67%), Gaps = 4/249 (1%)

Query: 92  CNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGR 151
           C  +W+       ++ PNH GKS D+ G+++  DFL+  E + ++  ++   WD SQSGR
Sbjct: 15  CLISWENEYDLIEKNCPNHRGKSLDVNGIFVLPDFLTAEEKDFILDGINKEKWDSSQSGR 74

Query: 152 RKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGS 211
           RKQN+GPKCNFKK KI++G+F+GFP+ +K I ++F    +L  F+ IEQCTLEY+   G+
Sbjct: 75  RKQNYGPKCNFKKNKIQVGNFKGFPSYSKIIHEKFKTLPILKDFRVIEQCTLEYDPTRGA 134

Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
           SID H+DDCWIWGERIVTVN++G++ L+++ +   +NK+NLN    SY     +    I 
Sbjct: 135 SIDPHVDDCWIWGERIVTVNILGETYLSMSRHK-QNNKFNLNK-VESYPCVLNNNGEFIY 192

Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA 331
           N+     + T ++  +   +V ++I MP RS+++L+G ARYEW+H + R+D+   R+C+A
Sbjct: 193 NNNLRPFEFTHNVPQD--ENVELKILMPERSIIILYGSARYEWEHYIPRNDIKNLRICLA 250

Query: 332 YREFTAPYM 340
           +REFT PY+
Sbjct: 251 FREFTPPYL 259


>gi|380015391|ref|XP_003691686.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like, partial [Apis florea]
          Length = 194

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 107/208 (51%), Positives = 146/208 (70%), Gaps = 18/208 (8%)

Query: 152 RKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGS 211
           RKQNFGPKCNFKK+K++LG F GFP  T+F+Q +F +  +L++FQT+EQCTLEY+   G+
Sbjct: 1   RKQNFGPKCNFKKRKLQLGSFDGFPKSTQFVQQKFSEIPILNNFQTVEQCTLEYDPIRGA 60

Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
           SID HIDDCWIWGERIVTVN++GDSVLT++ Y+G   KYNL S           + S+ +
Sbjct: 61  SIDPHIDDCWIWGERIVTVNVMGDSVLTMSSYHGPDTKYNLKSII---------EYSSTI 111

Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIA 331
             L  + K+ +   NE    +++R+PMP  SL+VL+G ARY+W+HCVLR D+  RR+C+A
Sbjct: 112 EGLKFDIKNKNE--NE---DIIVRLPMPEGSLMVLYGIARYKWEHCVLREDINSRRICLA 166

Query: 332 YREFTAPYM----NNEATASVFEKAKCF 355
           YRE T PY+    NNE    + ++A  F
Sbjct: 167 YRELTPPYLFNSTNNEVVKELLKRASVF 194


>gi|225718066|gb|ACO14879.1| Alkylated DNA repair protein alkB homolog 4 [Caligus clemensi]
          Length = 324

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 133/371 (35%), Positives = 189/371 (50%), Gaps = 62/371 (16%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+ +  CGCK  RTC KCE     ++      T                           
Sbjct: 1   MNTNRPCGCKSFRTCYKCEASISPDSNPYASLTR-------------------------- 34

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHSGKSFDIGG 119
                             + ++ ++ V+C  C   + G +L A  S HP    +SF   G
Sbjct: 35  ------------------SLRSSQSGVFCVECESIYPGWELAATCSDHPPPILQSF--SG 74

Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACT 179
           + I  +F++  E ED++K LD LPWD S SGRRKQNFGP+  F K+K K G F GFPACT
Sbjct: 75  IQIIPNFINHQEQEDLLKGLDSLPWDLSVSGRRKQNFGPRAKFNKRKAKNGPFSGFPACT 134

Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
           +F+Q RF + D L  +QT+EQC++EY    G+ I+ H+DDCWIWGERIV ++++  SVLT
Sbjct: 135 EFVQKRFEEVDSLEGYQTVEQCSIEYQKNTGAWIEPHVDDCWIWGERIVQLHVLSSSVLT 194

Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS--------TDSIVNETLSS 291
           LT     ++KYNL       +    D +  + N       S        ++ + +     
Sbjct: 195 LTKCR-KAHKYNLKDVSLYPKVLSTDGQRVLYNPFKSVSSSSPSEPFEISEEVADGRTPL 253

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCIAYREFTAPYMN-----NEAT 345
            VIRIP+P +SLLV+ G  RY+W+H VLR D+   RRV IAYREFT P++       +  
Sbjct: 254 DVIRIPLPEQSLLVMSGGPRYQWEHSVLREDIHDERRVIIAYREFTPPFLPGGDKFGDVG 313

Query: 346 ASVFEKAKCFF 356
            +V E A+ F+
Sbjct: 314 RAVIENARSFW 324


>gi|351698749|gb|EHB01668.1| Alkylated DNA repair protein alkB-like protein 4 [Heterocephalus
           glaber]
          Length = 351

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 110/339 (32%), Positives = 175/339 (51%), Gaps = 35/339 (10%)

Query: 7   CGCKGVRTCIKCEQEFGYENK-NIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
           CGCKG+RTC+ CE++   +    +       ++YC     A    D        +  G +
Sbjct: 17  CGCKGIRTCLICERQRQVDPPWQLSPKKTHRFLYCVDTGWAVGAED-------SDFEGWA 69

Query: 66  FDIGGVYIKEDFLTSQTK-----ETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
           F   GV + EDF+T + +       ++YC     A    D        +  G +F   GV
Sbjct: 70  FPFPGVTLIEDFVTREEEAEMKTHRFLYCVDTGWAVGAED-------SDFEGWAFPFPGV 122

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
            + EDF++  E  ++++ +D  PW  SQSGRRKQ++GP+ NF+KQK+K   F G P+ ++
Sbjct: 123 TLIEDFVTREEEAEMVRLMDCDPWKLSQSGRRKQDYGPRVNFRKQKLKTAGFLGLPSFSR 182

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
            +  R      L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V+++
Sbjct: 183 EVVQRMGLYPSLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVMSM 242

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           +   G  N    ++P  S +                 + +T    +     V + IP+P 
Sbjct: 243 S-QKGPGNLLLSSAPPASPKDSV--------------DSATAPCRSVPCQDVEVAIPVPR 287

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           RSLLVL G AR++W H + R  +  RRVC  +RE +A +
Sbjct: 288 RSLLVLAGAARHQWTHGIHRRHIKARRVCATFRELSAEF 326


>gi|291242331|ref|XP_002741061.1| PREDICTED: alkB, alkylation repair homolog 4-like [Saccoglossus
           kowalevskii]
          Length = 270

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 173/359 (48%), Gaps = 99/359 (27%)

Query: 1   MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPN 60
           M+    CGCKG+RTC+ CE+                                    S  +
Sbjct: 1   MNTVKLCGCKGIRTCLVCEK------------------------------------SKID 24

Query: 61  HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGV 120
            S +S    G + K D        +Y +C  CN AW  S+    + HP+H GK     GV
Sbjct: 25  LSCRS----GRFEKPD-------ASYSFCWLCNIAWLESN---TEQHPSHQGKFIRFPGV 70

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
            + E+ +SE E E +++ +D  PW  SQSGRRKQ++GPK NFKK+K+    F G PA  +
Sbjct: 71  TLIENVVSEEEEEAIIQAVDATPWKVSQSGRRKQDYGPKVNFKKRKVNSKCFSGLPAFIR 130

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
            + +R  Q D L+ FQ +EQC LEY  + GSSID H DD W+WGER+VT+NL  ++ LT+
Sbjct: 131 PLTERLVQMDGLADFQVVEQCNLEYVPDRGSSIDPHFDDVWLWGERLVTLNLNSETTLTM 190

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           T    D                                             + + IP+P 
Sbjct: 191 TQKEKD---------------------------------------------ICVSIPLPK 205

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN---NEATAS-VFEKAKCF 355
           RS++VL+G ARYEW H + R D+I RR+ + +RE +A +++   NE T S + E A+ +
Sbjct: 206 RSVIVLYGPARYEWMHAIHREDIINRRIAVTFRELSAEFLDGGVNEDTGSRLLEIARTY 264


>gi|68372246|ref|XP_687075.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Danio rerio]
          Length = 315

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 109/334 (32%), Positives = 164/334 (49%), Gaps = 95/334 (28%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC++CE +   E K+++                 Q +DL  Y           
Sbjct: 53  CGCKGIRTCLRCETD---ETKHLL-----------------QKNDLIHYD---------- 82

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                +I +  L S  +E     P C                      F+  GV + E+F
Sbjct: 83  -----FIYDPVLKSAVREEEGSTPQC----------------------FEFPGVLLWENF 115

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +SE E  +++ ++D   W +SQSGRRKQ+FGPK NFKK+++ +G F G PA ++ +  R 
Sbjct: 116 VSEDEERELVSRMDQDVWRESQSGRRKQDFGPKVNFKKRRVHVGSFSGLPAISRRLLVRM 175

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                LS F+ +EQC L+Y+S  GS+ID H+DD W+WGE +VTVNL+ D+VLTL+   G 
Sbjct: 176 SDLPQLSSFKPVEQCNLDYDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLDQG- 234

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
                                     D+   E             V + + +P RSL+VL
Sbjct: 235 ------------------------WGDMEQGE-------------VRVAVRLPRRSLVVL 257

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
           +G+AR+ W+H + R D+ GRRVC  +RE +A ++
Sbjct: 258 YGDARHRWKHAIHRKDIHGRRVCSTFRELSAEFL 291


>gi|344289767|ref|XP_003416612.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Loxodonta africana]
          Length = 358

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 106/333 (31%), Positives = 165/333 (49%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    N +WQ                  
Sbjct: 71  CGCKGIRTCLICERQRGG-------------------NLSWQ------------------ 93

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                      L  Q  + + YC H      G  + A +S  +  G +F   GV + EDF
Sbjct: 94  -----------LPPQNPDRFTYCSHT-----GWAVGAEES--DFEGWAFPFPGVTLIEDF 135

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+K+K+K   F+G P+ ++ +  R 
Sbjct: 136 VTRDEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKRKLKAAGFRGLPSFSREVVRRM 195

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++++    
Sbjct: 196 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMSHEAPG 255

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S      SP              +V  +    +S           V + +P+P RSLLVL
Sbjct: 256 SLLLCPASP---------GSLEPMVEGVVAFSRSV------LCQDVEVAVPVPRRSLLVL 300

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++WQH + R  +  RRVC+ +RE +A +
Sbjct: 301 SGAARHQWQHAIHRKHITARRVCVTFRELSAEF 333


>gi|405951338|gb|EKC19259.1| Alkylated DNA repair protein alkB-like protein 4 [Crassostrea
           gigas]
          Length = 270

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 105/333 (31%), Positives = 156/333 (46%), Gaps = 97/333 (29%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+R C+ CE+         VE   ET   C                          
Sbjct: 4   CGCKGIRNCLVCEK---------VERQNETEKKC-------------------------- 28

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                          +++ + +C  CN A++G        HPNH G + +  G++I+ DF
Sbjct: 29  ---------------SRKIHSFCDLCNMAYEGG---TADDHPNHHGNTLEFPGIFIQRDF 70

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +S  E  D+++ +   P+ +SQSGRRKQ++GPK NFK++KIK   F G P  ++ + DR 
Sbjct: 71  ISPEEESDILEHIYKSPFVESQSGRRKQDYGPKVNFKRKKIKSDVFTGLPDFSRVLYDRM 130

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
              D+L  F+ +EQC LEY  E GSSID H DD W+WGER+VT+NL  D++L +T     
Sbjct: 131 KSTDILKDFEPVEQCNLEYLPERGSSIDPHFDDFWLWGERLVTLNLGSDTILCMT----- 185

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
                           YPD                          + + IP+   SL+V+
Sbjct: 186 -------------SSDYPD--------------------------ISVHIPLFRNSLIVV 206

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           +G AR+ W H + R D+  +R+ I YRE +A +
Sbjct: 207 YGPARHAWMHGIHRCDITEKRLAITYRELSAEF 239


>gi|432901333|ref|XP_004076835.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Oryzias latipes]
          Length = 272

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 88/235 (37%), Positives = 137/235 (58%), Gaps = 37/235 (15%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           SF   G+++ +DF+SE E + ++  +D   W+ SQSGRRKQ+FGPK NFKK+K+++G F 
Sbjct: 59  SFSFPGIFLWDDFISEEEEKQLINSMDQDAWNPSQSGRRKQDFGPKVNFKKRKVRVGGFS 118

Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
           G P+ +K +  R  QE  LS FQ +EQC L+YN + GS+ID H DD W+WGER+VT+N++
Sbjct: 119 GLPSLSKELVLRMQQEPCLSGFQPVEQCNLDYNPQRGSAIDPHFDDSWLWGERLVTINML 178

Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
            D+ LT++   G                         + +L L+E+            + 
Sbjct: 179 SDTTLTMSLEQG-------------------------LPELGLSEE------------IH 201

Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASV 348
           + + +P R L+V++GEAR+ W+H + R D+  RRVC  YRE +A ++     A +
Sbjct: 202 VAVRLPRRCLVVVYGEARHRWKHAIRRQDIHKRRVCSTYRELSAEFLPGGQQAEL 256


>gi|260803607|ref|XP_002596681.1| hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae]
 gi|229281940|gb|EEN52693.1| hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae]
          Length = 303

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 115/338 (34%), Positives = 165/338 (48%), Gaps = 67/338 (19%)

Query: 2   DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
           D +  CGCKG+RTC+ CE              E T +  P      QG  +         
Sbjct: 7   DFTQICGCKGIRTCLLCE-------------GERTTVKAP------QGHAV--------- 38

Query: 62  SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVY 121
                                  TY+YC  C+KAW+  +    Q+      +  D  GV 
Sbjct: 39  -----------------------TYMYCNQCHKAWRHMEGSRCQT------EEMDFPGVL 69

Query: 122 IKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKF 181
           + EDF++  E  D++  ++   W  SQSGR+KQ+FGPK NFKK+K+KLG F G P  +  
Sbjct: 70  VVEDFITPEEEADIVTVIEGTEWKVSQSGRKKQDFGPKVNFKKKKLKLGGFTGLPKFSAP 129

Query: 182 IQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT 241
           + DR    D+L  F  +EQC L+Y+   GS+ID H DD W+WGER+VTVNL+ DSVLT++
Sbjct: 130 LLDRMKSHDLLKDFIPVEQCHLDYDPSRGSAIDPHFDDFWLWGERLVTVNLLADSVLTMS 189

Query: 242 YYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPAR 301
               D  +  +         K P  K++ ++   L     D +           IPMP R
Sbjct: 190 CEEKDEVEVTIQQTSNCEHSKKPVTKTSSLDSDILVPSDEDKVA----------IPMPRR 239

Query: 302 SLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           SLLV+ G AR++W H + R D+  RR+ +  RE +A +
Sbjct: 240 SLLVVHGNARHKWMHAIHREDITSRRIAVTLRELSAEF 277


>gi|449265929|gb|EMC77056.1| Alkylated DNA repair protein alkB like protein 4, partial [Columba
           livia]
          Length = 280

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 96/266 (36%), Positives = 153/266 (57%), Gaps = 18/266 (6%)

Query: 81  QTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLD 140
           Q ++ + YCP        + L     H   +G +F   GV++ E+F+S+ E  ++++ +D
Sbjct: 1   QGEDNFTYCP-------ATGLAKGNEHSEFAGWAFPFPGVFLTEEFISKDEESEIVELMD 53

Query: 141 DLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQ 200
              W  SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I  +     VL  F  +EQ
Sbjct: 54  RDDWKPSQSGRKKQDYGPKVNFKKQRLKAGSFTGLPSFSKKIVAQMKACSVLGGFLPVEQ 113

Query: 201 CTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQ 260
           C L+Y  E GS+ID H DD W+WGER+V++NL+  +VL+++  + DS +     P FS +
Sbjct: 114 CNLDYMPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDSIQL---FPTFSKE 170

Query: 261 KKYPDQKSTIVNDLSLNEKSTDSIVNETLSS-------VVIRIPMPARSLLVLWGEARYE 313
                   ++    S  + S +  VN  LS        V + I +P RSL+VL+G+ARY+
Sbjct: 171 NGELSPPGSLTQT-SACKNSGEETVNCVLSPRLVPSKEVTVAIHLPRRSLVVLYGDARYK 229

Query: 314 WQHCVLRSDVIGRRVCIAYREFTAPY 339
           W+H + R  +  RR+C+ +RE +A +
Sbjct: 230 WKHAIYRKHIEHRRICVTFRELSAEF 255


>gi|348532550|ref|XP_003453769.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Oreochromis niloticus]
          Length = 274

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 88/235 (37%), Positives = 137/235 (58%), Gaps = 37/235 (15%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           SF   GV++ E+F+SE E ++++  +D   W +SQSGRRKQ+FGPK NFKK+K+++G F 
Sbjct: 61  SFPFSGVFLWENFISEEEEKELICAMDQDVWKESQSGRRKQDFGPKVNFKKRKVRVGGFS 120

Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
           G PA +  +  R +QE  L+ FQ +EQC L+Y+ E G++ID H+DD W+WGER+VT+N++
Sbjct: 121 GLPALSHKLVLRMYQESALAGFQPVEQCNLDYHPERGAAIDPHLDDSWLWGERLVTINML 180

Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
            D+ LT++   G                         + +L L               V 
Sbjct: 181 SDTTLTMSLEQG-------------------------LPELGLT------------GEVH 203

Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASV 348
           + + +P R L+VL+GEAR++W+H + R D+  RRVC  YRE +A ++     A +
Sbjct: 204 VAVNLPRRCLVVLYGEARHKWKHAIHRKDIHVRRVCSTYRELSAEFLPGGQQADL 258


>gi|326931240|ref|XP_003211741.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Meleagris gallopavo]
          Length = 289

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/263 (35%), Positives = 153/263 (58%), Gaps = 16/263 (6%)

Query: 83  KETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL 142
           ++ + YCP        + L     H + +G +F   GV++ E+F+S+ E   V++ +D  
Sbjct: 12  EDNFTYCP-------ATGLAEGNEHSSFAGWAFPFPGVFLAEEFISKDEEVGVVELMDRD 64

Query: 143 PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT 202
            W  SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I  +     VL  F  +EQC 
Sbjct: 65  EWKPSQSGRKKQDYGPKVNFKKQRLKAGGFTGLPSFSKKIVAQMEACSVLGGFMPVEQCN 124

Query: 203 LEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK 262
           L+Y  E GS+ID H DD W+WGER+V++NL+  +VL+++  + DS +     P FS +  
Sbjct: 125 LDYRPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDSIQL---FPTFSNENG 181

Query: 263 YPDQKSTIVNDLSL---NEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYEWQH 316
             +   ++    S     ++ +  +++  L     V + I +P RSL+VL+GEARY+W+H
Sbjct: 182 ELNPPGSLTQTSSCENSGKQGSSCLLSPRLVPSKEVTVAIHLPRRSLVVLYGEARYKWKH 241

Query: 317 CVLRSDVIGRRVCIAYREFTAPY 339
            + R  +  RRVC+ +RE +A +
Sbjct: 242 AIHRRHIEHRRVCVTFRELSAEF 264


>gi|63102529|gb|AAH95774.1| LOC553475 protein, partial [Danio rerio]
          Length = 276

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 163/334 (48%), Gaps = 94/334 (28%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC++CE           + T++  +         Q ++L  Y           
Sbjct: 13  CGCKGIRTCLRCE----------TDGTKQHLL---------QNNELIHYD---------- 43

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                +I +  L S  +E     P C                      F+  GV + E+F
Sbjct: 44  -----FIYDPVLKSAVREEEGSTPQC----------------------FEFPGVLLWENF 76

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +SE E  +++ ++D   W +SQSGRRKQ+FGPK NFKK+++ +G F G PA ++ +  R 
Sbjct: 77  VSEDEERELVSRMDQDVWRESQSGRRKQDFGPKVNFKKRRVHVGSFSGLPAISRRLLVRM 136

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                LS F+ +EQC L+Y+S  GS+ID H+DD W+WGE +VTVNL+ D+VLTL+   G 
Sbjct: 137 SDLPQLSSFKPVEQCNLDYDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLDQG- 195

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
                                     D+   E             V + + +P RSL+VL
Sbjct: 196 ------------------------WGDMEQGE-------------VRVAVRLPRRSLVVL 218

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
           +G+AR+ W+H + R D+ GRRVC  +RE +A ++
Sbjct: 219 YGDARHRWKHAIHRKDIHGRRVCSTFRELSAEFL 252


>gi|147900037|ref|NP_001084791.1| alkB, alkylation repair homolog 4 [Xenopus laevis]
 gi|47124930|gb|AAH70778.1| MGC83815 protein [Xenopus laevis]
          Length = 311

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 145/230 (63%), Gaps = 9/230 (3%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
           F   GV++ +DF++E E + +++ +D   W  SQSGRRKQ+FGPK NFKKQK+++G+F G
Sbjct: 61  FHFPGVHLIKDFVTEDEEQRLVQFMDQGEWRHSQSGRRKQDFGPKVNFKKQKLRVGNFSG 120

Query: 175 FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
            P  ++ + +R  Q  VL  F  +EQC L+Y+SE GS+ID H DD W+WGER+V++NL+ 
Sbjct: 121 LPNISQVLWERMKQHAVLEGFLPVEQCNLDYHSERGSAIDPHFDDSWLWGERLVSLNLLS 180

Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKY----PDQKSTIVNDLSLNEKSTDSIVNET-L 289
           D+VLT+T  + D +   L  P      +      D  ++ VN +  N  S +S   +   
Sbjct: 181 DTVLTMT--SDDVSSLQL-MPVLQKGAELNMTGSDNNASDVN-IGQNPVSNNSQAKQVPC 236

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           S+V + I +P RSL+VL+G+ARY+W+H + R  +  RRVC  +RE +  +
Sbjct: 237 SNVEVAICLPRRSLVVLYGDARYKWKHAIHREHIKHRRVCSTFRELSVEF 286


>gi|301626260|ref|XP_002942310.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Xenopus (Silurana) tropicalis]
          Length = 308

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 93/230 (40%), Positives = 137/230 (59%), Gaps = 8/230 (3%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
           F   GVY+ +DF++E E + ++  +D   W  SQSGRRKQ+FGPK NFKKQK+++G+F G
Sbjct: 57  FHFPGVYLIQDFVTEDEEQHLVNFMDQEEWRLSQSGRRKQDFGPKVNFKKQKLRVGNFSG 116

Query: 175 FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
            P  ++ I +R  Q  VL  F  +EQC L+Y+SE GSSID H DD W+WGER+V++NL+ 
Sbjct: 117 LPNVSQVIWERMKQHTVLEGFLPVEQCNLDYHSERGSSIDPHFDDSWLWGERLVSLNLLS 176

Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEK-----STDSIVNETL 289
           D+V T+T  + D +   L  P      +     S   N   +N       S     +   
Sbjct: 177 DTVFTMT--SDDVSSLQL-MPVLQEGSELNSTSSDGNNASDMNHTQNPGYSKSQAKHVPY 233

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
             V + I +P RSL+VL+G+ARY+W+H + R ++  RRVC  +RE +  +
Sbjct: 234 CDVEVAIRLPRRSLVVLYGDARYKWKHAIHRENIEHRRVCSTFRELSKEF 283


>gi|115495653|ref|NP_001069021.1| probable alpha-ketoglutarate-dependent dioxygenase ABH4 [Bos
           taurus]
 gi|109939932|gb|AAI18458.1| AlkB, alkylation repair homolog 4 (E. coli) [Bos taurus]
          Length = 303

 Score =  181 bits (458), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 104/333 (31%), Positives = 165/333 (49%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                        SQ    ++Y P    A     + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------SQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F                 +L + +  S  +     V + +P+P RSL+VL
Sbjct: 200 SLLLCLAPSGFPE---------------ALADSAMASSRSVPCQEVEVAVPLPRRSLVVL 244

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRV + +RE +A +
Sbjct: 245 RGGARHQWKHAIHRRHIGARRVGVTFRELSAEF 277


>gi|296472939|tpg|DAA15054.1| TPA: alkB, alkylation repair homolog 4 [Bos taurus]
          Length = 303

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 166/337 (49%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                        SQ    ++Y P    A     + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------SQRTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMS----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P      +      E   DS +  + S     V + +P+P RS
Sbjct: 195 --------------REAPGSLLLCLAPSGFPEALADSAMAPSRSVPCQEVEVAVPLPRRS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           L+VL G AR++W+H + R  +  RRV + +RE +A +
Sbjct: 241 LVVLRGGARHQWKHAIHRRHIGARRVGVTFRELSAEF 277


>gi|332867922|ref|XP_003318746.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Pan troglodytes]
 gi|410206532|gb|JAA00485.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
          Length = 302

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 164/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   N  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SNTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|149755992|ref|XP_001504522.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Equus caballus]
          Length = 302

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 104/333 (31%), Positives = 163/333 (48%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTVIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTREEEAEMVQLMDREPWKPSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F            +V  +    +S           V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPCRSLLVL 244

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 245 TGAARHQWKHAIHRRHIQARRVCATFRELSAEF 277


>gi|397465108|ref|XP_003804369.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Pan paniscus]
          Length = 302

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW +SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKQSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|410984598|ref|XP_003998614.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Felis catus]
          Length = 308

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 105/333 (31%), Positives = 163/333 (48%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 21  CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 85

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 86  VTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSREVVRRM 145

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 146 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 205

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F            +V  +    +S           V + IP+P RSLLVL
Sbjct: 206 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAIPLPRRSLLVL 250

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 251 TGAARHQWKHAIHRRHIEARRVCATFRELSAEF 283


>gi|302564965|ref|NP_001181358.1| probable alpha-ketoglutarate-dependent dioxygenase ABH4 [Macaca
           mulatta]
 gi|355560537|gb|EHH17223.1| hypothetical protein EGK_13569 [Macaca mulatta]
 gi|355747566|gb|EHH52063.1| hypothetical protein EGM_12431 [Macaca fascicularis]
          Length = 302

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +++ E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|402863131|ref|XP_003895885.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Papio anubis]
          Length = 302

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +++ E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|440908430|gb|ELR58444.1| Putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
           [Bos grunniens mutus]
          Length = 299

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/335 (31%), Positives = 164/335 (48%), Gaps = 78/335 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 11  CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 35

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                        SQ    ++Y P    A     + A +S  +  G +F   GV + EDF
Sbjct: 36  -------------SQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 75

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 76  VTPEEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKRAGFRGLPSFSREVVRRM 135

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 136 GLYPILEDFRPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMS----- 190

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P      +      E   DS +  + S     V + +P+P RS
Sbjct: 191 --------------REAPGSLLLCLAPSGFPEALADSAMAPSRSVPCQEVEVAVPLPRRS 236

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
           L+VL G AR +W+H + R  +  RRV + +RE +A
Sbjct: 237 LVVLRGGARRQWKHAIHRRHIGARRVGVTFRELSA 271


>gi|332255155|ref|XP_003276698.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH4 [Nomascus
           leucogenys]
          Length = 311

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 165/337 (48%), Gaps = 69/337 (20%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G +     E        C  C K ++                  
Sbjct: 15  CGCKGIRTCLICERQRGGDPP--WELPPAPVDLCFFCRKTYR------------------ 54

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 55  -------------------FMYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 88

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 89  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 148

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 149 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 203

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+P RS
Sbjct: 204 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPTRS 249

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 250 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 286


>gi|297680122|ref|XP_002817855.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Pongo abelii]
          Length = 302

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 164/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              +VYC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FVYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRL 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|380795719|gb|AFE69735.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
           [Macaca mulatta]
 gi|380795721|gb|AFE69736.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
           [Macaca mulatta]
 gi|380795723|gb|AFE69737.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4, partial
           [Macaca mulatta]
          Length = 301

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 14  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 44

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 45  -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 78

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +++ E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 79  VTQEEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 138

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 139 SLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 193

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 194 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 239

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 240 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 276


>gi|62896937|dbj|BAD96409.1| hypothetical protein FLJ20013 variant [Homo sapiens]
          Length = 302

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 102/337 (30%), Positives = 161/337 (47%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGS-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W            +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA-----VGTEESDFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          ++ E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAVPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|345801177|ref|XP_850090.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Canis lupus familiaris]
          Length = 302

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/333 (31%), Positives = 164/333 (49%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGIRTCLICERQRG-------------------GDPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           +++ E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTQEEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTTGFRGLPSFSRDVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 ALYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F            +V  +    +S           V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPRRSLLVL 244

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 245 TGAARHQWKHAIHRRHIEARRVCATFRELSAEF 277


>gi|426357351|ref|XP_004046007.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Gorilla gorilla gorilla]
          Length = 302

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 164/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+                  
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWE------------------ 37

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                      L       ++YC   + +W    + A +S  +  G +F   GV + EDF
Sbjct: 38  -----------LPPAKTSRFIYC--SDTSWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTESFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|410914972|ref|XP_003970961.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Takifugu rubripes]
          Length = 246

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 134/230 (58%), Gaps = 42/230 (18%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S  SF   GV++ E+F+SE E E ++  +D   W++SQSGRRKQ+FGPK NFKK++++LG
Sbjct: 35  SAASFPFHGVFLWENFISEEEEEHLISSIDQNLWNESQSGRRKQDFGPKVNFKKRRVRLG 94

Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
            F G P  ++ + +R  QE  LS F+ +EQC L+Y+ + GS+ID H+DD W+WGER+VT+
Sbjct: 95  GFNGLPVSSRKLLERMQQEPFLSDFRPVEQCNLDYHPQRGSAIDPHLDDSWLWGERLVTI 154

Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
           N++ D+++T                                  +SL E  T  I      
Sbjct: 155 NMLSDTIIT----------------------------------MSLQEAPTGEI------ 174

Query: 291 SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
              + +P P R LLVL+ +AR++W+H V R DV  RRVC  +RE +A ++
Sbjct: 175 --QVAVPFPRRCLLVLYHDARHKWKHAVYRQDVEDRRVCSTFRELSAEFL 222


>gi|410249988|gb|JAA12961.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410249990|gb|JAA12962.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410290248|gb|JAA23724.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410290250|gb|JAA23725.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410354469|gb|JAA43838.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410354471|gb|JAA43839.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
 gi|410354473|gb|JAA43840.1| alkB, alkylation repair homolog 4 [Pan troglodytes]
          Length = 302

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 164/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|431898192|gb|ELK06887.1| Alkylated DNA repair protein alkB like protein 4 [Pteropus alecto]
          Length = 302

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 106/333 (31%), Positives = 162/333 (48%), Gaps = 70/333 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKGVRTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGVRTCLICERQRG-------------------GDPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLLEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   FQG P+ ++ +  R 
Sbjct: 80  VTREEETEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTTGFQGLPSYSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMSREAPG 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F            +V  +    +S           V + IP+P RSL VL
Sbjct: 200 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAIPLPRRSLFVL 244

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 245 SGAARHQWKHAIHRRHIEARRVCATFRELSAEF 277


>gi|8923019|ref|NP_060091.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo
           sapiens]
 gi|74734701|sp|Q9NXW9.1|ALKB4_HUMAN RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 4; AltName: Full=Alkylated DNA repair protein
           alkB homolog 4
 gi|7019832|dbj|BAA90888.1| unnamed protein product [Homo sapiens]
 gi|12803947|gb|AAH02820.1| AlkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
 gi|16877714|gb|AAH17096.1| AlkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
 gi|119570634|gb|EAW50249.1| alkB, alkylation repair homolog 4 (E. coli) [Homo sapiens]
 gi|312150638|gb|ADQ31831.1| alkB, alkylation repair homolog 4 (E. coli) [synthetic construct]
          Length = 302

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 102/337 (30%), Positives = 160/337 (47%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  W+      Y+          
Sbjct: 15  CGCKGIRTCLICERQRGS-------------------DPPWELPPAKTYR---------- 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W            +  G +F   GV + EDF
Sbjct: 46  -------------------FIYC--SDTGWA-----VGTEESDFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R 
Sbjct: 80  VTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMC----- 194

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ P          +  E   DS++  + S     V + IP+PARS
Sbjct: 195 --------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAIPLPARS 240

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 241 LLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 277


>gi|334324980|ref|XP_001378736.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Monodelphis domestica]
          Length = 367

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 139/230 (60%), Gaps = 9/230 (3%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           +G +F   GV + +DF+S  E  ++++ +D   W  SQSGRRKQ++GPK NF+KQK+K G
Sbjct: 121 AGWAFPFPGVAMIKDFVSADEETELVRLMDQDDWKLSQSGRRKQDYGPKVNFRKQKLKTG 180

Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
            F G P+ ++ I  R  Q  VL  F  +EQC L+Y+ E GS+ID H+DD W+WGER+V++
Sbjct: 181 GFDGLPSFSREIVHRMGQHPVLERFLPVEQCNLDYHPERGSAIDPHLDDSWLWGERLVSL 240

Query: 231 NLIGDSVLTLTYYNGDSN-KYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
           NL+  +VL+++    DSN +  L S      +  P       + +  + + T S  +   
Sbjct: 241 NLLSPTVLSMSR---DSNERLQLLSVAQQGTRNSPPN-----DPVPRDPEDTGSRRSVPC 292

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
             V + I +PARSLLVL+G ARY+W+H + R  +   R+C  +RE +A +
Sbjct: 293 DQVEVAIHLPARSLLVLFGAARYQWKHAIHRQHIESHRICATFRELSAEF 342


>gi|348568782|ref|XP_003470177.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Cavia porcellus]
          Length = 304

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 103/337 (30%), Positives = 165/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++                      +  WQ                  
Sbjct: 17  CGCKGIRTCLICERQL-------------------QVDPPWQ------------------ 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                      L+ Q    ++Y  + +  W    + A  S  N  G  F   GV + EDF
Sbjct: 40  -----------LSPQKTHRFLY--NVDTGWA---VGAEDS--NCEGWGFPFPGVTLIEDF 81

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGR+KQ++GP+ NF+KQK+K+ DFQG P+ ++ +  R 
Sbjct: 82  VTREEEAEMVRLMDLDPWKLSQSGRKKQDYGPRVNFRKQKLKVADFQGLPSFSREVVRRM 141

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++     
Sbjct: 142 GLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDTWLWGERLVSLNLLSPTVMSMS----- 196

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         ++ PD         +  + S D+++  + S     V + IP+P  S
Sbjct: 197 --------------REAPDTLLLSSAPTTGPKDSVDNLITPSWSVLCQDVEVAIPIPRCS 242

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W H + R  +  RRVC  +RE +A +
Sbjct: 243 LLVLTGAARHQWTHAIHRRHITARRVCATFRELSAEF 279


>gi|81905645|sp|Q9D8F1.1|ALKB4_MOUSE RecName: Full=Alpha-ketoglutarate-dependent dioxygenase alkB
           homolog 4; AltName: Full=Alkylated DNA repair protein
           alkB homolog 4
 gi|12842046|dbj|BAB25449.1| unnamed protein product [Mus musculus]
          Length = 300

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 78/340 (22%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++                    H +  WQ                  
Sbjct: 13  CGCKGIRTCLICERQR-------------------HRDPPWQ------------------ 35

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYI 122
                      +  Q K  ++YCP  +  W    +GSDL          G +F   GV +
Sbjct: 36  -----------ICLQKKCCFLYCP--DTGWAAGAEGSDL---------EGWAFPFPGVTL 73

Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
            +DF++  E  ++++ +D  PW  SQSGR+KQ++GPK NF+KQK+K+  FQG P  ++ +
Sbjct: 74  IQDFVTPEEEAEMVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKV 133

Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTY 242
             R      L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++ 
Sbjct: 134 VQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP 193

Query: 243 YNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
                      +P        P  +     D SL   S      E    V + I +P RS
Sbjct: 194 ----------EAPGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRS 238

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
           LLVL G AR++W H + R  +  RRVC  +RE ++ ++  
Sbjct: 239 LLVLTGAARHQWTHAIHRRHIKARRVCATFRELSSEFLPG 278


>gi|148687373|gb|EDL19320.1| mCG1878, isoform CRA_b [Mus musculus]
          Length = 304

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 107/340 (31%), Positives = 164/340 (48%), Gaps = 78/340 (22%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++                    H +  WQ                  
Sbjct: 17  CGCKGIRTCLICERQR-------------------HRDPPWQ------------------ 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYI 122
                      +  Q K  ++YCP  +  W    +GSDL          G +F   GV +
Sbjct: 40  -----------ICLQKKCCFLYCP--DTGWAAGAEGSDL---------EGWAFPFPGVTL 77

Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
            +DF++  E  ++++ +D  PW  SQSGR+KQ++GPK NF+KQK+K+  FQG P  ++ +
Sbjct: 78  IQDFVTPEEEAEMVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKV 137

Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTY 242
             R      L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++ 
Sbjct: 138 VQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP 197

Query: 243 YNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARS 302
                      +P        P  +     D SL   S      E    V + I +P RS
Sbjct: 198 ----------EAPGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRS 242

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
           LLVL G AR++W H + R  +  RRVC  +RE ++ ++  
Sbjct: 243 LLVLTGAARHQWTHAIHRRHIKARRVCATFRELSSEFLPG 282


>gi|239791449|dbj|BAH72189.1| ACYPI008664 [Acyrthosiphon pisum]
          Length = 174

 Score =  177 bits (448), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 122/194 (62%), Gaps = 26/194 (13%)

Query: 169 LGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
           +G+F GFP  TKF+QDRF +  ++ ++ TIEQC+LEY  E G+SID H+DDCWIWGERI+
Sbjct: 1   MGNFNGFPLSTKFVQDRFDKIPIMKNYFTIEQCSLEYRPEMGASIDPHVDDCWIWGERII 60

Query: 229 TVNLIGDSVLTLTYYNGD-SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
           T++L  D+VLTLT +N   SN+YN+                    D   N K     VN 
Sbjct: 61  TLSLQSDTVLTLTPHNIKYSNQYNI--------------------DYIPNVKCLPIPVNR 100

Query: 288 TLSSV-VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN---- 342
               + V+RI MP RSLLVL+G+ RY W+HC+LR D+  RRVCIAYREFT PY+      
Sbjct: 101 VNYPIDVVRILMPRRSLLVLYGKPRYLWEHCILREDIHERRVCIAYREFTPPYLKGGEKY 160

Query: 343 EATASVFEKAKCFF 356
             +  + +KAKCF+
Sbjct: 161 TESCDILQKAKCFW 174


>gi|449480049|ref|XP_002192991.2| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Taeniopygia guttata]
          Length = 406

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 97/264 (36%), Positives = 153/264 (57%), Gaps = 20/264 (7%)

Query: 83  KETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL 142
           ++ + YCP        + L     H   +G +F   GV++ E+F+SE E  ++++ +D  
Sbjct: 131 EDNFTYCP-------ATGLAKGNEHSEFAGWAFPFPGVFLVEEFISEDEECEIVELMDRD 183

Query: 143 PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCT 202
            W  SQSGR+KQ++GPK NFKKQ++K G F G P+ ++ I  +     VLS F  +EQC 
Sbjct: 184 DWKPSQSGRKKQDYGPKVNFKKQRLKAGSFTGLPSFSRKIVAQMKACAVLSGFLPVEQCN 243

Query: 203 LEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKK 262
           L+Y+ E GS+ID H DD W+WGER+V++NL+  +VL+++  + D+ +     P  S ++ 
Sbjct: 244 LDYSPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDTIQLF---PISSKEEL 300

Query: 263 YPD----QKSTIVNDLSLNEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYEWQ 315
            P     Q S   N     E+ T   ++  L     V + I +P RSL+VL G+ARY+W+
Sbjct: 301 SPPSPFMQTSACRNS---GEEGTQCFLSPRLVPGKEVSVAILLPQRSLVVLQGDARYKWK 357

Query: 316 HCVLRSDVIGRRVCIAYREFTAPY 339
           H + R  +  RRVCI +RE +A +
Sbjct: 358 HGIHRRHIEHRRVCITFRELSAEF 381


>gi|157786898|ref|NP_001099390.1| alkylated DNA repair protein alkB homolog 4 [Rattus norvegicus]
 gi|149062997|gb|EDM13320.1| alkB, alkylation repair homolog 4 (E. coli) (predicted), isoform
           CRA_a [Rattus norvegicus]
 gi|183986288|gb|AAI66568.1| AlkB, alkylation repair homolog 4 (E. coli) [Rattus norvegicus]
          Length = 301

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 104/336 (30%), Positives = 160/336 (47%), Gaps = 74/336 (22%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE+                       +  WQ                  
Sbjct: 14  CGCKGIRTCLICERRL-------------------QRDPPWQ------------------ 36

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAW--QGSDLYAYQSHPNHSGKSFDIGGVYIKE 124
                      ++ Q K  ++YCP    A   +GSD           G +F   GV + +
Sbjct: 37  -----------ISLQKKCCFLYCPDTGWAMGAEGSDF---------EGWAFPFPGVTLTQ 76

Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
           DF++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K+  FQG P  ++ +  
Sbjct: 77  DFVTPEEEAEMVRLMDCDPWKPSQSGRRKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQ 136

Query: 185 RFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYN 244
           R      L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++   
Sbjct: 137 RMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDSWLWGERLVSLNLLSATVVSMSR-- 194

Query: 245 GDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLL 304
            ++    L  P  S            V   +    S     +     V + I +P RSLL
Sbjct: 195 -EAPGSLLLCPVPS------------VGPDAFEGNSVAPSRSVPCQEVEVAISVPRRSLL 241

Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
           VL G AR++W H + R  +  RRVC  +RE  + ++
Sbjct: 242 VLTGAARHQWTHAIHRRHIKARRVCATFRELASEFL 277


>gi|395842978|ref|XP_003794283.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Otolemur garnettii]
          Length = 307

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 109/337 (32%), Positives = 163/337 (48%), Gaps = 78/337 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CEQ+                      +  WQ         HP   GK+ 
Sbjct: 20  CGCKGIRTCLICEQQRSS-------------------DPPWQ---------HP--LGKTH 49

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              ++YC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 50  H------------------FIYC--SDTGWA---VGAEES--DFEGWAFPFPGVMVIEDF 84

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  +++  +D  PW  SQSGRRKQ++GPK NF+KQK+K   FQG P+ ++ +  R 
Sbjct: 85  VTREEEAEMVHLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFQGLPSFSREVVRRM 144

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
                L  FQ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++      
Sbjct: 145 GLYPGLEGFQPVEQCNLDYCPEWGSAIDPHLDDTWLWGERLVSLNLLSPTVLSMC----- 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRIPMPARS 302
                         +K P          +  E   DS++  + S     V + I +P RS
Sbjct: 200 --------------RKAPGSLLLCSAPSAAPEVLVDSVIAPSRSVLCQEVEVAISLPRRS 245

Query: 303 LLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           LLVL G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 246 LLVLTGAARHQWKHAIHRRHIKARRVCATFRELSAEF 282


>gi|311251081|ref|XP_003124433.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Sus scrofa]
 gi|350581471|ref|XP_003481043.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Sus scrofa]
          Length = 302

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 160/331 (48%), Gaps = 70/331 (21%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ S                
Sbjct: 15  CGCKGIRTCLICERQRG-------------------GDPPWQHS---------------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------PQKTHRFIY--YTDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTREEEAEMVQLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTASFRGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               VL  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL+++     
Sbjct: 140 GLYPVLEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSREAPG 199

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F                 +L E +     +     V + +P+P RSLLVL
Sbjct: 200 SLLLCLAPSGFPE---------------ALVEGAVAPSRSVLCQEVEVAVPLPRRSLLVL 244

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
            G AR++W+H + R  +  RRV   +RE +A
Sbjct: 245 TGAARHQWKHAIHRRHIEARRVSATFRELSA 275


>gi|395536498|ref|XP_003770252.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4,
           partial [Sarcophilus harrisii]
          Length = 269

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 140/230 (60%), Gaps = 9/230 (3%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           +G +F   GV +  DF+S  E  ++++ +D   W  SQSGRRKQ++GP+ NF+KQK+K G
Sbjct: 23  AGWAFPFPGVTLIRDFVSVEEEAELVRLMDQDDWKLSQSGRRKQDYGPRVNFRKQKLKPG 82

Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
            F G P+ ++ I  R     +L  F  +EQC L+Y+ E GS+ID H+DD W+WGER+V++
Sbjct: 83  SFDGLPSFSREIVQRMGTYPILERFLPVEQCNLDYHPERGSAIDPHLDDSWLWGERLVSL 142

Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKS-TDSIVNETL 289
           NL+  +VL+++  +G+     L     + Q+     +S+  ND  + + S + S  + + 
Sbjct: 143 NLLSPTVLSMSQDSGE----RLQLLSVAQQRT----RSSGPNDPEMGDLSGSPSTRSVSC 194

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
             V + I +PARSLLVL G ARY+W+H + R  +   R+C  +RE +A +
Sbjct: 195 DQVEVAIHLPARSLLVLSGAARYQWKHAIHRQHIESHRICATFRELSAEF 244


>gi|198434600|ref|XP_002128162.1| PREDICTED: similar to alkB, alkylation repair homolog 4 [Ciona
           intestinalis]
          Length = 303

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/237 (39%), Positives = 140/237 (59%), Gaps = 15/237 (6%)

Query: 105 QSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKK 164
           Q  PN     F   GV +  DF+SE E +D++  ++   W  SQSGRRKQ+FGPK NFK+
Sbjct: 54  QLKPNGEFVGFSFPGVAVFPDFISENEEKDLVNAINSGQWKDSQSGRRKQDFGPKVNFKR 113

Query: 165 QKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWG 224
           QK+KL  F G P  +K I +R      L  F+T+EQC LEY+   GSSID H DD W+WG
Sbjct: 114 QKVKLASFTGLPKYSKCIDERIKTLKGLESFETVEQCNLEYDPNRGSSIDPHFDDEWLWG 173

Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQK--STIVNDLSLNEKSTD 282
           ER++++NL+ DS  T+T    DS +  L+        K PD +   T V  + +++   D
Sbjct: 174 ERLLSLNLLADSWFTMT--TDDSTECQLSVI------KTPDVQISRTGVKQIKMDK---D 222

Query: 283 SIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            I +  ++   ++IP+  RSL++L G+AR++W+H + R D+  +R+C   RE +  +
Sbjct: 223 DIFH--MNDFQVKIPLKRRSLVMLSGDARFKWKHSIQRQDIKSKRMCCTLRELSPEF 277


>gi|355668036|gb|AER94059.1| alkB, alkylation repair-like protein 4 [Mustela putorius furo]
          Length = 261

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 84/228 (36%), Positives = 130/228 (57%), Gaps = 15/228 (6%)

Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
           G +F   GV + EDF++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   
Sbjct: 24  GWAFPFPGVTVIEDFVTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAG 83

Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           F G P+ ++ +  R     +L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++N
Sbjct: 84  FSGLPSFSRDVVRRMGLYPILEDFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLN 143

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           L+  +VL+++     S    L    F            +V  +    +S           
Sbjct: 144 LLSPTVLSMSREAPGSLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQE 188

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           V + +P+P RSLLVL G AR++W+H + R  +  RRVC  +RE +A +
Sbjct: 189 VEVAVPLPRRSLLVLSGAARHQWKHAIHRRHIEARRVCATFRELSAEF 236


>gi|213515534|ref|NP_001134079.1| Alkylated DNA repair protein alkB homolog 4 [Salmo salar]
 gi|209730564|gb|ACI66151.1| Alkylated DNA repair protein alkB homolog 4 [Salmo salar]
          Length = 231

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/236 (38%), Positives = 136/236 (57%), Gaps = 40/236 (16%)

Query: 109 NHSGK--SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQK 166
           N+ G+  SF   G+++  +F+SE E  +++  +D   W +SQSGRRKQ+FGPK NFKK K
Sbjct: 12  NYGGQQLSFPFPGIFLWNNFISEEEESELITDMDLNAWRESQSGRRKQDFGPKVNFKKHK 71

Query: 167 IKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGER 226
           ++LG F G P  ++ +  R  QE VL+ FQ +EQC L+Y+ + GS+ID H+DD W+WGER
Sbjct: 72  VRLGGFCGLPPISRKLVLRMSQEPVLAGFQPVEQCNLDYHPQRGSAIDPHLDDSWLWGER 131

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
           +VTVNL+ D+ LT++   G                         + +L   E        
Sbjct: 132 LVTVNLLSDTTLTMSLEEG-------------------------LTELGQGE-------- 158

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNN 342
                V + + +P RSL+V++ EAR+ W+H + R D+  RRVC  YRE +A +M+ 
Sbjct: 159 -----VRVSVHLPCRSLVVVYSEARHRWKHAIHRHDIHERRVCSTYRELSAEFMSG 209


>gi|221106579|ref|XP_002170457.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Hydra magnipapillata]
          Length = 272

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 94/288 (32%), Positives = 141/288 (48%), Gaps = 53/288 (18%)

Query: 76  DFLTSQTKETYVYCPHCNKAWQGS--------DLYAYQSHPNHSGKSFDIGGVYIKEDFL 127
           DF+ +  K+    CP C   ++G+        ++     H  H+  +  I  V I  +F+
Sbjct: 24  DFIQNSVKDICWLCPECEVVFEGNVNEILLKDNINWCSLHLQHNISNLHIDKVIIINNFV 83

Query: 128 SEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH 187
           SE E   ++ +++  PW  SQSGRRKQ+FGPK NFK++K+K   F GFP  +KF+ ++  
Sbjct: 84  SETEESYLLTEINKDPWKMSQSGRRKQDFGPKVNFKRKKLKTAVFTGFPGYSKFVVEKMR 143

Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
           Q + L  F  +E C LEY+ E GSSID HIDD W+WGE++VT+NL   ++LTLT      
Sbjct: 144 QVESLKDFFPVELCNLEYSPERGSSIDPHIDDTWLWGEQLVTLNLNSATILTLT------ 197

Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
                                                   TL    I+IPM  RSL+++ 
Sbjct: 198 ---------------------------------------STLFQQEIQIPMMPRSLVIIE 218

Query: 308 GEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEKAKCF 355
             ARY W H +  SD+   RV + +R  + P++ +     +   AK F
Sbjct: 219 KNARYNWMHGIKSSDITDLRVAMTFRNLSQPFVKDNIGQQILHIAKSF 266


>gi|148687372|gb|EDL19319.1| mCG1878, isoform CRA_a [Mus musculus]
          Length = 261

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 92/257 (35%), Positives = 142/257 (55%), Gaps = 26/257 (10%)

Query: 86  YVYCPHCNKAW--QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLP 143
           ++YCP    A   +GSDL          G +F   GV + +DF++  E  ++++ +D  P
Sbjct: 5   FLYCPDTGWAAGAEGSDL---------EGWAFPFPGVTLIQDFVTPEEEAEMVRLMDCDP 55

Query: 144 WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTL 203
           W  SQSGR+KQ++GPK NF+KQK+K+  FQG P  ++ +  R      L  FQ +EQC L
Sbjct: 56  WKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQRMGLYPGLEDFQPVEQCNL 115

Query: 204 EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKY 263
           +Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++            +P        
Sbjct: 116 DYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP----------EAPGSLLLCSA 165

Query: 264 PDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV 323
           P  +     D SL   S      E    V + I +P RSLLVL G AR++W H + R  +
Sbjct: 166 PSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRSLLVLTGAARHQWTHAIHRRHI 220

Query: 324 IGRRVCIAYREFTAPYM 340
             RRVC  +RE ++ ++
Sbjct: 221 KARRVCATFRELSSEFL 237


>gi|296192301|ref|XP_002744023.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Callithrix jacchus]
          Length = 306

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 132/232 (56%), Gaps = 23/232 (9%)

Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
           G +F   GV + EDF++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   
Sbjct: 69  GWAFPFPGVMLIEDFVTREEEAELVRLMDCDPWKLSQSGRRKQDYGPKVNFRKQKLKTEG 128

Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           F G P+ ++ +  R      L  F+ +EQC L+Y  E GSSID H+DD W+WGER+V++N
Sbjct: 129 FCGLPSFSREVVRRMGLYPGLEGFRPVEQCNLDYCPERGSSIDPHLDDAWLWGERLVSLN 188

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS- 290
           L+  +VL++                    ++ P          +  E   DS++  + S 
Sbjct: 189 LLSPTVLSMC-------------------REAPGSLLLCSAPSAAPEVLVDSVIAPSRSV 229

Query: 291 ---SVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
               V + +P+P RSLLVL G AR++W+H + R  V  RRVC+ +RE +A +
Sbjct: 230 LCQEVEVAVPLPRRSLLVLRGAARHQWKHAIHRGHVEARRVCVTFRELSAEF 281


>gi|240995541|ref|XP_002404613.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215491615|gb|EEC01256.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 319

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 92/281 (32%), Positives = 147/281 (52%), Gaps = 23/281 (8%)

Query: 81  QTKETYVYCPHCNKAWQG-SDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQL 139
           + +  +V+CP C   W G          P  S     + G+ + E+FL+  E   +   +
Sbjct: 31  KVRHPFVFCPKCCLCWPGWLSTDTGGGKPCGSVPPLRVPGLEVHEEFLTPDEETRLAADI 90

Query: 140 DDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIE 199
           D   W  SQSGR KQ++GPK NFKK+++K   F+G PA    I+++  + + +S F+ +E
Sbjct: 91  DTANWAGSQSGRLKQDYGPKVNFKKKRVKSDGFRGLPAYYTRIEEKMRENETVSDFEAVE 150

Query: 200 QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT-------------YYNGD 246
            C L+Y+ + GS ID H DD W+WG R+VT NL+ D+VLTL+               +GD
Sbjct: 151 LCNLDYHPKRGSCIDPHYDDWWLWGPRLVTFNLLSDTVLTLSRPEEHWMLTDEDLIESGD 210

Query: 247 SNKYNLNSPCFSYQKKY-PDQKSTIVNDLSLNEKSTDSIVNETLSSV-VIRIPMPARSLL 304
               ++     +   ++ PD         S ++   DSI       + V+++  P RS+L
Sbjct: 211 RGADDVERLAAAAPLQHGPDCG-------SWSDAFKDSISTRVPPRLCVVQVSCPRRSML 263

Query: 305 VLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEAT 345
           VLW +ARY+W H +LR DV  RRV +  RE +  +++  A+
Sbjct: 264 VLWSDARYKWHHSILREDVKSRRVAMTVRELSEEFLDGGAS 304


>gi|149062998|gb|EDM13321.1| alkB, alkylation repair homolog 4 (E. coli) (predicted), isoform
           CRA_b [Rattus norvegicus]
          Length = 247

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 83/229 (36%), Positives = 130/229 (56%), Gaps = 15/229 (6%)

Query: 112 GKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
           G +F   GV + +DF++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K+  
Sbjct: 10  GWAFPFPGVTLTQDFVTPEEEAEMVRLMDCDPWKPSQSGRRKQDYGPKVNFRKQKLKMAG 69

Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           FQG P  ++ +  R      L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++N
Sbjct: 70  FQGLPGFSQKVVQRMGLYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDSWLWGERLVSLN 129

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           L+  +V++++    ++    L  P  S            V   +    S     +     
Sbjct: 130 LLSATVVSMSR---EAPGSLLLCPVPS------------VGPDAFEGNSVAPSRSVPCQE 174

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
           V + I +P RSLLVL G AR++W H + R  +  RRVC  +RE  + ++
Sbjct: 175 VEVAISVPRRSLLVLTGAARHQWTHAIHRRHIKARRVCATFRELASEFL 223


>gi|10438837|dbj|BAB15358.1| unnamed protein product [Homo sapiens]
          Length = 229

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 128/223 (57%), Gaps = 23/223 (10%)

Query: 121 YIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTK 180
            + EDF++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++
Sbjct: 1   MLIEDFVTREEEAELVRLMDRDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSR 60

Query: 181 FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
            +  R      L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++
Sbjct: 61  EVVRRMGLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 120

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS----SVVIRI 296
                               ++ P          +  E   DS++  + S     V + I
Sbjct: 121 C-------------------REAPGSLLLCSAPSAAPEALVDSVIAPSRSVLCQEVEVAI 161

Query: 297 PMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           P+PARSLLVL G AR++W+H + R  +  RRVC+ +RE +A +
Sbjct: 162 PLPARSLLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAEF 204


>gi|363741354|ref|XP_415758.3| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Gallus gallus]
          Length = 317

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/266 (34%), Positives = 158/266 (59%), Gaps = 16/266 (6%)

Query: 80  SQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQL 139
           +Q  + + YCP    A +G++      H   +G +F   GV++ E+F+SE E  ++++ +
Sbjct: 37  TQGDDNFTYCPSTGLA-EGNE------HTPFAGWAFPFPGVFLAEEFVSEEEEAEMVELM 89

Query: 140 DDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSHFQTIE 199
           D   W  SQSGR+KQ++GPK NFKKQ++K G F G P+ +K I  +     VL  F  +E
Sbjct: 90  DRDEWKPSQSGRKKQDYGPKVNFKKQRLKAGGFTGLPSFSKKIVAQMEACSVLGGFLPVE 149

Query: 200 QCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSY 259
           QC L+Y  E GS+ID H DD W+WGER+V++NL+  +VL+++  + D+ +     P F  
Sbjct: 150 QCNLDYRPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCDSEDTIQL---FPTFGN 206

Query: 260 QKKYPDQKSTIVNDLSL---NEKSTDSIVNETL---SSVVIRIPMPARSLLVLWGEARYE 313
           +    + ++++    +     ++ +  +++  L     V + I +P RSL+VL+G+ARY+
Sbjct: 207 ENGELNPRASLTQTSACENSGKQGSSCLLSPRLVPSKEVTVAIHLPRRSLVVLYGDARYK 266

Query: 314 WQHCVLRSDVIGRRVCIAYREFTAPY 339
           W+H + R  +  RRVC+ +RE +A +
Sbjct: 267 WKHAIYRRHIEHRRVCVTFRELSAEF 292


>gi|226479122|emb|CAX73056.1| Alkylated repair protein alkB homolog 4 [Schistosoma japonicum]
          Length = 305

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 104/338 (30%), Positives = 164/338 (48%), Gaps = 87/338 (25%)

Query: 7   CGCKGVRTCIKCEQE--FGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGK 64
           CGCKG+R+C  C  +     E + + EFT     +CP C+ A  G++             
Sbjct: 16  CGCKGIRSCAICRNDKFPSDETQQLFEFT-----FCPFCDMAVVGAN------------- 57

Query: 65  SFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKE 124
                      + LT                    D++A+QS        F    + I +
Sbjct: 58  -----------NILT--------------------DIHAHQS-------CFPFLNIEIVQ 79

Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQD 184
           +F+ E+E   +++++D   W  SQSGRRKQ++GPK NFK+QK++LG+F G PA +KF+  
Sbjct: 80  NFIDESEEAILIEEIDKQTWVLSQSGRRKQDYGPKVNFKRQKVQLGNFTGLPAYSKFLIT 139

Query: 185 RFHQE-----DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
           R+++       +   F  +E C LEY S+ G+ I  H DD W+WG+R+VTVNL G + LT
Sbjct: 140 RYNERIKNHTSLHPEFLPVELCNLEYKSDRGACIVPHYDDSWLWGDRLVTVNLCGSTYLT 199

Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS-IVNETLSSVVIRIPM 298
            T    D                       +V+ +    +   S I + +  +  +R+P+
Sbjct: 200 FTLPTDD-----------------------VVDGVKHEFQRIHSCIPSVSRGTFCVRVPL 236

Query: 299 PARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
             RSL+V+ G ARY WQH + RSD+  RR+ + +RE +
Sbjct: 237 DRRSLVVVSGPARYIWQHEIRRSDIPTRRIAMTFRELS 274


>gi|443717035|gb|ELU08273.1| hypothetical protein CAPTEDRAFT_150134 [Capitella teleta]
          Length = 318

 Score =  163 bits (413), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 102/348 (29%), Positives = 159/348 (45%), Gaps = 90/348 (25%)

Query: 4   SNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSG 63
           S  C CKG+R+C+ CE++     +++                +W                
Sbjct: 22  SKPCACKGIRSCLLCERDSPSTPRSV------------SVTSSW---------------- 53

Query: 64  KSFDIGGVYIKEDFLTSQTKETYVYCPHCNKA----WQGSDLYAYQSHPNH-------SG 112
            S D   V  +++     TK+ YVYC  C +A    W  +   + +   NH       S 
Sbjct: 54  -SADTHDVKDRKE----ATKQIYVYCHRCRRAHIAPWTATP--SLKDVINHAELNCSPSD 106

Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
               + G+++ E+F+S  E +++  +++   W  SQSGRRKQ++GPK NFK++K+KL  F
Sbjct: 107 SDLPLQGIHLFENFISADEEKELCDRINCTAWVVSQSGRRKQDYGPKVNFKRKKVKLASF 166

Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
            G P+ ++    R  Q   LS F  +E C LEY+ E GS+ID H DD W+WGER++T+N+
Sbjct: 167 SGLPSYSEPFIQRMLQLPQLSDFTPVELCNLEYSRERGSAIDAHFDDFWLWGERLLTINM 226

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
           + D+V T+T                                            NE L   
Sbjct: 227 VSDTVYTMT--------------------------------------------NEGLPRT 242

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
            I I  P  + +V+ GEARYEW+H + R+ V  RR+    RE T  ++
Sbjct: 243 EILIHFPRFAFIVMMGEARYEWKHAIQRTHVAQRRMATTIRELTPEFL 290


>gi|156402493|ref|XP_001639625.1| predicted protein [Nematostella vectensis]
 gi|156226754|gb|EDO47562.1| predicted protein [Nematostella vectensis]
          Length = 267

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 156/339 (46%), Gaps = 101/339 (29%)

Query: 4   SNFCGCKGVRTCIKCEQEFGY-ENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHS 62
           S  CGC G+R+C+ C+      ++   V+  +  Y++C  C++                 
Sbjct: 2   STKCGCTGIRSCLFCKDNTKTSQSTTSVDEAKTKYLFCHLCSQ----------------- 44

Query: 63  GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYI 122
             +  +GG                  C H     +  D   Y       G S D  G+ +
Sbjct: 45  --TLPLGGT-----------------CSH-----ELGDCGRY-------GPSLD--GITL 71

Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
            EDF+S+ E   +++ +D+  W  SQSGRRKQ++GP+ NFKK+K+K+  F G PA ++F+
Sbjct: 72  IEDFVSQREEARIVQVIDETVWKPSQSGRRKQDYGPQVNFKKKKVKMSHFNGLPAFSEFL 131

Query: 183 QDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
             R + +DV  L  F  +E C LEY+   GSSID H DD W+WGER+VT+NL+  + LT+
Sbjct: 132 VRRMN-DDVPGLKDFVPVELCNLEYDEARGSSIDAHFDDFWLWGERLVTLNLLSATRLTM 190

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
           T      + Y                                           I +PMP 
Sbjct: 191 T-----KDTYE------------------------------------------ISVPMPR 203

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           RSL+++ G AR+ WQH V R D+ GRR+ I  RE +  +
Sbjct: 204 RSLIIVSGAARHLWQHAVKREDISGRRIAITLRELSEEF 242


>gi|391346930|ref|XP_003747718.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Metaseiulus occidentalis]
          Length = 313

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/303 (32%), Positives = 153/303 (50%), Gaps = 50/303 (16%)

Query: 79  TSQTKETYVYCPHCNKAW----QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETED 134
            S+    +V+CP C        Q  +L A          SF + G+ + E+ ++E   + 
Sbjct: 28  VSRIGRAFVFCPKCKLCIELDPQNGELGA---------SSFRLEGIEVAENAITEDHEKF 78

Query: 135 VMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH 194
           ++  ++  PW  SQSGR KQ+FGPK NFKK+K+K   F GFPA    +Q  F +   L+ 
Sbjct: 79  LVDAIESAPWKDSQSGRLKQDFGPKVNFKKRKVKCDSFLGFPAYMDVVQMSFSRIPGLND 138

Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL-------------T 241
           F  +E C L+Y+ E GS ID H DD W+WGER+VT+NL+G +VLT              T
Sbjct: 139 FVPVELCNLKYSPERGSCIDPHFDDSWLWGERLVTLNLLGQTVLTFSQAQDFDDLTQKQT 198

Query: 242 YYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSL----NEKSTDSI--VNETLSSVVIR 295
           +Y+G S  Y L SP              +++ L      +E+ ++++   N+     V+ 
Sbjct: 199 HYDGASAVYAL-SP------------EALMDALQAHGCGSEQGSEALGTRNKCQPMDVVH 245

Query: 296 IPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMN-----NEATASVFE 350
           + +P RSL+ +   AR EW H +LR  V   R+ +  RE +  +++     +E    + E
Sbjct: 246 VLLPRRSLVTVARAARSEWHHSILREHVTETRLAMTCRELSETFLDVNNAESEQGKILLE 305

Query: 351 KAK 353
           KA+
Sbjct: 306 KAR 308


>gi|344245743|gb|EGW01847.1| Alkylated DNA repair protein alkB-like 4 [Cricetulus griseus]
          Length = 224

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 125/220 (56%), Gaps = 25/220 (11%)

Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
            ++  E  +++  +D  PW  SQSGRRKQ++GPK NF+KQK+K+  FQG P+ ++ +  R
Sbjct: 1   MVTPEEEAEMVCMMDSDPWKLSQSGRRKQDYGPKVNFRKQKLKMAGFQGLPSFSQKVVQR 60

Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
                 L  FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++    
Sbjct: 61  MGVYPGLEDFQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSCEAP 120

Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD-SIVNETLS----SVVIRIPMPA 300
            S                      + + LS+     D SIV  + S     V + I +P 
Sbjct: 121 GS--------------------LLLCSALSIGPDVFDGSIVAPSRSVPCQEVEVAICVPC 160

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM 340
           RSLLVL G AR++W H + R  ++ RRVC  +RE +  ++
Sbjct: 161 RSLLVLTGAARHQWTHAIHRRHIVARRVCATFRELSNEFL 200


>gi|301780878|ref|XP_002925858.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Ailuropoda melanoleuca]
 gi|281337677|gb|EFB13261.1| hypothetical protein PANDA_015429 [Ailuropoda melanoleuca]
          Length = 294

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 156/333 (46%), Gaps = 78/333 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ         HP       
Sbjct: 15  CGCKGIRTCLICERQRGG-------------------DPPWQ---------HP------- 39

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                         Q    ++Y  + +  W    + A +S  +  G +F   GV + EDF
Sbjct: 40  -------------PQKTHRFIY--YSDTGWA---VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F+G P+ ++ +  R 
Sbjct: 80  VTRDEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKTAGFRGLPSFSRDVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGD 246
               +L  F+ +EQC L+Y  E GS+ID H+         +V++NL+  +VL+++     
Sbjct: 140 GLYPILEDFRPVEQCNLDYCPERGSAIDPHLG--------LVSLNLLSPTVLSMSREAPG 191

Query: 247 SNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVL 306
           S    L    F            +V  +    +S           V + +P+P RSLLVL
Sbjct: 192 SLLLCLAPSGFP---------EALVEGVMAPSRSV------LCQEVEVAVPLPRRSLLVL 236

Query: 307 WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            G AR++W+H + R  +  RRVCI +RE +A +
Sbjct: 237 TGAARHQWKHAIHRRHIEARRVCITFRELSAEF 269


>gi|47214665|emb|CAG00901.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 917

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 66/142 (46%), Positives = 103/142 (72%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S  SF   GV++ E+F+SE E E ++  +D   W++SQSGRRKQ+FGPK NFKK+++ LG
Sbjct: 38  SAASFPFHGVFLWENFISEEEEEHLISSIDQNLWNESQSGRRKQDFGPKVNFKKRRVCLG 97

Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
            F G P  ++ + +R HQE +LS F+ +EQC L+Y+ + GS+ID H+DD W+WGER+VT+
Sbjct: 98  GFSGLPVSSRKLLERMHQEPLLSDFRPVEQCNLDYHPQRGSAIDPHLDDSWLWGERLVTI 157

Query: 231 NLIGDSVLTLTYYNGDSNKYNL 252
           N++ D+ LT++ ++G S +  +
Sbjct: 158 NMLSDTTLTMSLHDGLSGEVRV 179


>gi|110625894|ref|NP_082346.1| alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Mus
           musculus]
 gi|18490963|gb|AAH22729.1| AlkB, alkylation repair homolog 4 (E. coli) [Mus musculus]
          Length = 215

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 77/206 (37%), Positives = 118/206 (57%), Gaps = 15/206 (7%)

Query: 135 VMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH 194
           +++ +D  PW  SQSGR+KQ++GPK NF+KQK+K+  FQG P  ++ +  R      L  
Sbjct: 1   MVRLMDCDPWKLSQSGRKKQDYGPKVNFRKQKLKMAGFQGLPGFSQKVVQRMGLYPGLED 60

Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
           FQ +EQC L+Y+ E GS+ID H+DD W+WGER+V++NL+  +V++++            +
Sbjct: 61  FQPVEQCNLDYSPERGSAIDPHLDDAWLWGERLVSLNLLSATVVSMSP----------EA 110

Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
           P        P  +     D SL   S      E    V + I +P RSLLVL G AR++W
Sbjct: 111 PGSLLLCSAPSVRPDAFED-SLVAPSRSVPCQE----VEVAITVPRRSLLVLTGAARHQW 165

Query: 315 QHCVLRSDVIGRRVCIAYREFTAPYM 340
            H + R  +  RRVC  +RE ++ ++
Sbjct: 166 THAIHRRHIKARRVCATFRELSSEFL 191


>gi|326433577|gb|EGD79147.1| hypothetical protein PTSG_09879 [Salpingoeca sp. ATCC 50818]
          Length = 270

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 117/228 (51%), Gaps = 44/228 (19%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           GV++  +F++  E + ++ Q+D+ PW  SQSGR KQ+FGPK NFK++K+K+G+F G P+ 
Sbjct: 68  GVHVFREFVTADEEQALLSQMDEWPWKLSQSGRWKQDFGPKVNFKRKKVKVGNFTGLPSS 127

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
           ++ +  R      L  F+ +EQC L+Y  E G++ID H DD WIWGER+VTVNL+ ++ L
Sbjct: 128 SRDLVARMQALPCLQDFEPVEQCHLDYRPERGAAIDMHFDDDWIWGERLVTVNLLSETRL 187

Query: 239 TLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPM 298
           +  +   +  +                                            + + +
Sbjct: 188 SFEHPQHEGAQ--------------------------------------------VYVVL 203

Query: 299 PARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA 346
           PARSL+ + G AR +W+H V R  +  RR+ +  RE    ++     A
Sbjct: 204 PARSLVAVQGSARTQWKHAVHRGAITDRRIAVTLRELGPDFVAGAGRA 251


>gi|386783769|gb|AFJ24779.1| alpha ketoglutarate dependent dioxygenase ABH4 [Schmidtea
           mediterranea]
          Length = 287

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 100/342 (29%), Positives = 153/342 (44%), Gaps = 73/342 (21%)

Query: 6   FCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
            C CKG+RTC  C       N N ++   +  I C  C K                    
Sbjct: 3   LCTCKGIRTCSSC-------NPNKIKIENQNCIVCYFCPK-------------------- 35

Query: 66  FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKED 125
                       ++   KE  +           SDL   + H      + ++ G+ + E+
Sbjct: 36  ------------ISKIVKENCI-----------SDLKCEEFH----KVNIELNGIILIEN 68

Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
           FL+E +   ++  +    W  SQSGRRKQ+FGPK NFKK+KI L  FQG P   +   +R
Sbjct: 69  FLTEDDKNYLLGGICSNSWVDSQSGRRKQDFGPKVNFKKRKINLTKFQGLPEYIERFVNR 128

Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
           F     L  F  +E C LEYN   G+SID H DD W+WGER+VT+N+   + LT T   G
Sbjct: 129 FSDIPELKDFNPVELCNLEYNPTRGASIDPHFDDFWLWGERLVTINVQSSTYLTFTP--G 186

Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLV 305
             + ++ +S  F +     +    + N+                 +V I++P+P  SL++
Sbjct: 187 FPDMFDESSQAF-FSSVCSENHENMGNN----------------CAVSIKVPLPEGSLVI 229

Query: 306 LWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATAS 347
           + G+AR++W H V  +DV   R+    RE +  +  N+   S
Sbjct: 230 VSGDARHKWMHAVSAADVQSTRIASTLRELSMEFTENDRELS 271


>gi|268563819|ref|XP_002647020.1| Hypothetical protein CBG24043 [Caenorhabditis briggsae]
          Length = 291

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 117/230 (50%), Gaps = 47/230 (20%)

Query: 107 HPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQK 166
           H   +  +  I G+ +  DFL+EAE  DV++ +D + W +SQSGRRKQ++GPK NFK +K
Sbjct: 73  HSTSAENAIQIDGLTLIHDFLTEAEESDVLEMIDTVEWVQSQSGRRKQDYGPKVNFKHKK 132

Query: 167 IKLGDFQGFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWG 224
           +K   F G P     + ++  + DV  L ++Q  E C LEY     S+I+ H DD WIWG
Sbjct: 133 VKTDSFIGMPEYADMLLNKMSEYDVTKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWG 192

Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
            R++++NLI  SV+TL+    DS +    S C+ Y                         
Sbjct: 193 NRLISINLINGSVMTLS---NDSKE----SLCYVY------------------------- 220

Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
                        MP RSL+ +  E RYEW+H VL   + GRR+ +  RE
Sbjct: 221 -------------MPHRSLICMADECRYEWKHGVLAHHIRGRRIALTMRE 257


>gi|170579486|ref|XP_001894850.1| LD42289p [Brugia malayi]
 gi|158598401|gb|EDP36302.1| LD42289p, putative [Brugia malayi]
          Length = 297

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/339 (29%), Positives = 147/339 (43%), Gaps = 87/339 (25%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVE---FTE-ETYIYCPHCNKAWQGSDLYAYQSHPNHS 62
           CGCKGVR C  C+     +   + E   F + + Y+Y  H + A   + L      PN  
Sbjct: 18  CGCKGVRFCAACKDTLRVKKLRLCEEYPFAKYKKYVYSSHHHIAIHDNSL------PNRP 71

Query: 63  GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYI 122
             + DI  V                     NKA             N+      + G+Y+
Sbjct: 72  SLA-DIHDV-----------------ASRINKA------------ENNFEDDLAVPGLYV 101

Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
             DFLSEAE  D++  +D   W  SQSGRRKQ++GP+ NFK +KIK+  F G P+ T  I
Sbjct: 102 VIDFLSEAEEVDLVNAIDKTDWVLSQSGRRKQDYGPRINFKHKKIKMDRFFGMPSYTDVI 161

Query: 183 QDRFH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
            +R +    D+   +   E C LEY     S+I+ H DD WIWGER++ +NL+  SVLT 
Sbjct: 162 LNRMNSISSDLFGSYHPFELCNLEYRDSRWSTIEMHYDDTWIWGERLICINLLTKSVLT- 220

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
             Y  D  +                                          ++I +P+P+
Sbjct: 221 --YANDEKQ------------------------------------------LIIYVPLPS 236

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           R+++ +  E RY W+H V    + GRR+ +  RE +  +
Sbjct: 237 RTMVCMSDEIRYSWRHAVFPEHIRGRRIALTMREPSTAF 275


>gi|393907552|gb|EJD74689.1| hypothetical protein LOAG_18028 [Loa loa]
          Length = 293

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 150/336 (44%), Gaps = 81/336 (24%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKGVR C  C        K+ +  T+        C +       Y Y  +  +   + 
Sbjct: 14  CGCKGVRFCAAC--------KDTLRVTKLAL-----CGE-------YPYGEYKKYVYSTR 53

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYA-YQSHPNHSGKSFDIGGVYIKED 125
            +  ++ K   LT +   T ++           DL        N       I G+++  D
Sbjct: 54  HLAAIHDKS--LTGRPTLTEIH-----------DLAGKINGTENKPEDYLIIPGLHVITD 100

Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
           FL+EAE  D++  +D   W  SQSGRRKQ++GP+ NFK +KIK+  F G P+    I +R
Sbjct: 101 FLTEAEEMDLINVIDRTDWVLSQSGRRKQDYGPRVNFKHKKIKMERFLGMPSYIDVILNR 160

Query: 186 FH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY 243
            +    D+   +Q  E C LEY+ +  S+I+ H DD WIWGER++ VNL+  SVLT   Y
Sbjct: 161 MNSISPDLFGSYQPFELCNLEYSDDRWSNIEMHYDDTWIWGERLICVNLLSKSVLT---Y 217

Query: 244 NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSL 303
             D+ +                                          ++I +P+P+R++
Sbjct: 218 TNDAKQ------------------------------------------LIIYVPLPSRTM 235

Query: 304 LVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           + +  E RY W+H V    + GRR+ +  RE +A +
Sbjct: 236 VCMSDEIRYSWRHAVFPEHIRGRRIAVTMREPSAAF 271


>gi|402593158|gb|EJW87085.1| hypothetical protein WUBG_02002 [Wuchereria bancrofti]
          Length = 296

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 92/339 (27%), Positives = 150/339 (44%), Gaps = 87/339 (25%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKGVR C  C+     +   + E     Y +  +          Y Y +H +      
Sbjct: 17  CGCKGVRFCAACKDTLRVKELRLCE----EYPFAKYKK--------YVYSTHHH------ 58

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD----IGGVYI 122
               + I ++ L+++                 +D++   +  N +   F+    + G+++
Sbjct: 59  ----IAIHDNSLSNRP--------------SLADIHDVANRINKAENKFEDYLVVPGLHV 100

Query: 123 KEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
             +FLSEAE  D++  +D   W  SQSGRRKQ++GP+ NFK +K+K+  F G P+ T  I
Sbjct: 101 VTNFLSEAEEVDLVNAIDKTDWVLSQSGRRKQDYGPRINFKHKKVKMDRFFGMPSYTDVI 160

Query: 183 QDRFH--QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
            +R +    D+   +Q  E C LEY     S+I+ H DD WIWGER++ VNL+  SVLT 
Sbjct: 161 LNRMNSISSDLFGSYQPFELCNLEYRDSRWSTIEMHYDDTWIWGERLICVNLLSKSVLT- 219

Query: 241 TYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPA 300
             Y  D  +                                          ++I +P+P 
Sbjct: 220 --YANDEKQ------------------------------------------LIIYVPLPC 235

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           R+++ +  E RY W+H V    + GRR+ +  RE +  +
Sbjct: 236 RTMVCMSDEIRYSWRHAVFPEHIRGRRIALTMREPSTAF 274


>gi|340057250|emb|CCC51593.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 299

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 86/262 (32%), Positives = 131/262 (50%), Gaps = 57/262 (21%)

Query: 85  TYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFD----IGGVYIKEDFLSEAETEDVMKQLD 140
           T+ +C +C K +   +  A ++  +H+G + D    + G+ +  DF+S+ E + +++  D
Sbjct: 64  THSFCSNCGKIFVFPE-RAIRACADHNGLTADSETTMRGLMVSPDFVSDIEEDYLLRFFD 122

Query: 141 DL---PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ---EDVLSH 194
                 W  SQSGRRKQ++GP+ NFKK+K+K GD  G P   K I  R +Q    + +  
Sbjct: 123 GAHHSRWKVSQSGRRKQDYGPRANFKKRKLKKGDGNGMPIQLKDIITRVNQFISNETMKR 182

Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
           +QTIE   LEY+++ GSSID HIDD W+WG+RI  +NL+ D VLTL              
Sbjct: 183 YQTIEVSVLEYSTKCGSSIDTHIDDTWLWGDRIGGLNLLEDVVLTLV------------- 229

Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
                     D K T+                         + +P RS  +L GE+RY W
Sbjct: 230 ----------DSKGTVAT-----------------------VFVPRRSFFLLSGESRYNW 256

Query: 315 QHCVLRSDVIGRRVCIAYREFT 336
            H +   D+  RR+ + +REF 
Sbjct: 257 MHGIRSEDIKSRRISMTFREFA 278


>gi|324523341|gb|ADY48232.1| Alpha-ketoglutarate-dependent dioxygenase ABH4, partial [Ascaris
           suum]
          Length = 296

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 129/272 (47%), Gaps = 61/272 (22%)

Query: 84  ETYVYCPHCNKAWQGS---------DLYAYQSHPNH-----SGKSFDIGGVYIKEDFLSE 129
           E YVY     KA   S         D++A  +  N        K  ++ G+ + +DFLSE
Sbjct: 43  ERYVYSSRHAKAIYDSRLPADPSLDDVHAAAARVNSLSTLDPDKCLNLDGILLIDDFLSE 102

Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH-- 187
            E + +M+++D+  W  SQSGRRKQ++GP+ NFK++K+K+  F G P  T FI  R    
Sbjct: 103 QEEKTIMRKIDNKEWIISQSGRRKQDYGPRVNFKQKKVKMDRFFGMPDYTDFILQRMEAL 162

Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
             + L+ FQ  E C LEY     S+I+ H DD WIWG R++++NL+ D V+T T      
Sbjct: 163 SANKLNAFQPFELCNLEYVESRQSAIELHFDDWWIWGNRLISLNLLNDCVMTFT------ 216

Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
                                   ND                  +V+   +P R+LL + 
Sbjct: 217 ------------------------ND---------------ERQLVVYAALPRRTLLCMC 237

Query: 308 GEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
            E RYEW+H VL   V GRR+ +  RE +  +
Sbjct: 238 DEVRYEWKHGVLPQHVRGRRIALTMREPSPAF 269


>gi|340380835|ref|XP_003388927.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
           ABH4-like [Amphimedon queenslandica]
          Length = 240

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 113/226 (50%), Gaps = 45/226 (19%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           +F   GV I  +F+S  E ++++  +D   W +SQSGR KQ++GPK NFKK+K+K   F 
Sbjct: 39  TFKFDGVKIIRNFVSFEEEKELVSFIDSSQWVESQSGRYKQDYGPKVNFKKRKLKTDTFY 98

Query: 174 GFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLI 233
           G P  T  + +R     V   F+ +E C L+Y+   GS+ID H DD W+WG+R++T+NL+
Sbjct: 99  GLPKYTHTLLERLESLSVTPSFKPVELCNLDYHPTRGSAIDPHFDDNWLWGDRLITINLL 158

Query: 234 GDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV 293
             ++L+                                              N  +  +V
Sbjct: 159 SSTLLSF---------------------------------------------NHPVDPLV 173

Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           + +P P  S++++ G ARY W H + R +V GRRV + +RE +  +
Sbjct: 174 VHVPHPRLSIVIIEGSARYNWLHGIRRENVTGRRVAMTFRELSREF 219


>gi|403343479|gb|EJY71069.1| hypothetical protein OXYTRI_08063 [Oxytricha trifallax]
          Length = 342

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 107/391 (27%), Positives = 167/391 (42%), Gaps = 103/391 (26%)

Query: 2   DRSNFCGCKGVRTCIKC-EQEFGYENKN------IVEFTEE--TYIYCPHCNKAWQGSDL 52
           DR   C C GVR C  C + EF  + K+      I+E  ++  TY+ C  CN+    ++L
Sbjct: 8   DRERKCACTGVRYCKDCNDPEFRKQFKDLYPIDDILEAQQKVLTYVTCELCNRFKLKNEL 67

Query: 53  YAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSG 112
              QS  N+            K+D    Q+                   +  Q   + + 
Sbjct: 68  NISQSDQNNGNND--------KQDDQNYQS-------------------FIDQCIGHLTS 100

Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
           +  D GG+Y   DF+SE   +D++ +L D  W  SQSGR+K +FGP+ NFKKQK+K   F
Sbjct: 101 EQLDFGGLYTINDFISEEFEQDIVNKLQDYKWVDSQSGRKKIDFGPQVNFKKQKLKYTKF 160

Query: 173 QGFPACTK------------FIQDRFHQE-------------DVLSHFQTIEQCTLEYNS 207
            GFP   K             +Q++  ++              VL  F  IE   LEY+ 
Sbjct: 161 TGFPLFIKPILDLIQTLNDEILQEKIEEQKNQQKPQLKENLPSVLKDFIPIEVNVLEYDE 220

Query: 208 ENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQK 267
           + GS+I  H DD W+WGERI+ +NL+ D+ +T                   +Q+   +Q 
Sbjct: 221 QRGSNIAPHKDDFWLWGERIIGINLLQDTFMT-------------------FQRDSQNQI 261

Query: 268 STIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRR 327
             IV                      I +P+  R L ++ G++R+EW H +    + G+R
Sbjct: 262 GQIVE---------------------IEVPVKRRMLYIISGKSRFEWMHGIKSEHIKGKR 300

Query: 328 VCIAYREFTAPY--MNNEATASVFEKAKCFF 356
           +    REF+  +   +NE    +   AK + 
Sbjct: 301 IVCTIREFSDEFKSQDNEDANKIRAIAKTYI 331


>gi|25148697|ref|NP_741141.1| Protein F09F7.7, isoform a [Caenorhabditis elegans]
 gi|351060673|emb|CCD68390.1| Protein F09F7.7, isoform a [Caenorhabditis elegans]
          Length = 291

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 114/223 (51%), Gaps = 47/223 (21%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           S +I G+ +  +FLSE+E   ++  +D + W +SQSGRRKQ++GPK NFK +K+K   F 
Sbjct: 80  SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139

Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           G P     + ++  + DV  L ++Q  E C LEY     S+I+ H DD WIWG R++++N
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 199

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           LI  SV+TL+    + NK  L   C+                                  
Sbjct: 200 LINGSVMTLS----NDNKSFL---CY---------------------------------- 218

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
               + MP RSLL +  E RY+W+H VL   + GRR+ +  RE
Sbjct: 219 ----VHMPHRSLLCMADECRYDWKHGVLAHHIRGRRIALTMRE 257


>gi|403286064|ref|XP_003934327.1| PREDICTED: uncharacterized protein LOC101032010 [Saimiri
           boliviensis boliviensis]
          Length = 517

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 77/235 (32%), Positives = 118/235 (50%), Gaps = 57/235 (24%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ                  
Sbjct: 23  CGCKGIRTCLICERQRGG-------------------DPPWQ------------------ 45

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQ-GSDLYAYQSHPNHSGKSFDIGGVYIKED 125
                      L       ++YC   +  W  GS+   ++      G +F   GV + ED
Sbjct: 46  -----------LPPAKTHHFIYC--SDTGWAVGSEESDFE------GWAFPFPGVMLIED 86

Query: 126 FLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR 185
           F+++ E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K   F G P+ ++ +  R
Sbjct: 87  FVTQEEEAELVRLMDCDPWKLSQSGRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRR 146

Query: 186 FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
                 L  FQ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++
Sbjct: 147 MGLYPGLEGFQPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 201


>gi|444715583|gb|ELW56448.1| putative alpha-ketoglutarate-dependent dioxygenase ABH4 [Tupaia
           chinensis]
          Length = 421

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 77/234 (32%), Positives = 121/234 (51%), Gaps = 55/234 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE + G                    +  WQ     A ++H        
Sbjct: 15  CGCKGIRTCLICELQRGG-------------------DPPWQP---LARKTH-------- 44

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                              +VYC   +  W    + A +S  +  G +F   GV + EDF
Sbjct: 45  ------------------RFVYC--SDTGWA---VGAEES--DFEGWAFPFPGVMLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+K+K+K+  F+G P+ ++ +  R 
Sbjct: 80  VTPEEEAEMVRLMDLDPWKLSQSGRRKQDYGPKVNFRKRKLKMAGFRGLPSFSRELVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
                L  F+ +EQC L+Y  E GS+ID H+DD W+WGER+V++NL+  +VL++
Sbjct: 140 GLYPGLEGFRPVEQCNLDYCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSM 193



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 29/39 (74%)

Query: 301 RSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           RSLL+L G AR++W+H + R D+  RRVC+ +RE +A +
Sbjct: 358 RSLLLLPGAARHQWKHAIHRRDIEARRVCVTFRELSAEF 396


>gi|341883863|gb|EGT39798.1| hypothetical protein CAEBREN_17559 [Caenorhabditis brenneri]
          Length = 292

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 116/223 (52%), Gaps = 47/223 (21%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           S +I G+ +  +FLSE+E  +++  +D++ W +SQSGRRKQ++GPK NFK +K+K   F 
Sbjct: 80  SIEIDGLTLIHNFLSESEETNILNMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTDSFI 139

Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           G P     + ++  Q DV  L ++Q  E C LEY     S+I+ H DD WIWG R++++N
Sbjct: 140 GMPEYADMLLEKMSQYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 199

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           LI  SV+TL+    +S ++     C+                                  
Sbjct: 200 LINGSVMTLS---NESRQF----LCY---------------------------------- 218

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
               + MP RSL+ +  + RY+W+H VL   + GRR+ +  RE
Sbjct: 219 ----VHMPHRSLICMADDCRYDWKHGVLAHHIRGRRIALTMRE 257


>gi|403359506|gb|EJY79416.1| hypothetical protein OXYTRI_23312 [Oxytricha trifallax]
          Length = 335

 Score =  137 bits (345), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 164/395 (41%), Gaps = 107/395 (27%)

Query: 2   DRSNFCGCKGVRTCIKC-EQEFG------YENKNIVEFTEE--TYIYCPHCNKAWQGSDL 52
           DR   C C GVR C  C + EF       Y   +I+E  ++  TY+ C  CN+    ++L
Sbjct: 8   DRERKCACTGVRYCKDCNDPEFRKQFKDLYPIDDILEAQQKVLTYVTCGLCNRFKLKNEL 67

Query: 53  YAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSG 112
               S  N             K+D    Q+                   +  Q   + + 
Sbjct: 68  NISISDQNDGNND--------KQDNQNDQS-------------------FIDQCIGHLTA 100

Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
           +  D GG+Y  ++ +SE    +++ +L D  W  SQSGR+K +FGP+ NFKKQK+K   F
Sbjct: 101 EQLDFGGLYTIKEIISEDFEYNIVNKLQDYKWVDSQSGRKKIDFGPQVNFKKQKLKYTKF 160

Query: 173 QGFPACTK-----------------------------FIQDRFHQEDVLSHFQTIEQCTL 203
            GFP   K                              +Q + H   VL  FQ IE   L
Sbjct: 161 TGFPLFIKPILDLISTLNDQNLQQKEEELKEEQKDQALLQLKQHLPSVLKDFQPIEVNVL 220

Query: 204 EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKY 263
           EY+ + GS+I  H DD W+WGERI+ +NL+ D+ +T                   +Q+  
Sbjct: 221 EYDEQRGSNIAPHKDDFWLWGERIIGINLLKDTFMT-------------------FQRDS 261

Query: 264 PDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDV 323
            +Q   IV                      I +P+  R + V+ G++R+EW H +    +
Sbjct: 262 ENQLGQIVE---------------------IEVPVKRRMMYVISGKSRFEWMHGIKSEHI 300

Query: 324 IGRRVCIAYREFTAPY--MNNEATASVFEKAKCFF 356
            G+R+   +REF+  +   +NE    + E AK F 
Sbjct: 301 KGKRIVCTFREFSDEFKSQDNEDANKIREIAKNFI 335


>gi|209489362|gb|ACI49132.1| hypothetical protein Cbre_JD16.004 [Caenorhabditis brenneri]
          Length = 314

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 115/223 (51%), Gaps = 47/223 (21%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           S +I G+ +  +FLSE+E  +++  +D++ W +SQSGRRKQ++GPK NFK +K+K   F 
Sbjct: 102 SIEIDGLTLIHNFLSESEESNILNMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTDSFI 161

Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           G P     + ++  Q DV  L ++Q  E C LEY     S+I+ H DD WIWG R++++N
Sbjct: 162 GMPEYADMLLEKMSQYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLISIN 221

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           LI  SV+TL+    +S ++     C+                                  
Sbjct: 222 LINGSVMTLS---NESRQF----LCY---------------------------------- 240

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
               + MP RSL+ +    RY+W+H VL   + GRR+ +  RE
Sbjct: 241 ----VHMPHRSLICMADACRYDWKHGVLAHHIRGRRIALTMRE 279


>gi|308497524|ref|XP_003110949.1| hypothetical protein CRE_04550 [Caenorhabditis remanei]
 gi|308242829|gb|EFO86781.1| hypothetical protein CRE_04550 [Caenorhabditis remanei]
          Length = 291

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 75/226 (33%), Positives = 113/226 (50%), Gaps = 47/226 (20%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S  + +I G+ +  DFL+E E   +++ +D++ W +SQSGRRKQ++GPK NFK +K+K  
Sbjct: 77  SDDAIEIDGLTLIHDFLTEREESSILEMIDNVEWVQSQSGRRKQDYGPKVNFKHKKVKTD 136

Query: 171 DFQGFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
            F G P     + ++    D+  L ++Q  E C LEY     S+I+ H DD WIWG R++
Sbjct: 137 SFIGMPEYADMLLNKMSDYDLKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLI 196

Query: 229 TVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNET 288
           ++NLI  SV+TL                                       S DS     
Sbjct: 197 SINLISGSVMTL---------------------------------------SNDS----- 212

Query: 289 LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
             S +  + MP RSL+ +  E RY+W+H VL   + GRR+ +  RE
Sbjct: 213 -KSFLCYVHMPHRSLICMADECRYDWKHGVLAHHIRGRRIALTMRE 257


>gi|426255358|ref|XP_004021316.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4
           [Ovis aries]
          Length = 252

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 76/235 (32%), Positives = 115/235 (48%), Gaps = 55/235 (23%)

Query: 7   CGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKSF 66
           CGCKG+RTC+ CE++ G                    +  WQ                  
Sbjct: 15  CGCKGIRTCLICERQRG-------------------GDPPWQH----------------- 38

Query: 67  DIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDF 126
                        SQ    ++Y P    A     + A +S  +  G +F   GV + EDF
Sbjct: 39  ------------LSQKTHRFIYYPDTGWA-----VGAEES--DFEGWAFPFPGVTLIEDF 79

Query: 127 LSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRF 186
           ++  E  ++++ +D  PW  SQSGRRKQ++GPK NF+KQK+K+ DF+G P+ ++ +  R 
Sbjct: 80  VTREEEAEMVQLMDREPWKLSQSGRRKQDYGPKVNFRKQKLKMADFRGLPSFSREVVRRM 139

Query: 187 HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLT 241
               +L  FQ +EQC L+Y  E GS+ID H+DD W+WGER+        S++ L+
Sbjct: 140 GLYPILEDFQPVEQCNLDYCPERGSAIDPHLDDAWLWGERLGGPQPPRRSLVVLS 194


>gi|7498760|pir||T16009 hypothetical protein F09F7.7 - Caenorhabditis elegans
          Length = 329

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 9/223 (4%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           S +I G+ +  +FLSE+E   ++  +D + W +SQSGRRKQ++GPK NFK +K+K   F 
Sbjct: 80  SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139

Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           G P     + ++  + DV  L ++Q  E C LEY     S+I+ H DD WIWG R++   
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLIR-- 197

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
              ++     +     +K  L S   +    +  ++S     ++L   S  ++ N+   S
Sbjct: 198 --SENFRVFDFSQKLCSKKCLASRIINLNTNF--EESNQFFSINLINGSVMTLSNDN-KS 252

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
            +  + MP RSLL +  E RY+W+H VL   + GRR+ +  RE
Sbjct: 253 FLCYVHMPHRSLLCMADECRYDWKHGVLAHHIRGRRIALTMRE 295


>gi|167524358|ref|XP_001746515.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775277|gb|EDQ88902.1| predicted protein [Monosiga brevicollis MX1]
          Length = 207

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 117/229 (51%), Gaps = 44/229 (19%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           GV +  DF++EAE   ++ Q+D  PW  SQSGRRKQ+FGPK NFKK+K+K  +F G PA 
Sbjct: 5   GVRVWLDFVTEAEETALVAQMDAWPWTDSQSGRRKQDFGPKANFKKKKVKAANFTGLPAL 64

Query: 179 TKFIQDRFHQEDV-LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
            + +  R       L  F  +EQC L+Y    G++I  H DD W+WGER++T+NL+  + 
Sbjct: 65  ARPLIPRLQALTADLKDFVPVEQCHLDYEPSRGAAIVPHFDDFWLWGERLITLNLLSTTF 124

Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           L                 CF    + PD+                        ++ I +P
Sbjct: 125 L-----------------CF----QLPDEP----------------------DALEIAVP 141

Query: 298 MPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATA 346
           +P RSLL + G AR  W H V R DV  RR+ + +RE T  +++ E  A
Sbjct: 142 LPPRSLLEVKGVARMAWHHAVHRQDVSQRRIAMTWRELTPEFLSGERQA 190


>gi|312072236|ref|XP_003138973.1| hypothetical protein LOAG_03388 [Loa loa]
          Length = 186

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 104/208 (50%), Gaps = 47/208 (22%)

Query: 134 DVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ--EDV 191
           D++  +D   W  SQSGRRKQ++GP+ NFK +KIK+  F G P+    I +R +    D+
Sbjct: 2   DLINVIDRTDWVLSQSGRRKQDYGPRVNFKHKKIKMERFLGMPSYIDVILNRMNSISPDL 61

Query: 192 LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYN 251
              +Q  E C LEY+ +  S+I+ H DD WIWGER++ VNL+  SVLT   Y  D+ +  
Sbjct: 62  FGSYQPFELCNLEYSDDRWSNIEMHYDDTWIWGERLICVNLLSKSVLT---YTNDAKQ-- 116

Query: 252 LNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEAR 311
                                                   ++I +P+P+R+++ +  E R
Sbjct: 117 ----------------------------------------LIIYVPLPSRTMVCMSDEIR 136

Query: 312 YEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           Y W+H V    + GRR+ +  RE +A +
Sbjct: 137 YSWRHAVFPEHIRGRRIAVTMREPSAAF 164


>gi|261332706|emb|CBH15701.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 304

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 109/229 (47%), Gaps = 54/229 (23%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
            ++ G+ I+  F+S  E E ++   D+      W  SQSGRRKQ +GPK NFKK+K+K+G
Sbjct: 102 IELSGLTIQPGFVSPNEEEYLVAFFDNPAPFAAWKVSQSGRRKQEYGPKANFKKRKLKVG 161

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERI 227
           DF+  P   K   DR   F  E  +  +  +E   LEY +E  S +D HIDD W+WG+RI
Sbjct: 162 DFRALPHQMKTTLDRVRSFVAEQTMREYCIVEVSVLEYTAEC-SCLDPHIDDTWLWGDRI 220

Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
             +NL+ D  +TLT+ + D                                         
Sbjct: 221 GGLNLLDD--VTLTFVSAD----------------------------------------- 237

Query: 288 TLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
               V + + +P  +  +L G +RYEW H + R DV  RRV + +REF 
Sbjct: 238 ---EVAVTVFVPRGAFFLLTGVSRYEWMHGIRREDVKNRRVSVTFREFA 283


>gi|71747664|ref|XP_822887.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832555|gb|EAN78059.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 304

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 109/229 (47%), Gaps = 54/229 (23%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
            ++ G+ I+  F+S  E E ++   D+      W  SQSGRRKQ +GPK NFKK+K+K+G
Sbjct: 102 IELSGLTIQPGFVSPNEEEYLVAFFDNPAPFAAWKVSQSGRRKQEYGPKANFKKRKLKVG 161

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERI 227
           DF+  P   K   DR   F  E  +  +  +E   LEY +E  S +D HIDD W+WG+RI
Sbjct: 162 DFRALPHQMKTTLDRVRSFVAEQTMREYCIVEVSVLEYTAEC-SCLDPHIDDTWLWGDRI 220

Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNE 287
             +NL+ D  +TLT+ + D                                         
Sbjct: 221 GGLNLLDD--VTLTFVSAD----------------------------------------- 237

Query: 288 TLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
               V + + +P  +  +L G +RYEW H + R DV  RRV + +REF 
Sbjct: 238 ---EVAVTVFVPRGAFFLLTGVSRYEWMHGIRREDVKNRRVSVTFREFA 283


>gi|407409697|gb|EKF32423.1| hypothetical protein MOQ_003722 [Trypanosoma cruzi marinkellei]
          Length = 304

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 75/263 (28%), Positives = 121/263 (46%), Gaps = 58/263 (22%)

Query: 86  YVYCPHCNKAWQGSDLYAYQSHPNHSGKS----FDIGGVYIKEDFLSEAETEDVMKQLDD 141
           Y YC  CN+ +      A++S  +H G +      I G+++  DFLS  + E ++  LD+
Sbjct: 69  YGYCWQCNRIFLMHH-GAFKSCADHEGATPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDE 127

Query: 142 LP----WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDR---FHQEDVLSH 194
                 W  SQSGRRKQ+FGP+ NFKK+K+     +G P   + + ++   F +E     
Sbjct: 128 PSSFSGWKHSQSGRRKQDFGPRANFKKRKLNTSGMRGMPKQLESVMEKVKSFVREITSKE 187

Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
           +  +E   LEY SEN SSID HIDD W+WG+R+  +NL+ D+V+T     G         
Sbjct: 188 YHIVEVSALEYTSENSSSIDPHIDDTWVWGDRVGGLNLLEDTVMTFVNNEG--------- 238

Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
                                                  + + +P  +  +L   +RY+W
Sbjct: 239 -------------------------------------TAVDVFLPRGAFFLLSQGSRYDW 261

Query: 315 QHCVLRSDVIGRRVCIAYREFTA 337
            H +   ++  RR+   +REF++
Sbjct: 262 LHGIRLENIKHRRISFTFREFSS 284


>gi|407849132|gb|EKG03971.1| hypothetical protein TCSYLVIO_004974 [Trypanosoma cruzi]
          Length = 304

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 147/355 (41%), Gaps = 97/355 (27%)

Query: 1   MDRSNFCGCKGVRTCIKC---EQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQS 57
           M++   C C G+R C +C   E+  G  ++N++                 + SD+ + Q 
Sbjct: 7   MEKEEGCCCSGIRFCGRCIESERAQGIIHQNVL---------------LVKSSDVISQQY 51

Query: 58  HPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLY-----AYQSHPNHSG 112
                                TS    T V   H    WQ + ++     A++S  +H G
Sbjct: 52  SAGR-----------------TSSCSFTCVELSHYGLCWQCNRIFLMHHGAFKSCADHEG 94

Query: 113 KS----FDIGGVYIKEDFLSEAETEDVMKQLDDLP----WDKSQSGRRKQNFGPKCNFKK 164
            +      I G+++  DFLS  + E ++  LD+      W  SQSGRRKQ+FGP+ NFKK
Sbjct: 95  ATPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDEPSSFSGWKHSQSGRRKQDFGPRANFKK 154

Query: 165 QKIKLGDFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
           +K+     +G P   + +  +   F ++     +  +E   LEY SEN SSID HIDD W
Sbjct: 155 RKLNTSGIRGMPKQLESVVKKVKSFVRDITSKEYHIVEVSALEYTSENSSSIDPHIDDTW 214

Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
           +WG RI  +NL+ D+V+T     G +                                  
Sbjct: 215 VWGNRIGGLNLLEDTVMTFVNNEGTA---------------------------------- 240

Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
                       + + +P  +  +L   +RY+W H +   ++  RR+   +REF+
Sbjct: 241 ------------VDVFLPRGAFFLLSNGSRYDWLHGIRLENIKHRRISFTFREFS 283


>gi|71659461|ref|XP_821452.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886833|gb|EAN99601.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 304

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 147/355 (41%), Gaps = 97/355 (27%)

Query: 1   MDRSNFCGCKGVRTCIKC---EQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQS 57
           M++   C C G+R C +C   E+  G  ++N++                 + SD+ + Q 
Sbjct: 7   MEKEEGCCCSGIRFCGRCIESERAQGITHQNVL---------------LVKSSDVISQQY 51

Query: 58  HPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLY-----AYQSHPNHSG 112
                                TS    T V   H    WQ + ++      ++S  +H G
Sbjct: 52  GAGR-----------------TSSCSFTCVELSHYGLCWQCNRIFLMHHGVFKSCADHEG 94

Query: 113 KS----FDIGGVYIKEDFLSEAETEDVMKQLDDLP----WDKSQSGRRKQNFGPKCNFKK 164
            +      I G+++  DFLS  + E ++  LD+      W  SQSGRRKQ+FGP+ NFKK
Sbjct: 95  TTPNLDIRIEGLFVIPDFLSLLDEEKLVSFLDEPSSLSGWKHSQSGRRKQDFGPRANFKK 154

Query: 165 QKIKLGDFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
           +K+     +G P   + + ++   F ++     +  +E   LEY SEN SSID HIDD W
Sbjct: 155 RKLNTSGIRGMPKQLESVMEKVKSFVRDITSKEYHIVEVSALEYTSENSSSIDPHIDDTW 214

Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
           +WG R+  +NL+ D+V+T     G +                                  
Sbjct: 215 VWGNRVGGLNLLEDTVMTFVNNEGTA---------------------------------- 240

Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
                       + + +P  +  +L   +RY+W H +   ++  RR+   +REF+
Sbjct: 241 ------------VDVFLPRGAFFLLSNGSRYDWLHGIRLENIKHRRISFTFREFS 283


>gi|339242841|ref|XP_003377346.1| alkylated DNA repair protein AlkB protein [Trichinella spiralis]
 gi|316973860|gb|EFV57409.1| alkylated DNA repair protein AlkB protein [Trichinella spiralis]
          Length = 273

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 136/335 (40%), Gaps = 85/335 (25%)

Query: 6   FCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNHSGKS 65
            CGCKG+R C  CE     +N        E  +        W+   ++ Y    + SG S
Sbjct: 11  ICGCKGIRFCRLCENSDRVKN-----LQRERCL-------DWKKYRIFVYCRKCDSSGHS 58

Query: 66  FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYA-YQSHPNHSGKSFDIGGVYIKE 124
                       LTS+           +   Q  +LY  +      +  SF    V + E
Sbjct: 59  ------------LTSEM----------HSVEQLLELYNNFHDLVQGNFDSFACETVKVVE 96

Query: 125 DFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-----FPACT 179
           +FL+E E   ++K +    W  SQSGRRKQ++GPK NFKK+KIK            P   
Sbjct: 97  NFLTEDEEASLLKSISQSAWQVSQSGRRKQDYGPKVNFKKRKIKYHTPNNQRDNTLPEYF 156

Query: 180 KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
            F+ DR  Q   L  F  ++   LEY SE GS I+ H DD W WGERIVT +L  D+V++
Sbjct: 157 SFLFDRMKQLKFLQDFVAVQVTNLEYCSERGSWIEFHTDDQWAWGERIVTCSLCSDAVMS 216

Query: 240 LTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMP 299
                 +S                                             +I +P+ 
Sbjct: 217 FLEEENNS---------------------------------------------LIHVPLL 231

Query: 300 ARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYRE 334
            RSL+ + G  R++ +H +L   ++G R+ I  RE
Sbjct: 232 KRSLICISGPMRHKLKHAILPGHIVGTRIAITVRE 266


>gi|154345804|ref|XP_001568839.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066181|emb|CAM43971.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 297

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 141/364 (38%), Gaps = 79/364 (21%)

Query: 2   DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
           + S+ C C G+R C KC      +      F+   ++           S +   Q H + 
Sbjct: 3   EHSSPCVCSGIRFCAKCRDTLRVQQL----FSGSVFLS--------SASSVIEKQWHNDR 50

Query: 62  -SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQG-SDLYAYQSHPNHSGKSFDIGG 119
            S  SF I G            K T  YC  C   ++  + + +   H      S  I G
Sbjct: 51  LSSCSFAIIG------------KSTLSYCIECMTIFKSEAPIKSCVDHQGAMSTSIVISG 98

Query: 120 VYIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
           + + +D L+E E   ++  LD+     PW +SQSGRRKQ++GPK NFKK+K++  +    
Sbjct: 99  LVVFQDVLTEEEETALIYYLDNSHPFPPWKESQSGRRKQDYGPKRNFKKKKVRPAEIPAM 158

Query: 176 PACTKFIQDRFHQED---VLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
           P   + +               ++  E   LEY     S+ D HIDD W+WG+RI  +NL
Sbjct: 159 PLALEPVCATISSTTENFTGRAYRIAEVSALEYVEGKMSNFDPHIDDTWLWGDRIAGLNL 218

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
               V+T    +G                                              V
Sbjct: 219 NEPCVVTFVEPDG----------------------------------------------V 232

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEKA 352
              + +P RS  ++ G  RY+W H +    V GRR+ + +RE +   + +  T+ V   A
Sbjct: 233 CCDVYLPRRSFFLMSGNCRYKWMHGIRPEHVRGRRISLTFRELSDEILADAETSEVVLSA 292

Query: 353 KCFF 356
              F
Sbjct: 293 ASTF 296


>gi|342184304|emb|CCC93785.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 306

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 105/227 (46%), Gaps = 54/227 (23%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDL----PWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
           + G  +  DF+S  E + ++   DD     PW  SQSGRRKQ++GPK NFKK+K+KLG+F
Sbjct: 106 MDGFCLMTDFISPLEEQKLVTFFDDPAPFPPWKDSQSGRRKQDYGPKANFKKKKLKLGNF 165

Query: 173 QGFPACTKFIQDR---FHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
           QG P   + +  R   F        F   E   LEY ++  SS+D H+DD W+WG+RI  
Sbjct: 166 QGMPQQMEELLGRVTSFVSRYTNKEFSVAEVSALEYTTKC-SSLDPHVDDTWLWGDRIGG 224

Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
           +NL+ D VLT    +G                                            
Sbjct: 225 LNLLVDVVLTFVNASG-------------------------------------------- 240

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFT 336
             + +   +P RS  +L    RYEW H + R D++GRR+ + +RE  
Sbjct: 241 --IAVAAHIPRRSFFMLSKVCRYEWMHGIRREDIVGRRISVTFRELA 285


>gi|401420114|ref|XP_003874546.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322490782|emb|CBZ26046.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 297

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 118/282 (41%), Gaps = 55/282 (19%)

Query: 83  KETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDD 141
           K +  +C  C   ++    + +   H      S  I G+ + +D L E +   V++ LDD
Sbjct: 61  KSSLSFCIECMSVFKSEVPIKSCADHHGAMATSITISGLAVFQDALPEEDETAVIRFLDD 120

Query: 142 ----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI---QDRFHQEDVLSH 194
                PW +SQSGRRKQ++GP+ NFKK+K+K+ D    P   + I        +      
Sbjct: 121 SHPFPPWKESQSGRRKQDYGPRRNFKKKKVKVADIPAMPLVFEPIFSVISSMTETFTGKA 180

Query: 195 FQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
           ++  E   LEY     S+ D H+DD W+WG+RI  +N                       
Sbjct: 181 YRIAEVSALEYMEGKMSNFDPHVDDTWLWGDRIAGLN----------------------- 217

Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEW 314
                                LNE    + VN     V   + +P R+  ++ G+ RY+W
Sbjct: 218 ---------------------LNEACAVTFVNS--EGVCCDVYLPRRTFFLMSGDCRYKW 254

Query: 315 QHCVLRSDVIGRRVCIAYREFTAPYM-NNEATASVFEKAKCF 355
            H +    V GRR+ + +RE +   + + EA+  V   A  F
Sbjct: 255 MHGIRPEHVRGRRISLTFRELSDEILADTEASEMVLSAASNF 296


>gi|25148702|ref|NP_741142.1| Protein F09F7.7, isoform b [Caenorhabditis elegans]
 gi|351060674|emb|CCD68391.1| Protein F09F7.7, isoform b [Caenorhabditis elegans]
          Length = 208

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 75/127 (59%), Gaps = 2/127 (1%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
           S +I G+ +  +FLSE+E   ++  +D + W +SQSGRRKQ++GPK NFK +K+K   F 
Sbjct: 80  SIEIDGLTLIHNFLSESEESKILNMIDTVEWAQSQSGRRKQDYGPKVNFKHKKVKTDTFV 139

Query: 174 GFPACTKFIQDRFHQEDV--LSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
           G P     + ++  + DV  L ++Q  E C LEY     S+I+ H DD WIWG R++   
Sbjct: 140 GMPEYADMLLNKMSEYDVKKLGNYQPFEMCNLEYEEVKKSAIEMHQDDMWIWGNRLIRYE 199

Query: 232 LIGDSVL 238
            +   VL
Sbjct: 200 FLFKKVL 206


>gi|146104297|ref|XP_001469788.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398024324|ref|XP_003865323.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134074158|emb|CAM72900.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322503560|emb|CBZ38646.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 297

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 144/370 (38%), Gaps = 92/370 (24%)

Query: 2   DRSNFCGCKGVRTCIKCEQEFGYE---NKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSH 58
           + S+ C C G+R C KC      +   + ++   +  T I   H ++             
Sbjct: 3   EHSSACSCAGIRFCAKCRDSSRVQQLFSGSVPLSSARTVIEKQHNDERL----------- 51

Query: 59  PNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDI 117
              S  SF I G            K +  +C  C   ++    + +   H      S  I
Sbjct: 52  ---SSCSFAIIG------------KSSLSFCIECASVFKSEVPIKSCADHQGAMATSITI 96

Query: 118 GGVYIKEDFLSEAETEDVMKQLDDL----PWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ 173
            G+ + +  L+E +   V++ LDD     PW +SQSGRRKQ++GP+ NFKK+K+K+ +  
Sbjct: 97  SGLAVLQHALTEEDEAAVIRFLDDSHPFPPWKESQSGRRKQDYGPRRNFKKKKVKVAEIP 156

Query: 174 GFP-------ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGER 226
             P       A    + + F  +     ++  E   LEY     S+ D H+DD W+WG+R
Sbjct: 157 SMPLVFESVFAVISSMTETFTGK----AYRIAEVSALEYMEGKMSNFDPHVDDTWLWGDR 212

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
           I  +NL    V+T     G                                         
Sbjct: 213 IAGLNLNEACVVTFVNPEG----------------------------------------- 231

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM-NNEAT 345
                V   + +P R+  ++ G  RY+W H +    V GRR+ + +RE +   + + EA+
Sbjct: 232 -----VCCDVYLPRRAFFLMSGNCRYKWMHGIRHEHVRGRRISLTFRELSDEILADTEAS 286

Query: 346 ASVFEKAKCF 355
             V   A  F
Sbjct: 287 EMVLSAASNF 296


>gi|195577921|ref|XP_002078817.1| GD23631 [Drosophila simulans]
 gi|194190826|gb|EDX04402.1| GD23631 [Drosophila simulans]
          Length = 349

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 91/320 (28%), Positives = 140/320 (43%), Gaps = 48/320 (15%)

Query: 66  FDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQS-HPNHS-GKSFDIGGVYIK 123
           F I    ++E F   Q  E + YC  C+    G D    Q  H NH   +   + G+ ++
Sbjct: 46  FQIVKTSLQEQF---QELEAWSYCIQCDHLQPGWDTNQVQKDHENHKKDEGLPLPGILVQ 102

Query: 124 EDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ-GFPACTKFI 182
           E+FLS  E   ++  LDDLPWD SQSGRRKQNFGPK N    KI +   +   PA    +
Sbjct: 103 EEFLSVDEGAQLIADLDDLPWDISQSGRRKQNFGPKTN----KISVNALELSAPATPHLL 158

Query: 183 QDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNH----------IDDCWIWGERIV---- 228
           Q   HQ+   +H Q   Q  ++ +    S +             ++D  +   R++    
Sbjct: 159 Q---HQQ---AHKQAKPQGWMQLSGHPESIVPTSTGIVRKRISGLEDSEVHAYRLICKEP 212

Query: 229 -TVNLIG-----DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD 282
            T  ++        + +  +            PC    K     ++ + +++S   K   
Sbjct: 213 QTAQIVPAYFGIQEMQSQHFIELQDLLAGFRDPCVMDIKM--GSRTFLESEVSSMAKGRG 270

Query: 283 SIVNETLSSVVIRIPMPARSLLVL---WGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
                T    +     P +  L L   +G ARY+++H VLR DV  RRVC+AYREFT  Y
Sbjct: 271 RPTAGTKKPTL----QPTQRSLSLIVLYGPARYQFEHSVLREDVQERRVCVAYREFTPMY 326

Query: 340 MNN---EATASVFEKAKCFF 356
           +N    +    V EK++ F+
Sbjct: 327 INGADIQKGDPVREKSQIFW 346


>gi|157876860|ref|XP_001686772.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129847|emb|CAJ09153.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 297

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 142/367 (38%), Gaps = 86/367 (23%)

Query: 2   DRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEETYIYCPHCNKAWQGSDLYAYQSHPNH 61
           + S+ C C G+R C KC              +    ++      +   S +   Q +   
Sbjct: 3   EHSSACSCAGIRFCAKCRGS-----------SRVQQLFNGSVPLSSARSVIEKQQDNERL 51

Query: 62  SGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGS-DLYAYQSHPNHSGKSFDIGGV 120
           S  SF + G            K +  +C  C   ++    + +   H         I G+
Sbjct: 52  SSCSFAVIG------------KSSLSFCIECASVFKSEVPIKSCADHQGAVATGITISGL 99

Query: 121 YIKEDFLSEAETEDVMKQLDD----LPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
            +  D L+E +   V++ LDD     PW +SQSGRRKQ++GPK NFKK+KIK+ +  G P
Sbjct: 100 AVFRDTLTEEDETAVIRFLDDSRPFPPWKESQSGRRKQDYGPKRNFKKKKIKVAEIPGMP 159

Query: 177 -------ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
                  A    + + F  +     ++  E   LEY     S+ D H+DD W+WG+RI  
Sbjct: 160 LVFESVFAVISSMVETFTGK----AYRIAEVSALEYMEGKMSNFDPHVDDTWLWGDRIAG 215

Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
           +N                                            LNE    + VN   
Sbjct: 216 LN--------------------------------------------LNEACAVTFVNP-- 229

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYM-NNEATASV 348
             V   + +P R+  ++ G  RY W H +    V GRR+ + +RE +   + + +A+  V
Sbjct: 230 EGVCCDVYLPRRAFFLMSGNCRYRWMHGIRPEHVRGRRISLTFRELSDEILVDTKASEMV 289

Query: 349 FEKAKCF 355
              A  F
Sbjct: 290 LSAASHF 296


>gi|358337469|dbj|GAA55822.1| alkylated DNA repair protein alkB homolog 4 [Clonorchis sinensis]
          Length = 183

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 62/99 (62%), Gaps = 4/99 (4%)

Query: 153 KQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQEDVLSH----FQTIEQCTLEYNSE 208
           +Q++GPK NFK+Q++  G F G P+ ++F+ DR +Q    SH    F  +E C LEY  E
Sbjct: 12  RQDYGPKVNFKRQRVVPGRFTGLPSYSRFLVDRINQTIQTSHLRQTFIPVELCNLEYCPE 71

Query: 209 NGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
            G+SI  H+DD W+WG+R+VT+NL   + L  T  N  S
Sbjct: 72  RGASIVPHLDDAWLWGDRLVTLNLCSSTTLIFTLPNPGS 110


>gi|294944511|ref|XP_002784292.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239897326|gb|EER16088.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 252

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 68/123 (55%), Gaps = 2/123 (1%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD-FQGFPA 177
           G+ +  D ++EAE   ++  +   PW  SQSGRRKQ++GP+ NFKK+K+K  D FQG P 
Sbjct: 55  GLVVLADAITEAEEATLLGDIYARPWKLSQSGRRKQDYGPQVNFKKRKLKCPDNFQGLPH 114

Query: 178 CTKFIQDRFHQE-DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDS 236
               +  R H    +L      E    EY    GSSID H+D  W+W + I+ ++L  D 
Sbjct: 115 SIDLVLPRIHTGLGLLLDHAWHEMVVQEYAVSRGSSIDLHVDHSWVWADGILDLSLAADC 174

Query: 237 VLT 239
           ++ 
Sbjct: 175 IMA 177


>gi|345316254|ref|XP_003429720.1| PREDICTED: LOW QUALITY PROTEIN: probable
           alpha-ketoglutarate-dependent dioxygenase ABH4-like
           [Ornithorhynchus anatinus]
          Length = 181

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/107 (40%), Positives = 63/107 (58%), Gaps = 7/107 (6%)

Query: 70  GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
           G  + ++ L   T   +VYCP    A QG++    Q      G++F   GV + EDF+S 
Sbjct: 3   GPRLGQEPLPDPTVARFVYCPETGLA-QGAEATEVQ------GRAFPFPGVTLLEDFVSA 55

Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
            E   +++ +D  PW  SQSGRRKQ++GPK NF+K+K+K+G F G P
Sbjct: 56  EEEARMVRLMDRDPWKPSQSGRRKQDYGPKVNFRKRKLKVGAFDGLP 102



 Score = 38.1 bits (87), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 36/73 (49%)

Query: 276 LNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREF 335
           L   + D +         + + +P RSLLV+ G  R+ W+H + R  V  RRVC  +RE 
Sbjct: 93  LKVGAFDGLPXXXXXXXXVAVGLPRRSLLVVHGPGRHRWRHGIRRRHVRARRVCATFREL 152

Query: 336 TAPYMNNEATASV 348
           +A +    A A +
Sbjct: 153 SAEFGPGGARAPL 165


>gi|380023187|ref|XP_003695407.1| PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase
          ABH4-like [Apis florea]
          Length = 103

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/82 (47%), Positives = 54/82 (65%), Gaps = 2/82 (2%)

Query: 1  MDRSNFCGCKGVRTCIKCEQEFGYENKNIVEFTEE--TYIYCPHCNKAWQGSDLYAYQSH 58
          M+    CGCKG+R+C+ CE E+     N+    E+  +Y+YCP+C KAW G +L  Y++H
Sbjct: 1  METIRPCGCKGIRSCLLCETEYKIVKPNLKACFEKCSSYVYCPYCEKAWPGWNLDLYKNH 60

Query: 59 PNHSGKSFDIGGVYIKEDFLTS 80
          PNH G + +  GVYIK DFLTS
Sbjct: 61 PNHQGTAIEFPGVYIKLDFLTS 82


>gi|428170226|gb|EKX39153.1| hypothetical protein GUITHDRAFT_143755 [Guillardia theta CCMP2712]
          Length = 156

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 8/118 (6%)

Query: 109 NHSGKSFDIGGVYIKEDFLSEAETEDVMKQLD--DLPWDKSQSGRRKQNFGPKCNFKKQK 166
            H G SF   GV + E+F++E      ++++D  D  W  SQSGRRKQ FGPK NFKK++
Sbjct: 42  GHMGCSFQ--GVTVVEEFITEEVEGCWVREMDRDDRLWILSQSGRRKQEFGPKVNFKKEE 99

Query: 167 IKLGDFQG--FPACTK-FIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
           +K+ D  G  FP+ ++ F+Q       +LS F+ +E   LEY    GS I  HIDD W
Sbjct: 100 VKI-DPSGTIFPSWSQEFLQLCSSSSSLLSDFEAVELGLLEYEPLRGSCIAPHIDDSW 156


>gi|256052601|ref|XP_002569851.1| hypothetical protein [Schistosoma mansoni]
          Length = 103

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 50/78 (64%), Gaps = 1/78 (1%)

Query: 110 HSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKL 169
           H G  F    + +  +F+ E E   +++++D   W  SQSGRRKQ++GPK NFK+QK+ +
Sbjct: 16  HQG-GFPFLDIEVIPNFIDENEEAMLVEEIDKQTWVLSQSGRRKQDYGPKVNFKRQKVHI 74

Query: 170 GDFQGFPACTKFIQDRFH 187
           G F G PA ++F+  R++
Sbjct: 75  GGFYGLPAYSRFLITRYN 92


>gi|255557783|ref|XP_002519921.1| nucleic acid binding protein, putative [Ricinus communis]
 gi|223540967|gb|EEF42525.1| nucleic acid binding protein, putative [Ricinus communis]
          Length = 421

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 63/241 (26%), Positives = 109/241 (45%), Gaps = 47/241 (19%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+Y+  DF+S  E ++++  +D  PW KS S RR Q++G +  ++ + +    +LG
Sbjct: 153 LNIPGLYLLPDFVSAEEEQELLAAVDARPW-KSLSKRRVQHYGYEFCYQTRNVDTKQQLG 211

Query: 171 DFQGFPACTKFIQDRFHQEDVL--SHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERI 227
           +    P     + +R      L  S    ++Q T+ EY    G  +  HID    +   I
Sbjct: 212 EL---PPFISLVLERILSFPELGESASSILDQLTVNEY--PRGVGLSPHIDTHSAFEGSI 266

Query: 228 VTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSL-NEKSTDSIVN 286
            +++L G  ++    Y+ DS                   K+T  N++ + N  +  ++V 
Sbjct: 267 FSLSLAGPCIMEFRRYSDDSRA----------------PKATTNNEMIIENPDNGTNLVR 310

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYREFT 336
                    I +P RS+L+L GEARY WQH +       +  +VI    RRV   +R+  
Sbjct: 311 RA-------IYLPPRSMLLLSGEARYAWQHYIPHHKIDTVNGNVITRGSRRVSFTFRKVR 363

Query: 337 A 337
           A
Sbjct: 364 A 364


>gi|428169970|gb|EKX38899.1| hypothetical protein GUITHDRAFT_115004 [Guillardia theta CCMP2712]
          Length = 568

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 6/142 (4%)

Query: 113 KSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
           K   + G+ +KEDFLS  E+ED + QLD+L W+ S S RR Q+FG   ++  +++     
Sbjct: 147 KDLVVEGLILKEDFLSPEESEDTLNQLDELEWESSLS-RRVQHFGFTFDYATRRVNTLKT 205

Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVN 231
           + FP     I  R  +E +L      +QCT+ EY  + G  I +H+D    + + I++V+
Sbjct: 206 RAFPPFLLRIAQRALREKLL--LFEPDQCTVNEY--QPGQGIRSHVDTHSAFEDGILSVS 261

Query: 232 LIGDSVLTLTYYNGDSNKYNLN 253
           L    V+     +G S    L 
Sbjct: 262 LGSSVVMEFRSPDGLSKNVQLK 283


>gi|225427651|ref|XP_002270503.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Vitis
           vinifera]
          Length = 349

 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 36/232 (15%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
            +I G+Y+  DF+S  E E+++  +D + W KS S RR Q++G +  ++ + +    + G
Sbjct: 124 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 182

Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
             P+    I +R      L     I    L  N    G  +  HID    +   I +++L
Sbjct: 183 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 242

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
            G  ++    Y       + +S   S +  YPD+ S+ +                     
Sbjct: 243 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRR------------------- 281

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
              I +P RS+L+L GEARY W H +       ++  VI    RRV   +R+
Sbjct: 282 --AIYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRK 331


>gi|449455667|ref|XP_004145573.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Cucumis sativus]
 gi|449523025|ref|XP_004168525.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Cucumis sativus]
          Length = 344

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 98/217 (45%), Gaps = 40/217 (18%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S    DI G+++  DF++  E ED+++++D  PW+ + + RR Q++G +  ++ + +   
Sbjct: 115 SASELDIPGLFLLHDFVNAKEEEDLLREVDARPWN-NLAKRRVQHYGYEFCYQTRNVNTK 173

Query: 171 DFQG-FPACTKFIQDRF----HQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWG 224
              G  P+    + DR     + ED+     +++Q T+ EY    G  +  HID    + 
Sbjct: 174 HQLGELPSFVSHVVDRISMFPNTEDIAD--ASLDQLTVNEY--PPGVGLSPHIDTHSAFE 229

Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPD---QKSTIVNDLSLNEKST 281
             I +++L G                    PC    ++YP+    K     DL +     
Sbjct: 230 GLIFSLSLAG--------------------PCIMEFRRYPEGTWHKFPSSIDLKMENSVN 269

Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
           DS      + +   I +P RS+L+L GEARY W H +
Sbjct: 270 DS------NYLRKAIYLPPRSMLLLSGEARYVWHHYI 300


>gi|440796420|gb|ELR17529.1| alkB, alkylation repair protein, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 172

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 64/124 (51%), Gaps = 5/124 (4%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ +  D+L+  E   ++  LD+LPW   +  RR Q +G   +++KQ++        PAC
Sbjct: 19  GLTLVVDYLTVDEETALVNSLDELPW-LGEIQRRTQQYGYHYSYRKQRVDDTPVPPLPAC 77

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
            +F+ DRF Q+D   H   +    +    + G  I  HID    WG+ +V+++L+ D  +
Sbjct: 78  VRFLLDRF-QDD---HVPLVAHQLIINEYQPGEQIKPHIDSTTDWGDCVVSLSLLDDWDM 133

Query: 239 TLTY 242
             T+
Sbjct: 134 IFTH 137


>gi|297603304|ref|NP_001053775.2| Os04g0602700 [Oryza sativa Japonica Group]
 gi|38344149|emb|CAD41869.2| OSJNBa0041A02.16 [Oryza sativa Japonica Group]
 gi|116310931|emb|CAH67869.1| B0403H10-OSIGBa0105A11.21 [Oryza sativa Indica Group]
 gi|125549609|gb|EAY95431.1| hypothetical protein OsI_17273 [Oryza sativa Indica Group]
 gi|125591534|gb|EAZ31884.1| hypothetical protein OsJ_16049 [Oryza sativa Japonica Group]
 gi|255675751|dbj|BAF15689.2| Os04g0602700 [Oryza sativa Japonica Group]
          Length = 348

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 103/243 (42%), Gaps = 51/243 (20%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S     + G+Y+  DF++ AE ++++  +D+ PW KS + RR Q++G +  ++ + +   
Sbjct: 119 SASELGVPGIYLVPDFVTAAEEQELLAAVDNRPW-KSLAKRRVQHYGFEFLYETRNVDSK 177

Query: 171 DFQG--FPACTKFIQDRFHQEDVLSHFQTIEQCT------LEYNS-ENGSSIDNHIDDCW 221
            F G   P  +K I       D +  F    +CT      L  N    G  +  HID   
Sbjct: 178 QFLGELPPFVSKII-------DKIMSFPGANKCTSKLVDQLTVNEYPCGVGLSPHIDTHS 230

Query: 222 IWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKST 281
            + E I +++L G                    PC    +KYP       + +S  +K  
Sbjct: 231 AFEEMIFSLSLAG--------------------PCIMEFRKYPKGSWRAPSMVSGTDK-- 268

Query: 282 DSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLR---SDVIG-------RRVCIA 331
           DSI  E    +   + +P RS+L++ GE RY W H +      DV G       RRV   
Sbjct: 269 DSI--EEPQCIRKAVFLPPRSMLLMSGEGRYAWHHYIPHHKIDDVGGQVIKRNTRRVSFT 326

Query: 332 YRE 334
           +R+
Sbjct: 327 FRK 329


>gi|168033740|ref|XP_001769372.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679292|gb|EDQ65741.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 334

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/214 (22%), Positives = 97/214 (45%), Gaps = 41/214 (19%)

Query: 114 SFDIG--GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGD 171
           S D+G  G+ +  DF++ +E +++++++D+  W ++ + RR Q++G +  +K + + L  
Sbjct: 110 SADVGVPGLTLYTDFVTLSEEQELLREVDERSW-QTLAKRRVQHYGYEFLYKTRNVDLSQ 168

Query: 172 FQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIWG 224
             G  PA  + + ++     ++    + E+ TL ++         G  +  HID      
Sbjct: 169 KLGELPASLEPLLNKI---SLVPEIASAEEPTLPFDQLTVNEYPRGVGLSPHID------ 219

Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
                         T + + G     +L  PC    +KY  +   + ND        DS+
Sbjct: 220 --------------THSAFEGSILSLSLGGPCIMEFRKYIKRAIVVRND--------DSV 257

Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
              ++  +   + +P RSLL+L GEARY W H +
Sbjct: 258 PTGSVYIIRKALFLPPRSLLILSGEARYAWHHYI 291


>gi|296085482|emb|CBI29214.3| unnamed protein product [Vitis vinifera]
          Length = 912

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 100/237 (42%), Gaps = 37/237 (15%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
            +I G+Y+  DF+S  E E+++  +D + W KS S RR Q++G +  ++ + +    + G
Sbjct: 687 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 745

Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
             P+    I +R      L     I    L  N    G  +  HID    +   I +++L
Sbjct: 746 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 805

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
            G  ++    Y       + +S   S +  YPD+ S+ +                     
Sbjct: 806 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRRA------------------ 845

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYREF-TAP 338
              I +P RS+L+L GEARY W H +       ++  VI    RRV   +R+  T P
Sbjct: 846 ---IYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRKVRTGP 899


>gi|147834541|emb|CAN76404.1| hypothetical protein VITISV_021238 [Vitis vinifera]
          Length = 1036

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 36/232 (15%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG 174
            +I G+Y+  DF+S  E E+++  +D + W KS S RR Q++G +  ++ + +    + G
Sbjct: 766 LNIPGIYLLHDFVSAKEEEELLAAVDKMSW-KSLSKRRVQHYGYEFCYETRNVNTKQYLG 824

Query: 175 -FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTVNL 232
             P+    I +R      L     I    L  N    G  +  HID    +   I +++L
Sbjct: 825 KLPSFVSAIVERISSFPNLESAADIVLDQLTVNEYPPGVGLSPHIDTHSAFEGFIFSLSL 884

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
            G  ++    Y       + +S   S +  YPD+ S+ +                     
Sbjct: 885 AGPCIMDFRRYTEGVWPKSASSSDMSVE--YPDKSSSFLRRA------------------ 924

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
              I +P RS+L+L GEARY W H +       ++  VI    RRV   +R+
Sbjct: 925 ---IYLPPRSMLLLSGEARYAWHHYIPHHKIDMVKDSVIRRGPRRVSFTFRK 973


>gi|221485447|gb|EEE23728.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 1030

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 35/51 (68%)

Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           ++L++FQ +E C LEY  + GS I+ H DD W+WG R+VT NL   ++L+ 
Sbjct: 775 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 825



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 51/113 (45%), Gaps = 15/113 (13%)

Query: 70  GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
           GVY+  D L  Q +   +     N    G+        P   G + + GG         +
Sbjct: 589 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 638

Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
           AETE          W  SQSGRRK +FGP+ NFKK+++K G F GFP   K +
Sbjct: 639 AETEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 686



 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
           V IR+ +P RSL+V  G  RY W H V    +  RRV +  RE      P   NE     
Sbjct: 926 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 985

Query: 349 FEKAKCFF 356
                 FF
Sbjct: 986 LLSLSAFF 993


>gi|237835449|ref|XP_002367022.1| hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]
 gi|211964686|gb|EEA99881.1| hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]
          Length = 1033

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 35/51 (68%)

Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           ++L++FQ +E C LEY  + GS I+ H DD W+WG R+VT NL   ++L+ 
Sbjct: 778 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 828



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 51/113 (45%), Gaps = 15/113 (13%)

Query: 70  GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
           GVY+  D L  Q +   +     N    G+        P   G + + GG         +
Sbjct: 592 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 641

Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
           A+TE          W  SQSGRRK +FGP+ NFKK+++K G F GFP   K +
Sbjct: 642 AKTEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 689



 Score = 44.7 bits (104), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
           V IR+ +P RSL+V  G  RY W H V    +  RRV +  RE      P   NE     
Sbjct: 929 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 988

Query: 349 FEKAKCFF 356
                 FF
Sbjct: 989 LLSLSAFF 996


>gi|221506304|gb|EEE31939.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 1033

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 35/51 (68%)

Query: 190 DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTL 240
           ++L++FQ +E C LEY  + GS I+ H DD W+WG R+VT NL   ++L+ 
Sbjct: 778 ELLANFQPVELCLLEYVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSF 828



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/113 (33%), Positives = 51/113 (45%), Gaps = 15/113 (13%)

Query: 70  GVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSE 129
           GVY+  D L  Q +   +     N    G+        P   G + + GG         +
Sbjct: 592 GVYLLPDALERQEEAAILAWADGNME-TGAQSREATHKPQPRGDTREYGG---------D 641

Query: 130 AETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
           AETE          W  SQSGRRK +FGP+ NFKK+++K G F GFP   K +
Sbjct: 642 AETEPRAGL-----WVLSQSGRRKIDFGPQVNFKKKRLKPGRFNGFPPFAKAL 689



 Score = 44.7 bits (104), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 29/68 (42%), Gaps = 3/68 (4%)

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA---PYMNNEATASV 348
           V IR+ +P RSL+V  G  RY W H V    +  RRV +  RE      P   NE     
Sbjct: 929 VEIRVLLPRRSLVVFEGHCRYTWTHAVRSHHIFSRRVAVTLRELAPEFLPGGENEEVGQK 988

Query: 349 FEKAKCFF 356
                 FF
Sbjct: 989 LLSLSAFF 996


>gi|357165655|ref|XP_003580453.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Brachypodium distachyon]
          Length = 349

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 60/263 (22%), Positives = 107/263 (40%), Gaps = 52/263 (19%)

Query: 90  PHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQS 149
           P   KA  G+ +         S     I GV++ ++F++ AE ++++  +D  PW K  +
Sbjct: 102 PAAPKARPGAPVLPVAL----SSSELGIPGVHLVQEFVTAAEEQELLAAVDSRPW-KRLA 156

Query: 150 GRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCT------ 202
            RR Q++G +  ++ + +    F G  P+    + D+      +  F  ++ CT      
Sbjct: 157 KRRVQHYGYEFLYETRNVDSKQFLGELPSFVSKVLDK------IGAFPGVKNCTSKLVDQ 210

Query: 203 LEYNS-ENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQK 261
           L  N    G  +  HID    + E I++++L G  ++    Y   S +            
Sbjct: 211 LTVNEYPCGVGLSPHIDTHSAFEEMILSLSLAGSCIMEFRQYPKGSWR------------ 258

Query: 262 KYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLR- 320
                  +I N        TD +  +    +   I +P RS+L++ GE RY W H +   
Sbjct: 259 ----APPSIGN-------GTDEVSIQDPQCIRKAIFLPPRSMLLMSGEGRYAWHHYIPHH 307

Query: 321 --SDVIG-------RRVCIAYRE 334
              DV G       RRV   +R+
Sbjct: 308 KIDDVAGQVIKRNSRRVSFTFRK 330


>gi|401412938|ref|XP_003885916.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120336|emb|CBZ55890.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 951

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 33/52 (63%)

Query: 188 QEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLT 239
           ++ +LS FQ +E C LEY    GS I+ H DD W+WG R+VT  L   ++L+
Sbjct: 691 RKKLLSTFQPVELCLLEYVPSRGSHIEEHFDDFWLWGPRLVTFTLASSTILS 742



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/39 (64%), Positives = 30/39 (76%)

Query: 144 WDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFI 182
           W  SQSGRRK +FGPK NFKK+++KLG F GFP  TK +
Sbjct: 582 WALSQSGRRKIDFGPKVNFKKKRLKLGLFNGFPPFTKRL 620



 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 31/60 (51%)

Query: 292 VVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPYMNNEATASVFEK 351
           V IR+ +P RSL+V  G  RY W H +    +  RRV +  RE    ++    +A V +K
Sbjct: 848 VEIRVLLPRRSLVVCEGPCRYTWTHAIRSHHIFSRRVAVTLRELAPEFLPGGESAEVGQK 907


>gi|302758364|ref|XP_002962605.1| hypothetical protein SELMODRAFT_78643 [Selaginella moellendorffii]
 gi|300169466|gb|EFJ36068.1| hypothetical protein SELMODRAFT_78643 [Selaginella moellendorffii]
          Length = 315

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 86/215 (40%), Gaps = 50/215 (23%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S +   I G+ +  +F+S  E E +++++D  PW ++ + RR Q++G +  +  + +   
Sbjct: 101 SAQELGIPGLSLLTEFISSREEERLLQEVDARPW-QALAKRRVQHYGYEFLYNARNVDTS 159

Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIW 223
            F G FP    F+Q    +   ++  Q   + T  ++         G  +  HID    +
Sbjct: 160 KFLGEFP---DFLQPLLEKISSIAELQETSEATFPFDQLTVNEYPRGVGLSPHIDTHSAF 216

Query: 224 GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS 283
              I++++L G  V+    Y  +       SP F  +  +                    
Sbjct: 217 QGSIISLSLAGPCVMEFRKYASEG-----VSPEFERKALF-------------------- 251

Query: 284 IVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
                         +P RSLL+L GE+RY W H +
Sbjct: 252 --------------LPQRSLLILSGESRYGWHHYI 272


>gi|302797440|ref|XP_002980481.1| hypothetical protein SELMODRAFT_112315 [Selaginella moellendorffii]
 gi|300152097|gb|EFJ18741.1| hypothetical protein SELMODRAFT_112315 [Selaginella moellendorffii]
          Length = 315

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 86/215 (40%), Gaps = 50/215 (23%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           S +   I G+ +  +F+S  E E +++++D  PW ++ + RR Q++G +  +  + +   
Sbjct: 101 SAQELGIPGLSLLTEFISSREEERLLQEVDARPW-QALAKRRVQHYGYEFLYNARNVDTS 159

Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS------ENGSSIDNHIDDCWIW 223
            F G FP    F+Q    +   ++  Q   + T  ++         G  +  HID    +
Sbjct: 160 KFLGEFP---DFLQPLLEKISSIAELQETSEATFPFDQLTVNEYPRGVGLSPHIDTHSAF 216

Query: 224 GERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDS 283
              I++++L G  V+    Y  +       SP F  +  +                    
Sbjct: 217 QGSIISLSLAGPCVMEFRKYASEG-----VSPEFERKALF-------------------- 251

Query: 284 IVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
                         +P RSLL+L GE+RY W H +
Sbjct: 252 --------------LPQRSLLILSGESRYGWHHYI 272


>gi|224074611|ref|XP_002304398.1| predicted protein [Populus trichocarpa]
 gi|222841830|gb|EEE79377.1| predicted protein [Populus trichocarpa]
          Length = 344

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 99/234 (42%), Gaps = 41/234 (17%)

Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQ----KIKLGD 171
           +I G+Y+  DFLS  E ++++  +D+ PW    S RR Q++G +  +  +    K+ LG 
Sbjct: 119 NIPGLYLVHDFLSPQEEKELLAAVDERPW-IGLSKRRVQHYGYEFCYGTRNVDTKLHLGK 177

Query: 172 FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSIDNHIDDCWIWGERIVTV 230
               P+    + +R      L+   +I    L  N    G  +  HID    +   I ++
Sbjct: 178 L---PSFVSSVLERISLFPGLNSSTSILLDQLTVNEYPPGVGLSPHIDTHSAFEGLIFSL 234

Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
           +L G  ++    Y   S                PD  S+    +  N  +  ++V  +L 
Sbjct: 235 SLAGPCIMEFRRYLDGS--------------WVPDAASSAYTKVE-NVDNCSNLVRRSLY 279

Query: 291 SVVIRIPMPARSLLVLWGEARYEWQHCV-------LRSDVI---GRRVCIAYRE 334
                  +P RS+L+L GEARY WQH +       +   VI    RRV   +R+
Sbjct: 280 -------LPPRSILLLSGEARYAWQHYIPHHKIDMVNQSVIRRGARRVSFTFRK 326


>gi|290982964|ref|XP_002674199.1| hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi]
 gi|284087788|gb|EFC41455.1| hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi]
          Length = 259

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/285 (23%), Positives = 117/285 (41%), Gaps = 56/285 (19%)

Query: 76  DFL--TSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETE 133
           DFL  + + K++ +     N    G+ +    S+   S    +IGG+YI E+ +  AE  
Sbjct: 7   DFLQTSDKKKQSTLKKIKSNSEVSGAPVKQISSYKTDS----EIGGLYIIENIIDVAEER 62

Query: 134 DVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKF------IQDRFH 187
            ++K +D   W+   S RR Q++G   N+  + +K    +  P  ++F      I+++  
Sbjct: 63  KLVKFIDSQKWNDEIS-RRTQHYGVSYNYGARGVKEA-LKVPPVPSEFSDLLEEIKNKEG 120

Query: 188 QEDVLSHFQTIEQCTLEYNSENGSS--IDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNG 245
            + + +  + I+   +  N   G+   I  H+D C  +G  I+ ++L  + V+       
Sbjct: 121 LDSIRNLMEGIDFKQVIINEYKGAKQGISKHVDHCQDFGPLILILSLGDECVMKF----- 175

Query: 246 DSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLV 305
                                K   V +  L +K       E   S      MP RSL++
Sbjct: 176 --------------------HKLEQVKEEDLKKKKVKR--TEVSPSECYDRRMPRRSLII 213

Query: 306 LWGEARYEWQHCVLRSDVIG-------------RRVCIAYREFTA 337
           L G+ARY++QH + ++ V               RRV I YR  T 
Sbjct: 214 LSGDARYQYQHEIPKTMVFKIDGKQFLKRSESYRRVSITYRSLTT 258


>gi|242077058|ref|XP_002448465.1| hypothetical protein SORBIDRAFT_06g027580 [Sorghum bicolor]
 gi|241939648|gb|EES12793.1| hypothetical protein SORBIDRAFT_06g027580 [Sorghum bicolor]
          Length = 349

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/242 (19%), Positives = 99/242 (40%), Gaps = 49/242 (20%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           +     I G+Y+ ++F++ AE ++++  +D   W K  + RR Q++G +  ++ + +   
Sbjct: 118 AASELGIPGIYMVQEFVTAAEEQELLSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSK 176

Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSEN-------GSSIDNHIDDCWI 222
            F G  P     + ++      ++ F  ++ CT     +        G  +  HID    
Sbjct: 177 QFLGELPTFVSTVLEK------IASFPGVKDCTTSLVDQLTVNEYPCGVGLSPHIDTHSA 230

Query: 223 WGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD 282
           + E I +++L G  ++    Y   S +                   ++VN +  +     
Sbjct: 231 FEEMIFSLSLAGPCIMEFRKYTKGSWR-----------------APSVVNGVDEDSSQEP 273

Query: 283 SIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVL--RSDVIG--------RRVCIAY 332
             + + +        +P RS+L++ GE RY W H +   + D +G        RRV   +
Sbjct: 274 GCIRKAIF-------LPPRSMLLMSGEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTF 326

Query: 333 RE 334
           R+
Sbjct: 327 RK 328


>gi|297846510|ref|XP_002891136.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297336978|gb|EFH67395.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 344

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 91/207 (43%), Gaps = 36/207 (17%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+++  DF+S  E + ++  +D  PW    + RR Q++G +  +  + +    +LG
Sbjct: 121 LNIPGLFLLPDFVSVEEEQQLLAAVDARPW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 179

Query: 171 DFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVT 229
           +   F +        F   D+ S    ++Q T+ EY S  G  +  HID    + + I +
Sbjct: 180 ELPSFVSPILERMSLFPNFDINSASLDLDQLTVNEYPS--GVGLSPHIDTHSAFEDCIFS 237

Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
           ++L G                    PC    ++Y    ST     +  EKS DS      
Sbjct: 238 LSLAG--------------------PCIMEFRRY--SVSTWKASSADAEKSGDS------ 269

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQH 316
           S +   + +P RS+L+L GEARY W H
Sbjct: 270 SCIRKALYLPPRSMLLLSGEARYAWNH 296


>gi|226498774|ref|NP_001148410.1| nucleic acid binding protein [Zea mays]
 gi|195619080|gb|ACG31370.1| nucleic acid binding protein [Zea mays]
          Length = 349

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 54/277 (19%), Positives = 108/277 (38%), Gaps = 54/277 (19%)

Query: 76  DFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV 135
           D L  +        P   KA  G  +        H+     I G+Y+ ++F++ AE +++
Sbjct: 88  DLLAGRVLHIRYSVPVKPKARPGGSVPV-----AHAASELGIPGIYMVQEFVTAAEEQEL 142

Query: 136 MKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSH 194
           +  +D   W K  + RR Q++G +  ++ + +    F G  P     + ++      ++ 
Sbjct: 143 LSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSKQFLGELPTFVSTVLEK------IAS 195

Query: 195 FQTIEQCTLEYNSEN-------GSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
           F  ++ C      +        G  +  HID    + E I +++L G  ++    Y    
Sbjct: 196 FPGVKDCATRLVDQLTVNEYPCGVGLSPHIDTHSAFEEMIFSLSLAGTCIMEFRKY---- 251

Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
            K    +P             ++V+ +  +       + +        I +P RS+L++ 
Sbjct: 252 TKGTWRAP-------------SVVDGVDEDSSQEPECIRKA-------IFLPPRSMLLMS 291

Query: 308 GEARYEWQHCVL--RSDVIG--------RRVCIAYRE 334
           GE RY W H +   + D +G        RRV   +R+
Sbjct: 292 GEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTFRK 328


>gi|171676605|ref|XP_001903255.1| hypothetical protein [Podospora anserina S mat+]
 gi|170936369|emb|CAP61027.1| unnamed protein product [Podospora anserina S mat+]
          Length = 298

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 94/206 (45%), Gaps = 17/206 (8%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDL-PWDKSQSGRR-KQNFGPKCNFKKQKIKLGDFQGFP 176
           G+ +  +F+S AE ++++     + P   + S RR  Q+FG   ++    I        P
Sbjct: 39  GLTLVHEFISPAEEQEMISAFHAISPLSPADSKRRISQHFGHHFDYTTFGIDESKHSPVP 98

Query: 177 ACTKFIQDRFHQE-DVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGD 235
           A      DR   + D     +  +Q T++Y    G+ I  H+D   ++GE + +++  G 
Sbjct: 99  AYITNFLDRLPVDTDGKEAGRKPDQFTVQYYPP-GAGIPPHVDTHSMFGEALYSLSF-GS 156

Query: 236 SVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVI- 294
            V  +   +G++    L  P  S Q+          +D ++N K    I+++    VV  
Sbjct: 157 GVPMIFRMSGENEARKLRLPKRSLQES---------SDGNVNGKVGGEILDKAEGVVVHP 207

Query: 295 --RIPMPARSLLVLWGEARYEWQHCV 318
              + +PARSLLV+ G +RY + H +
Sbjct: 208 AWELMLPARSLLVMRGASRYGYTHGI 233


>gi|414585597|tpg|DAA36168.1| TPA: nucleic acid binding protein [Zea mays]
          Length = 536

 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 54/277 (19%), Positives = 108/277 (38%), Gaps = 54/277 (19%)

Query: 76  DFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDV 135
           D L  +        P   KA  G  +        H+     I G+Y+ ++F++ AE +++
Sbjct: 275 DLLAGRVLHIRYSVPVKPKARPGGSVPV-----AHAASELGIPGIYMVQEFVTAAEEQEL 329

Query: 136 MKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSH 194
           +  +D   W K  + RR Q++G +  ++ + +    F G  P     + ++      ++ 
Sbjct: 330 LSAVDSKTW-KRLAKRRVQHYGYEFLYETRNVDSKQFLGELPTFVSTVLEK------IAS 382

Query: 195 FQTIEQCTLEYNSEN-------GSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDS 247
           F  ++ C      +        G  +  HID    + E I +++L G  ++    Y    
Sbjct: 383 FPGVKDCATRLVDQLTVNEYPCGVGLSPHIDTHSAFEEMIFSLSLAGTCIMEFRKY---- 438

Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
            K    +P             ++V+ +  +       + +        I +P RS+L++ 
Sbjct: 439 TKGTWRAP-------------SVVDGVDEDSSQEPECIRKA-------IFLPPRSMLLMS 478

Query: 308 GEARYEWQHCVL--RSDVIG--------RRVCIAYRE 334
           GE RY W H +   + D +G        RRV   +R+
Sbjct: 479 GEGRYAWHHYIPHHKIDAVGGQVIKRNSRRVSFTFRK 515


>gi|302847355|ref|XP_002955212.1| hypothetical protein VOLCADRAFT_119009 [Volvox carteri f.
           nagariensis]
 gi|300259504|gb|EFJ43731.1| hypothetical protein VOLCADRAFT_119009 [Volvox carteri f.
           nagariensis]
          Length = 753

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/250 (21%), Positives = 102/250 (40%), Gaps = 45/250 (18%)

Query: 100 DLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDL-------PW------DK 146
           ++ A  SHP   G+       Y+  +F++ AE  +++   DD        PW      + 
Sbjct: 523 EMRAEVSHPALEGQ-------YLVLEFVTPAEEAELLAMCDDPVLKPSWSPWIGQMYGNA 575

Query: 147 SQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH-QEDVLSHFQTIEQCTLEY 205
           +    R + +G   ++ ++ +   +    P   + + +R   Q  +L  FQ  E   ++Y
Sbjct: 576 TAQKTRGKRWGVLPDYHRRGVAPVEHP-LPPLLRILTERMRVQVGLLRCFQPNEANAIDY 634

Query: 206 NSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPD 265
               GS +  H+DD  + G+ IV ++L G +V+T   +  + +K     P     ++   
Sbjct: 635 WRSRGSWLRPHVDDRILSGDLIVNLSLGGAAVMT---FARERDKDEGGHPGLQQHQQQQQ 691

Query: 266 QKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVI- 324
                  D                    +R+ +  RSL +L   ARY + H +  SD++ 
Sbjct: 692 HPRQSAGD-------------------EVRVRLAPRSLQILSRAARYSYTHAIAASDLLD 732

Query: 325 GRRVCIAYRE 334
            RRV I +R 
Sbjct: 733 ARRVSITFRR 742


>gi|445456942|ref|ZP_21446201.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
           baumannii OIFC047]
 gi|444777446|gb|ELX01476.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Acinetobacter
           baumannii OIFC047]
          Length = 200

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 98/246 (39%), Gaps = 73/246 (29%)

Query: 108 PNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI 167
           P  + K  DI GV    DF+S  E  +++K +D  PW  +  GRR Q++G + N+KK   
Sbjct: 10  PKPTIKVPDILGVEYLSDFVSVVEETEILKNIDSQPW-ITDLGRRVQHYGYRYNYKKA-- 66

Query: 168 KLGDFQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS--------ENGSSIDNHIDD 219
           KL      P    ++     Q+D++      ++C+L+           E G  I +HID 
Sbjct: 67  KLDRHITLPPIPSWLIR--MQKDLM------DECSLDLPPNQLIINEYEPGQGITDHIDA 118

Query: 220 CWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEK 279
              +GE I+ V+L    V+  T     S + N     F  Q                   
Sbjct: 119 PDEFGETIIMVSLGSSCVMDFT-----STESNQKEAIFLEQ------------------- 154

Query: 280 STDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVL--RSDV-------IGRRVCI 330
                                RSLL++  +ARY+W+H +   ++D+         RRV +
Sbjct: 155 ---------------------RSLLMIRNDARYKWKHGIAKRKTDIWEGIQYPRSRRVSL 193

Query: 331 AYREFT 336
            +R  +
Sbjct: 194 TFRHVS 199


>gi|440789981|gb|ELR11271.1| DNA repair protein alkB, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 253

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 88/219 (40%), Gaps = 60/219 (27%)

Query: 124 EDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPACTKFIQ 183
           EDF++  E   +++ +D   W +    RR Q++G + ++ +Q I        P   + I 
Sbjct: 77  EDFITADEERALVQAIDAQEWSEKLH-RRTQHYGYEFDYSRQDINTSVPIELPVFAQQII 135

Query: 184 DRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYY 243
           ++  Q  +    Q I     EY    G  I+ HID    +G  +V+++L+   V+T T  
Sbjct: 136 EKMRQRGLPQFDQLIIN---EYTP--GQGINPHIDKTHCFGPCVVSLSLLSTCVMTFT-- 188

Query: 244 NGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSL 303
                                         L   EK            VV+R     RSL
Sbjct: 189 -----------------------------SLETGEK----------IPVVLR----PRSL 205

Query: 304 LVLWGEARYEWQHCV--LRSDVI-------GRRVCIAYR 333
           +VL G+ARY WQH +   R+D++        RRV + YR
Sbjct: 206 VVLRGQARYGWQHGIEPKRADIVAGKHTPRARRVSLTYR 244


>gi|281212158|gb|EFA86318.1| 2-oxoglutarate and Fe(II)-dependent oxygenase family protein
           [Polysphondylium pallidum PN500]
          Length = 168

 Score = 50.8 bits (120), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 49/228 (21%), Positives = 86/228 (37%), Gaps = 71/228 (31%)

Query: 116 DIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGF 175
           D+ G+ + E+F++E E   +++ +D   W  S+  RR Q +G    ++ + +   + +G 
Sbjct: 6   DVDGLRLIENFITEQEENSLIENIDRYDW-SSEIARRTQQYGYHYCYRLRGVDELNDEGA 64

Query: 176 PACT------KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVT 229
           P          F+ DR  +                                         
Sbjct: 65  PTTPPIPEFLNFLIDRLAK----------------------------------------- 83

Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
              + D V+  T+     N+YN N     +     D +ST+                   
Sbjct: 84  ---LSDPVIPFTFDQVIINEYNKNQKIQPHIDSVTDWESTV------------------- 121

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTA 337
             V I  P+P RSLLVL  +ARY W+H + +S   GRRV + +R++ +
Sbjct: 122 DEVEIETPLPRRSLLVLQDDARYVWKHGI-KSQNNGRRVSLTFRKYKS 168


>gi|42571709|ref|NP_973945.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|332193254|gb|AEE31375.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
          Length = 344

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 93/212 (43%), Gaps = 42/212 (19%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+++  DF++ AE + ++  +D   W    + RR Q++G +  +  + +    +LG
Sbjct: 121 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 179

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
           +   F      I +R   F   D  S    ++Q T+ EY S  G  +  HID    + + 
Sbjct: 180 ELPSF---VSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 234

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
           I +++L G                    PC    ++Y    ST     +  EKS DS   
Sbjct: 235 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 269

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
              S +   + +P RS+L+L GEARY W H +
Sbjct: 270 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 298


>gi|12597837|gb|AAG60147.1|AC074360_12 hypothetical protein [Arabidopsis thaliana]
          Length = 410

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 94/212 (44%), Gaps = 42/212 (19%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+++  DF++ AE + ++  +D   W    + RR Q++G +  +  + +    +LG
Sbjct: 196 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 254

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
           +    P+    I +R   F   D  S    ++Q T+ EY S  G  +  HID    + + 
Sbjct: 255 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 309

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
           I +++L G                    PC    ++Y    ST     +  EKS DS   
Sbjct: 310 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 344

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
              S +   + +P RS+L+L GEARY W H +
Sbjct: 345 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 373


>gi|256069452|ref|XP_002571149.1| oxygenase-related [Schistosoma mansoni]
 gi|238652196|emb|CAZ38834.1| oxygenase-related [Schistosoma mansoni]
          Length = 75

 Score = 48.9 bits (115), Expect = 0.004,   Method: Composition-based stats.
 Identities = 18/46 (39%), Positives = 32/46 (69%)

Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYREFTAPY 339
           +R+ +  RSL+V+ G ARY WQH + R+D+  RR+ + +RE ++ +
Sbjct: 3   VRVLLDRRSLVVVSGPARYIWQHEIRRTDITTRRIAMTFRELSSTF 48


>gi|62319760|dbj|BAD93744.1| hypothetical protein [Arabidopsis thaliana]
          Length = 431

 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 57/212 (26%), Positives = 94/212 (44%), Gaps = 42/212 (19%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+++  DF++ AE + ++  +D   W    + RR Q++G +  +  + +    +LG
Sbjct: 208 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 266

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
           +    P+    I +R   F   D  S    ++Q T+ EY S  G  +  HID        
Sbjct: 267 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHID-------- 313

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
             T +   D V +L+          L  PC    ++Y    ST     +  EKS DS   
Sbjct: 314 --THSAFEDCVFSLS----------LAGPCIMEFRRY--SVSTWKASTTDAEKSGDS--- 356

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
              S +   + +P RS+L+L GEARY W H +
Sbjct: 357 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 385


>gi|324504403|gb|ADY41902.1| Alkylated DNA repair protein alkB 8 [Ascaris suum]
          Length = 578

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/202 (20%), Positives = 83/202 (41%), Gaps = 48/202 (23%)

Query: 120 VYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQ-NFGPKCNFKKQKIKLGDFQGFPAC 178
           + + ++F++E E + +M  L +  + K++   RK  +FG K N+   +  +      P  
Sbjct: 150 LRVLKNFINEDEEQQLMASLKECIYAKTKLKSRKVVHFGYKFNYDTNEADVPAENAIPQS 209

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVL 238
              + DR  +  + +  +  +Q T+    E G+ I +H+D    +G+ IV+++LI D V+
Sbjct: 210 CDVVIDRMIEAGIFT--ERPDQLTVNI-YEPGNGIPSHVDSHSPFGDTIVSLSLISDLVM 266

Query: 239 TLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPM 298
               +   S+ YN                                            + +
Sbjct: 267 EFRDFANTSSIYN--------------------------------------------VLL 282

Query: 299 PARSLLVLWGEARYEWQHCVLR 320
           P  SL V+ GE+RY W+H + +
Sbjct: 283 PRYSLAVMQGESRYRWKHGIAK 304


>gi|30692572|ref|NP_174442.2| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|42571711|ref|NP_973946.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|20259468|gb|AAM13854.1| unknown protein [Arabidopsis thaliana]
 gi|22136680|gb|AAM91659.1| unknown protein [Arabidopsis thaliana]
 gi|332193253|gb|AEE31374.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
 gi|332193255|gb|AEE31376.1| RNA recognition motif-containing protein [Arabidopsis thaliana]
          Length = 431

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 94/212 (44%), Gaps = 42/212 (19%)

Query: 115 FDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKI----KLG 170
            +I G+++  DF++ AE + ++  +D   W    + RR Q++G +  +  + +    +LG
Sbjct: 208 LNIPGLFLLPDFVTVAEEQQLLAAVDARHW-IGLAKRRVQHYGYEFCYGTRNVDTKKRLG 266

Query: 171 DFQGFPACTKFIQDR---FHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGER 226
           +    P+    I +R   F   D  S    ++Q T+ EY S  G  +  HID    + + 
Sbjct: 267 EL---PSFVSPILERIYLFPNFDNGSASLNLDQLTVNEYPS--GVGLSPHIDTHSAFEDC 321

Query: 227 IVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVN 286
           I +++L G                    PC    ++Y    ST     +  EKS DS   
Sbjct: 322 IFSLSLAG--------------------PCIMEFRRY--SVSTWKASTTDAEKSGDS--- 356

Query: 287 ETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
              S +   + +P RS+L+L GEARY W H +
Sbjct: 357 ---SCIKKALYLPPRSMLLLSGEARYAWNHYI 385


>gi|170044145|ref|XP_001849718.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867399|gb|EDS30782.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 153

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 184 DRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDD 219
           DRF Q  ++++FQTIEQ +LEY  E  +SID HIDD
Sbjct: 118 DRFEQVPLMANFQTIEQYSLEYTPERRASIDPHIDD 153


>gi|145345998|ref|XP_001417484.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577711|gb|ABO95777.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 347

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 51/244 (20%), Positives = 93/244 (38%), Gaps = 52/244 (21%)

Query: 99  SDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDD----LPWDKSQ--SGRR 152
           SD +A  +          + G YI E+F++E E   ++  LDD     PW  S      +
Sbjct: 131 SDAFAALARGAKLQAKVKLPGHYILENFITEDEERRIVDWLDDDIAAGPWRDSSFNGAHQ 190

Query: 153 KQNFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFHQ-EDVLSHFQTIEQCTLEYNSENGS 211
            + +G + N  K+ ++             +  +F    + L HF   E   + Y  + GS
Sbjct: 191 GKKYGVEPNLLKRCVEPARVPMPKILRDLVVAKFAAAHETLKHFTPNECNAINYRKDLGS 250

Query: 212 SIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIV 271
            +  H DD  +  + +V ++L  D  +T                 +S++K          
Sbjct: 251 VLTPHCDDRQLSSDILVNLSLCSDCTMT-----------------YSHEK---------- 283

Query: 272 NDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCI 330
                             +S  + + +P RSL +  G  RY++ H +   ++ G RRV +
Sbjct: 284 -----------------FASKRVDVRLPRRSLQIQSGSTRYDYMHSIANENLHGNRRVSV 326

Query: 331 AYRE 334
            +RE
Sbjct: 327 TFRE 330


>gi|380025248|ref|XP_003696389.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Apis
           florea]
          Length = 584

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 97/235 (41%), Gaps = 71/235 (30%)

Query: 118 GGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSG----RRKQNFGPKCNFKKQKIKLGD-F 172
            G+ + EDF++E E + +   L  + W+  +S     R+ ++FG +  +   K+ L    
Sbjct: 126 SGIKLIEDFITEEEEKML---LSTITWNNEESSDLKHRKVKHFGYEFQYDTNKVDLDKPI 182

Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNL 232
              P   +F+Q  F Q   +S+    +Q T+ +    G  I  HID          T ++
Sbjct: 183 IPIPKNYQFLQTLFKQYHDVSY--EYDQLTINHYLP-GQGIPPHID----------THSV 229

Query: 233 IGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSV 292
             DS+L+L+          L S C             I+N    NEK++           
Sbjct: 230 FEDSILSLS----------LGSAC-------------IMNFKKENEKAS----------- 255

Query: 293 VIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI-----------GRRVCIAYRE 334
              + +P RSLL++ GEARY W H  C   +D++           G RV   +R+
Sbjct: 256 ---LFLPPRSLLIMSGEARYAWSHGICPRHNDIVQTSNGITTQSRGTRVSFTFRK 307


>gi|115345714|ref|YP_771891.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
 gi|115293033|gb|ABI93483.1| oxidoreductase, putative [Roseobacter denitrificans OCh 114]
          Length = 195

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 31/121 (25%), Positives = 62/121 (51%), Gaps = 5/121 (4%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQG-FPA 177
           G+   E+++SE E   +++++D   W ++   RR Q++G + ++K ++    D+ G  P 
Sbjct: 16  GLTYLENYISEDEAGRLVQEIDAALW-RTDLKRRVQHYGYRYDYKARQAWREDYLGPLPE 74

Query: 178 CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
             + + +R   E    HFQT+    +    + G  I  HID    +GE I +++L+   V
Sbjct: 75  LFQSLAERLTAE---GHFQTVPDQVIVNEYQPGQGISAHIDCQPCFGETIASLSLLSACV 131

Query: 238 L 238
           +
Sbjct: 132 M 132


>gi|255085018|ref|XP_002504940.1| predicted protein [Micromonas sp. RCC299]
 gi|226520209|gb|ACO66198.1| predicted protein [Micromonas sp. RCC299]
          Length = 353

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 50/231 (21%), Positives = 94/231 (40%), Gaps = 41/231 (17%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLD-----DLPWDKSQ-SGR-RKQNFGPKCNFKKQKIKL 169
           + G ++  DF++  E   +++ LD       PW  S  +G+ R + +G + + K++ +  
Sbjct: 146 LPGHHLIPDFITPDEESRLIEYLDRDESDTNPWRPSNFNGKHRGKKWGVEVDLKRRTVAP 205

Query: 170 GDFQGFPACTKFIQDRF-HQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIV 228
            + +  PA  + + DR       L  F   E   ++Y+   G+ +  H+DD  +  + IV
Sbjct: 206 -ERRPLPALVRAVADRMPAAHPALRGFVPNEANAIDYDRRGGAELLPHVDDRQMSTDLIV 264

Query: 229 TVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNET 288
            ++L GD V+T   Y  D+ +               D +      +    +  D      
Sbjct: 265 NLSLAGDCVMT---YVEDAGR---------------DGRRGGWEGVPAGARRVDVF---- 302

Query: 289 LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-RRVCIAYREFTAP 338
                    +P RSL V  G  R+ + H +   ++   RRV I +R    P
Sbjct: 303 ---------LPRRSLQVQSGPCRFNFAHSIRNENLRAPRRVSITFRRSQMP 344


>gi|255086679|ref|XP_002509306.1| predicted protein [Micromonas sp. RCC299]
 gi|226524584|gb|ACO70564.1| predicted protein [Micromonas sp. RCC299]
          Length = 418

 Score = 44.7 bits (104), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 91/228 (39%), Gaps = 51/228 (22%)

Query: 114 SFDIGGVYIKEDFLSEAETEDVMKQLD-DLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDF 172
           +  + GV +  DF++E E  +++  +D D  W +  + RR  ++G   ++  +  +    
Sbjct: 192 TLGVPGVTLITDFVTEEEEREMLACVDSDERW-QGLAKRRVLHYGYAFDYGTRDAR-DKT 249

Query: 173 QGFPACTKFIQDRFHQEDVLSHFQTI--EQCTL-EYNSENGSSIDNHIDDCWIWGERIVT 229
              PA    +  R          +++  +Q T+ EY +  G  I  H+D    +G  I++
Sbjct: 250 SPMPAFVAGLLGRAASCGAPGACESVHCDQLTVNEYVA--GVGIAPHVDTHSAFGPTILS 307

Query: 230 VNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETL 289
           ++L G +V+    + G               +K P ++  I                   
Sbjct: 308 LSLAGRAVMEFRLHEGG--------------EKEPRERRAIS------------------ 335

Query: 290 SSVVIRIPMPARSLLVLWGEARYEWQHCVL---RSDVIGRRVCIAYRE 334
                   MP RSLLVL GEARY W H +    R  ++G   C A  E
Sbjct: 336 --------MPPRSLLVLHGEARYRWLHYIPHRKRDAIVGEDECEAREE 375


>gi|356531539|ref|XP_003534335.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Glycine max]
          Length = 342

 Score = 44.3 bits (103), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 94/225 (41%), Gaps = 31/225 (13%)

Query: 97  QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNF 156
           Q SDL       + +     I G+++  DF+S  E E++++ +D  PW+ S S RR Q++
Sbjct: 102 QASDLVPV----SITASDVSIPGLFLVHDFISAKEEEELLQAVDCRPWN-SLSKRRVQHY 156

Query: 157 GPKCNFKKQKIKLGDFQG-FPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNS-ENGSSID 214
           G +  +  + +      G  P+    I DR        + + I    L  N    G  + 
Sbjct: 157 GYEFRYDIRNVNTRHCLGELPSFVSPILDRISSCPTFENVKNIVLDQLTVNEYPPGVGLS 216

Query: 215 NHIDDCWIWGERIVTVNLIGDSVLTLTYY-NGDSNKYNLNSPCFSYQKKYPDQKSTIVND 273
            HID    + + I +++L G  ++    Y NGD                 P   S+ V  
Sbjct: 217 PHIDTHSAFEDLIFSLSLSGPCIMEFRRYENGD---------------WLPRVASSAVAK 261

Query: 274 LSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
           +   E  ++ I           I +P RSLL+L GEARY W H +
Sbjct: 262 IENPEDQSNFIRRA--------IYLPPRSLLLLSGEARYAWHHYI 298


>gi|256822586|ref|YP_003146549.1| type 12 methyltransferase [Kangiella koreensis DSM 16069]
 gi|256796125|gb|ACV26781.1| Methyltransferase type 12 [Kangiella koreensis DSM 16069]
          Length = 260

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 73/160 (45%), Gaps = 19/160 (11%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           GV + EDF+ +A+ + + +++      KSQ+   +QN     +F KQ  +L D   FPAC
Sbjct: 81  GVDVVEDFIQQAKIDAISQRV------KSQAEFVRQNM---ADFLKQNTELFDVIVFPAC 131

Query: 179 TKFIQDRFHQEDVLSH------FQTIEQ--CTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
           T  + DR    D+LS       F  +E    +  +N  +G  +  ++DD     +   T 
Sbjct: 132 TDVLGDRVESLDILSKHINDDGFILVESDYKSHSFNIGSGVELTKNMDDVIAESKLQSTR 191

Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI 270
            +I +  + +  Y  D+ +  L +     + K P+ K  +
Sbjct: 192 KIIWEQKILIDIYRRDTQR--LTTRVNELKVKQPEIKREL 229


>gi|350855225|emb|CCD58057.1| hypothetical protein Smp_197790 [Schistosoma mansoni]
          Length = 44

 Score = 42.4 bits (98), Expect = 0.33,   Method: Composition-based stats.
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 155 NFGPKCNFKKQKIKLGDFQGFPACTKFIQDRFH 187
           ++GPK NFK+QK+ +G F G PA ++F+  R++
Sbjct: 1   DYGPKVNFKRQKVHIGGFYGLPAYSRFLITRYN 33


>gi|307180204|gb|EFN68237.1| Alkylated DNA repair protein alkB-like protein 8 [Camponotus
           floridanus]
          Length = 604

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 51/216 (23%), Positives = 91/216 (42%), Gaps = 64/216 (29%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQS-------GRRKQNFGPKCNFKKQKIKLGD 171
           G+ + E+F+++ + E +++ L+   WD+ +S        R+ ++FG +  +    +   D
Sbjct: 140 GLTLIENFITKKQEETLLRTLN---WDECESVTSLQLKHRQVKHFGYEFEYGTNIVNPDD 196

Query: 172 -FQGFPACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTV 230
                P   +F+Q  F++ +   H    +Q T+      G  I  HID          T 
Sbjct: 197 PIAPIPQDYEFLQTLFNKHN---HKYKYDQLTIN-KYLPGQGIPPHID----------TH 242

Query: 231 NLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLS 290
           ++  D++L+L+          L S C                   +N K  +  +N    
Sbjct: 243 SVFEDTILSLS----------LGSACI------------------MNFKKENQNIN---- 270

Query: 291 SVVIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI 324
                + +PARSLL++ GEARY W H  C   +DVI
Sbjct: 271 -----VFLPARSLLIMTGEARYAWTHGICSRHNDVI 301


>gi|326435968|gb|EGD81538.1| hypothetical protein PTSG_11856 [Salpingoeca sp. ATCC 50818]
          Length = 656

 Score = 41.2 bits (95), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 293 VIRIPMPARSLLVLWGEARYEWQHCV 318
            + + +PARSLLV+ GEARY W HC+
Sbjct: 597 AVELHLPARSLLVVRGEARYRWTHCI 622


>gi|312081720|ref|XP_003143146.1| hypothetical protein LOAG_07565 [Loa loa]
          Length = 129

 Score = 41.2 bits (95), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 26/38 (68%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQ 154
           I G+++  DFL+EAE  D++  +D   W  SQSGRRKQ
Sbjct: 92  IPGLHVITDFLTEAEEMDLINVIDRTDWVLSQSGRRKQ 129


>gi|67609385|ref|XP_666959.1| 2OG-Fe oxygenase superfamily (67.4 kD) (3E801) [Cryptosporidium
           hominis TU502]
 gi|54658038|gb|EAL36728.1| 2OG-Fe oxygenase superfamily (67.4 kD) (3E801) [Cryptosporidium
           hominis]
          Length = 350

 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 54/233 (23%), Positives = 96/233 (41%), Gaps = 40/233 (17%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ + EDF+++ E  +++  +D+    +++  R+ Q++G   ++  + I     +  P  
Sbjct: 123 GLVLIEDFINKLEAIELLNWIDNNGQWETKLNRKVQHYGYSFDYNNKTISSVWERDIPPI 182

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
              + +R     +++  +  +Q T+ EY  E G  I  HID     GE I  ++L G  +
Sbjct: 183 LNRLIERMLSLKIIT--EVPDQITINEY--EVGKGIGPHIDSHHTIGENISVISL-GSGI 237

Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           L    +N  S + NL                    D S  E S     N      V    
Sbjct: 238 LF--EFNELSKRKNL--------------------DFSSKEGSGSRKYNRISKRTVY--- 272

Query: 298 MPARSLLVLWGEARYEWQHCV--LRSDVI-------GRRVCIAYREFTAPYMN 341
           +P  SL ++  E RY W+H +   + D I        RRV I  R++   + N
Sbjct: 273 IPENSLYIMKNEIRYAWEHGIKSRKYDKIQGKFQQRKRRVSITIRKYIESHYN 325


>gi|330819033|ref|XP_003291570.1| hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum]
 gi|325078238|gb|EGC31900.1| hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum]
          Length = 256

 Score = 41.2 bits (95), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 52/224 (23%), Positives = 96/224 (42%), Gaps = 32/224 (14%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQ-GF 175
           I G+ I E+  ++   + +  +++   WD     RR Q++G K N+K +++K  D    F
Sbjct: 36  IKGLKIIENIFNDEYHDKLWAEVNKCEWDDKSLLRRTQHYGYKYNYKSRQLKDEDIAPPF 95

Query: 176 PACTKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIG 234
           P     +     ++ +++   T +Q  + EY    G  I  H+ D  I+G  I ++    
Sbjct: 96  PQWANDLIMLLLEKQIIN--DTPKQLIVNEYKDNQG--ISAHV-DSKIFGNLIFSI---- 146

Query: 235 DSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLN--EKSTDSIVNETLSSV 292
                           +L S C    KK  + KS   +D + N  +KS  +  N T + V
Sbjct: 147 ----------------SLGSKCKMVFKKKIENKSNNTDDNTSNSKDKSKATSTNNTKNKV 190

Query: 293 ---VIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCIAYR 333
               I   +   S+++L  E+RY W H + +      R+ + +R
Sbjct: 191 EYEKIEKELLPNSIVLLQDESRYLWTHEIPKLKKGDHRISLTFR 234


>gi|322799222|gb|EFZ20637.1| hypothetical protein SINV_06026 [Solenopsis invicta]
          Length = 597

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 53/236 (22%), Positives = 92/236 (38%), Gaps = 74/236 (31%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQS------GRRKQNFGPKCNFKKQKIKLGDF 172
           G+ + E+F+++ + E ++  L+   WD+ +S       R+ ++FG +  +    +     
Sbjct: 137 GLMLIENFITQEQEETLLGTLN---WDECESVSSQLKHRQVKHFGYEFEYGTNMVDPDKP 193

Query: 173 QGFPACT-KFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGERIVTVN 231
                   KF+Q  F   D   H  T +Q T+     +G  I +HID   ++ + I++++
Sbjct: 194 IPPIPEDYKFLQTLF---DKHGHKYTYDQLTIN-KYLSGQGIPSHIDTHSVFEDTILSLS 249

Query: 232 LIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSS 291
           L    V+                       K  DQK                        
Sbjct: 250 LGSAYVMNF---------------------KKEDQK------------------------ 264

Query: 292 VVIRIPMPARSLLVLWGEARYEWQH--CVLRSDVI-----------GRRVCIAYRE 334
             I + +PARSLL++ GEARY W H  C   SD++           G RV   +R+
Sbjct: 265 --IDVLLPARSLLIMTGEARYAWTHGICPRHSDIVKTENGTTTRERGTRVSFTFRK 318


>gi|428311031|ref|YP_007122008.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
 gi|428252643|gb|AFZ18602.1| alkylated DNA repair protein [Microcoleus sp. PCC 7113]
          Length = 206

 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 71/156 (45%), Gaps = 17/156 (10%)

Query: 196 QTIEQCTLEYNSENGSSI-DNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNS 254
           + I++   +Y+ ++GS +   H+ +   W +  V V L  D  +         N+Y    
Sbjct: 55  RRIQEYGYKYDYKDGSFVASTHLGNLPDWAQN-VAVRLAEDGFMVNVPEQVIVNEYQPGQ 113

Query: 255 PCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVV--IRIPMPARSLLVLWGEARY 312
              S+    P   +TI+  LSL    ++ ++N T S     + I + A SLL+  GEARY
Sbjct: 114 GIVSHTDCIPCFGNTIIT-LSL---GSECVMNFTHSQTQKEVGILLQAGSLLIFKGEARY 169

Query: 313 EWQHCVL---------RSDVIGRRVCIAYREFTAPY 339
            W+H ++         R  +  RR+ + +RE   PY
Sbjct: 170 IWKHGIVPRKRDNYKGRIFMRTRRISMTFREVLFPY 205


>gi|170079593|ref|YP_001736226.1| 2OG-Fe(II) oxygenase family oxidoreductase [Synechococcus sp. PCC
           7002]
 gi|169887262|gb|ACB00971.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Synechococcus sp. PCC
           7002]
          Length = 204

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 64/126 (50%), Gaps = 30/126 (23%)

Query: 225 ERIVTVNLIG---DSVLTLTYYNGDSNKYNLN-SPCFSYQKKYPDQKSTIVNDLSLNEK- 279
           E++V++NLI    D  +   Y  G     +++  PCF+          TI++ LSLN   
Sbjct: 90  EQMVSLNLISELPDQAIVNEYLPGQGITSHVDCKPCFT---------DTIIS-LSLNAPC 139

Query: 280 --STDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV--LRSD-------VIGRRV 328
             + DSIVN    S +++     RSL++L GE+RY W+H +   +SD       +  RR+
Sbjct: 140 IMNFDSIVNNERQSKLLK----PRSLVILQGESRYLWKHGIPPRKSDQWNGQKIMRDRRI 195

Query: 329 CIAYRE 334
            I +R+
Sbjct: 196 SITFRK 201


>gi|255084479|ref|XP_002508814.1| chromodomain-containing protein [Micromonas sp. RCC299]
 gi|226524091|gb|ACO70072.1| chromodomain-containing protein [Micromonas sp. RCC299]
          Length = 742

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 57/131 (43%), Gaps = 15/131 (11%)

Query: 215 NHIDDCWIWGERIVTVNLIGDSVLTLTYY----NGDSNKYNLNSPCFSYQKKYPDQKSTI 270
           +H+D    WGE +V V+L  D VL  T       G  N+   N+   +    YP +    
Sbjct: 441 SHLDSSERWGEVVVGVSLGADGVLFFTPNCRSDRGIGNRIR-NAALENDFADYPTESGVT 499

Query: 271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIGRRVCI 330
            +D   N +   S VN+   S  + + +P RS+ V+ G AR + +H +        R   
Sbjct: 500 ESDSWRNGRG--SRVNKGQKSWALELEIPRRSVYVMSGAARTDLEHGI--------RAVK 549

Query: 331 AYREFTAPYMN 341
           A  + +AP  N
Sbjct: 550 APSQESAPAWN 560


>gi|332027277|gb|EGI67361.1| Alkylated DNA repair protein alkB-like protein 8 [Acromyrmex
           echinatior]
          Length = 583

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 23/33 (69%), Gaps = 2/33 (6%)

Query: 294 IRIPMPARSLLVLWGEARYEWQH--CVLRSDVI 324
           I + +PARSLL++ GEARY W H  C   SDVI
Sbjct: 283 IDVLLPARSLLIMTGEARYAWTHGICPRYSDVI 315


>gi|116197553|ref|XP_001224588.1| hypothetical protein CHGG_06932 [Chaetomium globosum CBS 148.51]
 gi|88178211|gb|EAQ85679.1| hypothetical protein CHGG_06932 [Chaetomium globosum CBS 148.51]
          Length = 307

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 66/144 (45%), Gaps = 19/144 (13%)

Query: 199 EQCTLEYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFS 258
           +Q T++Y    GS I  H+D   ++GE + +++  G +V      +G ++   +  P  S
Sbjct: 107 DQFTVQYYPP-GSGIPPHVDTHSMFGEALYSLSF-GSAVPMQFRMSGANDARKMRLPKRS 164

Query: 259 YQKKYPDQKSTIVNDLSLNEKSTDSIVN--------ETLSS-------VVIRIPMPARSL 303
            Q++   Q+     + + N  S  S           +TL             I +PARSL
Sbjct: 165 VQQQQKQQQQQQQQNTTTNPPSPPSATTTSQEPPQPQTLPPSDPSDHHPTWEILLPARSL 224

Query: 304 LVLWGEARYEWQHCV--LRSDVIG 325
           L++ G ARY + H +   ++DVIG
Sbjct: 225 LLMTGAARYGYTHAIRPRKTDVIG 248


>gi|164423854|ref|XP_962305.2| hypothetical protein NCU07663 [Neurospora crassa OR74A]
 gi|157070260|gb|EAA33069.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 510

 Score = 39.7 bits (91), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 36/67 (53%), Gaps = 8/67 (11%)

Query: 277 NEKSTDSIVN-ETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-------RRV 328
           N  ST S  N ++L+   I I +P  SLLV+  + + EW+HCV  +  I        RR+
Sbjct: 330 NSSSTTSNANLDSLNEGQISIHLPHNSLLVMHADMQEEWKHCVTPAQAIDPHPIAGKRRI 389

Query: 329 CIAYREF 335
            + YR++
Sbjct: 390 NVTYRDY 396


>gi|357484481|ref|XP_003612528.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
 gi|355513863|gb|AES95486.1| Alkylated DNA repair protein alkB-like protein [Medicago
           truncatula]
          Length = 344

 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 89/212 (41%), Gaps = 31/212 (14%)

Query: 111 SGKSFDIGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLG 170
           +    +I G+Y+  DF+S  E E++++ +D  PW+ S + RR Q++G +  +  + +   
Sbjct: 112 TASDLNIPGLYLLHDFVSATEEEELLQSVDSRPWN-SLAKRRVQHYGYEFCYDIRNVNTK 170

Query: 171 DFQG-FPACTKFIQDRFHQEDVLSHFQ----TIEQCTL-EYNSENGSSIDNHIDDCWIWG 224
              G  P+    I +R        +       ++Q T+ EY    G  +  HID    + 
Sbjct: 171 RCLGELPSFLSPILERISSCPTFKNADPDRIVLDQLTVNEYPP--GVGLSPHIDTHSAFE 228

Query: 225 ERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSI 284
           + I +++L G  ++    Y GD +               P   S+I   +   E  ++ I
Sbjct: 229 DLIFSLSLAGPCIMEFRRY-GDGDWR-------------PRVASSIDTKVDCPEDGSNCI 274

Query: 285 VNETLSSVVIRIPMPARSLLVLWGEARYEWQH 316
                           RSLL+L GEARY W H
Sbjct: 275 KRAIYLP--------PRSLLLLSGEARYAWHH 298


>gi|47222373|emb|CAG05122.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 491

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 36/69 (52%), Gaps = 6/69 (8%)

Query: 34  EETYIYCPHCNKAW-QGSDLYAYQSHPNHSGKSFDIGGVYIKEDFLTSQTKETYVYCPHC 92
           +E    C +C K++   S LY +Q    HSGKS   GG+   +  L S  +E    CPHC
Sbjct: 127 DERQYRCSYCGKSYSHASSLYRHQQ--MHSGKS---GGLAQSKQVLESTPQEMRYTCPHC 181

Query: 93  NKAWQGSDL 101
             +++GS +
Sbjct: 182 GMSFKGSRM 190


>gi|303274827|ref|XP_003056728.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461080|gb|EEH58373.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 372

 Score = 39.3 bits (90), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 51/108 (47%), Gaps = 8/108 (7%)

Query: 211 SSIDNHIDDCWIWGERIVTVNLIGDSVLTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTI 270
           +S  +H D    WGE +V V+L   SV+   Y+  D    +  +   + + +   Q ++ 
Sbjct: 101 ASFKDHFDGRERWGETVVGVSLGQSSVI---YFCPDCRTQSGKALSDAVRSRRNTQYTS- 156

Query: 271 VNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCV 318
               S     T+ ++ ET  S  I I +P RS+ ++   ARYE +H +
Sbjct: 157 ----SDGPVKTELVIKETNVSFAIEIELPRRSIYIMSENARYELKHGI 200


>gi|440793749|gb|ELR14924.1| hypothetical protein ACA1_051460 [Acanthamoeba castellanii str.
           Neff]
          Length = 70

 Score = 38.9 bits (89), Expect = 3.8,   Method: Composition-based stats.
 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 2/36 (5%)

Query: 295 RIPMPARSLLVLWGEARYEWQHCVLR--SDVIGRRV 328
           ++ +  RSLLVL G+ARYEW H V R   DV+G  V
Sbjct: 13  QVYLERRSLLVLEGDARYEWMHAVARRAEDVVGDTV 48


>gi|449469096|ref|XP_004152257.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Cucumis sativus]
 gi|449484295|ref|XP_004156842.1| PREDICTED: alkylated DNA repair protein alkB homolog 8-like
           [Cucumis sativus]
          Length = 260

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 55/239 (23%), Positives = 91/239 (38%), Gaps = 52/239 (21%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFP 176
           I G+++   FLS  +   ++  + +  W    S  +   FG   N     ++L D     
Sbjct: 50  IKGLWLCRFFLSPQQQSSLLSAIRNEGWFMEASQNQAMRFG---NLPTWAVELSD----- 101

Query: 177 ACTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIWGE-----RIVTVN 231
           +  + +    H  D           TL  +  NG  ID  +    +W E      I  V 
Sbjct: 102 SVYEAVLSSNHMTD-----------TLIVDRYNGDKIDCPLPSDILWREPLFDQMIANVY 150

Query: 232 LIGDSV---LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTD-SIVNE 287
             G+ +   + L  +       +L SPC             I++   +++ S D SI  E
Sbjct: 151 QPGEGICAHVDLMRFEDGIAIVSLESPC-------------IMHFTQVDKTSCDPSIKGE 197

Query: 288 T-LSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSD----------VIGRRVCIAYREF 335
             LS+  + + +   SL++LWGEARY W+H + R              GRR  I  R+ 
Sbjct: 198 VNLSTSKVPVYLNPGSLVILWGEARYHWKHEINRKPGFQIWEGQELAQGRRTSITLRKL 256


>gi|118368517|ref|XP_001017465.1| hypothetical protein TTHERM_00483520 [Tetrahymena thermophila]
 gi|89299232|gb|EAR97220.1| hypothetical protein TTHERM_00483520 [Tetrahymena thermophila
           SB210]
          Length = 199

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 65/139 (46%), Gaps = 16/139 (11%)

Query: 111 SGKSFD-IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKL 169
           S ++FD + G+   +  L+E E   + K++    W+ +Q  RR Q++G K ++  + I  
Sbjct: 11  SAQTFDQVQGLRYIDSILTEEEEVFIFKEIYQNEWN-TQLKRRTQHYGYKYDYSIKSIDK 69

Query: 170 GDFQG-FPA-----CTKFIQDRFHQEDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCWIW 223
             F G  P      C + I D+  +  V+     I +         G  I+ HID   I+
Sbjct: 70  NMFLGVLPKYAINFCQRLIDDKVIK--VMPDQMIINEYL------PGQGINPHIDKTDIF 121

Query: 224 GERIVTVNLIGDSVLTLTY 242
           GE I +V+L    ++ LTY
Sbjct: 122 GETIFSVSLGSGCIMKLTY 140


>gi|66362644|ref|XP_628288.1| F27M3_19 plant like RRM plus AlkB domain containing protein
           [Cryptosporidium parvum Iowa II]
 gi|46229757|gb|EAK90575.1| F27M3_19 plant like RRM plus AlkB domain containing protein
           [Cryptosporidium parvum Iowa II]
          Length = 350

 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 51/233 (21%), Positives = 97/233 (41%), Gaps = 40/233 (17%)

Query: 119 GVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRKQNFGPKCNFKKQKIKLGDFQGFPAC 178
           G+ + EDF+++ E  +++  +D+    +++  R+ Q++G   ++  + I     +  P  
Sbjct: 123 GLVLVEDFINKLEAIELLDWIDNNGQWETKLNRKVQHYGYSFDYNNKTISSVWERDIPPI 182

Query: 179 TKFIQDRFHQEDVLSHFQTIEQCTL-EYNSENGSSIDNHIDDCWIWGERIVTVNLIGDSV 237
              + +R     +++  +  +Q T+ EY  E G  I  HID     GE I  ++L G  +
Sbjct: 183 LNRLIERMLSLKIIT--EVPDQITINEY--EVGKGIGPHIDSHHTIGENISVISL-GSGI 237

Query: 238 LTLTYYNGDSNKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIP 297
           L           +  N      ++K PD  S          +  D I   T       + 
Sbjct: 238 L-----------FEFNELS---KRKNPDCSSK----EGSGSRKYDRISKRT-------VY 272

Query: 298 MPARSLLVLWGEARYEWQHCV---LRSDVIG------RRVCIAYREFTAPYMN 341
           +P  SL ++  E RY W+H +   +   + G      RRV I  R++   + N
Sbjct: 273 IPENSLYIMKNEIRYAWEHGIKSRMYDKIQGKFQQRKRRVSITIRKYIESHYN 325


>gi|336470928|gb|EGO59089.1| hypothetical protein NEUTE1DRAFT_145169 [Neurospora tetrasperma
           FGSC 2508]
 gi|350291999|gb|EGZ73194.1| hypothetical protein NEUTE2DRAFT_165345 [Neurospora tetrasperma
           FGSC 2509]
          Length = 494

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 7/66 (10%)

Query: 277 NEKSTDSIVNETLSSVVIRIPMPARSLLVLWGEARYEWQHCVLRSDVIG-------RRVC 329
           N  +T +   ++L+   I I +P  SLLV+  + + EW+HCV  +  I        RR+ 
Sbjct: 336 NSSTTSNANLDSLNQGQISIHLPHNSLLVMHADMQEEWKHCVTPAQAIDPHPIAGKRRIN 395

Query: 330 IAYREF 335
           + YR++
Sbjct: 396 VTYRDY 401


>gi|402079391|gb|EJT74656.1| GRF zinc finger domain-containing protein [Gaeumannomyces graminis
           var. tritici R3-111a-1]
          Length = 465

 Score = 38.5 bits (88), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 7/53 (13%)

Query: 294 IRIPMPARSLLVLWGEARYEWQHCVLRSDVI-------GRRVCIAYREFTAPY 339
           I I +P  SLLV+  E + EW+HC+  +  I        RR+ + YR++ A +
Sbjct: 329 ISIHLPHNSLLVMHAEMQEEWKHCIAPAQAIQPHPVAGNRRINVTYRDYKAAF 381


>gi|290993472|ref|XP_002679357.1| predicted protein [Naegleria gruberi]
 gi|284092973|gb|EFC46613.1| predicted protein [Naegleria gruberi]
          Length = 191

 Score = 38.5 bits (88), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 59/140 (42%), Gaps = 18/140 (12%)

Query: 117 IGGVYIKEDFLSEAETEDVMKQLDDLPWDKSQSGRRK-QNFGPKCNFKKQKIKLGDFQGF 175
           + G+Y+ EDFLS  E  ++M +++   W K++S  R  Q +GPK + +   I   D    
Sbjct: 5   VPGLYLIEDFLSSTEATNLMNEINQQTWIKNRSQTRNIQIYGPKHD-QSYTIIPNDITPL 63

Query: 176 PACTKFIQDRFHQ--------------EDVLSHFQTIEQCTLEYNSENGSSIDNHIDDCW 221
           P   K +  R  +              E  L   +  E    EY  E+   +D H D   
Sbjct: 64  PEFLKELSKRILETTRNKLPQIDLKDYEPYLGIDKYTEIFINEYKPED--KLDQHFDHRS 121

Query: 222 IWGERIVTVNLIGDSVLTLT 241
            + E I  ++L   S LT T
Sbjct: 122 TYKEIIFGLSLECTSTLTFT 141


>gi|357612338|gb|EHJ67931.1| hypothetical protein KGM_21226 [Danaus plexippus]
          Length = 585

 Score = 38.1 bits (87), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 248 NKYNLNSPCFSYQKKYPDQKSTIVNDLSLNEKSTDSIVNETLSSVVIRIPMPARSLLVLW 307
           NKY+      S+  K+     TI   LSL+  S+  +  +  S   + + +P+RSLLV+ 
Sbjct: 210 NKYSPGQGIPSHVDKHSPFGDTI---LSLSLNSSVVMDWKHHSKAYVPVVVPSRSLLVMQ 266

Query: 308 GEARYEWQHCV 318
            EARY+WQH +
Sbjct: 267 AEARYDWQHGI 277


>gi|355755380|gb|EHH59127.1| Zinc finger protein 358 [Macaca fascicularis]
          Length = 451

 Score = 37.7 bits (86), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 9/107 (8%)

Query: 15  CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQSHPNHSGKSFDIGG 70
           C  C + FG  +   K++   T E    CP C KA+ Q S L  +Q     +    D G 
Sbjct: 163 CPVCAKGFGQGSALLKHLRTHTGERPYPCPQCGKAFGQSSALLQHQRTHTAARPCPDCGK 222

Query: 71  VYIKEDFLTSQ----TKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
            + ++  L S     T E    CPHC+KA+  S       H  HSG+
Sbjct: 223 AFSQKSNLVSHRRIHTGERPYACPHCSKAFGQSSALLQHLH-VHSGE 268


>gi|417403069|gb|JAA48358.1| Putative c2h2-type zn-finger protein [Desmodus rotundus]
          Length = 589

 Score = 37.7 bits (86), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 15/111 (13%)

Query: 15  CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQ-----SHP---NHS 62
           C  C + FG  +   K++   T E    CPHC KA+ Q S L  +Q       P    H 
Sbjct: 310 CPVCAKGFGQGSALLKHLRTHTGERPYPCPHCAKAFGQSSALLQHQRTHTAERPYRCPHC 369

Query: 63  GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
           GK+F  G     +  L   T E    CPHC KA+  S       H  HSG+
Sbjct: 370 GKAF--GQSSNLQHHLRIHTGERPYPCPHCAKAFGQSSALLQHLH-VHSGE 417


>gi|157123521|ref|XP_001660184.1| hypothetical protein AaeL_AAEL009522 [Aedes aegypti]
 gi|108874376|gb|EAT38601.1| AAEL009522-PA [Aedes aegypti]
          Length = 354

 Score = 37.7 bits (86), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 29/121 (23%), Positives = 50/121 (41%), Gaps = 14/121 (11%)

Query: 12  VRTCIKCEQEFGYE---NKNIVEFTEETYIYCPHCNKAWQGS----DLYAYQSHPN---- 60
           V  C  C ++F Y+   +K+    T+E    CP CN ++        L+++   P     
Sbjct: 186 VFVCASCPKQFLYKSLLDKHARVHTQEKPFKCPQCNTSFSQKVSRMILFSFHVQPKMFIC 245

Query: 61  -HSGKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGKSFDIGG 119
            +  KSFD       +  + + TKE    C  C K++  +   A     NH G+S  +  
Sbjct: 246 EYCQKSFDRPSTL--QIHIRTHTKERPFACMECTKSFTSNSALAMHIKTNHRGESILLQS 303

Query: 120 V 120
           +
Sbjct: 304 I 304


>gi|76621956|ref|XP_582193.2| PREDICTED: zinc finger protein 358 isoform 1 [Bos taurus]
 gi|297476686|ref|XP_002688899.1| PREDICTED: zinc finger protein 358 [Bos taurus]
 gi|296485863|tpg|DAA27978.1| TPA: zinc finger protein 358 [Bos taurus]
          Length = 596

 Score = 37.7 bits (86), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 15/111 (13%)

Query: 15  CIKCEQEFGYEN---KNIVEFTEETYIYCPHCNKAW-QGSDLYAYQ-----SHP---NHS 62
           C  C + FG  +   K++   T E    CP C KA+ Q S L  +Q       P    H 
Sbjct: 308 CPVCAKGFGQGSALLKHLRTHTGERPYPCPQCGKAFGQSSALLQHQRTHTAERPYRCPHC 367

Query: 63  GKSFDIGGVYIKEDFLTSQTKETYVYCPHCNKAWQGSDLYAYQSHPNHSGK 113
           GK+F  G     +  L + T E    CPHC+KA+  S       H  HSG+
Sbjct: 368 GKAF--GQSSNLQHHLRTHTGERPYACPHCSKAFGQSSALLQHLH-VHSGE 415


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.430 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,777,312,506
Number of Sequences: 23463169
Number of extensions: 246053373
Number of successful extensions: 668310
Number of sequences better than 100.0: 218
Number of HSP's better than 100.0 without gapping: 142
Number of HSP's successfully gapped in prelim test: 76
Number of HSP's that attempted gapping in prelim test: 667769
Number of HSP's gapped (non-prelim): 456
length of query: 356
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 213
effective length of database: 9,003,962,200
effective search space: 1917843948600
effective search space used: 1917843948600
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)