BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026959
         (230 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 179/225 (79%), Positives = 202/225 (89%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMF++KAQDEIVA IEARIAAWTFLP ENGE+MQILHYEHG
Sbjct: 85  MVADNESGKSIESEVRTSSGMFIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHG 144

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK NQ+LGGHR+ATVLMYLS+VEKGGETVFPN+E  +SQ ++ +WS+CA
Sbjct: 145 QKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCA 204

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           + GYAVKP KGDALLFFSLHPDA+TDS SLHGSCPVIEGEKWSATKWIHVR+F+K  K+ 
Sbjct: 205 KGGYAVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEKWSATKWIHVRSFEKSFKQL 264

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              DCVDE+ +C +WAKAGECKKNPLYM+GS  + GYCRKSCKVC
Sbjct: 265 GKGDCVDENDHCPLWAKAGECKKNPLYMIGSGGANGYCRKSCKVC 309


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score =  367 bits (941), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 177/226 (78%), Positives = 194/226 (85%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMFL KAQD++VA+IEARIAAWTFLP ENGEAMQILHYE G
Sbjct: 91  MVADNESGKSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERG 150

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN--WSECA 118
           QKYEPHFD+F DK+NQQLGGHRIATVLMYLS+VE+GGETVFPN+E       N   S+CA
Sbjct: 151 QKYEPHFDYFHDKVNQQLGGHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCA 210

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK-E 177
           + GY+VKP KGDALLFFSLHPDASTDS SLHGSCPVIEGEKWSATKWIHVR+FD+  K +
Sbjct: 211 KGGYSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEGEKWSATKWIHVRSFDRIRKDD 270

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           P   DCVD++  C  WA AGECKKNPLYMVGSK  +GYCRKSC VC
Sbjct: 271 PPSGDCVDDNALCAQWALAGECKKNPLYMVGSKDMKGYCRKSCNVC 316


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score =  366 bits (939), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 172/225 (76%), Positives = 194/225 (86%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMF+ K+QDEIV  IEARIAAWTFLP ENGE++QILHYEHG
Sbjct: 82  MVADNESGKSIESEVRTSSGMFIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHG 141

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK NQ+LGGHR+ TVLMYLS+V KGGETVFPNSE    Q +D +WS+CA
Sbjct: 142 QKYEPHFDYFHDKANQELGGHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCA 201

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           + GYAVKP KGDALLFFSLHPDA+TD+ SLHGSCPVIEGEKWSATKWIHVR+F+K  K  
Sbjct: 202 KNGYAVKPQKGDALLFFSLHPDATTDTNSLHGSCPVIEGEKWSATKWIHVRSFEKSLKHA 261

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               C+DE+ NC +WAKAGEC+KNP+YMVGS+ S G CRKSCKVC
Sbjct: 262 ASGGCIDENENCPLWAKAGECQKNPVYMVGSEGSYGSCRKSCKVC 306


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score =  365 bits (937), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 173/225 (76%), Positives = 196/225 (87%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMFL KAQDEIVA IEARIAAWTFLP ENGE++QILHYE+G
Sbjct: 90  MVADNESGKSIMSEVRTSSGMFLLKAQDEIVADIEARIAAWTFLPVENGESIQILHYENG 149

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPHFD+F DK+NQ LGGHRIATVLMYL+ VE+GGETVFPNSE   SQ +D +WS+CA
Sbjct: 150 EKYEPHFDYFHDKVNQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCA 209

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAV P KGDALLFFSLHPDA+TD +SLHGSCPVI GEKWSATKWIHVR+FDKP K  
Sbjct: 210 KKGYAVNPKKGDALLFFSLHPDATTDPSSLHGSCPVIAGEKWSATKWIHVRSFDKPSKRG 269

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              +CVDED +C  WA  GEC+KNP+YMVGS++S G+CRKSC VC
Sbjct: 270 AQGECVDEDEHCPKWAAVGECEKNPVYMVGSENSDGFCRKSCGVC 314


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score =  363 bits (932), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 173/225 (76%), Positives = 193/225 (85%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 92  MVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
           QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+FPN  +++ Q +D +WSECA
Sbjct: 152 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECA 211

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F KP K+ 
Sbjct: 212 HKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQV 271

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +  DCVDE+ NC  WAK GEC+KNPLYMVG +  +G C KSC VC
Sbjct: 272 DSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score =  363 bits (932), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 171/225 (76%), Positives = 194/225 (86%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKSI S++RTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 93  MVADNDSGKSIMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENG 152

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+FPN+E  + Q +D +WSECA
Sbjct: 153 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECA 212

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F+KP K+ 
Sbjct: 213 HKGYAVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVSDFEKPFKQV 272

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           ++ +CVDE+ NC  WAK GEC KNPLYMVG +  RG C KSC VC
Sbjct: 273 DNGECVDENENCPRWAKVGECDKNPLYMVGGEGVRGSCMKSCNVC 317


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score =  362 bits (929), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 168/225 (74%), Positives = 193/225 (85%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS++SEVRTSSGMFL KAQDE+VA +EARIAAWT LP ENGE++QILHYE+G
Sbjct: 89  MVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENG 148

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
           QKYEPHFDFF DK+NQ+LGGHRIATVLMYLS+VEKGGET+FPNSE   SQ++D +WS+C+
Sbjct: 149 QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCS 208

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R+GYAVK  KGDALLFFSL+ DA+TD  SLHGSCPVI GEKWSATKWIHVR+F+K     
Sbjct: 209 RKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRV 268

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               CVDE+ NC+ WAK GECKKNP YMVGS  + GYCRKSCK C
Sbjct: 269 SRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 313


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  362 bits (929), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 169/228 (74%), Positives = 194/228 (85%), Gaps = 5/228 (2%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS++SEVRTSSGMFL KAQDE+VA +EARIAAWT LP ENGE++QILHYE+G
Sbjct: 68  MVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENG 127

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWS 115
           QKYEPHFDFF DK+NQ+LGGHRIATVLMYLS+VEKGGET+FPNSEV     SQ++D +WS
Sbjct: 128 QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWS 187

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +C+R+GYAVK  KGDALLFFSL+ DA+TD  SLHGSCPVI GEKWSATKWIHVR+F+K  
Sbjct: 188 DCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT 247

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                  CVDE+ NC+ WAK GECKKNP YMVGS  + GYCRKSCK C
Sbjct: 248 SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 295


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  360 bits (925), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 172/225 (76%), Positives = 192/225 (85%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 92  MVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+F N++  + Q +D +WSECA
Sbjct: 152 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECA 211

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F KP K+ 
Sbjct: 212 HKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQV 271

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +  DCVDE+ NC  WAK GEC+KNPLYMVG +  +G C KSC VC
Sbjct: 272 DSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score =  358 bits (920), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 168/225 (74%), Positives = 194/225 (86%), Gaps = 7/225 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMF  KAQD++VA++EARIAAWTFLP ENGE++QILHYEHG
Sbjct: 98  MVADNESGKSVESEVRTSSGMFFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHG 157

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ+LGGHR+ATVLMYLS VEKGGETVFPNSE   +Q++  +WS+CA
Sbjct: 158 QKYEPHFDYFHDKVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCA 217

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFFSLHPDA+TD  SLHGSCPVIEGEKWSATKWIHVR+F     E 
Sbjct: 218 KKGYAVKPRKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSF-----ET 272

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               C D++ NC  WA AGEC+KNPLYM+GS+ S G+CRKSCKVC
Sbjct: 273 TSSVCKDQNPNCPQWATAGECEKNPLYMMGSEDSVGHCRKSCKVC 317


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score =  356 bits (914), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 167/226 (73%), Positives = 193/226 (85%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+V  IE RIAAWTFLPPENGE++QILHY++G
Sbjct: 76  MVADNESGKSVQSEVRTSSGMFLEKKQDEVVRGIEERIAAWTFLPPENGESIQILHYQNG 135

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  + Q +D  WS+CA
Sbjct: 136 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 195

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P
Sbjct: 196 RNGYAVKPVKGDALLFFSLHPDATTDSESLHGSCPVIEGQKWSATKWIHVRSFDLPVKQP 255

Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C D+++ C  WA  GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 256 GSSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 301


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score =  356 bits (913), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 165/226 (73%), Positives = 193/226 (85%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL + QDE+V  IE RI+AWTFLPPENGE++QILHY++G
Sbjct: 72  MVADNESGKSVQSEVRTSSGMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNG 131

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  + Q +D  WS+CA
Sbjct: 132 EKYEPHYDYFHDKKNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCA 191

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P
Sbjct: 192 RNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQP 251

Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C D+++ C  WA  GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 252 GSSDGCEDDNILCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 297


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score =  352 bits (903), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 170/226 (75%), Positives = 193/226 (85%), Gaps = 4/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKSI SEVRTSSGMFL+K QDEIV+ IEARIAAWTFLP ENGE+MQ+LHY +G
Sbjct: 87  MVADNESGKSIQSEVRTSSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNG 146

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPHFDFF DK NQ+LGGHR+ATVLMYLS+VEKGGET+FP++E  +SQ +D +WSECA
Sbjct: 147 EKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECA 206

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +GYAVKP KGDALLFFSLH DA+TDS SLHGSCPVIEGEKWSATKWIHV +F+KP ++ 
Sbjct: 207 HKGYAVKPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIHVADFEKPVRQA 266

Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            ED  C DE+ NC  WAK GEC+KNPLYMVG K   G C KSC VC
Sbjct: 267 LEDRVCADENENCARWAKVGECEKNPLYMVG-KGGNGKCMKSCNVC 311


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  351 bits (901), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 164/226 (72%), Positives = 192/226 (84%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QDE+V  IE RI+AWTFLPPENGEA+QILHY++G
Sbjct: 71  MVADNKSGKSVQSEVRTSSGMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  + Q +D  WS+CA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R GYAVKP+KGDALLFFSLHPD++TDS SLHGSCPVIEG+KWSATKWIHVR+FD   K+P
Sbjct: 191 RNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLTVKQP 250

Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C D+++ C  WA  GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 251 GPSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score =  350 bits (897), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 168/225 (74%), Positives = 191/225 (84%), Gaps = 3/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SE+RTSSGMFL+KAQDEIVAS+E RIAAWTFLP ENGEAMQ+LHYE G
Sbjct: 57  MVADNESGKSVMSEIRTSSGMFLNKAQDEIVASVEDRIAAWTFLPIENGEAMQVLHYELG 116

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ +GGHRIATVLMYLS V KGGETVFPN+E   SQ +D +WSECA
Sbjct: 117 QKYEPHFDYFHDKINQAMGGHRIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECA 176

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           + GY+VKP KGDALLFFSL PDA+TD +SLHGSCPVIEGEKWSATKWIHVR+F+   ++ 
Sbjct: 177 KGGYSVKPNKGDALLFFSLRPDATTDQSSLHGSCPVIEGEKWSATKWIHVRSFEVSNRKI 236

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             + CVDE+ +C  WA  GECKKNP YMVGS  S G CRKSC+VC
Sbjct: 237 S-EGCVDENDSCTHWASIGECKKNPTYMVGSPDSPGACRKSCQVC 280


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  349 bits (896), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 163/226 (72%), Positives = 191/226 (84%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QDE+V  IE RI+AWTFLPPENGEA+QILHY++G
Sbjct: 71  MVADNKSGKSVQSEVRTSSGMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  + Q +D  WS+CA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R GYAVKP+KGDALLFFSLHPD++TDS SLHGSCP IEG+KWSATKWIHVR+FD   K+P
Sbjct: 191 RNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQP 250

Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C D+++ C  WA  GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 251 GPSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  349 bits (895), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 160/225 (71%), Positives = 194/225 (86%), Gaps = 4/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SG+S+ SEVRTSSGMFLSK QD+IVA++EA++AAWTF+P ENGE+MQILHYE+G
Sbjct: 92  MVADNDSGESVESEVRTSSGMFLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
           QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP    + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECA 211

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHVR+FD+     
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVRSFDRA--FS 269

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +   CVDE+++C  WAKAGEC+KNP YMVGS    GYCRKSC VC
Sbjct: 270 KQSGCVDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCNVC 314


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score =  348 bits (892), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 163/226 (72%), Positives = 190/226 (84%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLP ENGE++QILHY++G
Sbjct: 71  MVADNESGKSVQSEVRTSSGMFLEKRQDEVVARIEERIAAWTFLPSENGESIQILHYKNG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  ++Q +D   SECA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECA 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
           + GYAVKPMKGDALLFFSLHPDA+TD  SLHGSCPVIEG+KWSATKWIHVR+F+ P K+ 
Sbjct: 191 KNGYAVKPMKGDALLFFSLHPDATTDPDSLHGSCPVIEGQKWSATKWIHVRSFENPGKQG 250

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C DE++ C  WA  GEC KNP YMVG+K + G+CRKSC +C
Sbjct: 251 ASGDGCEDENVLCAQWAAVGECAKNPNYMVGTKEAPGFCRKSCNLC 296


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score =  345 bits (886), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 157/225 (69%), Positives = 194/225 (86%), Gaps = 4/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 108 MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 167

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
           QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP    + +Q +D +W+ECA
Sbjct: 168 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 227

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++     
Sbjct: 228 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 285

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +   C+DE+++C  WAKAGEC+KNP YMVGS    GYCRKSCK C
Sbjct: 286 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 330


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score =  344 bits (883), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 157/225 (69%), Positives = 194/225 (86%), Gaps = 4/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92  MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
           QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP    + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 211

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++     
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 269

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +   C+DE+++C  WAKAGEC+KNP YMVGS    GYCRKSCK C
Sbjct: 270 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score =  343 bits (881), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 162/227 (71%), Positives = 187/227 (82%), Gaps = 4/227 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81  MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS---QSRDGNWSEC 117
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +EV    Q +D  WS+C
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDC 200

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
           A+ GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD   K+
Sbjct: 201 AKNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQ 260

Query: 178 -PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               D C DE++ C  WA  GEC KNP YMVG+  + G+CRKSC VC
Sbjct: 261 GASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 307


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score =  343 bits (880), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 157/225 (69%), Positives = 193/225 (85%), Gaps = 4/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SG+S+ SEVRTSSGMFLSK QD+IV ++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92  MVADNDSGESVESEVRTSSGMFLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
           QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP    + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 211

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++     
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 269

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +   C+DE+++C  WAKAGEC+KNP YMVGS    GYCRKSCK C
Sbjct: 270 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score =  342 bits (878), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81  MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E  + Q +D  WS+CA
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 200

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
           + GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD   K+ 
Sbjct: 201 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 260

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C DE++ C  WA  GEC KNP YMVG+  + G+CRKSC VC
Sbjct: 261 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score =  342 bits (878), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81  MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E  + Q +D  WS+CA
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 200

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
           + GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD   K+ 
Sbjct: 201 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 260

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C DE++ C  WA  GEC KNP YMVG+  + G+CRKSC VC
Sbjct: 261 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306


>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
 gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
          Length = 241

 Score =  341 bits (875), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 1   MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 60

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E  + Q +D  WS+CA
Sbjct: 61  EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 120

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
           + GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD   K+ 
Sbjct: 121 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 180

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              D C DE++ C  WA  GEC KNP YMVG+  + G+CRKSC VC
Sbjct: 181 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 226


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score =  335 bits (860), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 158/225 (70%), Positives = 184/225 (81%), Gaps = 4/225 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SE+RTSSGMFL + QDE +  IE RIAAWTFLP ENGE +QILHYE G
Sbjct: 24  MVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKRIAAWTFLPEENGEPIQILHYEKG 83

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKY+ H+D+F DK NQ++GGHR+ATVLMYLS V+KGGETVFP++E  + Q +D  WS+CA
Sbjct: 84  QKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGGETVFPDAEGKLLQVKDDTWSDCA 143

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R GYAVKP KGDALLFFS HP+A+TD  SLH SCPVIEGEKWSAT+WIHVR+F K  KE 
Sbjct: 144 RSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVIEGEKWSATRWIHVRSFAK--KER 201

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             D+CVDE+ NC  WA  GEC+KN LYMVG+  + GYCRKSCKVC
Sbjct: 202 NKDECVDEEDNCSFWASNGECEKNVLYMVGNNETLGYCRKSCKVC 246


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score =  332 bits (850), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 154/225 (68%), Positives = 189/225 (84%), Gaps = 6/225 (2%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VAD+ SG+SI SE RTSSG+FL+K QD+IVA++EA++A WTFLP ENGEA+QILHYE+G
Sbjct: 69  VVADDNSGESIDSEERTSSGVFLTKRQDDIVANVEAKLATWTFLPEENGEALQILHYENG 128

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
           QKY+PHFD++ DK   +LGGHRIATVLMYLS+V KGGETVFP    +  Q +D  WSECA
Sbjct: 129 QKYDPHFDYYYDKETLKLGGHRIATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECA 188

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LHP+A+TD TSLHGSCPVIEGEKWSAT+WIHVR+F K +   
Sbjct: 189 KQGYAVKPRKGDALLFFNLHPNATTDPTSLHGSCPVIEGEKWSATRWIHVRSFGKKQS-- 246

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             D CVD+  +C +WAKAGEC+KNP+YM+GS++  GYCRKSCK C
Sbjct: 247 --DGCVDDHESCEIWAKAGECEKNPMYMMGSETDLGYCRKSCKAC 289


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score =  329 bits (843), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 156/223 (69%), Positives = 179/223 (80%), Gaps = 4/223 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ S +RTSSGMFLSK QDE++  IE RIAAWTFLP ENGEA+Q+L YE G
Sbjct: 78  MVADNESGKSVLSNIRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFG 137

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGETVFP+SE +  +D +WS+CA++
Sbjct: 138 EKYEPHYDYFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKK 197

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G AVKP KGDALLF+SLHPDA+ D +SLHG CPVIEGEKWSATKWIHV  F KP+KE   
Sbjct: 198 GIAVKPRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKE--- 254

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             C DE+  C  WA  GEC KNP YMVG++   G CRKSCKVC
Sbjct: 255 -GCADENEKCGEWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 296


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score =  323 bits (829), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 155/224 (69%), Positives = 178/224 (79%), Gaps = 5/224 (2%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ S +RTSSGMFLSK QDE++  IE RIAAWTFLP ENGEA+Q+L YE G
Sbjct: 64  MVADNESGKSVLSNIRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFG 123

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-RDGNWSECAR 119
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS   KGGETVFP+SE   + +D +WS+CA+
Sbjct: 124 EKYEPHYDYFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAK 183

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
           +G AVKP KGDALLF+SLHPDA+ D +SLHG CPVIEGEKWSATKWIHV  F KP+KE  
Sbjct: 184 KGIAVKPRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKE-- 241

Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              C DE+  C  WA  GEC KNP YMVG++   G CRKSCKVC
Sbjct: 242 --GCADENEKCGEWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 283


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score =  323 bits (829), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 154/228 (67%), Positives = 178/228 (78%), Gaps = 2/228 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73  MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+     Q +D  WS+CA
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCA 192

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFFSLH +A+TD  SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 193 RSGLAVKPKKGDALLFFSLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVS 252

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPS 226
            D  C DE+  C  WA  GEC +NP YMVG+K S G+CRKSC VC  S
Sbjct: 253 LDLPCSDENERCTRWAAVGECYRNPKYMVGTKDSLGFCRKSCGVCSRS 300


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score =  323 bits (828), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 150/225 (66%), Positives = 181/225 (80%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ S+VRTSSG FL+K +DEI++ IE R+AAWTFLP EN E++Q+LHYE G
Sbjct: 67  MVADNDSGKSVMSQVRTSSGTFLNKHEDEIISGIEKRVAAWTFLPEENAESIQVLHYEVG 126

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F DK NQ+LGGHR+ATVLMYL+ V+KGGETVFPN+E    Q +D  WSECA
Sbjct: 127 QKYDAHFDYFHDKNNQKLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECA 186

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFFSLH +A+TD +SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 187 RSGLAVKPRKGDALLFFSLHINATTDPSSLHGSCPVIEGEKWSATKWIHVRSFDNPPIVR 246

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  C D++  C  WA  GEC +NP YM+G+K + G+CRKSC +C
Sbjct: 247 MDVRCSDDNELCSKWAAVGECYRNPKYMIGTKDTLGFCRKSCGIC 291


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score =  321 bits (823), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 153/225 (68%), Positives = 187/225 (83%), Gaps = 7/225 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VAD +SG+S  SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 69  VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 128

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
           QKY+PHFD+F DK   +LGGHRIATVLMYLS+V KGGETVFPN   +  Q +D +WS+CA
Sbjct: 129 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 188

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LH + +TD  SLHGSCPVIEGEKWSAT+WIHVR+F K +   
Sbjct: 189 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 247

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               CVD+  +C  WA AGEC+KNP+YMVGS++S G+CRKSCK C
Sbjct: 248 ----CVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score =  321 bits (823), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 153/225 (68%), Positives = 187/225 (83%), Gaps = 7/225 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VAD +SG+S  SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 34  VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 93

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
           QKY+PHFD+F DK   +LGGHRIATVLMYLS+V KGGETVFPN   +  Q +D +WS+CA
Sbjct: 94  QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 153

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LH + +TD  SLHGSCPVIEGEKWSAT+WIHVR+F K +   
Sbjct: 154 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 212

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               CVD+  +C  WA AGEC+KNP+YMVGS++S G+CRKSCK C
Sbjct: 213 ----CVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 253


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score =  320 bits (819), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 153/225 (68%), Positives = 177/225 (78%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73  MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+     Q +D  WS+CA
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCA 192

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFFSLH +A+TD  SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 193 RSGLAVKPKKGDALLFFSLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVS 252

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  C DE+  C  WA  GEC +NP YMVG+K S G+CRKSC VC
Sbjct: 253 LDLPCSDENERCTRWAAVGECYRNPKYMVGTKDSLGFCRKSCGVC 297


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score =  318 bits (816), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 153/226 (67%), Positives = 179/226 (79%), Gaps = 3/226 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL+K QD +V+ IE RIAAWTFLP EN E MQIL YEHG
Sbjct: 80  MVADNQSGKSVMSEVRTSSGMFLNKRQDPVVSRIEERIAAWTFLPQENAENMQILRYEHG 139

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ  GGHR ATVLMYLS V+KGGETVFPN++   SQ +D  +SECA
Sbjct: 140 QKYEPHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECA 199

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +G AVKP+KGDA+LFFSLH D   D  SLHGSCPVI+GEKWSA KWIHVR+++ P   P
Sbjct: 200 HQGLAVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIHVRSYENPPVVP 259

Query: 179 EDD-DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +D   C D+  +C  WA AGEC KNP+YMVG++ + G CRKSC VC
Sbjct: 260 KDTRGCADKSEHCAEWAAAGECGKNPVYMVGAEGAPGQCRKSCNVC 305


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score =  317 bits (811), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 146/225 (64%), Positives = 178/225 (79%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71  MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V+KGGETVFPN+E S  Q +D  WSEC+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECS 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFF+LH +A+ D+ SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 191 RSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVR 250

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  C D+   C  WA  GEC +NP YMVG+K + G+CRKSC +C
Sbjct: 251 TDAPCSDDKELCPRWAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295


>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
          Length = 394

 Score =  316 bits (810), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 144/201 (71%), Positives = 170/201 (84%), Gaps = 3/201 (1%)

Query: 26  AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
            QDE+V  IE RI+AWTFLPPENGE++QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 192 TQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIAT 251

Query: 86  VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
           VLMYLS+VEKGGET+FPN+E  + Q +D  WS+CAR GYAVKP+KGDALLFFSLHPDA+T
Sbjct: 252 VLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311

Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKN 202
           DS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P   D C D+++ C  WA  GEC KN
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWAAVGECAKN 371

Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
           P YMVG+K + G+CRKSCKVC
Sbjct: 372 PNYMVGTKEAPGFCRKSCKVC 392


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score =  316 bits (809), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 146/225 (64%), Positives = 177/225 (78%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71  MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V KGGETVFPN+E S  Q +D  WSEC+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECS 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFF+LH +A+ D+ SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 191 RSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVR 250

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  C D+   C  WA  GEC +NP YMVG+K + G+CRKSC +C
Sbjct: 251 TDAPCSDDKELCPRWAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score =  315 bits (808), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 153/224 (68%), Positives = 176/224 (78%), Gaps = 3/224 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SE+RTSSGMFL K QD+I++ IE RIAAWTFLP ENGEA+Q+L Y+ G
Sbjct: 42  MVADNESGKSVKSEIRTSSGMFLMKGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDG 101

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-VSQSRDGNWSECAR 119
           +KYEPHFD+F DK NQ LGGHRIATVLMYLS V KGGETVFP+SE     +D +WS C +
Sbjct: 102 EKYEPHFDYFHDKNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGK 161

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
            G AVKP KGDALLFFSLHP A  D +SLH  CPVIEGEKWSATKWIHV  F+KP   P+
Sbjct: 162 TGVAVKPRKGDALLFFSLHPSAVPDESSLHTGCPVIEGEKWSATKWIHVAAFEKP--RPK 219

Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +  CV+E  +C  WA  GEC+KNP YMVG+K   GYCRK+C VC
Sbjct: 220 NGACVNEVDSCEEWAAYGECQKNPAYMVGTKEWPGYCRKACHVC 263


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  315 bits (808), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 150/225 (66%), Positives = 175/225 (77%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93  MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E   +Q +D  +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++G AVKP+KGD +LFFSLH D   D  SLHGSCPVIEGEKWSA KWI +R+++ P    
Sbjct: 213 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             + C D    C  WA+AGEC+KNP+YMVG++   G CRKSC VC
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 317


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score =  315 bits (806), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 149/225 (66%), Positives = 178/225 (79%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ S+VRTSSG FL+K +DEIV++IE R+AAWTFLP EN E+MQ+L YE G
Sbjct: 71  MVADNDSGKSLMSQVRTSSGAFLAKHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F DK N + GG R ATVLMYL+ V+KGGETVFPN+E S  Q +D  WSEC+
Sbjct: 131 QKYDAHFDYFHDKNNVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECS 190

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R G AVKP KGDALLFF LH +A+TD++SLHGSCPVIEGEKWSATKWIHVR+FD P    
Sbjct: 191 RSGLAVKPKKGDALLFFGLHLNATTDTSSLHGSCPVIEGEKWSATKWIHVRSFDNPPNVR 250

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  C D++  C  WA  GEC KNP YMVG+K + G+CRKSC +C
Sbjct: 251 MDAPCSDDNELCPKWAAIGECYKNPTYMVGTKDTNGFCRKSCGLC 295


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score =  315 bits (806), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 150/225 (66%), Positives = 175/225 (77%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 87  MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 146

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E   +Q +D  +SECA
Sbjct: 147 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 206

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++G AVKP+KGD +LFFSLH D   D  SLHGSCPVIEGEKWSA KWI +R+++ P    
Sbjct: 207 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 266

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             + C D    C  WA+AGEC+KNP+YMVG++   G CRKSC VC
Sbjct: 267 VTEGCSDNSARCAKWAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 311


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score =  311 bits (797), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 148/233 (63%), Positives = 187/233 (80%), Gaps = 12/233 (5%)

Query: 1   MVADNESGKSIASE----VRTSSGMFLSKAQ----DEIVASIEARIAAWTFLPPENGEAM 52
           MVADN+SG+S+ SE    V   S  F++       D+IV+++EA++AAWTFLP ENGE+M
Sbjct: 92  MVADNDSGESVESEDSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESM 151

Query: 53  QILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSR 110
           QILHYE+GQKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP    + +Q +
Sbjct: 152 QILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK 211

Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           D +W+ECA++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++
Sbjct: 212 DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKS 271

Query: 171 FDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           F++     +   C+DE+++C  WAKAGEC+KNP YMVGS    GYCRKSCK C
Sbjct: 272 FERAFN--KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score =  309 bits (791), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 146/222 (65%), Positives = 178/222 (80%), Gaps = 5/222 (2%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G+S+ S+ RTS+GMFL KAQDEIVA IE+RIAAWTFLP +NGE +QIL YE+GQKYEPH
Sbjct: 101 TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPH 160

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAV 124
           FDFF+D  N  +GGHRIAT+LMYLS+VEKGGETVFPNS V  S+    + SEC + GY V
Sbjct: 161 FDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGV 220

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCV 184
           +P  GDALLFFS++P+ + D+TS HGSCPVIEGEKWSATKWIH+   D+  + P    CV
Sbjct: 221 RPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPA---CV 277

Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPS 226
           DE+ +C  WAKAGEC+KNP+YM+GSK+  G+CR SCKVC PS
Sbjct: 278 DENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS 319


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  308 bits (790), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 147/232 (63%), Positives = 175/232 (75%), Gaps = 4/232 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MV D ++GKS+ SEVRTSSG FL+K QD++VA+IEARIAAWT LP ENGE++Q+L YE+G
Sbjct: 75  MVVDRQTGKSVMSEVRTSSGTFLAKKQDQVVATIEARIAAWTLLPQENGESIQVLRYENG 134

Query: 61  QKYEPHFDFFRD--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSE 116
           QKYEPH DF R   K +   GGHR+ATVLMYLS V+ GGETVFPNS+    Q +D   SE
Sbjct: 135 QKYEPHVDFIRHAAKGHHSRGGHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSE 194

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CARRGYAVKP+KGDA+LFFSLHP+ +TD  SLHG CPVIEGEKWSATKWIHVR FD   +
Sbjct: 195 CARRGYAVKPVKGDAVLFFSLHPNGTTDRDSLHGGCPVIEGEKWSATKWIHVRPFDNRRR 254

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSSV 228
            P    C D+D  C   A  GEC +NP YMVG+  S G+CRKSC  C  +++
Sbjct: 255 VPSTAGCGDDDELCPRLAANGECDRNPRYMVGTAGSPGFCRKSCNACNGTTL 306


>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 210

 Score =  308 bits (790), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 142/200 (71%), Positives = 168/200 (84%), Gaps = 3/200 (1%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
           QDE+V  IE RI+AWTFLPPENGEA+QILHY++G+KYEPH+D+F DK NQ LGGHRIATV
Sbjct: 9   QDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 68

Query: 87  LMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
           LMYLS+VEKGGET+FPN+E  + Q +D  WS+CAR GYAVKP+KGDALLFFSLHPD++TD
Sbjct: 69  LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 128

Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKNP 203
           S SLHGSCP IEG+KWSATKWIHVR+FD   K+P   D C D+++ C  WA  GEC KNP
Sbjct: 129 SDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECAKNP 188

Query: 204 LYMVGSKSSRGYCRKSCKVC 223
            YMVG+K + G+CRKSCKVC
Sbjct: 189 NYMVGTKEAPGFCRKSCKVC 208


>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 204

 Score =  307 bits (786), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 141/201 (70%), Positives = 168/201 (83%), Gaps = 3/201 (1%)

Query: 26  AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
           + DE+V  IE RI+AWTFLPPENGEA+QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 2   SNDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIAT 61

Query: 86  VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
           VLMYLS+VEKGGET+FPN+E  + Q +D  WS+CAR GYAVKP+KGDALLFFSLHPD++T
Sbjct: 62  VLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 121

Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKN 202
           DS SLHGSCP IEG+KWSATKWIHVR+FD   K+P   D C D+++ C  WA  GEC KN
Sbjct: 122 DSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECAKN 181

Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
           P YMVG+K + G+CRKSCKVC
Sbjct: 182 PNYMVGTKEAPGFCRKSCKVC 202


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score =  305 bits (781), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 148/227 (65%), Positives = 172/227 (75%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADNESG S  SEVRTSSGMF+ KA+D IV+ IE +IA WTFLP ENGE +Q+L YE GQ
Sbjct: 70  VADNESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQ 129

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KYEPH+D+F DK+N   GGHR+ATVLMYL++VEKGGETVFP +E S  R     D + SE
Sbjct: 130 KYEPHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSE 189

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G  VKP KGDALLF+SLHP+A+ D  SLHG CPVI+GEKWSATKWIHV +FDK   
Sbjct: 190 CAKKGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSFDKTVD 249

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              + +C D D NC  WA  GEC KNP YM+GS    GYCRKSCKVC
Sbjct: 250 --TEGNCSDRDENCERWAALGECTKNPEYMLGSAGLPGYCRKSCKVC 294


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score =  304 bits (779), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 141/225 (62%), Positives = 174/225 (77%), Gaps = 2/225 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWT LP EN E +QIL YE+G
Sbjct: 84  MVADNESGKSVMSEVRTSSGMFLDKQQDPVVSGIEERIAAWTLLPQENAENIQILRYENG 143

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKY+PHFD+F+DK+NQ  GGHR ATVL YLS VEKGGETVFPN+E   SQ +D ++S+CA
Sbjct: 144 QKYDPHFDYFQDKVNQLQGGHRYATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCA 203

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++G AVK +KGD++LFF+L PD + D  SLHGSCPVIEGEKWSA KWIHVR++D      
Sbjct: 204 KKGLAVKAVKGDSVLFFNLQPDGTPDPLSLHGSCPVIEGEKWSAPKWIHVRSYDNASSMK 263

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           + ++C D   NC  WA +GEC  N +YM+G++ + G C+KSC  C
Sbjct: 264 QSEECSDLSENCAAWAASGECNNNAVYMIGTEDAPGQCQKSCNAC 308


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score =  304 bits (779), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 150/228 (65%), Positives = 173/228 (75%), Gaps = 10/228 (4%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADNESGKS  SEVRTSSG F+SKA+D IV  IE ++A WTFLP ENGE +Q+L YE GQ
Sbjct: 74  VADNESGKSQVSEVRTSSGAFISKAKDAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR------DGNWS 115
           KYE HFDFF DK+N   GGHR ATVLMYLS+VEKGG+TVFPN+E+S+ +      + + S
Sbjct: 134 KYENHFDFFSDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLS 193

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           ECA+RG +VKP KGDALLFFSL P A+ D  SLHG CPVIEGEKWSATKWIHV +FDK  
Sbjct: 194 ECAKRGISVKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWSATKWIHVDSFDK-- 251

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               +D C D + NC  WA  GEC KNP YMVG+ S  GYCR+SCKVC
Sbjct: 252 --ILEDGCNDHNQNCERWAALGECTKNPEYMVGTSSLPGYCRRSCKVC 297


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score =  300 bits (767), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 142/213 (66%), Positives = 166/213 (77%), Gaps = 2/213 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93  MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E   +Q +D  +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++G AVKP+KGDA+LFFSLH D   D  SLHGSCPVIEGEKWSA KWI +R+++ P    
Sbjct: 213 QKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKS 211
             + C D    C  WA+AGEC+KNP+YM  + S
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMTVNSS 305


>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
          Length = 328

 Score =  298 bits (764), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 144/216 (66%), Positives = 175/216 (81%), Gaps = 7/216 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VAD +SG+S  SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 2   VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 61

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
           QKY+PHFD+F DK   +LGGHRIATVLMYLS+V KGGETVFPN   +  Q +D +WS+CA
Sbjct: 62  QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 121

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++GYAVKP KGDALLFF+LH + +TD  SLHGSCPVIEGEKWSAT+WIHVR+F K +   
Sbjct: 122 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 180

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRG 214
               CVD+  +C  WA AGEC+KNP+YMVG     G
Sbjct: 181 ----CVDDHESCQEWADAGECEKNPMYMVGVGKKTG 212


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score =  298 bits (763), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 145/227 (63%), Positives = 173/227 (76%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADNESGKS  SEVRTSSGMF++KA+D IVA IE +IA WTFLP ENGE +Q+L YEHGQ
Sbjct: 76  VADNESGKSKLSEVRTSSGMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQ 135

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL+ VEKGGETVFP++E    R       + SE
Sbjct: 136 KYDPHYDYFSDKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSE 195

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CAR+G AVKP +GDALLFFSL+P A  D++S+H  CPVIEGEKWSATKWIHV +FDK  +
Sbjct: 196 CARKGIAVKPRRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIHVDSFDKNLE 255

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D++ +C  WA  GEC KN  YMVGS    GYCR+SCKVC
Sbjct: 256 --AGGNCTDQNESCGRWAALGECTKNVEYMVGSSGLPGYCRRSCKVC 300


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score =  298 bits (763), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 141/213 (66%), Positives = 165/213 (77%), Gaps = 2/213 (0%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93  MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E   +Q +D  +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           ++G AVKP+KGD +LFFSLH D   D  SLHGSCPVIEGEKWSA KWI +R+++ P    
Sbjct: 213 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKS 211
             + C D    C  WA+AGEC+KNP+YM  + S
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMTVNSS 305


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score =  297 bits (760), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/225 (66%), Positives = 169/225 (75%), Gaps = 5/225 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSGMF+ K +D IVA IE +IAAWTFLP +NGE MQ+L YE GQ
Sbjct: 74  VADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS---RDGNWSECA 118
           KY+ H+D+F DK+N   GGHRIATVLMYLS V KGGETVFP +EVS S    + + SECA
Sbjct: 134 KYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECA 193

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           R+G AVKP KGDALLFFSLHP A  D  SLHG CPVIEGEKWSATKWIHV +FDK  K  
Sbjct: 194 RKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIHVDSFDKILK-- 251

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              +C DE+ +C  WA  GEC KNP YM+GS    G CR+SCKVC
Sbjct: 252 PGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 296


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score =  296 bits (759), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 139/224 (62%), Positives = 171/224 (76%), Gaps = 3/224 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +SG+S+ S+ RTSSGMFL + QDE+VA IE RIAAWT  P ENGE+MQ+L Y  G+
Sbjct: 75  VVNGKSGESVMSKTRTSSGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGE 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECAR 119
           KYEPHFD+ R +     GGHRIATVLMYLS+V+ GGETVFP++E  +SQ +D  WS+CA 
Sbjct: 135 KYEPHFDYIRGRQASARGGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAE 194

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
           +G+AVKP KG A+LFFSL+P+A+ D  SLHGSCPVI+GEKWSATKWIHVR++D+  +   
Sbjct: 195 QGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRR-S 253

Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D C DE   C  WA AGEC KNP YMVG+  S G+CRKSC VC
Sbjct: 254 SDKCEDEHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCNVC 297


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score =  296 bits (758), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 143/229 (62%), Positives = 172/229 (75%), Gaps = 9/229 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+SK +D IV+ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77  VADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-------RDGNW 114
           KY+PH+D+F DK+N   GGHR+ATVLMYL++V KGGETVFPN+E+ +S        D + 
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDL 196

Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           SEC ++G AVKP +GDALLFFSLHP+A  D+ SLH  CPVIEGEKWSATKWIHV +FDK 
Sbjct: 197 SECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKT 256

Query: 175 EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                  DC D+  +C  WA  GEC KNP YMVG+    GYCRKSCK C
Sbjct: 257 VG--AGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 303


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score =  296 bits (757), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 138/224 (61%), Positives = 171/224 (76%), Gaps = 3/224 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +SG+S+ S+ RTSSGMFL + QDE+VA IE RIAAWT  P ENGE+MQ+L Y  G+
Sbjct: 75  VVNGKSGESVMSKTRTSSGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGE 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECAR 119
           KYEPHFD+ R +     GGHRIATVLMYLS+V+ GGETVFP++E  +SQ +D  WS+CA 
Sbjct: 135 KYEPHFDYIRGRQASARGGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAE 194

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
           +G+AVKP KG A+LFFSL+P+A+ D  SLHGSCPVI+GEKWSATKWIHVR++D+  +   
Sbjct: 195 QGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRR-S 253

Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D C D+   C  WA AGEC KNP YMVG+  S G+CRKSC VC
Sbjct: 254 SDKCEDQHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCNVC 297


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score =  295 bits (754), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 143/227 (62%), Positives = 169/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADNESGKS  SEVRTSSGMF++K +D I+A IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 73  VADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYLS V KGGETVFPN+E    R       + SE
Sbjct: 133 KYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSE 192

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G +VKP +GDALLFFSLHP A  D  SLH  CPVIEGEKWSATKWIHV +FDK  +
Sbjct: 193 CAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVDSFDKNIE 252

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D++ +C  WA  GEC  NP YMVGS    GYCR+SCKVC
Sbjct: 253 --AGGNCTDKNESCERWAALGECTNNPEYMVGSPELPGYCRRSCKVC 297


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score =  294 bits (753), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 143/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+SK +D IV+ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77  VADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-----QSRDGNWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL++V KGGETVFPN+E S        D + SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSE 196

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           C ++G AVKP +GDALLFFSLHP+A  D+ SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 197 CGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 256

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                DC D+  +C  WA  GEC KNP YMVG+    GYCRKSCK C
Sbjct: 257 --AGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 301


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score =  294 bits (752), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 145/227 (63%), Positives = 173/227 (76%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSG F+ KA+D IV+ IE +IAAWTFLP +NGE +Q+L YE+GQ
Sbjct: 78  VADNLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQ 137

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+ HFD+F DK+N   GGHR+ATVLMYLS VEKGGETVFP++E SQ R       + S+
Sbjct: 138 KYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSD 197

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP KGDALLFFSLHP+A  D++SLHG CPVIEGEKWSATKWI V +FD   +
Sbjct: 198 CAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVR 257

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             +  +C DE+ +C  WA+ GEC  NP YMVGS    GYCRKSCK C
Sbjct: 258 --DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC 302


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  292 bits (747), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 141/227 (62%), Positives = 172/227 (75%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  S+VRTSSGMF+SK +D I++ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHRIATVLMYL++V KGGETVFP++E    R G     + SE
Sbjct: 134 KYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSE 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSLH +A+ D++SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 194 CAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                DC D  ++C  WA  GEC KNP YM+GS    GYCRKSCK C
Sbjct: 254 --AGGDCSDHHVSCERWASLGECTKNPEYMIGSSDVPGYCRKSCKSC 298


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score =  292 bits (747), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 148/227 (65%), Positives = 167/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSGMF+ K +D IVA IE +IAAWTFLP +NGE MQ+L YE GQ
Sbjct: 74  VADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KY+ H+D+F DK+N   GGHRIATVLMYLS V KGGETVFP +E    R     + + SE
Sbjct: 134 KYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSE 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CAR+G AVKP KGDALLFFSLHP A  D  SLHG CPVIEGEKWSATKWIHV +FDK  K
Sbjct: 194 CARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIHVDSFDKILK 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C DE+ +C  WA  GEC KNP YM+GS    G CR+SCKVC
Sbjct: 254 --PGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 298


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score =  291 bits (744), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 140/227 (61%), Positives = 170/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN+SG+S  SEVRTSSG F+SK +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KY+ HFD+F DK+N   GGHR+AT+LMYLS+V KGGETVFP++E+   R     + + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSD 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA+RG AVKP KGDALLFF+LHPDA  D  SLHG CPVIEGEKWSATKWIHV +FD+   
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVT 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D + +C  WA  GEC KNP YMVG+    GYCR+SCK C
Sbjct: 254 --PSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  290 bits (743), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 142/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  S+VRTSSGMF+SK +D IVA IE +I++WTFLP ENGE +Q+  YEHGQ
Sbjct: 73  VADNLSGESQLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHRIATVLMYL+ V KGGETVFP++E    R G     + SE
Sbjct: 133 KYDPHYDYFTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSE 192

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSLH +A+ D++SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 193 CAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 252

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                DC D  ++C  WA  GEC KNP YM+GS    GYCRKSCK C
Sbjct: 253 --AGGDCSDNHVSCERWASLGECTKNPEYMIGSSDIPGYCRKSCKAC 297


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score =  290 bits (742), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 140/227 (61%), Positives = 169/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN+SG+S  SEVRTSSG F+SK +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KY+ HFD+F DK+N   GGHR+AT+LMYLS+V KGGETVFP++E+   R       + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSD 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA+RG AVKP KGDALLFF+LHPDA  D  SLHG CPVIEGEKWSATKWIHV +FD+   
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVT 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D + +C  WA  GEC KNP YMVG+    GYCR+SCK C
Sbjct: 254 --PSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score =  290 bits (742), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 140/223 (62%), Positives = 169/223 (75%), Gaps = 3/223 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSG FL K QD IV  IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88  VADNMSGKSTLSEVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
           KYEPH+D+F D +N   GGHR ATVL+YL+ V +GGETVFP + E   ++D   SECA++
Sbjct: 148 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK 207

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G AV+P KGDALLFF+L+PD +TDS SLHG CPVI+GEKWSATKWI V +FDK    P+ 
Sbjct: 208 GIAVRPRKGDALLFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVH-HPQ- 265

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            +C DE+ +C  WA  GEC KNP YMVG+ +  GYCR+SC VC
Sbjct: 266 GNCTDENESCAKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 308


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score =  290 bits (741), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 143/227 (62%), Positives = 168/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+ K +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 615 VADNLSGESKLSEVRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 674

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL+ V KGGETVFP++E S    G     N SE
Sbjct: 675 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSE 734

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL+P+A  D+ SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 735 CAQKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKVVG 794

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             +  DC D+  NC  WA  GEC  NP YMVGS    GYC KSCK C
Sbjct: 795 --DGGDCNDKHENCERWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 839


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score =  289 bits (739), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 140/223 (62%), Positives = 167/223 (74%), Gaps = 3/223 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+ K +D IVA +E +I++WT LP ENGE +Q+L YEHGQ
Sbjct: 77  VADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQ 136

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARR 120
           KY+PH+D+F DK+N   GGHR+ATVLMYL+ V KGGETVFPN+E+  S    + SECA++
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQK 196

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G AVKP +GDALLFFSL+P+A  D+ SLH  CPVIEGEKWSATKWIHV +FDK     + 
Sbjct: 197 GIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK--MVADG 254

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            DC D+  NC  WA  GEC  NP YMVGS    GYC KSCK C
Sbjct: 255 GDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 297


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score =  288 bits (738), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 141/230 (61%), Positives = 170/230 (73%), Gaps = 13/230 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG S  S+VRTSSGMF+SK +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 78  VADNLSGDSKLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 137

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--------SQSRDGN 113
           KY+PH+DFF DK+N   GGHR+ATVLMYL++V +GGETVFPN+EV        S++ D +
Sbjct: 138 KYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETID-D 196

Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
            SECA++G AVKP +GDALLFFSL+P+A  D+ SLH  CPVIEGEKWSATKWIHV +FD+
Sbjct: 197 LSECAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIHVDSFDR 256

Query: 174 PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                   DC D   +C  WA  GEC  NP YMVGS    GYC +SCK C
Sbjct: 257 ----KAGGDCTDHHESCASWAAVGECTNNPEYMVGSAGLPGYCMRSCKAC 302


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score =  287 bits (735), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 141/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN+SGKS  S VRTSSGMF+SK +D IV+ IE +I+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 76  VADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQ 135

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KYE H+D+F DK+N   GGHR+ATVLMYLS+V +GGETVFP +E    R     D + SE
Sbjct: 136 KYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSE 195

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP KGDALLFFSL P+A  D+ SLHG CPV+EGEKWSATKWIHV +F K   
Sbjct: 196 CAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSK--N 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             +  +C D + +C  WA  GEC KNP YMVGS    GYCR+SC++C
Sbjct: 254 LGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score =  287 bits (734), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 141/227 (62%), Positives = 167/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+ K +D IVA +E +I++WT LP ENGE +Q+L YEHGQ
Sbjct: 77  VADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQ 136

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL+ V KGGETVFPN+E S    G     + SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSE 196

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL+P+A  D+ SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 197 CAQKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK--M 254

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             +  DC D+  NC  WA  GEC  NP YMVGS    GYC KSCK C
Sbjct: 255 VADGGDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 301


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score =  287 bits (734), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 142/227 (62%), Positives = 166/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN+SG+S  SEVRTSSG F+ K +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNDSGESKFSEVRTSSGTFIPKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           KY+ HFD+F DK+N   GGHRIATVLMYLS+V KGGETVFP++EV   R       + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSD 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA+RG AVKP KGDALLFF+LHPDA  D  SLHG CPVIEGEKWSATKWIHV +FDK   
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDKIVT 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C +   +C  WA  GEC KNP YMVG+    GYCR SCK C
Sbjct: 254 --PSGNCTNMHESCERWAVLGECTKNPEYMVGTTELPGYCRHSCKAC 298


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score =  286 bits (732), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 142/227 (62%), Positives = 168/227 (74%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG+S  SEVRTSSGMF+ K +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77  VADNLSGESKLSEVRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL+ V KGGETVFP++E S    G     N SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSE 196

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL+P+A  D+ SLH  CPVIEGEKWSAT+WIHV +FDK   
Sbjct: 197 CAQKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIHVDSFDKVVG 256

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             +  DC D+  NC  WA  GEC  NP YMVGS    GYC KSCK C
Sbjct: 257 --DGGDCNDKHENCERWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 301


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score =  286 bits (732), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 138/223 (61%), Positives = 165/223 (73%), Gaps = 3/223 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  S+VRTSSG FL K QD IV  IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88  VADNMSGKSTLSDVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
           KYEPH+D+F D +N   GGHR ATVL+YL+ V +GGETVFP + EV  ++D  +SECA++
Sbjct: 148 KYEPHYDYFTDNVNTIRGGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQK 207

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G AVKP KGDALLFF+L PD +TD  SLHG C VI GEKWSATKWI V +FDK       
Sbjct: 208 GIAVKPRKGDALLFFNLKPDGTTDPVSLHGGCAVIRGEKWSATKWIRVASFDKVHY--PQ 265

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            +C DE+ +C  WA  GEC KNP YMVG+ +  GYCR+SC VC
Sbjct: 266 GNCTDENESCSKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 308


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score =  284 bits (727), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 140/227 (61%), Positives = 166/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG S  S+VRTSSGMF+SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNLSGDSQLSDVRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL++V KGGETVFP +E    R G     + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL  +A  D+ SLH  CPV+EGEKWSATKWIHV +FDK   
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVG 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                 C D+  +C  WA  GEC  NP+YMVGS    GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score =  283 bits (725), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 140/226 (61%), Positives = 167/226 (73%), Gaps = 9/226 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VAD  SGKS  SEVRTSSG F+SK +D IVA IE +IAAWTFLP ENGE MQ+L Y+ G+
Sbjct: 88  VADETSGKSQLSEVRTSSGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGE 147

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN----WSEC 117
           KYEPH+DFF D +N  LGGHR+ATVL+YL+ V +GGETVFP   +++ R G+     SEC
Sbjct: 148 KYEPHYDFFTDSVNTILGGHRVATVLLYLTDVAEGGETVFP---LAKGRKGSHHKGLSEC 204

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
           A++G AVKP KGDALLFF+L PDA+TD TSLHG C VI+GEKWSATKWI V +FDK    
Sbjct: 205 AQKGIAVKPRKGDALLFFNLRPDAATDPTSLHGGCEVIKGEKWSATKWIRVASFDKVYHS 264

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           P   +C D   +C  WA  GEC KNP YMVG+    G+CR+SC VC
Sbjct: 265 P--GNCTDNSNSCSQWAALGECTKNPAYMVGTAVLPGHCRRSCNVC 308


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score =  283 bits (725), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 143/228 (62%), Positives = 171/228 (75%), Gaps = 8/228 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSG F+ KA+D IV+ IE +IAAWTFLP +NGE +Q+L YE+GQ
Sbjct: 78  VADNLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQ 137

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVF----PNSEVSQSRDGN--WS 115
           KY+ HFD+F DK+N   GGHR+ATVLMYLS VEKGGETVF      S+  Q+ + N   S
Sbjct: 138 KYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLS 197

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +CA++G AVKP KGDALLFFSLHP+A  D++SLHG CPVIEGEKWSATKWI V +FD   
Sbjct: 198 DCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVV 257

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +  +  +C DE+ +C  WA+ GEC  NP YMVGS    GYCRKSCK C
Sbjct: 258 R--DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC 303


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score =  282 bits (722), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 139/227 (61%), Positives = 165/227 (72%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG S  S+VRTSSGM +SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNLSGDSQLSDVRTSSGMLISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL++V KGGETVFP +E    R G     + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL  +A  D+ SLH  CPV+EGEKWSATKWIHV +FDK   
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVG 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                 C D+  +C  WA  GEC  NP+YMVGS    GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score =  282 bits (722), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 139/227 (61%), Positives = 165/227 (72%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SG S  S+VRTSSGMF+SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74  VADNLSGDSQLSDVRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVLMYL++V KGGETVFP +E    R G     + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSL  +A  D+ SLH  CPV+EGEKWSATKWIHV + DK   
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSLDKIVG 253

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                 C D+  +C  WA  GEC  NP+YMVGS    GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  281 bits (720), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 137/227 (60%), Positives = 166/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  S+ RTSSG F+ K+QD IVA IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 95  VADNLSGKSELSDARTSSGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGE 154

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSE 116
           KYE H+D+F D +N   GGHRIATVLMYL+ V +GGETVFP +E      + + D   SE
Sbjct: 155 KYERHYDYFSDNVNTLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSE 214

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP KGDALLFF+L PDAS DS SLH  CPVI+GEKWSATKWI V +FDK   
Sbjct: 215 CAKKGVAVKPRKGDALLFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYH 274

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D++ +C  WA  GEC KNP YM+G+ +  GYCRKSC +C
Sbjct: 275 --TQGNCTDDNESCEKWAALGECIKNPEYMIGTAALPGYCRKSCNIC 319


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score =  281 bits (719), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 137/227 (60%), Positives = 166/227 (73%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  S+ RTSSG F+ K+QD IVA IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 95  VADNLSGKSELSDARTSSGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGE 154

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSE 116
           KYE H+D+F D +N   GGHRIATVLMYL+ V +GGETVFP +E      + + D   SE
Sbjct: 155 KYERHYDYFSDNVNTLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSE 214

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP KGDALLFF+L PDAS DS SLH  CPVI+GEKWSATKWI V +FDK   
Sbjct: 215 CAKKGVAVKPRKGDALLFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYH 274

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                +C D++ +C  WA  GEC KNP YM+G+ +  GYCRKSC +C
Sbjct: 275 --TQGNCTDDNESCEKWAALGECIKNPEYMIGTAALPGYCRKSCNIC 319


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score =  281 bits (718), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/227 (61%), Positives = 164/227 (72%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN  G S  SEVRTSSGMF+SK +D IVA IE +I+AWTFLP ENGE MQ+L YEHGQ
Sbjct: 73  VADNLPGDSKLSEVRTSSGMFISKKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
           KY+PH+D+F DK+N   GGHR+ATVL+YL++V +GGETVFP +E    R G     + SE
Sbjct: 133 KYDPHYDYFTDKVNIVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSE 192

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP +GDALLFFSLH  A  D+ SLH  CPVIEGEKWSATKWIHV +FDK   
Sbjct: 193 CAKKGIAVKPRRGDALLFFSLHTTAIPDTDSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 252

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                DC D+  +C  WA  GEC  NP YMVGS    G CR+SCK C
Sbjct: 253 --AGGDCSDQHESCQRWASLGECTNNPEYMVGSSDLPGSCRRSCKAC 297


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score =  279 bits (714), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 141/242 (58%), Positives = 168/242 (69%), Gaps = 22/242 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE-------------- 47
           V D ESG+S+ S+VRTSSGMFL K QDE+VA IE RIAAWT LP E              
Sbjct: 80  VVDGESGESVTSKVRTSSGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139

Query: 48  ---NGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS 104
              NGE+MQIL Y  G+KYEPHFD+   +      G R+ATVLMYLS+V+ GGET+FP+ 
Sbjct: 140 LSENGESMQILRYGQGEKYEPHFDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDC 199

Query: 105 E--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSA 162
           E  +SQ +D  WS+CA +G+AVKP KG A+LFFSLHP+A+ D+ SLHGSCPVIEGEKWSA
Sbjct: 200 EARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSA 259

Query: 163 TKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVG-SKSSRGYCRKSCK 221
           TKWIHVR++    +      C DE + C  WA AGEC KNP YMVG S S  G+CRKSC 
Sbjct: 260 TKWIHVRSYSYRRRSA--GKCEDEHVLCSSWAAAGECAKNPGYMVGTSDSPPGFCRKSCN 317

Query: 222 VC 223
           VC
Sbjct: 318 VC 319


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score =  277 bits (709), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN++G+S  S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75  VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
           KY+ HFD+F DK+N   GGHRIATVL+YLS+V KGGETVFP+++      +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 193

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +CA++G AVKP KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV +FDK  
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               D +C D + +C  WA  GEC KNP YMVG+    G CR+SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score =  277 bits (709), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN++G+S  S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75  VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
           KY+ HFD+F DK+N   GGHRIATVL+YLS+V KGGETVFP+++      +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 193

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +CA++G AVKP KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV +FDK  
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               D +C D + +C  WA  GEC KNP YMVG+    G CR+SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score =  277 bits (709), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN++G+S  S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 73  VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
           KY+ HFD+F DK+N   GGHRIATVL+YLS+V KGGETVFP+++      +S+++D + S
Sbjct: 133 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 191

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +CA++G AVKP KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV +FDK  
Sbjct: 192 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 251

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               D +C D + +C  WA  GEC KNP YMVG+    G CR+SCK C
Sbjct: 252 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 297


>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
          Length = 187

 Score =  275 bits (704), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 126/179 (70%), Positives = 150/179 (83%), Gaps = 2/179 (1%)

Query: 47  ENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE- 105
           ENGE++QILHYE+G+KYEPH+D+F D+ NQ +GGHRIATVLMYLS V KGGET+FPN+E 
Sbjct: 7   ENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAES 66

Query: 106 -VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATK 164
            +SQ +D +WSECA +GYAVKP KGDALLFFSLH +A+TDS SLHGSCPVIEGEKWSATK
Sbjct: 67  KLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSATK 126

Query: 165 WIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           WIHV +F+K  K+ ++ DC DE+ NC  WAK GEC KNPLYM+G K  +GYC KSC VC
Sbjct: 127 WIHVSDFEKAIKQDDNGDCTDENENCSRWAKLGECVKNPLYMIGGKGVKGYCMKSCNVC 185


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score =  275 bits (703), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 135/227 (59%), Positives = 162/227 (71%), Gaps = 7/227 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTS G F+SK +D IVA IE +IAAWTFLP ENGE MQ+L Y+ G+
Sbjct: 89  VADNTSGKSTLSEVRTSYGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGE 148

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-----NSEVSQSRDGNWSE 116
           K EP FDFF D +N   GGHR+ATVL+YL+ V +GGETVFP            +D   SE
Sbjct: 149 KDEPQFDFFTDTVNTVRGGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSE 208

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           CA++G AVKP KGDALLFF+L PDA+TD  SLHG C VI+GEKW+ATKWI V +FDK   
Sbjct: 209 CAQKGIAVKPRKGDALLFFNLRPDAATDPLSLHGGCTVIKGEKWTATKWIRVASFDKVYH 268

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            P   +C D + +CV WA  GEC KNP YM+G+ +  G+CR+SC VC
Sbjct: 269 MP--GNCSDNNDSCVRWAALGECIKNPPYMIGTAALPGHCRRSCNVC 313


>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
 gi|194706408|gb|ACF87288.1| unknown [Zea mays]
 gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
 gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
          Length = 217

 Score =  275 bits (703), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 127/201 (63%), Positives = 155/201 (77%), Gaps = 2/201 (0%)

Query: 25  KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIA 84
           + +DEIV++IE R+AAWTFLP EN E++Q+L YE GQKY+ HFD+F D+ N +LGG R+A
Sbjct: 15  QPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVA 74

Query: 85  TVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS 142
           TVLMYL+ V KGGETVFPN+E S  Q +D  WSEC+R G AVKP KGDALLFF+LH +A+
Sbjct: 75  TVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNAT 134

Query: 143 TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKN 202
            D+ SLHGSCPVIEGEKWSATKWIHVR+FD P     D  C D+   C  WA  GEC +N
Sbjct: 135 ADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVRTDAPCSDDKELCPRWAAIGECHRN 194

Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
           P YMVG+K + G+CRKSC +C
Sbjct: 195 PTYMVGTKDTLGFCRKSCGIC 215


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  272 bits (695), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 129/178 (72%), Positives = 150/178 (84%), Gaps = 2/178 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVA++E+G+S+ S+ RTSSGMF+ K +DEIV  IEARIAAWTFLP ENGE +QIL YEHG
Sbjct: 54  MVANDETGESMESQERTSSGMFIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHG 113

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
           QKYE H D+F DK NQ+ GGHR ATVLMYLS V+KGGETVFP SE   SQ++D +WS+CA
Sbjct: 114 QKYEAHIDYFVDKANQEEGGHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCA 173

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           ++GYAVKP KGDALLFFSLHPDA+ D  SLH SCPVIEGEKWSATKWIHVR+F +P K
Sbjct: 174 KKGYAVKPNKGDALLFFSLHPDATPDPGSLHASCPVIEGEKWSATKWIHVRSFSEPVK 231


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score =  272 bits (695), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 134/228 (58%), Positives = 169/228 (74%), Gaps = 9/228 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN++G+S  S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YE GQ
Sbjct: 75  VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQ 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
           KY+ HFD+F DK+N   GGHRIATVL+YLS+V KGGETVFP+++      +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKD-DLS 193

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           +CA++G AVKP KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV +FDK  
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               D +C D + +C  WA  GEC KNP YMVG+    G CR SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPELPGNCRHSCKAC 299


>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
          Length = 369

 Score =  271 bits (694), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 124/172 (72%), Positives = 147/172 (85%), Gaps = 3/172 (1%)

Query: 26  AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
            QDE+V  IE RI+AWTFLPPENGE++QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 192 TQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIAT 251

Query: 86  VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
           VLMYLS+VEKGGET+FPN+E  + Q +D  WS+CAR GYAVKP+KGDALLFFSLHPDA+T
Sbjct: 252 VLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311

Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWA 194
           DS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P   D C D+++ C  WA
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWA 363


>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
          Length = 208

 Score =  266 bits (681), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 127/210 (60%), Positives = 152/210 (72%), Gaps = 9/210 (4%)

Query: 21  MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
           MF+ K +D I++ IE +IAAWTFLP ENGE MQ+L YE G+KY+PHFDFF+DK+N   GG
Sbjct: 1   MFIPKGKDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGG 60

Query: 81  HRIATVLMYLSHVEKGGETVFPNSEVSQSR-------DGNWSECARRGYAVKPMKGDALL 133
           HR+ATVLMYL+ V KGGETVFP++E    R       D   S+CA+RG AVKP +GDALL
Sbjct: 61  HRVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAKRGTAVKPKRGDALL 120

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVW 193
           FFSL   A  D+ SLH  CPVIEGEKWS TKWIHV +FDKP +    D+CVD++  C  W
Sbjct: 121 FFSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIHVESFDKPRQ--SSDNCVDQNPRCGEW 178

Query: 194 AKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           A  GEC  NP+YM+GS    G CRKSCKVC
Sbjct: 179 AAYGECNNNPIYMLGSPDLPGACRKSCKVC 208


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score =  261 bits (668), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 130/223 (58%), Positives = 159/223 (71%), Gaps = 16/223 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SE             D IV  IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88  VADNMSGKSTLSE-------------DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
           KYEPH+D+F D +N   GGHR ATVL+YL+ V +GGETVFP + E   ++D   SECA++
Sbjct: 135 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK 194

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G AV+P KGDALLFF+L+PD +TDS SLHG CPVI+GEKWSATKWI V +FDK    P+ 
Sbjct: 195 GIAVRPRKGDALLFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVH-HPQ- 252

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            +C DE+ +C  WA  GEC KNP YMVG+ +  GYCR+SC VC
Sbjct: 253 GNCTDENESCAKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 295


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score =  261 bits (667), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 128/188 (68%), Positives = 149/188 (79%), Gaps = 6/188 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81  MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F D++NQ  GGHR ATVLMYLS V +GGETVFPN++   SQ +D  +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKE 177
            +G AVKP+KGDA+LFFSLH D + D  SLHGSCPVI GEKWSA KWIHVR++ D+P+  
Sbjct: 201 HKGLAVKPVKGDAVLFFSLHADGTPDPLSLHGSCPVIRGEKWSAPKWIHVRSYEDEPQAV 260

Query: 178 ---PEDDD 182
              PE+ D
Sbjct: 261 LVLPEETD 268


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score =  259 bits (663), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 128/222 (57%), Positives = 157/222 (70%), Gaps = 18/222 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN++G+S  S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75  VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           KY+ HFD+F DK+N   GGHRIATVL+YLS+V KGGETVFP+++V               
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQV--------------- 179

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
             +KP KG+ALLFF+L  DA  D  SLHG CPVIEGEKWSATKWIHV +FDK      D 
Sbjct: 180 -CLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKILT--HDG 236

Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +C D + +C  WA  GEC KNP YMVG+    G CR+SCK C
Sbjct: 237 NCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 278


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score =  252 bits (644), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 126/227 (55%), Positives = 158/227 (69%), Gaps = 6/227 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN++G+S+ S +RTS GMF  + +D+I+  IE RIA WT +P ENGE +Q+L YE GQ
Sbjct: 42  VVDNKTGQSVPSNIRTSDGMFFDRHEDDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQ 101

Query: 62  KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECA 118
           KYEPH D F DK N  +  GG R+ATVLMYLS VE+GGETVFP S +     D  WSECA
Sbjct: 102 KYEPHLDAFSDKFNTEESKGGQRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECA 161

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE--K 176
           +RG AVK  KGDALLF+SL  D++ D  SLHG CPVI+G KWSATKW+H+++FD     K
Sbjct: 162 QRGVAVKARKGDALLFWSLDIDSNVDELSLHGGCPVIKGTKWSATKWMHLKSFDTANSFK 221

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            PE   C D +  C  WA  GEC+KNP YM+G+  + GYC ++C  C
Sbjct: 222 FPE-GVCDDVNEQCEGWASTGECEKNPKYMIGNGKTDGYCVRACGKC 267


>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
           C-169]
          Length = 347

 Score =  252 bits (644), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 129/232 (55%), Positives = 156/232 (67%), Gaps = 10/232 (4%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN++GKSI S VRTS+G F  + +DE++  IE RI+  T LP  NGE +QILHYE GQ
Sbjct: 120 VVDNDTGKSIDSTVRTSTGTFFGREEDEVIQGIERRISMITHLPEVNGEGLQILHYEDGQ 179

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYE H DFF DK N   + GG RIATVLMYL+  E+GGETVFP +  ++     WSECAR
Sbjct: 180 KYEAHHDFFHDKFNSRPENGGQRIATVLMYLTTAEEGGETVFPMA-ANKVTGPQWSECAR 238

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKEP 178
            G AVK  +GDALLF+SL P+  TD TSLHGSCP  +GEKWSATKWIHV  F    E++ 
Sbjct: 239 GGAAVKSRRGDALLFYSLLPNGETDPTSLHGSCPTTKGEKWSATKWIHVGPFGGSSEQQR 298

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSSVSS 230
              +C+D D  C  WA  GECKKNP YM+ S      CR SC  C P+S ++
Sbjct: 299 AKGECIDADERCSGWAADGECKKNPGYMMSS------CRLSCHTCTPASKTT 344


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score =  248 bits (634), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 123/228 (53%), Positives = 158/228 (69%), Gaps = 16/228 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 79  VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 138

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ T+LMYL+ VE+GGETV PN+E   + DG WSECA+
Sbjct: 139 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 197

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK--- 176
           RG AVKP+KGDAL+F+SL PD S D  SLHGSCP ++G+KWSATKWIHV      +K   
Sbjct: 198 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHVAPIGGKKKLNL 257

Query: 177 -EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             PE   C DED  C  WA  GEC+KNP +M         C++SCK C
Sbjct: 258 GTPE---CHDEDERCQEWAFFGECEKNPGFM------DAQCKRSCKKC 296


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  246 bits (629), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 123/226 (54%), Positives = 156/226 (69%), Gaps = 10/226 (4%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN SGKS+ SE+RTS+G +L+K +DEI++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 91  VVDNASGKSVDSEIRTSTGAWLAKGEDEIISRIEKRVAQVTMIPLENHEGLQVLHYHDGQ 150

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ TVLMYL+ VE+GGETV P+++   S +G WSECA+
Sbjct: 151 KYEPHYDYFHDPVNASPEHGGQRVVTVLMYLTTVEEGGETVLPHADQKVSGEG-WSECAK 209

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKEP 178
           RG AVKP+KGDAL+F+SL PD S D  SLHGSCP ++G+KWSATKWIHV     K     
Sbjct: 210 RGLAVKPVKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHVGPIGGKKAVSL 269

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
              +C D    C  WA  GEC+KNP YM      R  C +SCK CK
Sbjct: 270 GTPECHDSMEQCTEWAFFGECEKNPGYM------RENCARSCKTCK 309


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score =  237 bits (605), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 120/224 (53%), Positives = 152/224 (67%), Gaps = 13/224 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN++GKS+ S VRTSSG FL++ +DE+V +IE RI+  T +P ENGEA+QIL Y  GQ
Sbjct: 75  VVDNKTGKSMDSTVRTSSGTFLARGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQ 134

Query: 62  KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH D+F DK N +   GG R+AT+LMYLS  E+GGETVFP +E     +G WSECAR
Sbjct: 135 KYEPHTDYFHDKYNSRTENGGQRVATILMYLSTPEEGGETVFPYAEKKVEGEG-WSECAR 193

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
           +G AVK +KG ALLF+SL P+   D  S HGSCP + GEKWSAT+WIHV  F     +  
Sbjct: 194 KGLAVKAVKGSALLFYSLKPNGEEDQASTHGSCPTLAGEKWSATRWIHVGAFQPGGAK-- 251

Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              C DE+  C  WA  GEC+ NP +M      +  C+KSC++C
Sbjct: 252 --GCKDENEKCEEWAVMGECQNNPAFM------KSNCKKSCELC 287


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score =  234 bits (596), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 114/233 (48%), Positives = 154/233 (66%), Gaps = 7/233 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G S+ S++RTSSGMFL + +D++VASIE RIA+WT +P  +GE  Q+L YE GQ
Sbjct: 93  VVDASNGGSVPSDIRTSSGMFLLRGEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQ 152

Query: 62  KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG--NWSEC 117
           +Y PHFD+F+D+ NQ+   GG R+ATVLMYL+ VE+GGET+FP++E   +  G  + S C
Sbjct: 153 EYRPHFDYFQDEFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSC 212

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
           A    AVKP KGDAL F SLH + ++D+ S H  CPV++G K+SATKW+HV   +     
Sbjct: 213 AAGKLAVKPRKGDALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWMHVAPIEDSATA 272

Query: 178 P---EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSS 227
               E   C D +  C  WA +GEC KNP +MVG   + G C +SC  C P +
Sbjct: 273 SVRFEPGVCKDVNAACEGWASSGECTKNPSFMVGRGRANGNCMRSCGACPPGT 325


>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
          Length = 300

 Score =  232 bits (591), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 108/226 (47%), Positives = 155/226 (68%), Gaps = 7/226 (3%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G+S+ S++RTS GMF  + +DE+V  +E R++ W+ +PP +GE +Q+L YE+G++Y
Sbjct: 57  DNPGGESV-SDIRTSYGMFFDRGEDEVVREVERRLSEWSLIPPGHGEGIQVLRYENGEEY 115

Query: 64  EPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRG 121
           +PHFD+F D ++ Q GG+R+AT+LMYL+  E GGETVFPN +    Q+ +  +SECA +G
Sbjct: 116 KPHFDYFFDNLSVQNGGNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQG 175

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF----DKPEKE 177
            AVKP KGDA+LFFSL  + + D  SLHGSCP ++G K++ATKW HV ++    ++    
Sbjct: 176 LAVKPRKGDAVLFFSLRTEGTLDKGSLHGSCPTLKGFKFAATKWYHVAHYAMGGERAPVL 235

Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           P    C DE   CV WA+ GEC+ NP +MVG+K   G C  +C  C
Sbjct: 236 PASAGCKDEKDACVGWAEGGECESNPGFMVGTKEQPGACLLACGRC 281


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 113/234 (48%), Positives = 153/234 (65%), Gaps = 12/234 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +SGKS    VRTS G FL++  D ++A IEARIA WT +P  NGE +Q+L YEHGQ
Sbjct: 100 VVETDSGKSKIDNVRTSKGTFLNRGHDSVIADIEARIAKWTLMPAGNGEGLQVLKYEHGQ 159

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARR 120
           +YE H+D+F  K     GG+R  TVLMYL+ VE+GGET FPN       +G  +SECAR+
Sbjct: 160 EYEGHYDYFFHKAGTANGGNRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARK 219

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF----DKP-- 174
             A KP KG+A+LF S+ P    +  SLH +CPVI+G KWSA KW+HV ++    +KP  
Sbjct: 220 VLAAKPKKGNAVLFHSIKPTGELERRSLHTACPVIKGVKWSAPKWVHVGHYAVGGEKPQH 279

Query: 175 -EKEPEDD----DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            ++ P+ D    +C ++D  C  WA  GEC+KNP++MVG+K   G+C K+C  C
Sbjct: 280 IQQIPQGDSTYPECKNKDAACDSWAGNGECEKNPVFMVGTKQRPGHCIKACGKC 333


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  226 bits (576), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 106/173 (61%), Positives = 135/173 (78%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++GKS  S VRTSSGMFL + +D+IV +IE RIA +TF+P ENGE +QILHYE GQ
Sbjct: 116 VVDSKTGKSTESRVRTSSGMFLKRGKDKIVQNIEKRIADFTFIPEENGEGLQILHYEVGQ 175

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP +  + S    W   S+CA
Sbjct: 176 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCA 235

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           R+G +VKP  GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+H+R +
Sbjct: 236 RKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLREY 288


>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
 gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
          Length = 275

 Score =  226 bits (575), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 111/174 (63%), Positives = 131/174 (75%), Gaps = 4/174 (2%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVA N S  S   + RTSSGMFL K QD +V+ IE RIAAWT LP EN E MQI  Y+HG
Sbjct: 81  MVAHNRS--SYYRQTRTSSGMFLRKRQDPVVSRIEERIAAWTLLPRENVEKMQIQRYQHG 138

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKY+PHFD+F DK++   GG R ATVLMYLS V+KGGETVFP ++   SQ +D  +SECA
Sbjct: 139 QKYDPHFDYFDDKIHHTRGGPRYATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECA 198

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
            +G AVKP+KGDA+LFFSLH D   D  +LHGSCPVI+GEKWSA  WIHVR+F+
Sbjct: 199 HKGLAVKPVKGDAVLFFSLHVDGGPDPLTLHGSCPVIQGEKWSAPNWIHVRSFE 252


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 115/226 (50%), Positives = 145/226 (64%), Gaps = 4/226 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G   +SE+RTSSGMFL +A+D+++ +IEARIAAWT +P  +GE  Q+L YE  Q
Sbjct: 63  VVDASTGGDASSEIRTSSGMFLGRAEDDVIEAIEARIAAWTHVPESHGEGFQVLRYEKHQ 122

Query: 62  KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +Y  H+D+F DK N  ++ GG R+ TVLMYLS VE+GGETVFP  E         SECAR
Sbjct: 123 EYRAHYDYFHDKFNVKREKGGQRMGTVLMYLSDVEEGGETVFPKFEDGTPAGSEASECAR 182

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
              AV+P KGDAL F SL  D   D+ S H  CPVI G K+SATKW+HV   +       
Sbjct: 183 NKLAVRPRKGDALFFRSLRHDGVPDTFSEHAGCPVIRGVKFSATKWMHVSPIEDGSNGLL 242

Query: 180 DDDCVDEDLN--CVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               V +DL+  CV WAK+GEC+KN  YMVG   S+G C +SC  C
Sbjct: 243 LPPGVCKDLHAACVAWAKSGECEKNKNYMVGRGRSKGNCMRSCGAC 288


>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 369

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 118/229 (51%), Positives = 147/229 (64%), Gaps = 11/229 (4%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+ + S +RTS GMF  + +D++V ++E RI+AWT LP ENGE MQ+L Y  GQ
Sbjct: 113 VVDTDTGEGVPSAIRTSDGMFFDRGEDDVVDAVERRISAWTRLPTENGEGMQVLRYAGGQ 172

Query: 62  KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-QSRDGNWSECA 118
           KY+ H D F DK N     GG R+ATVLMYL+ V+ GGETVFP +       D  +S CA
Sbjct: 173 KYDAHLDAFVDKFNADDAHGGQRVATVLMYLNDVDDGGETVFPETTAKPHVGDERYSACA 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV-IEGEKWSATKWIHVRNFDKPEKE 177
           RRG AVKP +GDALLF+S+     T + SLHG CPV   G KWS TKWIH   F +  K 
Sbjct: 233 RRGVAVKPRRGDALLFWSMD---ETFTRSLHGGCPVGAGGVKWSMTKWIHKGAFSRGHKM 289

Query: 178 --PEDDDCVDEDLNCVVWAKAGECKKNPLYMVG-SKSSRGYCRKSCKVC 223
             PE   C DED NC  WAK+GEC+KNP YM G  + + G+C  SC  C
Sbjct: 290 KFPE-GVCDDEDANCAGWAKSGECEKNPAYMTGDGRENDGHCAFSCGTC 337


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 106/173 (61%), Positives = 135/173 (78%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN++GKS  S VRTSSG FL + QDEI++ IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 83  VVDNQTGKSKDSRVRTSSGTFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQ 142

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KY+ H D+F DK+N + GG R+ATVLMYLS VE+GGETVFP+++V+ S    W   SECA
Sbjct: 143 KYDAHHDYFHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECA 202

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP KGDALLF+S+ PDA  D  SLHG CPVI+G KWSATKW+H+R +
Sbjct: 203 KKGVSVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score =  223 bits (567), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 102/173 (58%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 141 VVDSETGKSKDSRVRTSSGMFLQRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQ 200

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS +E+GGET+FP++ V+ S        SECA
Sbjct: 201 KYEPHFDYFLDEFNTKNGGQRMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECA 260

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           R+G AVKP  GDALLF+S+ PDA+ D  SLHG CPVI+G KWS+TKW+HV  +
Sbjct: 261 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWLHVGEY 313


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  223 bits (567), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 105/173 (60%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DN++GKS  S VRTSSG FL + QDEI++ IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 83  VVDNQTGKSKDSRVRTSSGTFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQ 142

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KY+ H D+F DK+N + GG R+ATVLMYLS VE+GGETVFP+++V+ S    W   SEC 
Sbjct: 143 KYDAHHDYFHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECG 202

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP KGDALLF+S+ PDA  D  SLHG CPVI+G KWSATKW+H+R +
Sbjct: 203 KKGVSVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score =  222 bits (566), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 115/238 (48%), Positives = 151/238 (63%), Gaps = 17/238 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V+D  +G+   S++RTSSGMF ++ ++++V  IE R+A WT LP ENGE +Q+L YE  Q
Sbjct: 87  VSDATTGEGGVSDIRTSSGMFYTRGENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQ 146

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECAR 119
           KY+PH D+F  +     GG+R+ATVLMYL+  E+GGETVFP   V   Q+R  N+SEC  
Sbjct: 147 KYDPHHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFPKIPVPAGQTR-ANFSECGM 205

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK-PEKEP 178
           +G AVKP+KGDA+LF+S+ PD   +  SLHGSCPVI G KWSATKWIHV  +    EK  
Sbjct: 206 KGLAVKPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKWSATKWIHVGPYSMGAEKAV 265

Query: 179 E-------------DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           E                C++    C  WA++GEC+ NP YMVG   S G C  +C  C
Sbjct: 266 EVTRVIYAPPPPPAVPGCINTHKLCDHWAESGECESNPGYMVGQLGSPGACNLACNRC 323


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score =  222 bits (566), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 103/173 (59%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           R+G AVKP  GDALLF+S+ PDA+ D  SLHG CPVI+G KWS+TKW+HVR +
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVREY 308


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 59  VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 118

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ T+LMYL+ VE+GGETV PN+E   + DG WSECA+
Sbjct: 119 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 177

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           RG AVKP+KGDAL+F+SL PD S D  SLHGSCP ++G+KWSATKWIHV
Sbjct: 178 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 50  VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 109

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ T+LMYL+ VE+GGETV PN+E   + DG WSECA+
Sbjct: 110 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 168

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           RG AVKP+KGDAL+F+SL PD S D  SLHGSCP ++G+KWSATKWIHV
Sbjct: 169 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 217


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 51  VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 110

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ T+LMYL+ VE+GGETV PN+E   + DG WSECA+
Sbjct: 111 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 169

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           RG AVKP+KGDAL+F+SL PD S D  SLHGSCP ++G+KWSATKWIHV
Sbjct: 170 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 218


>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 113/245 (46%), Positives = 155/245 (63%), Gaps = 27/245 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E+G S+ S++RTS+GMFL K QD+IV +IE RIA  +  P +NGE MQIL Y+ GQKY+P
Sbjct: 82  EAGDSVPSDIRTSAGMFLRKGQDKIVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDP 141

Query: 66  HFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE------- 116
           HFD+F DK+N   + GG R+AT+L+YL   +KGGET FPN+++ QS + +  E       
Sbjct: 142 HFDYFHDKVNPAPKRGGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHI 201

Query: 117 ----CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
               CA++G  VK ++GDA+LFFS+  D   D  SLHG+CPVIEG+KW+A KWI V  FD
Sbjct: 202 EHTDCAKKGIPVKSVRGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKFD 261

Query: 173 ----------KPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSR----GYCRK 218
                     K  +  +++ CVD+   C  WA  G C+ NP +M  + S+R      C K
Sbjct: 262 GNYQEEIPMPKLSRRTDEEPCVDDWDECAKWASQGWCELNPEFMTTADSARDSQSAACAK 321

Query: 219 SCKVC 223
           SC +C
Sbjct: 322 SCGLC 326


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 102/173 (58%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 144 VVDSTTGKSKDSRVRTSSGMFLRRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQ 203

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 204 KYEPHFDYFLDEFNTKNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECA 263

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           R+G AVKP  GDALLF+S++PDA+ D  SLHG CPVI G KWS+TKW+HV  +
Sbjct: 264 RKGLAVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGEY 316


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  220 bits (561), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 105/173 (60%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSG FL + QD+IV +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 126 VVDSATGGSKDSRVRTSSGTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQ 185

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D  N + GG RIATVLMYLS VE+GGETVFP+++V+ S        SECA
Sbjct: 186 KYEPHFDYFHDDFNTKNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECA 245

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PD + D TSLHG CPVI+G+KWS+TKWI V  +
Sbjct: 246 KRGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEY 298


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score =  220 bits (560), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 100/173 (57%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++GKS  S +RTSSG FL + QD ++  IE RIA +TF+P E GE +Q+L Y+  +
Sbjct: 40  VVDSDTGKSKDSRLRTSSGTFLMRGQDPVIKRIEKRIADFTFIPAEQGEGLQVLQYKESE 99

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D  N + GG RIATVLMYLS+VE+GGETVFP ++V+++   +W   SECA
Sbjct: 100 KYEPHYDYFHDAYNTKNGGQRIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECA 159

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +V+P  GDALLF+S+ PDA+ DSTSLHG CPVI+G KWSATKW+HV N+
Sbjct: 160 QKGLSVRPRMGDALLFWSMKPDATLDSTSLHGGCPVIKGTKWSATKWLHVENY 212


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score =  219 bits (559), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 104/173 (60%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSG FL + QD I+  IE RIA +TF+P E GE +Q+L Y   +
Sbjct: 39  VIDSATGKSKDSRVRTSSGTFLVRGQDHIIKRIEKRIADFTFIPVEQGEGLQVLQYRESE 98

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D  N + GG RIATVLMYLS VEKGGETVFP S+V+ S   +W   SECA
Sbjct: 99  KYEPHYDYFHDAFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECA 158

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +V+P  GDALLF+S+ PDA  D TSLHG+CPVI+G KWSATKW+HV  +
Sbjct: 159 KRGLSVRPRMGDALLFWSMKPDAKLDPTSLHGACPVIQGTKWSATKWLHVEKY 211


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score =  219 bits (558), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++GKS+ S VRTSSGMFL + +D+++ +IE RIA + F+P ENGE +Q+LHYE GQ
Sbjct: 14  VVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVENGEGLQVLHYEVGQ 73

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGET+FP ++ + S      + S CA
Sbjct: 74  KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKANFSSVPWYNDLSVCA 133

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP +GDALLF+S+ PDA+ D +SLHG CPVI G KWS+TKW+H+  +
Sbjct: 134 KKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSSTKWMHLEEY 186


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  218 bits (556), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 102/173 (58%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSG FL + QD+++ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 132 VVDSATGGSKDSRVRTSSGTFLRRGQDKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQ 191

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D  N + GG RIAT+LMYLS VE+GGETVFP+++V+ S        SECA
Sbjct: 192 KYEPHFDYFHDDFNTKNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECA 251

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PD + D TSLHG CPVI+G+KWS+TKWI V  +
Sbjct: 252 KRGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEY 304


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  218 bits (555), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 100/173 (57%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++GKS  S VRTSSGMFL + +D+I+ +IE RIA +TF+P ENGE +Q+LHY  G+
Sbjct: 106 VVDSKTGKSTESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGE 165

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG R+ATVLMYLS VE+GGETVFP ++ + S    W   SECA
Sbjct: 166 KYEPHYDYFLDEFNTKNGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECA 225

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           R+G ++KP  GDALLF+S+ PDA+ D++SLHG CPVI G KWS+TKW+H+  +
Sbjct: 226 RKGLSLKPKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 278


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score =  218 bits (555), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 105/166 (63%), Positives = 128/166 (77%), Gaps = 3/166 (1%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           KS  S VRTSSGMFL++ QD+ + SIE RIA +TF+P E+GE +Q+LHYE GQKYEPHFD
Sbjct: 136 KSNDSRVRTSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFD 195

Query: 69  FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVK 125
           +F D+ N + GG RIATVLMYLS VEKGGETVFP S+V+ S    W   SECA+ G +V+
Sbjct: 196 YFLDEFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVR 255

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           P  GDALLF+S+ PDA  D +SLH  CPVI+G+KWSATKWIHV  +
Sbjct: 256 PRMGDALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIHVGEY 301


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score =  218 bits (554), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 112/235 (47%), Positives = 147/235 (62%), Gaps = 16/235 (6%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G S  S++RTS GMFL +  D+ VA+IE RIA WT LP  NGE +Q+L+Y  G+
Sbjct: 69  VVDTATGGSEISDIRTSKGMFLERGHDDTVAAIEERIARWTLLPVGNGEGLQVLNYHPGE 128

Query: 62  KYEPHFDFFRDKMN-QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECAR 119
           KY+   D+F DK+N +  GG+R ATVLMYL+ VE+GGETVFPN       +G  ++ECAR
Sbjct: 129 KYD---DYFFDKVNGESNGGNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECAR 185

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-------- 171
           R  A KP KG A+LF S+ P    +  SLH +CPV++GEKWSA KWIHV ++        
Sbjct: 186 RHLAAKPTKGSAVLFHSIKPSGDLERRSLHTACPVVKGEKWSAPKWIHVGHYAMGGEAAV 245

Query: 172 ---DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                P+K      C D D NC  WA  GEC+ N ++M+G++   G C KSC  C
Sbjct: 246 PVPQHPQKVGNLLGCEDADENCEQWAANGECENNKVFMIGTRDRPGSCVKSCDAC 300


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score =  218 bits (554), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 65

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S        S+CA
Sbjct: 66  KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 125

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI+G KWS+TKW+H+  +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 178


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score =  216 bits (551), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI G KWS+TKW+H+  +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 305


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score =  216 bits (551), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S        S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI+G KWS+TKW+H+  +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score =  216 bits (550), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++  IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 65

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 66  KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECA 125

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI G KWS+TKW+H+  +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score =  216 bits (550), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 100/169 (59%), Positives = 128/169 (75%), Gaps = 3/169 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A  T +P EN E +Q+LHY  GQ
Sbjct: 59  VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQ 118

Query: 62  KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           KYEPH+D+F D +N   + GG R+ T L YL+ VE+GGETV PN+E   + DG WSECA+
Sbjct: 119 KYEPHYDYFHDPVNAGPEHGGQRVVTXLXYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 177

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           RG AVKP+KGDAL F+SL PD S D  SLHGSCP ++G+KWSATKWIHV
Sbjct: 178 RGLAVKPIKGDALXFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score =  216 bits (549), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 103/173 (59%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G S  S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 147 VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 206

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S        SECA
Sbjct: 207 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 266

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G AVKP  GDALLF+S+ PD S D+TSLHG CPVI+G KWS+TKW+ V  +
Sbjct: 267 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEY 319


>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score =  216 bits (549), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 114/244 (46%), Positives = 148/244 (60%), Gaps = 23/244 (9%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VAD    KS  S +RTS+GMFL+K Q   V  +E R+AA   LP ENGE MQIL YEHG
Sbjct: 266 VVADG-GKKSTKSGIRTSAGMFLTKGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHG 324

Query: 61  QKYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-----RDGN 113
           QKY+PH+D+F DK+N     GG R+AT+L+YL   E+GGET+FPN++  +      +DG 
Sbjct: 325 QKYDPHYDYFHDKINPSPNRGGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGA 384

Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           +S+CA+RG  VK  +GDA+LF+SL  D   D  SLHG+CPV+ GEKW+A KWI V  FD 
Sbjct: 385 FSDCAKRGLPVKSKRGDAVLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIRVAKFDG 444

Query: 174 ------PEKEPEDDD---------CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRK 218
                 P       D         CVDE   C  WA+ G C++NP +M G   +R     
Sbjct: 445 RFTGELPMPSLTRGDRAAVDATARCVDEWDECAEWARKGWCERNPEFMTGVNGARDSKGP 504

Query: 219 SCKV 222
           +C V
Sbjct: 505 ACAV 508


>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
 gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
          Length = 454

 Score =  215 bits (547), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 114/244 (46%), Positives = 154/244 (63%), Gaps = 25/244 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  SG S+ S++RTS+GMFL + QD  V +IE RIAA + LP  NGE +QIL YE+G
Sbjct: 207 VVGDKGSG-SMVSKIRTSAGMFLGRGQDPTVRAIEERIAAASGLPEPNGEGLQILRYENG 265

Query: 61  QKYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN----- 113
           QKY+PHFD+F D++N   + GG R+AT+L+YL    +GGET+FPN    +  D +     
Sbjct: 266 QKYDPHFDYFHDQVNSSPRRGGQRMATMLIYLEDTTEGGETIFPNGVRPEDWDADEPGNH 325

Query: 114 --WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             WS+CA++G  VK  +GDA+LF+SL  D + D+ SLHG+CPVI GEKW+A KWI V  F
Sbjct: 326 NSWSDCAKKGIPVKSHRGDAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIRVAKF 385

Query: 172 DKPEKEP-----------EDDDCVDEDLNCVVWAKAGECKKNPLYMV---GSKSSRG-YC 216
           D    +P              +C+DE   C  WAK G C +NP +M    G++ SRG  C
Sbjct: 386 DGGFTDPLPMPALARSDRTKGECLDEWDECGEWAKKGWCDRNPSFMTGLEGARDSRGPAC 445

Query: 217 RKSC 220
            +SC
Sbjct: 446 PQSC 449


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score =  215 bits (547), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 103/173 (59%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G S  S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 48  VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 107

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S        SECA
Sbjct: 108 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 167

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G AVKP  GDALLF+S+ PD S D+TSLHG CPVI+G KWS+TKW+ V  +
Sbjct: 168 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEY 220


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score =  214 bits (546), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +++++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRNKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S        S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI+G KWS+TKW+H+  +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305


>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
 gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
          Length = 369

 Score =  214 bits (546), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 112/248 (45%), Positives = 155/248 (62%), Gaps = 34/248 (13%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           + G S+ASE+RTS+GMFL K+QD+ V  IE RIA  + +P +NGE MQIL Y+ GQKY+P
Sbjct: 123 DGGSSVASEIRTSAGMFLRKSQDDTVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDP 182

Query: 66  HFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN---------- 113
           HFD+F DK+N   + GG R+ATVL+YL   E+GGET FPN  + ++ + +          
Sbjct: 183 HFDYFHDKVNPAPKRGGQRVATVLIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHI 242

Query: 114 -WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
             ++CA+ G  VK ++GDA+LFFS+  D   D  SLHG+CPVI G+KW+A KW+ V  FD
Sbjct: 243 KHTDCAKNGIPVKSVRGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLRVAKFD 302

Query: 173 --------------KPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYM--VGSKSSRG-Y 215
                         + E+EP    CVDE  +C  WA+ G C++NP +M   G++ S    
Sbjct: 303 GGFKDELPMIPLTRRTEREP----CVDEWDDCASWARDGWCERNPEFMKFAGARDSHTPA 358

Query: 216 CRKSCKVC 223
           C KSC +C
Sbjct: 359 CPKSCGLC 366


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score =  214 bits (546), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 131/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++  IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 134 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 193

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 194 KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECA 253

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI G KWS+TKW+H+  +
Sbjct: 254 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 306


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  214 bits (545), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 100/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S        SEC 
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G +VKP  GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKWIHV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWIHV 282


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S        S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ P A+ D  SLHG CPVI+G KWS+TKW+H+  +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 99/172 (57%), Positives = 127/172 (73%), Gaps = 1/172 (0%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNE+GKS  S+VRTSSGMFL++ +D+++  IEARIA +T +P ENGE +QILHY+  +
Sbjct: 38  VVDNETGKSAPSKVRTSSGMFLNRGEDDVIERIEARIAKYTAIPKENGEGLQILHYQASE 97

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARR 120
           +Y PHFD+F D  N Q GG RIAT+LMYLS VE GGETVFP +S+     +  +S+CA+ 
Sbjct: 98  EYRPHFDYFHDNFNTQNGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQA 157

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           G A KP KGDAL F+SL PD   D  SLH  CPV++G+KWSATKW+ V  F+
Sbjct: 158 GAAAKPKKGDALFFYSLTPDGRMDEKSLHAGCPVMKGDKWSATKWLRVDRFE 209


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score =  213 bits (543), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 100/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +QILHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQILHYEAGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S        SEC 
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G +VKP  GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKW+HV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  213 bits (542), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 101/173 (58%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNE+GK++   VRTSSGMFL++ QD+IV++IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 111 VVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQ 170

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KY+ H+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 171 KYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCG 230

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PDA+ D TSLHG+CPVI G KWS TKW+HV  +
Sbjct: 231 KGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score =  213 bits (542), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S        SEC 
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G +VKP  GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKW+HV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score =  213 bits (541), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 130/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++  IE RI  +TF+P ++GE +Q+LHYE GQ
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLHYEVGQ 65

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LM+LS VE+GGET+FP++ V+ S        SECA
Sbjct: 66  KYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNELSECA 125

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           +RG +VKP  GDALLF+S+ PDA+ D  SLHG CPVI G KWS+TKW+H+  +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  213 bits (541), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+G+S  S VRTSSGMFL + +D+I+  IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDSETGRSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQ 173

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KY+ H+D+F D+ N + GG RIAT+LMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 174 KYDAHYDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECG 233

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP  GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+HV  +
Sbjct: 234 KKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHVEEY 286


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  213 bits (541), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 102/174 (58%), Positives = 130/174 (74%), Gaps = 3/174 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKSI S VRTSSG FL++  DEIV  IE RI+ +TF+PPENGE +Q+LHYE GQ
Sbjct: 117 VVDVKTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           +YEPH D+F D+ N + GG RIATVLMYLS V++GGETVFP ++ + S    W   S+C 
Sbjct: 177 RYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           + G +V P K DALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW HV  ++
Sbjct: 237 KEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/217 (48%), Positives = 136/217 (62%), Gaps = 6/217 (2%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           S +RTS G+FL + +DEIV  +E RIAAWT +P  NGE +Q+L Y+  QKY+ H+D+F  
Sbjct: 36  SNIRTSYGVFLDRGEDEIVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFH 95

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
           K     GG+R ATVLMYL   E+GGETVFPN       +  +SECAR   A KP KG A+
Sbjct: 96  KDGITNGGNRYATVLMYLVDTEEGGETVFPNVAAPGGENVGFSECARYHLAAKPKKGTAI 155

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH----VRNFDKPEKEPED--DDCVDE 186
           LF S+ P    +  SLH +CPVI G KWSA KWIH    +    +P+ +P+D    C D 
Sbjct: 156 LFHSIKPTGELERKSLHTACPVIRGIKWSAAKWIHHAETIEQHPQPKVKPQDLPPGCEDS 215

Query: 187 DLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           D  C  WA AGEC++N  +MVGS++  G C  SCK C
Sbjct: 216 DEMCPEWADAGECERNASFMVGSRARPGKCVASCKRC 252


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 134/173 (77%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL++ +D+IV +IE +IA +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPH+D+F D+ N + GG RIATVLMYL+ VE+GGETVFP ++ + S        S+C 
Sbjct: 175 KYEPHYDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCG 234

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G ++KP +GDALLF+S+ PDA+ D++SLHG CPVI+G KWS+TKWI V  +
Sbjct: 235 KKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKWSSTKWIRVNEY 287


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL++ +D+IV  IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 116 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQ 175

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D  N + GG RIATVLMYL+ VE+GGETVFP ++ + S    W   SEC 
Sbjct: 176 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECG 235

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G ++KP +GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+ V  +
Sbjct: 236 KKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSEY 288


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 99/170 (58%), Positives = 131/170 (77%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G+S  S VRTSSGMFL + +D+I+ +IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSKTGRSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYE H+D+F D+ N + GG R AT+LMYLS VE+GGETVFP ++ + S   +W   SECA
Sbjct: 175 KYEAHYDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECA 234

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           R+G +VKP  G+ALLF+S  PDA+ D  SLHGSCPVI G KWSATKW+H+
Sbjct: 235 RQGLSVKPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKWSATKWMHL 284


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  211 bits (538), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 104/173 (60%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSGMFL + QD+I+ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 133 VVDSATGASKDSRVRTSSGMFLRRGQDKIIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D  N + GG RIAT+LMYLS VE GGETVFP+S  + S        SECA
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PD S DSTSLHG CPVI+G KWS+TKW+ V  +
Sbjct: 253 KGGLSVKPKMGDALLFWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHEY 305


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score =  211 bits (536), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 112/237 (47%), Positives = 147/237 (62%), Gaps = 27/237 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G S  SEVRTS+G F+S+  D+I+A +E RI  W+ +P  + EA QIL YE GQ
Sbjct: 295 VVDAQTGGSSLSEVRTSTGTFISRKYDDIIAGVEERIELWSQIPQSHHEAFQILRYEPGQ 354

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN-WSECARR 120
           +Y+ HFD+F  K   +   +RIATVL+YLS VE+GGETVFPN++V  SR+ + +SEC   
Sbjct: 355 EYKAHFDYFFHKSGMR--NNRIATVLLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNG 412

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
           G A+K  KGDALLF+S+ P    D+ S H  CPVI+GEKW+ATKW+HV     P   P D
Sbjct: 413 GKALKARKGDALLFWSMKPGGELDAGSSHAGCPVIKGEKWTATKWMHV----NPLAGPND 468

Query: 181 D--------------DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           D               C D    C  WA++GEC KNP +M      R  C+ SC+VC
Sbjct: 469 DAHNVFYDGGPRSTASCSDAQAECRGWAESGECDKNPGFM------RESCKMSCRVC 519


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  211 bits (536), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+G+S  S VRTSSG FL + +D+ V +IE R++ ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 173 KYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP +GDALLF+S+ PDAS D +SLHG CPVI+G KWSATKW+ V  +
Sbjct: 233 KKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWVRVEEY 285


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 101/173 (58%), Positives = 130/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNE+GK++   VRTSSGMFL++ QD+IV++IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 76  VVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQ 135

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KY+ H+DFF D+ N +  G R+AT+LMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 136 KYDAHYDFFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCG 195

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PD + D TSLHG+CPVI G KWS TKWIHV   
Sbjct: 196 KGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIHVNQL 248


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 98/174 (56%), Positives = 134/174 (77%), Gaps = 3/174 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL++ +D+IV +IE +I+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS---ECA 118
           KYEPH+D+F D  N + GG RIATVLMYL+ VE+GGETVFP ++ + S    W+   EC 
Sbjct: 175 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECG 234

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           ++G ++KP +GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW+ V  ++
Sbjct: 235 KKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSEYN 288


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  210 bits (535), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 102/173 (58%), Positives = 130/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S+VRTSSG FL + +D+IV  IE RIA ++F+P E+GE +QILHYE GQ
Sbjct: 117 VVDSSTGKSKDSKVRTSSGTFLPRGRDKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           +YEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP++E + S    W   SEC 
Sbjct: 177 RYEPHFDYFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S++PD S D +SLHG CPVI G KWS+TKW+ V  +
Sbjct: 237 KGGLSVKPKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNEY 289


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 101/173 (58%), Positives = 129/173 (74%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSG FL++ QD+I+  IE R++ +TFLP E+GE +QILHYE GQ
Sbjct: 114 VVDSSTGKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQ 173

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D  N + GG R+ATVLMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 174 KYEPHYDYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCG 233

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKWI V  +
Sbjct: 234 KEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 101/173 (58%), Positives = 129/173 (74%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSG FL++ QD+I+  IE R++ +TFLP E+GE +QILHYE GQ
Sbjct: 114 VVDSSTGKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQ 173

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D  N + GG R+ATVLMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 174 KYEPHYDYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 233

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKWI V  +
Sbjct: 234 KEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 100/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G+S+ S VRTSSGMFL++ QD+I+ +IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 111 VVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQ 170

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KY+ H+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 171 KYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECG 230

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PDA+ D TSLHG+CPVI G KWS TKW+HV  +
Sbjct: 231 KGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  210 bits (534), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+G+S  S VRTSSG FL + +D+ V +IE R++ ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 173 KYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP +GDALLF+S+ PDAS D +SLHG CPVI+G KWSATKW+ V  +
Sbjct: 233 KKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  209 bits (533), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 103/173 (59%), Positives = 126/173 (72%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 190 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 249

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D  N + GG RIAT+LMYLS VE GGETVFP+S  + S        SECA
Sbjct: 250 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 309

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PD S D TSLHG CPVI+G KWS+TKW+ V  +
Sbjct: 310 KGGLSVKPKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEY 362


>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
 gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
          Length = 343

 Score =  209 bits (533), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 111/238 (46%), Positives = 143/238 (60%), Gaps = 17/238 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V+D  +G    S++RTSSGMF  + + E+V  IE R+A WT LP ENGE +Q+L YE  Q
Sbjct: 104 VSDATTGAGAVSDIRTSSGMFYERGETELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQ 163

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECAR 119
           KY+PH D+F        GG+R+ATVLMYL+  E+GGETVFP     V Q      + C R
Sbjct: 164 KYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPC-R 222

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
           +G AVKP KGDA+LF+S+ PD   D  SLHGSCPVI+G KWSATKWIHV ++    +  E
Sbjct: 223 QGLAVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWIHVGHYAMSGERSE 282

Query: 180 D--------------DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                            C ++   C  WA++GEC+ NP YM+G K   G C  +C  C
Sbjct: 283 TVKRVQYVPPPPPAVPGCENQHKLCSHWAESGECESNPGYMIGKKGMPGACILACNRC 340


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score =  209 bits (533), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 99/173 (57%), Positives = 130/173 (75%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+G+S  S VRTSSG FLS+ +D+ +  IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGRSKDSRVRTSSGTFLSRGRDKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 173 KYEPHFDYFNDEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECG 232

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G +VKP  GDALLF+S+ PDA+ D +SLHG CPVI G KWSATKW+ V  +
Sbjct: 233 KKGLSVKPNMGDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNEY 285


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score =  209 bits (533), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 103/173 (59%), Positives = 126/173 (72%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 133 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D  N + GG RIAT+LMYLS VE GGETVFP+S  + S        SECA
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 252

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PD S D TSLHG CPVI+G KWS+TKW+ V  +
Sbjct: 253 KGGLSVKPKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEY 305


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  209 bits (532), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 102/174 (58%), Positives = 128/174 (73%), Gaps = 3/174 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKSI S VRTSSG FL +  DEIV  IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 117 VVDVKTGKSIDSRVRTSSGTFLKRGHDEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH D+F D+ N + GG RIATVLMYLS V++GGETVFP ++ + S    W   S+C 
Sbjct: 177 KYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           + G +V P K DALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW HV  ++
Sbjct: 237 KEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290


>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
          Length = 564

 Score =  207 bits (527), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 114/242 (47%), Positives = 150/242 (61%), Gaps = 24/242 (9%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G S+ S +RTS+GMFL KA D+ + +IE RIAA +  P  NGE MQIL Y+ GQKY+PHF
Sbjct: 321 GTSVPSTIRTSAGMFLRKAADKTLENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHF 380

Query: 68  DFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD----GN---WSECA 118
           D+F D +N   + GG R+AT+L+YL + ++GGET+FP    +++ D    GN   WSEC 
Sbjct: 381 DYFHDAVNPSPKRGGQRMATMLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECT 440

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           + G  VK +KGDALLF+SL  D   D  SLHG+CPV++G+KW+A KWI V  FD     P
Sbjct: 441 KHGLPVKSVKGDALLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIRVAKFDGMFTSP 500

Query: 179 -----------EDDDCVDEDLNCVVWAKAGECKKNPLYMV---GSKSSRG-YCRKSCKVC 223
                      +   CVDE   C  WAK G C+KN  +MV   G++ S+G  C  SC V 
Sbjct: 501 LPMPALSRRTEQHGKCVDEWDECAKWAKDGWCEKNKDFMVSNGGARDSKGPACPVSCNVP 560

Query: 224 KP 225
            P
Sbjct: 561 CP 562


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 97/170 (57%), Positives = 129/170 (75%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G+S  S VRTSSGMFL + +D ++  IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDSKTGRSKDSRVRTSSGMFLRRGRDRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQ 173

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYE HFD+F D+ N + GG R AT+LMYLS VE+GGETVFP + ++ S    W   SECA
Sbjct: 174 KYEAHFDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECA 233

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G ++KP  G+ALLF+S  PDA+ D +SLHGSCPVI G KWSATKW+H+
Sbjct: 234 KQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKWSATKWMHL 283


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 108/233 (46%), Positives = 148/233 (63%), Gaps = 19/233 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G S  S++RTS+G F+S+A D  + +IE RI  W+ +P ++GEA+Q+L YE+GQ
Sbjct: 31  VVDAKTGGSTTSDIRTSTGTFISRAHDPTITAIEERIELWSQIPVDHGEALQVLRYENGQ 90

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD-GNWSECARR 120
           +Y+ HFD+F  K  ++   +RIATVL+YLS VE+GGETVFPN++V   RD   +SEC   
Sbjct: 91  EYKAHFDYFFHKGGKR--NNRIATVLLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNG 148

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP------ 174
           G +VK  KGDALLF+S+ P    D  S H  CPVI+G KW+ATKW+HV    K       
Sbjct: 149 GKSVKARKGDALLFWSMKPGGELDPGSSHAGCPVIKGVKWTATKWMHVNAIGKHGDDVHK 208

Query: 175 ---EKEPE-DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
              E  P+  + C D D  C  WA++GEC KNP +M+ S      C  SC+ C
Sbjct: 209 IFYEGGPQATESCKDTDDACRGWAESGECDKNPGFMLKS------CAMSCRAC 255


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score =  207 bits (526), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G S  S VRTSSGMFL++ QD +++ IE +IA  TF+P ++GE +Q+LHYE GQ
Sbjct: 39  VVDSQTGGSRDSRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQ 98

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KY+ H DFF D +N + GG RIAT+LMYL+ VE+GGETVFP S  + S        SEC 
Sbjct: 99  KYDAHHDFFYDTVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECG 158

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           RRG +V+P +GDALLF+S+ PDA  D +SLHG CPVI+G+KWSATKW+ V  +
Sbjct: 159 RRGVSVRPKRGDALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEY 211


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G S  S VRTSSGMFL++ QD +++ IE +IA  TF+P ++GE +Q+LHYE GQ
Sbjct: 51  VVDSQTGGSRDSRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQ 110

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KY+ H DFF D +N + GG RIAT+LMYL+ VE+GGETVFP S  + S        SEC 
Sbjct: 111 KYDAHHDFFYDTVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECG 170

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           RRG +V+P +GDALLF+S+ PDA  D +SLHG CPVI+G+KWSATKW+ V  +
Sbjct: 171 RRGVSVRPKRGDALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEY 223


>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
           [Brachypodium distachyon]
          Length = 314

 Score =  205 bits (522), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 103/204 (50%), Positives = 135/204 (66%), Gaps = 10/204 (4%)

Query: 23  LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHR 82
           L+ ++D +V+ IE RI+ W+F+P E+GE+MQIL Y   Q      D  +D      GG+R
Sbjct: 118 LADSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNR 172

Query: 83  IATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
           + T+LMYLS V++GGETVFP SE+  +Q+++G  SECA  GYAVKP+KGDA+L F+L PD
Sbjct: 173 LVTILMYLSDVKQGGETVFPRSELKDTQAKEGALSECA--GYAVKPVKGDAILLFNLRPD 230

Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGEC 199
             TDS S +  C V+EGEKW A K +H+   DK     P +D C DED  CV WA AGEC
Sbjct: 231 GVTDSDSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSEDLCTDEDDKCVSWAAAGEC 290

Query: 200 KKNPLYMVGSKSSRGYCRKSCKVC 223
             NP++M+GS    G CRKSC  C
Sbjct: 291 YSNPVFMIGSPDYYGTCRKSCHAC 314


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 106/223 (47%), Positives = 140/223 (62%), Gaps = 8/223 (3%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VA N  G S  S++RTS G+FL + +D +V  +E RI+A T +P  NGE +Q+L Y+  
Sbjct: 30  VVATN--GGSEESQIRTSFGVFLERGEDPVVKGVEERISALTLMPVGNGEGLQVLRYQKE 87

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           QKY+ H+D+F  K     GG+R ATVLMYL   E+GGETVFPN       +  +SECAR 
Sbjct: 88  QKYDAHWDYFFHKDGIANGGNRYATVLMYLVDTEEGGETVFPNIAAPGGENVGFSECARY 147

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
             A KP KG A+LF S+ P    +  SLH +CPVI+G KWSA KWIHV    KP+  P  
Sbjct: 148 HLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIKGIKWSAAKWIHV----KPQNLP-- 201

Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             C D D  C  WA+AGEC++N  +M+G+++  G C  SCK C
Sbjct: 202 PGCEDSDEMCPDWAEAGECERNASFMIGTRARPGKCVASCKRC 244


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score =  205 bits (522), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 98/173 (56%), Positives = 126/173 (72%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V DNE+GKS  S VRTSSG FL +  DEIV +IE RIA +TF+P ENGE+  +L YE GQ
Sbjct: 104 VVDNETGKSKDSSVRTSSGTFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQ 163

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KY+PH D+F D  N   GG RIAT+LMYLS VE+GGETVFP ++ + S    W+E   C 
Sbjct: 164 KYDPHLDYFADDYNTVNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCG 223

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ++G ++KP  GDALLF+S+ PD + D +SLHG+CPVI+G+KWS TKW+ +  F
Sbjct: 224 KKGLSIKPKMGDALLFWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINEF 276


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 124/173 (71%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G S  S VRTSSG FL +  DE+V  IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP +  + S    W+E   C 
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +V P K DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV  F
Sbjct: 237 KEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKS  S VRTSSG FL++ +D+ +  IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 175 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 234

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +VKP  GDALLF+S+ PDA+ D +SLHG C VI+G KWS+TKW+ V  +
Sbjct: 235 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEY 287


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 97/173 (56%), Positives = 124/173 (71%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G S  S VRTSSG FL +  DE+V  IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP +  + S    W+E   C 
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +V P K DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV  F
Sbjct: 237 KEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score =  204 bits (518), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 98/174 (56%), Positives = 129/174 (74%), Gaps = 3/174 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MV D+++G S+ S VRTSSG FL++ QD+I+  IE RIA ++ +P E+GE + +LHYE  
Sbjct: 127 MVVDSKTGGSMDSNVRTSSGWFLNRGQDKIIRRIEKRIADFSHIPVEHGEGLHVLHYEVE 186

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
           QKY+ H+D+F D +N + GG R AT+LMYLS VEKGGETVFP S+V+ S    W   SEC
Sbjct: 187 QKYDAHYDYFSDTINVKNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSEC 246

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            R G +V+P  GDALLF+S+ PDAS D +SLHGSCPVI+G KWSATKW+ +  +
Sbjct: 247 GRSGLSVRPKMGDALLFWSVKPDASLDPSSLHGSCPVIQGNKWSATKWMRLNKY 300


>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
          Length = 277

 Score =  203 bits (517), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 100/175 (57%), Positives = 123/175 (70%), Gaps = 18/175 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE-------------- 47
           V D ESG+S+ S+VRTSSGMFL K QDE+VA IE RIAAWT LP E              
Sbjct: 80  VVDGESGESVTSKVRTSSGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139

Query: 48  ---NGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS 104
              NGE+MQIL Y  G+KYEPHFD+   +      G R+ATVLMYLS+V K G+++ P +
Sbjct: 140 LSENGESMQILRYGQGEKYEPHFDYISGRQGSTREGDRVATVLMYLSNV-KMGDSLLPQA 198

Query: 105 EVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEK 159
            +SQ +D  WS+CA +G+AVKP KG A+LFFSLHP+A+ D+ SLHGSCPVIEGEK
Sbjct: 199 RLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEK 253


>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
           [Brachypodium distachyon]
          Length = 303

 Score =  202 bits (515), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 102/199 (51%), Positives = 131/199 (65%), Gaps = 10/199 (5%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           D +V+ IE RI+ W+F+P E+GE+MQIL Y   Q      D  +D      GG+R+ T+L
Sbjct: 112 DIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRLVTIL 166

Query: 88  MYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
           MYLS V++GGETVFP SE+  +Q+++G  SECA  GYAVKP+KGDA+L F+L PD  TDS
Sbjct: 167 MYLSDVKQGGETVFPRSELKDTQAKEGALSECA--GYAVKPVKGDAILLFNLRPDGVTDS 224

Query: 146 TSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNPL 204
            S +  C V+EGEKW A K +H+   DK     P +D C DED  CV WA AGEC  NP+
Sbjct: 225 DSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSEDLCTDEDDKCVSWAAAGECYSNPV 284

Query: 205 YMVGSKSSRGYCRKSCKVC 223
           +M+GS    G CRKSC  C
Sbjct: 285 FMIGSPDYYGTCRKSCHAC 303


>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
          Length = 327

 Score =  202 bits (515), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 110/230 (47%), Positives = 140/230 (60%), Gaps = 14/230 (6%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G S+  E+RTSSGMF+ K  D +++ +E R+AA T LP  + E +Q+L YE GQKY  H+
Sbjct: 84  GGSMLDEIRTSSGMFILKGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHW 143

Query: 68  DFFRDKMNQQ-------LGGHRIATVLMYLSHVEKGGETVFPNS----EVSQSRDGNWSE 116
           D        Q       LGG R AT+LMYLS VE+GGET FP+     E  Q+    ++E
Sbjct: 144 DINDSPERAQQMRAKGVLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAP-PYTE 202

Query: 117 CARRGYAVKPMKGDALLFFSLHPDAS-TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           CA +G  VKP KGDA+LFFSL  +    D  SLH  CPV+ G K+SATKW+HV  F    
Sbjct: 203 CASKGVVVKPRKGDAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVHVEPFGHTT 262

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
            + +   C D  + C  WA AGEC  NP+YM GS+ S G CR SCKVC+P
Sbjct: 263 VQ-QPSRCEDARVECPQWAAAGECDSNPVYMKGSEVSVGSCRLSCKVCRP 311


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  202 bits (514), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 96/173 (55%), Positives = 123/173 (71%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G S  S VRTSSG FL +  DE+V  IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP +  + S    W+E   C 
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G +V P   DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV  F
Sbjct: 237 KEGLSVLPKXRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  201 bits (512), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 93/173 (53%), Positives = 128/173 (73%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN+SG+S+  +VR S+G FL + QDEIV +IE RIA  TF+P ENGE + ++HYE GQ
Sbjct: 111 VADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIADVTFIPIENGEPIYVIHYEVGQ 170

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
            Y+PH+D+F D  N + GG RIAT+LMYLS+VE+GGET+FP ++ + S    W+E   C 
Sbjct: 171 YYDPHYDYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCG 230

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G ++KP  GDALLF+S+ P+A+ D+ +LH +CPVI+G KWS TKW+H   F
Sbjct: 231 KMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTEF 283


>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
 gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
          Length = 204

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 92/125 (73%), Positives = 109/125 (87%), Gaps = 2/125 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL + QDE+V  IE RI+AWTFLPPENGE++QILHY++G
Sbjct: 72  MVADNESGKSVQSEVRTSSGMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNG 131

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           +KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E  + Q +D  WS+CA
Sbjct: 132 EKYEPHYDYFHDKKNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCA 191

Query: 119 RRGYA 123
           R GYA
Sbjct: 192 RNGYA 196


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 104/174 (59%), Positives = 129/174 (74%), Gaps = 3/174 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MV D+ SGKS  S VRTSSG FL + +D+I+  IE RIA ++F+P E+GE +QILHYE G
Sbjct: 91  MVVDSSSGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGEGLQILHYEVG 150

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
           QKYEPHFD+F D  N + GG RIATVLMYLS VE+GGETVFP+++ + S    W   SEC
Sbjct: 151 QKYEPHFDYFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSEC 210

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            + G +VKP  GDALLF+S+ PDAS D +SLHG CPVI G KWS+TKW+ V  +
Sbjct: 211 GKGGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNEY 264


>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 225

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 97/200 (48%), Positives = 137/200 (68%), Gaps = 10/200 (5%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
           +D +V+ IE RI+ W+FLP ENGE++Q+L Y   +         +++     G HR+AT+
Sbjct: 33  EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRSGS-----IKEEPKSSSGAHRLATI 87

Query: 87  LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
           LMYLS V++GGETVFP SE+  +Q+++G  S+C+  GYAV+P KG+A+L F+L PD  TD
Sbjct: 88  LMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNAILLFNLRPDGETD 145

Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
             S +  CPV+EGEKW A K I++R FD P+     +D+C DED  CV WA +GEC +NP
Sbjct: 146 KDSQYEECPVLEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDDRCVSWAASGECDRNP 205

Query: 204 LYMVGSKSSRGYCRKSCKVC 223
           ++M+GS    G CRKSC+VC
Sbjct: 206 VFMIGSSDYYGSCRKSCRVC 225


>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
 gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
          Length = 303

 Score =  201 bits (510), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 97/200 (48%), Positives = 137/200 (68%), Gaps = 10/200 (5%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
           +D +V+ IE RI+ W+FLP ENGE++Q+L Y   +         +++     G HR+AT+
Sbjct: 111 EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRSGS-----IKEEPKSSSGAHRLATI 165

Query: 87  LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
           LMYLS V++GGETVFP SE+  +Q+++G  S+C+  GYAV+P KG+A+L F+L PD  TD
Sbjct: 166 LMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNAILLFNLRPDGETD 223

Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
             S +  CPV+EGEKW A K I++R FD P+     +D+C DED  CV WA +GEC +NP
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDDRCVSWAASGECDRNP 283

Query: 204 LYMVGSKSSRGYCRKSCKVC 223
           ++M+GS    G CRKSC+VC
Sbjct: 284 VFMIGSSDYYGSCRKSCRVC 303


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score =  200 bits (508), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 94/170 (55%), Positives = 129/170 (75%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++E+G SI S VRTSSG FL++ +D+IV +IE RIA +TF+P +NGE +Q+LHY+ G+
Sbjct: 122 VIESETGMSIESRVRTSSGTFLARGRDKIVRNIENRIADFTFIPVDNGEELQVLHYQVGE 181

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KY PH D+F D +N   GG RIAT+LMYLS VE+GGETVFP+++ + S    W+E   C 
Sbjct: 182 KYVPHHDYFMDDINTANGGDRIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCG 241

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G ++KP   +ALLF+S+ PDA+ D  SLHGSCPVI+G KWS+TKWI +
Sbjct: 242 KKGLSIKPKMRNALLFWSIKPDATYDPLSLHGSCPVIKGNKWSSTKWIRI 291


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score =  199 bits (507), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 93/173 (53%), Positives = 129/173 (74%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKS+ S +RTSSG FL +  DEIV++IE RIA +TF+P E+GE+  +LHYE GQ
Sbjct: 103 VIDEKTGKSLNSSIRTSSGTFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQ 162

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D  + +  G RIAT+LMYLS VE+GGETVFPN++ + S    W+E   C 
Sbjct: 163 KYEPHYDYFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCG 222

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G ++KP  G+A+LF+S+ PDA+ D +SLHG+CPVI+G+KWS  KW+H   +
Sbjct: 223 KGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADEY 275


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 105/222 (47%), Positives = 140/222 (63%), Gaps = 11/222 (4%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           GKS+    RTS G FL + QDEIV  IE R+AAWT +P  + E  QIL Y  GQ+Y+ H 
Sbjct: 43  GKSVEDNYRTSYGTFLKRYQDEIVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHA 102

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWSECARRGY 122
           D  RD    +  G R+ATVL+YL+  + GGET FP+SE     ++++   N+S+CA+   
Sbjct: 103 DTLRD----EEAGVRVATVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHV 158

Query: 123 AVKPMKGDALLFFSLHPDASTDST-SLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
           A  P +GDALLF+S++PD +T+ T + H  CPV+ G KW+ATKWIH R F +P +  +  
Sbjct: 159 AFAPKRGDALLFWSINPDGNTEDTHASHTGCPVLSGVKWTATKWIHARPF-RPNEMADPG 217

Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            C DE  NC  WA  G+C+KN  YMV +  S G CRKSC  C
Sbjct: 218 VCYDESPNCPEWAARGDCEKNSDYMVVNAVSPGVCRKSCGAC 259


>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
          Length = 549

 Score =  196 bits (499), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 96/200 (48%), Positives = 135/200 (67%), Gaps = 10/200 (5%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
           +D +V+ IE RI+ W+FLP ENGE +Q+L Y   ++        +++     GGH +AT+
Sbjct: 357 EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRRGS-----IKEEPKSSTGGHWLATI 411

Query: 87  LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
           L+YLS V++GGETVFP SE+  +Q+++G  S+C+  GYAV+P KG+ALL F+L PD   D
Sbjct: 412 LIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNALLLFNLRPDGEID 469

Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
             S +  CPV+EGEKW A K IH+R  D P+     +D+C DED  CV WA +GEC +NP
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHIHLRKLDSPKSSLASEDECTDEDDRCVSWAASGECDRNP 529

Query: 204 LYMVGSKSSRGYCRKSCKVC 223
           ++M+GS    G CRKSC+VC
Sbjct: 530 VFMIGSSDYYGSCRKSCRVC 549


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score =  195 bits (495), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 99/233 (42%), Positives = 137/233 (58%), Gaps = 17/233 (7%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+ +   +RTS GMF+ + QD +VA IE RI+ WT LP E+ E +Q+L Y HGQ Y  H+
Sbjct: 43  GEGVVDNIRTSYGMFIRRLQDPVVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHY 102

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV------SQSRDGNWSECARRG 121
           D   DK N+     R+AT LMYLS VE+GGET FP++ V       +     +S+CA+  
Sbjct: 103 D-SGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGN 161

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK-------- 173
            A KP  GDA+LF+S +P+ + D  ++H  CPVI+G KW+A  W+H   F          
Sbjct: 162 VAAKPKAGDAVLFYSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHDIPFRPSEISGMVQ 221

Query: 174 --PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
             P+ EP+   C D    CV WA AGEC+ N  +M+G   + G CRK+CK C+
Sbjct: 222 RIPDNEPDAGTCTDLHPRCVEWAAAGECEHNKGFMMGGPDNLGTCRKTCKACE 274


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 88/154 (57%), Positives = 119/154 (77%), Gaps = 3/154 (1%)

Query: 21  MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
           MFL + +D+I+ +IE RIA +TF+P ENGE +Q+LHY  G+KYEPH+D+F D+ N + GG
Sbjct: 1   MFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGG 60

Query: 81  HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVKPMKGDALLFFSL 137
            R+ATVLMYLS VE+GGETVFP ++ + S    W   SECAR+G ++KP  GDALLF+S+
Sbjct: 61  QRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSM 120

Query: 138 HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            PDA+ D++SLHG CPVI G KWS+TKW+H+  +
Sbjct: 121 RPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 154


>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 91/200 (45%), Positives = 133/200 (66%), Gaps = 5/200 (2%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           DE+ A IE RI+AWTFLP EN E ++++ Y+  +  +  +++F +K   + G   +ATVL
Sbjct: 119 DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLMATVL 177

Query: 88  MYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
           ++LS+V +GGE  FP SE+  SQS+ G  S+C      ++P+KG+A+LFF++HP+AS D 
Sbjct: 178 LHLSNVTRGGELFFPESELKNSQSKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDK 237

Query: 146 TSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECKKNP 203
           +S +  CPV+EGE W ATK+ H+R   +     + D  +C DED NC  WA  GEC++NP
Sbjct: 238 SSSYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNP 297

Query: 204 LYMVGSKSSRGYCRKSCKVC 223
           +YM+GS    G CRKSC VC
Sbjct: 298 IYMIGSPDYYGTCRKSCNVC 317


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 92/173 (53%), Positives = 122/173 (70%), Gaps = 3/173 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+G  + S  RTSSG FL +  D IV +IE RIA +TF+P E+GE   +LHYE GQ
Sbjct: 152 VIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLHYEVGQ 211

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH+D+F D  +    G RIAT+LMYLS VE+GGETVFPN++ + S    W+E   C 
Sbjct: 212 KYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCG 271

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + G ++KP  G+A+LF+S+ PDA+ D +SLHG+CPVI+G+KW   KW+HV  F
Sbjct: 272 KGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEF 324


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 138/234 (58%), Gaps = 20/234 (8%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+ +   +RTS GMF+ +  D I+A IE RI+ WT LP E+ E +Q+L Y HGQ Y  H+
Sbjct: 90  GEGVVDNIRTSFGMFIRRLSDPIIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHY 149

Query: 68  DFFRDKMNQQLGGH-RIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSECARRG 121
           D      +  +G   R+AT LMYLS VE+GGET FP + V        R G  SECA+  
Sbjct: 150 D--SGASSDHVGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGH 207

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE------ 175
            A KP  GDA+LF+S  P+ + D  ++H  CPVI+G KW+A  W+H   F +PE      
Sbjct: 208 VAAKPKAGDAVLFYSFLPNNTMDPAAMHTGCPVIKGIKWAAPVWMHDIPF-RPEEVQGGK 266

Query: 176 -----KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
                ++PE   CVD    C  WA AGEC+KNP+YM G  +S G CRKSC+ C+
Sbjct: 267 QLIMDRDPEAGLCVDGHPRCGEWAAAGECEKNPMYMAGGPNSLGTCRKSCRTCE 320


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 92/157 (58%), Positives = 118/157 (75%), Gaps = 3/157 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G S  S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 147 VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 206

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
           KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S        SECA
Sbjct: 207 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 266

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVI 155
           ++G AVKP  GDALLF+S+ PD S D+TSLHG  P++
Sbjct: 267 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGEIPIL 303


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 89/170 (52%), Positives = 127/170 (74%), Gaps = 3/170 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS+ S  RTSSG F+ +  D+I++ IE RIA +TF+P E+GE + ILHYE GQ
Sbjct: 107 VVDDTTGKSVNSSARTSSGTFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQ 166

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KY+ H D+F D++N + GG RIAT+LMYLS VE+GGETVFP+++ + S    W+E   C 
Sbjct: 167 KYDFHTDYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCG 226

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           ++G ++KP  G+A+LF+ + PDA+ D  S+HG+CPVI+G+KWS TKW+ V
Sbjct: 227 KKGLSIKPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMRV 276


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 110/244 (45%), Positives = 145/244 (59%), Gaps = 17/244 (6%)

Query: 1   MVADNESGKSIASEVR---TSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
           +V+ + SGK  +   R   +SSG FL+K QD +VA +E RI   T LP  + E +Q+L Y
Sbjct: 52  VVSRDGSGKLDSVRTRQGLSSSGTFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKY 111

Query: 58  EHGQKYEPHFDFFRDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNS----EV 106
           E GQKY  H+D        QL       GG R AT+LMYLS VE+GGET FP+     E 
Sbjct: 112 ELGQKYSAHYDVHGSNEQAQLAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEG 171

Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKW 165
           +Q++   +SEC  RG AVKP KGDA+LF+SL  D  S D  SLH  CPV +G K+SAT W
Sbjct: 172 AQAQP-PYSECGSRGVAVKPRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAW 230

Query: 166 IHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
           IHV  +           C D +  C  WA  GEC++N ++M G+ + RG+CR SCKVC+P
Sbjct: 231 IHVEPYSN-TGPLHPGFCRDNNAKCPEWAALGECERNVVFMRGNGTYRGHCRLSCKVCQP 289

Query: 226 SSVS 229
            + +
Sbjct: 290 CAAN 293


>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score =  189 bits (481), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 89/198 (44%), Positives = 130/198 (65%), Gaps = 6/198 (3%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           DE+ A IE RI+AWTFLP EN E ++++ Y+  +  +  +++F +K   + G   +ATVL
Sbjct: 119 DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLMATVL 177

Query: 88  MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
           ++LS+V +GGE  FP SE   S+ G  S+C      ++P+KG+A+LFF++HP+AS D +S
Sbjct: 178 LHLSNVTRGGELFFPESE---SKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDKSS 234

Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECKKNPLY 205
            +  CPV+EGE W ATK+ H+R   +     + D  +C DED NC  WA  GEC++NP+Y
Sbjct: 235 SYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNPIY 294

Query: 206 MVGSKSSRGYCRKSCKVC 223
           M+GS    G CRKSC VC
Sbjct: 295 MIGSPDYYGTCRKSCNVC 312


>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 309

 Score =  189 bits (481), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 96/221 (43%), Positives = 139/221 (62%), Gaps = 13/221 (5%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G    + ++ +S    S   D+++A IE RI+AWTF+P EN + +Q++HY   +  E HF
Sbjct: 97  GDGSRNNIQLASSESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HF 155

Query: 68  DFFRDKM---NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
           D+F +K    N  L    +AT+++YLS+V +GGE +FP SE+   +D  WS+C +    +
Sbjct: 156 DYFDNKTLISNVSL----MATLVLYLSNVTRGGEILFPKSEL---KDKVWSDCTKDSSIL 208

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--D 182
           +P+KG+A+L F+ H +AS DS S HG CPV+EGE W ATK   VR  ++ +  P+ D  D
Sbjct: 209 RPVKGNAVLIFNAHLNASADSRSTHGRCPVLEGEMWCATKQFLVRATNEEKSLPDSDGSD 268

Query: 183 CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           C DED NC  WA  GEC++NP++M GS    G CRKSC  C
Sbjct: 269 CTDEDDNCPKWAALGECQRNPIFMTGSPDYYGTCRKSCNAC 309


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 100/232 (43%), Positives = 137/232 (59%), Gaps = 17/232 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G+     +RTS   FL++ +  +V  +E R++ +T LP  NGE MQIL Y  G+
Sbjct: 117 VVDSITGEIKTDPIRTSKQTFLARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGE 176

Query: 62  KYEPHFDFFRD--KMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDG 112
           KY  H D      K  QQL   GG R+ATVL+YL   E+GGET FP+SE     S+    
Sbjct: 177 KYSAHHDVGEKNTKSGQQLSADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQ 236

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF- 171
            +SECA+ G A KP +GD LLFFS+ P+   D  S+H  CPV++G KW+ATKWIH R F 
Sbjct: 237 KFSECAKNGVAFKPKRGDGLLFFSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFH 296

Query: 172 -DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKV 222
              P  +P  + C + D  C  WA AGEC++NP +M  +      C+ +C+V
Sbjct: 297 YKLPNPKPPKEGCENTDERCKGWANAGECERNPGFMTKN------CKWACRV 342


>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
          Length = 387

 Score =  186 bits (472), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 89/157 (56%), Positives = 117/157 (74%), Gaps = 3/157 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVI 155
           R+G AVKP  GDALLF+S+ PDA+ D  SLH +  V 
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHDTLRVF 292


>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
 gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
          Length = 303

 Score =  186 bits (471), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 95/201 (47%), Positives = 131/201 (65%), Gaps = 12/201 (5%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH-RIAT 85
           +D IV++IE RI+ W+FLP + GE+MQIL      KYE +   + +  +Q   GH R+ T
Sbjct: 111 EDTIVSTIEDRISVWSFLPKDFGESMQIL------KYEVNKSDYNNYESQSSSGHDRLVT 164

Query: 86  VLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
           VLMYLS V++GGET FP SE+  ++      SECA  GYAV+P++G+A+L F+L PD   
Sbjct: 165 VLMYLSDVKRGGETAFPRSELKGTKVELAAPSECA--GYAVQPVRGNAILLFNLKPDGVI 222

Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKN 202
           D  S +  C V+EGE+W A K IH+R  D P+     +D+C DED  CV WA  GEC +N
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIHLRKIDTPKSSLVSEDECTDEDDRCVSWAAGGECDRN 282

Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
           P++M+G+    G CRKSC+VC
Sbjct: 283 PIFMIGTPDYYGSCRKSCRVC 303


>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
          Length = 376

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 88/151 (58%), Positives = 115/151 (76%), Gaps = 3/151 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
           KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S        SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
           R+G AVKP  GDALLF+S+ PDA+ D  SLH
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286


>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
 gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
          Length = 310

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 95/222 (42%), Positives = 138/222 (62%), Gaps = 10/222 (4%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY--EHGQ 61
           D++SG+   + +  SS   L+   D I++ IE R++AWT LP EN + +Q++HY  E  +
Sbjct: 97  DDDSGRIERNRLFASSTSLLN-MDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAK 155

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
            Y   FD+F +K         +AT++ YLS+V +GGE  FP SEV   ++  WS+C +  
Sbjct: 156 NY---FDYFGNKSAIISSEPLMATLVFYLSNVTQGGEIFFPKSEV---KNKIWSDCTKIS 209

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
            +++P+KG+A+LFF++HP+ S D  S H  CPV+EGE W ATK  ++R   K   + E  
Sbjct: 210 DSLRPIKGNAILFFTVHPNTSPDMGSSHSRCPVLEGEMWYATKKFYLRAI-KVFSDSEGS 268

Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           +C DED NC  WA  GEC+KNP+YM+GS    G CRKSC  C
Sbjct: 269 ECTDEDENCPSWAALGECEKNPVYMIGSPDYFGTCRKSCNAC 310


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score =  184 bits (466), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 98/229 (42%), Positives = 134/229 (58%), Gaps = 21/229 (9%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           ++RTS GMF+ +  D +V  IE RI+ WT LP E+ E +QIL Y HGQ Y  H+D     
Sbjct: 67  DIRTSYGMFIRRLSDPVVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYD--SGA 124

Query: 74  MNQQLGGH-RIATVLMYLSHVEKGGETVFPNSEV------SQSRDGNWSECARRGYAVKP 126
            +  +G   R+AT LMYLS VE+GGET FP++ V       +     +S+CA+   A KP
Sbjct: 125 SSDHVGPKWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKP 184

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE----------- 175
             GDA+LF+S +P+ + D  S+H  CPVI+G KW+A  W+H   F +PE           
Sbjct: 185 KAGDAVLFYSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHDIPF-RPEEISGMTQHNMD 243

Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
           ++P+   C D    C  WA AGEC+ N  YM G  ++ G CRKSCKVC+
Sbjct: 244 RDPDAGTCTDLHARCTEWAAAGECENNKAYMCGGSNNLGACRKSCKVCE 292


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score =  182 bits (461), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 88/169 (52%), Positives = 120/169 (71%), Gaps = 3/169 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  +G+   S  RTSSGMFL + +D+IV +IE RIA  T +P ENGE + ++HY  G
Sbjct: 148 LVVDGVTGEVKESSSRTSSGMFLDRGKDKIVQNIERRIADITSVPIENGEGLHVIHYGVG 207

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           QK EPH+D+  D +  + GG R+ATVLMYLS VE+GGETVFP+   +Q    + S+C+  
Sbjct: 208 QKCEPHYDYTSDGVVTKNGGPRVATVLMYLSDVEEGGETVFPD---AQPNFTSVSKCSGD 264

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           G +VKP  GDALLF+S+ PD + D++SLHG  PVI G KW++TKW+H+R
Sbjct: 265 GLSVKPKMGDALLFWSMKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLR 313



 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 75/171 (43%), Positives = 102/171 (59%), Gaps = 18/171 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  +GK   S  RTSSG FL + +D+IV +IE RIA  T +P    + M        
Sbjct: 393 LVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIADITSIPRMARDFML------- 445

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
                   F    +  + GG R+ATVLMYLS VE+GGETVFPN++ + +    + E   +
Sbjct: 446 --------FTAGGVVTKNGGPRVATVLMYLSDVEEGGETVFPNAKPNINSVSKYPE---K 494

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G +VKP  GDALLF S+ PD + D++SLHG  PVI G KW++TKW+H+  F
Sbjct: 495 GLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLTEF 545


>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
          Length = 325

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/234 (43%), Positives = 135/234 (57%), Gaps = 22/234 (9%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+S+    RTS GMF+ +  DE+V+++E R+A WT     + E +Q+L Y   Q+Y+ HF
Sbjct: 74  GESVVDNYRTSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHF 133

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDGNWSECARRGYA 123
           D   D         R ATVL+YLS VE GGET FPNSE          G +SECA+   A
Sbjct: 134 DSLDDD------SPRTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVA 187

Query: 124 VKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKWIHVRNFD--------KP 174
           +KP +GDA++F SL+PD  S D  +LH +CPVI G K+ A  WIH + F          P
Sbjct: 188 MKPKRGDAIVFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTKPFRPEQLKGPLAP 247

Query: 175 EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCK---VCKP 225
           E     +DCVD D  C  WA +GEC +NP +M G+ ++ G CR SC    VCKP
Sbjct: 248 EPPMVPEDCVDADPGCPGWAASGECDRNPGFMRGAATTLGTCRASCGDCVVCKP 301


>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
 gi|194704960|gb|ACF86564.1| unknown [Zea mays]
 gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
          Length = 207

 Score =  180 bits (457), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 87/125 (69%), Positives = 100/125 (80%), Gaps = 2/125 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81  MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F D++NQ  GGHR ATVLMYLS V +GGETVFPN++   SQ +D  +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200

Query: 119 RRGYA 123
            +G A
Sbjct: 201 HKGLA 205


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 102/219 (46%), Positives = 133/219 (60%), Gaps = 13/219 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G+S    +RTS   FL++   +IV  +E R+A  T LP  +GE MQIL Y  GQ
Sbjct: 35  VIDSVTGQSKVDPIRTSEQTFLNRGTWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQ 94

Query: 62  KYEPHFDF--FRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRD 111
           KY+ H D         +QL   GGHR+ATVL+YLS VE+GGET FP+SE     + +  +
Sbjct: 95  KYDAHHDVGELTSASGKQLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAE 154

Query: 112 GN-WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           G  WS+CA    AVKP KGD LLF+S++ + + D  S+H  CPVI GEKW+ATKWIH R 
Sbjct: 155 GQKWSDCAEGNVAVKPRKGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWIHARP 214

Query: 171 F--DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
           F    P        C ++   C  WA AGECKKNP +M+
Sbjct: 215 FRWTAPPPPKAPPGCDNKHELCKAWANAGECKKNPGFML 253


>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 207

 Score =  180 bits (457), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 87/125 (69%), Positives = 100/125 (80%), Gaps = 2/125 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81  MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
           QKYEPHFD+F D++NQ  GGHR ATVLMYLS V +GGETVFPN++   SQ +D  +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200

Query: 119 RRGYA 123
            +G A
Sbjct: 201 HKGLA 205


>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
          Length = 458

 Score =  179 bits (455), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 95/237 (40%), Positives = 134/237 (56%), Gaps = 23/237 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+G +  S++RTS+G F+    ++++  +E R+A ++ LP ++ EA Q+L YE  Q
Sbjct: 215 VVDAETGGTAKSDIRTSTGSFVGIGANDLMKKLEKRVATFSMLPVKHQEATQVLRYEVKQ 274

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
           +Y  H+D+F  K    +  +RI T+LMYL   E GGETVFPN+EV   R       N+SE
Sbjct: 275 EYRAHYDYFFHKGG--MANNRIVTILMYLHEPEFGGETVFPNTEVPLERAEKGWGKNFSE 332

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           C  RG A    KGDAL+F+S+ P    D  S H  CPV+ GEKW+ATKWIHV   ++  +
Sbjct: 333 CGNRGRAAVVRKGDALIFWSMKPGGELDPGSSHAGCPVVRGEKWTATKWIHVNPTNQWNQ 392

Query: 177 E----------PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                         + C D +  C  WA+ GEC  NP +MV S      C+ SC+ C
Sbjct: 393 NNHKVHYAGGPANSETCKDTNAACPGWAEGGECTANPGFMVNS------CKVSCRQC 443


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score =  179 bits (453), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 98/236 (41%), Positives = 139/236 (58%), Gaps = 22/236 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSK----AQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
           V D  +G S  S +RTS+G F+        +++V  IE RIAAWT +P  +GE +Q+L Y
Sbjct: 196 VVDASNGGSSFSNIRTSTGSFVPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRY 255

Query: 58  EHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-DGNWSE 116
           + GQ+Y+ HFD+F  +   +   +RIATVLMYLS V+ GGETVFP++E  Q + +     
Sbjct: 256 QIGQEYQSHFDYFFHEGGMK--NNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHA 313

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN---FDK 173
           CA+ G  V P KGDA+LF+++      D  S H  CPV+ GEKW+ATKW+HV +   FD 
Sbjct: 314 CAKNGITVIPKKGDAILFWNMKVGGDLDGGSTHAGCPVVLGEKWTATKWLHVSSSTEFDA 373

Query: 174 PE------KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            +      +E     C + ++ C VWA+  EC++NP YM      R  C  SC +C
Sbjct: 374 RQRVLREGRETNFGGCRNANIQCQVWAEQNECERNPQYM------RDTCHLSCGMC 423


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score =  177 bits (450), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 86/152 (56%), Positives = 113/152 (74%), Gaps = 3/152 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKS  S VRTSSG FL++ +D+ +  IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 173

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 174 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 233

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           + G +VKP  GDALLF+S+ PDA+ D +SLHG
Sbjct: 234 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 265


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 86/152 (56%), Positives = 113/152 (74%), Gaps = 3/152 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKS  S VRTSSG FL++ +D+ +  IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 175 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 234

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           + G +VKP  GDALLF+S+ PDA+ D +SLHG
Sbjct: 235 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 266


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 87/175 (49%), Positives = 119/175 (68%), Gaps = 10/175 (5%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  +G+ I + VRTSSG FL + +D+IV ++E RIA  T +P ENGE +QI+HYE G
Sbjct: 121 LVVDGVTGQGILNSVRTSSGTFLERGKDKIVQNVERRIADITSIPIENGEGLQIIHYEVG 180

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR- 119
           QK+EPH+D+  +      GG R+ATVLMYLS VE+GGETVFPN++       N++  ++ 
Sbjct: 181 QKFEPHYDYNFNWRITNNGGPRVATVLMYLSDVEEGGETVFPNAK------PNFNSVSKY 234

Query: 120 ---RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
              +G  VKP  GDALLF+S+ PD S D+ SLHG  PVI G KW++ K +H+  F
Sbjct: 235 HPGKGLVVKPKMGDALLFWSVKPDGSLDTASLHGGSPVIRGSKWASNKLLHLTEF 289


>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
 gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
          Length = 147

 Score =  176 bits (447), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 85/152 (55%), Positives = 107/152 (70%), Gaps = 5/152 (3%)

Query: 21  MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
           MFL + QD IV +IE RIA +T +P ENGE +Q+LHY  GQK+EPHFD+       ++GG
Sbjct: 1   MFLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGG 60

Query: 81  HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
            R AT LMYLS VE+GGETVFPN+    S     +  A+ G +VKP  GDALLF+S+ PD
Sbjct: 61  PRKATFLMYLSDVEEGGETVFPNATAKGS-----APSAKSGISVKPKMGDALLFWSMKPD 115

Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
            S D  SLHG+ PVI+G+KWSATKWIHV  ++
Sbjct: 116 GSLDPKSLHGASPVIKGDKWSATKWIHVNKYN 147


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score =  176 bits (446), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 82/170 (48%), Positives = 111/170 (65%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  SG S+ S++RTS GMF  + +D I+ ++E R+A WT  P   GE++Q+L Y   Q
Sbjct: 73  VVDTGSGGSVVSDIRTSDGMFFERGEDAIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           KY+ H+D+F  K     GG+R ATVL+YL+  E+GGETVFP        +  +SECA+  
Sbjct: 133 KYDSHWDYFFHKDGSSNGGNRWATVLLYLTETEEGGETVFPKIPAPNGINVGFSECAKYN 192

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            AVKP KGDALLF S+ P    +  S+HG+CPVI GEK+S TKWIH  ++
Sbjct: 193 LAVKPHKGDALLFHSMKPTGELEERSMHGACPVIRGEKFSMTKWIHAGHY 242


>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 196

 Score =  176 bits (446), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 86/177 (48%), Positives = 121/177 (68%), Gaps = 16/177 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
             D+E+GKS+ +  RTSSG F+++  D+I+ +IE RIA +TF+P ENGE++ ILHYE GQ
Sbjct: 33  TVDDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVGQ 92

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
           KYEPH DFF D++N + GG             E+GGETVFP +E + S    W+E   C 
Sbjct: 93  KYEPHPDFFTDEINTKNGG-------------EQGGETVFPFAEGNFSSVPWWNELSDCG 139

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           ++G ++KP  GDALLF+S+ PD + D  S+HG+CPVI+G+KWS TKW+ V  +  P+
Sbjct: 140 KKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRVGKWSIPK 196


>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
 gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
          Length = 262

 Score =  174 bits (441), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 101/231 (43%), Positives = 136/231 (58%), Gaps = 20/231 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V   +    +  +VRTS G FL K  D+++  IE R+  ++ +  EN E +Q+L Y  GQ
Sbjct: 38  VVGGKDDTGVLDDVRTSFGTFLPKKYDDVLYGIERRVEDFSQISYENQEQLQLLKYHDGQ 97

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDG---NW 114
           +Y+ H    +D +    GG RIATVLM+L   EKGGET FP  +    V+Q   G     
Sbjct: 98  EYKDH----QDGLTSPNGGRRIATVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDEL 153

Query: 115 SECA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           S+CA    RG AVKP +GDA+LFFS   +  +D  S H SCP + G KW+ATKWIH + F
Sbjct: 154 SDCAWRDGRGLAVKPRRGDAVLFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEKRF 213

Query: 172 DKPE-KEPEDDDCVDED-LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
           D    +EP+   CVDE+  NC  WAK+GEC  NP YM+G ++  G C +SC
Sbjct: 214 DTGVWREPK---CVDEEPANCPGWAKSGECANNPAYMLGGETP-GKCLRSC 260


>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
 gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 93/223 (41%), Positives = 126/223 (56%), Gaps = 18/223 (8%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           + +  ++RTS G FL +AQD ++ +IE R+A W+ +PP + E MQ+L Y    KY PH D
Sbjct: 84  EGVVDDIRTSYGTFLRRAQDPVIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHID 143

Query: 69  FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS--EVSQSRDGNWSECARRGYAVKP 126
                     G  R+ATVLMYL   E  G  + P S  E   +   N S CA+   A KP
Sbjct: 144 ----------GLERVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYKP 192

Query: 127 MKGDALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE----PEDD 181
            +GDAL+FF + PD  +TD  S+H  CPV+ G KW+A KWIH   F +  +     P+  
Sbjct: 193 KRGDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWIHGTPFRRMRRNKPPLPDPG 252

Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
            C D    C  WA+AGEC+ NP YM+GS +  G CR +CK C+
Sbjct: 253 VCTDLHEMCDTWARAGECQNNPGYMLGSNTGIGNCRLACKDCE 295


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score =  174 bits (440), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 82/171 (47%), Positives = 111/171 (64%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  SG S+ S++RTS GMF  + +D I+ ++E R+A WT  P   GEA+Q+L Y   Q
Sbjct: 73  VVDTASGSSVVSDIRTSDGMFFERGEDAILEAVEQRLADWTMTPIWAGEALQVLRYRKDQ 132

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           KY+ H ++F  K     GG+R ATVL YL+  E+GGETVFP        +  +SECA+  
Sbjct: 133 KYDSHVNYFFHKEGSANGGNRWATVLTYLTDTEEGGETVFPKIPAPGGVNVGFSECAKYN 192

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
            AVKP KGDA+LF S+  +   +  SLHG+CPVI+GEK+S TKWIH  ++D
Sbjct: 193 LAVKPRKGDAILFHSMKTNGQLEERSLHGACPVIKGEKFSMTKWIHAGHYD 243


>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
          Length = 190

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 78/99 (78%), Positives = 93/99 (93%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92  MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 151

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGET 99
           QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGET
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGET 190


>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
 gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
          Length = 797

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 96/232 (41%), Positives = 137/232 (59%), Gaps = 12/232 (5%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D+++G+S   ++RTS G    + +D ++A IE RIA WT LPPE+GE MQIL Y  G
Sbjct: 529 LVVDSQTGQSKLDDIRTSYGAAFGRGEDPVIAEIEERIAEWTHLPPEHGEPMQILRYVDG 588

Query: 61  QKYEPHFDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNS---EVSQSRDGNW 114
           QKY+ H+D+F D ++ +   + G+R ATVL+YLS VE GGET  P +   ++S     N 
Sbjct: 589 QKYDAHWDWFDDPVHHRSYLVDGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENP 648

Query: 115 SEC-ARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNF- 171
           S C A+ G +++P KGDALLF+ +  +    D  +LH SCP ++G KW+ATKWIH + + 
Sbjct: 649 SPCAAKMGLSIRPRKGDALLFYDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSKPYM 708

Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            + +       C D   +C      G C  +   MVG     G CRKSC  C
Sbjct: 709 GRFDPLRTAGVCRDTAQDCAALVAEGRCTSDLDTMVGPA---GKCRKSCGDC 757


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 97/232 (41%), Positives = 133/232 (57%), Gaps = 19/232 (8%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+S+    RT     + + QD++V  IE R+AAWT +   + E MQIL Y  GQ+Y+ H 
Sbjct: 43  GQSVEDSYRTLYTAGVRRYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHA 102

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWSECARRGY 122
           D  RD       G R+ATVL+YL+  E GGET FP+S+     ++++   N+S CA+   
Sbjct: 103 DTLRDDE----AGVRVATVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHV 158

Query: 123 AVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE------ 175
           A  P +GDALLF+S+ PD +T D  + H  CPV+ G KW+ATKWIH + F   E      
Sbjct: 159 AFAPKRGDALLFWSIGPDGTTEDYHASHTGCPVLSGVKWTATKWIHAKPFRPQEMAAGRP 218

Query: 176 KEPEDDD---CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
            +P   D   C DE   C  WA  G+C+KN  YM+ +  S G CRK+C  CK
Sbjct: 219 HQPYVRDPGVCYDESPRCAEWAARGDCEKNRDYMIVNAVSPGVCRKACGACK 270


>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
 gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
          Length = 208

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 79/125 (63%), Positives = 102/125 (81%), Gaps = 2/125 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71  MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
           QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V+KGGE VFP++E S  Q +D  WS+C+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVKKGGEAVFPDAEGSHLQYKDETWSDCS 190

Query: 119 RRGYA 123
           R G A
Sbjct: 191 RSGLA 195


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 91/178 (51%), Positives = 117/178 (65%), Gaps = 14/178 (7%)

Query: 2   VADNESGKS---IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILH 56
           V D  +GK+   I S+VRTS+GMFLS       ++ +IE RIA ++ +P ENGE +Q+L 
Sbjct: 97  VVDTSTGKARHGIESKVRTSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLR 156

Query: 57  YEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           YE  Q Y+PH D+F D+ N + GG R+ATVLMYLS VE+GGET+FP+       DG   E
Sbjct: 157 YEPNQYYKPHHDYFSDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVG-----DGE-CE 210

Query: 117 CA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           C    R+G  VKP KGDA+LF+S   D + DS SLHG C V+ GEKWSATKW+    F
Sbjct: 211 CGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLRQSRF 268


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 84/167 (50%), Positives = 115/167 (68%), Gaps = 3/167 (1%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V DNESG+ I +  RTS+   + + +D+IV +IE RIA  TF+P E+GE + ++ Y  G
Sbjct: 119 LVIDNESGEGIETSYRTSTEYVVERGKDKIVRNIEKRIADVTFIPIEHGEPLHVIRYAVG 178

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
           Q YEPH D+F ++ +   GG RIAT+LMYLS+VE GGETVFP +  + S    W   SEC
Sbjct: 179 QYYEPHVDYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSEC 238

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATK 164
            + G ++KP  GDALLF+S+ PDA+ D  +LH +CPVI+G KWS TK
Sbjct: 239 GQTGLSIKPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWSCTK 285


>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 88/196 (44%), Positives = 123/196 (62%), Gaps = 9/196 (4%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           D +VA IE +I+AWTFLP ENG ++++  Y   +K     D+F ++ +  L    +ATV+
Sbjct: 104 DPVVAGIEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLATVV 162

Query: 88  MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
           +YLS+  +GGE +FPNSEV   +      C+  G  ++P+KG+A+LFFS   +AS D TS
Sbjct: 163 LYLSNTTQGGELLFPNSEVKPKKS-----CSEDGNILRPVKGNAVLFFSRLLNASLDETS 217

Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
            H  CPV++GE   ATK I+ +   K  +  E+ +C DED NC  WA  GECKKNP+YM+
Sbjct: 218 THLICPVVKGELLVATKLIYAK---KQARNEENGECSDEDENCERWANLGECKKNPVYMI 274

Query: 208 GSKSSRGYCRKSCKVC 223
           GS    G CRKSC  C
Sbjct: 275 GSPDYYGTCRKSCNAC 290


>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 252

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 101/227 (44%), Positives = 127/227 (55%), Gaps = 16/227 (7%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VADN  G S+  + RTS G F+++    +VA +E R+A  T +P    E MQ+L Y +G
Sbjct: 38  VVADN--GSSVLDDYRTSYGTFINRYATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNG 95

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSEC 117
           Q Y  H D   +         R+ATVL+YLS  E GGET FP +          G +SEC
Sbjct: 96  QYYHRHTDSLEND------SPRLATVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSEC 149

Query: 118 ARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
            +   A KP KGDALLF+S+ PD  T D  S H  CPVI G KW+AT W+H + F +PE 
Sbjct: 150 VKNNVAFKPRKGDALLFWSVKPDGKTEDPLSEHEGCPVIRGVKWTATVWVHTKPF-RPE- 207

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
             E DDC D    C  W  AGEC+KN  YM G  +  G CR SC VC
Sbjct: 208 --EWDDCTDRHKECPKWKAAGECEKNHGYMQGDANQVGSCRLSCGVC 252


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 90/174 (51%), Positives = 116/174 (66%), Gaps = 14/174 (8%)

Query: 2   VADNESGKS---IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILH 56
           V D  +GK+   I S+VRTS+GMFLS       ++ +IE RIA ++ +P ENGE +Q+L 
Sbjct: 96  VVDTSTGKARHGIESKVRTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLR 155

Query: 57  YEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           YE  Q Y+PH D+F D+ N + GG R+ATVLMYLS VE+GGET+FP+       DG   E
Sbjct: 156 YEPNQYYKPHHDYFSDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVG-----DGE-CE 209

Query: 117 CA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           C    R+G  VKP KGDA+LF+S   D + DS SLHG C V+ GEKWSATKW+ 
Sbjct: 210 CGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263


>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
 gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 251

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 96/225 (42%), Positives = 125/225 (55%), Gaps = 24/225 (10%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S+   +RTS G F+ +  D +V  +  R+AAWT  PPEN E +Q+L Y  GQKY  H
Sbjct: 43  NGSSVLDTIRTSYGTFIRRRHDPVVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAH 102

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS------EVSQSRDGNWSECARR 120
            D   D         R+ATVL+YL   E GGET FP+S       ++QS  G +SECA+ 
Sbjct: 103 MDSLIDD------SPRMATVLLYLHDTEYGGETAFPDSGHWLDPSLAQSM-GPFSECAQG 155

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPEKEP 178
             A +P KGDAL+F+S+ PD + D  SLH  CPV+ G KW+AT W+H    N+D   K  
Sbjct: 156 HVAFRPKKGDALMFWSIKPDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYDDYFKP- 214

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
               C D    C  W + GECKKNP YM        +C +SC  C
Sbjct: 215 --GACTDLHDQCKHWERMGECKKNPAYM------ESHCGRSCGAC 251


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score =  171 bits (432), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 87/179 (48%), Positives = 111/179 (62%), Gaps = 12/179 (6%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +   G S  S+ RTSSG ++S    E++A+IE R+AAWT LP   GE  Q++ YE GQ
Sbjct: 115 VVNEADGTSKTSDERTSSGGWVSGEDSEVMANIERRVAAWTMLPRNRGETTQVMRYEAGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS--------EVSQSRDGN 113
           +Y  H D+F D++N + GG R ATVLMYLS VE+GGETVFP          E S    GN
Sbjct: 175 EYAAHDDYFHDEVNVKNGGQRAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGN 234

Query: 114 WSECARRG----YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
             E A RG     AVKP +GDALLFF++H +   D  + H  CPV+ G KW+AT+W HV
Sbjct: 235 ACERALRGDPNVLAVKPRRGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRWQHV 293


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score =  170 bits (430), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 79/168 (47%), Positives = 111/168 (66%), Gaps = 2/168 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    S  RTS G FL +  D IV+ IE RI++ TF+P E GE++Q++ Y+ GQ
Sbjct: 28  VIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGIEDRISSITFIPKEYGESLQVVRYKTGQ 87

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECAR 119
           K+EPH D+++   N   GGHRI T+L+YL++VE GGETVFP +  +   D   N SEC +
Sbjct: 88  KFEPHQDYYKLTENNNNGGHRIGTLLLYLTNVENGGETVFPRALANVINDYSTNTSECTK 147

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +G  ++P +GD LLF+   P    D  S HG CPV++GEKW ATK++H
Sbjct: 148 KGIVIRPRRGDGLLFWITRPSGEIDPFSFHGGCPVVKGEKWLATKFLH 195


>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
 gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
          Length = 298

 Score =  170 bits (430), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 128/239 (53%), Gaps = 32/239 (13%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ------------ 53
           ++G S+   +RTS G F+ +  D ++  I  R+AAWT  PPEN E +Q            
Sbjct: 42  QNGSSVTDNIRTSYGTFIRRRHDPVIERILRRVAAWTKAPPENQEDLQAGRGEGGREKER 101

Query: 54  ILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQ 108
           +L Y  GQKY  H D   D         R+ATVL+YL   E+GGET FP+S         
Sbjct: 102 VLRYGIGQKYGAHMDSLIDD------SPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLA 155

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           +R G +SECA+   A +P KGDAL+F+S+ PD + D  S+H  CPV++G KW+AT W+H 
Sbjct: 156 TRMGPFSECAQGHVAFRPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWVHS 215

Query: 169 RNFDKPEKEPEDDD---CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
             +        D +   C D    C VWA AGEC +NP+YM        +C  SCK C+
Sbjct: 216 MPYAYDRYISHDGEPGACTDLHDMCTVWAAAGECDRNPVYM------STHCGPSCKTCE 268


>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 96/231 (41%), Positives = 133/231 (57%), Gaps = 23/231 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V   ++G+   S++RTS G F+ K  DE++  IE R A ++ +P  + E MQ+L Y  GQ
Sbjct: 105 VVGGQTGR--VSDIRTSFGTFIPKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQ 162

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--------NSEVSQSRDGN 113
           KY  H     D +  + GG RIAT+LM+L    +GGET F            + +++D  
Sbjct: 163 KYSDH----TDGLISENGGKRIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKD-Q 217

Query: 114 WSECARR---GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           +S+C  R   G+AVKP  GDA+LFFS      TD+ S+H SCP + G KW+AT WIH R 
Sbjct: 218 FSDCGYRSGKGFAVKPKVGDAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHERP 277

Query: 171 FDKPE-KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
           FD    ++P   DC D    C  WA  GECKKNP+YM+G++   G C +SC
Sbjct: 278 FDTATWRKP---DCKDLHQECANWANRGECKKNPIYMLGNEVV-GTCSRSC 324


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score =  169 bits (428), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 87/174 (50%), Positives = 116/174 (66%), Gaps = 9/174 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S VRTSSGMF+S  + +  ++ SIE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 98  VVDVATGKGVKSNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEP 157

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET FP     Q+ DG  S   +
Sbjct: 158 SQYYRPHHDYFSDTFNIKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGK 212

Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             +G  VKP KGDA+LF+S+  D  TDS S+HG CPV+EGEKWSATKW+  + F
Sbjct: 213 MVKGLCVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 98/223 (43%), Positives = 127/223 (56%), Gaps = 18/223 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ-----ILH 56
           V D+ +G+S    +RTS   FL++    IV+ IE R+  +T LP  NGE +Q     +L 
Sbjct: 38  VVDSTTGESKVDPIRTSEQCFLNRGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLK 97

Query: 57  YEHGQKYEPHFDF--FRDKMNQQL---GGHRIATVLMYLSHVEK--GGETVFPNSE---V 106
           Y +GQKY+ H D         +QL   GGHR+ATVL+YLS V+   GGET FP+SE    
Sbjct: 98  YSNGQKYDAHHDVGELDTASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDP 157

Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           +  R   WSECA    AVKP KGD LLF+S+ P+   D  S+H  CPV+ G+ W+ATKWI
Sbjct: 158 TADRGSGWSECAEDHVAVKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWI 216

Query: 167 HVRNFDKP--EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
           H R F             C D    C  WA +GECKKNP +M+
Sbjct: 217 HARPFRHQFPPPPAAPPGCADTVAMCKSWANSGECKKNPGFML 259


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 87/174 (50%), Positives = 116/174 (66%), Gaps = 9/174 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S VRTSSGMF+S  + +  ++ SIE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 98  VVDVATGKGVKSNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEP 157

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET FP     Q+ DG  S   +
Sbjct: 158 SQYYRPHHDYFSDTFNIKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGK 212

Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             +G  VKP KGDA+LF+S+  D  TDS S+HG CPV+EGEKWSATKW+  + F
Sbjct: 213 MVKGLCVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 17/231 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ESGKS+ + +RTS   FLS+  D +V  +  R+++ T LP  + E +Q+L Y  G+
Sbjct: 46  VIDSESGKSVVNPIRTSKQTFLSR-NDPVVRKVLERMSSVTHLPWYHCEDLQVLEYSAGE 104

Query: 62  KYEPHFDFFRD--KMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE---VSQSRDGN 113
           KY+ H D   +  K   QL   GG R+AT+L+YL   E+GGET FP+SE     +++   
Sbjct: 105 KYDAHEDVGEEGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTET 164

Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNF 171
           WS+CA R  A+KP +GD L+F+S+ PD + D  +LH  CP   G KW+AT W+H    N+
Sbjct: 165 WSKCAHRRVAMKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWVHADPYNW 224

Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKV 222
            KP        C D+   C  WA  GEC KNP +M+ +      C+ SC+V
Sbjct: 225 IKPPDPVPTIGCEDKSDRCRGWANIGECDKNPSFMLEN------CKWSCRV 269


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 94/226 (41%), Positives = 123/226 (54%), Gaps = 30/226 (13%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G S+  ++RTS G FL + QD IV ++E R+A WT L   + E MQIL Y  GQKY  H+
Sbjct: 74  GASVEDQIRTSYGTFLKRLQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHY 133

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHV--EKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           D   +         R+ TVL+YLS V  + GGET FP                 R  A+ 
Sbjct: 134 DSLDND------SPRVCTVLLYLSDVPADGGGETAFPGV---------------RRQALY 172

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-------DKPEKEP 178
           P KGDALLF+SL PD ++D+ SLH  CP+I G KW+ATKWIH   F       ++ E   
Sbjct: 173 PKKGDALLFYSLKPDGTSDAYSLHTGCPIISGVKWTATKWIHTLPFRPHLLGKEQAEAIV 232

Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
             ++C D   +C  WA AGEC+ N  +M G   + G CR SC  C+
Sbjct: 233 YPEECKDAQADCKAWADAGECENNEQFMRGDAFTLGNCRASCGDCE 278


>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
 gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
          Length = 307

 Score =  168 bits (426), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 82/139 (58%), Positives = 104/139 (74%), Gaps = 3/139 (2%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           VRTSSG FL++  DEIV  IE RI+ +TF+PPENGE +Q+LHYE GQ+YEPH D+F D+ 
Sbjct: 161 VRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEF 220

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVKPMKGDA 131
           N + GG RIATVLMYLS V++GGETVFP ++ + S    W   S+C + G +V P K DA
Sbjct: 221 NVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDA 280

Query: 132 LLFFSLHPDASTDSTSLHG 150
           LLF+S+ PDAS D +SLHG
Sbjct: 281 LLFWSMKPDASLDPSSLHG 299


>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 104/243 (42%), Positives = 132/243 (54%), Gaps = 43/243 (17%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ---ILHYEHGQKYEPHFDF 69
           S+ RTS+G FL K  D++V  +E R+ A++ LP EN E +Q   +L YE GQ+Y  H D 
Sbjct: 135 SDYRTSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDHVDG 194

Query: 70  FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---------GNWSECA-- 118
           F      + GG R+ATVLM+L+  E+GGET FPN E S++           G  S+CA  
Sbjct: 195 F----ATENGGKRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWR 250

Query: 119 -------------RRGYAVKPMKGDALLFFSLHPDASTDS-------TSLHGSCPVIEGE 158
                         RG+AVKP  GDA+LFFS   D             S H SCP   G 
Sbjct: 251 GGGGGTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGV 310

Query: 159 KWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCR 217
           KW+ATKWIH R F     E PE   CVD D  C  WA+ GEC KNP +M+G +++ G C 
Sbjct: 311 KWTATKWIHERAFATGTWETPE---CVDRDDGCAGWARGGECAKNPGFMLG-EATPGSCL 366

Query: 218 KSC 220
           KSC
Sbjct: 367 KSC 369


>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
 gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
 gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 291

 Score =  166 bits (420), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 85/196 (43%), Positives = 121/196 (61%), Gaps = 9/196 (4%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           D +VA IE +++AWTFLP ENG ++++  Y   +K     D+F ++ +  L    +ATV+
Sbjct: 105 DPVVAGIEEKVSAWTFLPGENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLHESLLATVV 163

Query: 88  MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
           +YLS+  +GGE +FPNSE+        + C   G  ++P+KG+A+LFF+   +AS D  S
Sbjct: 164 LYLSNTTQGGELLFPNSEMKPK-----NSCLEGGNILRPVKGNAILFFTRLLNASLDGKS 218

Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
            H  CPV++GE   ATK I+ +   K  +  E  +C DED NC  WAK GECKKNP+YM+
Sbjct: 219 THLRCPVVKGELLVATKLIYAK---KQARIEESGECSDEDENCGRWAKLGECKKNPVYMI 275

Query: 208 GSKSSRGYCRKSCKVC 223
           GS    G CRKSC  C
Sbjct: 276 GSPDYYGTCRKSCNAC 291


>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
          Length = 264

 Score =  166 bits (419), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 91/226 (40%), Positives = 134/226 (59%), Gaps = 13/226 (5%)

Query: 2   VADNESGKSIASE-VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           V +  SG    SE V TS  M     +D+I+A IE R++ W FLP E  + +Q++HY   
Sbjct: 48  VKEKSSGNGGLSEGVETSLDM-----EDDILARIEERLSVWAFLPKEYSKPLQVMHYGPE 102

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSH-VEKGGETVFPNSEVSQSRDGNWSECAR 119
           Q    + D+F +K   +L G  +AT+++YLS+ V +GG+ +FP S    S     S C+ 
Sbjct: 103 QNGR-NLDYFTNKTQLELSGPLMATIILYLSNDVTQGGQILFPESVPGSSSW---SSCSN 158

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
               ++P+KG+A+LFFSLHP AS D +S H  CPV+EG+ WSA K+ + +   + +    
Sbjct: 159 SSNILQPVKGNAILFFSLHPSASPDKSSFHARCPVLEGDMWSAIKYFYAKPISRGKVSAT 218

Query: 180 DD--DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            D  +C DED +C  WA  GEC++NP++M+GS    G CRKSC  C
Sbjct: 219 LDGGECTDEDDSCPAWAAVGECQRNPVFMIGSPDYYGTCRKSCNAC 264


>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
 gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
          Length = 311

 Score =  165 bits (418), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 127/236 (53%), Gaps = 21/236 (8%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VADN  G S+  + RTS G F+++ Q  ++A++E R+A  T  P    E MQ+L Y  G
Sbjct: 38  VVADN--GSSVLDDYRTSYGTFINRYQTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLG 95

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWS 115
           Q Y  H D   +         R+ATVL+YLS  E GGET FP +            G +S
Sbjct: 96  QYYHRHTDSLEND------SPRMATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFS 149

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           +C +   A KP +GDALLF+S+ PD  T D  S H  CPVI G KW+AT W+H + F +P
Sbjct: 150 DCVKGNVAFKPRRGDALLFWSVKPDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQPF-RP 208

Query: 175 EKEPEDDD------CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
           E  P          C D    C  WA+AGEC  N  YM G  +  G CR++C VC+
Sbjct: 209 EDFPPQPRSRLSGLCTDRHAECPRWARAGECDNNSNYMKGDANQVGSCRRTCGVCE 264


>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
          Length = 178

 Score =  165 bits (417), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 76/106 (71%), Positives = 91/106 (85%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73  MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV 106
           QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+ V
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAV 178


>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
          Length = 302

 Score =  165 bits (417), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/203 (41%), Positives = 123/203 (60%), Gaps = 13/203 (6%)

Query: 27  QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH---FDFFRDKMNQQLGGHRI 83
           +D+I+A IE R++ W FLP E  + +Q++HY      EP+    D+F +K   +L G  +
Sbjct: 107 EDDILARIEERLSLWAFLPKEYSKPLQVMHYGP----EPNGRNLDYFTNKTQLELSGPLM 162

Query: 84  ATVLMYLSHV-EKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS 142
           AT+++YLS+   +GG+ +FP S     R  +WS C+     ++P+KG+A+LFFSLHP AS
Sbjct: 163 ATIVLYLSNAATQGGQILFPES---VPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSAS 219

Query: 143 TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECK 200
            D  S H  CPV+EG  WSA K+ + +     E     D  +C DED NC  WA  GEC+
Sbjct: 220 PDKNSFHARCPVLEGNMWSAIKYFYAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQ 279

Query: 201 KNPLYMVGSKSSRGYCRKSCKVC 223
           +NP++M+GS    G CRKSC  C
Sbjct: 280 RNPVFMIGSPDYYGTCRKSCNAC 302


>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 84/178 (47%), Positives = 116/178 (65%), Gaps = 15/178 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLS--KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK I S+VRTS+GMFL+    +   + +IE RIAA++ +P +NGE +Q+L YE 
Sbjct: 115 VVDATTGKGIESKVRTSTGMFLNGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYES 174

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
            Q Y+ H D+F D+ N + GG R+AT+LMYL+   +GGET+FP     Q+ D    EC+ 
Sbjct: 175 DQYYKAHHDYFSDEFNLKRGGQRVATMLMYLTEGVEGGETIFP-----QAGD---KECSC 226

Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
               + G  VKP +GDA+LF+S+  D   D TSLHG C V+ GEKWS+TKW+  R FD
Sbjct: 227 GGEMKIGVCVKPKRGDAVLFWSIKLDGQVDPTSLHGGCKVLSGEKWSSTKWMRQRAFD 284


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 83/167 (49%), Positives = 115/167 (68%), Gaps = 5/167 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK I S+VRTSSGMFL+  + +  +V +IE RI+ ++ +P ENGE MQ+L YE 
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG RIAT+LMYLS   +GGET FP   ++ S + +      
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRIATMLMYLSDNIEGGETYFP---LAGSGECSCGGKLV 234

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           +G +VKP+KG+A+LF+S+  D  +D  S+HG C VI GEKWSATKW+
Sbjct: 235 KGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWM 281


>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 221

 Score =  162 bits (409), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 115/214 (53%), Gaps = 29/214 (13%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           +  ++RTS G FL +  D ++A+IE R+A W+ LP  + E MQ+L Y    KY PH D  
Sbjct: 36  VVDDIRTSYGTFLRRVPDPVIAAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHID-- 93

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
                   G  R+ATVL+YL   E+                 N S+CAR   A KP +GD
Sbjct: 94  --------GLERVATVLIYLGQAER----------------ANLSQCARGRVAYKPKRGD 129

Query: 131 ALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLN 189
           AL+FF   PD   TD  S+H  CPV+EG KW+A KW+H   + +P  +P    C +    
Sbjct: 130 ALMFFDTMPDYKQTDVHSMHTGCPVVEGVKWNAVKWLHGTPYGRPLPDP--GICANLHEM 187

Query: 190 CVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           C  WA  GECK NP +M+G+ +S G CR +C  C
Sbjct: 188 CETWALQGECKNNPGFMIGAGASMGSCRLACNDC 221


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 82/167 (49%), Positives = 111/167 (66%), Gaps = 5/167 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK I S+VRTSSGMFLS  + +  ++ +IE RI+ ++ +P ENGE MQ+L YE 
Sbjct: 112 VVDANTGKGIKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEK 171

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG RIAT+LMYL    +GGET FP++   +   G       
Sbjct: 172 NQYYRPHHDYFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGG---KLT 228

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           +G  VKP+KG+A+LF+S+  D  +D  S+HG CPV+ GEKWSATKW+
Sbjct: 229 KGLCVKPVKGNAVLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWM 275


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/169 (49%), Positives = 114/169 (67%), Gaps = 9/169 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  ++ +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 97  VVDVATGKGVKSDVRTSSGMFVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEP 156

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET FP     Q+ DG  S   R
Sbjct: 157 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGR 211

Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             RG  VKP KGDA+LF+S+  D +TDS S+H  C V++GEKWSATKW+
Sbjct: 212 IVRGLCVKPNKGDAVLFWSMGLDGNTDSNSIHSGCAVLKGEKWSATKWM 260


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score =  160 bits (405), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 80/168 (47%), Positives = 112/168 (66%), Gaps = 5/168 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQ--DEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK + S+VRTSSGMFL+  +  + I+ +IE RIA ++ +P ENGE +Q+L YE 
Sbjct: 114 VVDVKTGKGVKSDVRTSSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEP 173

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG R+AT+LMYL+   +GGET FP   ++   D        
Sbjct: 174 KQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFP---LAGDGDCTCGGKIM 230

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +G +VKP KGDA+LF+S+  D  +D  S+HG C V+ GEKWSATKW+ 
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score =  160 bits (405), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 83/169 (49%), Positives = 113/169 (66%), Gaps = 9/169 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  ++ +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 97  VVDVATGKGVKSDVRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEP 156

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET FP     Q+ DG      R
Sbjct: 157 NQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECICGGR 211

Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             RG  VKP KGDA+LF+S+  D +TDS SLH  C V++GEKWSATKW+
Sbjct: 212 LVRGLCVKPNKGDAVLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWM 260


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score =  160 bits (405), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 83/171 (48%), Positives = 114/171 (66%), Gaps = 5/171 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLS--KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK I S+VRTSSGMFLS   +   IV +IE RI+ ++ +P ENGE +Q+L Y+ 
Sbjct: 118 VVDAQTGKGIQSDVRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKK 177

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG R+AT+L+YLS   +GGET FP +     R G  S    
Sbjct: 178 SQFYKPHHDYFSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCGGKSV--- 234

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           RG +V P+KG+A+LF+S+  D  +D  S+HG C V+ GEKWSATKW+  R+
Sbjct: 235 RGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRS 285


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 81/170 (47%), Positives = 113/170 (66%), Gaps = 5/170 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK I S+VRTSSGMFLS  +   ++V +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 118 VVDVKTGKGIESKVRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEK 177

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG R+AT+LMYLS   +GGET FP +   +   G       
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGG---KVV 234

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G +VKP+KG+A+LF+S+  D  +D +S+HG C V+ G KWSATKW+  R
Sbjct: 235 DGLSVKPIKGNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQR 284


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 80/168 (47%), Positives = 111/168 (66%), Gaps = 5/168 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK + S+VRTSSGMFL+  +    I+ +IE RIA ++ +P ENGE +Q+L YE 
Sbjct: 114 VVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEP 173

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG R+AT+LMYL+   +GGET FP   ++   D        
Sbjct: 174 QQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFP---LAGDGDCTCGGKIM 230

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +G +VKP KGDA+LF+S+  D  +D  S+HG C V+ GEKWSATKW+ 
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score =  159 bits (403), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 83/173 (47%), Positives = 114/173 (65%), Gaps = 15/173 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  +V +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 94  VVDVATGKGVKSDVRTSSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEA 153

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
            Q Y PH D+F D  N + GG R+AT+LMYL+    GGET FP     Q+ DG   EC+ 
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVVGGETHFP-----QAGDG---ECSC 205

Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                +G  VKP KGDA+LF+S+  D +TD  S+H  CPV++GEKWSATKW+ 
Sbjct: 206 GGNVVKGLCVKPNKGDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMR 258


>gi|297600382|ref|NP_001049073.2| Os03g0166200 [Oryza sativa Japonica Group]
 gi|255674232|dbj|BAF10987.2| Os03g0166200, partial [Oryza sativa Japonica Group]
          Length = 135

 Score =  159 bits (402), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 73/122 (59%), Positives = 91/122 (74%), Gaps = 1/122 (0%)

Query: 102 PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWS 161
           P + +SQ +D  WS+CA +G+AVKP KG A+LFFSL+P+A+ D  SLHGSCPVI+GEKWS
Sbjct: 13  PQARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWS 72

Query: 162 ATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCK 221
           ATKWIHVR++D+  +    D C D+   C  WA AGEC KNP YMVG+  S G+CRKSC 
Sbjct: 73  ATKWIHVRSYDENGRR-SSDKCEDQHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCN 131

Query: 222 VC 223
           VC
Sbjct: 132 VC 133


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 82/175 (46%), Positives = 113/175 (64%), Gaps = 5/175 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK + S+ RTSSGMFLS  +    +V +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 119 VVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEK 178

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+PH D+F D  N + GG RIAT+LMYLS   +GGET FP +   +   G  +    
Sbjct: 179 NQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVP-- 236

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
            G +VKP KGDA+LF+S+  D  +D  S+HG C V+ GEKWSATKW+  ++   P
Sbjct: 237 -GLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP 290


>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 323

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/176 (47%), Positives = 114/176 (64%), Gaps = 6/176 (3%)

Query: 3   ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
           A N + + + S  RTSSG FL+K Q+++V  IE RIA +TF+P ENGE + ILHYE GQK
Sbjct: 106 AQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 165

Query: 63  YEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW----SEC 117
           +EPH D+   D  + +  G R AT++MYLS V++GG TVFP ++   S    W     E 
Sbjct: 166 FEPHHDYTHPDSFSFKSLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEY 225

Query: 118 AR-RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
            +  G +VKP  GDALLF+S+ PD + D TSLH S PV++G+KW   K +HV+  D
Sbjct: 226 GKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD 281



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 38/63 (60%), Gaps = 3/63 (4%)

Query: 90  LSHVEKGGETVFPNSEVSQSRDGNWSEC---ARRGYAVKPMKGDALLFFSLHPDASTDST 146
           + ++E+GGETVFP +    S    W +     + G ++KP  GDAL F+S+ PD + D T
Sbjct: 9   ILNIEEGGETVFPAANKCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYT 68

Query: 147 SLH 149
           SLH
Sbjct: 69  SLH 71


>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/176 (47%), Positives = 114/176 (64%), Gaps = 6/176 (3%)

Query: 3   ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
           A N + + + S  RTSSG FL+K Q+++V  IE RIA +TF+P ENGE + ILHYE GQK
Sbjct: 115 AQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 174

Query: 63  YEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW----SEC 117
           +EPH D+   D  + +  G R AT++MYLS V++GG TVFP ++   S    W     E 
Sbjct: 175 FEPHHDYTHPDSFSFKSLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEY 234

Query: 118 AR-RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
            +  G +VKP  GDALLF+S+ PD + D TSLH S PV++G+KW   K +HV+  D
Sbjct: 235 GKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD 290



 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 47/74 (63%), Gaps = 3/74 (4%)

Query: 90  LSHVEKGGETVFPNSEVSQSRDGNWSEC---ARRGYAVKPMKGDALLFFSLHPDASTDST 146
           + ++E+GGETVFP +    S    W +     + G ++KP  GDAL F+S+ PD + D T
Sbjct: 9   ILNIEEGGETVFPAANQCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYT 68

Query: 147 SLHGSCPVIEGEKW 160
           SLHGS PVI G++W
Sbjct: 69  SLHGSYPVIRGDEW 82


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 81/154 (52%), Positives = 106/154 (68%), Gaps = 5/154 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S ++DE  I+ +IE +IA  T +P  +GEA  IL YE GQKY  H+D F +
Sbjct: 140 IRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDE 199

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                L   R+A+ L+YL+ V +GGET+FP  E   +RDGN  +C   G  V+P KGDAL
Sbjct: 200 AEYGPLQSQRVASFLLYLTDVPEGGETMFP-YENGFNRDGNVEDCI--GLRVRPRKGDAL 256

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           LF+SL P+ + D TS HGSCPVI+GEKW ATKWI
Sbjct: 257 LFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWI 290


>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
           C-169]
          Length = 285

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 77/175 (44%), Positives = 111/175 (63%), Gaps = 1/175 (0%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++ K + +++R +   ++  + D+++  IE RIA +TFLP  +GE   I+ Y  GQ
Sbjct: 90  VLDAKTKKQVPNKLRNNKEAYIDGSADDVIDQIERRIARYTFLPAAHGEPFHIMQYLPGQ 149

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-QSRDGNWSECARR 120
            Y PH D+  D  + +LG  RIAT+++YLS V +GGETVFPNS +     D  +S+CA++
Sbjct: 150 GYAPHTDWLDDWWHPRLGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQ 209

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           G AVKP+KGDALL ++L  +   D  SLH  CPVI G KW+ATK I V     P+
Sbjct: 210 GIAVKPVKGDALLLYNLLENGRNDGESLHQGCPVIRGVKWTATKRILVNQLPSPD 264


>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 81/168 (48%), Positives = 113/168 (67%), Gaps = 5/168 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK I S+VRTSSGMFL+  + +  +V +IE RI+ ++ +P ENGE MQ+L YE 
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y+P  D+F D  N + GG  IAT+LMYLS   +GGET FP   ++ S + +      
Sbjct: 178 NQYYKPRHDYFFDTFNLKRGGQGIATMLMYLSDNIEGGETYFP---LAGSGECSCGGKLV 234

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +G +VKP+KG+A+LF+S+  D  +D  S+HG C VI GEKWSATKW+ 
Sbjct: 235 KGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWLR 282


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 80/168 (47%), Positives = 109/168 (64%), Gaps = 7/168 (4%)

Query: 9   KSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           ++   E+RTSSG FL  ++D+   +A +E ++A  T +P +NGEA  +L Y  GQKY+ H
Sbjct: 120 EATTKEIRTSSGTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCH 179

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVK 125
           +D F           R+A+ L+YLS VE+GGET+FP         G N+ +C   G  VK
Sbjct: 180 YDVFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCI--GLKVK 237

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           P +GDALLF+S+HP+ + D T+LHGSCPVI+GEKW ATKWI  RN DK
Sbjct: 238 PRQGDALLFYSMHPNGTFDKTALHGSCPVIKGEKWVATKWI--RNTDK 283


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score =  156 bits (394), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 81/172 (47%), Positives = 114/172 (66%), Gaps = 15/172 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  ++ +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 94  VVDVATGKGVKSDVRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEA 153

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET F      Q+ DG   EC+ 
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHF-----LQAGDG---ECSC 205

Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
                +G  VKP KGDA+LF+S+  D +TD  S+H  CPV++GEKWSATKW+
Sbjct: 206 GGNVVKGLCVKPNKGDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWM 257


>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
          Length = 297

 Score =  156 bits (394), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 84/170 (49%), Positives = 110/170 (64%), Gaps = 10/170 (5%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+FLS ++D+   + +IE +IA  T +P  +GEA  IL YE GQ+Y  H+D F  
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFNP 195

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DG +    R G  VKP +GD L
Sbjct: 196 DEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYGYEDRVGLRVKPRQGDGL 254

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
           LF+SL P+ + D TSLHGSCPVI+GEKW ATKWI  RN D+     EDDD
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWI--RNLDQ-----EDDD 297


>gi|7269410|emb|CAB81370.1| hypothetical protein [Arabidopsis thaliana]
          Length = 315

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/216 (39%), Positives = 121/216 (56%), Gaps = 29/216 (13%)

Query: 28  DEIVASIEARIAAWTFLP--------------------PENGEAMQILHYEHGQKYEPHF 67
           D +VA IE +++AWTFLP                     ENG ++++  Y   +K     
Sbjct: 109 DPVVAGIEEKVSAWTFLPGGLFSCGQTAGLCFSLDAHFSENGGSIKVRSYTS-EKSGKKL 167

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
           D+F ++ +  L    +ATV++YLS+  +GGE +FPNSEV        + C   G  ++P+
Sbjct: 168 DYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEVKPK-----NSCLEGGNILRPV 222

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDED 187
           KG+A+LFF+   +AS D  S H  CPV++GE   ATK I+ +   K  +  E  +C DED
Sbjct: 223 KGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK---KQARIEESGECSDED 279

Query: 188 LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            NC  WAK GECKKNP+YM+GS    G CRKSC  C
Sbjct: 280 ENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 315


>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 165

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 77/163 (47%), Positives = 106/163 (65%), Gaps = 3/163 (1%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           + + + S VRTSSGMFLS  + +   +IE RI+ ++ +P ENGE +Q+L YE  Q Y PH
Sbjct: 3   TNQGMKSNVRTSSGMFLSSEERKSPMAIEKRISVYSQVPIENGELVQVLRYEKSQFYRPH 62

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
            D+F D  N + GG R+AT+LMYLS   +GGET FP +   +   G       +G +VKP
Sbjct: 63  HDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGECSCGG---KIVKGLSVKP 119

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           +KGDA+LF+S+  D  +D  S+HG C V+ GEKWSATKW+  R
Sbjct: 120 IKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162


>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 290

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 81/158 (51%), Positives = 106/158 (67%), Gaps = 9/158 (5%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSGMFLS ++D+  ++ +IE +IA  T LP  NGEA  IL YE GQKY  H+D F  
Sbjct: 131 IRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNP 190

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP---NSEVSQSRDGNWSECARRGYAVKPMKG 129
                    R+A+ L+YLS VE+GGET+FP   + +V +S D  + +C   G  V+P +G
Sbjct: 191 AEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYD--FEKCI--GLQVRPRRG 246

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           D LLF+SL P+ + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 247 DGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIR 284


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 81/179 (45%), Positives = 105/179 (58%), Gaps = 12/179 (6%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D++SG++   + R+S G ++S   DE++ +IE R + W  LP   GE MQ+L YE GQ
Sbjct: 105 VTDDDSGEARPDDARSSIGGWVSGDDDEVIRNIELRASTWAMLPMNRGETMQVLRYEKGQ 164

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--------GN 113
           KY+ H DFF D+ N + GG R+AT+LMYLS VE+GGETVFP       RD         N
Sbjct: 165 KYDAHDDFFHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDN 224

Query: 114 WSECAR----RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
             E A     R  AVKP +GDALLFF+ H     D  + H  CPV  G KW+ T+W  V
Sbjct: 225 ACELASQNDPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHRV 283


>gi|2980790|emb|CAA18166.1| hypothetical protein [Arabidopsis thaliana]
          Length = 316

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/216 (39%), Positives = 121/216 (56%), Gaps = 29/216 (13%)

Query: 28  DEIVASIEARIAAWTFLP--------------------PENGEAMQILHYEHGQKYEPHF 67
           D +VA IE +++AWTFLP                     ENG ++++  Y   +K     
Sbjct: 110 DPVVAGIEEKVSAWTFLPGGLFSCGQTAGLCFSLDAHFSENGGSIKVRSYTS-EKSGKKL 168

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
           D+F ++ +  L    +ATV++YLS+  +GGE +FPNSE+        + C   G  ++P+
Sbjct: 169 DYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK-----NSCLEGGNILRPV 223

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDED 187
           KG+A+LFF+   +AS D  S H  CPV++GE   ATK I+ +   K  +  E  +C DED
Sbjct: 224 KGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK---KQARIEESGECSDED 280

Query: 188 LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
            NC  WAK GECKKNP+YM+GS    G CRKSC  C
Sbjct: 281 ENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 316


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 84/216 (38%), Positives = 120/216 (55%), Gaps = 11/216 (5%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G+S    +RTS   FL++  +E+V  I   ++A T LP  + E MQ+L Y  G+
Sbjct: 40  VVDSVTGESKVDPIRTSKQTFLNR-DEEVVREIYDALSAVTMLPWTHNEDMQVLEYRVGE 98

Query: 62  KYEPHFDF-----FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE---VSQSRDGN 113
           KY+ H D         +   + GG R+ATVL+YL   E GGET FP+SE      +   +
Sbjct: 99  KYDAHEDVGAEDSLSGRELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTS 158

Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-- 171
           WS+CA    A+KP +GD L+F+S+ P+   D  +LH  CPV+ G KW+AT W+H   +  
Sbjct: 159 WSKCAEHRVAMKPRRGDGLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWVHAEPYRW 218

Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
            KP +      C D    C  WA  GEC KNP +M+
Sbjct: 219 QKPPEASATPGCEDAHDQCRGWANTGECDKNPGFML 254


>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
          Length = 311

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/219 (38%), Positives = 129/219 (58%), Gaps = 7/219 (3%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG ++++E+  SSG+ L+   D+IVA IE R+A WT LP ++    QI+ Y   +    +
Sbjct: 98  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKY 156

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           F   R  M        +ATV++YLS    GGE +FP S+V   +   WS   ++   ++P
Sbjct: 157 FYGNRSAMLPS-SEPLMATVVLYLSDSASGGEILFPESKV---KSKFWSGRRKKNNFLRP 212

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKPEKEPEDDDCV 184
           +KG+A+LFFS+H +AS D +S H   P+ +GE W ATK++++     +K   + + D C 
Sbjct: 213 VKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCF 272

Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           DED +C  WA  GEC++N ++MVGS    G CRKSC  C
Sbjct: 273 DEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311


>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 297

 Score =  152 bits (385), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 83/170 (48%), Positives = 109/170 (64%), Gaps = 10/170 (5%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+FLS ++D+   + +IE +IA  T +P  +GEA  IL YE GQ+Y  H+D F  
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFNP 195

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DG +      G  VKP +GD L
Sbjct: 196 DEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYGYEDCVGLRVKPRQGDGL 254

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
           LF+SL P+ + D TSLHGSCPVI+GEKW ATKWI  RN D+     EDDD
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWI--RNLDQ-----EDDD 297


>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 317

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 92/231 (39%), Positives = 129/231 (55%), Gaps = 21/231 (9%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V  +ESG    S  RTS G F+++   E +  +E R+A ++ +P E+ E +Q+L Y  G
Sbjct: 73  VVNSDESGA--VSTARTSFGTFVTRRLTETLQRVEDRVAKYSGIPWEHQEQLQLLRYRDG 130

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS----EVSQSRDGN--- 113
           Q+Y  H     D +  + GG RIATVLM+L     GGET FP      E   +   N   
Sbjct: 131 QEYVAH----HDGIISENGGKRIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDK 186

Query: 114 WSECA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            SEC      G++V P KG+A+LFFS H + + D  + H SCP + G K++ATKWIH   
Sbjct: 187 LSECGWNDGNGFSVIPKKGEAVLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHENP 246

Query: 171 FDK-PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
           F+    K P    C DE   C VWA+  EC++NP++M+G +S  G C KSC
Sbjct: 247 FETGTAKTP---TCTDETELCPVWAQGHECERNPVFMMGEESV-GACSKSC 293


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 83/178 (46%), Positives = 106/178 (59%), Gaps = 20/178 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VAD  +G +        SG FL +  D IV  IE RI+A+  +P ++GE M+IL Y  G+
Sbjct: 42  VADARTGGTFPG-----SGAFLLRNHDPIVTRIEERISAFAMIPADHGEGMRILRYGRGE 96

Query: 62  KYEPHFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV------------SQ 108
           KY+PH D+F D   N +  G R+ATVLMYLS VE GGETVFP                S 
Sbjct: 97  KYDPHHDYFDDGDKNLRFYGQRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSS 156

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           S+D   S+CA+    VKP +GDALLF + H +   D TSLH  CPV+ GEKW+ATKW+
Sbjct: 157 SKDS--SKCAKGALHVKPRRGDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWM 212


>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 299

 Score =  151 bits (381), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 80/160 (50%), Positives = 106/160 (66%), Gaps = 3/160 (1%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTS G+F+S ++DE  I+ SIE +IA  T +P  +GEA  IL YE GQKY PH+D F +
Sbjct: 140 IRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDAFDE 199

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                L   R A+ L+YL+ V +GGET+FP  E   +RDG++      G  V+P KGD L
Sbjct: 200 AEFGPLQSQRAASFLLYLTDVPEGGETLFP-YENGFNRDGSYDFEDCIGLRVRPRKGDGL 258

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           LF+SL P+ + D TS+HGSCPVI+GEKW ATKWI  +  D
Sbjct: 259 LFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQVLD 298


>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
           [Cucumis sativus]
          Length = 311

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 84/219 (38%), Positives = 128/219 (58%), Gaps = 7/219 (3%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG ++++E+  SSG+ L+   D+IVA IE R+A WT LP ++    QI+ Y   +    +
Sbjct: 98  SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKY 156

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           F   R  M        +ATV++YLS    GGE +FP S+V   +   WS   ++   ++P
Sbjct: 157 FYGNRSAMLPS-SEPLMATVVLYLSDSASGGEILFPESKV---KSKFWSGRRKKNNFLRP 212

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKPEKEPEDDDCV 184
           +KG+A+L FS+H +AS D +S H   P+ +GE W ATK++++     +K   + + D C 
Sbjct: 213 VKGNAILXFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCF 272

Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
           DED +C  WA  GEC++N ++MVGS    G CRKSC  C
Sbjct: 273 DEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 71/106 (66%), Positives = 84/106 (79%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VADN SGKS  SEVRTSSG FL K QD IV  IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88  VADNMSGKSTLSEVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS 107
           KYEPH+D+F D +N   GGHR ATVL+YL+ V +GGETVFP +EV+
Sbjct: 148 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEVN 193


>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
           sativus]
          Length = 164

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 78/166 (46%), Positives = 107/166 (64%), Gaps = 5/166 (3%)

Query: 11  IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           + S+ RTSSGMFLS  +    +V +IE RI+ ++ +P ENGE +Q+L YE  Q Y+PH D
Sbjct: 2   VKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHD 61

Query: 69  FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
           +F D  N + GG RIAT+LMYLS   +GGET FP +   +   G  +     G +VKP K
Sbjct: 62  YFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKT---VPGLSVKPAK 118

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           GDA+LF+S+  D  +D  S+HG C V+ GEKWSATKW+  ++   P
Sbjct: 119 GDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP 164


>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 80/161 (49%), Positives = 104/161 (64%), Gaps = 5/161 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG F+S ++D+  I+  IE +IA  T +P  +GEA  +L YE GQ+Y+ H+D F  
Sbjct: 127 IRTSSGTFISASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDP 186

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R A+ L+YLS VE+GGETVFP  E  Q+ D ++      G  VKP +GD L
Sbjct: 187 AQYGPQKSQRAASFLLYLSDVEEGGETVFPY-ENGQNMDASYDFSKCIGLKVKPRRGDGL 245

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           LF+SL P+ + D TSLHGSCPVI GEKW ATKWI  RN D+
Sbjct: 246 LFYSLFPNGTIDLTSLHGSCPVIRGEKWVATKWI--RNQDQ 284


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 3/155 (1%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           VRTSSG+F S ++DE   +  IE +IA  T +P  +GEA  IL YE GQKY  H+D F+ 
Sbjct: 132 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 191

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DG ++     G  VKP +GD L
Sbjct: 192 SEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYNFQTCIGLKVKPRQGDGL 250

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+S+ P+ + D TSLHGSCPVI+G+KW ATKWI 
Sbjct: 251 LFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285


>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 286

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 80/155 (51%), Positives = 101/155 (65%), Gaps = 5/155 (3%)

Query: 16  RTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           RTSSG FLS ++D    +  IE +IA  T +P  +GEA  IL YE GQKY+ H+D F   
Sbjct: 128 RTSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPA 187

Query: 74  MNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                   R+A+ L+YLS VEKGGET+FP  + V  S   ++ +CA  G  VKP +GD +
Sbjct: 188 EYGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCA--GLKVKPRQGDGI 245

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+SL P+ + D TSLHGSCPVIEGEKW ATKWI 
Sbjct: 246 LFYSLLPNGTIDQTSLHGSCPVIEGEKWVATKWIR 280


>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
          Length = 285

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 103/156 (66%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S ++D+   +  IE +IA    +P  +GEA  +L YE GQ+Y  H+D F  
Sbjct: 126 IRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDP 185

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
                   HRIAT L+YLS VE+GGET+FP  + ++  +D ++  C   G  VKP +GD 
Sbjct: 186 AEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCI--GLKVKPHQGDG 243

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+S+ P+ + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 244 LLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIR 279


>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/159 (48%), Positives = 103/159 (64%), Gaps = 5/159 (3%)

Query: 14  EVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           + RTSSG F+S ++D+  I+  +E +IA  T +P  +GE   IL YE GQKY+ H+D F 
Sbjct: 126 DTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHYDAFN 185

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGD 130
                 +   RIA+ L+YLS+VE GGET+FP    ++  R  ++ +C   G  VKP +GD
Sbjct: 186 PDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCI--GLKVKPRQGD 243

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            LLF+SL P+   D TSLHGSCPVI+GEKW ATKWI  R
Sbjct: 244 GLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDR 282


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 77/163 (47%), Positives = 104/163 (63%), Gaps = 7/163 (4%)

Query: 14  EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           ++RTSSG FL   +D    +  +E ++A  T +P ENGEA  +L Y  GQKY+ H+D F 
Sbjct: 140 DIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYDVFD 199

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
                     R+A+ L+YLS VE+GGET+FP         G ++ +C   G  VKP +GD
Sbjct: 200 PAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCI--GMKVKPRQGD 257

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           ALLF+S+HP+ + D ++LHGSCPVI+GEKW ATKWI  RN DK
Sbjct: 258 ALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWI--RNTDK 298


>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 363

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 87/218 (39%), Positives = 125/218 (57%), Gaps = 30/218 (13%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
            RTS G F+++     ++++E R+A ++ +P  + E +Q+L YE GQ+Y           
Sbjct: 141 ARTSFGTFITRRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGN--------- 191

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPN--------SEVSQSR----DGNWSECARRGY 122
               G  RIATVLM+L   E GGET FP+        SE   SR    D  W+E   RG+
Sbjct: 192 ----GEKRIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEG--RGF 245

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
           +V P KGDA+LFFS H + ++D  + H SCP + G K++ATKWIH + FD    E     
Sbjct: 246 SVIPRKGDAILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKEFDTTTFE--TPM 303

Query: 183 CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
           C D++  C  WA +GEC+KNP++M+G ++  G C KSC
Sbjct: 304 CEDKEDMCDQWANSGECEKNPVFMMGIETV-GSCSKSC 340


>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
 gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 78/157 (49%), Positives = 101/157 (64%), Gaps = 5/157 (3%)

Query: 14  EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           + RTSSG F+S ++DE   +  IE +IA  T +P  +GEA  IL YE GQKY+ H+D F 
Sbjct: 132 DTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFN 191

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
                Q    R A+ L+YLS+VE+GGET+FP    S    G ++ +C   G  VKP +GD
Sbjct: 192 PDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCV--GLKVKPRQGD 249

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            LLF+SL P+ + D TSLHGSCPVI+G KW ATKWI 
Sbjct: 250 GLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIR 286


>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 79/164 (48%), Positives = 103/164 (62%), Gaps = 7/164 (4%)

Query: 15  VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S ++D   A   +E +IA  T +P  +GE+  IL YE GQKY+ H+D F  
Sbjct: 129 TRTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    RIA+ L+YLS VE+GGET+FP    S    G ++ +C   G  VKP KGD 
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCI--GLKVKPRKGDG 246

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
           LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI  R+ D+ E
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWI--RDQDQEE 288


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 100/155 (64%), Gaps = 3/155 (1%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           VRTSSG+F S ++DE   +  IE + A  T +P  +GEA  IL YE GQKY  H+D F+ 
Sbjct: 132 VRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 191

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DG ++     G  VKP +GD L
Sbjct: 192 SEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYNFQTCIGLKVKPRQGDGL 250

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+S+ P+ + D TSLHGSCPVI+G+KW ATKWI 
Sbjct: 251 LFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285


>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 245

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/149 (51%), Positives = 98/149 (65%), Gaps = 19/149 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +  +G    S  RTSSG FL K  D+IV  IE RI+ +TF+P ENGEA+Q++HYE GQ
Sbjct: 100 VRNAITGLGEESSSRTSSGTFLRKGHDKIVKEIEKRISEFTFIPEENGEALQVIHYEVGQ 159

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           K+EPHFD          G  RIATVLMYLS V+KGGETVFP ++  +S         ++G
Sbjct: 160 KFEPHFD----------GFQRIATVLMYLSDVDKGGETVFPEAKGIKS---------KKG 200

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHG 150
            +V+P KGDALLF+S+ PD S D +S HG
Sbjct: 201 VSVRPKKGDALLFWSMRPDGSQDPSSKHG 229


>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
 gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
          Length = 231

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 14  EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           +VRTS G FLS  QD+   +A +E ++A  T +P  +GEA  +L YE GQKY  H+D F 
Sbjct: 71  DVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFN 130

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
                     R+A+ L+YLS VE+GGET+FP        +  ++ EC   G  VKP +GD
Sbjct: 131 PAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI--GLKVKPKQGD 188

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           ALLF+S+ P+ + D T+LHGSCPVI+GEKW ATKWI
Sbjct: 189 ALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWI 224


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 14  EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           +VRTS G FLS  QD+   +A +E ++A  T +P  +GEA  +L YE GQKY  H+D F 
Sbjct: 132 DVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFN 191

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
                     R+A+ L+YLS VE+GGET+FP        +  ++ EC   G  VKP +GD
Sbjct: 192 PAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI--GLKVKPKQGD 249

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           ALLF+S+ P+ + D T+LHGSCPVI+GEKW ATKWI
Sbjct: 250 ALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWI 285


>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
          Length = 293

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/160 (46%), Positives = 101/160 (63%), Gaps = 3/160 (1%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S ++D+   +  IE +IA  T +P  +GEA  IL YE  Q+Y  H+D F  
Sbjct: 134 IRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNP 193

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DGN+      G  VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEGCIGLKVKPRQGDGL 252

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           LF+SL  + + D TSLHGSCPVI+GEKW ATKWI  +  D
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 98/156 (62%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S +++   A   +E +IA  T +P  +GE+  IL YE GQKY+ H+D F  
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    RIA+ L+YLS VE+GGET+FP    S    G ++ +C   G  VKP KGD 
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 246

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI 
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 75/156 (48%), Positives = 98/156 (62%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S +++   A   +E +IA  T +P  +GE+  IL YE GQKY+ H+D F  
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    RIA+ L+YLS VE+GGET+FP    S    G ++ +C   G  VKP KGD 
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 246

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI 
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 290

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/158 (47%), Positives = 97/158 (61%), Gaps = 2/158 (1%)

Query: 14  EVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           + RTSSG F+S ++D+  I+  +E +IA  T +P  +GE   IL YE  QKY+ H+D F 
Sbjct: 125 DTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFN 184

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
                 +   RIA+ L+YLS+VE GGET+FP         G +      G  VKP +GD 
Sbjct: 185 PDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPRQGDG 244

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           LLF+SL P+   D TSLHGSCPVI+GEKW ATKWI  R
Sbjct: 245 LLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDR 282


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 97/155 (62%), Gaps = 3/155 (1%)

Query: 15  VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D  E +A IE +IA  T LP  +GE   +L Y  GQ+Y  H+D F  
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E S++ D  +      G  VKP KGD L
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFP-YENSENMDIGYDYEKCIGLKVKPRKGDGL 255

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 256 LFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 76/155 (49%), Positives = 97/155 (62%), Gaps = 3/155 (1%)

Query: 15  VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D  E +A IE +IA  T LP  +GE   +L Y  GQ+Y  H+D F  
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E S++ D  +      G  VKP KGD L
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFP-YENSENMDIGYDYEKCIGLKVKPRKGDGL 255

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 256 LFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
          Length = 293

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 75/160 (46%), Positives = 101/160 (63%), Gaps = 3/160 (1%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S ++D+   +  IE +IA  T +P  +GEA  IL YE  Q+Y  H+D F  
Sbjct: 134 IRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNP 193

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DGN+      G  VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEDCIGLKVKPRQGDGL 252

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           LF+SL  + + D TSLHGSCPVI+GEKW ATKWI  +  D
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S ++D+  I+  +E +IA  T +P  +GEA  IL YE GQ+Y  H+D F  
Sbjct: 125 TRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNP 184

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
                    R+A+ L+YLS VE+GGET+FP   +++     ++ +C   G  VKP +GD 
Sbjct: 185 AEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCI--GLKVKPQRGDG 242

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+S+ P+ + D TSLHGSCPVI GEKW ATKWI 
Sbjct: 243 LLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIR 278


>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
 gi|255641811|gb|ACU21174.1| unknown [Glycine max]
          Length = 293

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 99/155 (63%), Gaps = 3/155 (1%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S + D+   +A IE +IA  T +P  +GEA  IL YE  Q+Y  H+D F  
Sbjct: 134 IRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDAFNP 193

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    R+A+ L+YL+ VE+GGET+FP  E   + DGN+      G  VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEDCIGLKVKPRQGDGL 252

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIR 287


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 95/156 (60%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D    +A IE +IA  T +P  +GE   +L YE GQKY  H+D F  
Sbjct: 190 IRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDP 249

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL+ VE+GGET+FP         G ++ +C   G  VKP KGD 
Sbjct: 250 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCI--GLKVKPRKGDG 307

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL  + + D TSLHGSCPV+ GEKW ATKWI 
Sbjct: 308 LLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIR 343


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S ++D+  I+  +E +IA  T +P  +GEA  IL YE GQ+Y  H+D F  
Sbjct: 117 TRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNP 176

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
                    R+A+ L+YLS VE+GGET+FP   +++     ++ +C   G  VKP +GD 
Sbjct: 177 AEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCI--GLKVKPQRGDG 234

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+S+ P+ + D TSLHGSCPVI GEKW ATKWI 
Sbjct: 235 LLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIR 270


>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
 gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
          Length = 231

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 93/163 (57%), Gaps = 4/163 (2%)

Query: 12  ASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           + + RTS+G FL+ A D   ++  +E RIAA T LP ENGEA  +LHYE  Q Y+ H+D 
Sbjct: 65  SQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHYDT 124

Query: 70  FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPM 127
           F  K        RIATVL+YLS V +GGETVF    V       G+W  C    +   P 
Sbjct: 125 FDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKREGVDGENRVIGDWRNCDDGSFKYMPR 184

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            GDA+LF+   P+   D  +LHG CPV  GEKW ATKWI  R 
Sbjct: 185 MGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSRG 227


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D    +A +E +IA  T +P  +GE   +L YE GQKY  H+D F  
Sbjct: 138 IRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDP 197

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL+ VE+GGET+FP         G ++ +C   G  VKP KGD 
Sbjct: 198 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCI--GLKVKPRKGDG 255

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 256 LLFYSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIR 291


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 96/155 (61%), Gaps = 4/155 (2%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG F+S ++D+  I+  IE +IA  T +P  +GE   IL YE GQ+Y  H+D    
Sbjct: 124 IRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISP 183

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                    RIA+ L+YLS VE+GGET+FP          N  +C   G  VKP +GD L
Sbjct: 184 AEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNINTFNSRKCI--GLKVKPRRGDGL 241

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LF+S+ P+ + D TS+HGSCPVIEGEKW ATKWI 
Sbjct: 242 LFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIR 276


>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 243

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 68/125 (54%), Positives = 91/125 (72%), Gaps = 3/125 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GKS  S VRTSSG FL++ +D+ +  IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 172

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
           KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S    W   SEC 
Sbjct: 173 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 232

Query: 119 RRGYA 123
           + G+ 
Sbjct: 233 KGGWV 237


>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
          Length = 151

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/148 (48%), Positives = 97/148 (65%), Gaps = 5/148 (3%)

Query: 21  MFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQL 78
           MFL+  + +  +V +IE RI+ ++ +P ENGE MQ+L YE  Q Y+PH D+F D  N + 
Sbjct: 1   MFLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKR 60

Query: 79  GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
           GG RIAT+LMYLS   +GGET FPN    Q   G  +     G +VKP KG+A+LF+S+ 
Sbjct: 61  GGQRIATMLMYLSDNVEGGETYFPNIGSGQCSCGGKT---VEGLSVKPTKGNAVLFWSMG 117

Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWI 166
            D  +D  S+HG C V+ GEKWSATKW+
Sbjct: 118 LDGQSDPLSVHGGCEVLAGEKWSATKWM 145


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D    +A +E +IA  T +P  +GE   IL YE GQ+Y  H+D F  
Sbjct: 151 IRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDP 210

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL+ VE+GGET+FP         G ++ +C   G  VKP KGD 
Sbjct: 211 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 268

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 269 LLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 304


>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 294

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 72/156 (46%), Positives = 99/156 (63%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSGMF+S ++D+  ++  I+ +IA    +P  +G A  IL Y+ GQKY  H+D F  
Sbjct: 134 IRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAFNP 193

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL+ V +GGET+FP    S      N+ +C   G  +KP+KGD 
Sbjct: 194 AEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFEDCI--GLKIKPLKGDG 251

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL P+ + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 252 LLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIR 287


>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 75/163 (46%), Positives = 93/163 (57%), Gaps = 4/163 (2%)

Query: 12  ASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           + + RTS+G FLS   D   ++  +E RIAA T LP +NGEA  +LHYEH Q Y+ H D 
Sbjct: 65  SQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSHMDS 124

Query: 70  FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPM 127
           F  K        RIATVL+YLS V +GGETVF    V  +     +W  C    +   P 
Sbjct: 125 FDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKYAPR 184

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            GDA+LF+   P+   D  SLHG CPV +GEKW ATKWI  R 
Sbjct: 185 MGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSRG 227


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D    +A +E +IA  T +P  +GE   IL YE GQ+Y  H+D F  
Sbjct: 121 IRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDP 180

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL+ VE+GGET+FP         G ++ +C   G  VKP KGD 
Sbjct: 181 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 238

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 239 LLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 274


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 96/156 (61%), Gaps = 5/156 (3%)

Query: 15  VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FLS  +D    +A IE +IA  T +P  +GE   +L Y  GQ+Y  H+D F  
Sbjct: 137 IRTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDP 196

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    R+A+ L+YL++VE+GGET+FP         G ++ +C   G  VKP KGD 
Sbjct: 197 VQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 254

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           LLF+SL  + + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 255 LLFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 73/149 (48%), Positives = 96/149 (64%), Gaps = 19/149 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +  +G    S  RTSSG F+    D+IV  IE RI+ +TF+P ENGE +Q+++YE GQ
Sbjct: 133 VRNALTGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           K+EPHFD          G  RIATVLMYLS V+KGGETVFP ++  +S         ++G
Sbjct: 193 KFEPHFD----------GFQRIATVLMYLSDVDKGGETVFPEAKGIKS---------KKG 233

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHG 150
            +V+P KGDALLF+S+ PD S D +S HG
Sbjct: 234 VSVRPKKGDALLFWSMRPDGSRDPSSKHG 262


>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
          Length = 318

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 105/172 (61%), Gaps = 10/172 (5%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG+F+S  +D+  ++  IE +IA  T +P  +GEA  +L Y+ GQKY  H+D    
Sbjct: 143 IRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHP 202

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
            +       R+A+ L+YLS V +GGET+FP  E   + DG++      G  VKP KGD L
Sbjct: 203 DIYGPQKSQRMASFLLYLSDVPEGGETMFP-FENGLNMDGSYYYEKCIGLKVKPRKGDGL 261

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCV 184
           LF+SL P+ + D  SLHGSCPVI+GEKW ATKWI  +  D       D+D V
Sbjct: 262 LFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQVLD-------DEDTV 306


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 70/174 (40%), Positives = 100/174 (57%), Gaps = 8/174 (4%)

Query: 8   GKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           G      VRTS G F+S+  D   ++A +E + A  T LP  +GE   +L Y+ GQ Y+ 
Sbjct: 153 GPQETENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDS 212

Query: 66  HFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-----NSEVSQSRDGNWSECARR 120
           H+D F  +        R+AT+L YL+ VE+GGET+FP       ++ +    N+  C   
Sbjct: 213 HYDIFEPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTT- 271

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           G+  KP  GDAL+F+S+HP+ + D  +LHG CPV+ GEKW ATKWI  + F  P
Sbjct: 272 GFKYKPRMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDKCFTPP 325


>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
 gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
          Length = 297

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 73/156 (46%), Positives = 99/156 (63%), Gaps = 8/156 (5%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSGMF+  ++D+  ++  IE +IA  T +P  +GEA  +L YE GQKY+ H+D F  
Sbjct: 140 IRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFNP 199

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECARRGYAVKPMKGD 130
                    R+AT L+YLS+ E+GGET FP  N E  +  D    +C   G  VKP +GD
Sbjct: 200 AEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFEGYDAQ--KC--NGLRVKPHQGD 255

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           A+LF+S+ P+ + D  SLH SC VI+GEKW ATKWI
Sbjct: 256 AILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWI 291


>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
 gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/154 (46%), Positives = 95/154 (61%), Gaps = 5/154 (3%)

Query: 17  TSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           T+   F+  ++D+   +  IE +IA  T +P  +GEA  IL YE GQKY+ H+D F    
Sbjct: 18  TTESTFIGGSEDKTGTLDFIERKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDE 77

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDALL 133
                  R+A+ L+YLS VE+GGET+FP    S    G  + +C   G  VKP +GD LL
Sbjct: 78  YGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAVSSGFEYKQCV--GLKVKPRQGDGLL 135

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           F+SL P+ + D TSLHGSCPVI+GEKW ATKWI 
Sbjct: 136 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 169


>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
          Length = 341

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 70/158 (44%), Positives = 100/158 (63%), Gaps = 6/158 (3%)

Query: 15  VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +RTSSG FL+   ++   +  +E ++A  T +P  +GEA  IL YE GQKY+ H+D F  
Sbjct: 179 IRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDMFDP 238

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFP---NSEVSQSRDGNWSECARRGYAVKPMKG 129
                    R+A+ L+YL+  ++GGETVFP    + + + R  +++ C   G  VKP KG
Sbjct: 239 SQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSC-EAGLKVKPRKG 297

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           DALLF+S+HP+ + D +SLHG CPVI G K+ ATKWIH
Sbjct: 298 DALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIH 335


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 68/176 (38%), Positives = 99/176 (56%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  DN+SG    ++ RTS+GMF  + ++E+++ +E RIA     P ENGE MQ+LHY  G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENELISLVEQRIARLLNWPLENGEGMQVLHYRPG 200

Query: 61  QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+ T++MYL+   +GG T FP+            
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P +G+A+ F    PD +T   +LHG  PV+EGEKW ATKW+  R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 298


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 72/174 (41%), Positives = 101/174 (58%), Gaps = 23/174 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN++G S  +E RTS GMF  + + E+++ IEARIAA    P ENGE +Q+LHY  G +Y
Sbjct: 132 DNDTGGSEVNEARTSQGMFFMRGEGELISRIEARIAALLDWPLENGEGVQVLHYRPGAEY 191

Query: 64  EPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +PH+D+F           + GG R+ T++MYL+  E+GG T FP+  +            
Sbjct: 192 KPHYDYFDPAQPGTPTILKRGGQRVGTLVMYLNTPERGGGTTFPDVNLE----------- 240

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
                V P+KG+A +FFS +  A   + SLHG  PV+ GEKW ATKW+    FD
Sbjct: 241 -----VAPIKGNA-VFFS-YERAHPSTRSLHGGAPVLAGEKWVATKWLRQARFD 287


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/175 (41%), Positives = 100/175 (57%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G     A+  ++A IEARIAA T +P E+GE +QIL+Y+ G 
Sbjct: 135 VVNPDTGDENLIDARTSMGAMFQVAEHALIARIEARIAAVTGVPAEHGEGLQILNYKPGG 194

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG RIAT+++YL+  E GG T FP              
Sbjct: 195 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 240

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             R G  V P+KG+A+ F  L PD + D  +LH   PV  GEKW ATKW+  R +
Sbjct: 241 --RVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRERPY 293


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  DN+SG    ++ RTS+GMF  + +++++  +E RIA     P ENGE MQ+LHY  G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLICRVEQRIARLLNWPLENGEGMQVLHYRPG 200

Query: 61  QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+ T++MYL+   +GG T FP+            
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P +G+A+ F    PD +T   +LHG  PV+EGEKW ATKW+  R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 298


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 99/176 (56%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  DN+SG    ++ RTS+GMF  + ++++++ +E RIA     P ENGE MQ+LHY  G
Sbjct: 148 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLISLVEQRIARLLNWPLENGEGMQVLHYRPG 207

Query: 61  QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+ T++MYL+   +GG T FP+            
Sbjct: 208 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 256

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  + P +G+A+ F    PD +T   +LHG  PV+EGEKW ATKW+  R F
Sbjct: 257 -----GLQIVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 305


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 65/143 (45%), Positives = 92/143 (64%), Gaps = 1/143 (0%)

Query: 30  IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMY 89
           ++A+IE +IA  T  P +  E+  IL Y+ GQKY+ H+D F       L   R+ T L++
Sbjct: 133 VLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRVVTFLLF 192

Query: 90  LSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
           LS VE+GGET+FP  E  ++ +G +      G  VKP +GDA+ F++L P+ + D TSLH
Sbjct: 193 LSSVEEGGETMFP-FENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLH 251

Query: 150 GSCPVIEGEKWSATKWIHVRNFD 172
           GSCPVI+GEKW ATKWI  + +D
Sbjct: 252 GSCPVIKGEKWVATKWIRDQTYD 274


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 68/151 (45%), Positives = 92/151 (60%), Gaps = 6/151 (3%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+ +  E+RTS GMF+ +  D ++  IE RI+ WT LP E+ E +Q+L Y HGQ Y  H+
Sbjct: 96  GEGVVDEIRTSYGMFIRRLADPVITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHY 155

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSECARRGY 122
           D   DK N+     R+AT LMYLS VE+GGET FP + V        R G  SECA+   
Sbjct: 156 D-SGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHV 214

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCP 153
           A KP  GDA+LF+S +P+ + D  ++H  CP
Sbjct: 215 AAKPKAGDAVLFYSFYPNLTMDPAAMHTGCP 245


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 99/176 (56%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  DN+SG    ++ RTS+GMF  + ++++++ +E RIA     P ENGE MQ+LHY  G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLISRVEQRIARLLNWPLENGEGMQVLHYRPG 200

Query: 61  QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+ T++MYL+   +GG T FP+            
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P +G+A+ F    P+ +T   +LHG  PV+EGEKW ATKW+  R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPEPATK--TLHGGAPVLEGEKWIATKWLREREF 298


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 72/175 (41%), Positives = 100/175 (57%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G     A+  ++A IEARIAA T +P ++GE +QIL+Y+ G 
Sbjct: 134 VVNPDTGDENLIDARTSMGAMFQVAEHALIARIEARIAAVTGVPADHGEGLQILNYKPGG 193

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG RIAT+++YL+  E GG T FP              
Sbjct: 194 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             R G  V P+KG+A+ F  L PD + D  +LH   PV  GEKW ATKW+  R +
Sbjct: 240 --RVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRERPY 292


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 80/237 (33%), Positives = 119/237 (50%), Gaps = 38/237 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           ++    + K + S  RT+   +L   QD++V  +E +IA  T   PE GE +Q+LHY   
Sbjct: 112 LITPYGTNKLVESTTRTNKQAWLDFQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKS 171

Query: 61  QKYEPHFDFFRDKM----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           Q++  H D+F        N + GG+R+ TV++YL   E+GGET F  + +          
Sbjct: 172 QQFTEHHDYFDPATDPPENYEKGGNRLITVIVYLQAAEEGGETHFGAANLK--------- 222

Query: 117 CARRGYAVKPMKGDALLFFSLH------PDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
                  +   KGDA++F++L            D  +LH   P I+GEKW ATKWIH R 
Sbjct: 223 -------LTAAKGDAVMFYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHERG 275

Query: 171 FDKPEKEPEDDDCVDEDLNCVVWA--KAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
           +    +      C D+   C  WA     ECK NP++M  SK+    CR+SCK+C+P
Sbjct: 276 Y----QSETSGGCFDKHPKCTYWAGKTPTECKLNPVWM--SKN----CRRSCKICQP 322


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/170 (39%), Positives = 96/170 (56%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+++G +   E RTSSG F  +     +A I+ R+AA   +P  +GE +QIL+Y+ G 
Sbjct: 130 VVDHQTGNTKLHEHRTSSGTFFHRGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGG 189

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PH+D+FR        +   GG R AT+++YL+ V+ GGET+FP              
Sbjct: 190 EYRPHYDYFRPDAPGSAKHLARGGQRTATLIIYLNDVDGGGETIFP-------------- 235

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             R G ++ P KG A+ F   + +   DS S HG  PVIEGEKW ATKW+
Sbjct: 236 --RNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEGEKWIATKWV 283


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/177 (40%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ESG S  S VR S G    + ++E+V  IEAR++A   LP   GE +QILHY  G 
Sbjct: 151 VVDRESGGSYESSVRKSEGSHFERGENELVRRIEARLSALVDLPVNRGEPLQILHYGPGG 210

Query: 62  KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+ H DFF  K     +  ++GG RI TV+MYL+ V +GGET FP+             
Sbjct: 211 EYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFPDI------------ 258

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
               G++ KP+KG A+ F   + D   D   LH   PVI G+KW  TKW+  R +++
Sbjct: 259 ----GFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRERPYEQ 311


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score =  131 bits (329), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/175 (41%), Positives = 99/175 (56%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G     A+  ++  IEARIAA T +P E+GE +QIL+Y+ G 
Sbjct: 135 VVNPDTGDENLIDARTSMGAMFQVAEHPLITRIEARIAAVTGVPAEHGEGLQILNYKPGG 194

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG RIAT+++YL+  E GG T FP              
Sbjct: 195 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 240

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             R G  V P+KG+A+ F  L PD + D  +LH   PV  GEKW ATKW+  R +
Sbjct: 241 --RVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRERPY 293


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  ++A IEARIA  T +P E+GE  Q+LHY+ G 
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  ++A IEARIA  T +P E+GE  Q+LHY+ G 
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  ++A IEARIA  T +P E+GE  Q+LHY+ G 
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY+ G 
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYQPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V  GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 130 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 189

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 190 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 235

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 236 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 290


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 127 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 118 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 177

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 178 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 223

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 224 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 278


>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 112/227 (49%), Gaps = 51/227 (22%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D+++G+S   ++RTS G    + +D ++A++E RIA WT LPPE GE MQIL Y  G
Sbjct: 44  LVVDSKTGQSKLDDIRTSYGAAFGRGEDPVIAAVEERIAEWTHLPPEYGEPMQILRYVDG 103

Query: 61  QKYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           QKY+ H+D+F D ++       G+R ATVL+YLS VE GGET  P ++            
Sbjct: 104 QKYDAHWDWFDDPVHHAAYLHEGNRYATVLLYLSGVEGGGETNLPLAD------------ 151

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEK 176
                   P+  +A                        +G KW+ATKWIH + +  K + 
Sbjct: 152 --------PIDKEA------------------------QGMKWTATKWIHNKPYMGKYDP 179

Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
                 C D   NC   A AGEC  N   MVG     G CRKSC  C
Sbjct: 180 LRTAGRCADTGGNCAARAAAGECTSNMDKMVGPA---GECRKSCNDC 223


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 97/175 (55%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+++ +  R+S GMF    +  ++A +EARIA  T LP ENGE +Q+LHYE G 
Sbjct: 132 VVDPVTGRNVVAGHRSSDGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEAGA 191

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+       ++ +    G R+ T+LMYL+ VE GGET+FP +            
Sbjct: 192 ESTPHVDYLIAGNPANRESIARSGQRVGTLLMYLNDVEGGGETMFPQT------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G++V P +G AL F   +     D +SLH S P+  GEKW ATKWI  R F
Sbjct: 240 ----GWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTRRF 290


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/175 (40%), Positives = 98/175 (56%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+++ +  R+S GMF    +  ++A +EARIA  T LP ENGE +Q+LHYE G 
Sbjct: 132 VVDPVTGRNVVAGHRSSDGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEVGA 191

Query: 62  KYEPHFDFF--RDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+    +  NQ+     G R+ T+LMYL+ VE GGET+FP +            
Sbjct: 192 ESTPHVDYLIAGNPANQESIARSGQRVGTLLMYLNDVEGGGETMFPQT------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G++V P +G AL F   +     D +SLH S P+  GEKW ATKWI  R F
Sbjct: 240 ----GWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTRRF 290


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/175 (40%), Positives = 97/175 (55%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D +SGK I  E R S G F++ + D +V +I+ RIA     P ENGE + IL Y  G 
Sbjct: 124 VVDPDSGKEITIEERRSEGAFVNASTDALVETIDRRIAELFRQPVENGEDLHILRYGMGG 183

Query: 62  KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PH+D+F +     K + Q GG RIATV++YL+ VE+GG+T FP+             
Sbjct: 184 EYRPHYDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFPDI------------ 231

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G A+ P +G AL F  ++    +D  +LH   PV +GEKW ATKWI    F
Sbjct: 232 ----GLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWIATKWIRRGRF 282


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 126 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 185

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 186 EYQPHFDYFNPGRSGEARQLDVGGQRVATLVIYLNSVQAGGATGFP-------------- 231

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 286


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 127 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 70/174 (40%), Positives = 94/174 (54%), Gaps = 23/174 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           D  +G S  +  RTS GMF ++ +  + A  EARIAA    P ENGE +Q+LHY  G +Y
Sbjct: 134 DTATGASEVNAARTSDGMFFTRGEHPVCARFEARIAALLNWPVENGEGLQVLHYRPGAEY 193

Query: 64  EPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +PH+D+F           + GG R+AT++ YL+   +GG T FP+               
Sbjct: 194 KPHYDYFDPDQPGTPAVLRRGGQRVATLVTYLNTPTRGGGTTFPDI-------------- 239

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
             G  V P+KG A+ F    P  ST   SLHG  PV+EG+KW ATKW+ V  FD
Sbjct: 240 --GLEVTPLKGHAVFFSYDRPHPST--RSLHGGAPVLEGDKWVATKWLRVGRFD 289


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 95/177 (53%), Gaps = 23/177 (12%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +    ++G    +E RTSSGMF  + ++E+VA IEARIA     P ENGE +Q+LHY  G
Sbjct: 128 LTVATKTGGEEVNEDRTSSGMFFQRGENELVARIEARIARLVNWPVENGEGLQVLHYRPG 187

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+ T++MYL   EKGG T FP+  +         
Sbjct: 188 AEYKPHYDYFDPAEPGTPTILKRGGQRVGTLVMYLGEPEKGGGTTFPDVHLE-------- 239

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
                   V P +G  + F    P  ST   +LHG  PV+ GEKW ATKW+  R F+
Sbjct: 240 --------VAPKRGHGVFFSYERPHPST--RTLHGGAPVLAGEKWIATKWLRERRFE 286


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 72/172 (41%), Positives = 94/172 (54%), Gaps = 17/172 (9%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  SG+  A   RTS  M     ++E+V  IE RIA  T  P ENGE +QIL+Y  G
Sbjct: 125 LVVDRGSGEERAGSGRTSKSMAFRLKENELVERIETRIAELTGYPAENGEGLQILNYGLG 184

Query: 61  QKYEPHFDFFRDKM-NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           ++Y+PHFDFF   M +   GG R+ T L+YL+ VE GGETVF                ++
Sbjct: 185 EEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF----------------SK 228

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            G +  P KG A+ F   +     D  S+H S PV +GEKW+ATKWI   N 
Sbjct: 229 AGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKWAATKWIRESNI 280


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 64/143 (44%), Positives = 92/143 (64%), Gaps = 1/143 (0%)

Query: 30  IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMY 89
           I+A+IE +IA  T +P +  E+  IL Y+ GQKY+ H+D F           R+ T +++
Sbjct: 131 ILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQRVVTFILF 190

Query: 90  LSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
           LS VE+GGET+FP  E  ++ +G +      G  VKP +GDA+ F++L P+ + D TSLH
Sbjct: 191 LSSVEEGGETMFP-FENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLH 249

Query: 150 GSCPVIEGEKWSATKWIHVRNFD 172
           GSCPVI+GEKW ATKWI  + +D
Sbjct: 250 GSCPVIKGEKWVATKWIRDQTYD 272


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score =  129 bits (324), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 95/177 (53%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  +VA IEARIA  T +P E+GE  Q+LHY  G 
Sbjct: 121 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 180

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F      +    ++GG R+AT+++YL+ V+ GG T FP              
Sbjct: 181 EYQPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 226

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD   D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 227 --KLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKWIATKWLRERPYRR 281


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score =  129 bits (324), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 69/177 (38%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+      RTS G      +  ++A IEARIA  T +P E+GE  Q+LHY+ G 
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F   R    +QL  GG R+AT+++YL+ V  GG T FP              
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATGFP-------------- 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRERPYRR 287


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 68/161 (42%), Positives = 90/161 (55%), Gaps = 23/161 (14%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
           RTSSGMF ++ Q   V ++E RIA     P ENGE +Q+LHY  G +Y+PH+D+F  K  
Sbjct: 153 RTSSGMFFTRGQTPEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEA 212

Query: 74  ---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
                 + GG R+AT++MYL+   +GG T FP+                 G  V P+KG 
Sbjct: 213 GTPTILKRGGQRVATLVMYLNEPARGGGTTFPDV----------------GLEVAPVKGS 256

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           A+ F    P  +T   SLHG  PV+EGEKW ATKW+  R F
Sbjct: 257 AVFFSYDRPHPTTR--SLHGGAPVLEGEKWVATKWLREREF 295


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 102/168 (60%), Gaps = 6/168 (3%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D++ GK   +  RTS+  F+   +D ++  I+ R+  +T +P  + E +Q+L Y+ GQ
Sbjct: 232 LMDHDKGKP-DTNWRTSTTYFMPSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQ 290

Query: 62  KYEPHFDFFRDKMNQQLGG---HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +Y  H DF  ++  + + G   +R+ TV  YLS VE+GGET+FP       R  ++S+C 
Sbjct: 291 RYTAHHDFLDERTMRNMDGGRKNRMITVFWYLSDVEEGGETIFPRYGGRTGRV-DFSDCT 349

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             G  VKP++G   +F+SL PD   D  SLHG+CPVI G+KW+A KW+
Sbjct: 350 T-GLKVKPVEGKVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 68/175 (38%), Positives = 95/175 (54%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D +SG  +  + R S G F++ + D +VA+I+ RIA     P ENGE + IL Y  G 
Sbjct: 121 VVDPDSGGEVLIDARKSEGAFVNGSTDPLVATIDRRIAELVQQPVENGEDLHILRYGAGG 180

Query: 62  KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFD+F +     K + Q GG RIAT+++YL+ VE+GG+T FP+             
Sbjct: 181 EYRPHFDYFPEEQAGSKHHMQRGGQRIATLILYLNQVEEGGDTTFPDI------------ 228

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G  + P +G AL F  ++    TD  +LH   PV  GEKW ATKW+    F
Sbjct: 229 ----GLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKWIATKWMRRGRF 279


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 71/175 (40%), Positives = 103/175 (58%), Gaps = 14/175 (8%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D + G+  AS+ RTS   F+    D I+  I+ R A+   +P  + E +Q+L Y+  +
Sbjct: 46  LMDKDQGRP-ASDFRTSQSAFIRAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTE 104

Query: 62  KYEPHFDFF------RDKMNQQL--GGHR--IATVLMYLSHVEKGGETVFPNSEVSQSRD 111
           KY+ H D+F      +DK    L   GHR  +ATV  YLS VEKGGETVFP    +Q  +
Sbjct: 105 KYDSHADYFDPALYTKDKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQ--E 162

Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            +  +C + G  VKP KG  ++F+S+ PD + D  SLHG+CPV +G KW+A KW+
Sbjct: 163 TSMKDC-KTGLKVKPEKGKVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216


>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
          Length = 282

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 74/188 (39%), Positives = 101/188 (53%), Gaps = 38/188 (20%)

Query: 15  VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGE---------------------- 50
           +R  SG+F+S ++D+   +  IE +IA    +P  +GE                      
Sbjct: 90  IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149

Query: 51  -----------AMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGET 99
                      A  IL YE GQ+Y  H+D F          HRIAT L+YLS VE+GGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209

Query: 100 VFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE 158
           +FP  + ++  +D ++  C   G  VKP +GD LLF+S+ P+ + D TSLHGSCPVI+GE
Sbjct: 210 MFPFENGLNMDKDYDFQRCI--GLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGE 267

Query: 159 KWSATKWI 166
           KW ATKWI
Sbjct: 268 KWVATKWI 275


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 69/173 (39%), Positives = 95/173 (54%), Gaps = 17/173 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+  A+  RTS GM     ++E +  +E RIA     P ENGE +Q+L+Y  G+
Sbjct: 139 VIDPKTGEEKAATGRTSKGMSFYLQENEFIKKVEKRIAELIEFPVENGEGLQVLNYGIGE 198

Query: 62  KYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           +Y+ HFD+F + K+  + GG R+ T L+YL+ V  GGETVFP +                
Sbjct: 199 EYKSHFDYFPQSKVVPEKGGQRVGTFLIYLNDVPAGGETVFPKA---------------- 242

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           G ++ P KG A+ F   +     D  SLH S PV EGEKW ATKWI   N  K
Sbjct: 243 GVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKWVATKWIRQENIYK 295


>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
          Length = 190

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 75/180 (41%), Positives = 100/180 (55%), Gaps = 18/180 (10%)

Query: 3   ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
           A NE+   + S  RTSS  +LSK  D +VA I  R+A    LP E  E MQ+LHY   Q 
Sbjct: 9   AGNEAKNGVGS-ARTSSTAWLSKTADPLVAKIRTRVAELVKLPMELAEDMQVLHYSKNQH 67

Query: 63  YEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           Y  H DFF   + +      G +R  TV  YLS VE+GGETVFP +     R  ++++C+
Sbjct: 68  YWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRVTDFADCS 127

Query: 119 RRGYAVKPMKGDALLFFSLH---------PD---ASTDSTSLHGSCPVIEGEKWSATKWI 166
            RG  VKP  G+A++F+S+          PD    + D  SLHG C VI+G+KW+A  WI
Sbjct: 128 -RGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDKWAANYWI 186


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score =  126 bits (316), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 24/176 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ +  + RTS+GMF  + ++ +VA +EARIA     P ENGE +Q+LHY  G 
Sbjct: 153 VATRTGGEEVNDD-RTSNGMFFQREENPVVARLEARIARLVNWPLENGEGLQVLHYRPGA 211

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+D+F           + GG R+AT+++YL+  EKGG T FP+  +          
Sbjct: 212 EYKPHYDYFDPAEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLE--------- 262

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
                  V P +G+A+ F    P  ST   +LHG  PV+ G+KW ATKW+  R F+
Sbjct: 263 -------VAPRRGNAVFFSYERPHPST--RTLHGGAPVVAGDKWIATKWLRERRFE 309


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 94/175 (53%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+++ +  R+S GMF    +  ++  IEARIAA T  P ENGE +Q+LHYE G 
Sbjct: 132 VVDPVTGRNVVAGHRSSHGMFFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGA 191

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+       ++ +    G R+ T+LMYL  VE GGETVFP              
Sbjct: 192 ESTPHVDYLITGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQI------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G++V P +G AL F   +     D +SLH S P+  G+KW ATKWI  R F
Sbjct: 240 ----GWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRRF 290


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 95/175 (54%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+ + +  R+S GMF    +  ++A IEARIA  T  P ENGE +Q+LHYE G 
Sbjct: 132 VVDPVTGRDVIATHRSSHGMFFRLGETPLIARIEARIAELTATPVENGEGLQMLHYEEGA 191

Query: 62  KYEPHFDFFR--DKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+    ++ N++     G R+ T+LMYL  VE GGETVFP              
Sbjct: 192 ESTPHVDYLMTGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQV------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G+++ P +G AL F   +     D +SLH S P+  G+KW ATKWI  R F
Sbjct: 240 ----GWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTRRF 290


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 24/176 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ +  + RTS+GMF  + ++ +VA +EARIA     P ENGE +Q+LHY  G 
Sbjct: 142 VATRTGGEEVNDD-RTSNGMFFQREENPMVAKLEARIARLVNWPLENGEGLQVLHYRPGA 200

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+D+F           + GG R+AT+++YL+  EKGG T FP+  +          
Sbjct: 201 EYKPHYDYFDPTEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLE--------- 251

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
                  V P +G+A+ F    P  ST   +LHG  PV+ G+KW ATKW+  R F+
Sbjct: 252 -------VAPRRGNAVFFSYERPHPST--RTLHGGAPVVAGDKWIATKWLRERRFE 298


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+      RTS G      +  ++A IEARIA    +P E+GE  Q+L+Y+ G 
Sbjct: 126 VVNPDTGEENLISARTSQGGMFQVGEHPLIAKIEARIAQAVGVPVEHGEGFQVLNYQPGG 185

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRERPYRR 286


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 62/153 (40%), Positives = 94/153 (61%), Gaps = 17/153 (11%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           E+RTSS  F  + ++EIVA IE RI+    +P E+GE +QIL+Y+ GQ+Y+ HFDFF   
Sbjct: 76  ELRTSSSTFFHEGENEIVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFF-SS 134

Query: 74  MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
            ++     RI+T++MYL+ VE+GGET FP                +  ++V P KG A+ 
Sbjct: 135 TSRAASNPRISTLVMYLNDVEQGGETYFP----------------KLNFSVSPQKGMAVY 178

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           F   + D + +  +LHG  PV+ G+KW+AT+W+
Sbjct: 179 FEYFYNDQNLNDLTLHGGAPVVMGDKWAATQWM 211


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 92/171 (53%), Gaps = 21/171 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           ESG+    ++RTS G +  + +D  +  ++ RI+A    P E+GE +QILHY  G +Y P
Sbjct: 150 ESGREDVIQLRTSEGFWFQRCEDAFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRP 209

Query: 66  HFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           HFD+F        ++   GG R+AT+++YLS V  GGETVFPN+                
Sbjct: 210 HFDYFPPSQSGSVLHTSRGGQRVATLIVYLSDVAGGGETVFPNA---------------- 253

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G AV   +G A+ F  L+     D  +LHG  PV  GEKW  TKW+  R +
Sbjct: 254 GLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRERPY 304


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 93/175 (53%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + + G       RTS G   ++ +  ++A IEARIA+   +P  +GE +QILHY    
Sbjct: 133 VVNTQHGAFELKPSRTSGGTHFARGETPLIADIEARIASLLKVPEAHGEPLQILHYPVSG 192

Query: 62  KYEPHFDFFRDKM--NQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PH+DFF  +   NQ++   GG R+ T++MYLS VE GG TVFP              
Sbjct: 193 EYRPHYDFFDPEKPGNQEVLAAGGQRVGTLIMYLSDVESGGATVFP-------------- 238

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             R G  V+P KG AL F  +      D  SLHG  PV+ GEKW ATKW+    +
Sbjct: 239 --RVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPVLAGEKWIATKWLRAAEY 291


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 60/154 (38%), Positives = 94/154 (61%), Gaps = 21/154 (13%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +++RTSS  F+ + + E+V  +E RI+    +P ENGE +QIL+Y+ GQ+Y+ HFDFF++
Sbjct: 74  NDMRTSSSTFMEEGESEVVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKN 133

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
             N      RI+T++MYL+ VE+GGET FP                +  ++V P KG A+
Sbjct: 134 ASNP-----RISTLVMYLNDVEEGGETYFP----------------KLNFSVSPQKGMAV 172

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            F   + +   +  +LHG  PVI G+KW+AT+W+
Sbjct: 173 YFEYFYDNQELNDLTLHGGAPVIIGDKWAATQWM 206


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 98/176 (55%), Gaps = 23/176 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+    + RTS GMF  +  + + A +EARIAA    P ENGE +Q+L Y  G 
Sbjct: 126 VFDPDTGQDQQHQARTSEGMFFGRGANPLCARVEARIAALLNWPLENGEGLQVLRYGPGA 185

Query: 62  KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +YEPH+D+F       ++  + GG R+A++++YL+   +GG T FP++ +          
Sbjct: 186 QYEPHYDYFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAHLE--------- 236

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
                  V P+KG+A+ F    P   T   +LHG  PV+EGEKW ATKW+  R  D
Sbjct: 237 -------VAPIKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRHD 283


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 92/175 (52%), Gaps = 24/175 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ I  + RTS GMF  + Q  ++  IE RIA     P ENGE +Q+LHY  G 
Sbjct: 147 VATKTGGEEINDD-RTSDGMFFQRGQSPLIQRIEERIARLLNWPIENGEGLQVLHYRPGA 205

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+D+F           + GG R+ T++MYL+  EKGG T FP+  V          
Sbjct: 206 EYKPHYDYFDPAEPGTPTIVKRGGQRVGTLVMYLNTPEKGGGTTFPDVHVE--------- 256

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                  V P +G+A+ F    P  ST   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 257 -------VAPQRGNAVFFSYERPHPST--RTLHGGAPVLAGEKWIATKWLREREF 302


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 94/177 (53%), Gaps = 28/177 (15%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ I  + RTS GMF  + Q  ++  IE RIA     P ENGE +Q+LHY  G 
Sbjct: 146 VATKTGGEEINDD-RTSDGMFFQRGQSPLIQRIEERIARLLNWPIENGEGLQVLHYRPGA 204

Query: 62  KYEPHFDFFRDK-------MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
           +Y+PH+D+F          +N+  GG R+ T++MYL+  EKGG T FP+  +        
Sbjct: 205 EYKPHYDYFDPAEPGTPSIVNR--GGQRVGTLVMYLNTPEKGGGTTFPDVHLE------- 255

Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                    V P +G+A+ F    P  ST   +LHG  PVI GEKW ATKW+  R F
Sbjct: 256 ---------VAPQRGNAVFFSYERPHPST--RTLHGGAPVIAGEKWIATKWLREREF 301


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 67/175 (38%), Positives = 92/175 (52%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G++I +  R+S GMF    +  +++ IE RIAA T  P ENGE +Q+LHYE G 
Sbjct: 132 VVDPVTGRNIVAGHRSSDGMFFRLGETPLISRIEQRIAALTGFPVENGEGLQMLHYEAGA 191

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+       +  +    G R+ T+LMYL+ VE GGET+FP              
Sbjct: 192 ESTPHVDYLVPGNPANAESIARSGQRVGTLLMYLNDVESGGETLFPQV------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V P +G A  F   +    +D  SLH S P+  G+KW ATKWI  R F
Sbjct: 240 ----GCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDKWVATKWIRTRRF 290


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 66/177 (37%), Positives = 96/177 (54%), Gaps = 21/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+      RTS G      +  ++A IE RIA    +P E+GE  Q+L+Y+ G 
Sbjct: 126 VVNPDTGEENLISARTSQGGMFQVGEHPLIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF   R    +QL  GG R+AT+++YL+ V+ GG T FP              
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
             + G  V P+KG+A+ F    PD + D  +LH   PV  GEKW ATKW+  R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRERPYRR 286


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score =  124 bits (310), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 19/158 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL ++  EI   IE RIA+   +P  +GE +QIL Y  GQ+Y+ H+DFF 
Sbjct: 78  TNDIRTSSGAFLEES--EITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV 135

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  +     +R++T++MYL+HVE+GGET FP   +S                V P KG A
Sbjct: 136 EN-SAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS----------------VSPKKGMA 178

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + F   + D S +  +LHG  PVI+GEKW AT+W+  R
Sbjct: 179 VYFEYFYQDESINKLTLHGGAPVIKGEKWVATQWMRRR 216


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 19/158 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL ++  EI   IE RIA+   +P  +GE +QIL Y  GQ+Y+ H+DFF 
Sbjct: 78  TNDIRTSSGAFLEES--EITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV 135

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  +     +R++T++MYL+HVE+GGET FP   +S                V P KG A
Sbjct: 136 EN-SAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS----------------VSPKKGMA 178

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + F   + D S +  +LHG  PVI+GEKW AT+W+  R
Sbjct: 179 VYFEYFYQDESINKLTLHGGAPVIKGEKWVATQWMRRR 216


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 92/175 (52%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+ +A+  R+S G F   A+  +VA +E RIAA T L  ENGE +Q+L Y+ G 
Sbjct: 132 VVDPVTGRDVAAGHRSSDGTFFRLAETPLVARLEMRIAALTGLAAENGEGLQLLRYQPGA 191

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +  PH D+       ++ +    G R+ T+LMYL+ VE GGETVFP              
Sbjct: 192 ESTPHVDYLVAGNETNRESIARSGQRVGTLLMYLNDVEGGGETVFPQV------------ 239

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V P +G AL F   +     D  SLH S P+  GEKW ATKWI  R F
Sbjct: 240 ----GCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSGEKWVATKWIRARRF 290


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 95/154 (61%), Gaps = 18/154 (11%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +E+RTSS MF+   ++ IV  ++ RI+A   +P E+GE +QIL Y  GQ+Y+ H DFF  
Sbjct: 70  NELRTSSSMFIEDDENLIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSS 129

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
             + ++  +RI+T++MYL+ VE+GGET FP+ +                ++V P KG A+
Sbjct: 130 --DSKITNNRISTLVMYLNDVEQGGETFFPHLK----------------FSVSPRKGMAV 171

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            F   + D + +  +LHG  PV+EGEKW AT+W+
Sbjct: 172 YFEYFYSDQTLNDFTLHGGAPVVEGEKWVATQWM 205


>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 280

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 91/176 (51%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  +  +G  + +  RTS GMF  + ++EIVA +E R+A     P E GE +QIL Y  G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARLEQRLAMLLRWPLEYGEGLQILRYAPG 181

Query: 61  QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y PH+D+F           + GG R+AT++MYL   E+GG T FP+            
Sbjct: 182 AQYRPHYDYFDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFPDV----------- 230

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P++G  + F    PD  T   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 231 -----GLEVAPVRGTGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279


>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
 gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 253

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 75/177 (42%), Positives = 103/177 (58%), Gaps = 12/177 (6%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V  N+S +     +RTS    +   + ++VA IE RIA WT LP  + E M++L Y +G
Sbjct: 74  LVVGNKSDE--VDPIRTSYSASIGYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYING 131

Query: 61  QKYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVE--KGGETVFP-----NSEVSQSRDG 112
           QKY+ H+D+F   +     GG+R+AT LMYLS +E   GGET  P     + EV      
Sbjct: 132 QKYDAHWDWFDETETGGTGGGNRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGR 191

Query: 113 NWSECA-RRGYAVKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKWIH 167
            +SECA + G +V+P KGD LLF+ + P     D  +LH SCP   G KW+ATKWIH
Sbjct: 192 GYSECASKMGISVRPKKGDVLLFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIH 248


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score =  123 bits (308), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 92/176 (52%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  +  +G  + +  RTS GMF  + +++IVA +E RIAA    P E GE +QIL Y  G
Sbjct: 121 LTVETRTGGEVLNVDRTSEGMFFERGENDIVARLEQRIAALLRWPVEFGEGLQILRYAPG 180

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y PH+D+F           + GG R+AT++MYL    +GG T FP+            
Sbjct: 181 AQYRPHYDYFDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFPDV----------- 229

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P++G  + F    PD +T   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 230 -----GLEVAPVRGTGVFFSYEEPDPAT--RTLHGGAPVLAGEKWVATKWLREREF 278


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 94/176 (53%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +    ++G    ++ RTS GMF  + +  +V  IE RIA+    P ENGE +Q+LHY  G
Sbjct: 160 LTVATQTGGEEVNDDRTSHGMFFQRGESPLVQRIEERIASLLNWPIENGEGLQVLHYRPG 219

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           Q GG R+ T++MYL+  E+GG T FP++++         
Sbjct: 220 AEYKPHYDYFDPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFPDAQIE-------- 271

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                   V P +G+A  F    P  ST   +LHG  PV+ G+KW ATKW+  R F
Sbjct: 272 --------VAPQRGNAAFFSYERPTPSTR--TLHGGAPVLAGDKWIATKWLREREF 317


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 94/170 (55%), Gaps = 21/170 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G+      RTS G++  + +D+++A +E RIA+ T  P ENGE +Q+LHY    +Y PH
Sbjct: 132 TGREDVIRNRTSEGVWYRRGEDQLIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPH 191

Query: 67  FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           FDFF        ++   GG R+AT+++YL+ V  GGETVFP +                G
Sbjct: 192 FDFFAPDQPGSAVHTTQGGQRVATLIIYLNDVADGGETVFPTA----------------G 235

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            +V    G A+ F  ++ +   D ++LHG  PV+ G+KW  TKW+  R +
Sbjct: 236 LSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDKWIMTKWMRERAY 285


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 62/152 (40%), Positives = 92/152 (60%), Gaps = 17/152 (11%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           +RTSS  F+ + ++ IV+ IE RI+    +P E GE +QIL+Y+ GQ+Y+ HFDFF    
Sbjct: 77  MRTSSSTFIEENENIIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPH 136

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           N  +   RI+T++MYLS VE+GGET FP                +  ++V P KG A+ F
Sbjct: 137 N-AINNPRISTLVMYLSDVEQGGETYFP----------------KLHFSVSPQKGMAVYF 179

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              + D + +  +LHG  PVI G+KW+AT+W+
Sbjct: 180 EYFYNDQTLNELTLHGGAPVIVGDKWAATQWM 211


>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
          Length = 280

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 67/176 (38%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  +  +G  + +  RTS GMF  + ++EIVA +E RIAA    P E GE +QIL Y  G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARVEQRIAALLRWPLEFGEGLQILRYAPG 181

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y PH+D+F           + GG R+AT++MYL   E GG T FP+            
Sbjct: 182 AQYRPHYDYFDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFPDV----------- 230

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P +G  + F    PD  T   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 231 -----GLEVAPARGCGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279


>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
          Length = 207

 Score =  122 bits (307), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 56/92 (60%), Positives = 72/92 (78%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL++ +D+IV  IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 116 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQ 175

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHV 93
           KYEPH+D+F D  N + GG RIATVLMYL+ V
Sbjct: 176 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDV 207


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score =  122 bits (307), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 64/175 (36%), Positives = 91/175 (52%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +GK      R+S G F     D+ +A ++ RI+A   LP ++GE +QILHY  G 
Sbjct: 125 IVDPTTGKHETIADRSSEGTFFEINADDFIARLDRRISALMNLPVDHGEGLQILHYGPGG 184

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF        +    GG R++T++MYL+ VE GG T+FP              
Sbjct: 185 EYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFPEL------------ 232

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V P KG A+ F   +     D  +LHG  PV+ GEKW  TKW+  R +
Sbjct: 233 ----GLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKWIVTKWMRQRRY 283


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 96/171 (56%), Gaps = 21/171 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G      +  ++  +EARIAA T +P E+GE +QIL+Y+ G 
Sbjct: 157 VVNPDTGDENLIDARTSMGAMFQVGEHPLIERLEARIAAVTGVPVEHGEGLQILNYKPGA 216

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+DFF   R    +QL  GG R+AT+++YL+ V  GG T FP              
Sbjct: 217 EYQPHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP-------------- 262

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
             + G  V P++G+A+ F  L  D S D  +LH   PV +GEKW ATKW+ 
Sbjct: 263 --KLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLR 311


>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
 gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
          Length = 284

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/181 (38%), Positives = 100/181 (55%), Gaps = 26/181 (14%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  D+E G+      RTS GMF +  +  +V  IE R+AA   +P  +GE +QILHY  G
Sbjct: 124 LTVDSE-GRQQVDRRRTSEGMFFTLNEVPLVGRIEQRLAALLRVPASHGEGLQILHYLPG 182

Query: 61  QKYEPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
           Q+YEPHFD+F  +         +GG RIA+V+MYL+   +GG T FP   ++ +      
Sbjct: 183 QEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYLNTPARGGGTAFPELGLTVT------ 236

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
             ARRG AV         +F+       D +SLH   PV++GEKW ATKW+  R + +P+
Sbjct: 237 --ARRGSAV---------YFAYE---GGDPSSLHAGLPVLDGEKWIATKWLRERPYKRPK 282

Query: 176 K 176
           K
Sbjct: 283 K 283


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 92/171 (53%), Gaps = 21/171 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E+G     ++RTS G +  + +D  +  ++ RI+A    P E+GE +QILHY  G +Y P
Sbjct: 130 ENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRP 189

Query: 66  HFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           HFD+F    N  +     GG R+AT+++YLS VE GGETVFP++                
Sbjct: 190 HFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFPDA---------------- 233

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G AV   +G A+ F  ++     D  +LHG  PV  G+KW  TKW+  R +
Sbjct: 234 GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRERPY 284


>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
 gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
          Length = 263

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/181 (41%), Positives = 104/181 (57%), Gaps = 23/181 (12%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MV   +S   +  ++RTS    +   +  IV+SIE RIA WT           +L Y +G
Sbjct: 95  MVVGTDS--DLIDDIRTSFSASIMYGETSIVSSIEERIARWT-----------VLRYVNG 141

Query: 61  QKYEPHFDFFRDKMNQQLGG-HRIATVLMYLSHVE--KGGETVFPNSEV----SQSRDGN 113
           QKY+ H+D+F D    + GG +R+ATVLMYLS V+   GGET  P +E      QS DG 
Sbjct: 142 QKYDAHWDWFDDNEVAKAGGSNRMATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDGQ 201

Query: 114 -WSECA-RRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRN 170
            +S+CA R G +++P KGD LLF+ + P     D  +LH SCP   G KW+ATKWIH + 
Sbjct: 202 GYSQCAARMGISIRPRKGDVLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNKP 261

Query: 171 F 171
           +
Sbjct: 262 Y 262


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 95/181 (52%), Gaps = 21/181 (11%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           D  +G++     R+S GMF    ++  VA ++ R++    LP ENGE +Q+LHY  G + 
Sbjct: 131 DPLTGRNRLGAQRSSLGMFFRLRENAFVARLDERLSELMNLPVENGEGLQVLHYPAGAQS 190

Query: 64  EPHFDFF--RDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PHFDF    +  NQ   Q  G R++T++ YL+ VE+GGETVFP +              
Sbjct: 191 LPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEVEEGGETVFPET-------------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
             G++V P +G A+ F   +     D  SLH   PV+ GEKW ATKW+  R F    + P
Sbjct: 237 --GWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKWVATKWMRQRRFVAAAQAP 294

Query: 179 E 179
            
Sbjct: 295 R 295


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/175 (39%), Positives = 95/175 (54%), Gaps = 24/175 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ +  + RTS GMF  + ++ +V  IE RIA     P ENGE +Q+LHY  G 
Sbjct: 143 VATKTGGEEVNDD-RTSDGMFFQRGENPVVQRIEERIARLLDWPIENGEGLQVLHYRPGA 201

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+D+F           + GG R+ T++MYL+  EKGG T FP+  V          
Sbjct: 202 EYKPHYDYFDPGEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFPDVHVE--------- 252

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                  V P +G+A +FFS +  A   + +LHG  PVI GEKW ATKW+  R F
Sbjct: 253 -------VAPQRGNA-VFFS-YERAHPATRTLHGGAPVIAGEKWIATKWLREREF 298


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/176 (38%), Positives = 94/176 (53%), Gaps = 26/176 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ I ++ RTS GMF  + Q  ++  IE RIA     P ENGE +Q+LHY  G 
Sbjct: 147 VATKTGGEEINAD-RTSDGMFFQRGQSPLIQRIEERIARLLQWPIENGEGLQVLHYRPGA 205

Query: 62  KYEPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
           +Y+PH+D+F D          + GG R+ T++MYL+  +KGG T FP+  +         
Sbjct: 206 EYKPHYDYF-DPAEPGTPSIIKRGGQRVGTLVMYLNTPDKGGGTTFPDVHLE-------- 256

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                   V P +G+A+ F    P  ST   +LHG  PVI G+KW ATKW+  R F
Sbjct: 257 --------VAPQRGNAVFFSYERPHPST--RTLHGGAPVIAGDKWIATKWLREREF 302


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/171 (36%), Positives = 92/171 (53%), Gaps = 21/171 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E+G     ++RTS G +  + +D  +  ++ RI+A    P E+GE +QILHY  G +Y P
Sbjct: 127 ENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRP 186

Query: 66  HFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           HFD+F    N  +     GG R+AT+++YLS VE GGETVFP++                
Sbjct: 187 HFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFPDA---------------- 230

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G AV   +G A+ F  ++     D  +LHG  PV  G+KW  TKW+  R +
Sbjct: 231 GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRERPY 281


>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 222

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 55/89 (61%), Positives = 68/89 (76%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +G S  S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 133 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 192

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
           KYEPHFD+F D  N + GG RIAT+LMYL
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYL 221


>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 204

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 55/89 (61%), Positives = 72/89 (80%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+E+GKS  S VRTSSG FL++ +D+IV +IE +IA +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQ 174

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
           KYEPH+D+F D+ N + GG RIATVLMYL
Sbjct: 175 KYEPHYDYFLDEFNTKNGGQRIATVLMYL 203


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/169 (39%), Positives = 91/169 (53%), Gaps = 25/169 (14%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G S  +  RTS GMF  + +  ++  IE RIA     P E GE +Q+LHY  G +Y
Sbjct: 69  DNSTGGSEVNAARTSDGMFFERGETPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQY 128

Query: 64  EPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +PH DFF D  +       + GG R+ TV++YL+    GG T FP               
Sbjct: 129 KPHHDFF-DPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFPEV------------- 174

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              G  V+P+KG+A+ F    P AST   +LHG  PV++GEKW ATKW+
Sbjct: 175 ---GLEVQPIKGNAVFFSYERPLASTR--TLHGGAPVLDGEKWVATKWL 218


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/160 (36%), Positives = 98/160 (61%), Gaps = 18/160 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           S KS+  ++RTSS MF   A++++V+++E R++    +P ++GE +QIL+Y  GQ+Y+ H
Sbjct: 75  SNKSV-HDLRTSSSMFFDDAENDVVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAH 133

Query: 67  FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           +D+F    N ++   RI+T++MYL+ VE GGET FP                +  + V P
Sbjct: 134 YDYFSSG-NSKVNNPRISTLVMYLNDVEAGGETYFP----------------KLNFYVAP 176

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            KG A+ F   + D + +  +LHG  PV+ G+KW+AT+W+
Sbjct: 177 KKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWAATQWM 216


>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
 gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
          Length = 284

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/170 (40%), Positives = 97/170 (57%), Gaps = 23/170 (13%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG    ++ RTS GMF  + ++E VA +E RIA     P ENGE +Q+LHY  G +Y+PH
Sbjct: 132 SGGEEVNKDRTSDGMFFQRGENEAVARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPH 191

Query: 67  FDFF--RDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +D+F   +    +L   GG R+AT+++YL+   +GG T FP+  +               
Sbjct: 192 YDYFDPAEPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFPDVPLE-------------- 237

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             + P +G+A +FFS +  A   S +LHG  PVIEGEKW ATKW+  R F
Sbjct: 238 --IGPRQGNA-VFFS-YGRAHPSSRTLHGGAPVIEGEKWIATKWLREREF 283


>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 280

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  +  +G  + +  RTS GMF  + ++EIVA +E R+A     P E GE +QIL Y  G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARLEQRLATLLRWPLEYGEGLQILRYAPG 181

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y PH+D+F           + GG R+AT++MYL   E GG T FP+            
Sbjct: 182 AQYRPHYDYFDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFPDV----------- 230

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V P++G  + F    PD  T   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 231 -----GLEVAPVRGCGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/164 (38%), Positives = 94/164 (57%), Gaps = 20/164 (12%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           + SE+RTS GMF  + ++  +  IE RI+A   +P E+ E +Q+LHY  GQ+Y+ H+DFF
Sbjct: 64  VVSEIRTSRGMFFEEEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFF 123

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
               +     +RI+T+++YL+ VE GGETVFP  ++                 VKP +G 
Sbjct: 124 GPN-SPSASNNRISTLIIYLNDVEAGGETVFPLLDLE----------------VKPERGS 166

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI---HVRNF 171
           AL F   +     ++ +LH S PV+ GEKW AT+W+    VR F
Sbjct: 167 ALYFEYFYRQQELNNLTLHSSVPVVRGEKWVATQWMRRQRVREF 210


>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
          Length = 334

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 89/147 (60%), Gaps = 4/147 (2%)

Query: 28  DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
           D ++A IE ++AA T +P  +GE   +L YE  Q Y+ H+D F ++        RIATVL
Sbjct: 185 DGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQRIATVL 244

Query: 88  MYLSHVEKGGETVF---PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
           +YL+ VE+GGETVF       +++    ++  C   G  VKP +GDALLFFS+  + + D
Sbjct: 245 LYLADVEEGGETVFLLEGKGGLARLERIDYKAC-DTGIKVKPRQGDALLFFSVSVNGTLD 303

Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNF 171
             SLHG CPV+ G KW+ TKWI  R F
Sbjct: 304 KHSLHGGCPVVAGTKWAMTKWIRNRCF 330


>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
 gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
          Length = 284

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/181 (39%), Positives = 98/181 (54%), Gaps = 26/181 (14%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  D+E G+      RTS GMF +  +  +V  IE R+AA   +P  +GE +QILHY  G
Sbjct: 124 LTVDSE-GRQQVDRRRTSEGMFFTLDEVPLVGRIERRVAALLDVPASHGEGLQILHYLPG 182

Query: 61  QKYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
           Q YEPHFD+F       +    +GG RIA+V+MYL+   +GG T FP   ++ +      
Sbjct: 183 QAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYLNTPARGGGTAFPALGLTVT------ 236

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
             ARRG AV         +F+       D +SLH   PV+EGEKW ATKW+  R + +P 
Sbjct: 237 --ARRGAAV---------YFAYE---GGDCSSLHAGLPVLEGEKWIATKWLRERPYRRPT 282

Query: 176 K 176
           K
Sbjct: 283 K 283


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 91/159 (57%), Gaps = 17/159 (10%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
            K   S +RTSSGMF  + ++ +++ IE RI++   LP E+ E +Q+LHYE GQ+++PHF
Sbjct: 61  AKKEISSIRTSSGMFFEENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHF 120

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
           DFF    +     +RI T+++YL+ VE+GG T FPN                 G    P 
Sbjct: 121 DFFGPN-HPSSSNNRICTLVVYLNDVEEGGVTTFPN----------------LGIVNVPK 163

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           KG A+ F   + D   +  +LH   PVI+GEKW AT+W+
Sbjct: 164 KGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWM 202


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 65/161 (40%), Positives = 90/161 (55%), Gaps = 23/161 (14%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           RTS GMF ++ ++ +V  +EARIA     P + GE +Q+L Y  G +Y+PH+D+F     
Sbjct: 134 RTSQGMFFARGENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEP 193

Query: 72  -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
                 Q GG R+AT++MYL+  E+GG TVFP+                 G  V P +G 
Sbjct: 194 GTPAILQRGGQRVATLIMYLNEPEQGGATVFPDI----------------GLQVTPRRGT 237

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           A +FFS +P A+  S + HG  PV  GEKW ATKW+  R F
Sbjct: 238 A-VFFS-YPAANPASLTRHGGEPVKAGEKWIATKWLREREF 276


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 92/170 (54%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G      +  ++  IEARIAA T  P E+GE  Q+L+Y+ G 
Sbjct: 131 VVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGG 190

Query: 62  KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF  K        ++GG R+AT+++YL+    GG T FP              
Sbjct: 191 EYQPHFDFFNPKRPGEARQLRVGGQRVATMVIYLNSPASGGATAFP-------------- 236

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             R G  V P+KG+A+LF    PD + D  +LH   PV  GEKW ATKW+
Sbjct: 237 --RIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKWIATKWL 284


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 92/170 (54%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G      +  ++  IEARIAA T  P E+GE  Q+L+Y+ G 
Sbjct: 131 VVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGG 190

Query: 62  KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF  K        ++GG R+AT+++YL+    GG T FP              
Sbjct: 191 EYQPHFDFFNPKRPGEARQLRVGGQRVATMVIYLNSPASGGATAFP-------------- 236

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             R G  V P+KG+A+LF    PD + D  +LH   PV  GEKW ATKW+
Sbjct: 237 --RIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKWIATKWL 284


>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 186

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 91/168 (54%), Gaps = 7/168 (4%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           +VRTS G FL       +  +E +IAA T LP  NGE   +L+Y+H Q Y+ H D F  K
Sbjct: 19  QVRTSKGTFLGGDSSPALRWLEDKIAAVTLLPRTNGEFWNVLNYKHSQHYDSHMDSFDPK 78

Query: 74  MNQQLGGHRIATVLMYLSHVE-KGGETVFPNSEVSQSRD--GNWSEC-ARRGYAVKPMKG 129
                   RIATV++ LS     GGETVF     S       NW++C A  G   KP  G
Sbjct: 79  EYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKPISNWTDCDADGGLKYKPRAG 138

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR---NFDKP 174
           DA+LF+S  PD   D  +LHGSCPV+ G KW A KW+  +   + DKP
Sbjct: 139 DAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNKGEYDHDKP 186


>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 276

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 65/140 (46%), Positives = 91/140 (65%), Gaps = 9/140 (6%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  ++ +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 97  VVDVATGKGVKSDVRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEP 156

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Q Y PH D+F D  N + GG R+AT+LMYL+   +GGET FP     Q+ DG      R
Sbjct: 157 NQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECICGGR 211

Query: 120 --RGYAVKPMKGDALLFFSL 137
             RG  VKP KGDA+LF+S+
Sbjct: 212 LVRGLCVKPNKGDAVLFWSM 231


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     + RTS G      +  ++  IE RIAA   +P ++GE +QIL+Y+ G 
Sbjct: 120 VINPDTGDENLIDARTSMGAMFQVGEHTLIQRIEDRIAAVLGVPVDHGEGLQILNYKPGG 179

Query: 62  KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFDFF  K        ++GG R AT+++YL+  + GG T FP              
Sbjct: 180 EYQPHFDFFNPKRPGEARQLRVGGQRTATLVIYLNTPQAGGATAFP-------------- 225

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             R G  V P+KG+A+ F  L PD   D  +LH   PV  GEKW ATKW+
Sbjct: 226 --RIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKWIATKWL 273


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/170 (39%), Positives = 90/170 (52%), Gaps = 25/170 (14%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G S  +  RTS GMF  + +  ++  IE RIA     P E GE +Q+L Y  G +Y
Sbjct: 124 DNSTGGSEVNAARTSDGMFFERGEKPLIERIERRIAELVRWPVERGEGLQVLRYRPGAQY 183

Query: 64  EPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +PH DFF D  +       + GG R+ TV+MYL+    GG T FP               
Sbjct: 184 KPHHDFF-DPAHPGTANILRRGGQRVGTVVMYLNTPAGGGATTFPEV------------- 229

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              G  V+P+KG+A+ F    P AST   +LHG  PV++GEKW ATKW+ 
Sbjct: 230 ---GLEVQPVKGNAVFFSYERPLAST--RTLHGGAPVLDGEKWVATKWMR 274


>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
 gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
          Length = 429

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 71/158 (44%), Positives = 91/158 (57%), Gaps = 6/158 (3%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           +VRTS G FL       +  +E++IAA T +P +NGE   +L+Y+H Q Y+ H D F  K
Sbjct: 265 QVRTSKGTFLGGDSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPK 324

Query: 74  MNQQLGGHRIATVLMYLS-HVEKGGETVFPNSEVSQSRD---GNWSEC-ARRGYAVKPMK 128
              Q    RIATV++ LS     GGETVF   E   + D    NW++C A  G   KP  
Sbjct: 325 EYGQQYSQRIATVIVVLSDEGLVGGETVF-KREGKANIDKPITNWTDCDADGGLRYKPRA 383

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           GDA+LF+S  PD   D  +LHGSCPV+ G KW A KWI
Sbjct: 384 GDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWI 421


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 97/178 (54%), Gaps = 27/178 (15%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
           +  GK    + RTS GMF    +  +VA+IE R+A    +P  +GE +QILHY  GQ+YE
Sbjct: 125 DSDGKQQIDQRRTSEGMFFRAGETPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYE 184

Query: 65  PHFDFF------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           PH+D+F       DK+  +  G RIA+V+MYL+  E+GG T FP   ++ +        A
Sbjct: 185 PHYDWFDPALPGYDKLTAR-AGQRIASVVMYLNTPERGGGTAFPEIGLTVT--------A 235

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           RRG AV         +F+       D +SLH   PV++GEKW AT W+  R F +  K
Sbjct: 236 RRGAAV---------YFAYE---GGDQSSLHAGLPVLQGEKWIATHWLRERPFGQGSK 281


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 64/173 (36%), Positives = 93/173 (53%), Gaps = 21/173 (12%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           D  SG+ +  E R+S GMF    ++  +A ++ R++    LP ENGE +Q+L Y  G + 
Sbjct: 131 DPLSGRDLVGEQRSSLGMFFRLRENAFIARLDQRVSELMNLPVENGEGLQVLCYPAGAQS 190

Query: 64  EPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PHFDF       +K +    G R++T++ YL+ VE+GGET+FP             EC 
Sbjct: 191 MPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEVEEGGETIFP-------------EC- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             G++V P +G A+ F   +     D  SLH   PV+ GEKW ATKW+  R F
Sbjct: 237 --GWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKWVATKWMRQRRF 287


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/157 (39%), Positives = 94/157 (59%), Gaps = 19/157 (12%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           +RTSSG FL   ++E VA IE R+++   +P E+GE + IL Y  GQ+Y+ H+D+F +  
Sbjct: 82  IRTSSGTFLE--ENETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEH- 138

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           ++    +RI+T++MYL+ VE+GGET FP   +S                + P KG A+ F
Sbjct: 139 SRAAENNRISTLVMYLNDVEEGGETFFPKLNLS----------------IAPKKGSAVYF 182

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
              + D S +  +LHG  PVI+GEKW AT+W+  R+ 
Sbjct: 183 EYFYNDKSLNELTLHGGAPVIKGEKWVATQWMKRRSL 219


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 90/159 (56%), Gaps = 16/159 (10%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           E R S+  +L    D++V ++  RIA  T L  E  EA+Q+ +Y  G  YE H+D    +
Sbjct: 344 EFRISTAAWLQPDHDDVVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASR 403

Query: 74  MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
             +   G RIAT ++YL+ VE+GG T FP                R G AV+P  GDA+ 
Sbjct: 404 ERELPEGDRIATFMIYLNQVEQGGYTAFP----------------RLGAAVEPGHGDAVF 447

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           +++L PD  +D+ +LHG+CPV++G KW A KWIH +  D
Sbjct: 448 WYNLLPDGESDNNTLHGACPVLQGSKWVANKWIHEKKND 486


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/171 (39%), Positives = 92/171 (53%), Gaps = 19/171 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +GK ++  VRTS G +L +  + +   IE R+   T L  +  EA  I++Y  G  Y  H
Sbjct: 350 NGKYVSRRVRTSKGAWLERDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAH 409

Query: 67  FDFFRDKMNQQL-GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           +DFF     Q    G RIATVL YLS VE+GG TVFPN ++                AV 
Sbjct: 410 YDFFNTTKQQTSETGDRIATVLFYLSDVEQGGATVFPNLKL----------------AVS 453

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           P +G AL +++L  + + D+ +LHG CPV+ G KW  T WIH R   F +P
Sbjct: 454 PERGMALFWYNLLDNGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQLFTRP 504


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 61/159 (38%), Positives = 91/159 (57%), Gaps = 17/159 (10%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
            K   S +RTSSGMF  + ++ +++ IE RI++   LP E+ E +Q+LHYE GQ+++ HF
Sbjct: 61  AKKEISSIRTSSGMFFEENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHF 120

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
           DFF    +     +RI+T+++YL+ VE+GG T FPN                 G    P 
Sbjct: 121 DFFGPN-HPSSSNNRISTLVVYLNDVEEGGVTTFPN----------------LGIVNVPK 163

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           KG A+ F   + D   +  +LH   PVI+GEKW AT+W+
Sbjct: 164 KGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWM 202


>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
          Length = 222

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 52/89 (58%), Positives = 70/89 (78%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GKS  S VRTSSGMFL + +D+++  IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 134 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 193

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
           KYEPHFD+F D+ N + GG R+AT+LMYL
Sbjct: 194 KYEPHFDYFLDEFNTKNGGQRMATLLMYL 222


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 65/175 (37%), Positives = 94/175 (53%), Gaps = 24/175 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ I  + RTS+GMF  + +  IV+ +E RIA     P ++GE +Q+LHY  G 
Sbjct: 138 VATQSGGEEINDD-RTSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGA 196

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH D+F           + GG R+ T+++YL+  E+GG T+FP   +          
Sbjct: 197 EYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFPEVPLQ--------- 247

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                  V P +G+A+ F    PD ST   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 248 -------VVPRRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 65/175 (37%), Positives = 94/175 (53%), Gaps = 24/175 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA    G+ I  + RTS+GMF  + +  IV+ +E RIA     P ++GE +Q+LHY  G 
Sbjct: 138 VATQSGGEEINDD-RTSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGA 196

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH D+F           + GG R+ T+++YL+  E+GG T+FP   +          
Sbjct: 197 EYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFPEVPLQ--------- 247

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                  V P +G+A+ F    PD ST   +LHG  PV+ GEKW ATKW+  R F
Sbjct: 248 -------VVPRRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   H D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFHQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/170 (37%), Positives = 89/170 (52%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E G  +    RTS+     + + +I+ +IEARIA     P ++GE +Q+L YE G 
Sbjct: 109 VVDPEDGSFVEHSARTSTSTGYHRGEIDIIKTIEARIADLINWPVDHGEGLQVLRYEDGG 168

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFDFF       ++  + GG R+ T LMYLS V+ GG T FPN             
Sbjct: 169 EYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPN------------- 215

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
                + ++P KG AL F + +  A  +  +LH   PV EG K+ ATKW+
Sbjct: 216 ---LNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWL 262


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    DE+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DDELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 277

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/178 (37%), Positives = 96/178 (53%), Gaps = 27/178 (15%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +  D  +G    +  RTS GMF ++ ++E++  IEARIA     P +NGE +Q+L Y  G
Sbjct: 119 LTVDIRTGGEELNHDRTSHGMFYTRGENEVIRRIEARIARLLNWPVQNGEGLQVLRYRRG 178

Query: 61  QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y+PH+D+F           + GG R+A+++MYL    +GG TVFP+            
Sbjct: 179 AEYKPHYDYFDPGEPGTAAILRRGGQRVASLIMYLREPGEGGATVFPDI----------- 227

Query: 116 ECARRGYAVKPMKGDALLF-FSL-HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                G  V+P +G A+ F ++L HP     S +LHG  PV  GEKW ATKW+  R F
Sbjct: 228 -----GLKVRPQQGSAVFFSYALAHP----ASLTLHGGEPVKSGEKWIATKWLREREF 276


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 87/161 (54%), Gaps = 23/161 (14%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           RTS GMF  + +  +V  +E RIA     P +NGE +Q+LHY  G +Y+PH+D+F     
Sbjct: 146 RTSDGMFFQRGETPVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQP 205

Query: 76  Q-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
                 + GG R+AT+++YL++  KGG T FP+  +                 V P +G+
Sbjct: 206 GTSTIVRRGGQRVATLVIYLNNPRKGGGTTFPDVPLE----------------VAPRQGN 249

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           A+ F    P  ST   +LHG   VIEGEKW ATKW+  R F
Sbjct: 250 AVFFSYERPHPST--RTLHGGASVIEGEKWIATKWLREREF 288


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 60/162 (37%), Positives = 92/162 (56%), Gaps = 23/162 (14%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            SEVRTSS MF  ++++E +  +EAR+A    +P  + E +Q+L Y+ G++Y PHFD+F 
Sbjct: 68  VSEVRTSSSMFFEESENECIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFT 127

Query: 72  D--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
               MN     +RI+T++MYL+ VE+GGET FP+                  ++V P KG
Sbjct: 128 QGSSMN-----NRISTLVMYLNDVEEGGETYFPSLH----------------FSVTPKKG 166

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            A+ F   + D   +  +LH   PV  GEKW AT+W+  + +
Sbjct: 167 SAVYFEYFYNDTRLNELTLHAGHPVEAGEKWVATQWMRRQRY 208


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 104/188 (55%), Gaps = 25/188 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   ++ R S   +L + + + +A +  R++  T L     E +Q+++Y  G 
Sbjct: 362 VQNTDTGELEIAQYRISKSAWLKEEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGG 421

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R D+ N  + LG G+RIATVL Y+S VE+GG TVFP+ +VS      W   
Sbjct: 422 HYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVEQGGATVFPSIQVSL-----W--- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP- 174
                   P KG A  +++LHP    D  + H +CPV+ G KW + KWIH R   F +P 
Sbjct: 474 --------PQKGSAAFWYNLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQEFRRPC 525

Query: 175 --EKEPED 180
             E+  ED
Sbjct: 526 TLERPSED 533


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 21/171 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D ++GK      R+S G +  + +  +++ ++ RI+     P ++GE +QILHY  G 
Sbjct: 126 IVDPQTGKFQVIADRSSEGTYFQRGESPLISRLDRRISELMNWPEDHGEGIQILHYGVGA 185

Query: 62  KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PHFD+F +      +     G R+AT++MYL+ V +GGETVFP+             
Sbjct: 186 QYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFPDV------------ 233

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
               G ++ P +G A  F   +     D  +LHG  PV+ GEKW ATKW+ 
Sbjct: 234 ----GISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTGEKWIATKWMR 280


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score =  116 bits (290), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 90/156 (57%), Gaps = 17/156 (10%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           + S++RTS GMF  + +   +  IE RIA    +P E+ E +Q+LHY  GQ+Y+ H DFF
Sbjct: 64  VVSDIRTSRGMFFEEEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFF 123

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
               +     +RI+T+++YL+ VE+GGETVFP                  G A+KP +G 
Sbjct: 124 APG-SPAARNNRISTLIVYLNDVEEGGETVFP----------------LLGIAMKPKRGA 166

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           AL F   + + + +  +LH S PV+ GEKW AT+W+
Sbjct: 167 ALYFEYFYRNQALNDLTLHSSVPVVRGEKWVATQWM 202


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 61/161 (37%), Positives = 86/161 (53%), Gaps = 21/161 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R+S G F     D+ +A ++ RIA     P ENGE +Q+LHY  G +Y+PHFD+F     
Sbjct: 141 RSSEGTFFPVNADDFIARLDRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDP 200

Query: 72  -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
             +    +GG R++T+L+YL+ V +GG TVFP                  G  V P KG 
Sbjct: 201 GSEAQMVVGGQRVSTLLIYLNDVAQGGATVFPT----------------LGLRVLPRKGM 244

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           A+ F   + D   D  +LHG  PV +GEKW  TKW+  R++
Sbjct: 245 AVYFEYSNRDGQVDPLTLHGGEPVEKGEKWIITKWMRQRSY 285


>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
          Length = 315

 Score =  115 bits (289), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 88/165 (53%), Gaps = 20/165 (12%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           + S  RT++  +L   Q  +V  +E  +A  T   PENGE +QILHY+  Q+++ H D+F
Sbjct: 128 VESSTRTNTAAWLEYHQGPVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYF 187

Query: 71  RDKM----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
                   N + GG+R+AT ++YL + E+GGET F   +                  VKP
Sbjct: 188 DPATDPPENFEPGGNRLATAIIYLQNAEEGGETDFMKIDTK----------------VKP 231

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             G A+LF+ L PD S D  ++H   P   GEKW ATKWIH R +
Sbjct: 232 EAGSAVLFYDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHERRY 276


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 87/161 (54%), Gaps = 23/161 (14%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           RTS GMF  + +  +V  +E RIA     P +NGE +Q+LHY  G +Y+PH+D+F     
Sbjct: 146 RTSDGMFFQRGETPVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQP 205

Query: 76  Q-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
                 + GG R+AT+++YL++  KGG T FP+  +                 V P +G+
Sbjct: 206 GTSTIVRRGGQRVATLVIYLNNPLKGGGTTFPDVPLE----------------VAPRQGN 249

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           A+ F    P  ST   +LHG   VIEGEKW ATKW+  R F
Sbjct: 250 AVFFSYERPHPST--RTLHGGASVIEGEKWIATKWLREREF 288


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 92/175 (52%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +G+      RTS G++  + +D ++  ++ RIA+    P ENGE +QILHY    
Sbjct: 154 IVDPATGREDVIRNRTSEGIWYQRGEDALIERLDQRIASLMNWPLENGEGLQILHYGPSG 213

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFD+F        ++   GG R+AT+++YL+ V  GGET+FP +            
Sbjct: 214 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVPDGGETIFPEA------------ 261

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V   +G A+ F  ++     D  +LHG  PV+ G+KW  TKW+  R +
Sbjct: 262 ----GLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWIMTKWVRERPY 312


>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 120

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 57/116 (49%), Positives = 74/116 (63%), Gaps = 2/116 (1%)

Query: 51  AMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
           A  +L YE GQKY  H+D F           RIA+ L+YLS VE+GGET+FP    +   
Sbjct: 1   AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDNIDS 60

Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + ++ +C   G  VKP +GD LLF+SL  + + D TS+HGSCPVI+GEKW ATKWI
Sbjct: 61  NYDYVQCI--GLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWI 114


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/174 (35%), Positives = 89/174 (51%), Gaps = 21/174 (12%)

Query: 3   ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
            + E+G       RTS G +    +D ++  IE R+AA    P ENGE +Q+L Y  G +
Sbjct: 127 VNAETGTQEVIRHRTSHGTWFQNGEDALIRRIETRLAALMNCPVENGEGLQVLRYTPGGE 186

Query: 63  YEPHFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           Y  H+D+F+      L     GG R+AT+++YL+ V  GGETVFP +             
Sbjct: 187 YRSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA------------- 233

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
              G +V P +GDA+ F  ++     D  +LH   PV +GEKW  TKW+  R +
Sbjct: 234 ---GISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRERPY 284


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   ++ R S   +L + +D +VA++  R+   T L  E  E +Q+++Y  G 
Sbjct: 74  VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 133

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PH+DF R +++N  + LG G+RIATVL Y+S V +GG TVFP           W   
Sbjct: 134 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------WL-- 180

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              G A++P+KG A ++F+L+P  + D  + H +CPV++G KW   KW+H
Sbjct: 181 ---GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 227


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDRSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 93/154 (60%), Gaps = 18/154 (11%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           +++RTS+ +FL +   E+V  +E RI+    +P E+GE +Q+L+Y+ GQ+Y+ HFDFF  
Sbjct: 69  NDIRTSTSVFLPEDASEVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSP 128

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
           K  + +   RI+T+++YL+ VE+GG+T FPN ++S                V P KG A+
Sbjct: 129 K--KLIENPRISTLVLYLNDVEEGGDTYFPNLKLS----------------VSPHKGMAV 170

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            F   + D   +  +LHG  PV  G+KW+AT W+
Sbjct: 171 YFEYFYDDPMLNELTLHGGAPVTIGDKWAATMWM 204


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 91/168 (54%), Gaps = 23/168 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           D ++G S     RTS G F  +    + A+IEARIA     P ENGE +Q+LHY  G ++
Sbjct: 134 DAQTGGSQVHADRTSRGTFFERGAHPVCATIEARIARLLEWPVENGEGLQVLHYPPGAEF 193

Query: 64  EPHFDFFR-DKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F  D+   ++    GG R+ATV+MYL+   +GG T FP++ +            
Sbjct: 194 RPHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFPDAHLE----------- 242

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
                V  +KG+A+ F    P   T   +LHG  PV EGEKW ATKW+
Sbjct: 243 -----VAAVKGNAVFFSYDRPHPMT--RTLHGGAPVTEGEKWIATKWL 283


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 17/155 (10%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
             ++RTSS MF  + ++E+VA IE R++    +P E+GE +Q+L+Y  GQ+Y+ HFDFF 
Sbjct: 75  VDDIRTSSSMFFEEGENELVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFS 134

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
                     RI+T++MYL+ VE+GGET FP                +  ++V P KG A
Sbjct: 135 SSSRAASNP-RISTLVMYLNDVEEGGETYFP----------------KLNFSVNPQKGSA 177

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + +   +  +LHG  PVI+G KW+AT+W+
Sbjct: 178 VYFEYFYDNQDLNDLTLHGGAPVIKGSKWAATQWM 212


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GK   ++ R S   +L+  +  +V  I  RI   T L  +  E +Q+ +Y  G 
Sbjct: 353 VHDPQTGKLTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGG 412

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 413 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV------------- 459

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G AVKP+KG A+ +++L P    D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 460 ---GAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 515


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSINELTLHGGAPVTKGEKWIATQWV 242


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWV 210


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 88  VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 145

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 146 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VNPRKGMA 188

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 189 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 223


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   ++ R S   +L + +D +VA++  R+   T L  E  E +Q+++Y  G 
Sbjct: 377 VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 436

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PH+DF R +++N  + LG G+RIATVL Y+S V +GG TVFP           W   
Sbjct: 437 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------WL-- 483

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              G A++P+KG A ++F+L+P  + D  + H +CPV++G KW   KW+H
Sbjct: 484 ---GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 530


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   ++ R S   +L + +D +VA++  R+   T L  E  E +Q+++Y  G 
Sbjct: 359 VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 418

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PH+DF R +++N  + LG G+RIATVL Y+S V +GG TVFP           W   
Sbjct: 419 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------W--- 464

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              G A++P+KG A ++F+L+P  + D  + H +CPV++G KW   KW+H
Sbjct: 465 --LGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 512


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWV 210


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSINELTLHGGAPVTKGEKWIATQWV 210


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/160 (36%), Positives = 92/160 (57%), Gaps = 19/160 (11%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            S++RTSS  FL   +D++   IE R+A    +P E+GE + IL+Y+ GQ+Y+ H+D+FR
Sbjct: 70  VSDIRTSSSTFL--PEDDLTNRIEKRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFR 127

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
            K  +     RI+T+++YL+ VE+GGET FP+  +S                + P KG A
Sbjct: 128 SKA-KAANNPRISTLVLYLNDVEEGGETYFPHMNLS----------------ISPHKGMA 170

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + F   + D   +  +LHG  PV  GEKW+AT W+  + +
Sbjct: 171 VYFEYFYSDPLINERTLHGGSPVTSGEKWAATMWVRRKQY 210


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPQLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 96/179 (53%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L  A+DE++ +I  R+   T L  E  E +Q+++Y  G 
Sbjct: 379 VQNYKTGELEFANYRISKSAWLKDAEDEMIRTISQRVEDMTGLTMETAEELQVVNYGIGG 438

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S V +GG TVFP+  +           
Sbjct: 439 HYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLNL----------- 487

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  +F+LH     D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 488 -----ALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIHERGQEFRRP 541


>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
 gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
          Length = 295

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 90/172 (52%), Gaps = 23/172 (13%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
           N SG S  +  RTS GMF  + +  +  +IE RIAA    P ENGE +Q+L Y  G +Y+
Sbjct: 141 NGSGGSEVNAARTSDGMFFDRGEFPLCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYK 200

Query: 65  PHFDFFRDKMNQ-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            H D+F           + GG R+ TV+MYL+H  +GG T FP+                
Sbjct: 201 AHHDYFDPAQPGTPTILKRGGQRVGTVVMYLNHPIRGGGTAFPDV--------------- 245

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            G  V P KG+A +FFS +  A   + +LH   PV+EGEKW ATKW+    F
Sbjct: 246 -GLEVAPFKGNA-VFFS-YDRAHPMTRTLHAGTPVLEGEKWVATKWVREGEF 294


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 286

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 91/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS GM L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 128 DNANGEHLVHAARTSDGMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 188 RPHYDYFDPDAVGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 285


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++ YL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVXYLNDVEEGGETFFPKLNLS----------------VHPRKGXA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T+++YL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVIYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GK   +  R S   +LS  ++ IVA I  RI   T L     E +Q+ +Y  G 
Sbjct: 373 VHDPQTGKLTTAHYRVSKSAWLSGYENPIVARINTRIQDLTGLDVSTAEELQVANYGVGG 432

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 433 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 479

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 480 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 535


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWM 210


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/160 (38%), Positives = 93/160 (58%), Gaps = 19/160 (11%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG+F    ++E VA IE RI+    +P E+G+ +Q+L Y  GQ+Y+PHFDFF 
Sbjct: 72  VNQIRTSSGVFCE--ENETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFA 129

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 130 DT-SRASANNRISTLVMYLNDVEEGGETTFPMLNLS----------------VFPSKGMA 172

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + F   + +   +  +LH   PV +GEKW AT W+  + F
Sbjct: 173 VYFEYFYSNHELNERTLHAGAPVRKGEKWVATMWMRRQTF 212


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ A IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T+++YL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVIYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +     +  +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EHSRSAVN-NRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL  +  E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E  + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLE--DNEFTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +     +  +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EHSRSAVN-NRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 91/175 (52%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+ +  + RTS     ++A+  ++A +EARIAA    P ENGE MQ+L Y  G 
Sbjct: 121 VVDPATGEFVKHQDRTSMNAAFARAEHPLIARLEARIAAAIHWPAENGEGMQVLRYRSGG 180

Query: 62  KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+ HFD+F       + N Q GG R+ T L+YL  V+ GG T FP              
Sbjct: 181 EYKAHFDYFDTQSEGGRKNMQTGGQRVGTFLVYLCDVDAGGATRFP-------------- 226

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
                + ++P KG AL F +  P+   +  +LH   PV+ G K+ A+KW+  + +
Sbjct: 227 --ALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKYLASKWLREKPY 279


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 64/177 (36%), Positives = 90/177 (50%), Gaps = 23/177 (12%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G  +    RTS  M L   QD +   IEARIA     P ENGE +Q+L Y  G +Y
Sbjct: 128 DNANGAHVVHAARTSDSMCLQLGQDALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +PH+D+F        +  Q GG R+A+++MYL+  ++GG T FP+  +            
Sbjct: 188 QPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPDRGGATRFPDVHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
                +  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P+
Sbjct: 237 -----IAAIKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAARMPD 286


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL  +  E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 107 VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN++G  I    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 128 DNDNGAQIVHAARTSDSMCLQLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP+  +            
Sbjct: 188 QPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFPDVHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRLP 285


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL  +  E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G  I    RTS  M L   QD +   IEARIA     P E+GE +Q+L Y  G +Y
Sbjct: 130 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 189

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP+  +            
Sbjct: 190 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 238

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   +LH   PV+ GEKW ATKW+  R    P
Sbjct: 239 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 287


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G  I    RTS  M L   QD +   IEARIA     P E+GE +Q+L Y  G +Y
Sbjct: 150 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 209

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP+  +            
Sbjct: 210 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 258

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   +LH   PV+ GEKW ATKW+  R    P
Sbjct: 259 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 307


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G  I    RTS  M L   QD +   IEARIA     P E+GE +Q+L Y  G +Y
Sbjct: 128 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP+  +            
Sbjct: 188 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   +LH   PV+ GEKW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 285


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/155 (40%), Positives = 83/155 (53%), Gaps = 18/155 (11%)

Query: 22  FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH 81
           +L   +DE+V  I  R+ A++ L     E +Q+++Y  G  YEPH+DF RDK      G+
Sbjct: 5   WLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGN 64

Query: 82  RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA 141
           RIAT L YLS VE GG TVF                 R G  V P KGDA  +++L    
Sbjct: 65  RIATFLSYLSDVEAGGGTVF----------------TRVGATVWPQKGDAAFWYNLKRSG 108

Query: 142 STDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             DS++ H +CPV+ G KW A KWIH   + F KP
Sbjct: 109 DGDSSTRHAACPVLVGSKWVANKWIHEVGQEFLKP 143


>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
          Length = 244

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 68/178 (38%), Positives = 90/178 (50%), Gaps = 14/178 (7%)

Query: 3   ADNESGK-SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
            D  +G+  +  EVRTS   +L   +  IVA I  R+     +P    E MQ+L Y   Q
Sbjct: 61  GDQSNGEEKVKDEVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQ 120

Query: 62  KYEPHFDFFRDKM---NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS---QSRDGNWS 115
            Y  H+DFF  KM       G +R+ TV  YL+ VEKGGET+FP    S     +  +W 
Sbjct: 121 HYHVHYDFFDPKMYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGNTSAEEHHKIQSWG 180

Query: 116 EC---ARRGYAVKPMKGDALLFFSLHPDAST----DSTSLHGSCPVIEGEKWSATKWI 166
            C         VKP++G A++F+ + P   T    D TSLHG C  I GEKW+A  WI
Sbjct: 181 PCENAVESSIKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWI 238


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score =  113 bits (283), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 226


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +++ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLE--DNKLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 216 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 275

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 276 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 322

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 323 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 378


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSKGAFLD--DNELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSKGAFLD--DNELTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score =  112 bits (281), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 59/175 (33%), Positives = 93/175 (53%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  SG+ + S+ R S GMF    ++++VA ++ R++A   LP ENGE + +L+Y  G 
Sbjct: 130 LVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRLSALMNLPLENGEGLHLLYYPTGA 189

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
             EPH D+       ++ +    G R++T++ YL+   +GG+TVFP              
Sbjct: 190 GSEPHHDYLAPTNAANRESIARSGQRVSTLVTYLNDAPEGGQTVFP-------------- 235

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             + G AV P++G+A  F     +   D+ SLH S PV  G+KW  TKW+  R F
Sbjct: 236 --QLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGDKWVMTKWMRERRF 288


>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 216

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 58  DNANGEHLVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 117

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 118 RPHYDYFDPDAVGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 166

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 167 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 215


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+   IE RI++ T +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D   +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210


>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 296

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEY 197

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 198 RPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 295


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E  + IE RI++ T +P  +GE + IL+Y   Q+Y+ H+D+F 
Sbjct: 113 VNDIRTSSGAFLE--ENEFTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFA 170

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 171 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 213

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 214 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 248


>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 296

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEY 197

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 198 RPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 295


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 90/171 (52%), Gaps = 21/171 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E+GK      RTS G++  + +D  +  ++ RI++    P ENGE +QILHY    +Y P
Sbjct: 140 ETGKEDVIRNRTSEGIWYQRGEDAFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRP 199

Query: 66  HFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           HFD+F        ++   GG R+AT+++YL+ V  GGET+FP +                
Sbjct: 200 HFDYFPPDQPGSAVHTAQGGQRVATLVIYLNDVPDGGETIFPEA---------------- 243

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G +V   +G A+ F  ++     D  +LHG  PV+ G+KW  TKW+  R +
Sbjct: 244 GISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKWIMTKWMRERAY 294


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 90/160 (56%), Gaps = 19/160 (11%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            S++RTSS  FL    DE+   IE R+A    +P E+GE + ILHY+ GQ+Y+ H D+FR
Sbjct: 70  VSDIRTSSSAFL--PDDELTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFR 127

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
              ++     RI+T+++YL+ VE+GGET FP   ++                V P KG A
Sbjct: 128 -STSRAAKNPRISTLVLYLNDVEEGGETYFPEMNLT----------------VSPHKGMA 170

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + F   + D + +  +LHG  PV  GEKW+AT W+  + +
Sbjct: 171 VYFEYFYNDPAINERTLHGGSPVTAGEKWAATMWVRRQQY 210


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTS G FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+   IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 91  VNDIRTSSGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG  PV +GEKW  T+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWITTQWV 226


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG   V +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWV 210


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL   ++E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   + D S +  +LHG   V +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWV 210


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFRD---KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R       Q+LG G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 306

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 148 DNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 207

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 208 RPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 256

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 257 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 305


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 368 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 427

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 428 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 474

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 475 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 530


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 285 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 344

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 345 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 391

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 392 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 447


>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 286

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 128 DNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 188 RPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 285


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 197

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 198 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ G+KW ATKW+  R    P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 295


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           +RTSSGMF  ++++E+V  IE R++       E  E +QIL Y   Q+Y+ H D+F    
Sbjct: 78  IRTSSGMFFDESENELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA- 136

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           ++    +RI+T++MYL+ VE+GGET FP                + G +V P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSVSPTKGMAVYF 180

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              + DA  +  +LHG  PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 91/155 (58%), Gaps = 19/155 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            +++RTSSG FL    +E+ + IE RI++   +P  +GE + IL+YE  Q+Y+ H+D+F 
Sbjct: 75  VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGET FP   +S                V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           + F   +   S +  +LHG  PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQGQSLNELTLHGGAPVTKGEKWIATQWV 210


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 320 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 379

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 380 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 426

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 427 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 482


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 324 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 383

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 384 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 430

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 431 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 486


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 427 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 357 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 416

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 417 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 463

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 464 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 519


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 128 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 188 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ G+KW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 285


>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 286

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN +G+ +    RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y
Sbjct: 128 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T FP++ +            
Sbjct: 188 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   SLH   PV+ G+KW ATKW+  R    P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 285


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 474 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           +RTSSGMF  ++++E+V  IE R++       E  E +QIL Y   Q+Y+ H D+F    
Sbjct: 78  IRTSSGMFFEESENELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA- 136

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           ++    +RI+T++MYL+ VE+GGET FP                + G ++ P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSISPTKGMAVYF 180

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              + DA  +  +LHG  PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +G+      RTS G++  + +D  +  ++ RIA+    P ENGE +QILHY    
Sbjct: 141 IVDPATGQEGVIRNRTSEGIWYQRGEDAFIERLDRRIASLMNWPVENGEGLQILHYGPTG 200

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFD+F        ++   GG R+AT+++YL+ V  GGET+FP +            
Sbjct: 201 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVADGGETIFPAA------------ 248

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V   +G A+ F  ++     D  +LHG  PV  G+KW  TKW+  R +
Sbjct: 249 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 299


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 347 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 406

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 407 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 453

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 454 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 509


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/162 (39%), Positives = 84/162 (51%), Gaps = 18/162 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+    E R S+  +L    D IV  I  RI   T +  E  EA+QI +Y  G  YEPHF
Sbjct: 372 GRFQPVEFRISTAAWLQPDHDAIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHF 431

Query: 68  DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
           D      N    G R+AT ++YL+ V++GG T FP                R G AV+P 
Sbjct: 432 DHSSRGTNPD--GERLATFMIYLNPVKQGGFTAFP----------------RLGAAVQPG 473

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            GDA+ +++L P    D  +LHG+CPV+ G KW A KWIH R
Sbjct: 474 YGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKWVANKWIHER 515


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/168 (35%), Positives = 87/168 (51%), Gaps = 18/168 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+ + +  R S   +L   +  +VA +  R+A  T L     E +Q+++Y  G 
Sbjct: 52  VHDPATGELVPAHYRISKSAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGG 111

Query: 62  KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            Y+PHFDF R + N  ++  G+RIATVL Y+S V +GG TVF                  
Sbjct: 112 HYDPHFDFARKEENAFEKFNGNRIATVLFYMSDVAQGGATVF----------------TE 155

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            G +V P +G A+ + +LHP    D  + H +CPV+ G KW   KWIH
Sbjct: 156 LGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPVLRGSKWVCNKWIH 203


>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
 gi|255628535|gb|ACU14612.1| unknown [Glycine max]
          Length = 238

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/118 (49%), Positives = 80/118 (67%), Gaps = 3/118 (2%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D ++GK I S+VRTSSGMFL+  + +  +V +IE RI+ ++ +P ENGE MQ+L YE 
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSE 116
            Q Y+PH D+F D  N + GG RIAT+LMYLS   + GET FP +  V+ +  GN S+
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRIATMLMYLSDNIERGETYFPLAGSVNAAVVGNLSK 235


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 86/176 (48%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G  I    RTS  M L   QD +   IEARIA     P E+GE +Q+L Y  G +Y
Sbjct: 130 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 189

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T  P+  +            
Sbjct: 190 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLD----------- 238

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   +LH   PV+ GEKW ATKW+  R    P
Sbjct: 239 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 287


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +G+      RTS G++  + +D  +  ++ RIA+    P ENGE +QILHY    
Sbjct: 148 IVDPATGQEDVIRNRTSEGIWYQRGEDAFIERLDQRIASLMNWPVENGEGLQILHYGPTG 207

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFD+F        ++   GG R+AT+++YL+ V  GGET+FP +            
Sbjct: 208 EYRPHFDYFPPDQPGSMVHTARGGQRVATLVIYLNDVPDGGETIFPEA------------ 255

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V   +G A+ F  ++     D  +LHG  PV  G+KW  TKW+  R +
Sbjct: 256 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 306


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/173 (38%), Positives = 89/173 (51%), Gaps = 21/173 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +GK   +  R S   +L   +DE+V  I  R+ A++ L     E +Q+++Y  G  YEPH
Sbjct: 367 TGKLEFANYRISKSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPH 426

Query: 67  FDFFRD---KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +DF RD   K      G+RIAT L YLS VE GG TVF                 R G  
Sbjct: 427 YDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVF----------------TRVGAT 470

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           V P KGDA  +++L      DS++ H +CPV+ G KW A KWIH   + F KP
Sbjct: 471 VWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQEFRKP 523


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/176 (36%), Positives = 86/176 (48%), Gaps = 23/176 (13%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN  G  I    RTS  M L   QD +   IEARIA     P E+GE +Q+L Y  G +Y
Sbjct: 150 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 209

Query: 64  EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PH+D+F        +  Q GG R+A+++MYL+  E+GG T  P+  +            
Sbjct: 210 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLD----------- 258

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
                V  +KG+A+ F    P   T   +LH   PV+ GEKW ATKW+  R    P
Sbjct: 259 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 307


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
           +RTSSGMF  ++++E+V  IE R++       E  E +Q+L Y   Q+Y+ H D+F    
Sbjct: 78  IRTSSGMFFEESENELVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSA- 136

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           ++    +RI+T++MYL+ VE+GGET FP                + G +V P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSVSPTKGMAVYF 180

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              + DA  +  +LHG  PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212


>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 285

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 87/174 (50%), Gaps = 25/174 (14%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E G     E RTS GMF    +  ++  IEARIAA   +P ++GE +Q+LHY  GQ+YEP
Sbjct: 128 EDGAQQIDEHRTSDGMFFGLGEQPLIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEP 187

Query: 66  HFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           H D+F             GG RIA++++YL+  + GG T FP                  
Sbjct: 188 HQDWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPEI---------------- 231

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           G  V  ++G A+ F       S D  SLH   PV  GEKW ATKW+  R + +P
Sbjct: 232 GLTVTALRGSAVCFTY----ESGDVFSLHAGLPVTRGEKWIATKWLRERPYREP 281


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP+              
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +G+      RTS G++  + +D  +  ++ RIA+    P ENGE +QILHY    
Sbjct: 141 IVDPATGQEGVIRNRTSEGIWYQRGEDAFIERLDQRIASLMNWPVENGEGLQILHYGPTG 200

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PHFD+F        ++   GG R+AT+++YL+ V  GGET+FP +            
Sbjct: 201 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVADGGETIFPAA------------ 248

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
               G +V   +G A+ F  ++     D  +LHG  PV  G+KW  TKW+  R +
Sbjct: 249 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKWIMTKWMRERAY 299


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 98/176 (55%), Gaps = 14/176 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D + GK  +SE RTS   FLS   DE++  I+ R+A+ T +P  + E +Q+L Y  G+
Sbjct: 300 LKDADKGKD-SSEWRTSQSAFLSARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGE 358

Query: 62  KYEPHFDFF------RDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
           KY+ H D+F       DK   +L      +R ATV  YL+ V  GGET+FP    + +  
Sbjct: 359 KYDSHHDYFDPSAYRSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPR 418

Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE-KWSATKWI 166
            +  +C+  G  VKP KG  ++F+SL      D  SLHG+CPV E   KW+A KWI
Sbjct: 419 SH-KDCS-IGLKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 91/160 (56%), Gaps = 19/160 (11%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            + +RTSSG+F    Q E +  IE RI+    +P E+G+ +Q+L Y  GQ+Y+PH+DFF 
Sbjct: 72  VNSIRTSSGVFCE--QTETITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA 129

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGETVFP   +S                V P KG A
Sbjct: 130 ET-SRASTNNRISTLVMYLNDVEQGGETVFPLLHLS----------------VFPTKGMA 172

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + F   + +   +  +LH    VI GEKW AT W+  ++F
Sbjct: 173 VYFEYFYRNQEVNEFTLHAGAQVIHGEKWVATMWMRRQSF 212


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 75  VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 134

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 135 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 181

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 182 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 237


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L P    D ++ H +CPV+ G KW   KW+H R   F +P
Sbjct: 474 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVFNKWLHERGQEFRRP 529


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +GK      RTS G++  + +D  +  ++ RI++    P ENGE +QILHY    +Y PH
Sbjct: 138 TGKEDVIRNRTSEGIWYQRGEDPFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPH 197

Query: 67  FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           FD+F        ++   GG R+AT+++YL+ V  GGET+FP +                G
Sbjct: 198 FDYFPPDQPGSAVHTAQGGQRVATLVIYLNDVPDGGETIFPEA----------------G 241

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            +V   +G A+ F  ++     D  +LHG  PV+ G+KW  TKW+  R +
Sbjct: 242 MSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKWIMTKWMRERAY 291


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D +SG    +  R S   +L   +D I+A +  RI   T L  +  E +Q+ +Y  G 
Sbjct: 369 VRDPKSGVLTTASYRVSKSAWLEGEEDPIIARVNQRIEDLTGLTVKTAELLQVANYGVGG 428

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 429 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 476 ---GAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKWVSNKWIHERGQEFRRP 531


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 133 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 192

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 193 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 239

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 240 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 295


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 91/160 (56%), Gaps = 19/160 (11%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            + +RTSSG+F    Q E +  IE RI+    +P E+G+ +Q+L Y  GQ+Y+PH+DFF 
Sbjct: 77  VNSIRTSSGVFCE--QTETITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA 134

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           +  ++    +RI+T++MYL+ VE+GGETVFP   +S                V P KG A
Sbjct: 135 ET-SRASTNNRISTLVMYLNDVEQGGETVFPLLHLS----------------VFPTKGMA 177

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           + F   + +   +  +LH    VI GEKW AT W+  ++F
Sbjct: 178 VYFEYFYSNQELNDFTLHAGTQVIHGEKWVATMWMRRQSF 217


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 210 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 269

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 270 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 316

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 317 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 372


>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
 gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
          Length = 540

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 20/173 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +SG S  +E+RTS   +L    +  +A I+ R+   T L  E+ E +Q+++Y  G +YEP
Sbjct: 368 QSGNSTTTEIRTSQNTWLWYDANPWLAKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427

Query: 66  HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           HFDF  D   +  G  G+R+AT L YL+ V  GG T FP   +                A
Sbjct: 428 HFDFMEDDGQKVFGWKGNRLATALFYLNDVALGGATAFPFLRL----------------A 471

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           V P+KG  L++++LH     D  + H  CPV++G KW   +W HV  + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524


>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
 gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
          Length = 284

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 90/174 (51%), Gaps = 25/174 (14%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G +   + RTS GMF +  +  +V  IE R+A    +P  +GE +QILHY  GQ+YEPHF
Sbjct: 130 GSNQVDQRRTSEGMFFTLNELPLVGRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHF 189

Query: 68  DFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           D+F  +         +GG R+A+V+MYL+   +GG T FP   ++ +        ARRG 
Sbjct: 190 DWFDPQQPGYDTITAVGGQRVASVVMYLNTPAQGGGTAFPELGLTVT--------ARRGA 241

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
           AV         +F+       D  SLH   PV  GEKW ATKW+  R +    K
Sbjct: 242 AV---------YFAYE---GGDQQSLHAGLPVQRGEKWIATKWLRERPYGHSHK 283


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 333 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 392

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV           
Sbjct: 393 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 439

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 440 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 495


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 319 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 378

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 379 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 425

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 426 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 481


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/166 (38%), Positives = 85/166 (51%), Gaps = 21/166 (12%)

Query: 7   SGKSIASEVRTSSGMFLS-KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +G    +  R S   +LS +   E++  +E RIAA T L  E  E  Q+ +Y    +Y+P
Sbjct: 330 TGHLETAHYRISKNCWLSGREHGEVIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDP 389

Query: 66  HFDFFRDKMNQQLG----GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           HFDF RD  N  LG    G+RIATVL+++S VE GG TVFP                  G
Sbjct: 390 HFDFSRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPYV----------------G 433

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
             + P KGDA+ + +L      D  + H  CPV+ G KW A KWIH
Sbjct: 434 ARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIH 479


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 89/172 (51%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GK   ++ R S   +L+  +  ++ +I  RI   T L  +  E +Q+ +Y  G 
Sbjct: 426 VHDPQTGKLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGG 485

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP+              
Sbjct: 486 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPDV------------- 532

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G AV P KG A+ +++L      D ++ H +CPV+ G KW + KWIH R
Sbjct: 533 ---GAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIHER 581


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/161 (39%), Positives = 82/161 (50%), Gaps = 20/161 (12%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFD 68
           + S +R S   +L    DEIVA +  RI   T L   P + E +Q+L+Y  G +YEPH D
Sbjct: 121 VESHIRISQQAWLHDKDDEIVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHD 180

Query: 69  FF--RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           +    +KM   + G+R+AT LMYLS V  GG TVFP + V+                V  
Sbjct: 181 YMTAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVANVT----------------VPV 224

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +K   LLF  L      D  SLH  CPV+ G KW A KWIH
Sbjct: 225 VKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIH 265


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP               
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEV------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 90/174 (51%), Gaps = 21/174 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G     E RTS G      +  ++  IE  IAA T +  E GE +QIL+Y+ G 
Sbjct: 162 VINPDTGDENLIEARTSLGAMFQVGEHPLIERIEDCIAAVTGIAAERGEGLQILNYKPGG 221

Query: 62  KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y+PH+DFF   R    +QL  GG R+ T+++YL+    GG T FP              
Sbjct: 222 EYQPHYDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFP-------------- 267

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
             + G  V P+KG+A+ F     D + D  +LH   PV  GEKW ATKW++ R 
Sbjct: 268 --KLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWLNART 319


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L   +D ++  +  RI A T L  E  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTTANYRVSKSAWLEGEEDPVIDRVNQRIEAITGLTVETAELLQVANYGVGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 427 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G A+ P KG ++ +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 474 ---GAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 529


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D ++  + +R+ A T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVANYRVSKSAWLEEYDDPVIGRVNSRMQAITGLTKDTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSNLKTEGNRLATYLNYMSDVEAGGATVFPDF--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 471 -GAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 519


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/161 (39%), Positives = 86/161 (53%), Gaps = 20/161 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           R S   +L   + + +A +  R+A  T L     E  Q+++Y  G  YEPHFDF +  ++
Sbjct: 365 RISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF-QSTVD 423

Query: 76  QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
             +G  RI TVL YLS VE+GG TVFP  +VS                V P KG A+++F
Sbjct: 424 PAIGS-RIETVLFYLSDVEQGGATVFPEIQVS----------------VWPQKGSAVVWF 466

Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           +LHP    D  + H  CPV+ G KW ATKWIH R   F +P
Sbjct: 467 NLHPSGDGDQRTKHAGCPVLIGSKWIATKWIHERGQEFLRP 507


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/161 (39%), Positives = 86/161 (53%), Gaps = 20/161 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           R S   +L   + + +A +  R+A  T L     E  Q+++Y  G  YEPHFDF +  ++
Sbjct: 371 RISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF-QSTVD 429

Query: 76  QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
             +G  RI TVL YLS VE+GG TVFP  +VS                V P KG A+++F
Sbjct: 430 PAIGS-RIETVLFYLSDVEQGGATVFPEIQVS----------------VWPQKGSAVVWF 472

Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           +LHP    D  + H  CPV+ G KW ATKWIH R   F +P
Sbjct: 473 NLHPSGDGDQRTKHAGCPVLIGSKWIATKWIHERGQEFLRP 513


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/173 (35%), Positives = 92/173 (53%), Gaps = 22/173 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG++   + RTS   +   + + +   + ARIA  T       E +Q+++Y  G  Y+ H
Sbjct: 364 SGRNEVVKTRTSKVAWFPDSYNPLTVRLNARIADMTGFNLYGSEMLQLMNYGLGGHYDQH 423

Query: 67  FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +DFF + +N  L    G RIATVL YL+ VE+GG TVFPN               R+  A
Sbjct: 424 YDFF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--A 466

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           V P +G  +++++L  +  TD+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 467 VFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKWVCNKWIREREQIFSRP 519


>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 244

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/117 (45%), Positives = 76/117 (64%), Gaps = 2/117 (1%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V D  +GK + S+VRTSSGMF++  + +  +V +IE RI+ ++ +P ENGE +Q+L YE 
Sbjct: 94  VVDVATGKGVKSDVRTSSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEA 153

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
            Q Y PH D+F D  N + GG R+AT+LMYL+    GGET FP    S + +  WS+
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVVGGETHFPQEMESAAVEETWSK 210


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 97/181 (53%), Gaps = 21/181 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           +A+ ++GK+  S+ R S   +        + +I  R+A  T L  +  E +Q+++Y  G 
Sbjct: 359 IANQQTGKAERSKDRVSKSSWFPDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGG 418

Query: 62  KYEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           +Y+PHFDFF   K+ +    +RIATVL Y+S V  GG TVFP                + 
Sbjct: 419 QYDPHFDFFHWGKLKEV---NRIATVLFYMSDVSIGGATVFP----------------KL 459

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK-EPE 179
           G  ++  KG A  +++LH     D ++LHG+CPV+ GEKW A KWI  R  +   K +P+
Sbjct: 460 GVTLEARKGTAAFWYNLHSSGELDYSTLHGACPVLIGEKWVANKWIRERGQEFRRKCDPK 519

Query: 180 D 180
           D
Sbjct: 520 D 520


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/175 (37%), Positives = 96/175 (54%), Gaps = 24/175 (13%)

Query: 1   MVADNESGKSIASEV---RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
           M+  +  G+S + EV   RTS   +L+    E+V  +  R    T L  ++ E++Q+ +Y
Sbjct: 37  MLKRSMVGESFSKEVSNERTSQNAWLADYDFELVKVLSLRTEDMTGLDRKSYESLQVNNY 96

Query: 58  EHGQKYEPHFDFFRDKMNQQ----LG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
             G  Y PHFD+ R    ++    +G G+RIAT++ YLS VE+GG TVFP          
Sbjct: 97  GIGGFYLPHFDWVRTNGTEEPYKDMGLGNRIATLMYYLSDVEQGGATVFP---------- 146

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 + G  V P KG A+ +++L PD + D  +LHG+CPV+ G KW A KWIH
Sbjct: 147 ------QIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWVANKWIH 195


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 22/169 (13%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
            ++ R +   +LS  +D +VA +  RI   T L     E +Q+ +Y  G +YEPHFDF R
Sbjct: 367 TAQYRITKSAWLSGYEDPVVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR 426

Query: 72  ----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
               D   +   G+R+AT L Y+S VE GG TVFP                  G AV P 
Sbjct: 427 KYEPDAFKKLGTGNRVATWLFYMSDVEAGGATVFPEV----------------GAAVYPK 470

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           KG A+ +++L      D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 471 KGTAVFWYNLLESGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 519


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 93/185 (50%), Gaps = 22/185 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++ ++  R A  T L  E+ E +Q+++Y  G 
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTHEDRVIGTVVQRTADMTGLDMESAEELQVVNYGIGG 372

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 373 HYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 421

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P 
Sbjct: 422 -----ALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 476

Query: 176 KEPED 180
              ED
Sbjct: 477 DLEED 481


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +L+  +  +V  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 386 RISKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEP 445

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G AVKP+KG A
Sbjct: 446 DAFKELGTGNRIATWLFYMSDVAAGGATVFPEV----------------GAAVKPLKGTA 489

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L P    D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 490 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 534


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 92/167 (55%), Gaps = 17/167 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           M  + +  + +    RTS+ ++L+  ++ ++  +E R+   T    EN E  Q+++Y  G
Sbjct: 415 MTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 474

Query: 61  QKYEPHFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
             Y+PH D F   ++  + GG RIATVL YLS V +GG T+FP   +S            
Sbjct: 475 GHYKPHTDHFETPQLEHRGGGDRIATVLFYLSDVPQGGATLFPRLNIS------------ 522

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
               V+P +GDALL+++L+     +  ++H SCP+I+G KW+  KWI
Sbjct: 523 ----VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWI 565


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++E+GK   +  R S   +L       VA +  R+   T L     E++Q+++Y  G 
Sbjct: 364 VQNSETGKLEVAHYRISKSAWLEDVDHPYVAKVSQRVEDITGLNMATAESLQVVNYGIGG 423

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     Q LG G+RIAT+L Y+S V +GG TVFP  +VS      W   
Sbjct: 424 HYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQGGATVFPGIKVSL-----W--- 475

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P KG A  +++L  +   D  + H +CPV+ G KW   KWIH R   F +P
Sbjct: 476 --------PKKGTAAFWYNLRKNGEGDYLTRHAACPVLTGSKWVCNKWIHERGQEFRRP 526


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/178 (32%), Positives = 95/178 (53%), Gaps = 19/178 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           M  + +  + +    RTS+ ++L+  ++ ++  +E R+   T    EN E  Q+++Y  G
Sbjct: 352 MTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 411

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
             Y+PH D F    ++  GG RIATVL YLS V +GG T+FP   +S             
Sbjct: 412 GHYKPHTDHFETPQHRG-GGDRIATVLFYLSDVPQGGATLFPRLNIS------------- 457

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
              V+P +GDALL+++L+     +  ++H SCP+I+G KW+  KWI      +P + P
Sbjct: 458 ---VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWID--ELSQPFRRP 510


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 92/177 (51%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G S  ++ R +   FL  ++ + +  +  RI   T L     E +Q+ +Y  G 
Sbjct: 372 VQNSLTGASEPTKYRIAKAAFLQNSEHDHIVKMTRRIGDVTGLDMTTAEELQVCNYGIGG 431

Query: 62  KYEPHFDFFRD-KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            YEPH+D  R  ++ +  G G+RIAT + Y+S VE GG TVFP   +             
Sbjct: 432 HYEPHYDHARKGEVQKDFGWGNRIATWMFYMSDVEAGGATVFPQINL------------- 478

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              A+ P KG A  +F+LHP+   D  + H +CPV+ G KW + KWIH RN  F +P
Sbjct: 479 ---ALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQEFRRP 532


>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
          Length = 173

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 69/189 (36%), Positives = 94/189 (49%), Gaps = 50/189 (26%)

Query: 9   KSIASEVRTSSGMFLSKAQDE---------------------------IVASIEARIAAW 41
           K I S+VRTSSGMFLS                                 + +IE RI+ +
Sbjct: 6   KGIQSDVRTSSGMFLSPDDSTYPIVRVFVVPPMEGFWNSCGLSNSLCLFLQAIEKRISVY 65

Query: 42  TFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVF 101
           + +P ENGE +Q                     N + GG R+AT+L+YLS   +GGET F
Sbjct: 66  SQVPVENGELIQF--------------------NLKRGGQRVATMLIYLSDNVEGGETYF 105

Query: 102 PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWS 161
           P +     R G  S    RG +V P+KG+A+LF+S+  D  +D  S+HG C V+ GEKWS
Sbjct: 106 PMAGSGFCRCGGKSV---RGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWS 162

Query: 162 ATKWIHVRN 170
           ATKW+  R+
Sbjct: 163 ATKWMRQRS 171


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 55/170 (32%), Positives = 88/170 (51%), Gaps = 21/170 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +GK      RTS G++  + +D  +  ++ RI++    P ENGE +Q+L Y    +Y PH
Sbjct: 138 TGKEDVIRNRTSEGIWYQRGEDPFIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPH 197

Query: 67  FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           FD+F        ++   GG R+AT+++YL+ V  GGET+FP +                G
Sbjct: 198 FDYFPPDQPGSTVHTAQGGQRVATLVIYLNDVPDGGETIFPEA----------------G 241

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
            +V   +G A+ F  ++     D  +LHG  PV+ G+KW  TKW+  R +
Sbjct: 242 MSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWIMTKWMRERAY 291


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L   +  +V ++  R+   T L     E +Q+++Y  G 
Sbjct: 372 VQNYKTGELEVANYRISKSAWLKDEEHSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGG 431

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S V +GG TVFP+  V           
Sbjct: 432 HYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQGGATVFPSIRV----------- 480

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A++P KG A  +++LH     D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 481 -----ALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGTKWVSNKWIHERGQEFLRP 534


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L    D ++  +  RI   T L  E  E +Q+ +Y  G 
Sbjct: 365 VRDPKTGVLTTAPYRVSKSAWLEGEDDPVIDRVNQRIQDITGLTVETAELLQVANYGVGG 424

Query: 62  KYEPHFDFFRDKM--NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R     N ++ G+R+AT L Y+S VE GG TVFP+                
Sbjct: 425 QYEPHFDFSRRPFDSNLKVDGNRLATFLNYMSDVEAGGATVFPDF--------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
            G ++ P KG A+ +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 470 -GASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 525


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 63/166 (37%), Positives = 89/166 (53%), Gaps = 24/166 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           +  +++G  + S++RTS  +FL +     VA I  RIA  T L  E+ E + + +Y  G 
Sbjct: 333 LTRSQTGWGVISDIRTSQSVFLEEVG--TVARISQRIADITGLSVESAEKLHVQNYGIGG 390

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +Y PHFD   D++N+     R AT L+Y+S VE GG TVF N                 G
Sbjct: 391 RYTPHFDT-GDEVNE-----RTATFLIYMSDVEVGGATVFTNV----------------G 428

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            AVKP KG A+ +++LH +   D  + H  CPV+ G KW A KWIH
Sbjct: 429 VAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 474


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 65/190 (34%), Positives = 99/190 (52%), Gaps = 25/190 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L + + + V ++  R+   T +  E  E +Q+++Y  G 
Sbjct: 379 VQNYKTGELEIANYRISKSAWLQEHEHKHVRAVSQRVEHMTSMSIETAEELQVVNYGIGG 438

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S          
Sbjct: 439 HYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTKINIS---------- 488

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP- 174
                 + P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P 
Sbjct: 489 ------LWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGSKWVANKWLHERGQEFHRPC 542

Query: 175 --EKEPEDDD 182
             E +P D D
Sbjct: 543 TLENQPADVD 552


>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
 gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
          Length = 547

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 22/185 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 373 VQNSVTGALETANYRISKSAWLKTEEDHVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 432

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 433 HYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 481

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P 
Sbjct: 482 -----ALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 536

Query: 176 KEPED 180
              ED
Sbjct: 537 SMDED 541


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 87/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+   +  R S   +L   +  +V  I  RI   T L     E +Q+ +Y  G 
Sbjct: 362 VHDPQTGQLTTAPYRVSKSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVGG 421

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPH+DF R    D   +   G+RIAT L+Y+S V+ GG TVF +              
Sbjct: 422 QYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQAGGATVFTDI------------- 468

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G +V P KG A+ +++LHP    D  + H +CPV+ G KW + KWIH R
Sbjct: 469 ---GASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLLGNKWVSNKWIHER 517


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/166 (37%), Positives = 89/166 (53%), Gaps = 24/166 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           +  +++G  + S++RTS  +FL +     VA I  RIA  T L  E+ E + + +Y  G 
Sbjct: 351 LTRSQTGWGVISDIRTSQSVFLEEVG--TVARISQRIADITGLSVESAEKLHVQNYGIGG 408

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +Y PHFD   D++N+     R AT L+Y+S VE GG TVF N                 G
Sbjct: 409 RYTPHFDT-GDEVNE-----RTATFLIYMSDVEVGGATVFTNV----------------G 446

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            AVKP KG A+ +++LH +   D  + H  CPV+ G KW A KWIH
Sbjct: 447 VAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 492


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 436 HYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 436 HYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T L  E  E +Q+++Y  G 
Sbjct: 373 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSLNVETAEELQVVNYGIGG 432

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S      W   
Sbjct: 433 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P KG A  +F+L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 485 --------PRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGSKWVANKWLHERGQEFLRP 535


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 436 HYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L   +D ++  +  RI   T L  +  E +QI +Y  G 
Sbjct: 369 VRDPKTGVLTTANYRVSKSAWLEGEEDPVIERVNQRIEDITGLTTQTAELLQIANYGVGG 428

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 429 QYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 476 ---GAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWIHERGQEFRRP 531


>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
 gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++A +  R A  T L  E+ E +Q+++Y  G 
Sbjct: 375 VQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGG 434

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y PHFDF R +  +   G    +RIATVL Y+S VE+GG TVF     +  R   W   
Sbjct: 435 HYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVF-----TTLRTALW--- 486

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P +G A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 487 --------PKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIHERGQEFRRP 537


>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
 gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
          Length = 550

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D+++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTEEDQVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 435

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 436 HYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHA----------- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R
Sbjct: 485 -----ALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIHER 531


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 87/169 (51%), Gaps = 18/169 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D ++G+   +  R S   +L   +  ++A I  R+   T L   + E +Q+++Y  G
Sbjct: 366 VVHDPKTGELTPAHYRISKSSWLRDEESPVIARITQRVTDMTGLSMLHAEELQVVNYGIG 425

Query: 61  QKYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
             YEPHFDF R + N   + GG+RIATVL Y+S V +GG TVF                 
Sbjct: 426 GHYEPHFDFARKRENPFTKFGGNRIATVLFYMSDVAQGGATVF----------------T 469

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
             G ++ P+K  A  + +LH     D  + H +CPV+ G KW + KWIH
Sbjct: 470 ELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKWVSNKWIH 518


>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
 gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
          Length = 487

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +D ++A +  R A  T L  E+ E +Q+++Y  G 
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGG 372

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y PHFDF R +  +   G    +RIATVL Y+S VE+GG TVF     +  R   W   
Sbjct: 373 HYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVF-----TTLRTALW--- 424

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P +G A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 425 --------PKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIHERGQEFRRP 475


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 86/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++GK   ++ R S   +L   +  IV  I  RI   T L     E +Q+ +Y  G 
Sbjct: 354 VHDPQTGKLTTAQYRVSKSAWLGSHEHPIVDRINQRIEDITGLDVSTAEDLQVANYGVGG 413

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L+Y+S V+ GG TVF +              
Sbjct: 414 QYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTVFTDI------------- 460

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G  V P KG A+ +++LH     D  + H +CPV+ G KW + KWIH R
Sbjct: 461 ---GAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLVGNKWVSNKWIHER 509


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 81/158 (51%), Gaps = 20/158 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +VA I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 388 RISKSAWLSGYENPVVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEP 447

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 448 DAFKELGTGNRIATWLFYMSDVAAGGATVFPEV----------------GASVWPKKGTA 491

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + +++L P    D ++ H +CPV+ G KW + KWIH R
Sbjct: 492 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHER 529


>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 294

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/173 (35%), Positives = 82/173 (47%), Gaps = 18/173 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D       A+  R++    L  A  E+V  +EARI   T  P    E +Q+  Y  GQ
Sbjct: 125 VVDPHQDAVHAAHFRSNDSAQLPAAGSELVRRVEARIERLTGWPSAFCETLQLQRYAQGQ 184

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
            Y PH+DFF   M +  GG R+AT+++YL   E GG T F N                 G
Sbjct: 185 DYRPHYDFFGQDMVEAQGGQRLATLILYLRAPEAGGATYFAN----------------LG 228

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
             + P KG AL F   +PD   +S +LHG   V+ GEKW AT+W   R +  P
Sbjct: 229 MRIAPRKGSALFF--TYPDPGNNSGTLHGGEAVLAGEKWIATQWFRDRAWRHP 279


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 95/178 (53%), Gaps = 21/178 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + + G++   + RTS   +L+ + + +   +  RI+  T       E +Q+++Y  G 
Sbjct: 359 VFNQKMGRNTVVKTRTSKVTWLTDSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGG 418

Query: 62  KYEPHFDFFRDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            Y+ HFD+F   + +   +L G RIATVL YL+ VE+GG TVFPN  + Q          
Sbjct: 419 HYDLHFDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFPN--IKQ---------- 466

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI--HVRNFDKP 174
               A+ P KG A+++++L  +   D  +LH +CPVI G KW   KWI  H + F +P
Sbjct: 467 ----AIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQLFRRP 520


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T +  E  E +Q+++Y  G 
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGG 297

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S          
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINIS---------- 347

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 + P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 348 ------LWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 400


>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 152

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 84/164 (51%), Gaps = 23/164 (14%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           RTS  M L   QD +   IEARIA     P ++GE +Q+L Y  G +Y PH+D+F     
Sbjct: 6   RTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAA 65

Query: 72  -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
              +  Q GG R+A+++MYL+  E+GG T FP++ +                 V  +KG+
Sbjct: 66  GTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------------VAAVKGN 109

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
           A+ F    P   T   SLH   PV+ GEKW ATKW+  R    P
Sbjct: 110 AVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRERAVRMP 151


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/168 (39%), Positives = 89/168 (52%), Gaps = 28/168 (16%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEH 59
           +  +++G  + SE+RTS  +FL    DE+  VA I  RIA  T L  E+ E + + +Y  
Sbjct: 78  LTRSQTGWGVISEIRTSQSVFL----DEVGTVARISQRIADITGLSVESAEKLHVQNYGI 133

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           G +Y PHFD   D +N+     R AT L+Y+S VE GG TVF N                
Sbjct: 134 GGRYTPHFDAGGD-VNE-----RTATFLIYMSDVEVGGATVFTNV--------------- 172

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            G AVKP KG A+ + +LH +   D  + H  CPV+ G KW A KWIH
Sbjct: 173 -GVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 219


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T +  E  E +Q+++Y  G 
Sbjct: 299 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSIETAEELQVVNYGIGG 358

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S      W   
Sbjct: 359 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 410

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 411 --------PRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 461


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 100/194 (51%), Gaps = 26/194 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYEH 59
           + D +   ++    RTS+ +FL      +V  +  R+A  T L     + + +Q+++Y  
Sbjct: 317 IYDYDKEGNVPVNFRTSNSVFLLNNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGL 376

Query: 60  GQKYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           G  Y  HFDFF +D+   +L G RI TVL+Y++ V++GG TVFP   ++           
Sbjct: 377 GGYYRYHFDFFGKDESPNKLLGDRIITVLIYMTDVQQGGATVFPALRITNF--------- 427

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-- 174
                  P KG AL+F +L  + S D ++LH  CPV+ G KW+ATKWI+   + F KP  
Sbjct: 428 -------PKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWAATKWIYSAEQMFRKPCL 480

Query: 175 ---EKEPEDDDCVD 185
              E  P D   ++
Sbjct: 481 PQNELRPYDTHVIE 494


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/159 (36%), Positives = 87/159 (54%), Gaps = 21/159 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD--- 72
           R S   +L      ++  +  RI+  T L  E  E +QI +Y  G +YEPHFD+ R    
Sbjct: 36  RVSKSAWLKDEDHPVIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDF 95

Query: 73  -KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
            K + ++G +RIAT L Y+S+VE+GG TVF +                 G AV+P+KG A
Sbjct: 96  GKFDDEVG-NRIATFLTYMSNVEQGGSTVFLHP----------------GIAVRPIKGSA 138

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           + +++L P  + D  + H +CPV+ G KW + KWIH R+
Sbjct: 139 VFWYNLLPSGAGDERTRHAACPVLTGVKWVSNKWIHERD 177


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 91/181 (50%), Gaps = 26/181 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+S+  + R +   FL  ++  ++  +  R+   T L     E +Q+ +Y  G 
Sbjct: 359 VTDRDTGRSMPVQYRIAKAAFLKDSEHNLIVKMSRRVGDITGLDMAASEDLQVCNYGIGG 418

Query: 62  KYEPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            Y PHFD+ R    +  G      G+RIAT L Y+S VE GG TVFP             
Sbjct: 419 HYVPHFDYARQ--GEIHGPRDLDWGNRIATWLFYMSDVEAGGATVFPAV----------- 465

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDK 173
                G A+ P KG A  +++L P+ + D  +LH  CPV+ G KW + KWIH R+  F +
Sbjct: 466 -----GAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQEFRR 520

Query: 174 P 174
           P
Sbjct: 521 P 521


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G  I    R S   +L +  D I+A +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGHLIVVSYRISKSSWLKEDDDPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGG 426

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG ++ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 472 -GAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKWVSNKWFHER 520


>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
          Length = 488

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 95/178 (53%), Gaps = 18/178 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D + G+  AS+ RTS   F++   D I+  IE R A+ T +P  + E +Q+L Y   +
Sbjct: 306 LKDADKGRP-ASDWRTSQSTFVAAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTE 364

Query: 62  KYEPHFDFF------RDKMNQQL----GGHRIATVLMYLSHVEKGGETVFP-NSEVSQSR 110
           KY+ H DFF       D    QL      +R ATV  YL+ V +GGET FP +      R
Sbjct: 365 KYDAHHDFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHGGAPPPR 424

Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE--KWSATKWI 166
           D  +S C   G  VKP KG  ++F+SL      D  SLHG+CPV+  E  KW+A KW+
Sbjct: 425 D--FSMCT--GLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 94/180 (52%), Gaps = 20/180 (11%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           ++ +N   +      RTS+ ++L+  ++ ++  +E R+   T    EN E  Q+++Y  G
Sbjct: 353 VMVNNLKVRPFIDSGRTSNSVWLASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 412

Query: 61  QKYEPHFDFFRDKM--NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
             Y+PH D F        + GG RIATVL YLS V +GG T+FP   +S           
Sbjct: 413 GHYKPHTDHFETPQAPEHRGGGDRIATVLFYLSDVPQGGATLFPRLNIS----------- 461

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
                V+P +GDALL+++L+     +  ++H SCP+I+G KW+  KWI      +P + P
Sbjct: 462 -----VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWID--ELSQPFRRP 514


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +L+  +D +V  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 395 RISKSAWLTAYEDPVVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 454

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP+                 G +V P KG A
Sbjct: 455 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GASVGPQKGTA 498

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + +++L      D ++ H +CPV+ G KW + KWIH R
Sbjct: 499 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHER 536


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 427 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 472 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 520


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + E VA++  R+   T +  +  E +Q+++Y  G 
Sbjct: 380 VQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 439

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S      W   
Sbjct: 440 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 491

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 492 --------PKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 542


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + E VA++  R+   T +  +  E +Q+++Y  G 
Sbjct: 380 VQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 439

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S      W   
Sbjct: 440 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 491

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                   P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 492 --------PKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 542


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 90/165 (54%), Gaps = 20/165 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           +S  RT+ G +L ++ + +   I  R+   + L  E  E MQ+++Y  G  Y PH D+F 
Sbjct: 357 SSPTRTAMGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF- 415

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
              + ++ G+R+ATVL YL+ VE+GG T+F  +E                + V P +G A
Sbjct: 416 -TQHPEVMGNRLATVLFYLTDVEQGGATMFNKAE----------------HKVLPRRGTA 458

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           L +++LH D   D ++ H +CP+I G KW  T+WI  RN  F +P
Sbjct: 459 LFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQIFIRP 503


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 428 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 472

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 473 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 521


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 89/172 (51%), Gaps = 20/172 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG++   + RTS   +     + +   + ARI+  T       E +Q+++Y  G  Y+ H
Sbjct: 364 SGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQH 423

Query: 67  FDFFRDKMNQQ--LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
           +DFF +  +    + G RIATVL YL+ VE+GG TVFPN               R+  AV
Sbjct: 424 YDFFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--AV 467

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            P +G  +++++L  +   D+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 468 FPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 90/165 (54%), Gaps = 20/165 (12%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           +S  RT+ G +L ++ + +   I  R+   + L  E  E MQ+++Y  G  Y PH D+F 
Sbjct: 382 SSPTRTALGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF- 440

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
              + ++ G+R+ATVL YL+ VE+GG T+F  +E                + V P +G A
Sbjct: 441 -TQHPEVMGNRLATVLFYLTDVEQGGATMFNKAE----------------HKVLPRRGTA 483

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           L +++LH D   D ++ H +CP+I G KW  T+WI  RN  F +P
Sbjct: 484 LFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQIFIRP 528


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/158 (37%), Positives = 82/158 (51%), Gaps = 20/158 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +L+  +D +V  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 390 RISKSAWLTAYEDPVVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEP 449

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L+Y+S V  GG TVF +                 G AV P KG A
Sbjct: 450 DAFKELGTGNRIATWLIYMSDVPSGGATVFTDV----------------GAAVWPKKGSA 493

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + +++L P    D ++ H +CPV+ G KW + KWIH R
Sbjct: 494 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHER 531


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G    +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431

Query: 67  FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           FDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV              G 
Sbjct: 432 FDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V    SG++   + RTS   +     + +   + ARI+  T       E +Q+++Y  G 
Sbjct: 359 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 418

Query: 62  KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            Y+ H+DFF +K N  +    G RIATVL YL+ VE+GG TVFPN               
Sbjct: 419 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 463

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           R+  AV P +G  +++++L  +   D+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 464 RK--AVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 372 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQHITGLTVKTAELLQVANYGMGG 431

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 432 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 476

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 477 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 525


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V    SG++   + RTS   +     + +   + ARI+  T       E +Q+++Y  G 
Sbjct: 359 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 418

Query: 62  KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            Y+ H+DFF +K N  +    G RIATVL YL+ VE+GG TVFPN               
Sbjct: 419 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 463

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           R+  AV P +G  +++++L  +   D+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 464 RK--AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V    SG++   + RTS   +     + +   + ARI+  T       E +Q+++Y  G 
Sbjct: 150 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 209

Query: 62  KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            Y+ H+DFF +K N  +    G RIATVL YL+ VE+GG TVFPN               
Sbjct: 210 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 254

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           R+  AV P +G  +++++L  +   D+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 255 RK--AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 310


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G    +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431

Query: 67  FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           FDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV              G 
Sbjct: 432 FDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 20  VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 79

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 80  QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 124

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 125 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 180


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G    +  R S   +LS  +  +V+ I  RI   T L     E +Q+ +Y  G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431

Query: 67  FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           FDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV              G 
Sbjct: 432 FDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           +V P KG A+ +++L P    D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 369 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 428

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 429 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 476 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 524


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 64/165 (38%), Positives = 87/165 (52%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 352 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 411

Query: 74  -MNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
              Q+LG G+RIAT L Y+S V  GG TVFP  EV              G +V P KG A
Sbjct: 412 DAFQELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GASVWPKKGTA 455

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 456 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 500


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTTASYRVSKSSWLEETDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 22  VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 81

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 82  QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 126

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 127 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 182


>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Ornithorhynchus anatinus]
          Length = 888

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 719 VRDPKTGVLTVANYRVSKSSWLEEEDDPVVAQVNRRMQYITGLTVKTAELLQVANYGMGG 778

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 779 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 825

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 826 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 881


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L   +   + +I  R+   T L     E +Q+++Y  G 
Sbjct: 372 VQNYKTGELEVANYRISKSAWLKDHEHPYIKAIGERVEDMTGLTMSTAEELQVVNYGIGG 431

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S V +GG TVFP+  +           
Sbjct: 432 HYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLRL----------- 480

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  +F+LH     D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 481 -----ALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGTKWVSNKWIHERGQEFRRP 534


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVANYRVSKSSWLEEEDDLVVARVNHRMEQITGLTTKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDITLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 471 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 519


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA I  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      ++LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 370 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 429

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 430 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 476

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 477 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 525


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +L++  D ++  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 460 RISKSAWLTEYDDPMIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 519

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP+                 G AV P KG A
Sbjct: 520 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GAAVWPQKGTA 563

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + +++L      D ++ H +CPV+ G KW + KWIH R
Sbjct: 564 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHER 601


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA I  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
 gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
          Length = 257

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 57/136 (41%), Positives = 80/136 (58%), Gaps = 5/136 (3%)

Query: 15  VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            RTSSG F+S +++   A   +E +IA  T +P  +GE+  IL YE GQKY+ H+D F  
Sbjct: 78  TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 137

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
                    RIA+ L+YLS VE+GGET+FP    S    G ++ +C   G  VKP KGD 
Sbjct: 138 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 195

Query: 132 LLFFSLHPDASTDSTS 147
           LLF+S+ P+ + D  +
Sbjct: 196 LLFYSVFPNGTIDQVN 211


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T +  E  E +Q+++Y  G 
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGG 297

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +S          
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINIS---------- 347

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 + P KG A  + +L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 348 ------LWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 400


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 426 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 485

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 486 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 529

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 530 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 586


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 74  -MNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
              Q+LG G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFQELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score =  103 bits (256), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
 gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
          Length = 550

 Score =  103 bits (256), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 92/185 (49%), Gaps = 22/185 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   +  ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTPEHRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 436 HYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
                A+ P KG A  + +LH D   D  + H +CPV+ G KW + KWIH R   F +P 
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 539

Query: 176 KEPED 180
              ED
Sbjct: 540 SLEED 544


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 90/178 (50%), Gaps = 21/178 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +S ++   + RTS   +L    +++   +  RI   T       E +Q+++Y  G 
Sbjct: 360 VFNQQSMRNHVVKTRTSKVTWLLDTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGG 419

Query: 62  KYEPHFDFFRDKMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            Y+ H+D+F   +     +L G RIATVL YL+ VE+GG TVFPN E             
Sbjct: 420 HYDKHYDYFNSSVAADLTRLNGDRIATVLFYLTDVEQGGATVFPNIE------------- 466

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
               AV P  G A+++++L  D + D  +LH +CPVI G KW   KWI  R   F +P
Sbjct: 467 ---KAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQVFRRP 521


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 85/173 (49%), Gaps = 21/173 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +GK   +  RTS   +L    DE+   +  RI A T L  E  E +Q+ +Y  G  Y PH
Sbjct: 425 TGKLENAYYRTSKSAWLQDGLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYYAPH 484

Query: 67  FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           FDF R +         G+RIAT++ YL+ V+ GG TVF                 R G +
Sbjct: 485 FDFGRKREKDAFEVENGNRIATIIFYLTDVKAGGATVF----------------NRFGAS 528

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           VKP++G A  +++LHP    D  + H +CPV+ G KW    W H R   F +P
Sbjct: 529 VKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLVGSKWVMNVWFHERGQEFRRP 581


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/173 (34%), Positives = 87/173 (50%), Gaps = 22/173 (12%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           SG++     RTS   +       +   + ARI   T       E +Q+++Y  G  Y+ H
Sbjct: 367 SGRNEVVRTRTSKVAWFPDGYSPLTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQH 426

Query: 67  FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D+F + +N  L    G RIATVL YL+ VE+GG TVFPN               R+  A
Sbjct: 427 YDYF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--A 469

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           V P +G  +++++L  D   D+ +LH +CPVI G KW   KWI  R   F +P
Sbjct: 470 VFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKWIREREQLFRRP 522


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/176 (34%), Positives = 87/176 (49%), Gaps = 25/176 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +GK   ++ R S   FL   +   V  +  R+ A T L     E +Q+ +Y  G 
Sbjct: 377 VMNSATGKLETAKYRISKAAFLKNKEHHHVLKMSRRVGAITGLDMSTAEDLQVCNYGIGG 436

Query: 62  KYEPHFDFFRDKMNQQLG-------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
            YEPHFD+ R   N+ +G        +RIAT L Y+S VE GG TVFP   V        
Sbjct: 437 HYEPHFDYARK--NETIGFNKDSGWRNRIATWLFYMSDVEAGGATVFPALNV-------- 486

Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
                   A+ P KG A  +++L P+   +  + H +CPV+ G KW A KWIH +N
Sbjct: 487 --------ALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKN 534


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 320 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 379

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 380 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 423

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 424 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 480


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 320 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 379

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 380 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 423

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 424 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 480


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 471

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 527


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 408 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 467

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 468 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 512

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 513 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 568


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 382 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEP 441

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 442 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 485

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L      D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 486 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 530


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP  EV              G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 472

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 471

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T +  +  E +Q+++Y  G 
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 297

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +           
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI----------- 346

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 347 -----ALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 400


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 87/166 (52%), Gaps = 17/166 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+ +A   R S   +  +    I+ S+   IA  T +P +  E +QILHY  G 
Sbjct: 93  VTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGIAQLTGIPIDCQEPLQILHYRPGG 152

Query: 62  KYEPHFD-FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           +Y+PH+D F  D    + GG+R AT+++YL+ VE+GGET FP                  
Sbjct: 153 EYKPHYDAFAADAPTLRQGGNRQATLILYLNAVEEGGETAFPE----------------L 196

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           G  V P+ G  + F +L+ +      SLH   PV +GEKW AT+WI
Sbjct: 197 GLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEKWIATQWI 242


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score =  102 bits (255), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Cavia porcellus]
          Length = 535

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 396

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 397 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 443

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 444 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 499


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +D +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
           melanoleuca]
          Length = 535

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
 gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
          Length = 487

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +  +G    +  R S   +L  A+  ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 313 VQNAVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIATVL Y+S VE+GG TVF +              
Sbjct: 373 HYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHA----------- 421

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
                 +KP KG A  + +LH     D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 422 -----VLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIHERGQEFRRP 475


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 87/162 (53%), Gaps = 24/162 (14%)

Query: 16  RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
           R+  G+F+ + +++  +  +I  R+  +  L  E+ E MQ++ Y  G++   HFD+F   
Sbjct: 101 RSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLT-ESSEVMQVIRYNPGEETSAHFDYFNPL 159

Query: 72  ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
                M   L G RI T+LMYL+ VE+GGET FP   V                 VKP+K
Sbjct: 160 TTNGAMKIGLYGQRICTILMYLADVEEGGETSFPEVNVK----------------VKPIK 203

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
           GDA+LF++  P+   D  SLH   PVI+G KW A K ++ +N
Sbjct: 204 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQKN 245


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 87/171 (50%), Gaps = 20/171 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D ++A +  R+ A T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLSVANYRVSKSAWLEENDDPVIARVNLRMQAITGLTVDTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 428 QYEPHFDFSRRPFDSNLKTDGNRLATFLNYMSDVEAGGATVFPDF--------------- 472

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            G A+ P KG A+ +++L      D  + H +CPV+ G KW   KW H ++
Sbjct: 473 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWG--KWTHTQD 520


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 84/159 (52%), Gaps = 24/159 (15%)

Query: 16  RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
           R+  G+F+ + +++  +V  I  R+     L  EN E MQ++ Y  G++   H+D+F   
Sbjct: 70  RSGWGLFMKEGEEDHDVVKKIFQRMKMLVNLT-ENCEVMQVIRYHPGEETSAHYDYFNPL 128

Query: 72  ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
                M   L G R+ T+LMYLS VE+GGET FP                  G  VKP+K
Sbjct: 129 TTNGAMKIGLYGQRVCTILMYLSEVEEGGETSFP----------------EVGVKVKPVK 172

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           GDA+LF++  P+   D  SLH   PVI+G KW A K I+
Sbjct: 173 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWVAIKLIN 211


>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Ovis aries]
          Length = 535

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 385 VRDPKTGVLTVANYRVSKSSWLEEEDDLVVAKVNQRMEHITGLTVKTAELLQVANYGMGG 444

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 445 QYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 491

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 492 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 540


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 396

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 397 QYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 443

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 444 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 499


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G    +  R S   +L + + + VA++  R+   T +  +  E +Q+++Y  G 
Sbjct: 360 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 419

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     + LG G+RIATVL Y+S VE+GG TVF    +           
Sbjct: 420 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI----------- 468

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                A+ P KG A  +++L P+   D  + H +CPV+ G KW A KW+H R   F +P
Sbjct: 469 -----ALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 522


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 427 QYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 474 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 529


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 449 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 492

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 493 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 549


>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
          Length = 535

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQYITGLTVQTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      ++LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
           taurus]
 gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
          Length = 535

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 428 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 428 QYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 408 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 467

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 468 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 514

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 515 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 570


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/178 (35%), Positives = 89/178 (50%), Gaps = 21/178 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   ++ R S   +L   +   VA I  R +A T L     E +QI +Y  G 
Sbjct: 367 VVNSVTGELEFAKYRISKSGWLKDEEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGG 426

Query: 62  KYEPHFDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            YEPHFD+ R           G+RI TV+ YLS VE GG TVF  +              
Sbjct: 427 HYEPHFDYSRLAEVTSFDHWRGNRILTVIFYLSDVEAGGGTVFMTA-------------- 472

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
             G  ++P KG A ++++LHPD + D  + H +CPV+ G KW A KW H R   F +P
Sbjct: 473 --GTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKWVANKWFHERGQEFTRP 528


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 427

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 472

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 396

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 397 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 441

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 442 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 497


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 321 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 380

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 381 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 425

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 426 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 481


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++++G+   +  R S   +L    D ++  +  RI  +T L     E +Q+ +Y  G 
Sbjct: 355 VQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGG 414

Query: 62  KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +          G+RIATVL Y+S  E+GG TVF +              
Sbjct: 415 HYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHL------------- 461

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G AV P K DAL +++L  D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 462 ---GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRP 517


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 458 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 502

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 503 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 558


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D ++A +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            G  + P KG ++ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 472 -GATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 520


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 20/173 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   + + V  +  R+   T L     E +Q+++Y  G 
Sbjct: 374 VQNSVTGNLEPANYRISKSAWLKSEEHDHVFKVTRRVGDVTGLDMATAEDLQVVNYGIGG 433

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFD+ R +++N  + LG G+R+AT L Y+S VE GG TVFP               
Sbjct: 434 HYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVEAGGATVFP--------------- 478

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            +   A+ P KG A  +++LHP+   +  + H +CPV+ G KW + KWIH RN
Sbjct: 479 -KLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERN 530


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score =  102 bits (253), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 458 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 502

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 503 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 558


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 84/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP+                 G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
 gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
          Length = 540

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G S  SEVRTS   +L   Q   + +++ R+   T L  E+ E +Q+++Y  G  YEPH+
Sbjct: 366 GNSTVSEVRTSQNTWLWYEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGGHYEPHY 425

Query: 68  DFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DF  DK+      G+R+ T L+YL+ V  GG T FP  ++                AV P
Sbjct: 426 DFVEDKVTTFGWKGNRLLTALLYLNEVPMGGATAFPYLKL----------------AVPP 469

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
           +KG  L++++LH     D  + H  CPV+ G KW   +W H   + F +P
Sbjct: 470 VKGSLLVWYNLHRSLDPDFRTKHAGCPVLMGSKWVCNEWFHEGAQEFRRP 519


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 89/173 (51%), Gaps = 21/173 (12%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           DN SG +   + RTS    + + + E++A I+AR+AA +  P ++GE +Q+  Y+ G +Y
Sbjct: 124 DNASGINRFDDSRTSESAHIQRGETELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEY 183

Query: 64  EPHFDFFRDKM-----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            PHFD+F   +     + +  G R+AT+++YL+ VE+GG T FP                
Sbjct: 184 RPHFDWFDPALAGTAKHLEKSGQRLATIILYLTDVEEGGGTSFPGI-------------- 229

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
             G  V P KG AL F +  P    D  + H   PV +G K  A KW+  + +
Sbjct: 230 --GLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGTKIIANKWLREKPY 280


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 371 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 430

Query: 62  KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           +YEPHFDF R   +  L   G+R+AT L Y+S VE GG TVFP+                
Sbjct: 431 QYEPHFDFSRRPFDSGLKTEGNRVATFLNYMSDVEAGGATVFPD---------------- 474

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 475 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 531


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/177 (35%), Positives = 87/177 (49%), Gaps = 20/177 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V   ESG+   S  R +   +L   + + V+ I  R+   T L     E +Q+ +Y  G 
Sbjct: 376 VQKKESGEREFSRYRIAKSAWLKHEEHDYVSDINFRVGDITGLDMATSEDLQVCNYGIGG 435

Query: 62  KYEPHFDFFRD-KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            YEPH+D+ R  ++ Q  G G RIAT L Y+S VE GG TVFP   +S            
Sbjct: 436 HYEPHYDYARKGEVQQDFGWGGRIATWLFYMSDVEAGGATVFPKLNLS------------ 483

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
               + P KG A  +F+L+P+   +  + H  CPV+ G KW A  WIH R   F +P
Sbjct: 484 ----LWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKWVANYWIHERGQEFRRP 536


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 23/174 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           MV D+   +   S+ RTSS  +L      +V ++  R    T L     E +Q+ +Y  G
Sbjct: 345 MVGDDH--EKAVSKTRTSSNAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIG 402

Query: 61  QKYEPHFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
             Y PH+D+   +  +++      G+RIATV+ YLS V  GG TVFP             
Sbjct: 403 GHYLPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP------------- 449

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              + G  V P KG A+ +++LH + + D  +LHG+CPV  G KW   KWIH R
Sbjct: 450 ---QLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHER 500


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 86/166 (51%), Gaps = 17/166 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G+ +A   R S   +  +    I+ S+   IA  T +P +  E +QILHY  G 
Sbjct: 41  VTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLAEGIAQLTGIPIDCQEPLQILHYRPGG 100

Query: 62  KYEPHFD-FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           +Y+PH+D F  D    + GG+R  T+++YL+ VE+GGET FP                  
Sbjct: 101 EYKPHYDAFAADAPTLRQGGNRQGTLILYLNAVEEGGETAFPE----------------L 144

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           G  V P+ G  + F +L+ +      SLH   PV +GEKW AT+WI
Sbjct: 145 GLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEKWIATQWI 190


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 449 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 494

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 495 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 551


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 83/163 (50%), Gaps = 17/163 (10%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           GK+   + RTS   +   + + +   + ARI   T       E +Q+++Y  G  Y+ H+
Sbjct: 363 GKNEVVKTRTSKVAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHY 422

Query: 68  DFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFF   + +  L G RIATVL Y+S VE+GG TVFPN   +                V P
Sbjct: 423 DFFNATEKSSSLTGDRIATVLFYMSDVEQGGATVFPNIYKT----------------VYP 466

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            +G A+++++L  D   D  +LH +CPV+ G KW   KWI  R
Sbjct: 467 QRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKWVCNKWIRER 509


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 82/170 (48%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G+    + R S   +LS +  +IV  I  R+   T L    GE +Q+ +Y  G 
Sbjct: 356 VNNLETGEIEDVDYRISQIAWLSDSDGDIVRRINRRVGFITGLNTNTGECLQVNNYGVGG 415

Query: 62  KYEPHFDFFRDKMNQQLG----GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFD   D  N  +     G+RIAT + YLS VE GG TVF                
Sbjct: 416 HYEPHFDHSLDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVF---------------- 459

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            + G    P KG A+ +++L      D  SLH  CPV+ G KW A KW+H
Sbjct: 460 IKTGVKTNPFKGGAVFWYNLKKSGEGDWDSLHAGCPVLIGNKWVANKWLH 509


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      + LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 88/172 (51%), Gaps = 19/172 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           ++G S  SE+RTS   +L    +  +A I+ R+   T L  +  E +Q+++Y  G +YEP
Sbjct: 378 QTGNSTVSEIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437

Query: 66  HFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
           HFDF  D + N    G+R+ T L YL+ V  GG T FP   +                AV
Sbjct: 438 HFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGATAFPFLHL----------------AV 481

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
            P+KG  L++++LH     D  + H  CPV++G KW   +W H   + F +P
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQEFRRP 533


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 427

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      + LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 428 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF     RD       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + ++ +G    +E R S   +LS+   ++V  +  RI  +T L  +  E +Q+ +Y  G 
Sbjct: 363 IQNSVTGNLEFAEYRISKSAWLSEDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGG 422

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R    +       G+RIAT L Y+S VE GG TVFP               
Sbjct: 423 HYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDVEAGGATVFPQV------------- 469

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G  + P KG A  +++L  +   D ++ H +CPV+ G KW + KWIH R
Sbjct: 470 ---GARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGSKWVSNKWIHER 518


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      + LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 426 QYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +L+  +  ++  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 393 RISKSAWLTGYEHPVIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 452

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP+                 G AV P KG A
Sbjct: 453 DAFKELGTGNRIATWLFYMSDVAAGGATVFPDV----------------GAAVWPQKGTA 496

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           + +++L  +   D ++ H +CPV+ G KW + KWIH R
Sbjct: 497 VFWYNLFANGEGDYSTRHAACPVLVGNKWVSNKWIHER 534


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      + LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 458 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 504

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 505 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 560


>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
 gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
          Length = 487

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L  A+  ++ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372

Query: 62  KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +  +   G    +RIAT+L Y+S VE+GG TVF +              
Sbjct: 373 HYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDVEQGGATVFTSLHA----------- 421

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
                A+ P KG A  + +LH     D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 422 -----ALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIHERGQEFRRP 475


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 19/169 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + D  +GK   ++ R S   +LS  +   + ++EAR  A T L     E +Q+ +Y  G 
Sbjct: 403 IQDPITGKLRHADYRISKSAWLSTNKYNFLQALEARTQATTGLDLSYAEQLQVANYGLGG 462

Query: 62  KYEPHFDFFR---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
            YEPHFD  R   D+      G+RIATVL YLS VE GG TVF   +             
Sbjct: 463 HYEPHFDHSRENEDRFTDLGMGNRIATVLFYLSDVEAGGATVFTVGKT------------ 510

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
               AV P KGDA+ +F+L  +   +  + H +CPV+ G+KW +  WIH
Sbjct: 511 ----AVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLVGQKWVSNWWIH 555


>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
 gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
          Length = 487

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/187 (33%), Positives = 95/187 (50%), Gaps = 26/187 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G    +  R S   +L   + EI+ ++  R A  T L  ++ E +Q+++Y  G 
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTPEHEIIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372

Query: 62  KYEPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            YEPHFDF R +  ++L       G+RIAT+L Y+S V++GG TVF     +  R   W 
Sbjct: 373 HYEPHFDFARRE--EKLAFEGLNLGNRIATMLFYMSDVQQGGATVF-----TSLRTALW- 424

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDK 173
                     P KG A  + +LH     D+ + H +CPV+ G KW + KWIH R   F +
Sbjct: 425 ----------PKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKWVSNKWIHERGQEFRR 474

Query: 174 PEKEPED 180
           P    ED
Sbjct: 475 PCALEED 481


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/172 (32%), Positives = 84/172 (48%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D ++A +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D       G+R+AT L Y+S VE GG TVFP+              
Sbjct: 427 QYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 473

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G  + P KG ++ +++L      D  + H +CPV+ G KW + KW H R
Sbjct: 474 ---GATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 522


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 82/165 (49%), Gaps = 16/165 (9%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  + +++  + RTS   +L  A +     +  RI   +       E +Q+++Y  G 
Sbjct: 351 VVDQVTHRNMMVKERTSKVTWLGDATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGG 410

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
            Y  H+DF       +L G RIATV+ YLS VE+GG TVFP  +                
Sbjct: 411 HYASHYDFLNATSKTRLNGDRIATVMFYLSDVEQGGATVFPKIQ---------------- 454

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            AV P +G A+++++L  +   D+ ++H +CPVI G KW   KWI
Sbjct: 455 KAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKWVCNKWI 499


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 371 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 430

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R+      ++LG G+R+AT L Y+S VE GG TVFP+              
Sbjct: 431 QYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 477

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
              G A+ P KG A+ +++L      D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 478 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 533


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 87/172 (50%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           +++  +G    +  R S   +L   +  +V  I   I   T L  +  E +Q+ +Y  G 
Sbjct: 425 ISNPVTGVLETAHYRISKSAWLGAYEHPVVDKINQLIEDVTGLNVKTAEDLQVANYGLGG 484

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+RIAT L+Y++ V+ GG TVF +              
Sbjct: 485 QYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQAGGATVFTDI------------- 531

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
              G AVKP KG A+ +++L+P    D  + H +CPV+ G KW + KWIH R
Sbjct: 532 ---GAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGNKWVSNKWIHER 580


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 81/170 (47%), Gaps = 21/170 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V     G S+  E RTS   F+ + + E+   IE R+AA    P E  E  Q+  Y+  Q
Sbjct: 113 VTGEADGSSMVHEGRTSEMAFIQRGEAEVAERIERRLAALAHWPAECSEPFQLQKYDATQ 172

Query: 62  KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
           +Y PH+D+        + +   GG R+AT ++YLS VE+GG TVFP              
Sbjct: 173 EYRPHYDWLDPDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFPG------------- 219

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
               G  V P KG AL F +   +   D  +LHG  PV+ G K  A KW+
Sbjct: 220 ---LGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTKIIANKWL 266


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 82/165 (49%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  +   +  I  RI   T L  +  E +Q+ +Y  G +YEPHFDF R    
Sbjct: 381 RISKSAWLSGYEHSTIERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 440

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVF +                 G AV P KG A
Sbjct: 441 DAFKELGTGNRIATWLFYMSDVSAGGATVFTDV----------------GAAVWPKKGTA 484

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
           + +++L P    D ++ H +CPV+ G KW + KWIH R   F +P
Sbjct: 485 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 529


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  + +  E R S   +L    D +V ++E R+AA T L   P   E +Q+++Y 
Sbjct: 370 VVASGEKQQKV--EYRISKSAWLKDTADPVVRALELRMAAITGLDLRPPYAEYLQVVNYG 427

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD    +   + +   G+RIATV++YLS VE GG T F  +           
Sbjct: 428 LGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------- 477

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++L  +   D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 478 ------FSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIHEYGQEFRR 531

Query: 174 P-EKEPED 180
           P  ++P D
Sbjct: 532 PCSRDPRD 539


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  + +  E R S   +L    D +V ++E R+AA T L   P   E +Q+++Y 
Sbjct: 373 VVASGEKQQKV--EYRISKSAWLKDTADPVVQALELRMAAITGLDLRPPYAEYLQVVNYG 430

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD    +   + +   G+RIATV++YLS VE GG T F  +           
Sbjct: 431 LGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------- 480

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++L  +   D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 481 ------FSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIHEYGQEFRR 534

Query: 174 P-EKEPED 180
           P  ++P D
Sbjct: 535 PCSRDPRD 542


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 79/161 (49%), Gaps = 25/161 (15%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           RT+ G +L K  +E+   I  RI   T     + E  Q+++Y  G  Y  HFD+F    +
Sbjct: 368 RTAKGYWLKKESNEMTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASS 427

Query: 76  QQLG---------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
              G         G RIATVL YL+ VE+GG TVF N                 GY+V P
Sbjct: 428 NYTGERSHHSIVLGDRIATVLFYLTDVEQGGATVFGNV----------------GYSVYP 471

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
             G A+ +++L  D + D  + H SCPV+ G KW  T+WIH
Sbjct: 472 QAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKWVMTEWIH 512


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 88/172 (51%), Gaps = 19/172 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           ++G S  S++RTS   +L    +  +A I+ R+   T L  +  E +Q+++Y  G +YEP
Sbjct: 378 QTGNSTVSDIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437

Query: 66  HFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
           HFDF  D + N    G+R+ T L YL+ V  GG T FP   +                AV
Sbjct: 438 HFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGATAFPFLHL----------------AV 481

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
            P+KG  L++++LH     D  + H  CPV++G KW   +W H   + F +P
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQEFRRP 533


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/170 (32%), Positives = 90/170 (52%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           + ++++G+   +  R S   +L   + + +  +  R+   T L     E +Q+++Y  G 
Sbjct: 366 IRNSKTGELEPANYRISKSAWLKSEEHDHILKVTRRVGDITGLDMSTAEDLQVVNYGIGG 425

Query: 62  KYEPHFDFFRDKMNQ---QLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFD+ R +  +   +LG G+RIAT L Y+S VE GG TVFP +             
Sbjct: 426 HYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAGGATVFPPT------------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              G AV P KG A  +++L+P+   +  + H +CPV+ G KW + +WIH
Sbjct: 473 ---GAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIH 519


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +SG+   +  R S   +L   +  ++A +  RI   T L  +  E +Q+++Y  G 
Sbjct: 321 VQNYKSGELEVANYRISKSAWLRNEEHGVIARVTRRIEHITGLSADTAEELQVVNYGIGG 380

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YEPHFDF R +     Q LG G+RIAT L Y+S V  GG TVFP     Q R   W   
Sbjct: 381 HYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDVPAGGATVFP-----QLRLTLW--- 432

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
                   P KG A  +++LH     D  + H +CPV+ G KW + KW H R   F +P
Sbjct: 433 --------PEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKWVSNKWFHERGQEFTRP 483


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVLAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 89/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++++G+   +  R S   +L    D ++  +  RI  +T L     E +Q+ +Y  G 
Sbjct: 355 VQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGG 414

Query: 62  KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +          G+RIATVL Y+S  E+GG TVF +              
Sbjct: 415 HYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHL------------- 461

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G AV P K DAL +++L  D   D  + H +CPV+ G KW + KWIH +   F +P
Sbjct: 462 ---GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRP 517


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVLAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ +  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           + +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/164 (37%), Positives = 78/164 (47%), Gaps = 21/164 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           RTS   +L  +  EI   I  RI A T L  E  E +Q+ +Y  G  Y PHFDF R +  
Sbjct: 423 RTSKSAWLPHSMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREK 482

Query: 76  QQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                  G+RIAT++ YLS V+ GG TVF                 R G  V P KG A 
Sbjct: 483 DAFEVKNGNRIATIIFYLSDVQAGGATVF----------------NRIGTRVVPKKGAAG 526

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            +F+L P+   D  + H +CPV+ G KW    W H R   F +P
Sbjct: 527 FWFNLLPNGEGDLRTRHAACPVLAGSKWVMNLWFHERGQEFRRP 570


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/186 (34%), Positives = 91/186 (48%), Gaps = 34/186 (18%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYEHGQKYEP 65
           GK +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y  G  YEP
Sbjct: 420 GKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEP 479

Query: 66  HFD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           HFD         FR K      G+R+AT ++YLS VE GG T F  +             
Sbjct: 480 HFDHATSPSSPLFRMK-----SGNRVATFMIYLSSVEAGGATAFIYA------------- 521

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP- 174
               ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +P 
Sbjct: 522 ---NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPC 578

Query: 175 EKEPED 180
              PED
Sbjct: 579 SSSPED 584


>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 331

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 84/175 (48%), Gaps = 23/175 (13%)

Query: 2   VADNESGKSIASEVRTSS--GMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           V + ESG+ + +E   S     F  +    +   I  R A     P  + E +    Y  
Sbjct: 162 VIEYESGQEVVNEATRSCSCASFPPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLP 221

Query: 60  GQKYEPHFDFFRDKM---NQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
           G+++ PH D+FR  +   ++ +G  GHRIATVL+YL+ VE GG T FPN           
Sbjct: 222 GEQFRPHVDYFRGAVLNNDKIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------- 271

Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
                 G+ V+P KG AL F     D S D TSLH  C V +GEKW AT W   R
Sbjct: 272 ------GFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/184 (34%), Positives = 88/184 (47%), Gaps = 24/184 (13%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G S  S+ RT S  +L   ++ +   I  R+A     P E  E +Q++HY H Q+Y PH+
Sbjct: 91  GSSGVSQGRTGSNCWLRYQEEPLARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHY 150

Query: 68  DFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           D +     R     + GG R+ T L+YL+ VE+GG T FPN+                G 
Sbjct: 151 DAYDLDTPRGLRCTRQGGQRMVTALLYLNEVEEGGATAFPNA----------------GV 194

Query: 123 AVKPMKGDALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
            V P KG   +F ++  D       SLHG  PV  GEKW+A+ W   R     E++P  D
Sbjct: 195 EVAPRKGRIAIFNNVGADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAH--ERQPWFD 252

Query: 182 DCVD 185
           D  D
Sbjct: 253 DVED 256


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 88/171 (51%), Gaps = 21/171 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++GK   +  R S   +L+     +V  I   I   T L  E+ EA+QI +Y  G 
Sbjct: 364 VHNKDTGKLEYATYRISKSAWLNDDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGG 423

Query: 62  KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
            YEPHFD        D      GG+RIAT+L+YLS VE GG TVF ++            
Sbjct: 424 HYEPHFDHADVRSGTDVFKTWKGGNRIATMLIYLSSVELGGATVFSSA------------ 471

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
               G  ++P +G A  +++LH + + ++ + H +CPV+ G KW A KWIH
Sbjct: 472 ----GVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 21/178 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   ++ R S   +L   +D ++A I  R +A T L     E +Q+++Y  G 
Sbjct: 366 VHNSATGQLEHAKYRISKSGWLRDEEDPLIARISERCSALTNLSLTTVEELQVVNYGIGG 425

Query: 62  KYEPHFDFFRD---KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           +YEPHFDF R       ++  G+RI TV+ Y++ VE GG TVF ++              
Sbjct: 426 QYEPHFDFSRRSEPTAFEKWRGNRILTVIYYMTDVEAGGATVFLDA-------------- 471

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
             G  V P KG A ++ +L P    D  + H +CPV+ G KW A KW H R+  F +P
Sbjct: 472 --GVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFHERDQEFRRP 527


>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 548

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 107/219 (48%), Gaps = 39/219 (17%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN-----GEAMQILHYEHGQK 62
           GK+I S+ RTS   F++        +++ RI  +  L  E       + +Q+L Y   Q 
Sbjct: 254 GKAI-SKTRTSDNAFVTHTN--TAQALKRRI--FQLLGIEEYHETWADGLQVLRYNESQA 308

Query: 63  YEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVF---------------- 101
           Y  HFD+       D  ++ LG +R ATV++Y + V +GGETVF                
Sbjct: 309 YVAHFDYLESAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKV 368

Query: 102 PNSEVSQSRD---GNWSECA----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV 154
           P  EV ++ D     W E      RR   V P +G A+LF++ HPD   D +S HG+CPV
Sbjct: 369 PVREVLENLDLPRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPV 428

Query: 155 IEGEKWSATKWI-HVRNFDKPEKEPEDDDCVDEDLNCVV 192
           I+G+KW+A  W+ +   +     +PE    VD+  N +V
Sbjct: 429 IDGQKWAANLWVWNGPRYGLSSVDPETGRTVDKAGNNIV 467


>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 324

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 91/165 (55%), Gaps = 12/165 (7%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           A++ RTS+  FLS ++   +  I+ R+A  T +P ++ E +Q+L YE  QKY+ H D+F 
Sbjct: 156 ATDWRTSTTYFLSSSKHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYFP 215

Query: 72  DKMNQQLG----------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
            + ++              +R+ TV  Y+S V KGG T+FP +     R  +  +C+  G
Sbjct: 216 VEHHKNSPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAG-GAPRPQSMKDCST-G 273

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             V P K   ++F+S+ P+   D  SLHG CPV +G K+S  KW+
Sbjct: 274 LKVSPKKRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 55/159 (34%), Positives = 85/159 (53%), Gaps = 24/159 (15%)

Query: 16  RTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
           R+  G+F+ + ++  +I  +I  ++ ++  +  E+ E MQ++ Y  G++   HFD+F   
Sbjct: 69  RSGWGLFMKEGEEDHQITKNIFNKMKSFVNIS-ESCEVMQVIRYNQGEETSSHFDYFNPL 127

Query: 72  ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
                M   L G R+ T+LMYL  VE+GGET FP                  G  VKP+K
Sbjct: 128 TTNGSMKIGLYGQRVCTILMYLCDVEEGGETTFPEV----------------GIKVKPIK 171

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           GDA+LF++  P+   D  SLH   PV++G KW A K I+
Sbjct: 172 GDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKWVAIKLIN 210


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
            G  YEPHFD          +MN    G+R+AT ++YLS VE GG T F          G
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 481

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
           N+S        V  +K  AL +++LH     D  +LH +CPV+ G+KW A KWIH   + 
Sbjct: 482 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 533

Query: 171 FDKP-EKEPED 180
           F +P    PED
Sbjct: 534 FRRPCSSRPED 544


>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
 gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
          Length = 541

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +S  S  SEVR S   +L    +  ++ I+ R+   T L  E+ E +Q+++Y  G +YEP
Sbjct: 369 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 428

Query: 66  HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           HFDF  D         G+R+ T L YL+ V  GG T FP   +                A
Sbjct: 429 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 472

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           V P+KG  L++++LH     D  + H  CPV++G KW   +W HV  + F +P
Sbjct: 473 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 525


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score = 99.4 bits (246), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 345 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 402

Query: 59  HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
            G  YEPHFD          +MN    G+R+AT ++YLS VE GG T F          G
Sbjct: 403 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 451

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
           N+S        V  +K  AL +++LH     D  +LH +CPV+ G+KW A KWIH   + 
Sbjct: 452 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 503

Query: 171 FDKP-EKEPED 180
           F +P    PED
Sbjct: 504 FRRPCSSRPED 514


>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
 gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
 gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
          Length = 540

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +S  S  SEVR S   +L    +  ++ I+ R+   T L  E+ E +Q+++Y  G +YEP
Sbjct: 368 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427

Query: 66  HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           HFDF  D         G+R+ T L YL+ V  GG T FP   +                A
Sbjct: 428 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 471

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           V P+KG  L++++LH     D  + H  CPV++G KW   +W HV  + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 83/171 (48%), Gaps = 21/171 (12%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           +S   + RTS   +     +E+   +  RIA  T       E +Q ++Y  G  Y+ H+D
Sbjct: 332 RSEVVKTRTSKVAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYD 391

Query: 69  FFRDKMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           FF         Q+ G RIATVL YL+ VE+GG TVFPN                   AV 
Sbjct: 392 FFNASTAANLTQMNGDRIATVLFYLTDVEQGGATVFPNIR----------------KAVF 435

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           P +G A+++++L  D   +  +LH +CPV+ G KW   KWI  R   F +P
Sbjct: 436 PQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQLFKRP 486


>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
           [Drosophila melanogaster]
          Length = 540

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +S  S  SEVR S   +L    +  ++ I+ R+   T L  E+ E +Q+++Y  G +YEP
Sbjct: 368 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427

Query: 66  HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           HFDF  D         G+R+ T L YL+ V  GG T FP   +                A
Sbjct: 428 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 471

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           V P+KG  L++++LH     D  + H  CPV++G KW   +W HV  + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 309 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 366

Query: 59  HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
            G  YEPHFD          +MN    G+R+AT ++YLS VE GG T F          G
Sbjct: 367 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 415

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
           N+S        V  +K  AL +++LH     D  +LH +CPV+ G+KW A KWIH   + 
Sbjct: 416 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 467

Query: 171 FDKP-EKEPED 180
           F +P    PED
Sbjct: 468 FRRPCSSRPED 478


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 379 VVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 436

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 437 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 485

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 486 -----NFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 540

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 541 PCSSSPED 548


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   +  RTS   +L   + E+V  I  RI   T L  E  E +Q+ +Y  G 
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGG 421

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +++N  Q L  G+R+AT+L Y++  E GG TVF  +EV  +        
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 V P K DAL +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 524


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   +  RTS   +L   + EIV  I  RI   T L  E  E +Q+ +Y  G 
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGG 421

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +++N  Q L  G+R+AT+L Y++  E GG TVF  +EV  +        
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 V P K DAL +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRP 524


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   +  RTS   +L   + EIV  I  RI   T L  E  E +Q+ +Y  G 
Sbjct: 363 VQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGG 422

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +++N  Q L  G+R+AT+L Y++  E GG TVF  +EV  +        
Sbjct: 423 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 472

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 V P K DAL +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 473 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRP 525


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++ +G+   +  RTS   +L   + E+V  I  RI   T L  E  E +Q+ +Y  G 
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGG 421

Query: 62  KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R +++N  Q L  G+R+AT+L Y++  E GG TVF  +EV  +        
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 V P K DAL +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 524


>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
 gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 90/181 (49%), Gaps = 29/181 (16%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENG-------EAMQILHY 57
           N+ G +     RTS   F      +I   +  RI    F     G       + +QIL Y
Sbjct: 36  NQGGSNAKLTTRTSMNAF------DITTKLSFRIKRRAFRLLRMGAYKENLADGIQILRY 89

Query: 58  EHGQKYEPHFDFFRDKM-NQQL------GGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
           E GQ Y  H D+F  +  N  L      G +R AT+ +YLS VE GG+T+  ++ V    
Sbjct: 90  ELGQAYIAHHDYFPVRQSNDHLWDPSKGGSNRFATIFLYLSDVEVGGQTLEKDAGVDA-- 147

Query: 111 DGNW-----SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKW 165
            G+W      +C  +  AV P +GDA+LF+S +PD   D  SLHG+CP+++G KW A  W
Sbjct: 148 -GSWEDKLVDQCYSK-LAVPPRRGDAILFYSQYPDGHLDPNSLHGACPILKGTKWGANLW 205

Query: 166 I 166
           +
Sbjct: 206 V 206


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 83/171 (48%), Gaps = 21/171 (12%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           +S   + RTS   +     +E+   +  RIA  T       E +Q ++Y  G  Y+ H+D
Sbjct: 367 RSEVVKTRTSKVAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYD 426

Query: 69  FFRDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           FF         Q+ G RIATVL YL+ VE+GG TVFPN                   AV 
Sbjct: 427 FFNASTATNLTQMNGDRIATVLFYLTDVEQGGATVFPNIR----------------KAVF 470

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           P +G A+++++L  D   +  +LH +CPV+ G KW   KWI  R   F +P
Sbjct: 471 PQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQLFKRP 521


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 94/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y 
Sbjct: 366 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 423

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 424 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 475

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 476 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 527

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 528 PCSSSPED 535


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 376 LVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 433

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F          GN+S
Sbjct: 434 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 485

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 486 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 537

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 538 PCSSSPED 545


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 96/188 (51%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R AT+++YLS VE GG T F          GN+S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYKMKSGNRAATLMIYLSSVEAGGATAFIY--------GNFS 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 485 --------VPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P +  PED
Sbjct: 537 PCDTNPED 544


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 90/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +  +GK   +E R S   +L    D ++ ++  RI+  T L     E +QI +Y  G 
Sbjct: 448 VHNPRTGKLETAEYRVSKSAWLKDGDDPVIHNVNNRISDITGLSMATAEELQIANYGLGG 507

Query: 62  KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R +  +       G+RIAT L Y+++V+ GG TVF +              
Sbjct: 508 QYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVDAGGATVFTHI------------- 554

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G  + P+KG A  +++L+        + H +CPV+ G+KW + KWIH R   F +P
Sbjct: 555 ---GVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLVGQKWVSNKWIHERGQEFRRP 610


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/179 (35%), Positives = 88/179 (49%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++++G+   +  R S   +L     E++  I  RI   T L  E  E +QI +Y  G 
Sbjct: 359 VQNSKTGELETAAYRISKSAWLKGGDHELIDRINRRIELMTNLIQETSEELQIANYGVGG 418

Query: 62  KYEPHFDFFRD---KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R    K  + LG G+R+ATVL YL+  E GG TVF                
Sbjct: 419 HYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEIGGGTVFTELRT----------- 467

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                AV P K  AL +++L+     D  + H +CPV+ G KW A KWIH R   F +P
Sbjct: 468 -----AVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQEFLRP 521


>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
 gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
          Length = 531

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 61/175 (34%), Positives = 91/175 (52%), Gaps = 25/175 (14%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDE----IVASIEARIAAWTFL--PPENGEAMQIL 55
           V + ++G+   ++ R S   +L   +D+    I+  +  R +  T L   P + EA+QI+
Sbjct: 357 VTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSIITGLDTTPRSAEALQIV 416

Query: 56  HYEHGQKYEPHFDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
           +Y     YEPHFD   + ++  L    G+RIATVL Y+S VE GG TVF ++E       
Sbjct: 417 NYGAAGHYEPHFDHATEAVSSILKLGIGNRIATVLYYMSDVEAGGATVFVDAEA------ 470

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                      VKP KGDA  +++LH +   D  + H +CP+I G KW   KWIH
Sbjct: 471 ----------IVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWIH 515


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GK + +  R S   +L + + E+V  +  RI   T L  E  E +QI +Y  G 
Sbjct: 358 VHDSATGKLVTATYRISKSAWLKEWEHEVVERVNKRIELMTNLEMETAEELQIANYGIGG 417

Query: 62  KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFD  +    K  + LG G+RIATVL Y+S    GG TVF  +EV  +        
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVF--TEVKST-------- 467

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                 V P K DAL +++L      +  + H +CPV+ G KW + KWIH +   F +P
Sbjct: 468 ------VLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRP 520


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 89/172 (51%), Gaps = 23/172 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 314 VVASGE--KQLPVEYRISKSAWLKDTADPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 371

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 372 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 420

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 ++V  +K  AL +++LH     DS +LH +CPV+ G+KW A KWIH
Sbjct: 421 -----NFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLVGDKWVANKWIH 467


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/170 (34%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  E  E +Q+ +Y  G  YEPH
Sbjct: 362 NGDSTAAAFRTSQGASFNYSRNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F D    Q   L G+RIAT + YLS VE GG T FP   +                 
Sbjct: 422 WDSFPDNHVYQEGDLHGNRIATAIYYLSDVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P +G  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 91/179 (50%), Gaps = 19/179 (10%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
            V D  +G  I ++ R S   ++++  D I A I  R+   T L     E +Q+ +Y   
Sbjct: 362 FVHDMVTGDLIYADYRVSKNTWIAEDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIA 421

Query: 61  QKYEPHFDF---FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            +YEPHFD     R K   + GG+RIAT+L+YLS V+ GG TVF N+             
Sbjct: 422 GQYEPHFDHSTGTRPKHFDRWGGNRIATMLLYLSDVDWGGRTVFTNTA------------ 469

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
              G    P+KG  + +++L  +  ++  + H  CPV+ G+KW A  WIH   + F++P
Sbjct: 470 --PGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQEFNRP 526


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 235 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F          GN+S
Sbjct: 293 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 344

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 345 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 396

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 397 PCSTNPED 404


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 87/170 (51%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GK + +  R S   +L   + E+V  +  RI   T L  E  E +QI +Y  G 
Sbjct: 358 VHDSVTGKLVTATYRISKSAWLKAWEHEVVERVNKRIDLMTNLEMETAEELQIANYGIGG 417

Query: 62  KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFD  +    K  + LG G+RIATVL Y+S    GG TVF  +EV  +        
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVF--TEVKST-------- 467

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 V P K DAL +++L+     +  + H +CPV+ G KW + KWIH
Sbjct: 468 ------VLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511


>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
          Length = 262

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 93/185 (50%), Gaps = 29/185 (15%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           M+    + K + S  RT+ G +L   QD++V  +E  +   T   P+ GE +Q+LHY +G
Sbjct: 94  MIMPYGTHKLVESTTRTNDGAWLDFLQDDVVRRLEETLGKLTKTTPQQGENLQVLHYSNG 153

Query: 61  -QKYEPHFDFF---RDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            Q ++ H+D+F   RD     + GG+R  TV++YL    +GGET FP             
Sbjct: 154 AQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFPE------------ 201

Query: 116 ECARRGYAVKPMKGDALLFFSLH-------PDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
                G  +    GDAL+F++L        PD   +  ++H + P + GEKW A KWIH 
Sbjct: 202 ----LGLKLTAQPGDALMFYNLKEHCSGTDPDC-VEKKTIHAALPPVRGEKWVAVKWIHE 256

Query: 169 RNFDK 173
           + + K
Sbjct: 257 KPYQK 261


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y 
Sbjct: 365 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 422

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 423 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 474

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     DS +LH  CPV+ G KW A KWIH   + F +
Sbjct: 475 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQEFRR 526

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 527 PCSSSPED 534


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 97/185 (52%), Gaps = 24/185 (12%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENG-EAMQILHYEH 59
           MV D  + K   S+ RTS   +L+     +VA++  R         E   E++Q+ +Y  
Sbjct: 349 MVGD--AAKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGI 406

Query: 60  GQKYEPHFDFFRDK--MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           G  Y PH+D+ R++    +   G+RIAT++ YLS VE+GG TVFP+              
Sbjct: 407 GGHYLPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFPH-------------- 452

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP- 174
              G  V P KG A+ +++L      D  +LHG+CPV+ G KW A KWIH R+  F +P 
Sbjct: 453 --LGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKWVANKWIHERHQEFVRPC 510

Query: 175 EKEPE 179
           + +PE
Sbjct: 511 DPDPE 515


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 88/170 (51%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GK + +  R S   +L + + ++V ++  RI   T L  E  E +QI +Y  G 
Sbjct: 358 VHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGG 417

Query: 62  KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFD  +    K  + LG G+RIATVL Y+S    GG TVF  ++ +          
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKST---------- 467

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 + P K DAL +++L+     +  + H +CPV+ G KW + KWIH
Sbjct: 468 ------ILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 96/188 (51%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  + R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVDYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH +   DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 373 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F          GN+S
Sbjct: 431 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 482

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 483 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 534

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 535 PCSTNPED 542


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     DS +LH  CPV+ G KW A KWIH   + F +
Sbjct: 485 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 373 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F          GN+S
Sbjct: 431 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 482

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 483 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 534

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 535 PCSTNPED 542


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ ++  RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 88/170 (51%), Gaps = 20/170 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D+ +GK + +  R S   +L + + ++V ++  RI   T L  E  E +QI +Y  G 
Sbjct: 357 VHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGG 416

Query: 62  KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFD  +    K  + LG G+RIATVL Y+S    GG TVF  ++ +          
Sbjct: 417 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKST---------- 466

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 + P K DAL +++L+     +  + H +CPV+ G KW + KWIH
Sbjct: 467 ------ILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 510


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 88/175 (50%), Gaps = 29/175 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  + +  E R S   +L    D I+ S++ RIAA T L   P   E +Q+++Y 
Sbjct: 352 VVASGEKQQQV--EYRISKSAWLKDTVDPILVSLDRRIAALTGLNVQPPYAEHLQVVNYG 409

Query: 59  HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
            G  YEPHFD          +MN    G+R+AT ++YLS VE GG T F  +        
Sbjct: 410 IGGHYEPHFDHATSPSSPLYRMNS---GNRVATFMIYLSSVEAGGSTAFIYAN------- 459

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                    ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH
Sbjct: 460 ---------FSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLVGDKWVANKWIH 505


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 97/201 (48%), Gaps = 28/201 (13%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           + +S+ S VRTS   FL   +D+++A+I+ R+A  T       E  Q  +Y  G  Y  H
Sbjct: 335 TNESVVSNVRTSQFTFLPVTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQH 394

Query: 67  FD-FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
            D F++   +  L      G+RIATVL YLS V +GG T FP+  V              
Sbjct: 395 MDWFYQPSFDAGLVSSPEMGNRIATVLFYLSDVTQGGGTAFPHLRV-------------- 440

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
              +KP K  A  +++LH     D  + HG+CP+I G KW   +WI  R F + ++ P  
Sbjct: 441 --LLKPKKYAAAFWYNLHASGVGDPRTQHGACPIISGSKWVQNRWI--REFIQSDRRP-- 494

Query: 181 DDCVDEDLNCVVWAKAGECKK 201
             C+  D +    A+  E +K
Sbjct: 495 --CLTWDDSLATLAEIRELEK 513


>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
          Length = 480

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 91/203 (44%), Gaps = 42/203 (20%)

Query: 5   NESGKSIASEVRTSSGMF-LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           N+ G       RTS   F ++  Q   V     R+           + +QIL Y+ GQ Y
Sbjct: 247 NQGGDGAVLTTRTSENAFDITTKQSFDVKKRAFRLLRMNGYQENMADGIQILRYKVGQAY 306

Query: 64  EPHFDFFRDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNSE----------- 105
             H D+F    ++         G +R AT+ +YLS V  GG+TVFPN E           
Sbjct: 307 VAHHDYFPTHQSKDFNWDPLSGGSNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELV 366

Query: 106 ---------------VSQS--RDGNWSE-----CARRGYAVKPMKGDALLFFSLHPDAST 143
                          VS +   +G+W +     C  + +AV P +GDA+LF+S  PD   
Sbjct: 367 ERLGESPSASELKEFVSNAGLMEGSWEDNLIHKCYEK-FAVPPRRGDAILFYSQRPDGLL 425

Query: 144 DSTSLHGSCPVIEGEKWSATKWI 166
           D+ SLHG+CP++ G KW A  W+
Sbjct: 426 DTNSLHGACPILNGTKWGANLWV 448


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 96/191 (50%), Gaps = 32/191 (16%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  + +  E R S   +L    D ++ S++ RIAA T L   P   E +Q+++Y 
Sbjct: 390 VVASGEKQQQV--EYRISKSAWLKDTVDPMLVSLDHRIAALTGLNVQPPYAEHLQVVNYG 447

Query: 59  HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
            G  YEPHFD          +MN    G+R+AT ++YLS VE GG T F  +        
Sbjct: 448 IGGHYEPHFDHATSPSSPLYRMNS---GNRVATFMIYLSSVEAGGSTAFIYAN------- 497

Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
                    ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + 
Sbjct: 498 ---------FSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQE 548

Query: 171 FDKP-EKEPED 180
           F +P   +PED
Sbjct: 549 FRRPCSAKPED 559


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
 gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
          Length = 537

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 85/169 (50%), Gaps = 20/169 (11%)

Query: 10  SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           S  +EVR S   +L    +  ++ I+ R+   T L  E+ E +Q+++Y  G +YEPHFDF
Sbjct: 369 STTTEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDF 428

Query: 70  FRD--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
             D  K      G+R+ T L YL+ V  GG T FP   +                AV P+
Sbjct: 429 VEDDGKTVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------AVPPV 472

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
           KG  L++++LH     D  + H  CPV++G KW   +W HV  + F +P
Sbjct: 473 KGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVAAQEFRRP 521


>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
          Length = 327

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/173 (34%), Positives = 82/173 (47%), Gaps = 22/173 (12%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVA-SIEARIAAWTFLPPENGEAMQILHYEH 59
            V D  SG+ I   +RTS G  +    + +V  +I  RIAA T    E GE++ +L Y  
Sbjct: 169 FVLDPNSGRPIPHPIRTSDGGAIGPTNENLVVRAINLRIAAATGTAVEQGESLTVLRYAR 228

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           GQ+Y  H D      NQ     RIAT ++YL+   +GGET FP   +             
Sbjct: 229 GQEYRRHLDTIAGAENQ-----RIATFIVYLNDGFEGGETHFPLLNIQ------------ 271

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
               V+P  GDA+ F ++ PD + D   +H   PV  G KW AT+WI     D
Sbjct: 272 ----VRPRIGDAIRFDTIRPDGTPDPRLVHAGQPVRNGVKWIATRWIRREPVD 320


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 90/180 (50%), Gaps = 24/180 (13%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPH 66
           K +  E R S   +L    D ++ +++ RIAA T L   P   E +Q+++Y  G  YEPH
Sbjct: 346 KQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPH 405

Query: 67  FDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           FD        + +   G+R+AT ++YLS VE GG T F  +                 ++
Sbjct: 406 FDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFIYA----------------NFS 449

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-EKEPED 180
           V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +P    PED
Sbjct: 450 VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPCSTNPED 509


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 56/166 (33%), Positives = 83/166 (50%), Gaps = 21/166 (12%)

Query: 14  EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           + RT+   +     +++   +  RI   T       E +Q+++Y  G  Y  HFD+F   
Sbjct: 367 KTRTAKVAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTT 426

Query: 74  MN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
            N    Q+ G RIATVL YL+ VE+GG TVFP  E+ +              AV P +G 
Sbjct: 427 TNPHISQINGDRIATVLFYLNDVEQGGATVFP--EIKK--------------AVFPKRGS 470

Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           A+++++L  D   +  +LH +CPVI G KW   KWI  R   F +P
Sbjct: 471 AIMWYNLKDDGEGNRDTLHAACPVIVGSKWVCNKWIREREQIFRRP 516


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 84/179 (46%), Gaps = 22/179 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V +  +G    +  R S   +L       +  I  RI   T L  E  E +Q  +Y  G 
Sbjct: 355 VQNARTGDLEYANYRISKSAWLKGTDHPAIDRINKRIDLMTNLNQETAEELQAQNYGIGG 414

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            Y+PHFDF R    +       G+RIAT+L+Y+S VE GG TVF +              
Sbjct: 415 HYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVESGGATVFNH-------------- 460

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
              G AV P K DAL +++L  D   D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 461 --LGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQEFRRP 517


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 81/172 (47%), Gaps = 20/172 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + +SG+   +  R S   +L     E+V  +  R    T L     E +Q+++Y  G 
Sbjct: 356 VQNAKSGELEVANYRISKSAWLKNHDHEVVERLSFRFEYLTGLTHLTAEELQVVNYGIGG 415

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
            YE HFDF R    D   Q   G+RIAT + Y+S V+ GG TVFP               
Sbjct: 416 HYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVKAGGATVFP--------------- 460

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            R G  V P KG A  +++LH     D  + H +CPV+ G KW + KW H R
Sbjct: 461 -RLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKWVSNKWFHER 511


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 89/187 (47%), Gaps = 29/187 (15%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWT-----FLPPENG-EAMQILHYEHGQKYEP 65
           +S+ R S   +L    D ++  + ARI   T     + P  +  EAMQ+++Y  G +YEP
Sbjct: 320 SSDQRISKVGWLFDNVDTLIKKLSARIGDVTGLNTVYTPVRSPVEAMQVVNYGIGGQYEP 379

Query: 66  HFDFFRD-----KMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           H DF+ D      +N  L   G RI+T L YLS V  GG TVFP   V            
Sbjct: 380 HLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSRVHLGGATVFPKLNVR----------- 428

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
                V P+K  A  +++  P+   D  +LH  CPV+ GEKW A KWI  R  +     P
Sbjct: 429 -----VPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVANKWIRERGQEFYRPCP 483

Query: 179 EDDDCVD 185
            D + +D
Sbjct: 484 LDKEAID 490


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/161 (36%), Positives = 80/161 (49%), Gaps = 27/161 (16%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           RT+ G +L K  +E+   I  RI   T     + E  Q+++Y  G  Y  H D+F     
Sbjct: 369 RTAKGHWLKKESNELTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASS 428

Query: 71  -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
                R + ++ LG  RIATVL YLS VE+GG TVF N                 GY+V 
Sbjct: 429 NYTGPRSRQSKVLGD-RIATVLFYLSDVEQGGATVFGNV----------------GYSVY 471

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           P  G A+ +++L  D + D  + H SCPVI G KW  T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGSKWVMTEWI 512


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 84/170 (49%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+ + S  R S   +L  +   ++  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 363 GQRMKSAFRVSKNAWLPYSTHPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHW 422

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP                   +AV+P
Sbjct: 423 DFFRDSRHYPAAEGNRIATAIFYLSDVEQGGATAFPFL----------------NFAVRP 466

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH  +  D  + H  CPV++G KW A  WIH   + F +P
Sbjct: 467 QLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKWIANIWIHEATQTFARP 516


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 91/173 (52%), Gaps = 21/173 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +S  +  SE+RTS+  +L   ++  ++ I+ R+   T L  E+ E +Q+++Y  G +YEP
Sbjct: 371 QSQNATTSEIRTSANTWLWYNENPWLSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEP 430

Query: 66  HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           HFDF  +   +  G  G+R+ T L Y++ V  GG T FP  ++                A
Sbjct: 431 HFDFVEEP-QKVFGWKGNRMLTALFYINDVALGGATAFPFLQL----------------A 473

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
           V P+KG  L++++LH     D  + H  CPVI+G KW   +W H   + F +P
Sbjct: 474 VPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQVFKRP 526


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/176 (34%), Positives = 89/176 (50%), Gaps = 23/176 (13%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           E+G+ + +  R S   +LS + D +  V  I+ RI   T L     E +Q+++Y  G +Y
Sbjct: 359 ENGELLHATYRISKSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQY 418

Query: 64  EPHFDFFR---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           EPH+DF R   D       G+RI+T+L+Y+S VEKGG TVFP                  
Sbjct: 419 EPHYDFARTGEDTFTSLGSGNRISTLLIYMSDVEKGGATVFPGV---------------- 462

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           G  + P+K  A  +++L      D ++ H  CPV+ G KW   KWIH R   F +P
Sbjct: 463 GARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQEFRRP 518


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 88/165 (53%), Gaps = 23/165 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWT--FLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
           RTS+ +F+ +    ++ +I  R A  T  ++   + E +Q+++Y  G +Y PH D+F + 
Sbjct: 368 RTSNSVFMEETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPHCDYFDEN 427

Query: 74  MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
                 G R+ATVL YL+ V++GG TVFP   +S                  P KG AL+
Sbjct: 428 AE---NGDRLATVLFYLTDVQQGGATVFPFLRLSYF----------------PKKGSALI 468

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           F +L    S D  S H +CPV+ G KW ATKWI+  +FD+  + P
Sbjct: 469 FRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIY--HFDQMTRWP 511


>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
          Length = 417

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 87/165 (52%), Gaps = 12/165 (7%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           A++ RTS+  FL       +  I+ R++  T +P ++ E +Q+L YE  QKY+ H D+F 
Sbjct: 249 ATDWRTSTTYFLPSDAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYFP 308

Query: 72  DKMNQQLG----------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
            + ++              +R+ TV  Y+S V KGG T+FP +     R  +  +C   G
Sbjct: 309 VEHHKNAPHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAG-GAPRPTSMKDCTT-G 366

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             V P K   ++F+S+ P+   D  SLHG CPV EG K+S  KW+
Sbjct: 367 LNVPPKKRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWV 411


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score = 96.7 bits (239), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 55/166 (33%), Positives = 84/166 (50%), Gaps = 17/166 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  + K + ++ R S   +L     + V     RI+  T L  E  E +Q+ +Y  G 
Sbjct: 353 VFDPATHKLVNADYRVSKSAWLKDEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGG 412

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +YEPH+D+ R + +      RIAT L YL+ VE+GG TVF                   G
Sbjct: 413 QYEPHYDYSRREWDI-YNNRRIATWLSYLTTVEQGGGTVF----------------TELG 455

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
             ++ +KG A+ +++L P+ S D  + H +CPV+ G KW + KWIH
Sbjct: 456 LHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKWVSNKWIH 501


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RI A T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 433 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSRPED 544


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D  + ++  RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + ++      +   +  ++ L  E  E +Q+ +Y  G  YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q   L G+RIAT + YLS VE GG T FP   +                 
Sbjct: 422 WDSFPENHVYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P KG  L +++LHP    D  + H +CPV++G KW A  WI  RN DK
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDK 515


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D  + ++  RIAA T L   P   E +Q+++Y 
Sbjct: 359 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 416

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 417 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 468

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 469 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 520

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 521 PCSSSPED 528


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RIAA T L  ++   E +Q+++Y 
Sbjct: 403 VVASGE--KQLQVEYRISKSAWLKDTADPVLVTLDHRIAALTGLDVQHPYAEYLQVVNYG 460

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 461 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 509

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 510 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 564

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 565 PCSSNPED 572


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 98/202 (48%), Gaps = 28/202 (13%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           + +S  S VRTS   F++K + E++ +I+ R+A  T L  +  E  Q  +Y  G  Y  H
Sbjct: 361 TNQSTVSNVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQH 420

Query: 67  FDFFRDK------MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
            D+F +       ++    G+RIATVL YLS V +GG T FP  +               
Sbjct: 421 MDWFTETTFDNGLVSSTEMGNRIATVLFYLSDVAQGGGTAFPYLKQH------------- 467

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
              ++P K  A  + +LH     D+ + HG+CP+I G KW   +WI  R F + ++ P  
Sbjct: 468 ---LRPKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKWVLNRWI--REFVQSDRRP-- 520

Query: 181 DDCVDEDLNCVVWAKAGECKKN 202
             C+  D +   +A+  E  KN
Sbjct: 521 --CLLWDDSLATYAQIMELAKN 540


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 81/151 (53%), Gaps = 19/151 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           R S+G ++ +  + +   IE RIA    L  E  E   +++Y  G +Y+ H+DFF     
Sbjct: 361 RISAGTWVERKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFG---A 417

Query: 76  QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
             +  +R+ATVL Y++ VE+GG TVFP                R G  V+  +G+AL ++
Sbjct: 418 DTVEDNRLATVLFYMNDVEQGGATVFP----------------RLGQTVRAKRGNALFWY 461

Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           ++  + + D  +LHG CP++ G KW  T+WI
Sbjct: 462 NMQHNGTVDDRTLHGGCPILVGSKWIFTQWI 492


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 84/162 (51%), Gaps = 19/162 (11%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-M 74
           RTS G ++ +  + +   IE RI     L     E  Q+++Y  G  Y  H DF  D   
Sbjct: 345 RTSKGTWIERDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWA 404

Query: 75  NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
           +++    RIATVL YL+ VE+GG TVF  + ++Q              AV P +G AL +
Sbjct: 405 DKKEEDDRIATVLFYLTDVEQGGATVF--TILNQ--------------AVSPKRGTALFW 448

Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           ++LH + + D+ +LHG CPV+ G KW  T WI  R   F +P
Sbjct: 449 YNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQLFTRP 490


>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Rhinolophus ferrumequinum]
          Length = 555

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 90/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L + +D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEETEDPVVARLNLRMQHITGLSVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMNQQL------------------------GGHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +  L                         G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHDVFKHLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RI A T L   P   E +Q+++Y 
Sbjct: 364 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 421

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 422 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 470

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 471 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 525

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 526 PCSSSPED 533


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D  + ++  RIAA T L   P   E +Q+++Y 
Sbjct: 348 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVALNHRIAALTGLDVRPPYAEYLQVVNYG 405

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 406 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 457

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 458 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 509

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 510 PCSSSPED 517


>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 2 [Oryctolagus
           cuniculus]
 gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Oryctolagus cuniculus]
          Length = 555

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 92/199 (46%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA I  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       ++LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERDAFKRLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 86/165 (52%), Gaps = 19/165 (11%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTF-LPPENGEAMQILHYEHGQKYE 64
           + G+   S  RTS   +L    D +V +++ R+   T  L  ++ E +Q+ +Y  G  Y 
Sbjct: 341 DDGEPQVSNARTSQNAWLDAGDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYV 400

Query: 65  PHFDFFRDKMNQQ--LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
            H D+  + +       G+RIATV+ YLS VE GG TVFP                + G 
Sbjct: 401 AHHDWAMEAVPYAGLRVGNRIATVMFYLSDVEIGGATVFP----------------QLGL 444

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           AV P KG A+L+++L+ +   D  +LH +CPV+ G KW A +WIH
Sbjct: 445 AVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLSGSKWVANQWIH 489


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 62/207 (29%), Positives = 96/207 (46%), Gaps = 35/207 (16%)

Query: 10  SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           S+ S  RTS   FL K + +++ +I+ R+A  T L  E  E  Q+ +Y  G  Y  H D+
Sbjct: 370 SVVSNARTSQFTFLPKTRHKVLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDW 429

Query: 70  F------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           F        +++    G+RI TVL YLS VE+GG T FP  +                  
Sbjct: 430 FYPITFETKQVSNPEMGNRIGTVLFYLSDVEQGGATAFPALK----------------QL 473

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDC 183
           ++P K  A  +++LH     D+ ++HG+CP+I G KW   +WI  R F + ++ P     
Sbjct: 474 LRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGSKWVLNRWI--REFVQSDRRP----- 526

Query: 184 VDEDLNCVVWAKAGECKKNPLYMVGSK 210
                 C  W  +       L + GS+
Sbjct: 527 ------CYQWDDSKLTLSQVLELTGSQ 547


>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
 gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
          Length = 484

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 46/137 (33%), Positives = 77/137 (56%), Gaps = 19/137 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYL 90
           +  RI   T    +    +QI ++  G +++PH+D+F ++   +N  + G RIA+++ Y+
Sbjct: 348 LNLRIRDITGFNVDEIRGLQIANFGVGGQFKPHYDYFTERILRLNNTILGDRIASIIFYV 407

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
             V  GG+TVFP+ ++                AVKP KG +L +F+   DA+ D  SLH 
Sbjct: 408 GDVVHGGQTVFPDIQI----------------AVKPQKGSSLFWFNTFDDATPDPRSLHS 451

Query: 151 SCPVIEGEKWSATKWIH 167
            CPV+ G++W+ TKW+H
Sbjct: 452 VCPVLIGDRWTITKWLH 468


>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
           alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 525

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 86/168 (51%), Gaps = 23/168 (13%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
           N +G  I  ++RTS   +  K     V  +  RI+  T L  E  E +Q+ +Y    +Y+
Sbjct: 355 NNTG--IVEDIRTSKVAWFKKNDFTAVKKLYTRISEMTGLSEETFEDLQVANYGLAGEYQ 412

Query: 65  PHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           PHFD+  D     + +    G+RIAT+L+YL+ V++GG T F   ++             
Sbjct: 413 PHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKEGGRTAFIEPKI------------- 459

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                KP+KG A+ +++L+P    D  + H SCPV+ G KW++  W+H
Sbjct: 460 ---VAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVIGNKWASNVWVH 504


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 85/179 (47%), Gaps = 27/179 (15%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +G+   +  R S   +L   +  IV  I  RI   T L     E +Q+ +Y  G 
Sbjct: 337 VHDPRTGQLTTAPYRVSKSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGVGG 396

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMY-------LSHVEKGGETVFPNSEVSQSR 110
           +YEPHFDF +    D   +   G+RIAT L+Y       +S V+ GG TVF +       
Sbjct: 397 QYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTDI------ 450

Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
                     G +V P KG A+ +++L P    D  + H +CPV+ G KW + KWIH R
Sbjct: 451 ----------GASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWIHER 499


>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 284

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 87/169 (51%), Gaps = 31/169 (18%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
           K + +E R S   +L  +    V+ ++ RI+  T L  ++  GE +Q+++Y  G  YEPH
Sbjct: 121 KQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 180

Query: 67  FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           FD         F+ K      G+R+ATV++YLS VE GG T F  +              
Sbjct: 181 FDHATSPSSPVFKLKT-----GNRVATVMIYLSSVEAGGSTAFIYAN------------- 222

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              ++V  MK  A+ +++LH +   D  +LH  CPV+ G+KW A KWIH
Sbjct: 223 ---FSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKWVANKWIH 268


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 91/181 (50%), Gaps = 24/181 (13%)

Query: 10  SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           S  S+ RTS  +F++  + +++ +I+ R+A  T L  +  E  Q+  Y  G  Y  HFD+
Sbjct: 371 STVSKKRTSQHIFIAATRHKVLRTIDQRVADMTNLNMQYAEDHQLADYGIGGHYSQHFDW 430

Query: 70  F--RDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           F   D  N +    G+RIATVL YLS V +GG T FP  +                  +K
Sbjct: 431 FGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILK----------------QLLK 474

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED--DDC 183
           P K  A  +++LH     D  +LHG CP+I G KW   +WI  R +D+ +  P D  DD 
Sbjct: 475 PKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWI--REYDQSDLRPCDLWDDS 532

Query: 184 V 184
           V
Sbjct: 533 V 533


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 81/169 (47%), Gaps = 24/169 (14%)

Query: 4   DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
           D E G+   +  R S   +L K     V  I   I     L  E  E +QI +Y  G  Y
Sbjct: 387 DKEYGEE--TTYRISKTAWLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHY 444

Query: 64  EPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           EPH DF   +  + L       G+RIATVL+YLS+VE GG TVFP               
Sbjct: 445 EPHLDFIESEDKEALSEYTSRIGNRIATVLIYLSNVEAGGATVFP--------------- 489

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            + G  V+P +G A  ++++H +   +  S+H +CPV+ G KW+A  W 
Sbjct: 490 -KAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLIGSKWAANLWF 537


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/167 (33%), Positives = 82/167 (49%), Gaps = 21/167 (12%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           S+ RT+   +     +++   +  RI   T       E +Q+++Y  G  Y  HFD+F  
Sbjct: 329 SKTRTAKLAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNT 388

Query: 73  KMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
                  Q+ G RIATVL YL+ VE+GG TVFP  E+ +              AV P +G
Sbjct: 389 TKGPHITQINGDRIATVLFYLNDVEQGGATVFP--EIKK--------------AVFPKRG 432

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
            A+++++L  D   +  +LH  CPVI G KW   KWI  R   F +P
Sbjct: 433 SAIMWYNLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQIFRRP 479


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 23/172 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RI A T L   P   E +Q+++Y 
Sbjct: 370 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 427

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 428 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 476

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH
Sbjct: 477 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 523


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           RT+ G +L K  +E+   I  RI   T     + E  Q+++Y  G  Y  H D+F     
Sbjct: 28  RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 87

Query: 71  -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
                R + +  LG  RIATVL YL+ VE+GG TVF +                 GY V 
Sbjct: 88  NHTDTRSRYSIDLG-DRIATVLFYLTDVEQGGATVFGDV----------------GYYVS 130

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           P  G A+ +++L  D + D  + H +CPVI G KW  T+WI
Sbjct: 131 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 171


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/187 (32%), Positives = 95/187 (50%), Gaps = 23/187 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D  +GK I ++ R +   +L      +V  ++ RI A T L  ++ +A+Q+ +Y  G 
Sbjct: 365 VHDPTTGKLIHAKYRITKTAWLDDRDHLVVDRVQNRIKAVTGLDLDSADALQVANYGIGG 424

Query: 62  KYEPHFDF-FRDKMN----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
            Y+PH+DF  RD  +    ++  G+RIAT L+Y++ V+ GG TVFP  +V          
Sbjct: 425 HYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDAGGATVFPIIDVR--------- 475

Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
                  V P KG A+ +++L         + H +CPV+ G KW + KWI  R   F +P
Sbjct: 476 -------VLPKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKWVSNKWIRTRGQEFRRP 528

Query: 175 EKEPEDD 181
               ED+
Sbjct: 529 CGLTEDE 535


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 87/169 (51%), Gaps = 31/169 (18%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
           K + +E R S   +L  +    V+ ++ RI+  T L  ++  GE +Q+++Y  G  YEPH
Sbjct: 122 KQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 181

Query: 67  FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           FD         F+ K      G+R+ATV++YLS VE GG T F  +              
Sbjct: 182 FDHATSPSSPVFKLKT-----GNRVATVMIYLSSVEAGGSTAFIYAN------------- 223

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              ++V  MK  A+ +++LH +   D  +LH  CPV+ G+KW A KWIH
Sbjct: 224 ---FSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKWVANKWIH 269


>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
          Length = 555

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 90/198 (45%), Gaps = 42/198 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V ++++G+   +  R S   +L      ++  +  RI  +T L     E +Q+ +Y  G 
Sbjct: 371 VQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGG 430

Query: 62  KYEPHFDFFRDKMNQQLGGH-----------------------RIATVLMYLSHVEKGGE 98
            Y+PHFDF R   N  LGGH                       RIATVL Y+S  E+GG 
Sbjct: 431 HYDPHFDFAR-IANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGA 489

Query: 99  TVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE 158
           TVF +                 G AV P K DAL +++L  D   D  + H +CPV+ G 
Sbjct: 490 TVFNHL----------------GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGV 533

Query: 159 KWSATKWIHVRN--FDKP 174
           KW + KWIH R   F +P
Sbjct: 534 KWVSNKWIHERGQEFTRP 551


>gi|307103831|gb|EFN52088.1| hypothetical protein CHLNCDRAFT_139357 [Chlorella variabilis]
          Length = 1038

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 59/143 (41%), Positives = 85/143 (59%), Gaps = 11/143 (7%)

Query: 15  VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN-GEAMQILHYEHGQKYEPHFDFFRDK 73
           +RTS G FL++AQDE+V +IE R+A WT LP EN G  +Q   + +G  ++   D   D+
Sbjct: 10  IRTSWGTFLTRAQDEVVYAIEHRVANWTHLPVENAGGVLQGKRFHYGAHWD---DLDLDE 66

Query: 74  MNQQLGGH--RIATVLMYLSHVEKGGETVFPNS----EVSQSRDGNWSECARRGYAVKPM 127
               LGG   R+ATVL+YLS  E+GGET FP+S    +  Q+    +S CA+ G A    
Sbjct: 67  NPDGLGGGSVRVATVLIYLSDAEEGGETAFPHSRWLDKEKQTAGKAFSNCAKDGVAALAR 126

Query: 128 KGDALLFFSLHPDA-STDSTSLH 149
           KG+A++F+   P +   D  S+H
Sbjct: 127 KGNAIMFWDAKPGSMRQDKWSMH 149


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 95/188 (50%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  + R S   +L  + D ++ +++ RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVDYRISKSAWLKDSVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFIYA----------- 481

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                 +++  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 482 -----NFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-2 [Callithrix jacchus]
          Length = 579

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 390 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 449

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 450 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 509

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 510 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVG 553

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 554 CKWVSNKWFHERGQEFLRP 572


>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Dasypus novemcinctus]
          Length = 556

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEENDDPVVAQVNRRMEHITGLTVKTAELLQVANYGMGG 426

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQDVFKHLGTGNRVATFLNYMSDVEAGG 486

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 487 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 531 CKWVSNKWFHERGQEFLRP 549


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    +  + ++  RIAA T L   P   E +Q+++Y 
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVNPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  + +S        
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
                   V  ++  AL +++LH     DS +LH  CPV+ G+KW A KWIH   + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536

Query: 174 P-EKEPED 180
           P    PED
Sbjct: 537 PCSSSPED 544


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 82/170 (48%), Gaps = 18/170 (10%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + E+G    +  R S   +L   + E+V  I  R+   T L     E +Q+ +Y  G 
Sbjct: 357 VHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNYGIGG 416

Query: 62  KYEPHFDFFRDK--MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            YEPH D  RD+    +   G+RIAT+L+Y++  E GG TVF N + S            
Sbjct: 417 HYEPHLDCSRDEDAFERTGTGNRIATILIYMTEPEIGGRTVFINLKAS------------ 464

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
               V   K  AL +++L    + D  S H +CPV+ G KW+A KW H R
Sbjct: 465 ----VPCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHER 510


>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
          Length = 174

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 88/172 (51%), Gaps = 23/172 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E  K   +E R S   +L      +V ++E R+AA T L   P   E +Q+++Y 
Sbjct: 5   VVASGE--KQQKAEYRISKSAWLKDTAHPVVQTLEKRMAAVTGLDLRPPYAEYLQVVNYG 62

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD    +   + +   G+RIAT+++YLS V  GG T F ++ +S        
Sbjct: 63  LGGHYEPHFDHATSRKSPLYRMKSGNRIATLMIYLSAVGAGGSTAFVHANLS-------- 114

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                   V  +K  AL +++L  +   D  +LH  CPV+ G+KW A KWIH
Sbjct: 115 --------VPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIH 158


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 82/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L+      +A +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 367 GQHKKSAFRVSKNAWLAYEAHPTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 426

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP  +                +AVKP
Sbjct: 427 DFFRDPSHYPAAEGNRIATAIFYLSEVEQGGATAFPFLD----------------FAVKP 470

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 520


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 80/142 (56%), Gaps = 18/142 (12%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHV 93
           +E  IA    LP EN E  Q+L Y+  Q Y+ H D+  ++  QQ  G R+AT  +YL+ V
Sbjct: 133 MEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQ-RQQPCGIRVATFFLYLNDV 191

Query: 94  EKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS-TDSTSLHGSC 152
           E+GG T FPN  ++                V+P KG+A+L++S +P+ +  DS + H + 
Sbjct: 192 EEGGGTRFPNLNLT----------------VQPAKGNAVLWYSAYPNTTRMDSRTDHEAM 235

Query: 153 PVIEGEKWSATKWIHVRNFDKP 174
           PV +G K+ A KWIH+ +F  P
Sbjct: 236 PVAKGMKYGANKWIHIHDFVTP 257


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  +  E +Q+ +Y  G  YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q   L G+RIAT + YLS VE GG T FP   +                 
Sbjct: 422 WDSFPENHIYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P KG  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515


>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
 gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
           polypeptide II, isoform 1 (predicted) [Papio anubis]
          Length = 578

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 449 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERHTFKHLGTGNRVATFLNYMSDVEAGG 508

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 509 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 552

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 553 CKWVSNKWFHERGQEFLRP 571


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 23/172 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
           +VA  E  K +  E R S   +L    D ++ +++ RI A T L   P   E +Q+++Y 
Sbjct: 309 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVNLDHRIGALTGLDVQPPYAEYLQVVNYG 366

Query: 59  HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            G  YEPHFD        + +   G+R+AT ++YLS VE GG T F  +           
Sbjct: 367 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 415

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                 ++V  +K  AL +++LH     D  +LH  CPV+ G+KW A KWIH
Sbjct: 416 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 462


>gi|427410040|ref|ZP_18900242.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425712173|gb|EKU75188.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 225

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 22/162 (13%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           ++ RTS    L + +D +V +I ARI A T L P++GE +Q   Y  GQ+Y+ H D+F  
Sbjct: 77  ADYRTSHSCNLDR-EDPLVHAISARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPV 135

Query: 73  KMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
             +     ++ GG R  T ++YLS VE GGET FP  E                + V P+
Sbjct: 136 NASYWPDMRKTGGQRNWTAMIYLSPVEGGGETHFPRCE----------------FMVPPI 179

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           +G  L++ +L PD + +  SLH + PV +G K+  TKW   R
Sbjct: 180 EGMILIWNNLKPDGAPNPYSLHAARPVAQGTKYVVTKWFRER 221


>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
           precursor (4-PH alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 527

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/142 (38%), Positives = 72/142 (50%), Gaps = 22/142 (15%)

Query: 31  VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF-----FRDKMNQQLGGHRIAT 85
           VA I  RI+  T L     E +Q+ +Y  G +Y PHFD       RD +  Q  G RIAT
Sbjct: 378 VAKITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQ-DGERIAT 436

Query: 86  VLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
            L+YLS VE GG T F N+                G + KP+KG A+ ++++ P    D 
Sbjct: 437 FLIYLSDVEVGGRTAFVNA----------------GVSAKPIKGSAVFWYNVFPSGEPDL 480

Query: 146 TSLHGSCPVIEGEKWSATKWIH 167
            + HG+CPV  G KW+  KWI 
Sbjct: 481 RTYHGACPVAFGNKWAGNKWIR 502


>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callithrix jacchus]
          Length = 555

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|381200505|ref|ZP_09907642.1| procollagen-proline dioxygenase [Sphingobium yanoikuyae XLDN2-5]
          Length = 221

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 22/162 (13%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           ++ RTS    L + +D +V +I ARI A T L P++GE +Q   Y  GQ+Y+ H D+F  
Sbjct: 73  ADYRTSHSCNLDR-EDPLVHAISARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPV 131

Query: 73  KMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
             +     ++ GG R  T ++YLS VE GGET FP  E                + V P+
Sbjct: 132 NASYWPEMRKTGGQRNWTAMIYLSPVEGGGETHFPRCE----------------FMVPPI 175

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           +G  L++ +L PD + +  SLH + PV +G K+  TKW   R
Sbjct: 176 EGMILIWNNLKPDGAPNPYSLHAARPVAQGTKYVVTKWFRER 217


>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callicebus moloch]
          Length = 555

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  +  E +Q+ +Y  G  YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSKNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q   L G+RIAT + YLS VE GG T FP   +                 
Sbjct: 422 WDSFPENHIYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P KG  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L  +    +  +   ++  T L     E +Q+ +Y  G  YEPH+
Sbjct: 364 GQRRKSAFRVSKNAWLPYSTHPTMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHW 423

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP                   +AV+P
Sbjct: 424 DFFRDSRHYPAAEGNRIATAIFYLSDVEQGGATAFPFL----------------NFAVRP 467

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH  +  D  + H  CPV++G KW A  WIH   + F +P
Sbjct: 468 QLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKWIANIWIHEATQTFARP 517


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 75/141 (53%), Gaps = 24/141 (17%)

Query: 34  IEARIAAWTFLPPEN-GEAMQILHYEHGQKYEPHFDFFRDK------MNQQLGGHRIATV 86
           +  R+++ T L  E   E  Q+  Y  G  YEPHFDF + K      +N+Q+G  RIAT 
Sbjct: 14  LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVLNEQMGD-RIATF 72

Query: 87  LMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDST 146
           ++YL+ VE GG TVFP                R    ++P+K  A+ + +L  D   D  
Sbjct: 73  MIYLNDVEAGGRTVFP----------------RLNLVIEPIKNSAVFWHNLLDDGQQDDR 116

Query: 147 SLHGSCPVIEGEKWSATKWIH 167
           ++HG+CPV+ G KW A KWIH
Sbjct: 117 TIHGACPVVLGRKWVANKWIH 137


>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
          Length = 505

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 84/169 (49%), Gaps = 22/169 (13%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 355 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 414

Query: 62  KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
           +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP+              
Sbjct: 415 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 460

Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
              G A+ P KG A+ +++L      D  + H +CPV+ G KW   KW+
Sbjct: 461 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG--KWL 505


>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Otolemur garnettii]
          Length = 555

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 92/199 (46%), Gaps = 42/199 (21%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L +  D +VA +  R+   T L  +  E +Q+ +Y  G 
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 425

Query: 62  KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
           +YEPHFDF R   +                       ++LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERDAFKRLGTGNRVATFLNYMSDVEAGG 485

Query: 98  ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
            TVFP+                 G A+ P KG A+ +++L      D  + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529

Query: 158 EKWSATKWIHVRN--FDKP 174
            KW + KW H R   F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548


>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
 gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
          Length = 515

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 49/139 (35%), Positives = 77/139 (55%), Gaps = 22/139 (15%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
           I  RI+  T    E   A+Q+ ++  G  ++PH+DF+ D++     N  LG  RI +++ 
Sbjct: 377 INQRISDMTGFKLEEFPAIQLANFGVGGYFKPHYDFYTDRLKEVDVNNTLGD-RIGSIIF 435

Query: 89  YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
           Y   V +GG+TVFP+ +V                AV+P KG+AL +F+   D++ D  SL
Sbjct: 436 YAGEVSQGGQTVFPDLKV----------------AVEPKKGNALFWFNAFDDSTPDPRSL 479

Query: 149 HGSCPVIEGEKWSATKWIH 167
           H  CPV+ G +W+ TKW+H
Sbjct: 480 HSVCPVLVGSRWTITKWLH 498


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           RT+ G +L K  +E+   I  RI   T     + E  Q+++Y  G  Y  H D+F     
Sbjct: 369 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 428

Query: 71  -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
                R + +  LG  RIATVL YL+ VE+GG TVF +                 GY V 
Sbjct: 429 NHTDTRSRYSIDLGD-RIATVLFYLTDVEQGGATVFGDV----------------GYYVS 471

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           P  G A+ +++L  D + D  + H +CPVI G KW  T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 23/172 (13%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +VA+  SG+ +  + RTS G + +K ++ +VA+I+ RIA  T  P  + E +QIL+Y  G
Sbjct: 156 VVANRGSGEFV-DDTRTSYGAYFNKGENSLVATIQRRIAELTRWPLTHAEPLQILNYGLG 214

Query: 61  QKYEPHFDFFRDKM-----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
            +Y PHFD+F  +        + GG RIATV+MYL+ VE GG T+FP+  +         
Sbjct: 215 GEYLPHFDYFEPQQPGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLNLE-------- 266

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                    +P KG A ++FS     +    S   +   I   KW AT+W  
Sbjct: 267 --------TRPRKGGA-IYFSYQLAVARSIRSRCMAARRIARRKWIATQWFR 309


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 24/159 (15%)

Query: 16  RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
           R+  G+F+ + ++E  +  +I  ++  +  +  ++ E MQI+ Y  G++   H+D+F   
Sbjct: 69  RSGWGLFMKEGEEEHPVTKNIFNKMKNFVNIS-DSCEVMQIIRYNPGEETSAHYDYFNPL 127

Query: 72  ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
                M   L G RI T+LMYL  VE+GGET FP                  G  VKP++
Sbjct: 128 TTNGSMKIGLYGQRICTILMYLCDVEEGGETSFPEV----------------GIKVKPIR 171

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           GDA+LF++  P+   D  SLH   PV +G KW A K I+
Sbjct: 172 GDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKWVAIKLIN 210


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           RT+ G +L K  +E+   I  RI   T     + E  Q+++Y  G  Y  H D+F     
Sbjct: 369 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 428

Query: 71  -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
                R + +  LG  RIATVL YL+ VE+GG TVF +                 GY V 
Sbjct: 429 NHTDTRSRYSIDLGD-RIATVLFYLTDVEQGGATVFGDV----------------GYYVS 471

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           P  G A+ +++L  D + D  + H +CPVI G KW  T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 94/202 (46%), Gaps = 45/202 (22%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L   + E++ ++  R+   T L     E +Q+++Y  G 
Sbjct: 144 VQNYKTGELEFANYRISKSAWLKDTEHEVIRTVNQRVEDMTGLTMATAEELQVVNYGIGG 203

Query: 62  KYEPHFDFFRDKMN---QQLG-GHRIATVLMY-----------------------LSHVE 94
            YEPHFDF R +     + LG G+RIATVL Y                       +S V 
Sbjct: 204 HYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLCLCHTSHTNADFRFLSVGQMSDVT 263

Query: 95  KGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV 154
           +GG TVFP+  +                A++P KG A  + +LH   + D  + H +CPV
Sbjct: 264 QGGATVFPSLNL----------------ALRPRKGTAAFWHNLHASGNGDYATRHAACPV 307

Query: 155 IEGEKWSATKWIHVR--NFDKP 174
           + G KW + KWIH R   F +P
Sbjct: 308 LTGTKWVSNKWIHERGQEFRRP 329


>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
 gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
          Length = 515

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 49/139 (35%), Positives = 77/139 (55%), Gaps = 22/139 (15%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
           I  RI+  T    E   A+Q+ ++  G  ++PH+D++ D++     N  LG  RI +++ 
Sbjct: 377 INQRISDMTGFKLEEFPAIQLANFGVGGYFKPHYDYYTDRLKEVDVNNTLGD-RIGSIIF 435

Query: 89  YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
           Y   V +GG+TVFP+ +V                AV+P KG+AL +F+   D+S D  +L
Sbjct: 436 YAGEVSQGGQTVFPDLKV----------------AVEPKKGNALFWFNAFDDSSPDPRTL 479

Query: 149 HGSCPVIEGEKWSATKWIH 167
           H  CPVI G +W+ TKW+H
Sbjct: 480 HSVCPVIVGSRWTITKWLH 498


>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
           ANT32C12]
          Length = 186

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 78/158 (49%), Gaps = 24/158 (15%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           RT+S  ++     EI+  +  R +    +P  N E  Q++HY  G +Y+PHFD F     
Sbjct: 40  RTNSYAWIQHDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTE 99

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
             + N   GG R+ T L YL+ VE GG T FP+  VS                VKP KGD
Sbjct: 100 EGRNNWFPGGQRMVTALAYLNDVEDGGATDFPDIHVS----------------VKPNKGD 143

Query: 131 ALLFFSLHPDASTD--STSLHGSCPVIEGEKWSATKWI 166
            ++F +   D ++D    SLHG  PVI GEKW+   W 
Sbjct: 144 VVVFHNC-KDGTSDINPNSLHGGSPVISGEKWAVNLWF 180


>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
          Length = 305

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 22/167 (13%)

Query: 1   MVADNESGKSIASEVRTS-SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           MV D  SG+ +   VRTS  G+F    +D ++ +I  RIAA +      GE + +L Y  
Sbjct: 150 MVIDPRSGRPMPHPVRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGEPLTLLRYAV 209

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           GQ+Y  H D      NQ     R  T+L+YL+    GGET+FP                R
Sbjct: 210 GQQYRQHHDCLPHVRNQ-----RAWTMLIYLNEGYAGGETIFP----------------R 248

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            G +VK  KGDALLF +          ++H   PV+ G+KW  T+WI
Sbjct: 249 LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLGAPVMAGQKWLCTRWI 295


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L+      +  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 367 GQHKKSAFRVSKNAWLAYESHPTMVGMLRDLKEATGLDTTYCEQLQVANYGVGGHYEPHW 426

Query: 68  DFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +  +  G+RIAT + YLS VE+GG T FP  ++                AVKP
Sbjct: 427 DFFRDPNHYPEEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 470

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 520


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/167 (34%), Positives = 81/167 (48%), Gaps = 28/167 (16%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE----NGEAMQILHYEHGQKYEPHF 67
            S VRTS   +L +    ++  +  RI   T L  +      E +Q+ +Y  G  Y PH 
Sbjct: 359 VSNVRTSKTAWLPEGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHH 418

Query: 68  DFF-RDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           D+  +DK + +         G RIAT + YL+ VE+GG T FP                R
Sbjct: 419 DYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP----------------R 462

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            G AVKP+KG A  +F+L      D  +LHG+CPV+ G KW + KWI
Sbjct: 463 AGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWI 509


>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
          Length = 551

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 54/154 (35%), Positives = 78/154 (50%), Gaps = 18/154 (11%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR--DK 73
           RTS   +L   + E+V  I  R+   T L  E  E +Q+ +Y  G  YEPH+D  R  + 
Sbjct: 372 RTSQSSWLGSTEHEVVKRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSRRENV 431

Query: 74  MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
             +   G+RIAT+L+Y++  E GG TVF + + S S       C          K  AL 
Sbjct: 432 FEKTKNGNRIATILIYMTEPEIGGGTVFIDLKTSVS-------CT---------KNAALF 475

Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           +++L    + D  S H +CPV+ G KW+A KW H
Sbjct: 476 WYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFH 509


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 89/176 (50%), Gaps = 26/176 (14%)

Query: 10  SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           S+ S  RTS   F+ K + +++ +I+ R+A  T L     E  Q+ +Y  G  Y  H D+
Sbjct: 366 SVVSNARTSQFTFIPKTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDW 425

Query: 70  F-------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           F       +   N ++G +RIATVL YL+ VE+GG T FP  +                 
Sbjct: 426 FSPNAFETKQVANSEMG-NRIATVLFYLTDVEQGGGTAFPVLK----------------Q 468

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +KP K  A  +++LH   + D  ++HG+CP+I G KW   +WI  R F + ++ P
Sbjct: 469 LLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWI--REFVQSDRRP 522


>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
 gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
          Length = 570

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 81/164 (49%), Gaps = 22/164 (13%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF-- 70
           SE RT S  +L   +D++V S+  RI+    LP E  E+MQI+HY   Q+Y PHFD F  
Sbjct: 60  SEGRTGSNHWLKYDEDDVVQSVGQRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNL 119

Query: 71  ---RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
              + +   + GG R+ T L+YL+ VE GG T FP                + G  V  +
Sbjct: 120 SLPKGQRAAKWGGQRLVTALVYLNKVEAGGATQFP----------------KLGITVPAL 163

Query: 128 KGDALLFFSLHPDAS-TDSTSLHGSCPVIEGEKWSATKWIHVRN 170
            G  ++F +   D S     SLH   PV  GEKW+   W  +++
Sbjct: 164 PGRMVIFHNTTHDISGPHPLSLHAGMPVEAGEKWAFNMWFRLQD 207


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G++  S  R S   +L+      +  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 367 GQNKKSAFRVSKNAWLAYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHW 426

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP  +                +AVKP
Sbjct: 427 DFFRDPNHYPAEEGNRIATAIFYLSDVEQGGATAFPFLD----------------FAVKP 470

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDMDYRTKHAGCPVLKGSKWIGNVWIHDMTQTFARP 520


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/170 (32%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  +  E +Q+ +Y  G  YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q   L G+R+AT + YLS VE GG T FP   +                 
Sbjct: 422 WDSFPENHIYQEGDLHGNRMATGIYYLSDVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P +G  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515


>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
 gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
          Length = 448

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/156 (35%), Positives = 80/156 (51%), Gaps = 20/156 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWT---FLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           RTS   F +  Q   V  +  R+   T    L   + + + +L+Y    +Y  H D+F  
Sbjct: 295 RTSMSAFQTDHQYTAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGP 354

Query: 73  KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
             ++ +  G RIATVL YL+ VE+GG+TVFP                R G    PMKG A
Sbjct: 355 AYSEYIQRGDRIATVLFYLNDVEQGGKTVFP----------------RLGIFRSPMKGSA 398

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           ++F++L+     D  + HG CPV+ G KW+ATKWI+
Sbjct: 399 VVFYNLNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 434


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 81/156 (51%), Gaps = 20/156 (12%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWT---FLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
           RTS   F +  Q + V  +  R+   T    L   + + + +L+Y    +Y  H D+F  
Sbjct: 302 RTSMSAFQTDHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGP 361

Query: 73  KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
             ++ +  G RIATVL YL+ VE+GG+TVFP                R G    PMKG A
Sbjct: 362 AYSEYIQRGDRIATVLFYLNDVEQGGKTVFP----------------RLGIFRSPMKGSA 405

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           ++F++++     D  + HG CPV+ G KW+ATKWI+
Sbjct: 406 VVFYNMNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 441


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L+      +  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 297 GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 356

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP  ++                AVKP
Sbjct: 357 DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 400

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 401 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 450


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 24/176 (13%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           +S+ S VRTS   F+  +  +++++I+ R+A  T L  +  E  Q  +Y  G  Y  H D
Sbjct: 365 ESLVSNVRTSQFTFIPASAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMD 424

Query: 69  -FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
            F++   +  L      G+RIATVL YLS V +GG T FP                    
Sbjct: 425 WFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRT---------------- 468

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
            +KP K  A  + +LH     D  + HG+CP+I G KW   +WI  R FD+ ++ P
Sbjct: 469 LLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI--REFDQSDRRP 522


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  +  E +Q+ +Y  G  YEPH
Sbjct: 236 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 295

Query: 67  FDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q G   G+R+AT + YL+ VE GG T FP   +                 
Sbjct: 296 WDSFPENHIYQEGDLHGNRMATGIYYLADVEAGGGTAFPFLPL----------------L 339

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P +G  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 340 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 389


>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
          Length = 478

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 79/164 (48%), Gaps = 45/164 (27%)

Query: 50  EAMQILHYEHGQKYEPHFDFF-----------RDKMNQQL-----GGHRIATVLMYLSHV 93
           + +Q+LHYE  Q Y+PH D+F            D  +  +     G +R ATV +YL++ 
Sbjct: 232 DGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTANNGTNRFATVFLYLNNA 291

Query: 94  EKGGETVFPNS---EVSQS-----------------RDGNW-----SECARRGYAVKPMK 128
             GGETVFP S   E+ Q                   D  W     SE  R    V P  
Sbjct: 292 GSGGETVFPLSTTHEIYQGGRLTQAGTNRTPGFIRDADAAWVCDTKSEALR----VTPRT 347

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           GD++LF+S   DAS D  SLHGSCP+ +GEKW+A  W+  R  D
Sbjct: 348 GDSVLFYSQRGDASLDGYSLHGSCPMGDGEKWAANLWVWNRPRD 391


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L+      +  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 116 GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 175

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP  ++                AVKP
Sbjct: 176 DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 219

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 220 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 269


>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 215

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/181 (34%), Positives = 94/181 (51%), Gaps = 19/181 (10%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
           A++ RTS+  +L  +   +V +I+ R A    +P  + E++Q+L YE  Q Y+ H D+F 
Sbjct: 44  ATDWRTSTTYWLDSSSHPVVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFS 103

Query: 72  --------DKMNQQLGGH--RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
                   D + +   G+  R+ TV  Y+S V KGG T F  S     R  +  +C++ G
Sbjct: 104 AERHRNSPDVLKRIEYGYKNRMITVFWYMSDVAKGGHTNFARSG-GLPRPSSNKDCSQ-G 161

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
            +V P K   ++F+S+ P+   D  SLH  CPV EG K S  KWI    ++KP     DD
Sbjct: 162 ISVAPKKRKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKLSGNKWI----WNKPR---SDD 214

Query: 182 D 182
           D
Sbjct: 215 D 215


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)

Query: 50  EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
           E +Q+ +Y  G  YEPH+DFFRD  +     G+RIAT + YLS VE+GG T FP  ++  
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
                         AVKP  G+ L +++LH     D  + H  CPV++G KW    WIH 
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512

Query: 168 -VRNFDKP 174
             + F +P
Sbjct: 513 VTQTFARP 520


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+   S  R S   +L+      +  +   +   T L     E +Q+ +Y  G  YEPH+
Sbjct: 17  GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 76

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+RIAT + YLS VE+GG T FP  ++                AVKP
Sbjct: 77  DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 120

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 121 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 170


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)

Query: 50  EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
           E +Q+ +Y  G  YEPH+DFFRD  +     G+RIAT + YLS VE+GG T FP  ++  
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
                         AVKP  G+ L +++LH     D  + H  CPV++G KW    WIH 
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512

Query: 168 -VRNFDKP 174
             + F +P
Sbjct: 513 VTQTFARP 520


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)

Query: 50  EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
           E +Q+ +Y  G  YEPH+DFFRD  +     G+RIAT + YLS VE+GG T FP  ++  
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
                         AVKP  G+ L +++LH     D  + H  CPV++G KW    WIH 
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512

Query: 168 -VRNFDKP 174
             + F +P
Sbjct: 513 VTQTFARP 520


>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
          Length = 190

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 22/165 (13%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE------NGEAMQILHYEHGQ 61
           G    S+ RTS   +L ++   I+ +I  R      +  +      N E +Q++ Y+  Q
Sbjct: 34  GGGFTSKTRTSENGWLRRSASPILENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQ 93

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +Y PH DF  D   QQ    R  T+L+Y+   E+GG T FP     ++ DG        G
Sbjct: 94  EYAPHHDFGDDGTPQQ----RFLTLLLYIQLPEEGGATSFP-----KANDG-------MG 137

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             V P +GDA+LF+S+ PD + D  +LH   PV +G+KW    W+
Sbjct: 138 VQVVPARGDAVLFYSMLPDGNADDLALHAGMPVRKGQKWVCNLWV 182


>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
          Length = 562

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 86/167 (51%), Gaps = 24/167 (14%)

Query: 22  FLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFDFFRDK---MNQ 76
           +L    D ++ +++ RIAA T L   P   E +Q+++Y  G  YEPHFD        + +
Sbjct: 412 WLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYR 471

Query: 77  QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFS 136
              G+R+AT ++YLS VE GG T F  +                 ++V  +K  AL +++
Sbjct: 472 MKSGNRVATFMIYLSSVEAGGATAFIYA----------------NFSVPVVKNAALFWWN 515

Query: 137 LHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-EKEPED 180
           LH     +S +LH  CPV+ G+KW A KWIH   + F +P    PED
Sbjct: 516 LHRSGEGNSDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPCTSSPED 562


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 82/169 (48%), Gaps = 27/169 (15%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE------NGEAMQIL 55
           V D+  G+S     R SS  F++ + D +VAS+  R++  T L  E        E++Q+L
Sbjct: 214 VLDDTGGESFFDVSRLSSTAFVNDSND-LVASLNRRVSKLTGLQTEVLDSFSESESLQVL 272

Query: 56  HYEHGQKYEPHFDFFRDKMNQ----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
            Y  G  Y PH+D    + +     Q  G RIAT ++YL     GG TVFP   +S    
Sbjct: 273 RYGPGGLYTPHYDTLGSEADLPPYIQHTGDRIATFILYLDIATAGGATVFPLLPMS---- 328

Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKW 160
                       +   KG A  +F+LHPD S D  +LH +CPVI G KW
Sbjct: 329 ------------IPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKW 365


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 79/171 (46%), Gaps = 27/171 (15%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E G    +  RT+ G +  K  +E+   I  RI   T     + E  Q+++Y  G  Y  
Sbjct: 359 EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLL 418

Query: 66  HFDFF----------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
           H D+F          R   +  LG  RIATVL YL+ VE+GG TVF              
Sbjct: 419 HMDYFDFASSNHTDTRSSYSMDLGD-RIATVLFYLTDVEQGGATVF-------------- 463

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             A  GY+V P  G A+ +++L  +   D  + H +CPVI G KW  T+WI
Sbjct: 464 --ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKWVMTEWI 512


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G S A+  RTS G   + +++     +   +  ++ L  +  E +Q+ +Y  G  YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421

Query: 67  FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           +D F +    Q   L G+R+AT + YL+ VE GG T FP   +                 
Sbjct: 422 WDSFPENHIYQEGDLHGNRMATGIYYLADVEAGGGTAFPFLPL----------------L 465

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
           V P +G  L +++LHP    D  + H +CPV++G KW A  WI  RN D 
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515


>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
 gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
          Length = 314

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 78/171 (45%), Gaps = 22/171 (12%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQ-DEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           + D ++G      VRTS G  LS  + D +V  +  RIAA T      GE + IL Y   
Sbjct: 159 ILDPQTGARRPDPVRTSVGAALSPVEEDLVVGMLNRRIAAATGTDRMQGEPLHILRYSGA 218

Query: 61  QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
           Q+Y PH D      NQ     R  T+++YL+   +GGET FP                  
Sbjct: 219 QEYRPHHDAVAGLENQ-----RSHTLIVYLTADYEGGETAFPEL---------------- 257

Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           G+ ++  +GDALLF +L  D   D    H   P   G KW AT+WI  R +
Sbjct: 258 GFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWIATRWIRTRPY 308


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/173 (32%), Positives = 88/173 (50%), Gaps = 24/173 (13%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF- 70
            S+VRTS   F+ K + +++ +I+ R+A  + L  +  E  Q  +Y  G  Y  H D+F 
Sbjct: 368 VSKVRTSQFTFIPKTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFG 427

Query: 71  RDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           +D  + +L      G+RIATVL YLS V +GG T FP+ +                  ++
Sbjct: 428 QDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLK----------------QLLQ 471

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
           P K  A  + +LH     D  +LHG+CP+I G KW   +WI  R F + ++ P
Sbjct: 472 PKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWI--REFIQADRRP 522


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 78/165 (47%), Gaps = 19/165 (11%)

Query: 11  IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           I++  RTS G         I+  +   +A  + L   + E +QI +Y  G  YEPH D F
Sbjct: 370 ISANFRTSQGTTFEYTDHPIMQKMSHHVAEISGLDMRSAEPLQIANYGIGGHYEPHMDSF 429

Query: 71  RDKMNQQLGGH---RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
            D  +  L  +   R+AT + YLS+VE GG T FP   +                 V P 
Sbjct: 430 PDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPFLPL----------------LVTPE 473

Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
           +G  L +++LHP    D  + H +CPV++G KW A  WI + N D
Sbjct: 474 RGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIRLSNQD 518


>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 322

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 22/167 (13%)

Query: 1   MVADNESGKSIASEVRTS-SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
           MV D  SG+ +   +RTS  G+F    +D ++ +I  RIAA +      GE + +L Y  
Sbjct: 167 MVIDPRSGRPMPHPIRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGEPLTLLRYAV 226

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           GQ+Y  H D      NQ     R  T+L+YL+    GGET+FP                R
Sbjct: 227 GQQYRQHHDCLPHVRNQ-----RAWTMLIYLNEGYAGGETIFP----------------R 265

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            G +VK  KG+ALLF +          ++H   PV+ G+KW  T+WI
Sbjct: 266 LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLGAPVMAGQKWLCTRWI 312


>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Strongylocentrotus purpuratus]
          Length = 121

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 48/118 (40%), Positives = 65/118 (55%), Gaps = 17/118 (14%)

Query: 50  EAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS 109
           E +QI +Y  G  Y PHFDF RD    +  G+RIA++L YLS V KGG+TVF ++     
Sbjct: 5   EFLQIANYGLGGHYLPHFDFTRDVATHK-NGNRIASMLFYLSDVAKGGDTVFIDA----- 58

Query: 110 RDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                      G  +KP KG A+ +++L  +   D  + H SCPVI G KW A  W+H
Sbjct: 59  -----------GAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMH 105


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 84/164 (51%), Gaps = 19/164 (11%)

Query: 13  SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
            + RTSS   L   QD ++  I+ +I  +  + P   E +Q  HY+ GQ+++PH D+F  
Sbjct: 133 QQFRTSSTCHLGNMQDPVIRKIDLQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEP 192

Query: 73  KMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
                 G   G R  T ++YL+ VE+GG+TVFP   +              G+  K  KG
Sbjct: 193 YELAHYGGIQGQRTYTFMIYLNEVEQGGDTVFPELAI--------------GFKAK--KG 236

Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
            A+++ +++PD S +  +LH   PV +GEK   TKW    + ++
Sbjct: 237 MAVIWNNINPDGSVNYQTLHQGMPVQKGEKLIITKWFRQHSLEQ 280


>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
 gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
          Length = 515

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 82/148 (55%), Gaps = 24/148 (16%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
           I  RI+  T    E   A+Q+ ++  G  ++PH+D++ +++     N  LG  R+A++++
Sbjct: 377 INDRISDMTGFKVEEFPAIQLANFGVGGYFKPHYDYYTERLKELDANNTLGD-RLASIII 435

Query: 89  YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
           Y   V +GG+TVFP+ +V                AV+P KG AL +F+   D+S D  SL
Sbjct: 436 YAGEVSQGGQTVFPDIKV----------------AVEPKKGKALFWFNDFDDSSPDPRSL 479

Query: 149 HGSCPVIEGEKWSATKWIHV--RNFDKP 174
           H  CPVI G +W+ TKW+H   + F KP
Sbjct: 480 HSVCPVIVGSRWTITKWLHYAPQMFVKP 507


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 31/169 (18%)

Query: 9   KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
           K   +E R S   +L  +    V+ ++ +I+  T L  ++  GE +Q+++Y  G  YEPH
Sbjct: 377 KQATAEYRISKSAWLKGSAHSTVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 436

Query: 67  FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
           FD         F+ K      G+R+AT ++YLS VE GG T F  +              
Sbjct: 437 FDHATSPSSPVFKLKT-----GNRVATFMIYLSSVEAGGSTAFIYA-------------- 477

Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
              ++V  MK  A+ +++LH +   D+ +LH  CPV+ G+KW A KWIH
Sbjct: 478 --NFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIGDKWVANKWIH 524


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 56/175 (32%), Positives = 81/175 (46%), Gaps = 36/175 (20%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D E+GK   ++ R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G 
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +YEPHFDF R                  +S V  GG TVFP                  G
Sbjct: 425 QYEPHFDFAR------------------MSDVSAGGATVFPEV----------------G 450

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
            +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+H R   F +P
Sbjct: 451 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 505


>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
           anophagefferens]
          Length = 148

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/159 (37%), Positives = 79/159 (49%), Gaps = 15/159 (9%)

Query: 13  SEVRTSSGMFLSKA--QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           S  RTS   + + A   +    ++ ARI   T +P EN E+ Q+L Y HGQ+Y  H D  
Sbjct: 1   STSRTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMS 60

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
           R   N    G RI T  MY S VEKGGET FP  +    +             + P +G 
Sbjct: 61  RGD-NALACGPRIYTFFMYFSDVEKGGETEFPMVKRPSGKT----------VKIAPKRGS 109

Query: 131 ALLFFSLHPDAST--DSTSLHGSCPVIEGEKWSATKWIH 167
           ALL+ S+  D  T  D  + H + PV+EG K++A  WIH
Sbjct: 110 ALLWPSVTSDDPTAQDPRTRHAALPVVEGTKFAANAWIH 148


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/171 (33%), Positives = 79/171 (46%), Gaps = 27/171 (15%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           E G    +  RT+ G +  K  +E+   I  RI   T     + E  Q+++Y  G  Y  
Sbjct: 359 EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLL 418

Query: 66  HFDFF----------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
           H D+F          R   +  LG  RIATVL YL+ VE+GG TVF              
Sbjct: 419 HMDYFDFASSNHTDTRSGYSMDLGD-RIATVLFYLTDVEQGGATVF-------------- 463

Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
             A  GY+V P  G A+ +++L  +   D  + H +CPVI G KW  T+WI
Sbjct: 464 --ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|334343683|ref|YP_004552235.1| procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
 gi|334100305|gb|AEG47729.1| Procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
          Length = 225

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/165 (34%), Positives = 88/165 (53%), Gaps = 22/165 (13%)

Query: 10  SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
           S  ++ RTS+   LS   D +V+++  RI A T +  ++GE +Q   Y  GQ+Y+PH+D+
Sbjct: 74  SANADYRTSASCNLSP-WDPLVSAVSDRICALTGIAADHGETLQGQRYHPGQEYKPHWDY 132

Query: 70  FRDKMNQ-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
           F    N      + GG R  T ++YLS VE GGET FP+ E                + V
Sbjct: 133 FPVTANYWPAMLKTGGQRSWTAMIYLSPVEAGGETHFPHCE----------------FMV 176

Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            P++G  L++ ++  D S + +SLH + PV +G K+  TKW   R
Sbjct: 177 PPVEGMLLIWNNMDRDGSPNGSSLHAARPVEQGTKYVVTKWFRER 221


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 57/168 (33%), Positives = 87/168 (51%), Gaps = 23/168 (13%)

Query: 13  SEVRTSSGMFLSKAQDEIVASI--EARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           S++R S   +     D IV ++   AR  A     P + E +Q+ +Y  G  Y  H+D+ 
Sbjct: 363 SKIRISQNAWFENEHDPIVETLNQRARDMAGGLNEP-SYELLQVNNYGLGGFYSIHYDWS 421

Query: 71  R--DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
              +    +  G+RIAT++ YLS V++GG TVFP                R   AV+P K
Sbjct: 422 TSANPFPNKGMGNRIATLMFYLSDVQEGGSTVFP----------------RLNLAVRPRK 465

Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
           G A+ +++LH +   +  +LH +CPV+ G KW A KWIH R+  F +P
Sbjct: 466 GTAIFWYNLHRNGKGNKKTLHAACPVLIGSKWVANKWIHERHQEFVRP 513


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G++  S  R S   +L+      +  +   ++  T L     E +Q+ +Y  G  YEPH+
Sbjct: 366 GQNKKSSFRVSKNAWLAYETHPTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHW 425

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFR+  +     G+RIAT + YLS VE+GG T FP                   +AV+P
Sbjct: 426 DFFRNPDHYPAEEGNRIATAIYYLSEVEQGGATAFP----------------FLNFAVRP 469

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L +++LH  +  D  + H  CPV++G KW    WIH   + F +P
Sbjct: 470 QLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 519


>gi|224107311|ref|XP_002314441.1| predicted protein [Populus trichocarpa]
 gi|222863481|gb|EEF00612.1| predicted protein [Populus trichocarpa]
          Length = 84

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/87 (56%), Positives = 56/87 (64%), Gaps = 3/87 (3%)

Query: 137 LHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKA 196
           LHP A  D +SLH  CPVIEGEKWSATKWIHV +FDK  +     +C D++ +C  WA  
Sbjct: 1   LHPTAVPDISSLHAGCPVIEGEKWSATKWIHVDSFDKNVE--AGGNCTDQNESCERWAAL 58

Query: 197 GECKKNPLYMVGSKSSRGYCRKSCKVC 223
           GE  KN  Y VGS    GYCR S KVC
Sbjct: 59  GERTKNTEYTVGSPDLPGYCRSS-KVC 84


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G++  S  R S   +L+      +  + + ++  T L     E +Q+ +Y  G  YEPH+
Sbjct: 364 GQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHW 423

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+R+AT + YLS VE+GG T FP                   +AVKP
Sbjct: 424 DFFRDPDHYPAEEGNRMATAIFYLSDVEQGGATAFPF----------------LNFAVKP 467

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L ++++H     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 468 QLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIHEATQTFARP 517


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G++  S  R S   +L+      +  + + ++  T L     E +Q+ +Y  G  YEPH+
Sbjct: 364 GQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHW 423

Query: 68  DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DFFRD  +     G+R+AT + YLS VE+GG T FP                   +AVKP
Sbjct: 424 DFFRDPDHYPAEEGNRMATAIFYLSDVEQGGATAFPF----------------LNFAVKP 467

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             G+ L ++++H     D  + H  CPV++G KW    WIH   + F +P
Sbjct: 468 QLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIHEATQTFARP 517


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 90/177 (50%), Gaps = 33/177 (18%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
           +VA  E+  ++  E R S   +L  ++  IV  ++ RI+  T L   P   E +Q+++Y 
Sbjct: 348 VVASGENQATV--EYRISKSAWLKGSESCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYG 405

Query: 59  HGQKYEPHFD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
            G  YEPHFD         F+ K      G+R+AT ++YLS VE GG T F  +      
Sbjct: 406 IGGHYEPHFDHATSPSSPVFKLKT-----GNRVATFMIYLSSVEAGGSTAFIYA------ 454

Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                      ++V  +K  A+ +++LH +   D+ +LH  CPV+ G+KW A KW+H
Sbjct: 455 ----------NFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIGDKWVANKWVH 501


>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
 gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
          Length = 512

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
           I  RI   T       E +QI +Y  G  ++PHFD+  D     N    G R+A++L Y 
Sbjct: 380 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 439

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           S V +GG TVFP   V+                V P KG  L +F+LH D   D  SLH 
Sbjct: 440 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 483

Query: 151 SCPVIEGEKWSATKWI 166
            CPV+ G++W+ TKW+
Sbjct: 484 VCPVLNGDRWTLTKWV 499


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 62/207 (29%), Positives = 92/207 (44%), Gaps = 43/207 (20%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V + ++G+   +  R S   +L   + E+V  I  RI   T L  E  E +QI +Y  G 
Sbjct: 366 VHNADTGQLETASYRISKSAWLKDTEHEVVKRISDRIDMMTDLTMETAELLQIANYGIGG 425

Query: 62  KYEPHFDF--------FRDKMNQQL-----------------GGHRIATVLMYLSHVEKG 96
            Y+PHFD         + +    ++                  G+RIATVL Y+S  E G
Sbjct: 426 HYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPYSFESLNAGNRIATVLFYISQPEAG 485

Query: 97  GETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIE 156
           G TVF + +++                V+P K DA  +F++      D ++ H +CPV+ 
Sbjct: 486 GGTVFTSHKIT----------------VEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLA 529

Query: 157 GEKWSATKWIHVR--NFDKPEKEPEDD 181
           G KW A KWIH R   F +P    E D
Sbjct: 530 GTKWVANKWIHERGQEFRRPCSTKETD 556


>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 56/151 (37%), Positives = 80/151 (52%), Gaps = 20/151 (13%)

Query: 23  LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD--FFRDKMNQQLG- 79
           L   + E V  I+ R+   T L  E  E + +L+Y  G ++EPHFD     D+  ++LG 
Sbjct: 389 LRSTEYETVKRIDKRLELATNLEIETAEDLAVLNYGIGGQFEPHFDCALKGDQCFEKLGT 448

Query: 80  GHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
           G+RIAT L+YL+  E GG TVF  N ++S                V  +K  AL +++L 
Sbjct: 449 GNRIATFLIYLTEPEIGGRTVFTSNLKIS----------------VPCVKNAALFWYNLM 492

Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            +   D+ SLH +CPV  G KW+A KW H R
Sbjct: 493 RNGEVDTRSLHAACPVATGIKWTANKWFHER 523


>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
          Length = 451

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
           I  RI   T       E +QI +Y  G  ++PHFD+  D     N    G R+A++L Y 
Sbjct: 319 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 378

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           S V +GG TVFP   V+                V P KG  L +F+LH D   D  SLH 
Sbjct: 379 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 422

Query: 151 SCPVIEGEKWSATKWI 166
            CPV+ G++W+ TKW+
Sbjct: 423 VCPVLNGDRWTLTKWV 438


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 20/149 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
           R S   +LS  ++ +V+ I  RI   T L     E +Q+ +Y  G +YEPHFDF R    
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438

Query: 72  DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
           D   +   G+RIAT L Y+S V  GG TVFP                  G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482

Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKW 160
           + +++L      D ++ H +CPV+ G KW
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
 gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
          Length = 520

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
           I  RI   T       E +QI +Y  G  ++PHFD+  D     N    G R+A++L Y 
Sbjct: 388 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           S V +GG TVFP   V+                V P KG  L +F+LH D   D  SLH 
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 491

Query: 151 SCPVIEGEKWSATKWI 166
            CPV+ G++W+ TKW+
Sbjct: 492 VCPVLNGDRWTLTKWV 507


>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 56/151 (37%), Positives = 80/151 (52%), Gaps = 20/151 (13%)

Query: 23  LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD--FFRDKMNQQLG- 79
           L   + E V  I+ R+   T L  E  E + +L+Y  G ++EPHFD     D+  ++LG 
Sbjct: 389 LRSTEYETVKRIDKRLELATNLEIETAEDLAVLNYGIGGQFEPHFDCALKGDQCFEKLGT 448

Query: 80  GHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
           G+RIAT L+YL+  E GG TVF  N ++S                V  +K  AL +++L 
Sbjct: 449 GNRIATFLIYLTEPEIGGRTVFTSNLKIS----------------VPCVKNAALFWYNLM 492

Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            +   D+ SLH +CPV  G KW+A KW H R
Sbjct: 493 RNGEVDTRSLHAACPVATGIKWTANKWFHER 523


>gi|219126272|ref|XP_002183385.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405141|gb|EEC45085.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 474

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 59/164 (35%), Positives = 85/164 (51%), Gaps = 23/164 (14%)

Query: 13  SEVRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
           SE RTS+  +     D  E+   I  R+   T +PPEN E++Q+L YE GQ Y  H D+ 
Sbjct: 321 SETRTSTNAWCYNECDDHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFYAVHHDYI 380

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
            +  N+ +G  RI TV +YL+ VE+GG T FP  E+                AV+P +G 
Sbjct: 381 ENDWNRAVGS-RILTVFLYLNDVEEGGATNFPELEL----------------AVQPKRGR 423

Query: 131 ALLFFSL---HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
           ALL+ S+   +P    D T  H +  V +G K+ A  W H R++
Sbjct: 424 ALLWPSVLDQYPHKKDDRTE-HEAQVVTKGIKYGANAWFHQRDY 466


>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
 gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
          Length = 708

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 87/175 (49%), Gaps = 20/175 (11%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           VA N +GKS  +++R S   +L+     I+ SI   I   +       E MQ+ +Y  G 
Sbjct: 537 VAGN-AGKSTVADLRVSQQTWLNYT-SPIMKSISRIIQFVSGFDIAGAEFMQVANYGVGG 594

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +YEPH D+F   + QQ  G RI+T + YLS+VE+GG TVF    V               
Sbjct: 595 QYEPHPDYFEFNLPQQFQGDRISTSMFYLSNVEQGGYTVFTKLNV--------------- 639

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
             + P++G  +++ +LH     D+ +LH  CPV+ G K     W+H   + F +P
Sbjct: 640 -FLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSKRIGNIWMHSGFQEFRRP 693


>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
 gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
          Length = 533

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 74/141 (52%), Gaps = 18/141 (12%)

Query: 29  EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-MNQQLGGHRIATVL 87
           E++  I  RI   T L   +G  MQ+L Y  G  + PHFD+F  K +  +  G RIATVL
Sbjct: 382 EVLNRIGRRIGDITGLSTRSGRQMQLLKYGFGGHFTPHFDYFDSKTLYLEKVGDRIATVL 441

Query: 88  MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA-STDST 146
            YL++VE GG TVFP+  +                AV   KG AL + +L   +   D+ 
Sbjct: 442 FYLNNVEHGGATVFPSINL----------------AVPTQKGSALFWHNLDGQSYDYDTR 485

Query: 147 SLHGSCPVIEGEKWSATKWIH 167
           + HG+CP+I G K   T+WI+
Sbjct: 486 TFHGACPLISGTKLVMTRWIY 506


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 78/158 (49%), Gaps = 24/158 (15%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
           RT+   +L  +  +++  +  R +    +P  N E  Q+++Y  G +Y+PHFD F DK  
Sbjct: 59  RTNDFCWLEHSASDVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAF-DKTT 117

Query: 76  QQ------LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
           ++       GG R+ T L YL+ VE+GG T FP   VS                VKP KG
Sbjct: 118 KEGQNNWFPGGQRMVTALAYLNDVEEGGATDFPKINVS----------------VKPNKG 161

Query: 130 DALLFFS-LHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
           D ++F + +      +  +LHG  PV+ GEKW+   W 
Sbjct: 162 DVVVFHNCIEGTTEINPQALHGGSPVVAGEKWAVNLWF 199


>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 504

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 49/129 (37%), Positives = 68/129 (52%), Gaps = 22/129 (17%)

Query: 52  MQILHYEHGQKYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS 107
           +++ +Y  G +YEPHFDF R    D   +   G+R+AT L Y+S VE GG TVFP     
Sbjct: 385 LEVANYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAGGATVFPEV--- 441

Query: 108 QSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                        G AV P KG A+ +++L      D ++ H +CPV+ G KW + KWIH
Sbjct: 442 -------------GAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKWVSNKWIH 488

Query: 168 VRN--FDKP 174
            R   F +P
Sbjct: 489 ERGQEFRRP 497


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 83/166 (50%), Gaps = 31/166 (18%)

Query: 12  ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFD- 68
            +E R S   +L ++  E+V  ++ RI   T L   P   E +Q+++Y  G  YEPHFD 
Sbjct: 378 TAEYRISKSAWLKESAHEVVGKLDQRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDH 437

Query: 69  -------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
                   +R K      G+R+AT+++YLS V+ GG T F  +                 
Sbjct: 438 ATSDSSPLYRLKT-----GNRVATIMIYLSPVQAGGSTAFIYA----------------N 476

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           ++V  ++  AL +++LH +   +  +LH  CPVI G KW A KW+H
Sbjct: 477 FSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGNKWVANKWVH 522


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score = 89.4 bits (220), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 24/177 (13%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
            +S+ S VRTS   F+  +  +++++I+ R+A  T L  +  E  Q  +Y  G  Y  H 
Sbjct: 336 NESVVSNVRTSQFTFIPVSAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHM 395

Query: 68  D-FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           D F++  ++  L      G+RIATVL YLS V +GG T FP                   
Sbjct: 396 DWFYQTTIDAGLISSPEMGNRIATVLFYLSDVSQGGGTAFPQLRT--------------- 440

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
             +KP K  A  + +LH     D  + HG+CP+I G KW   +WI  R  D+ ++ P
Sbjct: 441 -LLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI--REVDQSDRRP 494


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/128 (37%), Positives = 67/128 (52%), Gaps = 19/128 (14%)

Query: 50  EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
           E +Q+ +Y  G  YEPH+DFF D  +     G+RIAT + YLS VE+GG T FP      
Sbjct: 410 EQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQGGATAFPF----- 464

Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
                        +AV+P  G+ L +++LH     D  + H  CPV++G KW A  WIH 
Sbjct: 465 -----------LNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKWIANIWIHE 513

Query: 168 -VRNFDKP 174
             + F +P
Sbjct: 514 ATQTFARP 521


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 80/161 (49%), Gaps = 17/161 (10%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G+S  S +RTS  M       E++ +IE RI   T L  +  E   +++Y  G  Y+ H+
Sbjct: 298 GESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHY 357

Query: 68  DFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
           DF+   +  + L G RI TVL YL  VE  G TVFP   +S                + P
Sbjct: 358 DFYVYSEPLRFLRGERIVTVLFYLGDVELSGSTVFPFLNIS----------------ITP 401

Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
            KG A+++++LH        + H +CPV+ G K+  TKWI+
Sbjct: 402 KKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKYVLTKWIN 442


>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
          Length = 520

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 82/162 (50%), Gaps = 26/162 (16%)

Query: 13  SEVRTSSGMFLSKAQD----EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
           S+VR ++G +L   +     +I  +++  I A   L     E  QIL Y     Y PH+D
Sbjct: 99  SQVRAANGTWLIHTKRPNFAKIFWNLQVNIRA---LDLSTAEPWQILSYNSEGYYAPHYD 155

Query: 69  FFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
           F   + N+ L    G+RIATVL+ L   +KGG TVFP   ++                ++
Sbjct: 156 FLNPETNKVLVESRGNRIATVLVILQIAKKGGTTVFPKININ----------------IR 199

Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
           P  GD +++ +  PD  +DS +LH +CP+ EG K  AT W+H
Sbjct: 200 PKIGDVVVWLNTVPDGESDSQTLHAACPIKEGTKIGATLWVH 241



 Score = 60.1 bits (144), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 73/179 (40%), Gaps = 35/179 (19%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAW----TFLPP---ENGEAMQILHY 57
           N+ G    S+ R ++G  +      I     A ++ W      +P    E+ E +  L Y
Sbjct: 341 NDDGTEYYSKYRKANGTQI------IAPDFPAALSIWKTVKILIPTLNIESSEDIVALSY 394

Query: 58  EHGQKYEPHFDFFRDKMNQQLGG------HRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
             G  Y  H DF      ++  G      +R  T++M     E GG T+FP+        
Sbjct: 395 IRGGHYAAHHDFLEYPSEKEWDGWMKDYGNRFGTLIMAFETAELGGATIFPSLNA----- 449

Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
                      A++P  GDA  +F+   +   +  S HG CP+ EG+K  +T W  ++N
Sbjct: 450 -----------AIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKKSISTIWFRMKN 497


>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
 gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 75/140 (53%), Gaps = 21/140 (15%)

Query: 32  ASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLG---GHRIATVLM 88
           A I  RI   T       E + + +Y  G  + PH+D+  +  N  +G   G  + T+L 
Sbjct: 391 ARIYQRITDITGFQLFVQEELNVANYGLGTIFGPHYDYTPE--NYDIGWFMGGPLGTILF 448

Query: 89  YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
           Y+S +++GG T+FP+  ++                V P KG ALL+F+L+ D   D  +L
Sbjct: 449 YVSDLQQGGATIFPSINIT----------------VSPRKGSALLWFNLYDDGEPDPRTL 492

Query: 149 HGSCPVIEGEKWSATKWIHV 168
           H SCPVIEG++W+ TKW+H+
Sbjct: 493 HSSCPVIEGDRWTLTKWVHL 512


>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
          Length = 187

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 18/150 (12%)

Query: 22  FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK--MNQQLG 79
           +L   + E+V  I  R+   T L  E  E +Q+ +Y  G  YEPH+D  R +    +   
Sbjct: 9   WLGSTEHEVVNRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSRRESVFEKTKN 68

Query: 80  GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHP 139
           G+RIAT+L+Y++  E GG TVF + + S S       C          K  AL +++L  
Sbjct: 69  GNRIATILIYMTKPEIGGGTVFIDLKTSIS-------CT---------KNAALFWYNLMR 112

Query: 140 DASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
             + D  S H +CPV+ G KW+A KW H R
Sbjct: 113 SGAVDIRSYHAACPVLTGTKWTANKWFHER 142


>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 156

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 69/106 (65%), Gaps = 8/106 (7%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           ++D  +GK I +    + G F+   +D+I+ +IE RI     +P ENGE +Q++HY  GQ
Sbjct: 47  ISDKRTGKGIENRFAYACGGFV---KDKIIKNIEQRIPDIISIPVENGEGLQVIHYGVGQ 103

Query: 62  KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSE 105
           K+ PH+D    + N+    GG R+AT LMYLS VE+GGETVFP+++
Sbjct: 104 KFVPHYD---SRSNESFWNGGPRVATFLMYLSDVEEGGETVFPSAK 146


>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
 gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
          Length = 151

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 70/149 (46%), Gaps = 19/149 (12%)

Query: 22  FLSKAQDEIVASIEARIAAWTFLPPE--NGEAMQILHYEHGQKYEPHFDFFRDKMNQQL- 78
           +L   +  ++A +  R+   T L      GEA Q+L+Y  G  YEPH D+FRD+    L 
Sbjct: 3   WLFDTEHTVIAKLSRRVEYITGLDVNWPYGEAFQVLNYGLGGFYEPHVDYFRDEQPALLT 62

Query: 79  GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
            G RI T L YLS VE GG TVF                 R    V  +K  A+LF  L 
Sbjct: 63  NGQRIVTFLFYLSDVEAGGATVF----------------TRLNLTVPAVKNSAVLFHDLK 106

Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIH 167
                +  S H  CPV+ G KW A KWIH
Sbjct: 107 RSLEFEKDSEHAGCPVLMGSKWIANKWIH 135


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 71/145 (48%), Gaps = 18/145 (12%)

Query: 22  FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH 81
           +L  A D +V  I       T L     E +Q+ +Y  G  Y PH+D         L   
Sbjct: 352 WLEDAYDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDPL--Q 409

Query: 82  RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA 141
           R+AT++ YLS+VE GG T+FP                R G AV+P KG AL + +L  + 
Sbjct: 410 RLATMMFYLSNVEIGGATIFP----------------RLGVAVRPQKGSALFWINLKRNG 453

Query: 142 STDSTSLHGSCPVIEGEKWSATKWI 166
            T+  +LH +CPV+ G KW A KWI
Sbjct: 454 LTNRQTLHAACPVVIGSKWIANKWI 478


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 92/181 (50%), Gaps = 30/181 (16%)

Query: 7   SGKSIASEVRTSSGMF-----LSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEH 59
           + +S+ S+VRT+ G F     LS    ++V  ++ R+   + L    E    MQ L+Y+ 
Sbjct: 325 NNESVVSKVRTAKGAFMHADRLSPESAQVVQRLKQRMGDLSDLNIKREGYNEMQYLNYDF 384

Query: 60  GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
           G  Y  H D+F   MN      RIAT L+YL+ V +GG T+FP  +V Q           
Sbjct: 385 GDHYLLHMDYFNISMND-----RIATFLIYLNDVTRGGGTIFP--QVKQ----------- 426

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI--HVRNFDKPEKE 177
              AV P KG  +L+++++ +   +  SLHG+CPV+ G K +   WI  H + F KP   
Sbjct: 427 ---AVHPEKGKLILWYNMNSNLDYELASLHGACPVLIGRKIAIVYWIREHDQMFVKPCLN 483

Query: 178 P 178
           P
Sbjct: 484 P 484


>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
          Length = 321

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 53/151 (35%), Positives = 75/151 (49%), Gaps = 17/151 (11%)

Query: 21  MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
           M ++     IV  +E RI     LP  N E  Q+L Y + Q Y  H D   ++ +   G 
Sbjct: 178 MAVNATAATIVRQLEERIGKLVGLPVVNQEHFQVLRYNNNQYYRVHNDLIDEQYDMPCGP 237

Query: 81  HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
            R+ T+ +YL+ V  GGET F                 R G AVKP KG A+L++S+  D
Sbjct: 238 -RVLTLFIYLNDVPAGGETSF----------------TRLGLAVKPKKGKAVLWYSVTND 280

Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
              +  + H + PV +G K++A KWIHV NF
Sbjct: 281 LEPEERTDHEARPVKQGTKYAANKWIHVGNF 311


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 80/164 (48%), Gaps = 18/164 (10%)

Query: 5   NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
           N +   + S+ RTS  ++L +  +E    +  R+A  T L  ++ E  Q+++Y  G  +E
Sbjct: 363 NAANDFVVSKFRTSKSVWLDRDANEATVKLTQRLADATGLDVKHSEHFQVINYGIGGVFE 422

Query: 65  PHFDFFRDKMNQQLGGH--RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
            HFD   +  N+ +GG   RIAT L YL+ V +GG T FP   ++               
Sbjct: 423 SHFDTTLEDTNRFVGGFIDRIATTLFYLNDVPQGGATHFPGLNIT--------------- 467

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
            V P  G AL +++L         ++H  CPVI G KW  +KWI
Sbjct: 468 -VFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKWVVSKWI 510


>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
 gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
          Length = 538

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 83/166 (50%), Gaps = 21/166 (12%)

Query: 6   ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
           +S  S  ++ RTS   +L    +  ++ I+ R+   T L  E+ E +Q+L+Y  G +YEP
Sbjct: 367 QSENSKIADRRTSQNTWLWYDVNPWLSRIKQRLEDVTGLSTESAEPLQLLNYGIGGQYEP 426

Query: 66  HFDFFRDKMNQQLGG---HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           HFDF  D   +++ G    R+ T + Y++ V  GG T FP   +                
Sbjct: 427 HFDFVEDA--EKIFGWQDDRLMTAIFYINDVALGGATAFPFLRL---------------- 468

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
           AV P KG  L++ +LH     D  S H  CP+++G KW  T+W HV
Sbjct: 469 AVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKWICTEWFHV 514


>gi|372272594|ref|ZP_09508642.1| Procollagen-proline dioxygenase [Marinobacterium stanieri S30]
          Length = 217

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 22/160 (13%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
           R+    +L  A   +   +  RIA    +P EN E++Q+LHY   Q+Y  H+D +     
Sbjct: 50  RSGQNCWLRYADYPLAKQVGDRIAKLAGIPLENAESLQVLHYGPEQEYRAHYDAYDLSTA 109

Query: 71  RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
           R +   + GG R+ T L+YL+ VE GG T FP                R G  V P  G 
Sbjct: 110 RGQRCCRYGGQRLVTALVYLNAVEAGGGTAFP----------------RLGLEVSPALGR 153

Query: 131 ALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVR 169
            +LF +   D S     SLH   PV +GEKW+   W HVR
Sbjct: 154 MVLFQNTDEDVSKPHRDSLHAGMPVTQGEKWAFNIWFHVR 193


>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 213

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/181 (32%), Positives = 84/181 (46%), Gaps = 17/181 (9%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  S  +  +  R S+ +  S     I+  I  RI  ++ +  EN E +QILHY  G
Sbjct: 42  VVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIRRRIELFSGISQENQEPLQILHYTRG 101

Query: 61  QKYEPHFDFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            KY+ H+D F D   Q + GG+R+ TVL+YL+ VE GG T FP+   +            
Sbjct: 102 GKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMAN------------ 149

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
               + P  G  +LF +          SLH   PV  GEKW A+ WI    +  P  +  
Sbjct: 150 ----IVPNAGSGILFRNTDAQNRQLRESLHAGLPVTHGEKWIASIWIRENPYITPSVDRV 205

Query: 180 D 180
           D
Sbjct: 206 D 206


>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
           anophagefferens]
          Length = 182

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/167 (34%), Positives = 84/167 (50%), Gaps = 25/167 (14%)

Query: 8   GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
           G    S  RTSS  +L++   E + S+  ++ A T  P E+ E  Q+  Y  G+ Y+PH+
Sbjct: 35  GNGEVSVSRTSSTCYLAR---EDLPSVCTKVCALTGKPLEHLELPQVGRYRGGEFYKPHY 91

Query: 68  DFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
           D F       +   Q GG R+ATVL+YL+ VE+GGET F                ++ G 
Sbjct: 92  DAFDTSSADGRRFAQNGGQRVATVLVYLNDVERGGETSF----------------SKLGV 135

Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
            +KP KG+AL+FF    D   D   LH + P ++  KW +  WI  R
Sbjct: 136 RIKPRKGNALIFFPATLDGVLDQNYLHAAEPAVD-PKWVSQIWIRQR 181


>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 215

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/181 (32%), Positives = 84/181 (46%), Gaps = 17/181 (9%)

Query: 1   MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
           +V D  S  +  +  R S+ +  S     I+  I  RI  ++ +  EN E +QILHY  G
Sbjct: 44  VVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIRRRIELFSGISQENQEPLQILHYTRG 103

Query: 61  QKYEPHFDFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
            KY+ H+D F D   Q + GG+R+ TVL+YL+ VE GG T FP+   +            
Sbjct: 104 GKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMAN------------ 151

Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
               + P  G  +LF +          SLH   PV  GEKW A+ WI    +  P  +  
Sbjct: 152 ----IVPNAGSGILFRNTDAQNRQLRESLHAGLPVTHGEKWIASIWIRENPYITPSVDRV 207

Query: 180 D 180
           D
Sbjct: 208 D 208


>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
 gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide 2 [Danio rerio]
 gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
          Length = 514

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/175 (31%), Positives = 79/175 (45%), Gaps = 36/175 (20%)

Query: 2   VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
           V D ++G    +  R S   +L    D ++A +  RI   T L  +  E +Q+ +Y  G 
Sbjct: 367 VRDPKTGVLTVAHYRVSKSAWLEGEDDPVIARVNQRIEDITGLTVDTAELLQVANYGVGG 426

Query: 62  KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
           +YEPHFDF R                  +S VE GG TVFP+                 G
Sbjct: 427 QYEPHFDFSR------------------MSDVEAGGATVFPDF----------------G 452

Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
            +V P KG A+ +++L      D  + H +CPV+ G KW + KWIH R   F +P
Sbjct: 453 ASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 507


>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 173

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/165 (35%), Positives = 80/165 (48%), Gaps = 32/165 (19%)

Query: 19  SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF-FRDKMN-- 75
           S  +LS     +V  +  RIAA T L   + E +Q+++Y  G  Y PHFDF  +DK    
Sbjct: 2   SAAWLSDHHHPVVKKLSRRIAAATGLSTSSAEHLQVVNYGVGGHYSPHFDFSTKDKPLRG 61

Query: 76  -QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
            +   G R AT L+YLS VE+GG T+F    V                 V+P  G AL +
Sbjct: 62  WETFAGQRQATWLVYLSSVERGGATLFKRLRVR----------------VQPEAGMALFW 105

Query: 135 FSLHPDAST------------DSTSLHGSCPVIEGEKWSATKWIH 167
            +L P ++             D  + HG+CPV+ G KW ATKWIH
Sbjct: 106 HNLPPGSTNSLPSCCVHRSVGDERTEHGACPVLVGSKWIATKWIH 150


>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 149

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 69/130 (53%), Gaps = 22/130 (16%)

Query: 51  AMQILHYEHGQKYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV 106
            +Q+ +Y  G +YEPHFDF R    D   +   G+RIAT L Y+S V  GG TVFP  EV
Sbjct: 29  GLQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV 86

Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
                         G +V P KG A+ +++L      D ++ H +CPV+ G KW + KW+
Sbjct: 87  --------------GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWL 132

Query: 167 HVRN--FDKP 174
           H R   F +P
Sbjct: 133 HERGQEFRRP 142


>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
 gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
          Length = 525

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 19/136 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
           I  RI   T       E +QI +Y  G  ++PHFD+  D     N    G R+A++L Y 
Sbjct: 388 INQRIIDMTEFNFSKDEKLQITNYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           S V +GG TVFP   V+                V P KG  L +F+LH D   D  S H 
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGRPDIRSKHS 491

Query: 151 SCPVIEGEKWSATKWI 166
            CPVI G++W+ TKW+
Sbjct: 492 VCPVINGDRWTLTKWV 507


>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
 gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
          Length = 536

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 19/161 (11%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF---RD 72
           RTS G   + +Q      +   +A  + L  +  E +QI +Y  G  YEPH+D F    +
Sbjct: 372 RTSQGASFNYSQYATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHE 431

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                L G+R+AT + YLS V  GG T FP   +                 V P +G  L
Sbjct: 432 YPEDDLYGNRLATAIYYLSDVVAGGGTAFPFLPL----------------LVTPERGSLL 475

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
            +++LHP    D  + H +CPV++G KW A  WI  RN D+
Sbjct: 476 FWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDR 516


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 76/166 (45%), Gaps = 19/166 (11%)

Query: 7   SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
           +G ++ S  R S   +L   +  ++  ++ R+   T L  E  E +Q+++Y  G  YEPH
Sbjct: 356 TGGAVLSSYRISKNAWLYYWEHRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPH 415

Query: 67  FDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
           FD         L    G RIAT+L Y+S VE GG TVFP                  G  
Sbjct: 416 FDCATKDEEFALDPNEGDRIATMLFYMSDVEAGGATVFPQV----------------GAR 459

Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
           V P KG    +++L      D  + H  CPV+ G KW +  WIH R
Sbjct: 460 VVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKWVSNMWIHER 505


>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
 gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
          Length = 536

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 19/161 (11%)

Query: 16  RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF---RD 72
           RTS G   + +Q      +   +A  + L  +  E +QI +Y  G  YEPH+D F    +
Sbjct: 372 RTSQGASFNYSQYATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHE 431

Query: 73  KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
                L G+R+AT + YLS V  GG T FP   +                 V P +G  L
Sbjct: 432 YPEDDLYGNRLATAIYYLSDVVAGGGTAFPFLPL----------------LVTPERGSLL 475

Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
            +++LHP    D  + H +CPV++G KW A  WI  RN D+
Sbjct: 476 FWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDR 516


>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
 gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
          Length = 520

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 19/136 (13%)

Query: 34  IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
           I  RI   T       E +QI +Y  G  ++PHFD+  D     N    G R+A++L Y 
Sbjct: 388 INQRIIDMTEFNFSKDEKLQIANYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447

Query: 91  SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
           S V +GG TVFP   V+                V P KG  L +F+LH D   D  S H 
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGRPDIRSKHS 491

Query: 151 SCPVIEGEKWSATKWI 166
            CPVI G++W+ TKW+
Sbjct: 492 VCPVINGDRWTLTKWL 507


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,918,866,119
Number of Sequences: 23463169
Number of extensions: 163928495
Number of successful extensions: 335651
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1467
Number of HSP's successfully gapped in prelim test: 588
Number of HSP's that attempted gapping in prelim test: 330647
Number of HSP's gapped (non-prelim): 2346
length of query: 230
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 92
effective length of database: 9,121,278,045
effective search space: 839157580140
effective search space used: 839157580140
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)