BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023180
         (286 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score =  453 bits (1165), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 208/277 (75%), Positives = 242/277 (87%), Gaps = 2/277 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR LE+VE  +E+   +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRLRMLEMVENGEEDTGSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S+ SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASED TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           GPQ SQR+ASFLLYLSDVEEGGETMFPFENG  + +GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCIGLKVKPRKGDGLLFYS 251

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           +FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288


>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 286

 Score =  452 bits (1164), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 206/273 (75%), Positives = 242/273 (88%)

Query: 13  GLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQS 72
           GLP  +L+CS FF+AGFY ST +S+DVP I+P+LR LEV ++E    +P G TG++ I+S
Sbjct: 14  GLPAVILVCSVFFVAGFYASTLISQDVPVIKPRLRMLEVTDEEKHQAMPRGVTGESYIES 73

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           IPFQVLSW+PRA+YFP+FA+ EQC++II  AK RLKPS LALR+GET ESTKGTRTSSGT
Sbjct: 74  IPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGT 133

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+SASED TG L+ IEHKIARATM+P++HGEAFN+LRYEIGQKYDSHYD+FNPAEYGPQM
Sbjct: 134 FLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQM 193

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           SQR+ASFLLYLSDVE+GGETMFPFENG+ + S YDYKKC GLKVKPR+GDG+LFYSL PN
Sbjct: 194 SQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCAGLKVKPRQGDGILFYSLLPN 253

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           GTID+TSLHGSCPVI+GEKWVATKWIRDQ Q +
Sbjct: 254 GTIDQTSLHGSCPVIEGEKWVATKWIRDQVQMD 286


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score =  449 bits (1155), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 210/277 (75%), Positives = 242/277 (87%), Gaps = 2/277 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEK-ENESG-LPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR LE+VE  E E+G +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLEMVENGEEEAGSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           GPQ SQR+ASFLLYLSDVEEGGETMFPFENG  +  GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYS 251

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           +FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score =  449 bits (1154), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 208/277 (75%), Positives = 241/277 (87%), Gaps = 2/277 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR L++VE  +E  S +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           GPQ SQR+ASFLLYLSDVEEGGETMFPFENG  +  GYDYK+CIGLKVKPR+GDGLLFYS
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIGLKVKPRKGDGLLFYS 251

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           +FPNGTID+TSLHGSCPV KGEKWVATKWIRDQ+Q E
Sbjct: 252 VFPNGTIDQTSLHGSCPVTKGEKWVATKWIRDQDQEE 288


>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
 gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 209/289 (72%), Positives = 240/289 (83%), Gaps = 9/289 (3%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR--------DVPSIRPKLRTLEVV 52
           MK K KRS  KLGLP A++LCS FFLAGFY S F           DVP  RP+LR LEV 
Sbjct: 1   MKIKTKRSKRKLGLPAAIILCSLFFLAGFYSSIFFPHVSASLSACDVPVTRPRLRMLEVA 60

Query: 53  EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
           E ++++ +PHG TG+AS++SIPFQVLSW+PRALYFP FA+ EQC+SII   + +LKPS L
Sbjct: 61  EGDHDT-MPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTL 119

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET ESTK TRTSSG+F+S SED+TG L+ IE KIA+ATM+PQ+HGEAFN+LRYEI
Sbjct: 120 ALRKGETAESTKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEI 179

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
           GQKYDSHYDAFNP EYG Q SQR ASFLLYLS+VEEGGETMFPFENG  +  G+DYK+C+
Sbjct: 180 GQKYDSHYDAFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCV 239

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           GLKVKPR+GDGLLFYSLFPNGTID TSLHGSCPVIKG KWVATKWIRDQ
Sbjct: 240 GLKVKPRQGDGLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIRDQ 288


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score =  436 bits (1122), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 211/286 (73%), Positives = 243/286 (84%), Gaps = 3/286 (1%)

Query: 1   MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
           MKGK K     KLGL    +  SFFFLAG +GS   S+DV  +R + R LE VE+   S 
Sbjct: 1   MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEY--SP 58

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +PHGETG++S+  IPFQVLSW+PRALYFP FA+AEQCQSII  AK  L+PS LALRQGET
Sbjct: 59  MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 118

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 119 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 178

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
           YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+
Sbjct: 179 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 238

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           RGDGLLFYS+FPNGTIDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 239 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 284


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 206/291 (70%), Positives = 240/291 (82%), Gaps = 6/291 (2%)

Query: 1   MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
           MKGK+ RS+    +KLGLP  + +LC F FLAGF+GST LS+DV   RP+ R L+     
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 56  NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
            E  L   GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+  AK +L+PS LAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET ESTKG RTSSG F SASED++G L +IE KIARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y+++ CIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           KVKPR+GDGLLFYS+FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score =  422 bits (1086), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 205/291 (70%), Positives = 239/291 (82%), Gaps = 6/291 (2%)

Query: 1   MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
           MKGK+ RS+    +KLGLP  + +LC F FLAGF+GST LS+DV   RP+ R L+     
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFXFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 56  NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
            E  L   GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+  AK +L+PS LAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET ESTKG RTSSG F SASED++G L +IE K ARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQ 180

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y+++ CIGL
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYNFQTCIGL 240

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           KVKPR+GDGLLFYS+FPNGTID TSLHGSCPVIKG+KWVATKWIRDQ Q +
Sbjct: 241 KVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIRDQMQED 291


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score =  422 bits (1086), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 208/286 (72%), Positives = 238/286 (83%), Gaps = 11/286 (3%)

Query: 1   MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
           MKGK K     KLGL    +  SFFFLAG +GS   S+         R LE VE+   S 
Sbjct: 1   MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQP--------RLLESVEEY--SP 50

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +PHGETG++S+  IPFQVLSW+PRALYFP FA+AEQCQSII  AK  L+PS LALRQGET
Sbjct: 51  MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 110

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 111 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 170

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
           YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE+ + + +GYDYKKCIGLKVKP+
Sbjct: 171 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCIGLKVKPQ 230

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           RGDGLLFYS+FPNGTIDRTSLHGSCPVI GEKWVATKWIRD++Q +
Sbjct: 231 RGDGLLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIRDEQQDD 276


>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 290

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 202/291 (69%), Positives = 235/291 (80%), Gaps = 7/291 (2%)

Query: 1   MKGKAKRSS----TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE- 55
           MK K  +      +KLGLP   L C FFFLAG + S  +S++V   + + R L+ V++E 
Sbjct: 1   MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR-RQLQWVKEEI 59

Query: 56  -NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
                LP G+TGD  +  IPFQVLSW+PRALYFPNFA+AEQCQS+I  AK  L PS LAL
Sbjct: 60  IEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLAL 119

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET E+TKG RTSSG F+SASEDKTG+L+ IE KIARATMLP+ +GEAFN+LRYEIGQ
Sbjct: 120 RKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQ 179

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           KY+SHYDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN + +D  YD++KCIGL
Sbjct: 180 KYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYDFEKCIGL 239

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           +V+PRRGDGLLFYSLFPN TID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 240 QVRPRRGDGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 290


>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 197/285 (69%), Positives = 234/285 (82%), Gaps = 2/285 (0%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTL-EVVEKENESG 59
           MKGK K S  KLG+PT  +LC+ FF  GF+ S  L +D+  + P+ R L E V+KE E  
Sbjct: 2   MKGKVKSSKLKLGVPTLFILCALFFFVGFFVSPLLFQDLDDVGPRSRILQESVKKEYEP- 60

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L HGE+G+  + SIP Q+LSWRPRA++FPNF S E CQ II  AK +L+PS+LALR+GET
Sbjct: 61  LEHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 120

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTK TRTSSGTFISASEDK+GIL+L+E KIA+ TM+P+THGE FN+L+YE+GQKYDSH
Sbjct: 121 AESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSH 180

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
           YDAFNP EYG   SQR+ASFLLYLS+VE GGETMFP+E G+ +D GYDY+KCIGLKVKPR
Sbjct: 181 YDAFNPDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCIGLKVKPR 240

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           +GDGLLFYSL PNG ID+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 241 QGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285


>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 297

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/282 (68%), Positives = 230/282 (81%), Gaps = 4/282 (1%)

Query: 9   STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
           + KL  P   L+C FFFLAGF+GST  S     D   +RP+ R L+  ++   + +  GE
Sbjct: 15  TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
            GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK  LKPS LALR+GET E+TK
Sbjct: 75  FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           G RTSSG F+SAS DKT  LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y+SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFN 194

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
           P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y Y+ C+GL+VKPR+GDGL
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDCVGLRVKPRQGDGL 254

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
           LFYSL PNGTID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296


>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 194/252 (76%), Positives = 220/252 (87%)

Query: 34  FLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
           F  +DVPS+RP+ R LE V +E+   LP GETG+ S+ SIPFQVLSW PRALYFPNFAS 
Sbjct: 35  FSHQDVPSVRPRPRFLESVYQEDFDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASI 94

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
           EQCQSII  AK  ++PS LALR GET E+TKG RTSSGTFISASEDKTGIL+LIE KIA+
Sbjct: 95  EQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAK 154

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
           ATM+P+THGEAFNVLRYEIGQ+Y SHYDAF+PA+YGPQ SQR ASFLLYLSDVEEGGET+
Sbjct: 155 ATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETV 214

Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           FP+ENG  +D+ YD+ KCIGLKVKPRRGDGLLFYSLFPNGTID TSLHGSCPVI+GEKWV
Sbjct: 215 FPYENGQNMDASYDFSKCIGLKVKPRRGDGLLFYSLFPNGTIDLTSLHGSCPVIRGEKWV 274

Query: 274 ATKWIRDQEQHE 285
           ATKWIR+Q+Q +
Sbjct: 275 ATKWIRNQDQDD 286


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score =  406 bits (1043), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 199/285 (69%), Positives = 234/285 (82%), Gaps = 3/285 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
           MKGK   S  KL LP   L  S FFLAG +GS   S+DV  +R   R LE VE+E  S +
Sbjct: 1   MKGKGGWSP-KLELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVEEEY-SSM 58

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           PHGETG++S+  IPFQVLSW+PRA YFP+FA+AEQCQSII  AK  L PS L LR+GET 
Sbjct: 59  PHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETE 118

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           ESTKG RTSSGTFISASEDKTGIL+ IE KIA+ATM+P+ HGE FN+LRYEIGQ+Y+SHY
Sbjct: 119 ESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHY 178

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
           DA +PAEYG Q SQR+ASFLLYLSDVEEGGETMFPFE+ + +++ ++ +KCIGLKVKPRR
Sbjct: 179 DAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNINT-FNSRKCIGLKVKPRR 237

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           GDGLLFYS+FPNGTID TS+HGSCPVI+GEKWVATKWIRD++Q +
Sbjct: 238 GDGLLFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIRDEQQED 282


>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
          Length = 297

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 191/282 (67%), Positives = 228/282 (80%), Gaps = 4/282 (1%)

Query: 9   STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
           + KL  P   L+C FFFLAGF+GST  S     D   +RP+ R L+  ++   + +  GE
Sbjct: 15  TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
            GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK  LKPS LALR+GET E+TK
Sbjct: 75  FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           G RTSSG F+SAS DKT  LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFN 194

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
           P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y Y+  +GL+VKPR+GDGL
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYEDRVGLRVKPRQGDGL 254

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
           LFYSL PNGTID+TSLHGSCPVIKGEKWVATKWIR+ +Q +D
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIRNLDQEDD 296


>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 290

 Score =  399 bits (1025), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 194/285 (68%), Positives = 230/285 (80%), Gaps = 1/285 (0%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
           MKGK +RS  KLG+PT   LC+ FF  GF+ S  L +D+  + P+ R L+   K+    L
Sbjct: 1   MKGKVRRSKLKLGVPTFFFLCALFFFVGFFVSPLLFQDLGDVGPRSRILQESVKKAYEPL 60

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
            HGE+G+  + SIPFQ+LSWRPRA+YFPNF S E CQ II  AK +L+PS+LALR+GET 
Sbjct: 61  EHGESGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 120

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           ESTK TRTSSGTFISASEDK+GIL+ +E KIA+ TM+P+THGE FN+L+YE+ QKYDSHY
Sbjct: 121 ESTKDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHY 180

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG-YDYKKCIGLKVKPR 239
           DAFNP EYG   SQR+ASFLLYLS+VE GGETMFP+E G+ +D G YDYKKCIGLKVKPR
Sbjct: 181 DAFNPDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPR 240

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           +GDGLLFYSL PNG ID+TSLHGSCPVIKGEKWVATKWI D+EQH
Sbjct: 241 QGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDREQH 285


>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
          Length = 318

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 202/293 (68%), Positives = 227/293 (77%), Gaps = 12/293 (4%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIR----PKL-RTLE 50
           ++G       K+  P+  LLC FFFLAGF+GST         D   +R    P+  R LE
Sbjct: 6   VRGNWSWRMNKIAFPSVFLLCIFFFLAGFFGSTLFFQYHSQEDEHGLRQIQRPRTTRLLE 65

Query: 51  -VVEKENESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
            + EKE ES L P GETGD  I +IPFQVLSW P ALYFPNFA+AEQC+SII TAK+ LK
Sbjct: 66  NLTEKETESHLLPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEGLK 125

Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
           PS L LR GET EST G RTSSG FISA EDKTG+L++IE KIARAT +P+THGEAFNVL
Sbjct: 126 PSTLVLRVGETDESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVL 185

Query: 169 RYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDY 228
           RY++GQKY SHYDA +P  YGPQ SQR+ASFLLYLSDV EGGETMFPFENG+ +D  Y Y
Sbjct: 186 RYKVGQKYSSHYDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMDGSYYY 245

Query: 229 KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +KCIGLKVKPR+GDGLLFYSLFPNGTID  SLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 246 EKCIGLKVKPRKGDGLLFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQ 298


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 190/275 (69%), Positives = 223/275 (81%), Gaps = 7/275 (2%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R LE    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLEAAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ +QR+ASFLLYL+DVEEGGETMFP+EN   +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLF 257

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           YSL  NGTIDRTSLHGSCPVIKGEKWVATKWIRD 
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292


>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
          Length = 293

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 198/285 (69%), Positives = 229/285 (80%), Gaps = 3/285 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R L+   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII  AK  LKPS LALRQG
Sbjct: 66  NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET ++TKG RTSSG F+SASEDKT  L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVK 245

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           PR+GDGLLFYSL  NGTID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 189/275 (68%), Positives = 223/275 (81%), Gaps = 7/275 (2%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R LE    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLETAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATM+P+ HGE FNVLRY IGQ+Y SHYDAF+P 
Sbjct: 138 RTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPV 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ SQR+ASFLLYL++VEEGGETMFP+ENG  +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 257

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           YSL  NGTIDRTSLHGSCPVIKGEKWVATKWIRD 
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 189/275 (68%), Positives = 223/275 (81%), Gaps = 7/275 (2%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R L+    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLDAAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ +QR+ASFLLYL+DVEEGGETMFP+EN   +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYENSENMDIGYDYEKCIGLKVKPRKGDGLLF 257

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           YSL  NGTIDRTSLHGSCPVIKGEKWVATKWIRD 
Sbjct: 258 YSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292


>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
          Length = 293

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 198/285 (69%), Positives = 229/285 (80%), Gaps = 3/285 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R L+   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII  AK  LKPS LALRQG
Sbjct: 66  NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET ++TKG RTSSG F+SASEDKT  L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEGCIGLKVK 245

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           PR+GDGLLFYSL  NGTID TSLHGSCPVIKGEKWVATKWIRDQE
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290


>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
 gi|255641811|gb|ACU21174.1| unknown [Glycine max]
          Length = 293

 Score =  392 bits (1007), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 199/288 (69%), Positives = 229/288 (79%), Gaps = 3/288 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R LE   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLEPGNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P  + GD SI SIPFQVLSWRPRALYFPNFA+AEQC++II  AK  LKPS LALRQG
Sbjct: 66  NLMPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET E+TKG RTSSG F+SAS DKTG L +IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFENG+ +D  Y Y+ CIGLKVK
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYEDCIGLKVK 245

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           PR+GDGLLFYSL  NGTID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 246 PRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 293


>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
          Length = 285

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 200/288 (69%), Positives = 232/288 (80%), Gaps = 6/288 (2%)

Query: 1   MKGKAK---RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE 57
           MK KAK   R  TKLGLP   L C FFFLAG +GS  L ++  S  P+     + E+ + 
Sbjct: 1   MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEPRRL---IREETDY 57

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
             L HGE+G+ S+ SIPFQVLSWRPRALYFPNFA++EQCQSII  AK  L PS +ALR G
Sbjct: 58  DPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVG 117

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           E   +T+G RTSSG FISASEDKTG L+LIE KIAR  M+P+THGEAFNVLRYEIGQ+Y+
Sbjct: 118 EIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYN 177

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETMFPFENG+ +D  YD+++CIGLKVK
Sbjct: 178 SHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCIGLKVK 237

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           P +GDGLLFYS+FPNGTID TSLHGSCPVIKGEKWVATKWIRDQEQ +
Sbjct: 238 PHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQEQDD 285


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/277 (66%), Positives = 225/277 (81%), Gaps = 4/277 (1%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D P    +L T    E+  E+   G+ +GE+G
Sbjct: 3   TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 61

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +     IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG 
Sbjct: 62  EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 121

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+S+ ED TG L  +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 122 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 181

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG  +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 182 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 241

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           YSL  NGTID TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 242 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 278


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 185/277 (66%), Positives = 225/277 (81%), Gaps = 4/277 (1%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D P    +L T    E+  E+   G+ +GE+G
Sbjct: 33  TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 91

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +     IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG 
Sbjct: 92  EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 151

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+S+ ED TG L  +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 152 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 211

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG  +D GYDY+KCIGLKVKPR+GDGLLF
Sbjct: 212 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCIGLKVKPRKGDGLLF 271

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           YSL  NGTID TSLHGSCPVIKGEKWVATKWIRD+ +
Sbjct: 272 YSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIRDKSK 308


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 185/278 (66%), Positives = 224/278 (80%), Gaps = 10/278 (3%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
           T+L LP  LL CS FFLAGF GS   ++D        RP  + R +E V  E    + +G
Sbjct: 72  TRLRLPVVLLSCSLFFLAGFLGSLLFTQDPQGEEDPERPLRRERVMEAVWPE----MAYG 127

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           E+GD +   IP+Q+LSW+PRALYFP FA+AEQC++++ TAK RL+PS LALR+GE+ E+T
Sbjct: 128 ESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEETT 187

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           KG RTSSGTF+SA ED TG L  IE KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 188 KGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAF 247

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
           +PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG  ++ GYDY++CIGLKVKPR+GDG
Sbjct: 248 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCIGLKVKPRKGDG 307

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           LLFYSL  NGTID TSLHGSCPV++GEKWVATKWIRD+
Sbjct: 308 LLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIRDK 345


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score =  382 bits (982), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 183/275 (66%), Positives = 223/275 (81%), Gaps = 4/275 (1%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLP---HGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D P    +L      E+  E+  P   +G++G
Sbjct: 20  TRLRLPVVLLSCSLFFLAGFFGSLLFTQD-PQGEEELERPLSRERLMEAVWPEMAYGDSG 78

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           D +   IP+Q+LSW+PRALYFP FA++EQC++++ TAK RL+PS LALR+GET E+TKG 
Sbjct: 79  DPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETTKGI 138

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA ED T  L  +E KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF+PA
Sbjct: 139 RTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPA 198

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +YGPQ SQR+ASFLLYL+DVEEGGETMFP+ENG  +D GYDY++CIGLKVKPR+GDGLLF
Sbjct: 199 QYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCIGLKVKPRKGDGLLF 258

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           YSL  NGTID TSLHGSCPVIKGEKWVATKWIR++
Sbjct: 259 YSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIRNK 293


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  375 bits (964), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 179/273 (65%), Positives = 215/273 (78%), Gaps = 6/273 (2%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI 70
           KLG P  LL C+FFFL GFYGS  LS++   +      ++ V  E +    HG TGD+S+
Sbjct: 17  KLGFPLVLLCCTFFFLVGFYGSNSLSKEEKHV-----VIDPVTNE-KLVFEHGRTGDSSV 70

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
             IPFQVLSW+PRAL +PNFAS EQC++II  A+ RL PS LALR+GE+  +TK  RTSS
Sbjct: 71  TDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSS 130

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           GTF+ ASEDKT  L  +E K+ARATM+P+ +GEAFNVLRY  GQKYD HYD F+PAEYGP
Sbjct: 131 GTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGP 190

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           Q SQR+ASFLLYLSDVEEGGETMFPFEN   +++GY+YK CIGLKVKPR+GD LLFYS+ 
Sbjct: 191 QPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCIGLKVKPRQGDALLFYSMH 250

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           PNGT D+T+LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 251 PNGTFDKTALHGSCPVIKGEKWVATKWIRNTDK 283


>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
 gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
          Length = 297

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 188/297 (63%), Positives = 222/297 (74%), Gaps = 15/297 (5%)

Query: 1   MKGKAKRSSTK------LGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIRPKLRTL 49
           M+GKA + + +      LG P  +L CSFFF+AG + S  L     S D   +R + R L
Sbjct: 1   MRGKACKENWRFYKKPNLGFPALILSCSFFFIAGLFASNLLLSQGTSSDERWLRARARQL 60

Query: 50  EVVEKENESG---LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
           + VE+E  S    LP GE+GD  I  IPFQVLSWRPRALY+P F +AEQCQ II  AK  
Sbjct: 61  QSVEEEIISKYDLLPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPS 120

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           L+PS LALR+GET E+TKG RTSSG F+ +SED+ G+L++IE KIARATM+P THGEAFN
Sbjct: 121 LQPSTLALRKGETAETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFN 180

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
           VLRYEIGQKYD+HYDAFNPAEYGPQ SQR+A+FLLYLS+ EEGGET FP EN    + GY
Sbjct: 181 VLRYEIGQKYDAHYDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFE-GY 239

Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           D +KC GL+VKP +GD +LFYS+FPN TID  SLH SC VIKGEKWVATKWIRDQ Q
Sbjct: 240 DAQKCNGLRVKPHQGDAILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWIRDQVQ 296


>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 299

 Score =  372 bits (955), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/282 (65%), Positives = 215/282 (76%), Gaps = 13/282 (4%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPS--------IRPKLRTLE--VVEKENESG 59
            KL LP+  L+C FFFLAGF  S+  S   PS        + P+ R LE    EK     
Sbjct: 17  NKLDLPSVFLICIFFFLAGFSASSLFS---PSQEHEYDLRLIPRARLLEKSTHEKTEHHL 73

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L  G++GD  I  IPFQVLSW PRALYFPNF SAEQC++II  A+  LKPS L LR+GET
Sbjct: 74  LKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGET 133

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKG RTS G F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY  H
Sbjct: 134 EESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPH 193

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
           YDAF+ AE+GP  SQR ASFLLYL+DV EGGET+FP+ENG   D  YD++ CIGL+V+PR
Sbjct: 194 YDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYENGFNRDGSYDFEDCIGLRVRPR 253

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GDGLLFYSL PNGTID+TS+HGSCPVIKGEKWVATKWIRDQ
Sbjct: 254 KGDGLLFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQ 295


>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 294

 Score =  367 bits (943), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 189/295 (64%), Positives = 221/295 (74%), Gaps = 12/295 (4%)

Query: 1   MKGKAKRS-----STKLGLPTALLLC---SFFFLAGFYGSTFLSRDVPSIRPKLRTLE-V 51
           MK K  ++     + K  LP+  LLC         GF        D   +R  +R LE  
Sbjct: 1   MKAKTVKANWSLRTKKFSLPSVFLLCIFFFLAGFFGFSFFHHSQEDEYGVR--VRLLEKS 58

Query: 52  VEKENESGLPH-GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS 110
           V+ E E  L H G++GD  I SIPFQVLSW PRALYFPNFASAEQC  II  AK  L PS
Sbjct: 59  VKDETEHRLLHAGKSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPS 118

Query: 111 QLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRY 170
           +L LR+GET E TKG RTSSG FISASEDKTG+LE+I+ KIARA  +P+THG A+N+LRY
Sbjct: 119 RLMLREGETEEGTKGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRY 178

Query: 171 EIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
           ++GQKY+SHYDAFNPAEYGPQ SQR+ASFLLYL+DV EGGETMFPFENG  +DS Y+++ 
Sbjct: 179 KVGQKYNSHYDAFNPAEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFED 238

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           CIGLK+KP +GDGLLFYSLFPNGTID TSLHGSCPVIKGEKWVATKWIR+Q  ++
Sbjct: 239 CIGLKIKPLKGDGLLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIREQLHYD 293


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score =  367 bits (941), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 183/293 (62%), Positives = 217/293 (74%), Gaps = 13/293 (4%)

Query: 4   KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR------DVPSIRPKLRTLEVVEKENE 57
           K  R   KL LP+  L+C FFFLAGF  S+  S       D+           ++EK  +
Sbjct: 7   KGNRRMNKLDLPSVFLICIFFFLAGFSASSLFSHSQKHEYDLRLRPRPRPRARLLEKSTQ 66

Query: 58  SG-----LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
                  L  G++GD  +  IPFQVLSW PRALYFPNFASAEQC+SII  A+  LK S L
Sbjct: 67  EKTEYHLLKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTL 126

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET ESTKG RTSSG F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+
Sbjct: 127 ALRKGETEESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEV 186

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
           GQKY+SHYDAF+ AEYGP  SQR+ASFLLYL+DV EGGETMFP+ENG   D   + + CI
Sbjct: 187 GQKYNSHYDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYENGFNRDG--NVEDCI 244

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           GL+V+PR+GD LLFYSL PNGTID+TS HGSCPVIKGEKWVATKWIR+Q Q +
Sbjct: 245 GLRVRPRKGDALLFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWIRNQVQDD 297


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 173/290 (59%), Positives = 212/290 (73%), Gaps = 9/290 (3%)

Query: 1   MKGKAK--RSSTKLGLPTALLLCSFFFLAGFYGSTF------LSRDVPSIRPKLRTLEVV 52
           MKG+++      ++  P  L+LC   FL G++   F          +PS          +
Sbjct: 1   MKGRSRGRVCGHQVSAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHEL 60

Query: 53  EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
             E+ +   HG TGD  +  IPFQVLSW PRAL FP FAS  QC++II+ AK +L PS L
Sbjct: 61  -LEDVTVAEHGVTGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSL 119

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET   T+  RTS G F+S+ +DKTG L  +E K+A+ATM+P++HGEAFNVLRYEI
Sbjct: 120 ALRKGETATETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEI 179

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
           GQKY+SHYD FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN   ++  YDYK+CI
Sbjct: 180 GQKYNSHYDVFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI 239

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           GLKVKP++GD LLFYS+FPNGT D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 240 GLKVKPKQGDALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKE 289


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score =  354 bits (909), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 172/282 (60%), Positives = 209/282 (74%), Gaps = 9/282 (3%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVP--SIRPKLRTLEVVEKENESG-------LP 61
           KL LP  LL C+   LAG+  S F  + +   +I    R     E E   G       L 
Sbjct: 17  KLSLPQVLLWCTICLLAGYAASNFFPQKIEEEAIYQPYRKSAQQEGEFPFGEFSEKVVLD 76

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           HG TGD  I  IPFQVLSW+PRAL +P FAS EQC++I+  A+ RL PS LALR+GE+ +
Sbjct: 77  HGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGESED 136

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           STK  RTSSGTF+ A ED T  LE +E K+A+ATM+P+ +GEAFNVL+Y +GQKYD HYD
Sbjct: 137 STKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYD 196

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
            F+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN   ++ G+DYKKCIG+KVKPR+G
Sbjct: 197 VFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCIGMKVKPRQG 256

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           D LLFYS+ PNGT D+++LHGSCPVIKGEKWVATKWIR+ ++
Sbjct: 257 DALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWIRNTDK 298


>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
 gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
          Length = 231

 Score =  349 bits (895), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 158/221 (71%), Positives = 186/221 (84%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           HG TG+  +  IPFQVLSW PRAL FP FAS  QC++II+ AK +L PS LALR+GET  
Sbjct: 8   HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            T+  RTS G F+S+ +DKTG L  +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD
Sbjct: 68  ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
            FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFEN   ++  YDYK+CIGLKVKP++G
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECIGLKVKPKQG 187

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           D LLFYS+FPNGT D+T+LHGSCPVIKGEKWVATKWIRD+E
Sbjct: 188 DALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKE 228


>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
 gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
          Length = 257

 Score =  343 bits (880), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 158/206 (76%), Positives = 182/206 (88%)

Query: 54  KENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           +E  S +PHG TG+ SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LA
Sbjct: 6   EEEASSMPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALA 65

Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           LR+GET E+TKGTRTSSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+G
Sbjct: 66  LRKGETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELG 125

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
           QKYDSHYD FNP EYGPQ SQR+ASFLLYLSDVEEGGETMFPFENG  +  GYDYK+CIG
Sbjct: 126 QKYDSHYDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCIG 185

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTS 259
           LKVKPR+GDGLLFYS+FPNGTID+ +
Sbjct: 186 LKVKPRKGDGLLFYSVFPNGTIDQVN 211


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score =  317 bits (812), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 157/272 (57%), Positives = 196/272 (72%), Gaps = 12/272 (4%)

Query: 11  KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
           KLG P   L C FF  + GF     LS+ +     ++ T   V +E ES L HG     S
Sbjct: 9   KLGFPYVFLACCFFLAIFGFCFFNLLSQIISF--SEISTRRSVNEETES-LDHG-----S 60

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           + +IPF  LSW PR  Y PNFA+ +QC+++I  AK +LKPS LALR+GET E+T+  RT 
Sbjct: 61  VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTR 120

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
                   ED++GIL  IE KIA AT +P  + E+FN+LRY++GQKYDSHYDAF+PAEYG
Sbjct: 121 ---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYG 177

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
           PQ+SQR+ +F+L+LS VEEGGETMFPFENG  ++  YDY+ CIGL+VKPR+GD + FY+L
Sbjct: 178 PQISQRVVTFILFLSSVEEGGETMFPFENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNL 237

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            PN TID+TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 238 LPNRTIDQTSLHGSCPVIKGEKWVATKWIRDQ 269


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score =  311 bits (798), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 154/272 (56%), Positives = 194/272 (71%), Gaps = 10/272 (3%)

Query: 11  KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
           KL  P   + C FF  + GF      S+ + S      T   V  E +S L HG    +S
Sbjct: 9   KLVFPYVFIACCFFLAIFGFCFFNLFSQGI-SFSEIPTTRRSVNDETDS-LDHG----SS 62

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           + +IPF  LSW PR  Y PNFA+ +QC+++I  AK +LKPS LALR+GET E+T+  R+ 
Sbjct: 63  VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSL 122

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
                   ED++G+L  IE KIA AT  P+ + E+FN+LRY++GQKYDSHYDAF+ AEYG
Sbjct: 123 HQ---HTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYG 179

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
           P +SQR+ +FLL+LS VEEGGETMFPFENG  ++  YDY+KC+GLKVKPR+GD + FY+L
Sbjct: 180 PLISQRVVTFLLFLSSVEEGGETMFPFENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNL 239

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           FPNGTID+TSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 240 FPNGTIDQTSLHGSCPVIKGEKWVATKWIRDQ 271


>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
 gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score =  300 bits (767), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 142/179 (79%), Positives = 156/179 (87%), Gaps = 9/179 (5%)

Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           AK +LKPS LALR+GET EST         FI  SEDKTG L+ IE KIA+ATM+PQ+HG
Sbjct: 2   AKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSHG 52

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
           EAFN+LRYEIGQKYDSHYDAFNP EYGPQ SQR+ASFLLYLS VEEGGETMFPFENG  +
Sbjct: 53  EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAV 112

Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            SG++YK+C+GLKVKPR+GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ
Sbjct: 113 SSGFEYKQCVGLKVKPRQGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 171


>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
          Length = 341

 Score =  276 bits (707), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 129/222 (58%), Positives = 168/222 (75%), Gaps = 3/222 (1%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G  GD  +  + FQ+LS  PR++ + NFAS   C +I+  A+ RL  S LAL++GET+E+
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
           TK  RTSSGTF+++  +++G L+ +E K+ARAT +P THGEA+N+LRYEIGQKYDSHYD 
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKVKPR 239
           F+P++YGPQ SQR+ASFLLYL+  +EGGET+FP E  NG++   G DY  C  GLKVKPR
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSCEAGLKVKPR 295

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GD LLF+S+ PN T DR+SLHG CPVI G K+VATKWI D 
Sbjct: 296 KGDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIHDN 337


>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
          Length = 282

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 126/192 (65%), Positives = 144/192 (75%), Gaps = 33/192 (17%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE----------------------- 163
           R  SG FISASEDKTG L+LIE KIAR  M+P+THGE                       
Sbjct: 91  RLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVMK 150

Query: 164 ----------AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
                     AFN+LRYEIGQ+Y+SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETM
Sbjct: 151 RFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETM 210

Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           FPFENG+ +D  YD+++CIGLKVKP +GDGLLFYS+FPNGTID TSLHGSCPVIKGEKWV
Sbjct: 211 FPFENGLNMDKDYDFQRCIGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGEKWV 270

Query: 274 ATKWIRDQEQHE 285
           ATKWIRDQEQ +
Sbjct: 271 ATKWIRDQEQDD 282


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 119/223 (53%), Positives = 151/223 (67%), Gaps = 5/223 (2%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           E+G+      P Q++SW PR + +P F   E+C+  +  AK RL PS LALR  E  + T
Sbjct: 98  ESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQET 157

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           +  RTS GTF+S  +D  G++  +E K A+ T LP +HGE FNVLRY+ GQ YDSHYD F
Sbjct: 158 ENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIF 217

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD----SGYDYKKC-IGLKVKP 238
            P  YGPQ SQR+A+ L YL+DVEEGGET+FP E     D    +G++YK C  G K KP
Sbjct: 218 EPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTTGFKYKP 277

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           R GD L+FYS+ PNGT D+ +LHG CPV+ GEKWVATKWIRD+
Sbjct: 278 RMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDK 320


>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
 gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
          Length = 231

 Score =  234 bits (597), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 117/226 (51%), Positives = 154/226 (68%), Gaps = 5/226 (2%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP  E+G  ++  IPFQ+LSW PR + FP F    + + +I  A K + PS LA R GET
Sbjct: 2   LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
           V+ ++ TRTS+GTF++A+ D  G+L  +E +IA AT+LP  +GEAFNVL YE  Q YDSH
Sbjct: 62  VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLK 235
           YD F+P E+GPQ SQR+A+ LLYLS+V EGGET+F  E G+  ++    D++ C     K
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKRE-GVDGENRVIGDWRNCDDGSFK 180

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             PR GD +LF+   PNG ID  +LHG CPV +GEKWVATKWIR +
Sbjct: 181 YMPRMGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSR 226


>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
          Length = 334

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 118/225 (52%), Positives = 148/225 (65%), Gaps = 15/225 (6%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L HGE+G +     P Q+LS  PRA   P F S +QC  +IA A++RL PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E+T+             ED  G+L  IE K+A  TM+P  HGE FNVLRYE  Q YDSH
Sbjct: 176 AENTR------------DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC-IGLKV 236
           YD+F+  EYGPQ SQR+A+ LLYL+DVEEGGET+F  E   G+      DYK C  G+KV
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKACDTGIKV 283

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           KPR+GD LLF+S+  NGT+D+ SLHG CPV+ G KW  TKWIR++
Sbjct: 284 KPRQGDALLFFSVSVNGTLDKHSLHGGCPVVAGTKWAMTKWIRNR 328


>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 117/225 (52%), Positives = 147/225 (65%), Gaps = 3/225 (1%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP   +G      IPFQ+LSW PR + FP F    + + I+  A K + PS LA R GE 
Sbjct: 2   LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
           VES++ TRTS+GTF+S+  D  G+L  +E +IA AT+LP  +GEAFNVL YE  Q YDSH
Sbjct: 62  VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY-DYKKCI--GLKV 236
            D+F+P ++GPQ SQR+A+ LLYLS+V EGGET+F  E     D    D++ C     K 
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKY 181

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            PR GD +LF+   PNG ID  SLHG CPV KGEKWVATKWIR +
Sbjct: 182 APRMGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSR 226


>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 120

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 100/120 (83%), Positives = 112/120 (93%), Gaps = 1/120 (0%)

Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD 223
           A+NVLRYE+GQKY+SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+EN   +D
Sbjct: 1   AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDN-ID 59

Query: 224 SGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           S YDY +CIGLKVKPR+GDGLLFYSLF NGTID TS+HGSCPVIKGEKWVATKWIR++EQ
Sbjct: 60  SNYDYVQCIGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEEQ 119


>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
 gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
          Length = 429

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 106/224 (47%), Positives = 145/224 (64%), Gaps = 7/224 (3%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G++G+A  ++IPFQ+LS  PR   FPNF    + + IIA A K + PS LA R GE VE+
Sbjct: 203 GDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEA 262

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
            +  RTS GTF+    D +  L  +E KIA  T +P+ +GE +NVL Y+  Q YDSH D+
Sbjct: 263 EQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDS 320

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFENGIFLDSGY-DYKKCI---GLKVK 237
           F+P EYG Q SQR+A+ ++ LSD    GGET+F  E    +D    ++  C    GL+ K
Sbjct: 321 FDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADGGLRYK 380

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           PR GD +LF+S FP+G +D+ +LHGSCPV+ G KWVA KWIR++
Sbjct: 381 PRAGDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWIRNK 424


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 102/222 (45%), Positives = 140/222 (63%), Gaps = 14/222 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + G+ VE
Sbjct: 32  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 86

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            +   RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 87  DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVK 237
            F+      ++ QR+A+ L+YLSDVEEGGET+FP   G F    +  +  KC   GL VK
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 202

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW  TKWI 
Sbjct: 203 PKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIH 244


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 138/212 (65%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           Q++SW PRA  + NF + ++C+ +I TAK  ++ S +   + G++ +S+   RTSSGTF+
Sbjct: 68  QIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSS--VRTSSGTFL 125

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  I+  IE +IA  T +P  +GE+FNVLRYE+GQKYD H D F          Q
Sbjct: 126 DRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGGQ 183

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G      +  +   C   GL +KP+ GD LLF+S+ 
Sbjct: 184 RIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSIKPKMGDALLFWSMK 243

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+GT+D +SLHG+CPVIKG+KW  TKW+R  E
Sbjct: 244 PDGTLDPSSLHGACPVIKGDKWSCTKWMRINE 275


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/212 (47%), Positives = 132/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KIA  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YL+DVEEGGET+FP   G F +  +  +   C   GL +KP+RGD LLF+S+ 
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMK 254

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG CPVIKG KW +TKWIR  E
Sbjct: 255 PDATLDASSLHGGCPVIKGNKWSSTKWIRVNE 286


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score =  188 bits (477), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 104/219 (47%), Positives = 139/219 (63%), Gaps = 10/219 (4%)

Query: 67  DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           D S QSI  P +V  LSW+PRA  +  F SA +C  ++  AK +L+ S +A  + G++V 
Sbjct: 30  DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 89

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F+S  +D+  ++  IE +IA  T LP+ +GEA  VLRYE G+KY+ HYD
Sbjct: 90  SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 145

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF-ENGIFLDSGYDYKKCIGLKVKPRR 240
            F+          R+A+ L+YLSDV +GGET+FP  E+    D  +      G+ VKPR+
Sbjct: 146 YFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRK 205

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           GD LLFYSL P+ T D +SLHG CPVI+GEKW ATKWI 
Sbjct: 206 GDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIH 244


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 138/222 (62%), Gaps = 14/222 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + G+ VE
Sbjct: 67  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            +   RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 122 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVK 237
            F       +  QR+A+ L+YLSDVEEGGET+FP   G F    +  +  KC   GL VK
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 237

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW  TKW+ 
Sbjct: 238 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 114/285 (40%), Positives = 163/285 (57%), Gaps = 19/285 (6%)

Query: 4   KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR-PKLRTLEVVEKENESGLPH 62
           K K S+  L  P +L+ C   FL  F   TFL   + ++R PKL  L        + + H
Sbjct: 3   KFKHSNVGLRKP-SLITCWTLFLTLFVTFTFLILIILTLRIPKLNHL--------NSITH 53

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVE 121
             T          Q++SW PRA  + NF + E+C+ +I  AK  +  S+ +  + G+++ 
Sbjct: 54  SNTLRNDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLN 113

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S+   RTSSGTF+    D+  I+  IE +IA  T +P  HGE+FNVL YE+GQKY+ HYD
Sbjct: 114 SS--IRTSSGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVK 237
            F          QR+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL +K
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIK 229

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ G+ +LF+S+ P+ T+D +SLHG+CPVIKG+KW   KW+   E
Sbjct: 230 PKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADE 274


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 101/222 (45%), Positives = 139/222 (62%), Gaps = 14/222 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + GE+V+
Sbjct: 67  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVK 237
            F       +  QR+A+ L+YLSDVEEGGET+FP   G F    +  +  +C   GL VK
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVK 237

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ GD LLF+S+ P+ T+D TSLHG+CPVI+G KW  TKW+ 
Sbjct: 238 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMH 279


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 136/212 (64%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK ++ K S +  + G++ ES    RTSSG F+
Sbjct: 80  EILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESR--VRTSSGMFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P+ +GE   +L YE+GQKY+ HYD F          Q
Sbjct: 138 KRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQ 195

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP  N  F    +  D  +C   GL VKP+ GD LLF+S+ 
Sbjct: 196 RIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLFWSMR 255

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG CPVIKG KW +TKW+  +E
Sbjct: 256 PDATLDPSSLHGGCPVIKGNKWSSTKWMHLRE 287


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 103/220 (46%), Positives = 138/220 (62%), Gaps = 11/220 (5%)

Query: 67  DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           D S QSI  P +V  LSW+PRA  +  F SA +C  ++  AK +L+ S +A  + G++V 
Sbjct: 16  DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 75

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F+S  +D+  ++  IE +IA  T LP+ +GEA  VLRYE G+KY+ HYD
Sbjct: 76  SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 131

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPR 239
            F+          R+A+ L+YLSD  +GGET+FP   E+    D  +      G+ VKPR
Sbjct: 132 YFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPR 191

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           +GD LLFYSL P+ T D +SLHG CPVI+GEKW ATKWI 
Sbjct: 192 KGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIH 231


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  186 bits (472), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 101/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   VL YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP  N  F    +  +  +C   GL VKPR GD LLF+S+ 
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D TSLHG CPVI+G KW +TKWI
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWI 280


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSR-VRTSSGMFLR 155

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I++ IE +IA  T +P  HGE   VL YE+GQKY+ H+D F+         QR
Sbjct: 156 RGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVE+GGET+FP        S +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 214 IATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWSMKP 273

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +G++D TSLHG CPVIKG KW +TKW+R  E
Sbjct: 274 DGSMDSTSLHGGCPVIKGNKWSSTKWMRVHE 304


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 135/217 (62%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +       V+S  G       RTS
Sbjct: 96  EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF+   +DK  ++  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+     
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP    N   +    +  +C   G+ VKP+ GD LL
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 266

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR  E
Sbjct: 267 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 303


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/208 (48%), Positives = 129/208 (62%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   VL YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP  N  F    +  +  +C   GL VKPR GD LLF+S+ 
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I+ AK  ++ S +     ET +S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DKT  +  IE +++  + +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL VKP+RGD LLF+S+ 
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWVRVEE 284


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 100/217 (46%), Positives = 135/217 (62%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +       V+S  G       RTS
Sbjct: 90  EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF+   +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+     
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP    N   +    +  +C   G+ VKP+ GD LL
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALL 260

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+GT+D TSLHG CPVIKG+KW +TKWIR  E
Sbjct: 261 FWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHE 297


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 98/206 (47%), Positives = 133/206 (64%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S  +C  ++  AK RL+ S +A    G++V S    RTSSGTF++ 
Sbjct: 34  LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ--VRTSSGTFLNK 91

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VL YE+GQKYD+H+D F+          R+
Sbjct: 92  HEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRV 149

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV++GGET+FP   G  L    + + +C   GL VKPR+GD LLF+SL  N 
Sbjct: 150 ATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINA 209

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D +SLHGSCPVI+GEKW ATKWI 
Sbjct: 210 TTDPSSLHGSCPVIEGEKWSATKWIH 235


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 99/208 (47%), Positives = 129/208 (62%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   +L YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP  N  F    +  +  +C   GL VKPR GD LLF+S+ 
Sbjct: 193 RMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMR 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D TSLHG CPVI+G KW +TKW+
Sbjct: 253 PDATLDPTSLHGGCPVIRGNKWSSTKWM 280


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score =  184 bits (467), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 132/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 80  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE +IA  + +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YL+DVEEGGET+FP   G F    +  +  +C   GL +KP+RGD LLF+S+ 
Sbjct: 196 RIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMK 255

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG CPVIKG KW +TKW+R  E
Sbjct: 256 PDATLDPSSLHGGCPVIKGNKWSSTKWMRVSE 287


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I+ AK  ++ S +     ET +S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DKT  +  IE +++  + +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL VKP+RGD LLF+S+ 
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLFWSMK 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ ++D +SLHG CPVIKG KW ATKW+R +E
Sbjct: 253 PDASLDPSSLHGGCPVIKGNKWSATKWMRVEE 284


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + +SW+PRA  F NF S+E+C  +I  A+  +K S +   Q    + ++  RTSSGTF+ 
Sbjct: 47  ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  I+  IE +IA+ T +P+ HGE   VL YE+GQKYD+H+D F+         QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163

Query: 196 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP    N   +    +  +C   G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSP 223

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +  +D  SLHG CPVIKG KW ATKW+  +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 129/212 (60%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  ++ S +     ET  S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVV--DSETGRSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S   DK   +  IE +IA  + +P  HGE   VL YE+GQKY+ H+D FN         Q
Sbjct: 135 SRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F    +  +  +C   GL VKP  GD LLF+S+ 
Sbjct: 193 RVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLFWSMK 252

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG CPVI G KW ATKW+R  E
Sbjct: 253 PDATLDPSSLHGGCPVINGNKWSATKWMRVNE 284


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + +SW+PRA  F NF S+E+C  +I  A+  +K S +   Q    + ++  RTSSGTF+ 
Sbjct: 47  ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR-VRTSSGTFLR 105

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  I+  IE +IA+ T +P+ HGE   VL YE+GQKYD+H+D F+         QR
Sbjct: 106 RGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQR 163

Query: 196 LASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP    N   +    +  +C   G+ VKPR+GD LLF+S+ P
Sbjct: 164 VATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSP 223

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +  +D  SLHG CPVIKG KW ATKW+  +E
Sbjct: 224 DAELDPFSLHGGCPVIKGNKWSATKWMHLRE 254


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP        S +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +G++D TSLHG CPVIKG KW +TKW+R  E
Sbjct: 288 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 318


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 131/211 (62%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA  + NF S E+C+ +I+ AK  +K S +   +    + ++  RTSSG F+ 
Sbjct: 78  EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR-VRTSSGMFLR 136

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              DK  I+  IE +IA  T +P  HGE   VL YE+GQKYD+HYD F          QR
Sbjct: 137 RGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP     F    +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 195 IATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGDALLFWSMRP 254

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + T+D +SLHG CPVIKG KW +TKW+  +E
Sbjct: 255 DATLDPSSLHGGCPVIKGNKWSSTKWMHVEE 285


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  ++ S +       V+S+ G       RTS
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF++  +DK  I+  IE +++  T LP  HGE   +L YE+GQKY+ HYD F      
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALL 248

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ ++D +SLHG CPVIKG KW +TKWIR  E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 133/217 (61%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  ++ S +       V+S+ G       RTS
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF++  +DK  I+  IE +++  T LP  HGE   +L YE+GQKY+ HYD F      
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL VKP+ GD LL
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALL 248

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ ++D +SLHG CPVIKG KW +TKWIR  E
Sbjct: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 96/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +SWRPRA  + NF + E+C   I  AK +L+ S +A  + G++VES    RTSSG F   
Sbjct: 65  ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDS-GYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLSDVE+GGET+FP        + G D+  C   G  VKPR+GD LLF+SL P+ 
Sbjct: 181 ATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGDALLFFSLHPDA 240

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
           T D  SLHGSCPVI+GEKW ATKWI
Sbjct: 241 TTDPLSLHGSCPVIEGEKWSATKWI 265


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/211 (45%), Positives = 132/211 (62%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 12  EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 70

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 71  RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 128

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP        S +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 129 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 188

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +G++D TSLHG CPVIKG KW +TKW+R  E
Sbjct: 189 DGSLDATSLHGGCPVIKGNKWSSTKWMRVHE 219


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 132/208 (63%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           Q++SW PRA  + NF + E+C+ +I  AK  +  S +   + G  V+S +  RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  I++ IE +IA  T +P  HGE FNVL YE+GQKY+ HYD F          Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL +KP+ G+ +LF+S+ 
Sbjct: 232 RIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMK 291

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D +SLHG+CPVIKG+KW+  KW+
Sbjct: 292 PDATLDPSSLHGACPVIKGDKWLCAKWM 319


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 131/217 (60%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVE+GGET+FP        S +  +  +C   GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 267

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+G++D TSLHG CPVIKG KW +TKW+R  E
Sbjct: 268 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 304


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 132/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KI+  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YL+DVEEGGET+FP   G   F+    +  +C   GL +KP+RGD LLF+S+ 
Sbjct: 195 RIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMK 254

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ ++D +SLHG CPVIKG KW +TKW+R  E
Sbjct: 255 PDASLDPSSLHGGCPVIKGNKWSSTKWMRVSE 286


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 96/206 (46%), Positives = 129/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S  +C  +I  AK  ++ S +A    G+++ S    RTSSG F++ 
Sbjct: 38  LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQ--VRTSSGAFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYEIGQKYD+H+D F+         QR 
Sbjct: 96  HEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRF 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV++GGET+FP   G  L   D  +      GL VKP++GD LLF+ L  N 
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNA 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D +SLHGSCPVI+GEKW ATKWI 
Sbjct: 214 TTDTSSLHGSCPVIEGEKWSATKWIH 239


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 131/206 (63%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV++GGET+FP   G  L   D  +      GL VKP++GD LLF+SL  N 
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWIH 241


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 98/217 (45%), Positives = 131/217 (60%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVE+GGET+FP        S +  +  +C   GL VKP+ GD LL
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALL 324

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+G++D TSLHG CPVIKG KW +TKW+R  E
Sbjct: 325 FWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHE 361


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 95/205 (46%), Positives = 131/205 (63%), Gaps = 8/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV++GGET+FP   G  L   D  +      GL VKP++GD LLF+SL  N 
Sbjct: 156 ATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDALLFFSLHVNA 215

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
           T D  SLHGSCPVI+GEKW ATKWI
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWI 240


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score =  180 bits (456), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 133/212 (62%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++ +S    RTSSGTF+
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F          Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G +    +  +  +C   GL VKP+ GD LLF+S+ 
Sbjct: 195 RIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMT 254

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG C VIKG KW +TKW+R  E
Sbjct: 255 PDATLDPSSLHGGCAVIKGNKWSSTKWLRVHE 286


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 129/217 (59%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK R++ S +       V+ST G       RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +  +C   GL VKP+ GD LL
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALL 278

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVI+G KW +TKW+   E
Sbjct: 279 FWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGE 315


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 128/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T +P+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       D + +C   G  VKPR+GD LLF++L PN 
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPV++GEKW AT+WI 
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 130/206 (63%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C  +I  A+ +L+ S +A  + G+++ES    RTSSG FI+ 
Sbjct: 52  LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  I+  IE +IA  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 168 ATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIH 253


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 129/209 (61%), Gaps = 9/209 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFI 134
           +++SW PR   + NF + E+C+ +I  AK  ++ S +     G++V S+   RTSSGTFI
Sbjct: 71  EIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSS--ARTSSGTFI 128

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  IL  IE +IA  T +P  HGE  N+L YE+GQKYD H D F          +
Sbjct: 129 DRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGGE 186

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F    +  +   C   GL +KP+ G+ +LF+ + 
Sbjct: 187 RIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSIKPKMGNAILFWGMK 246

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ T+D  S+HG+CPVIKG+KW  TKW+R
Sbjct: 247 PDATVDPLSVHGACPVIKGDKWSCTKWMR 275


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score =  177 bits (450), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 95/205 (46%), Positives = 127/205 (61%), Gaps = 8/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 75  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       D + +C   G  VKPR+GD LLF++L PN 
Sbjct: 191 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
           T D  SLHGSCPV++GEKW AT+WI
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWI 275


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       D + +C   G  VKPR+GD LLF++L PN 
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPV++GEKW AT+WI 
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       D + +C   G  VKPR+GD LLF++L PN 
Sbjct: 175 ATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNLHPNA 234

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPV++GEKW AT+WI 
Sbjct: 235 TTDSNSLHGSCPVVEGEKWSATRWIH 260


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 96/207 (46%), Positives = 126/207 (60%), Gaps = 9/207 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F + E+C  +I+ AK  LK S +A  +    ++++  RTSSG FI  +
Sbjct: 36  ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSE-VRTSSGMFIPKA 94

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRYE GQKY+ HYD F       +   RLA
Sbjct: 95  KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152

Query: 198 SFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           + L+YL++VE+GGET+FP          +  D         G+ VKPR+GD LLFYSL P
Sbjct: 153 TVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPRKGDALLFYSLHP 212

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
           N T D  SLHG CPVI+GEKW ATKWI
Sbjct: 213 NATPDPLSLHGGCPVIQGEKWSATKWI 239


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score =  177 bits (448), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 96/207 (46%), Positives = 131/207 (63%), Gaps = 10/207 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C  ++  A+ +L+ S +A  + G+++ES    RTSSG FI  
Sbjct: 49  LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           S+D+  I++ IE +IA  T LPQ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL----DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
            + L+YLS+V +GGET+FP   G  +    DS  D  K  G  VKP++GD LLF+SL P+
Sbjct: 165 VTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCAK-NGYAVKPQKGDALLFFSLHPD 223

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
            T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 224 ATTDTNSLHGSCPVIEGEKWSATKWIH 250


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score =  177 bits (448), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 97/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP    N   L    +  +C   GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVIKG KW +TKW+  +E
Sbjct: 271 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVRE 307


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 129/207 (62%), Gaps = 9/207 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFIS 135
           LSW PRA  +  F S E+C  +I  AK +L+ S +      GE+++S +  RTSSG F++
Sbjct: 35  LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D +   E       R
Sbjct: 93  KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+V +GGET+FP   G       D + +C   G  VKPR+GD LLF++L PN
Sbjct: 151 IATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDALLFFNLHPN 210

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
            T D TSLHGSCPVI+GEKW AT+WI 
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIH 237


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score =  176 bits (447), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 96/205 (46%), Positives = 130/205 (63%), Gaps = 8/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  I++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV++GGET+FP   G  L   D  +      GL VKP++GD LLF++L  N 
Sbjct: 154 ATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
           T D  SLHGSCPVI+GEKW ATKWI
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWI 238


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA    NF + ++C  +I  A   ++ S +   Q      ++  RTSSG F++
Sbjct: 3   EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 61

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE KIA+ T +P+ HGE   VL YE GQKYD+H+D F          QR
Sbjct: 62  RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 119

Query: 196 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YL+DVEEGGET+FP   +N   L       +C   G+ V+P+RGD LLF+S+ P
Sbjct: 120 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 179

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +  +D +SLHG CPVIKG+KW ATKW+R  E
Sbjct: 180 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 210


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 90/211 (42%), Positives = 130/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA    NF + ++C  +I  A   ++ S +   Q      ++  RTSSG F++
Sbjct: 15  EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 73

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE KIA+ T +P+ HGE   VL YE GQKYD+H+D F          QR
Sbjct: 74  RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 131

Query: 196 LASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YL+DVEEGGET+FP   +N   L       +C   G+ V+P+RGD LLF+S+ P
Sbjct: 132 IATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSP 191

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +  +D +SLHG CPVIKG+KW ATKW+R  E
Sbjct: 192 DAQLDHSSLHGGCPVIKGDKWSATKWMRVSE 222


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 97/222 (43%), Positives = 131/222 (59%), Gaps = 7/222 (3%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           + +   +++ +   + +SW+PRA  +  F S E+C  +I+ AK  LK S +A        
Sbjct: 24  YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            ++  RTSSG FI   +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+HYD
Sbjct: 84  LSE-VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYD 140

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVK 237
            F       +   R+A+ L+YLSDV +GGET+FP    +   L +  D  +C   G+ VK
Sbjct: 141 YFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVK 200

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           PR+GD LLF+SL P    D  SLHG CPVI+GEKW ATKWI 
Sbjct: 201 PRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIH 242


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 95/208 (45%), Positives = 126/208 (60%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF S E+C+ +I  AK R+  S +     ET +S     RTSSG F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVV--DSETGKSKDSRVRTSSGMFL 162

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220

Query: 195 RLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSD+EEGGET+FP  N     L    +  +C   GL VKP+ GD LLF+S+ 
Sbjct: 221 RMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMK 280

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D  SLHG CPVIKG KW +TKW+
Sbjct: 281 PDATLDPLSLHGGCPVIKGNKWSSTKWL 308


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 128/217 (58%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 98  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +  +C   GL VKP+ GD LL
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALL 268

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVI+G KW +TKW+   E
Sbjct: 269 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 305


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 91/211 (43%), Positives = 130/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+SW+PRA  + NF +  +C+ +I  AK R++ S +        + +K  RTSSGTF+ 
Sbjct: 81  EVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSK-VRTSSGTFLP 139

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              DK  I+  IE +IA  + +P  HGE   +L YE+GQ+Y+ H+D F          QR
Sbjct: 140 RGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQR 197

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP   G      +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 198 IATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLFWSMNP 257

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +G+ D +SLHG CPVI+G KW +TKW+R  E
Sbjct: 258 DGSPDPSSLHGGCPVIRGNKWSSTKWMRVNE 288


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 131/212 (61%), Gaps = 9/212 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK  L K S +  + G++ ES    RTSSG F+
Sbjct: 70  EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P  +GE   VL Y +G+KY+ HYD F          Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP     F    +  D  +C   GL +KP+ GD LLF+S+ 
Sbjct: 186 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 245

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P+ T+D +SLHG CPVI G KW +TKW+  +E
Sbjct: 246 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 277


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/208 (45%), Positives = 129/208 (62%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF S E+C+ +IA AK  + K + +  + G + +S    RTSSG F+
Sbjct: 79  EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I+  IE +IA  + +P  HGE   VL YE+GQKY++HYD F          Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           R A+ L+YLSDVEEGGET+FP    N   + S  +  +C   GL VKP+ G+ LLF+S  
Sbjct: 195 RTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWSTR 254

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D  SLHGSCPVI+G KW ATKW+
Sbjct: 255 PDATLDPASLHGSCPVIRGNKWSATKWM 282


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 127/217 (58%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +  +C   GL VKP+ GD LL
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALL 267

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVI+G KW +TKW+   E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 304


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 129/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  +++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+DV +GGET+FP   G  L   D  +      GL VKP++GD LLF++L  N 
Sbjct: 154 ATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNA 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 214 TADTGSLHGSCPVIEGEKWSATKWIH 239


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 96/208 (46%), Positives = 127/208 (61%), Gaps = 9/208 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW PRA  +  F +  +C  +I+ AK  LK S +A       + ++  RTSSG FI  +
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFIHKA 102

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+H+D F       +   R+A
Sbjct: 103 KDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160

Query: 198 SFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
           + L+YLSDVE+GGET+FP     +     ++  D   C   G+ VKPR+GD LLF+SL P
Sbjct: 161 TVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP 220

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           N   D +SLHG CPVI+GEKW ATKWIR
Sbjct: 221 NAIPDTSSLHGGCPVIEGEKWSATKWIR 248


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 100/220 (45%), Positives = 132/220 (60%), Gaps = 9/220 (4%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  
Sbjct: 28  AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG FI  ++D   I+  +E KI+  T+LP+ +GE   VLRYE GQKYD HYD
Sbjct: 88  SE--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYD 143

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPR 239
            F       +   R+A+ L+YL+DV +GGET+FP       ++  D  +C   G+ VKPR
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPR 203

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           RGD LLF+SL+PN   D  SLH  CPVI+GEKW ATKWI 
Sbjct: 204 RGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 243


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 102/253 (40%), Positives = 145/253 (57%), Gaps = 10/253 (3%)

Query: 33  TFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNF 90
            FL  ++     +L  L+   K     +     G +S++  P +V  LSW PRA  +  F
Sbjct: 12  CFLCFEISVSSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGF 71

Query: 91  ASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEH 149
            S E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F++ ++D+  I+  IE 
Sbjct: 72  LSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNKAQDE--IVAGIEA 127

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           +IA  T LP  +GE+  +L YE GQKY+ H+D F+          R+A+ L+YLSDVE+G
Sbjct: 128 RIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKG 187

Query: 210 GETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
           GET+FP      L   D  +      G  VKPR+GD LLF+SL  + + D  SLHGSCPV
Sbjct: 188 GETIFPNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPV 247

Query: 267 IKGEKWVATKWIR 279
           I+GEKW ATKWI 
Sbjct: 248 IEGEKWSATKWIH 260


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 134/208 (64%), Gaps = 11/208 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
           +SWRPRA  +  F +  +C  ++A A++  L+ S +  RQ G++V S    RTSSGTF++
Sbjct: 41  VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSE--VRTSSGTFLA 98

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG--PQMS 193
             +D+  ++  IE +IA  T+LPQ +GE+  VLRYE GQKY+ H D    A  G   +  
Sbjct: 99  KKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGG 156

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-KCI--GLKVKPRRGDGLLFYSLF 250
            R+A+ L+YLSDV+ GGET+FP  +   L    D + +C   G  VKP +GD +LF+SL 
Sbjct: 157 HRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSECARRGYAVKPVKGDAVLFFSLH 216

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           PNGT DR SLHG CPVI+GEKW ATKWI
Sbjct: 217 PNGTTDRDSLHGGCPVIEGEKWSATKWI 244


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 130/208 (62%), Gaps = 10/208 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F + E+C  +I+ AK  LK S +A  +    + ++  RTSSG FIS +
Sbjct: 40  ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSE-VRTSSGAFISKA 98

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I++ IE K+A  T LP  +GE   VLRYE GQKY++H+D F+      +   R A
Sbjct: 99  KD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156

Query: 198 SFLLYLSDVEEGGETMFPF-----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           + L+YLS+VE+GG+T+FP           + +  D  +C   G+ VKPR+GD LLF+SL 
Sbjct: 157 TVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGISVKPRKGDALLFFSLT 216

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P  T D+ SLHG CPVI+GEKW ATKWI
Sbjct: 217 PTATPDQLSLHGGCPVIEGEKWSATKWI 244


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 127/205 (61%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA     F S E+C+ IIA AK R+ K S +    G++V+S    RTS+G +++ 
Sbjct: 57  LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED+  I+  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P    P+   Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+ VEEGGET+ P  +      G+      GL VKP +GD L+FYSL P+G+
Sbjct: 173 RVVTVLMYLTTVEEGGETVLPHADQKVSGEGWSECAKRGLAVKPVKGDALMFYSLKPDGS 232

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 233 NDPASLHGSCPTLKGDKWSATKWIH 257


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 104/276 (37%), Positives = 159/276 (57%), Gaps = 18/276 (6%)

Query: 16  TALLLCSF--FFLAGFYGSTFLSRDVPSIRPKLRT--LEVVEKENESGLPHGETGDASIQ 71
           T L+L  F    L G    +  + +  S RP   T  ++ +E+    G      GD  ++
Sbjct: 22  TVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQTIEERESYGDEEDGNGDRWLE 81

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSS 130
                V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S    RTSS
Sbjct: 82  -----VISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSS 134

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           GTF+    D+  I+E IE++I+  T +P  +GE   VL YE+GQKY+ H+D F       
Sbjct: 135 GTFLKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVR 192

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLF 246
           +  QR+A+ L+YLSDV+EGGET+FP   G   D  +  +  +C   GL V P++ D LLF
Sbjct: 193 KGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSVLPKKRDALLF 252

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +S+ P+ ++D +SLHG CPVIKG KW +TKW    E
Sbjct: 253 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 126/217 (58%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +   C   GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVIKG KW +TKW+   E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 129/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F + E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F+  
Sbjct: 58  LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSE--VRTSSGMFLQK 115

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   ++  IE +IA  T LP  +GEA  +L YE GQKY+ H+D F+          R+
Sbjct: 116 AQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRI 173

Query: 197 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VEEGGET+FP  E  + L +      C   G  VKP++GD LLF+SL P+ 
Sbjct: 174 ATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFSLHPDA 233

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           + D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 234 STDSLSLHGSCPVIEGEKWSATKWIH 259


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 99/224 (44%), Positives = 132/224 (58%), Gaps = 11/224 (4%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
           + +   +++ +   + +SW+PRA  +  F S E+C  +I+ AK  LK S +A    G++ 
Sbjct: 24  YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
            S    RTSSG FI   +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+HY
Sbjct: 84  LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF----LDSGYDYKKCI--GL 234
           D F       +   R+A+ L+YLSDV +GGET+FP          L +  D  +C   G+
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGI 199

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            VKPR+GD LLF+SL P    D  SLHG CPVI+GEKW ATKWI
Sbjct: 200 AVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWI 243


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 89/213 (41%), Positives = 137/213 (64%), Gaps = 9/213 (4%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
            +V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S    RTSSGTF
Sbjct: 80  LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           ++   D+  I+E IE++I+  T +P  +GE   VL YE+GQ+Y+ H+D F       +  
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSL 249
           QR+A+ L+YLSDV+EGGET+FP   G   D  +  +  +C   GL V P++ D LLF+S+
Sbjct: 196 QRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSM 255

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
            P+ ++D +SLHG CPVIKG KW +TKW    E
Sbjct: 256 KPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHE 288


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score =  173 bits (439), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 133/209 (63%), Gaps = 9/209 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + NF + ++C+ +I  AK  +  S +   + G +++S    RTSSG F+
Sbjct: 92  EVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFL 149

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  +DK  I+  IE +IA  + +P  HGE  +VL YE+ QKYD+HYD F+         Q
Sbjct: 150 NRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQ 207

Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
           R A+ L+YLSDVE+GGET+FP    N   +    +  +C   GL V+P+ GD LLF+S+ 
Sbjct: 208 RGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVK 267

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ ++D +SLHGSCPVI+G KW ATKW+R
Sbjct: 268 PDASLDPSSLHGSCPVIQGNKWSATKWMR 296


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +I  AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 43  LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 159 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDA 218

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 219 TTDSESLHGSCPVIEGQKWSATKWIH 244


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 25  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 83  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+ VEEGGET+ P         G+      GL VKP +GD L+FYSL P+G+
Sbjct: 141 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 200

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 96/217 (44%), Positives = 125/217 (57%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +   C   GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P  T+D  SLHG CPVIKG KW +TKW+   E
Sbjct: 268 FWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 129/205 (62%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  + NF S  +C  +I+ AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 9   LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE +IA  T LP+ +GEA  VLRY+ G+KY+ H+D F+          R+
Sbjct: 67  GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124

Query: 197 ASFLLYLSDVEEGGETMFPF--ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           A+ L+YLSDV +GGET+FP   + G   D  +      G+ VKPR+GD LLF+SL P+  
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSAV 184

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D +SLH  CPVI+GEKW ATKWI 
Sbjct: 185 PDESSLHTGCPVIEGEKWSATKWIH 209


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score =  173 bits (438), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 125/204 (61%), Gaps = 5/204 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW PRA  +P+F S ++   +++ A+  LK S +A       + ++  RTSSGTFIS  
Sbjct: 54  ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSE-VRTSSGTFISKG 112

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F  +        R+A
Sbjct: 113 KDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVA 170

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
           + LLYL+DV EGGET+FP   G          +C   G+ VKPR+GD LLF++L P+   
Sbjct: 171 TVLLYLTDVAEGGETVFPLAKGRKGSHHKGLSECAQKGIAVKPRKGDALLFFNLRPDAAT 230

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D TSLHG C VIKGEKW ATKWIR
Sbjct: 231 DPTSLHGGCEVIKGEKWSATKWIR 254


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 16  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 74  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+ VEEGGET+ P         G+      GL VKP +GD L+FYSL P+G+
Sbjct: 132 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 191

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 192 NDPASLHGSCPTLKGDKWSATKWIH 216


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 146/260 (56%), Gaps = 12/260 (4%)

Query: 32  STFLSRDVPSIRPKLRTLEVVEKENESGLPHG----ETGDASIQSIPF-QVLSWRPRALY 86
           S   +R  PS     ++L V    + S + H     E+GD   ++  + + +SW PRA  
Sbjct: 7   SRISTRKSPSSTLIRKSLNVHFPNDLSSIAHNSKIHESGDDEGKAEQWVEAISWEPRAFI 66

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + NF +  +C  +I  AK  ++ S +        + ++  RTSSGTF+    DK  I+  
Sbjct: 67  YHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSR-VRTSSGTFLPRGRDK--IIRD 123

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
           IE +IA  + +P  HGE   +L YE+GQKY+ H+D F          QR+A+ L+YLSDV
Sbjct: 124 IEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFDYFMDDYNTENGGQRIATVLMYLSDV 183

Query: 207 EEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           EEGGET+FP   G      +  +  +C   GL VKP+ GD LLF+S+ P+ ++D +SLHG
Sbjct: 184 EEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLFWSMKPDASLDPSSLHG 243

Query: 263 SCPVIKGEKWVATKWIRDQE 282
            CPVI+G KW +TKW+R  E
Sbjct: 244 GCPVIRGNKWSSTKWMRVNE 263


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 17  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 75  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+ VEEGGET+ P         G+      GL VKP +GD L+FYSL P+G+
Sbjct: 133 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 192

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 193 NDPASLHGSCPTLKGDKWSATKWIH 217


>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 97/220 (44%), Positives = 137/220 (62%), Gaps = 17/220 (7%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK-GTRTS 129
           +S P  + + RP+A  F NF + E+C+ +IA AK +L PS +    G+  +STK G RTS
Sbjct: 225 RSPPLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK--KSTKSGIRTS 282

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NP 185
           +G F++  + +T  + ++E ++A A  LP+ +GE   +LRYE GQKYD HYD F    NP
Sbjct: 283 AGMFLT--KGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINP 340

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI--FLDSGYD--YKKCI--GLKVKPR 239
           +    +  QR+A+ L+YL D EEGGET+FP       F D   D  +  C   GL VK +
Sbjct: 341 SPN--RGGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGAFSDCAKRGLPVKSK 398

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           RGD +LF+SL  +  +D  SLHG+CPV++GEKW A KWIR
Sbjct: 399 RGDAVLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIR 438


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 105/272 (38%), Positives = 150/272 (55%), Gaps = 22/272 (8%)

Query: 14  LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
           L ++L LC            FL  ++     +L  L+   K     +     G +S++  
Sbjct: 6   LASSLFLC------------FLCFEISVSSLRLPGLDQDAKATHGSVLRLNRGGSSVKFD 53

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSS 130
           P +V  LSW PRA  +  F S E+C  +I  AK +L+ S +A    G+++ S    RTSS
Sbjct: 54  PTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSD--IRTSS 111

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           G F++ ++D+  I+  IE +IA  T LP  +GE+  +L YE GQKY+ H+D F+      
Sbjct: 112 GMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKANQV 169

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFY 247
               R+A+ L+YLSDVE+GGET+FP      L   D  +      G  VKP++GD LLF+
Sbjct: 170 MGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQKGDALLFF 229

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           SL  + + D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 230 SLHLDASTDTKSLHGSCPVIEGEKWSATKWIH 261


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 125/205 (60%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 45  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+ VEEGGET+ P         G+      GL VKP +GD L+FYSL P+G+
Sbjct: 161 RVVTMLMYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALMFYSLKPDGS 220

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 221 NDPASLHGSCPTLKGDKWSATKWIH 245


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 96/220 (43%), Positives = 132/220 (60%), Gaps = 9/220 (4%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
           T  + I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A  +    + ++
Sbjct: 29  TSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE 88

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTSSG FI+ ++D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD HYD F+
Sbjct: 89  -VRTSSGMFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFS 145

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKP 238
                 +   R+A+ L+YL+DVE+GGET+FP    +       S  D  +C   G+ VKP
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKP 205

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           RRGD LLF+SL+P    D +S+H  CPVI+GEKW ATKWI
Sbjct: 206 RRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWI 245


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 128/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F+  
Sbjct: 57  LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLLK 114

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  I+  IE +IA  T LP  +GE+  +L YE G+KY+ H+D F+          R+
Sbjct: 115 AQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRI 172

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YL+ VEEGGET+FP   G F     D +  C   G  V P++GD LLF+SL P+ 
Sbjct: 173 ATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDALLFFSLHPDA 232

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D +SLHGSCPVI GEKW ATKWI 
Sbjct: 233 TTDPSSLHGSCPVIAGEKWSATKWIH 258


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 93/206 (45%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
           LSWRPRA     F    +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 214 TTDSDSLHGSCPVIEGQKWSATKWIH 239


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 99/222 (44%), Positives = 141/222 (63%), Gaps = 11/222 (4%)

Query: 64  ETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           +TG +++   P +V  LS +PRA  +  F SAE+CQ +I +AK +L  S +A   G++V 
Sbjct: 47  KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S +  RTS+G F+  ++D+  I+  IE +IA  T LP  +GE   +LRYE GQKY+ H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL--DSGYDYKKC--IGLKVK 237
            F           R+A+ L+YLS+VE+GGET+FP  + + L  +   D  +C  +G  V+
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFP-NSPVKLSEEEKADLSECGKVGYGVR 221

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ GD LLF+S+ PN T D TS HGSCPVI+GEKW ATKWI 
Sbjct: 222 PKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIH 263


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 129/206 (62%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R+
Sbjct: 97  KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G  L   D+ +      G  VKP +GD LLF+SL P+ 
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWIH 240


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 98/230 (42%), Positives = 135/230 (58%), Gaps = 11/230 (4%)

Query: 57  ESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ 116
           ES   +  +  A++     + +SW+PRA  +  F +  +C  +++ A+  LK S++A   
Sbjct: 21  ESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADND 80

Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G++  ST   RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQK
Sbjct: 81  SGKSKLST--VRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQK 136

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKC 231
           Y+SHYD F           RLA+ L+YLS+V +GGET+FP          Y    D  +C
Sbjct: 137 YESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSEC 196

Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
              G+ VKP++GD LLF+SL PN   D  SLHG CPV++GEKW ATKWI 
Sbjct: 197 AKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIH 246


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 91/226 (40%), Positives = 132/226 (58%), Gaps = 9/226 (3%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETV 120
           H E  +    ++  +V+SW+PRA  + NF + E+C+ +I  A   ++ S +A  Q G++V
Sbjct: 61  HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
                 R S+G F+   +D+  I+  IE +IA  T +P  +GE   V+ YE+GQ YD HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKV 236
           D F          QR+A+ L+YLS+VEEGGETMFP     F    +  +   C  +GL +
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSI 236

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           KP+ GD LLF+S+ PN T+D  +LH +CPVIKG KW  TKW+   E
Sbjct: 237 KPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTE 282


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S E+C  +I  AK +L  S +A  + GE++ES +  RTSSG FI  
Sbjct: 21  LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           +ED+  I+  IE +IA  T LP+ +GE   +LRYE GQKY++H D F       +   R 
Sbjct: 79  TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136

Query: 197 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLSDV++GGET+FP    E     D  +      G  VKP +GD LLF+SL P+ 
Sbjct: 137 ATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCAKKGYAVKPNKGDALLFFSLHPDA 196

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLH SCPVI+GEKW ATKWI 
Sbjct: 197 TPDPGSLHASCPVIEGEKWSATKWIH 222


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/217 (43%), Positives = 126/217 (58%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    +K  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEN--GIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP  N     L    +   C   GL VKP+ GD LL
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALL 267

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           F+S+ P+ T+D  SLHG CPVIKG KW +TKW+   E
Sbjct: 268 FWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 304


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 101/253 (39%), Positives = 144/253 (56%), Gaps = 10/253 (3%)

Query: 33  TFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNF 90
            FL  ++     +L  L+   K     +     G +S++  P +V  LSW PRA  +  F
Sbjct: 12  CFLCFEISVSSIRLPGLDQDAKATHGSVLRLNRGGSSVKFDPTRVTQLSWSPRAFLYKGF 71

Query: 91  ASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEH 149
            S E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F++ ++D+  I+  IE 
Sbjct: 72  LSDEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLNKAQDE--IVAGIEA 127

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           +IA  T LP  +GE+  +L YE GQKY+ H+D F+          R+A+ L+YLSDVE+G
Sbjct: 128 RIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKG 187

Query: 210 GETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
           GET+F       L   D  +      G  VKPR+GD LLF+SL  + + D  SLHGSCPV
Sbjct: 188 GETIFSNAKAKLLQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPV 247

Query: 267 IKGEKWVATKWIR 279
           I+GEKW ATKWI 
Sbjct: 248 IEGEKWSATKWIH 260


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/209 (45%), Positives = 125/209 (59%), Gaps = 10/209 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW PRA  +  F +  +C  +I+ AK  LK S +A       + ++  RTSSG FI  +
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE-VRTSSGAFIHKA 102

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+H+D F       +   R+A
Sbjct: 103 KDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160

Query: 198 SFLLYLSDVEEGGETMFPFENG-----IFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           + L+YLSDVE+GGET+F             ++  D   C   G+ VKPR+GD LLF+SL 
Sbjct: 161 TVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLH 220

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           PN   D +SLHG CPVI+GEKW ATKWIR
Sbjct: 221 PNAIPDTSSLHGGCPVIEGEKWSATKWIR 249


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLSDV +GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWIH 249


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 127/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLSDV +GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDA 223

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 224 TTDSDSLHGSCPVIEGQKWSATKWIH 249


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 127/207 (61%), Gaps = 9/207 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS 135
           LSW PRA  +  F S E+C  +I  AK +L+ S +   +  GE+ +S    RTSSG F++
Sbjct: 35  LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D F   +       R
Sbjct: 93  KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+V +GGET+FP   G       D + KC   G  VKPR+GD LLF++L  N
Sbjct: 151 IATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
           GT D  SLHGSCPVI+GEKW AT+WI 
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIH 237


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/205 (44%), Positives = 131/205 (63%), Gaps = 9/205 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPRA  + NF S E+C+ +   A+KRL K + +  + G++++ST   RTSSGTF++ 
Sbjct: 41  VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            ED+  ++  IE +I+  TM+P+ +GEA  +L+Y  GQKY+ H D F+  +Y  +     
Sbjct: 99  GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           QR+A+ L+YLS  EEGGET+FP+        G+      GL VK  +G  LLFYSL PNG
Sbjct: 156 QRVATILMYLSTPEEGGETVFPYAEKKVEGEGWSECARKGLAVKAVKGSALLFYSLKPNG 215

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
             D+ S HGSCP + GEKW AT+WI
Sbjct: 216 EEDQASTHGSCPTLAGEKWSATRWI 240


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
           LSWRPRA     F    +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ 
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCP I+G+KW ATKWI 
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIH 239


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 101/224 (45%), Positives = 131/224 (58%), Gaps = 13/224 (5%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  
Sbjct: 28  AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG FI  ++D   I+  +E KI+  T+LP+ +GE   VLRYE GQKYD HYD
Sbjct: 88  SE--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYD 143

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY----DYKKCI--GLK 235
            F       +   R+A+ L+YL+DV +GGET+FP         G     D  +C   G+ 
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIA 203

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           VKPRRGD LLF+SL+PN   D  SLH  CPVI+GEKW ATKWI 
Sbjct: 204 VKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIH 247


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 126/205 (61%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +SW+PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSD--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 254
           A+ LLYL+DV EGGET+FP    +       + +C   G+ VKPR+GD LLF++L P+GT
Sbjct: 170 ATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQKGIAVKPRKGDALLFFNLKPDGT 229

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHG C VI+GEKW ATKWIR
Sbjct: 230 TDPVSLHGGCAVIRGEKWSATKWIR 254


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score =  170 bits (431), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 128/208 (61%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK   +K + +  + G + +S    RTSSG F+
Sbjct: 78  EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++  IE +IA  + +P  HGE   VL YE+GQKY++H+D F          Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLF 250
           R A+ L+YLSDVEEGGET+FP  N       +  +  +C   GL +KP+ G+ LLF+S  
Sbjct: 194 RTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTR 253

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D +SLHGSCPVI+G KW ATKW+
Sbjct: 254 PDATLDPSSLHGSCPVIRGNKWSATKWM 281


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 99/240 (41%), Positives = 138/240 (57%), Gaps = 13/240 (5%)

Query: 49  LEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
           + ++ + + +G  +  +  A I     + +SW+PRA  +  F +  +C  +I+ AK  LK
Sbjct: 15  VSLICESHGAGSSYAGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELK 74

Query: 109 PSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
            S +A    G++  S    RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   V
Sbjct: 75  RSAVADNLSGDSKLSD--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQV 130

Query: 168 LRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP------FENGIF 221
           LRYE GQKYD HYD F       +   R+A+ L+YL++V  GGET+FP      F     
Sbjct: 131 LRYEHGQKYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRG 190

Query: 222 LDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            ++  D  +C   G+ VKPRRGD LLF+SL+PN   D  SLH  CPVI+GEKW ATKWI 
Sbjct: 191 SETIDDLSECAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIH 250


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score =  169 bits (429), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 94/206 (45%), Positives = 126/206 (61%), Gaps = 9/206 (4%)

Query: 79  SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISA 136
           SW PRA  +  F S E+C  +I  AK +L+ S +   +  GE+ +S    RTSSG F++ 
Sbjct: 1   SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D F   +       R+
Sbjct: 59  RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+V +GGET+FP   G       D + KC   G  VKPR+GD LLF++L  NG
Sbjct: 117 ATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKGDALLFFNLHLNG 176

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+GEKW AT+WI 
Sbjct: 177 TTDPNSLHGSCPVIEGEKWSATRWIH 202


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 91/213 (42%), Positives = 129/213 (60%), Gaps = 9/213 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +++SW PR   + NF + E+C+ +I  AK  ++ S +   + G ++ES    RTSSGTF+
Sbjct: 86  EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE++IA  T +P  +GE   VL Y++G+KY  H+D F           
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G F  +    +   C   GL +KP+  + LLF+S+ 
Sbjct: 202 RIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCGKKGLSIKPKMRNALLFWSIK 261

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           P+ T D  SLHGSCPVIKG KW +TKWIR  E 
Sbjct: 262 PDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEH 294


>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
           C-169]
          Length = 347

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 93/210 (44%), Positives = 126/210 (60%), Gaps = 19/210 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +SW PRA     F    +C+ +I+ AK  +  S +     G++++ST   RTS+GTF   
Sbjct: 86  VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGP 190
            ED+  +++ IE +I+  T LP+ +GE   +L YE GQKY++H+D F+      P   G 
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGG- 200

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 248
              QR+A+ L+YL+  EEGGET+FP        +G  + +C   G  VK RRGD LLFYS
Sbjct: 201 ---QRIATVLMYLTTAEEGGETVFPMAANKV--TGPQWSECARGGAAVKSRRGDALLFYS 255

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           L PNG  D TSLHGSCP  KGEKW ATKWI
Sbjct: 256 LLPNGETDPTSLHGSCPTTKGEKWSATKWI 285


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 98/219 (44%), Positives = 128/219 (58%), Gaps = 11/219 (5%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
           A I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S    
Sbjct: 33  AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F   
Sbjct: 91  RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKC--IGLKVKPRR 240
               +   R+A+ L+YL++V +GGET+FP           ++  D  +C   G+ VKPRR
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGKKGVAVKPRR 208

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           GD LLF+SL PN   D  SLH  CPVI+GEKW ATKWI 
Sbjct: 209 GDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 247


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/231 (41%), Positives = 137/231 (59%), Gaps = 10/231 (4%)

Query: 55  ENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
           E  + +  G    A ++  P +V  LSW PRA  + NF + E+C  +I  +K +L+ S +
Sbjct: 29  EEGNKITRGSVFGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMV 88

Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
           A  + G++++S    RTSSG F++  +D+  I+  IE +IA  T LP  +GE+  VL Y 
Sbjct: 89  ADNESGKSIQSE--VRTSSGMFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYM 144

Query: 172 IGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF---LDSGYDY 228
            G+KY+ H+D F+          R+A+ L+YLS+VE+GGET+FP   G      D  +  
Sbjct: 145 NGEKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSE 204

Query: 229 KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
               G  VKPR+GD LLF+SL  + T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 205 CAHKGYAVKPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIH 255


>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
          Length = 564

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 94/216 (43%), Positives = 135/216 (62%), Gaps = 14/216 (6%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P  + + +P+A  F NF SAE+C  ++  AK  L PS +    G +V ST   RTS+G F
Sbjct: 278 PLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAGMF 335

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM 192
           +  + DKT  LE IE++IA A+  P+ +GE   +LRY++GQKYD H+D F+ A    P+ 
Sbjct: 336 LRKAADKT--LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKR 393

Query: 193 S-QRLASFLLYLSDVEEGGETMFP----FENGIFLDSG--YDYKKCI--GLKVKPRRGDG 243
             QR+A+ L+YL + +EGGET+FP     E     + G  +++ +C   GL VK  +GD 
Sbjct: 394 GGQRMATMLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDA 453

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           LLF+SL  +  +D  SLHG+CPV+KG+KW A KWIR
Sbjct: 454 LLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIR 489


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 130/222 (58%), Gaps = 13/222 (5%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 567 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 626

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FI  ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 627 E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 682

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLKV 236
           F       +   R+A+ L+YL+DV +GGET+FP         G +  + +      G+ V
Sbjct: 683 FADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAV 742

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           KPRRGD LLF+SL+PN   D  SLH  CPVI+GEKW ATKWI
Sbjct: 743 KPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWI 784


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 93/207 (44%), Positives = 128/207 (61%), Gaps = 9/207 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFP-FENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPN 252
           A+ L+YLSDV +GGET+FP  E G  L    D +  C   G  VKP +GD LLF+SL P+
Sbjct: 164 ATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPD 223

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIR 279
            T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 224 ATTDSDSLHGSCPVIEGQKWSATKWIH 250


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/206 (44%), Positives = 126/206 (61%), Gaps = 8/206 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +I  AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   G       +   +C   G  VKP +GD LLF+SL P+ 
Sbjct: 154 ATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDALLFFSLHPDA 213

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
           T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 214 TTDPDSLHGSCPVIEGQKWSATKWIH 239


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 98/221 (44%), Positives = 128/221 (57%), Gaps = 13/221 (5%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
           A I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S    
Sbjct: 33  AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F   
Sbjct: 91  RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFEN------GIFLDSGYDYKKC--IGLKVKP 238
               +   R+A+ L+YL++V +GGET+FP             ++  D  +C   G+ VKP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKP 208

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           RRGD LLF+SL PN   D  SLH  CPVI+GEKW ATKWI 
Sbjct: 209 RRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIH 249


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PR   + +F S ++   +++ A+  LK S +A       E +   RTSSGTFI  S
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F+      +   R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 250
           + L+YL+DV EGGET+FP     F +SG + +     +C   G+ VKPR+GD LLF++L 
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ + D  SLH  CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 93/209 (44%), Positives = 129/209 (61%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PR   + +F S ++   +++ A+  LK S +A       E +   RTSSGTFI  S
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F+      +   R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLF 250
           + L+YL+DV EGGET+FP     F +SG + +     +C   G+ VKPR+GD LLF++L 
Sbjct: 178 TVLMYLTDVAEGGETVFPLAEE-FTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLS 236

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+ + D  SLH  CPVIKGEKW ATKWIR
Sbjct: 237 PDASKDSLSLHAGCPVIKGEKWSATKWIR 265


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 101/223 (45%), Positives = 131/223 (58%), Gaps = 13/223 (5%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 25  GSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 84

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   V RYE GQKYD HYD 
Sbjct: 85  D--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDY 140

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENG----IFLDSGYDYKKCI--GLKV 236
           F       +   R+A+ L+YL+DV +GGET+FP           ++  D  +C   G+ V
Sbjct: 141 FTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAV 200

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           KPRRGD LLF+SL  N T D +SLH  CPVI+GEKW ATKWI 
Sbjct: 201 KPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 243


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 91/217 (41%), Positives = 130/217 (59%), Gaps = 16/217 (7%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVES---------TKG 125
           +VLSW PRA+ + NF + E+C+ +I  AK  + K + +    G++ +S            
Sbjct: 82  EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F++  +DKT  +  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F  
Sbjct: 142 VRTSSGMFLNRGQDKT--IRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRG 241
                   QR+A+ L+YLSDVE+GGET+FP    N   +    +  +C   G+ V+PR G
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMG 259

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           D LLF+S+ P+  +D +SLH  CPVI+G+KW ATKWI
Sbjct: 260 DALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWI 296


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 128/208 (61%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + +F + E+C  +I  A+  L K + +    G++ +S    RTSSGTF+
Sbjct: 4   EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   +++ IE +IA  T +P   GE   VL+Y+  +KY+ HYD F+ A       Q
Sbjct: 62  MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119

Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLS+VEEGGET+FP    N   +       +C   GL V+PR GD LLF+S+ 
Sbjct: 120 RIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMK 179

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+ T+D TSLHG CPVIKG KW ATKW+
Sbjct: 180 PDATLDSTSLHGGCPVIKGTKWSATKWL 207


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 132/225 (58%), Gaps = 17/225 (7%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 26  GSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 85

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 86  D--VRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 141

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF------ENGIFLDSGYDYKKCI--GL 234
           F       +   R+A+ L+YL++V +GGET+FP         G   ++  D  +C   G+
Sbjct: 142 FTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGT--ETSSDLSECAKKGI 199

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            VKP RGD LLF+SL  N T D +SLH  CPVI+GEKW ATKWI 
Sbjct: 200 AVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIH 244


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 94/208 (45%), Positives = 123/208 (59%), Gaps = 9/208 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F +  +C  +I+ AK  LK S +A  +    + ++  RTSSG FI+  
Sbjct: 39  VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFIAKG 97

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F       +   R+A
Sbjct: 98  KDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMA 155

Query: 198 SFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
           + L+YLSDV +GGET+FP           +S  D  +C   G+ VKPRRGD LLF+SL P
Sbjct: 156 TVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALLFFSLHP 215

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
               D  SLH  CPVI+GEKW ATKWI 
Sbjct: 216 TAIPDPNSLHAGCPVIEGEKWSATKWIH 243


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 91/205 (44%), Positives = 122/205 (59%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 25  LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMS-Q 194
            ED   ++  IE ++A+ T +P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 83  GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L YL+ VEEGGET+ P         G+      GL VKP +GD L FYSL P+G+
Sbjct: 141 RVVTXLXYLTTVEEGGETVLPNAEQKVTGDGWSECAKRGLAVKPIKGDALXFYSLKPDGS 200

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHGSCP +KG+KW ATKWI 
Sbjct: 201 NDPASLHGSCPTLKGDKWSATKWIH 225


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 91/193 (47%), Positives = 120/193 (62%), Gaps = 8/193 (4%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIE 148
           F S E+C  +IA  K +L+ S +A  + G++V S    RTSSG F+   +D+T  +  IE
Sbjct: 3   FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSE--IRTSSGMFLERRQDET--ITRIE 58

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            +IA  T LP+ +GE   +L YE GQKYD+HYD F+          R+A+ L+YLSDV++
Sbjct: 59  KRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKK 118

Query: 209 GGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
           GGET+FP   G  L    D +  C   G  VKPR+GD LLF+S  PN T D  SLH SCP
Sbjct: 119 GGETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCP 178

Query: 266 VIKGEKWVATKWI 278
           VI+GEKW AT+WI
Sbjct: 179 VIEGEKWSATRWI 191


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 99/224 (44%), Positives = 130/224 (58%), Gaps = 13/224 (5%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  
Sbjct: 28  AGSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKL 87

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG FI  ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD
Sbjct: 88  SE--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYD 143

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI------GLK 235
            F       +   R+A+ L+YL+DV +GGET+FP         G +  + +      G+ 
Sbjct: 144 YFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIA 203

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           VKPRRGD LLF+SL+PN   D  SLH  CPVI+GEKW AT+WI 
Sbjct: 204 VKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIH 247


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 85/211 (40%), Positives = 129/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+SW PRA+ + NF S E+C+ +I  AK  +  S +   +    + ++  RTSSGTF+ 
Sbjct: 81  EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDV++GGET+FP   G      +  +  KC   GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + ++D +SLHG CPV+KG KW +TKW    E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288


>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/225 (40%), Positives = 133/225 (59%), Gaps = 22/225 (9%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           ++ P  + + +P+A    NF SAE+C  ++  AK+ L PS +    G++V S    RTS+
Sbjct: 38  ETPPLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSA 95

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPA 186
           G F+   +DK  I++ IE +IAR +  P  +GE   +LRY++GQKYD H+D F    NPA
Sbjct: 96  GMFLRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPA 153

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPF----------ENGIFLDSGYDYKKCI--GL 234
               +  QRLA+ L+YL D ++GGET FP           E      S  ++  C   G+
Sbjct: 154 PK--RGGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGI 211

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            VK  RGD +LF+S+  +G +DR SLHG+CPVI+G+KW A KWIR
Sbjct: 212 PVKSVRGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIR 256


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 84/211 (39%), Positives = 130/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+SW PRA+ + NF + E+C+ +I+ AK  +  S +   +    + ++  RTSSGTF+ 
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDV++GGET+FP   G      +  +  KC   GL V P++ D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRDALLFWNMRP 257

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + ++D +SLHG CPV+KG KW +TKW    E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 128/208 (61%), Gaps = 10/208 (4%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
           ++SW+PR  ++  F S ++C  ++   K++LK S +A  + G++V S    RTSSG F+ 
Sbjct: 50  IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSE--VRTSSGMFLD 107

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   ++  IE +IA  T+LPQ + E   +LRYE GQKYD H+D F       Q   R
Sbjct: 108 KQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHR 165

Query: 196 LASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
            A+ L YLS VE+GGET+FP    +E+    DS  D  K  GL VK  +GD +LF++L P
Sbjct: 166 YATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCAK-KGLAVKAVKGDSVLFFNLQP 224

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           +GT D  SLHGSCPVI+GEKW A KWI 
Sbjct: 225 DGTPDPLSLHGSCPVIEGEKWSAPKWIH 252


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 93/205 (45%), Positives = 124/205 (60%), Gaps = 7/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGT 254
           A+ LLYL+DV EGGET+FP              +C   G+ V+PR+GD LLF++L P+GT
Sbjct: 170 ATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGT 229

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLHG CPVIKGEKW ATKWIR
Sbjct: 230 TDSVSLHGGCPVIKGEKWSATKWIR 254


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 12/210 (5%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
           +VL+W PR +    F SAE+C  +IA A  RL  S +        R G  +ES    RTS
Sbjct: 61  EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 116

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           +G F+S  + +  +++ IE +IA  +M+P  +GE   VLRYE  Q Y  H+D F+     
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
            +  QR+A+ L+YLSDVEEGGET+FP       + G + +K  GL VKPR+GD +LF+S 
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 234

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             +G +D  SLHG C V++GEKW ATKW+R
Sbjct: 235 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 264


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 92/210 (43%), Positives = 126/210 (60%), Gaps = 12/210 (5%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
           +VL+W PR      F SAE+C  +IA A  RL  S +        R G  +ES    RTS
Sbjct: 60  EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 115

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           +G F+S  + +  ++E IE +IA  +M+P  +GE   VLRYE  Q Y  H+D F+     
Sbjct: 116 TGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 175

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
            +  QR+A+ L+YLSDVEEGGET+FP       + G + +K  GL VKPR+GD +LF+S 
Sbjct: 176 KRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRK--GLCVKPRKGDAILFWSA 233

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             +G +D  SLHG C V++GEKW ATKW+R
Sbjct: 234 ALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263


>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 186

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 86/184 (46%), Positives = 120/184 (65%), Gaps = 7/184 (3%)

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           + PS LA R GE  E+ +  RTS GTF+    D +  L  +E KIA  T+LP+T+GE +N
Sbjct: 1   MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFENGIFLDSG 225
           VL Y+  Q YDSH D+F+P EYGPQ SQR+A+ ++ LSD    GGET+F  E    ++  
Sbjct: 59  VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKP 118

Query: 226 Y-DYKKCI---GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             ++  C    GLK KPR GD +LF+S  P+G +D  +LHGSCPV+ G KWVA KW+R++
Sbjct: 119 ISNWTDCDADGGLKYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNK 178

Query: 282 EQHE 285
            +++
Sbjct: 179 GEYD 182


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 84/211 (39%), Positives = 129/211 (61%), Gaps = 7/211 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+SW PRA+ + NF + E+C+ +I+ AK  +  S +   +    + ++  RTSSGTF+ 
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR-VRTSSGTFLR 139

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          QR
Sbjct: 140 RGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQR 197

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDV++GGET+FP   G      +  +  KC   GL V P+  D LLF+++ P
Sbjct: 198 IATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRDALLFWNMRP 257

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + ++D +SLHG CPV+KG KW +TKW    E
Sbjct: 258 DASLDPSSLHGGCPVVKGNKWSSTKWFHVHE 288


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP   G       D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211

Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
              GL VKP +GD +LF+SL  +G  D  SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP   G       D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211

Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
              GL VKP +GD +LF+SL  +G  D  SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP   G       D + +C
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 211

Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
              GL VKP +GD +LF+SL  +G  D  SLHGSCPVI+GEKW A KWIR
Sbjct: 212 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 261


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 94/230 (40%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 30  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 90  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKC 231
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP   G       D + +C
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSEC 205

Query: 232 I--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
              GL VKP +GD +LF+SL  +G  D  SLHGSCPVI+GEKW A KWIR
Sbjct: 206 AQKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIR 255


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 124/207 (59%), Gaps = 8/207 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           + +SW PR   +  F S  +C  ++  AKK+++ S +A  + G++V+S    RTSSG F+
Sbjct: 46  KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   ++  IE +IA  T LPQ + E   VLRYE GQKY+ H+D F+      +   
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF---LDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           R A+ L+YLS V EGGET+FP   G      D+ +      GL VKP +GD +LF+SL  
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHA 221

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +GT D  SLHGSCPVI+GEKW A KWI
Sbjct: 222 DGTPDPLSLHGSCPVIRGEKWSAPKWI 248


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  + + A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S+ E K  +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL+D  EGGET FP         G    K  GL VKP +GD +LF+S+  +G 
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
            D  S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score =  163 bits (412), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 5/208 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  + + A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S+ E K  +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL+D  EGGET FP         G    K  GL VKP +GD +LF+S+  +G 
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGKMVK--GLCVKPNKGDAVLFWSMGLDGE 237

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
            D  S+HG CPV++GEKW ATKW+R +E
Sbjct: 238 TDSNSIHGGCPVLEGEKWSATKWMRQKE 265


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score =  163 bits (412), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 95/208 (45%), Positives = 122/208 (58%), Gaps = 11/208 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FIS 
Sbjct: 39  VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSE--VRTSSGMFISK 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F       +   R+
Sbjct: 97  KKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRM 154

Query: 197 ASFLLYLSDVEEGGETMFPFENGI----FLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYL++V  GGET+FP          L++  D  +C   G+ VKPRRGD LLF+SL 
Sbjct: 155 ATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALLFFSLH 214

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                D  SLH  CPVI+GEKW ATKWI
Sbjct: 215 TTAIPDTDSLHAGCPVIEGEKWSATKWI 242


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 127/208 (61%), Gaps = 11/208 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFIS 
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155

Query: 197 ASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ L+YLS+V +GGET+FP        +  ++  D   C   G+ VKPR+GD LLF++L 
Sbjct: 156 ATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLH 215

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+   D  SLHG CPVI+GEKW ATKWI
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWI 243


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/225 (41%), Positives = 132/225 (58%), Gaps = 13/225 (5%)

Query: 64  ETGDASIQSIPF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-G 117
           + GD    + PF     + +SW+PR   +  F S  +C  ++  AKK+++ S +A  Q G
Sbjct: 28  DGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSG 87

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ++V S    RTSSG F++  +D   ++  IE +IA  T LPQ + E   +LRYE GQKY+
Sbjct: 88  KSVMSE--VRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRYEHGQKYE 143

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GL 234
            H+D F+      +   R A+ L+YLS V++GGET+FP   G       D + +C   GL
Sbjct: 144 PHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGL 203

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            VKP +GD +LF+SL  +G  D  SLHGSCPVI+GEKW A KWI 
Sbjct: 204 AVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIH 248


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 94/222 (42%), Positives = 134/222 (60%), Gaps = 23/222 (10%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG----ETVESTKGTRTSS 130
            +++SW+PRAL    F +  +C  +I+ A+ RL+PS++  R G    ++V + +G  +SS
Sbjct: 15  IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-- 188
           GTF++  +D   ++  +E +I  AT LP +H E   VL+YE+GQKY +HYD     E   
Sbjct: 74  GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131

Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKC--IGLKVK 237
                G Q   R A+ L+YLSDVEEGGET FP  +G ++D G      Y +C   G+ VK
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFP--HGRWIDEGAQAQPPYSECGSRGVAVK 189

Query: 238 PRRGDGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 278
           PR+GD +LFYSL  +G + D  SLH  CPV KG K+ AT WI
Sbjct: 190 PRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWI 231


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 127/208 (61%), Gaps = 11/208 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFIS 
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155

Query: 197 ASFLLYLSDVEEGGETMFPF----ENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ L+YLS+V +GGET+FP        +  ++  D   C   G+ VKPR+GD LLF++L 
Sbjct: 156 ATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLH 215

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+   D  SLHG CPVI+GEKW ATKWI
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWI 243


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 125/205 (60%), Gaps = 8/205 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S  +C  +I  AK +L+ S +A    G++V S    RTSSG F+  
Sbjct: 56  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T+LP  +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171

Query: 197 ASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ L+YLS+VE+GGET+FP   F+     D  +      G  VK ++GD LLF+SL  + 
Sbjct: 172 ATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDA 231

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
           T D  SLHGSCPVI GEKW ATKWI
Sbjct: 232 TTDERSLHGSCPVIAGEKWSATKWI 256


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 126/208 (60%), Gaps = 9/208 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + +F +  +C  +I  AK  L K + +    G++ +S    RTSSGTF+
Sbjct: 3   EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   I++ IE +IA  T +P   GE   VL+Y   +KY+ HYD F+ A       Q
Sbjct: 61  VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118

Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVE+GGET+FP    N   +       +C   GL V+PR GD LLF+S+ 
Sbjct: 119 RIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMK 178

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+  +D TSLHG+CPVI+G KW ATKW+
Sbjct: 179 PDAKLDPTSLHGACPVIQGTKWSATKWL 206


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 91/210 (43%), Positives = 125/210 (59%), Gaps = 8/210 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES--TKGTRTSSGTFI 134
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES  +      S +FI
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118

Query: 135 SA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           +   S +   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+        
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSL 249
             R+A+ L+YLS+VE+GGET+FP   G       D + +C   G  VKPR+GD LLF++L
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDALLFFNL 238

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            PN T D  SLHGSCPV++GEKW AT+WI 
Sbjct: 239 HPNATTDSNSLHGSCPVVEGEKWSATRWIH 268


>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 84/207 (40%), Positives = 130/207 (62%), Gaps = 5/207 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW+PR +   NF SA++C  +I  A+ RL K + +    G+ +ES    RTS+G F+
Sbjct: 79  EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESK--VRTSTGMFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           + ++ +   ++ IE +IA  +M+P  +GE   VLRYE  Q Y +H+D F+      +  Q
Sbjct: 137 NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQ 196

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL++  EGGET+FP         G + K  IG+ VKP+RGD +LF+S+  +G 
Sbjct: 197 RVATMLMYLTEGVEGGETIFPQAGDKECSCGGEMK--IGVCVKPKRGDAVLFWSIKLDGQ 254

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +D TSLHG C V+ GEKW +TKW+R +
Sbjct: 255 VDPTSLHGGCKVLSGEKWSSTKWMRQR 281


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/219 (39%), Positives = 133/219 (60%), Gaps = 5/219 (2%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVES 122
           E G   ++ +  +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S
Sbjct: 46  EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D 
Sbjct: 106 D--VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDY 163

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
           F+      +  QR+A+ L+YL+D  EGGET F          G +  K  GL VKP +GD
Sbjct: 164 FSDTFNLKRGGQRVATMLMYLTDGVEGGETHFLQAGDGECSCGGNVVK--GLCVKPNKGD 221

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            +LF+S+  +G  D  S+H  CPV+KGEKW ATKW+R +
Sbjct: 222 AVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMRQK 260


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 85/207 (41%), Positives = 128/207 (61%), Gaps = 5/207 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL+D   GGET FP         G +  K  GL VKP +GD +LF+S+  +G 
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQAGDGECSCGGNVVK--GLCVKPNKGDAVLFWSMGLDGN 233

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  S+H  CPV+KGEKW ATKW+R +
Sbjct: 234 TDPNSIHSGCPVLKGEKWSATKWMRQK 260


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 123/208 (59%), Gaps = 7/208 (3%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTF 133
            +VLSW PRA  + NF +  +   ++   K  ++ S++     ET +S     RTSSG F
Sbjct: 1   MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVV--DNETGKSAPSKVRTSSGMF 58

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           ++  ED   ++E IE +IA+ T +P+ +GE   +L Y+  ++Y  H+D F+         
Sbjct: 59  LNRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGG 116

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKVKPRRGDGLLFYSLFP 251
           QR+A+ L+YLSDVE+GGET+FP  +         + +C   G   KP++GD L FYSL P
Sbjct: 117 QRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYSLTP 176

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           +G +D  SLH  CPV+KG+KW ATKW+R
Sbjct: 177 DGRMDEKSLHAGCPVMKGDKWSATKWLR 204


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/223 (44%), Positives = 128/223 (57%), Gaps = 13/223 (5%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  
Sbjct: 25  AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG FIS ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD
Sbjct: 85  SD--VRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
            F       Q   RLA+ L+YL++V +GGET+FP         G     D  +C   G+ 
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           VKPRRGD LLF+SL  N   D  SLH  CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 100/223 (44%), Positives = 128/223 (57%), Gaps = 13/223 (5%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  
Sbjct: 25  AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG FIS ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD
Sbjct: 85  SD--VRTSSGMFISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
            F       Q   RLA+ L+YL++V +GGET+FP         G     D  +C   G+ 
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           VKPRRGD LLF+SL  N   D  SLH  CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243


>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
 gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
          Length = 454

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 92/221 (41%), Positives = 130/221 (58%), Gaps = 19/221 (8%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           S P  + +  P+A  F NF +  +C+ ++  AKK+L PS +   +G     +K  RTS+G
Sbjct: 167 SPPLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSK-IRTSAG 225

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            F+   +D T  +  IE +IA A+ LP+ +GE   +LRYE GQKYD H+D F +     P
Sbjct: 226 MFLGRGQDPT--VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSP 283

Query: 191 QMS-QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD---------YKKCI--GLKVKP 238
           +   QR+A+ L+YL D  EGGET+FP  NG+  +  +D         +  C   G+ VK 
Sbjct: 284 RRGGQRMATMLIYLEDTTEGGETIFP--NGVRPED-WDADEPGNHNSWSDCAKKGIPVKS 340

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            RGD +LF+SL  + T+D  SLHG+CPVI GEKW A KWIR
Sbjct: 341 HRGDAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIR 381


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/219 (40%), Positives = 127/219 (57%), Gaps = 21/219 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTF 133
            + +SW PRA  + NF + E+C  ++  AK     +   L++    ++ T GT   SG F
Sbjct: 2   IEQISWEPRAFVYHNFLTPEECAHLVNLAKA----TDGGLKRATVADARTGGTFPGSGAF 57

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM- 192
           +  + D   I+  IE +I+   M+P  HGE   +LRY  G+KYD H+D F+  +   +  
Sbjct: 58  LLRNHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLD----------SGYDYKKCI--GLKVKPRR 240
            QR+A+ L+YLSDVE GGET+FP ++G +++          S  D  KC    L VKPRR
Sbjct: 116 GQRVATVLMYLSDVESGGETVFP-KHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRR 174

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           GD LLF++   NG  D TSLH  CPV++GEKW ATKW+R
Sbjct: 175 GDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMR 213


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
           +SW PR   +  F S  +C+ +IA AK+ R++ S +   + GE+V S   TRTSSG F+ 
Sbjct: 40  VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE +IA  TM P  +GE+  +LRY  G+KY+ H+D     +   +   R
Sbjct: 98  RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155

Query: 196 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+V+ GGET+FP  E  +       +  C   G  VKP +G  +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
            T D  SLHGSCPVI+GEKW ATKWI  +   E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 126/209 (60%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFI  
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFIPK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRI 155

Query: 197 ASFLLYLSDVEEGGETMFPFEN----GIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ L+YLS+V +GGET+FP        +  ++  D   C   G+ VKPR+GD LLF++L 
Sbjct: 156 ATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLH 215

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           P+   D  SLHG CPVI+GEKW ATKWI 
Sbjct: 216 PDAIPDPLSLHGGCPVIEGEKWSATKWIH 244


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 90/204 (44%), Positives = 127/204 (62%), Gaps = 12/204 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 140
           P    + NF +  +C  +I  A+ +L+ S +A  + G++V S    RTSSG F++ ++D+
Sbjct: 28  PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSE--IRTSSGMFLNKAQDE 85

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             I+  +E +IA  T LP  +GEA  VL YE+GQKY+ H+D F+          R+A+ L
Sbjct: 86  --IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVL 143

Query: 201 LYLSDVEEGGETMFPFENGIFLDS---GYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
           +YLSDV +GGET+FP  N    DS      + +C   G  VKP +GD LLF+SL P+ T 
Sbjct: 144 MYLSDVVKGGETVFP--NAETKDSQPKDDSWSECAKGGYSVKPNKGDALLFFSLRPDATT 201

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D++SLHGSCPVI+GEKW ATKWI 
Sbjct: 202 DQSSLHGSCPVIEGEKWSATKWIH 225


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 129/214 (60%), Gaps = 9/214 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
           +SW PR   +  F S  +C+ +IA AK+ R++ S +   + GE+V S   TRTSSG F+ 
Sbjct: 40  VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE +IA  TM P  +GE+  +LRY  G+KY+ H+D     +   +   R
Sbjct: 98  RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155

Query: 196 LASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+V+ GGET+FP  E  +       +  C   G  VKP +G  +LF+SL+PN
Sbjct: 156 IATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPN 215

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
            T D  SLHGSCPVI+GEKW ATKWI  +   E+
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDEN 249


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/210 (43%), Positives = 127/210 (60%), Gaps = 11/210 (5%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFI 134
           ++LS  PRA  + NF S E+C+ +I  AK  +  S +     GE  ES+  +RTSSG F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESS--SRTSSGMFL 170

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P  +GE  +V+ Y +GQK + HYD  +          
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPN 252
           R+A+ L+YLSDVEEGGET+FP     F        KC G  L VKP+ GD LLF+S+ P+
Sbjct: 229 RVATVLMYLSDVEEGGETVFPDAQPNFTS----VSKCSGDGLSVKPKMGDALLFWSMKPD 284

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           GT+D +SLHG  PVI+G KW +TKW+  +E
Sbjct: 285 GTLDTSSLHGGSPVIRGNKWASTKWLHLRE 314



 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 108/195 (55%), Gaps = 24/195 (12%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLAL-RQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           F S E+C+ +I  AK  +  S +     G+  ES+   RTSSG F+   +DK  I++ IE
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESS--ARTSSGRFLERGKDK--IVQNIE 427

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            +IA  T +P+    A + + +  G               GP    R+A+ L+YLSDVEE
Sbjct: 428 QRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYLSDVEE 472

Query: 209 GGETMFPFEN-GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 267
           GGET+FP     I   S Y  K   GL VKP+ GD LLF S+ P+GT+D +SLHG  PVI
Sbjct: 473 GGETVFPNAKPNINSVSKYPEK---GLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVI 529

Query: 268 KGEKWVATKWIRDQE 282
           +G KW +TKW+   E
Sbjct: 530 RGNKWASTKWLHLTE 544


>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 162

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/161 (57%), Positives = 118/161 (73%), Gaps = 9/161 (5%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE---------SGLP 61
           KLGLP+   LC FFFL GF+GS+ +S+D  + R + R LE    E +           LP
Sbjct: 2   KLGLPSVFFLCFFFFLLGFFGSSLISQDYANARLRARILESSNGEEDITGGSSFGFDLLP 61

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
            G+TGD SI SIPFQVLSWRPR LYFP FA+A+ C++II+ A+ +LKPS+LALR+GET++
Sbjct: 62  SGDTGDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGETLD 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           ST+  RTSSG FISA EDKTGIL+ I+ KIARATM+P+ +G
Sbjct: 122 STREIRTSSGMFISADEDKTGILDFIDEKIARATMIPRANG 162


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/209 (42%), Positives = 124/209 (59%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S  +C  +I  AK +L+ S +A    G++V S    RTSSG F+  
Sbjct: 35  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T+LP  +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 93  AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150

Query: 197 ASFLLYLSDVEEGGETMFPFENGIF------LDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           A+ L+YLS+VE+GGET+FP     +       D  +      G  VK ++GD LLF+SL 
Sbjct: 151 ATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLN 210

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + T D  SLHGSCPVI GEKW ATKWI 
Sbjct: 211 LDATTDERSLHGSCPVIAGEKWSATKWIH 239


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/223 (44%), Positives = 127/223 (56%), Gaps = 13/223 (5%)

Query: 65  TGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVE 121
            G AS    P +V  +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  
Sbjct: 25  AGSASSIINPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQL 84

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG  IS ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD
Sbjct: 85  SD--VRTSSGMLISKNKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYD 140

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSG----YDYKKCI--GLK 235
            F       Q   RLA+ L+YL++V +GGET+FP         G     D  +C   G+ 
Sbjct: 141 YFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIA 200

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           VKPRRGD LLF+SL  N   D  SLH  CPV++GEKW ATKWI
Sbjct: 201 VKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWI 243


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 94/218 (43%), Positives = 124/218 (56%), Gaps = 10/218 (4%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
           D +I+    + +SWRP A  +  F + E+C  + A A   L  S +     G +V S   
Sbjct: 48  DRAIEFDRIERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSD-- 105

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+   ED   ++  IE +IA  T +P++HGE F VLRYE GQ+Y  H+D F  
Sbjct: 106 IRTSSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD 163

Query: 186 A--EYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIG--LKVKPRR 240
              +   +  QR+A+ L+YL+DVEEGGET+FP  E G     G D   C    L VKPR+
Sbjct: 164 EFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSCAAGKLAVKPRK 223

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           GD L F SL  NGT D  S H  CPV+KG K+ ATKW+
Sbjct: 224 GDALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWM 261


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 90/209 (43%), Positives = 126/209 (60%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+       RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYLS+V +GGET+FP    F      ++  D   C   G+ VKP++G+ LLF++L 
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            +   D  SLHG CPVI+GEKW ATKWI 
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/210 (43%), Positives = 125/210 (59%), Gaps = 13/210 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PR   + +F S ++   +++ A+  LK S +A    G++  S    RTS GTFIS 
Sbjct: 55  ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSE--VRTSYGTFISK 112

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KIA  T LP+ +GE   VLRY+ G+K +  +D F       +   R+
Sbjct: 113 GKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRV 170

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSL 249
           A+ LLYL+DV EGGET+FP     F D+G   K     +C   G+ VKPR+GD LLF++L
Sbjct: 171 ATVLLYLTDVAEGGETVFPLAKD-FTDTGLHDKDTTLSECAQKGIAVKPRKGDALLFFNL 229

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            P+   D  SLHG C VIKGEKW ATKWIR
Sbjct: 230 RPDAATDPLSLHGGCTVIKGEKWTATKWIR 259


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 90/209 (43%), Positives = 126/209 (60%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+       RTSSGTFIS 
Sbjct: 39  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 97  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYLS+V +GGET+FP    F      ++  D   C   G+ VKP++G+ LLF++L 
Sbjct: 155 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 214

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            +   D  SLHG CPVI+GEKW ATKWI 
Sbjct: 215 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 243


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 122/206 (59%), Gaps = 9/206 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I TA    LK   +    GE +E++   RTS+   +
Sbjct: 84  EVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETS--YRTSTEYVV 141

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I+  IE +IA  T +P  HGE  +V+RY +GQ Y+ H D F          Q
Sbjct: 142 ERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLS+VE GGET+FP  N  F    +  +  +C   GL +KP+ GD LLF+S+ 
Sbjct: 200 RIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMK 259

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATK 276
           P+ T+D  +LH +CPVIKG KW  TK
Sbjct: 260 PDATLDPLTLHRACPVIKGNKWSCTK 285


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 91/209 (43%), Positives = 127/209 (60%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYLS+V +GGET+FP    F      ++  D   C   G+ VKP++G+ LLF++L 
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            +   D  SLHG CPVI+GEKW ATKWI 
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245


>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
          Length = 327

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 125/218 (57%), Gaps = 20/218 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            +V++W+PRAL    F S  +C  II  A   L+ S +   +G ++      RTSSG FI
Sbjct: 42  VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D   ++  +E ++A  T LP +H E   VLRYE+GQKY +H+D  +  E   QM  
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157

Query: 195 -------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCI--GLKVKPRRG 241
                  R A+ L+YLSDVEEGGET FP  +G +LD G      Y +C   G+ VKPR+G
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFP--HGRWLDEGVQAAPPYTECASKGVVVKPRKG 215

Query: 242 DGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWI 278
           D +LF+SL  NG   D  SLH  CPV++G K+ ATKW+
Sbjct: 216 DAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWV 253


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 91/218 (41%), Positives = 127/218 (58%), Gaps = 15/218 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PRA    +  S E+C+ I+  AK  +K S +     GE    T   RTS  TF++ 
Sbjct: 83  ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFLAR 140

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMS-- 193
              K  ++  +E +++R TMLP  +GE   +L Y +G+KY +H+D      + G Q+S  
Sbjct: 141 G--KYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198

Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK---CI--GLKVKPRRGDGLLF 246
             QR+A+ LLYL D EEGGET FP    I  +S Y  +K   C   G+  KP+RGDGLLF
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGLLF 258

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           +S+ P G ID+ S+H  CPV+KG KW ATKWI  +  H
Sbjct: 259 FSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFH 296


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 89/209 (42%), Positives = 126/209 (60%), Gaps = 11/209 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+       RTSSGTFIS 
Sbjct: 41  VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYLS+V +GGET+FP    +      ++  D   C   G+ VKP++G+ LLF++L 
Sbjct: 157 ATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQ 216

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            +   D  SLHG CPVI+GEKW ATKWI 
Sbjct: 217 QDAIPDPFSLHGGCPVIEGEKWSATKWIH 245


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 123/205 (60%), Gaps = 11/205 (5%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFP 251
           +A+ L+YLSDVEEGGET+FP        S +  +  +C   GL VKP+ GD LLF+S+ P
Sbjct: 228 IATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWSMRP 287

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATK 276
           +G++D TSLHG  P++    W+ T 
Sbjct: 288 DGSLDATSLHGEIPIL----WLLTN 308


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/202 (45%), Positives = 118/202 (58%), Gaps = 11/202 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL--ALRQGETVESTKGTRTSSGTFISASED 139
           PRA  +  F + E+C  I+A +K  L  S +  A   G T   T   RTS+GTFIS + D
Sbjct: 1   PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
            T  +  IE +I   + +P  HGEA  VLRYE GQ+Y +H+D F     G + + R+A+ 
Sbjct: 58  PT--ITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYF--FHKGGKRNNRIATV 113

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDR 257
           LLYLSDVEEGGET+FP  +         Y +C   G  VK R+GD LLF+S+ P G +D 
Sbjct: 114 LLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDP 173

Query: 258 TSLHGSCPVIKGEKWVATKWIR 279
            S H  CPVIKG KW ATKW+ 
Sbjct: 174 GSSHAGCPVIKGVKWTATKWMH 195


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 122/221 (55%), Gaps = 14/221 (6%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           +++  + LS  P  L    F S ++C  I  TA+  ++ S++ L   +        RTS 
Sbjct: 3   RAVTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQ 62

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             FI A +D   IL  I+++ A    +P+ H E   VLRY++ +KYDSH D F+PA Y  
Sbjct: 63  SAFIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTK 120

Query: 191 Q----------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
                         R+A+   YLSDVE+GGET+FP  NG    S  D K   GLKVKP +
Sbjct: 121 DKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMKDCK--TGLKVKPEK 178

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           G  ++FYS+ P+G +D  SLHG+CPV KG KW A KW+ ++
Sbjct: 179 GKVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWVWNE 219


>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
 gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
          Length = 369

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 94/232 (40%), Positives = 131/232 (56%), Gaps = 27/232 (11%)

Query: 67  DASIQSIPFQVLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
           +A   S P  VLS + P+A    NF S ++C  ++  AK+ L PS +    G +V S   
Sbjct: 74  EAGFPSTPPLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE-- 131

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-- 183
            RTS+G F+  S+D T  +  IE +IAR + +P  +GE   +LRY+ GQKYD H+D F  
Sbjct: 132 IRTSAGMFLRKSQDDT--VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHD 189

Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLD------------SGYDYK 229
             NPA    +  QR+A+ L+YL D EEGGET FP  NG   +            +   + 
Sbjct: 190 KVNPAPK--RGGQRVATVLIYLVDTEEGGETTFP--NGRLPENFEEDEPDNPFAAHIKHT 245

Query: 230 KCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            C   G+ VK  RGD +LF+S+  +G +D  SLHG+CPVI G+KW A KW+R
Sbjct: 246 DCAKNGIPVKSVRGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLR 297


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 7/199 (3%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C+ ++A +K +L  S +   Q     S    RTS+GTFIS   D  
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQ-TGGSSLSEVRTSTGTFISRKYDD- 322

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  +E +I   + +PQ+H EAF +LRYE GQ+Y +H+D F         + R+A+ LL
Sbjct: 323 -IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATVLL 379

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           YLSDVEEGGET+FP  +     +   Y +C   G  +K R+GD LLF+S+ P G +D  S
Sbjct: 380 YLSDVEEGGETVFPNTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDAGS 439

Query: 260 LHGSCPVIKGEKWVATKWI 278
            H  CPVIKGEKW ATKW+
Sbjct: 440 SHAGCPVIKGEKWTATKWM 458


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/207 (43%), Positives = 123/207 (59%), Gaps = 11/207 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
           LS +P+A  +  F    +C  I   AK +L+ S +   + G++V S    RTS G F   
Sbjct: 8   LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            ED   I+E IE +IA  T +P  +GE   VLRYE+GQKY+ H DAF+  ++  + S   
Sbjct: 66  HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSD-KFNTEESKGG 122

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFP 251
           QR+A+ L+YLSDVEEGGET+FP            + +C   G+ VK R+GD LLF+SL  
Sbjct: 123 QRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECAQRGVAVKARKGDALLFWSLDI 182

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +  +D  SLHG CPVIKG KW ATKW+
Sbjct: 183 DSNVDELSLHGGCPVIKGTKWSATKWM 209


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 27/219 (12%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           +SW PRA +  NF S E+C  I+  A+ R+       R+   ++S  G       RTS  
Sbjct: 1   VSWYPRAFHLHNFMSHEECDRILEIARPRV-------RRSTVIDSVTGQSKVDPIRTSEQ 53

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
           TF++       I+  +E ++A  T LP  HGE   +L+Y +GQKYD+H+D     +  G 
Sbjct: 54  TFLNRG--TWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGK 111

Query: 191 QMS----QRLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCI--GLKVKPR 239
           Q++     R+A+ LLYLSDVEEGGET FP    +  +      G  +  C    + VKPR
Sbjct: 112 QLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPR 171

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +GDGLLF+S+     ID  S+H  CPVI+GEKW ATKWI
Sbjct: 172 KGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWI 210


>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
 gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
          Length = 343

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 93/227 (40%), Positives = 127/227 (55%), Gaps = 20/227 (8%)

Query: 63  GET---GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           GET   G+   Q     VLSW PR   +    + E+C  ++  ++ RL+ S ++      
Sbjct: 52  GETFNAGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS------ 105

Query: 120 VESTKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
            ++T G       RTSSG F    E  T +++ IE+++A  TMLP  +GE   VLRYE  
Sbjct: 106 -DATTGAGAVSDIRTSSGMFYERGE--TELVKRIENRLAMWTMLPVENGEGIQVLRYEKT 162

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENG--IFLDSGYDYKKC 231
           QKYD H+D F+          R+A+ L+YL+  EEGGET+FP   G  + L +       
Sbjct: 163 QKYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPCR 222

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            GL VKP +GD +LF+S+ P+G  D  SLHGSCPVIKG KW ATKWI
Sbjct: 223 QGLAVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWI 269


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 95/231 (41%), Positives = 132/231 (57%), Gaps = 21/231 (9%)

Query: 61  PHGETGDA-----SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR 115
           P    GDA     +I +   + LSW P A  +  F +  +C+ I   A   LKPS + + 
Sbjct: 7   PTTTAGDARGAGGTIDAGAIERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VD 65

Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
                +++   RTSSG F+  +ED   ++E IE +IA  T +P++HGE F VLRYE  Q+
Sbjct: 66  ASTGGDASSEIRTSSGMFLGRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQE 123

Query: 176 YDSHYDAFNPA-----EYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYK 229
           Y +HYD F+       E G    QR+ + L+YLSDVEEGGET+FP FE+G    +G +  
Sbjct: 124 YRAHYDYFHDKFNVKREKG---GQRMGTVLMYLSDVEEGGETVFPKFEDGT--PAGSEAS 178

Query: 230 KCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +C    L V+PR+GD L F SL  +G  D  S H  CPVI+G K+ ATKW+
Sbjct: 179 ECARNKLAVRPRKGDALFFRSLRHDGVPDTFSEHAGCPVIRGVKFSATKWM 229


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 129/218 (59%), Gaps = 7/218 (3%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTK 124
            D  +  +  +V+SW PR + F NF S+E+C  +   A+ RL+ S +  +  G+ V+S  
Sbjct: 51  ADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD- 109

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTSSG F+++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+
Sbjct: 110 -VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFS 168

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE-NGIFLDSGYDYKKCIGLKVKPRRGDG 243
                 +  QR+A+ L+YL+D  EGGET FP   +G  +  G   +   GL VKP +GD 
Sbjct: 169 DTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGDA 225

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +LF+S+  +G  D  SLH  C V+KGEKW ATKW+R +
Sbjct: 226 VLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWMRQK 263


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 85/189 (44%), Positives = 115/189 (60%), Gaps = 9/189 (4%)

Query: 99  IIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
           +I  AK  +  S +   Q G++V S    RTSSG F+   +DK  +++ IE +IA    +
Sbjct: 1   MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56

Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           P  +GE   VL YE+GQKY+ HYD F          QR+A+ L+YLSDVEEGGET+FP  
Sbjct: 57  PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAA 116

Query: 218 NGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
              F    +  D   C   GL VKP+RGD LLF+S+ P+ T+D +SLHG CPVI+G KW 
Sbjct: 117 KANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWS 176

Query: 274 ATKWIRDQE 282
           +TKW+  +E
Sbjct: 177 STKWMHLEE 185


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 97/286 (33%), Positives = 150/286 (52%), Gaps = 29/286 (10%)

Query: 12  LGLPTALLLCSFFFLAGF-------YGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHG- 63
           L   TA ++   FF   F       YG+ F         P  + +  ++ +    LP G 
Sbjct: 12  LAFVTAGMIVGAFFQLAFILKLEDSYGTKF---------PSFKRVRKLQSDAYLQLPRGI 62

Query: 64  ----ETGDASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LR 115
                  +A++  I +   +++SW PR +   +F S+E+C  + A AK RL+ S +  ++
Sbjct: 63  SHWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVK 122

Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G+ +ES    RTSSG F+S+ E    +++ IE +I+  + +P  +GE   VLRYE  Q 
Sbjct: 123 TGKGIESK--VRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQY 180

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
           Y  H+D F+      +  QR+A+ L+YLSD  EGGET FP         G   K   GL 
Sbjct: 181 YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCG--GKVVDGLS 238

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           VKP +G+ +LF+S+  +G  D +S+HG C V+ G KW ATKW+R +
Sbjct: 239 VKPIKGNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQR 284


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 24/222 (10%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
           +SWRPRA  +  F S  +C  +I+ AK+  K  +  +  GE+ ES T   RTSSG F+  
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103

Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
            +D+  ++  IE +IA  TMLP                   +GE+  +LRY  G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCI--GLKV 236
           +D  +  +   +   R+A+ L+YLS+V+ GGET+FP  E  +       +  C   G  V
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAV 221

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           KP +G  +LF+SL PN T+D  SLHGSCPVI+GEKW ATKWI
Sbjct: 222 KPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWI 263


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 126/208 (60%), Gaps = 14/208 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTFI 134
           +VLS  PRA  + NF S E+C+ +I  AK  ++ S +    G T +      RTSSGTF+
Sbjct: 86  EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVV--DGVTGQGILNSVRTSSGTFL 143

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFN---PAEYGP 190
              +DK  I++ +E +IA  T +P  +GE   ++ YE+GQK++ HYD  FN       GP
Sbjct: 144 ERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGP 201

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               R+A+ L+YLSDVEEGGET+FP     F +S   Y    GL VKP+ GD LLF+S+ 
Sbjct: 202 ----RVATVLMYLSDVEEGGETVFPNAKPNF-NSVSKYHPGKGLVVKPKMGDALLFWSVK 256

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           P+G++D  SLHG  PVI+G KW + K +
Sbjct: 257 PDGSLDTASLHGGSPVIRGSKWASNKLL 284


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/204 (43%), Positives = 117/204 (57%), Gaps = 18/204 (8%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +S +PR   + +F S ++   +I+ A+  LK S +A       ++  G  T        S
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKST-------LS 99

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R A
Sbjct: 100 EDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTI 255
           + LLYL+DV EGGET+FP              +C   G+ V+PR+GD LLF++L P+GT 
Sbjct: 158 TVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALLFFNLNPDGTT 217

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D  SLHG CPVIKGEKW ATKWIR
Sbjct: 218 DSVSLHGGCPVIKGEKWSATKWIR 241


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 127/222 (57%), Gaps = 10/222 (4%)

Query: 63  GETGDASIQSIP---FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE 118
           GET  +    +P     VLSW+PR   +    + E+C  +I  A+ RL+ S ++    GE
Sbjct: 35  GETYRSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGE 94

Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
                   RTSSG F +  E+   +++ IE ++A  TMLP  +GE   VLRYE  QKYD 
Sbjct: 95  G--GVSDIRTSSGMFYTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDP 150

Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC--IGLKV 236
           H+D F+          R+A+ L+YL+  EEGGET+FP        +  ++ +C   GL V
Sbjct: 151 HHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFPKIPVPAGQTRANFSECGMKGLAV 210

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           KP +GD +LF+S+ P+G  +  SLHGSCPVI+G KW ATKWI
Sbjct: 211 KPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKWSATKWI 252


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/267 (38%), Positives = 147/267 (55%), Gaps = 20/267 (7%)

Query: 5   AKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRP---KLRTL-EVVEKENESGL 60
           + R S++  L  A+L+ S F +        LS  VPS      K   L  +V K  + G+
Sbjct: 10  SARKSSRSTLVFAVLIMSTFVILILLAFGILS--VPSNNAGSSKANDLTSIVRKTLQRGV 67

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGET 119
                 D S      +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++
Sbjct: 68  E-----DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKS 122

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            +S    RTSSGTF++   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ H
Sbjct: 123 TDSR--VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPH 178

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLK 235
           YD F          QR+A+ L+YLSDVEEGGET+FP   G +    +  +  +C   GL 
Sbjct: 179 YDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLS 238

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           VKP+ GD LLF+S+ P+ T+D +SLHG
Sbjct: 239 VKPKMGDALLFWSMTPDATLDPSSLHG 265


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 122/203 (60%), Gaps = 20/203 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+       RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGES--QVSDVRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ LLYLS+V +GGET+FP           D + C+    KP++G+ LLF++L  +   D
Sbjct: 157 ATVLLYLSNVTKGGETVFP-----------DAQVCL----KPKKGNALLFFNLQQDAIPD 201

Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
             SLHG CPVI+GEKW ATKWI 
Sbjct: 202 PFSLHGGCPVIEGEKWSATKWIH 224


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/222 (38%), Positives = 126/222 (56%), Gaps = 16/222 (7%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S + +  + LS RP  +    F S E+C  I   A  ++K S ++L+  +  + +   RT
Sbjct: 255 SQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRT 314

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           S   F+SA +D+  +L  I+H++A  T +P+ H E   VLRY  G+KYDSH+D F+P+ Y
Sbjct: 315 SQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAY 372

Query: 189 GPQMS----------QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVK 237
               S           R A+   YL+DV +GGET+FP   G    +   +K C IGLKVK
Sbjct: 373 RSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGA--PAPRSHKDCSIGLKVK 430

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE-KWVATKWI 278
           P++G  ++FYSL  +G +D  SLHG+CPV +   KW A KWI
Sbjct: 431 PQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/207 (40%), Positives = 123/207 (59%), Gaps = 5/207 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  +   A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL+D  EGGET FP         G    +  GL VKP +GD +LF+S+  +G 
Sbjct: 179 RVATMLMYLTDGVEGGETHFPQAGDGECSCGGRIVR--GLCVKPNKGDAVLFWSMGLDGN 236

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  S+H  C V+KGEKW ATKW+R +
Sbjct: 237 TDSNSIHSGCAVLKGEKWSATKWMRQK 263


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 119/192 (61%), Gaps = 9/192 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++ +S    RTSSGTF+
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F          Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLF 250
           R+A+ L+YLSDVEEGGET+FP   G +    +  +  +C   GL VKP+ GD LLF+S+ 
Sbjct: 195 RIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLFWSMT 254

Query: 251 PNGTIDRTSLHG 262
           P+ T+D +SLHG
Sbjct: 255 PDATLDPSSLHG 266


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 84/213 (39%), Positives = 124/213 (58%), Gaps = 5/213 (2%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           I ++  +V+SW PR +   +F S E+C+ + A A+ RL+ S +  ++ G+ V+S    RT
Sbjct: 72  IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSG F++  E    I++ IE +IA  + +P  +GE   VLRYE  Q Y  H+D F     
Sbjct: 130 SSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFN 189

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
             +  QR+A+ L+YL+D  EGGET FP         G    K  G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWS 247

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +  +G  D  S+HG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280


>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
          Length = 300

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 116/208 (55%), Gaps = 5/208 (2%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  +VLSW PR   +    + E+C  ++  A  RL  S +        ES    RTS G 
Sbjct: 14  ILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGM 73

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F    ED+  ++  +E +++  +++P  HGE   VLRYE G++Y  H+D F         
Sbjct: 74  FFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNG 131

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGI---FLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
             RLA+ L+YL++ E GGET+FP         L++GY      GL VKPR+GD +LF+SL
Sbjct: 132 GNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQGLAVKPRKGDAVLFFSL 191

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKW 277
              GT+D+ SLHGSCP +KG K+ ATKW
Sbjct: 192 RTEGTLDKGSLHGSCPTLKGFKFAATKW 219


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score =  150 bits (378), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 119/214 (55%), Gaps = 16/214 (7%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           Q +   PRA YF NF +  +   ++  A  +LK S +    GE V      RTS G FI 
Sbjct: 2   QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGV--VDNIRTSYGMFIR 59

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQ 194
             +D   ++  IE +I+  T LP  H E   VLRY  GQ Y +HYD+ + + E GP+   
Sbjct: 60  RLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW-- 115

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCI--GLKVKPRRGDGLLF 246
           RLA+FL+YLSDVEEGGET FP  N ++ D       G  +  C    +  KP+ GD +LF
Sbjct: 116 RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVLF 174

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           YS +PN T+D  ++H  CPVIKG KW A  W+ D
Sbjct: 175 YSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHD 208


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score =  149 bits (377), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 81/174 (46%), Positives = 105/174 (60%), Gaps = 12/174 (6%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKK 230
           GQKY+ H+D F          QR+A+ L+YLSDVEEGGET+FP  N     L    +  +
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSE 123

Query: 231 CI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           C   GL VKP+ GD LLF+S+ P+ T+D  SLHG CPVI+G KW +TKW+   E
Sbjct: 124 CAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/213 (39%), Positives = 124/213 (58%), Gaps = 5/213 (2%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           I ++  +V+SW PR +   +F S E+C+ + A A+ RL+ S +  ++ G+ V+S    RT
Sbjct: 72  IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSG F++  E    I++ IE +IA  + +P  +GE   VLRYE  Q Y  H+D F     
Sbjct: 130 SSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFN 189

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
             +  QR+A+ L+YL+D  EGGET FP         G    K  G+ VKP +GD +LF+S
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGDGDCTCGGKIMK--GISVKPTKGDAVLFWS 247

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +  +G  D  S+HG C V+ GEKW ATKW+R +
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMRQK 280


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/174 (47%), Positives = 103/174 (59%), Gaps = 12/174 (6%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++  IE +IA  T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKK 230
           GQKY+ H+D F          QR+A+ L+YLSDVEEGGET+FP    N   L    +   
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSD 123

Query: 231 CI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           C   GL VKP+ GD LLF+S+ P+ T+D  SLHG CPVIKG KW +TKW+   E
Sbjct: 124 CAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHE 177


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/205 (40%), Positives = 118/205 (57%), Gaps = 5/205 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YLSD  EGGET FP         G    K  GL VKP +G+ +LF+S+  +G 
Sbjct: 200 RIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMGLDGQ 257

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  S+HG C VI GEKW ATKW+R
Sbjct: 258 SDPNSVHGGCEVISGEKWSATKWMR 282


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 5/205 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VLSW PR +   NF S E+C  +   A  RLK S +     G+ ++S    RTSSG F+
Sbjct: 76  EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  E K  ++  IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YL D  EGGET FP         G    K  GL VKP +G+ +LF+S+  +G 
Sbjct: 194 RIATMLMYLGDNVEGGETHFPSAGSDECSCGGKLTK--GLCVKPVKGNAVLFWSMGLDGQ 251

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  S+HG CPV+ GEKW ATKW+R
Sbjct: 252 SDPDSVHGGCPVLAGEKWSATKWMR 276


>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C  +I  AK R++ S ++ +           RTSSG F++  +++ 
Sbjct: 83  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            ++  IE +IA  T +P  +GE  ++L YE+GQK++ H+D  +P  +    + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200

Query: 201 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           +YLS V+EGG T+FP           +     +Y K  GL VKP+ GD LLF+S+ P+GT
Sbjct: 201 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 260

Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
           +D TSLH S PV+KG+KWV  K
Sbjct: 261 LDPTSLHASSPVVKGDKWVGVK 282



 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 49/73 (67%), Gaps = 6/73 (8%)

Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKCI-----GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++EEGGET+FP  N       + +KK       GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11  NIEEGGETVFPAANQCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69

Query: 260 LHGSCPVIKGEKW 272
           LHGS PVI+G++W
Sbjct: 70  LHGSYPVIRGDEW 82


>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 323

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 120/202 (59%), Gaps = 9/202 (4%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C  +I  AK R++ S ++ +           RTSSG F++  +++ 
Sbjct: 74  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 132

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            ++  IE +IA  T +P  +GE  ++L YE+GQK++ H+D  +P  +    + QR A+ +
Sbjct: 133 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 191

Query: 201 LYLSDVEEGGETMFPFENGI------FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           +YLS V+EGG T+FP           +     +Y K  GL VKP+ GD LLF+S+ P+GT
Sbjct: 192 MYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGT 251

Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
           +D TSLH S PV+KG+KWV  K
Sbjct: 252 LDPTSLHASSPVVKGDKWVGVK 273



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 39/63 (61%), Gaps = 6/63 (9%)

Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKC-----IGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++EEGGET+FP  N       + +KK       GL +KP+ GD L F+S+ P+GT+D TS
Sbjct: 11  NIEEGGETVFPAANKCVSSVPW-WKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTS 69

Query: 260 LHG 262
           LH 
Sbjct: 70  LHA 72


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 85/222 (38%), Positives = 126/222 (56%), Gaps = 8/222 (3%)

Query: 67  DASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
           DA I  I +   ++L+W PR +   +F S+E+C  + A A+  L+ S +   Q G+ ++S
Sbjct: 70  DADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQS 129

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+S  +    I+  IE +I+  + +P  +GE   VLRY+  Q Y  H+D 
Sbjct: 130 D--VRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDY 187

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
           F+ +    +  QR+A+ L+YLSD  EGGET FP     F   G   K   GL V P +G+
Sbjct: 188 FSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCG--GKSVRGLSVAPVKGN 245

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +LF+S+  +G  D  S+HG C V+ GEKW ATKW+R +  H
Sbjct: 246 AVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRSTH 287


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 80/176 (45%), Positives = 105/176 (59%), Gaps = 16/176 (9%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++ +IE +I   T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDY---- 228
           GQKY+ H+D F          QR+A+ L++LSDVEEGGET+FP  N    DS   +    
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDAN--VNDSSLPWYNEL 121

Query: 229 KKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
            +C   GL VKP+ GD LLF+S+ P+ T+D  SLHG CPVI+G KW +TKW+   E
Sbjct: 122 SECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHE 177


>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
          Length = 387

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 85/201 (42%), Positives = 113/201 (56%), Gaps = 19/201 (9%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP    N   L    +  +C   GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270

Query: 246 FYSLFPNGTIDRTSLHGSCPV 266
           F+S+ P+ T+D  SLH +  V
Sbjct: 271 FWSMKPDATLDPLSLHDTLRV 291


>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
 gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
          Length = 275

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 83/211 (39%), Positives = 118/211 (55%), Gaps = 21/211 (9%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTSS 130
           + LSW+PR   +  F S ++C  ++  AKK           G  V   +      TRTSS
Sbjct: 49  KALSWQPRIFVYKGFLSDDECDHLVTLAKK-----------GTMVAHNRSSYYRQTRTSS 97

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           G F+   +D   ++  IE +IA  T+LP+ + E   + RY+ GQKYD H+D F+   +  
Sbjct: 98  GMFLRKRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHT 155

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFY 247
           +   R A+ L+YLS V++GGET+FP   G       D + +C   GL VKP +GD +LF+
Sbjct: 156 RGGPRYATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECAHKGLAVKPVKGDAVLFF 215

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           SL  +G  D  +LHGSCPVI+GEKW A  WI
Sbjct: 216 SLHVDGGPDPLTLHGSCPVIQGEKWSAPNWI 246


>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
          Length = 376

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 84/196 (42%), Positives = 111/196 (56%), Gaps = 19/196 (9%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCI--GLKVKPRRGDGLL 245
               QR+A+ L+YLSDVEEGGET+FP    N   L    +  +C   GL VKP+ GD LL
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALL 270

Query: 246 FYSLFPNGTIDRTSLH 261
           F+S+ P+ T+D  SLH
Sbjct: 271 FWSMKPDATLDPLSLH 286


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 5/206 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           QVL    R   + NF + E+C  II  A+  +  S + +            RTS GTF++
Sbjct: 64  QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGV-VETDSGKSKIDNVRTSKGTFLN 122

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              D   ++  IE +IA+ T++P  +GE   VL+YE GQ+Y+ HYD F           R
Sbjct: 123 RGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGGNR 180

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG--LKVKPRRGDGLLFYSLFPNG 253
             + L+YL+DVEEGGET FP       D+G ++ +C    L  KP++G+ +LF+S+ P G
Sbjct: 181 YLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPTG 240

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIR 279
            ++R SLH +CPVIKG KW A KW+ 
Sbjct: 241 ELERRSLHTACPVIKGVKWSAPKWVH 266


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 87/204 (42%), Positives = 115/204 (56%), Gaps = 26/204 (12%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  F NF + E+   I+A AK  +K S +    G +VE     RTS GTF+   
Sbjct: 35  VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  +E ++A  T L  +H E   +LRY IGQKY +HYD+ +        S R+ 
Sbjct: 93  QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144

Query: 198 SFLLYLSDV--EEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + LLYLSDV  + GGET FP   G+   + Y           P++GD LLFYSL P+GT 
Sbjct: 145 TVLLYLSDVPADGGGETAFP---GVRRQALY-----------PKKGDALLFYSLKPDGTS 190

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D  SLH  CP+I G KW ATKWI 
Sbjct: 191 DAYSLHTGCPIISGVKWTATKWIH 214


>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
          Length = 488

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 124/230 (53%), Gaps = 18/230 (7%)

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           P   TG++  +++  + LS +P  L    F + E+C  I+  A   +K S ++L+  +  
Sbjct: 255 PSTITGNS--KNVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKG 312

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
                 RTS  TF++A  D   IL  IE + A  T +P TH E   VLRY + +KYD+H+
Sbjct: 313 RPASDWRTSQSTFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHH 370

Query: 181 DAFNPAEY--GPQMSQ--------RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
           D F+P+ Y   P   Q        R A+   YL+DV  GGET FP   G       D+  
Sbjct: 371 DFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHGGA--PPPRDFSM 428

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE--KWVATKWI 278
           C GLKVKP++G  ++FYSL  +G +D  SLHG+CPV+  E  KW A KW+
Sbjct: 429 CTGLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 114/205 (55%), Gaps = 8/205 (3%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL    R   +  F + E+C  I   A+KRL+ S + +  G         RTS G F   
Sbjct: 38  VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+E +E ++A  TM P   GE+  VLRY   QKYDSH+D F   +       R 
Sbjct: 97  GED--AIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154

Query: 197 ASFLLYLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           A+ LLYL++ EEGGET+F   P  NGI  + G+       L VKP +GD LLF+S+ P G
Sbjct: 155 ATVLLYLTETEEGGETVFPKIPAPNGI--NVGFSECAKYNLAVKPHKGDALLFHSMKPTG 212

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWI 278
            ++  S+HG+CPVI+GEK+  TKWI
Sbjct: 213 ELEERSMHGACPVIRGEKFSMTKWI 237


>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
          Length = 328

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 80/166 (48%), Positives = 105/166 (63%), Gaps = 7/166 (4%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE+ +S    RTSSG F++  +D   I+  +E K+A  T LP+ +GEA  +L YE GQKY
Sbjct: 9   GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--G 233
           D H+D F   +       R+A+ L+YLS+V +GGET+FP   G       D + KC   G
Sbjct: 65  DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQG 124

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             VKPR+GD LLF++L  NGT D  SLHGSCPVI+GEKW AT+WI 
Sbjct: 125 YAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIH 170


>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
 gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
          Length = 241

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 74/157 (47%), Positives = 98/157 (62%), Gaps = 5/157 (3%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+   +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+ 
Sbjct: 15  VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGD 242
                    R+A+ L+YLSDV +GGET+FP   G  L    D +  C   G  VKP +GD
Sbjct: 73  KNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGD 132

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            LLF+SL P+ T D  SLHGSCPVI+G+KW ATKWI 
Sbjct: 133 ALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIH 169


>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 196

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 116/194 (59%), Gaps = 21/194 (10%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
             + E+C+ +I  AK  +  S +    G++V+++   RTSSGTFI+   DK  IL  IE 
Sbjct: 12  ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNS--ARTSSGTFINRGHDK--ILRNIEQ 67

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           +IA  T +P  +GE+ N+L YE+GQKY+ H D F       +++ +           E+G
Sbjct: 68  RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFTD-----EINTKNGG--------EQG 114

Query: 210 GETMFPFENGIFLDSGY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
           GET+FPF  G F    +  +   C   GL +KP+ GD LLF+S+ P+GT+D  S+HG+CP
Sbjct: 115 GETVFPFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACP 174

Query: 266 VIKGEKWVATKWIR 279
           VIKG+KW  TKW+R
Sbjct: 175 VIKGDKWSCTKWMR 188


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 81/216 (37%), Positives = 116/216 (53%), Gaps = 25/216 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           LSW PRA    +  +  QC++++   + R+       R+   V+S  G       RTS  
Sbjct: 6   LSWYPRAFALRDALTEAQCEAVLRATRARV-------RRSTVVDSVTGESKVDPIRTSKQ 58

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPA 186
           TF++  E+   ++  I   ++  TMLP TH E   VL Y +G+KYD+H D     + +  
Sbjct: 59  TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIF--LDSGYDYKKCIGLKV--KPRRGD 242
           E      +R+A+ LLYL + E GGET FP    I   +  G  + KC   +V  KPRRGD
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRRGD 175

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           GL+F+S+ PNG ID  +LH  CPV+ G KW AT W+
Sbjct: 176 GLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWV 211


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 87/200 (43%), Positives = 112/200 (56%), Gaps = 10/200 (5%)

Query: 83  RALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKT 141
           R   +  F + E+C  I   A+KRL+ S +     G +V S    RTS G F    ED  
Sbjct: 44  RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSD--IRTSDGMFFERGED-- 99

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ILE +E ++A  TM P   GEA  VLRY   QKYDSH + F   E       R A+ L 
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159

Query: 202 YLSDVEEGGETMF---PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           YL+D EEGGET+F   P   G+  + G+       L VKPR+GD +LF+S+  NG ++  
Sbjct: 160 YLTDTEEGGETVFPKIPAPGGV--NVGFSECAKYNLAVKPRKGDAILFHSMKTNGQLEER 217

Query: 259 SLHGSCPVIKGEKWVATKWI 278
           SLHG+CPVIKGEK+  TKWI
Sbjct: 218 SLHGACPVIKGEKFSMTKWI 237


>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
 gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
          Length = 797

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/216 (37%), Positives = 115/216 (53%), Gaps = 12/216 (5%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PRA  + NF ++ +C  ++    +R+  S L +            RTS G   
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRS-LVVDSQTGQSKLDDIRTSYGAAF 551

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 192
              ED   ++  IE +IA  T LP  HGE   +LRY  GQKYD+H+D F +P  +   + 
Sbjct: 552 GRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609

Query: 193 -SQRLASFLLYLSDVEEGGETMFPFENGI-----FLDSGYDYKKCIGLKVKPRRGDGLLF 246
              R A+ LLYLS+VE GGET  P  + I      +++       +GL ++PR+GD LLF
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRKGDALLF 669

Query: 247 YSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           Y +   G   DR +LH SCP +KG KW ATKWI  +
Sbjct: 670 YDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSK 705


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 86/207 (41%), Positives = 116/207 (56%), Gaps = 15/207 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  F NF +  +   ++  A  +LK S +   +GE V      RTS G FI    D  
Sbjct: 55  PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGV--VDNIRTSFGMFIRRLSDP- 111

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQRLASFL 200
            I+  IE +I+  T LP  H E   VLRY  GQ Y +HYD+   +++ GP+   RLA+FL
Sbjct: 112 -IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPKW--RLATFL 168

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCIG--LKVKPRRGDGLLFYSLFPNG 253
           +YLSDVEEGGET FP +N ++ D     +     +C    +  KP+ GD +LFYS  PN 
Sbjct: 169 MYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFLPNN 227

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRD 280
           T+D  ++H  CPVIKG KW A  W+ D
Sbjct: 228 TMDPAAMHTGCPVIKGIKWAAPVWMHD 254


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 116/214 (54%), Gaps = 16/214 (7%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           Q +   PRA YF NF +  +   ++  A  +LK S +   +          RTS G FI 
Sbjct: 20  QQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGK--GEGVVDDIRTSYGMFIR 77

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQ 194
              D   ++  IE +I+  T LP  H E   +LRY  GQ Y +HYD+   +++ GP+   
Sbjct: 78  RLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPKW-- 133

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDS------GYDYKKCI--GLKVKPRRGDGLLF 246
           RLA+FL+YLSDVEEGGET FP  N ++ D       G  +  C    +  KP+ GD +LF
Sbjct: 134 RLATFLMYLSDVEEGGETAFP-HNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLF 192

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           YS +PN T+D  S+H  CPVIKG KW A  W+ D
Sbjct: 193 YSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHD 226


>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
 gi|194706408|gb|ACF87288.1| unknown [Zea mays]
 gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
 gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
          Length = 217

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 72/150 (48%), Positives = 95/150 (63%), Gaps = 3/150 (2%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
            +S  + K  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+        
Sbjct: 10  MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSL 249
            QR+A+ L+YL+DV +GGET+FP   G  L   D  +      GL VKP++GD LLF++L
Sbjct: 70  GQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNL 129

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             N T D  SLHGSCPVI+GEKW ATKWI 
Sbjct: 130 HVNATADTGSLHGSCPVIEGEKWSATKWIH 159


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 118/207 (57%), Gaps = 5/207 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR +   NF S ++C  +   A  RL+ S +   + G+ V+S    RTSSG F+
Sbjct: 83  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  E    +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ L+YLS+  EGGET FP         G   K   GL VKP +GD +LF+S+  +G 
Sbjct: 201 RIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLFWSMGLDGQ 258

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  S+HG C V+ GEKW ATKW+R +
Sbjct: 259 SDPKSIHGGCEVLSGEKWSATKWMRQK 285


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 87/213 (40%), Positives = 117/213 (54%), Gaps = 21/213 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTS 129
            QVL    R   F NF + E+C  I+A AK  L+      R G    +T G+     RTS
Sbjct: 34  MQVLDAEAR--IFINFLTEEECDHIVALAKPHLE------RSGVVDTATGGSEISDIRTS 85

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY-DAFNPAEY 188
            G F+    D T  +  IE +IAR T+LP  +GE   VL Y  G+KYD ++ D  N    
Sbjct: 86  KGMFLERGHDDT--VAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESN 143

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLF 246
           G     R A+ L+YL+ VEEGGET+FP       D+G  + +C    L  KP +G  +LF
Sbjct: 144 G---GNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLF 200

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           +S+ P+G ++R SLH +CPV+KGEKW A KWI 
Sbjct: 201 HSIKPSGDLERRSLHTACPVVKGEKWSAPKWIH 233


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/201 (37%), Positives = 116/201 (57%), Gaps = 6/201 (2%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           L F    S ++C  +I  A  RL+ S +   +    + ++  RTS G F+    D   I+
Sbjct: 1   LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSR-NRTSWGAFLRRDHDN--IV 57

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
             IE +I+  T +P+ +GE+  V+RY+ GQK++ H D +   E       R+ + LLYL+
Sbjct: 58  SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117

Query: 205 DVEEGGETMFPFE-NGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           +VE GGET+FP     +  D   +  +C   G+ ++PRRGDGLLF+   P+G ID  S H
Sbjct: 118 NVENGGETVFPRALANVINDYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFH 177

Query: 262 GSCPVIKGEKWVATKWIRDQE 282
           G CPV+KGEKW+ATK++ + E
Sbjct: 178 GGCPVVKGEKWLATKFLHEHE 198


>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 87/214 (40%), Positives = 120/214 (56%), Gaps = 14/214 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY----DSHYDAFNPAEYGP 190
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y    D  +D FN    G 
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGG- 198

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              Q +A+ L+YLSD  EGGET FP         G    K  GL VKP +G+ +LF+S+ 
Sbjct: 199 ---QGIATMLMYLSDNIEGGETYFPLAGSGECSCGGKLVK--GLSVKPIKGNAVLFWSMG 253

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G  D  S+HG C VI GEKW ATKW+R Q  H
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWLR-QTTH 286


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 80/217 (36%), Positives = 117/217 (53%), Gaps = 8/217 (3%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
           G  ++ SI  + LS  P      NF   E+C+ I   A   +KPS ++L   +  +    
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTS+  F+ ++ D   +L+ I+ ++   T +P++H E   VL+Y+ GQ+Y +H+D  + 
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301

Query: 186 A---EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRG 241
                       R+ +   YLSDVEEGGET+FP   G       D+  C  GLKVKP  G
Sbjct: 302 RTMRNMDGGRKNRMITVFWYLSDVEEGGETIFPRYGG--RTGRVDFSDCTTGLKVKPVEG 359

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
              +FYSL P+G  D  SLHG+CPVI G+KW A KW+
Sbjct: 360 KVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396


>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
           C-169]
          Length = 285

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/210 (38%), Positives = 115/210 (54%), Gaps = 7/210 (3%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSS 130
           S+  + +SW PRA  +    S ++C  II  A+  + K + L  +  + V +    R + 
Sbjct: 50  SLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNNK 107

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             +I  S D   +++ IE +IAR T LP  HGE F++++Y  GQ Y  H D  +   +  
Sbjct: 108 EAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPR 165

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYS 248
             ++R+A+ ++YLSDV EGGET+FP            Y KC   G+ VKP +GD LL Y+
Sbjct: 166 LGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQGIAVKPVKGDALLLYN 225

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           L  NG  D  SLH  CPVI+G KW ATK I
Sbjct: 226 LLENGRNDGESLHQGCPVIRGVKWTATKRI 255


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/218 (37%), Positives = 122/218 (55%), Gaps = 20/218 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +S  PRA  + NF + E+ ++ IA A++ ++ S++ + + +    T   RTSSG ++
Sbjct: 78  IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  + +  ++  IE ++A  TMLP+  GE   V+RYE GQ+Y +H D F+         Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG---------------LKVKPR 239
           R A+ L+YLSDVEEGGET+FP   G  L      K  +                L VKPR
Sbjct: 195 RAATVLMYLSDVEEGGETVFP--RGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPR 252

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
           RGD LLF+++  NG +D  + H  CPV++G KW AT+W
Sbjct: 253 RGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRW 290


>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
 gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 253

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 86/228 (37%), Positives = 124/228 (54%), Gaps = 18/228 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           ++GD     I  + +SW PRA  +  F S  +C  +I  A  +L+ S +   + + V+  
Sbjct: 29  DSGDVGAPWI--ETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDPI 86

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    I  +E  T ++  IE +IAR T LP++H E   VLRY  GQKYD+H+D F
Sbjct: 87  ---RTSYSASIGYNE--TDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWF 141

Query: 184 N-PAEYGPQMSQRLASFLLYLSDVE--EGGETMFPFENGIFLD----SGYDYKKC---IG 233
           +     G     R+A+ L+YLSD+E   GGET  P    +  +     G  Y +C   +G
Sbjct: 142 DETETGGTGGGNRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGRGYSECASKMG 201

Query: 234 LKVKPRRGDGLLFYSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRD 280
           + V+P++GD LLF+ + P G   DR +LH SCP   G KW ATKWI +
Sbjct: 202 ISVRPKKGDVLLFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHN 249


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 122/223 (54%), Gaps = 32/223 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           +SW PRA +  N  +  +C  ++  A+ R+       R+   V+ST G       RTS  
Sbjct: 4   ISWHPRAFHLHNIMTDAECDEVLELARTRV-------RRSTVVDSTTGESKVDPIRTSEQ 56

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFN-----VLRYEIGQKYDSHYDAFN-P 185
            F++       I+ +IE ++ R TMLP  +GE        VL+Y  GQKYD+H+D     
Sbjct: 57  CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114

Query: 186 AEYGPQMS----QRLASFLLYLSDVEE--GGETMFPFENGI--FLDSGYDYKKCI--GLK 235
              G Q++     R+A+ LLYLSDV++  GGET FP    I    D G  + +C    + 
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVA 174

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           VKP++GDGLLF+S+ P G ID+ S+H  CPV+ G+ W ATKWI
Sbjct: 175 VKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWI 216


>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
 gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 251

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 116/219 (52%), Gaps = 18/219 (8%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PR   + NF S  +C+ I  TA   +K S +    G +V  T   RTS GTFI
Sbjct: 2   IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D   ++E +  ++A  T  P  + E   VLRY  GQKY +H D+          S 
Sbjct: 60  RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLD-----SGYDYKKCIGLKV--KPRRGDGLLFY 247
           R+A+ LLYL D E GGET FP ++G +LD     S   + +C    V  +P++GD L+F+
Sbjct: 112 RMATVLLYLHDTEYGGETAFP-DSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFW 170

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
           S+ P+GT D  SLH  CPV+ G KW AT W+     + D
Sbjct: 171 SIKPDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYD 209


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 88/248 (35%), Positives = 126/248 (50%), Gaps = 29/248 (11%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
           RPK      V     +G P  E+ +A   S + IP       PR + F +F S E+C  +
Sbjct: 56  RPKPEPAATVPSPAPAG-PEPESSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDQL 114

Query: 100 IATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
           IA  + RLK S +   + GE  E+    RTS G      E    ++  IE +IA+AT +P
Sbjct: 115 IALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP--LVARIEARIAQATGVP 170

Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETM 213
             HGE F VL Y+ G +Y  H+D FNP   G         QR+A+ ++YL+ V  GG T 
Sbjct: 171 VEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATG 230

Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           FP                +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+
Sbjct: 231 FP---------------KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWI 275

Query: 274 ATKWIRDQ 281
           ATKW+R++
Sbjct: 276 ATKWLRER 283


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 25/198 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+++I  +K ++K S++ + +      T   RTSSG F+  SE   
Sbjct: 42  PLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGAFLEESE--- 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I   IE +IA    +P  HGE   +L+Y +GQ+Y +HYD F         + R+++ ++
Sbjct: 94  -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+ VEEGGET FP                + L V P++G  + F   + + +I++ +LH
Sbjct: 152 YLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQDESINKLTLH 196

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PVIKGEKWVAT+W+R
Sbjct: 197 GGAPVIKGEKWVATQWMR 214


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G         QR+A+ ++YL+ V+ GG T FP                +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G         QR+A+ ++YL+ V+ GG T FP                +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 116/219 (52%), Gaps = 25/219 (11%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G         QR+A+ ++YL+ V+ GG T FP                +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 244

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 109/198 (55%), Gaps = 25/198 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+++I  +K ++K S++ + +      T   RTSSG F+  SE   
Sbjct: 42  PLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGAFLEESE--- 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I   IE +IA    +P  HGE   +L+Y +GQ+Y +HYD F         + R+++ ++
Sbjct: 94  -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+ VEEGGET FP                + L V P++G  + F   + + +I++ +LH
Sbjct: 152 YLNHVEEGGETFFP---------------KLNLSVSPKKGMAVYFEYFYQDESINKLTLH 196

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PVIKGEKWVAT+W+R
Sbjct: 197 GGAPVIKGEKWVATQWMR 214


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 73  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188

Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP                
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 233

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 234 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 76  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 135

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 136 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 191

Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP                
Sbjct: 192 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 236

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 237 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 286


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 64  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 123

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 124 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 179

Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP                
Sbjct: 180 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 224

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 225 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 274


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 119/230 (51%), Gaps = 28/230 (12%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 73  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188

Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP                
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 233

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 234 LGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 283


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 77/206 (37%), Positives = 117/206 (56%), Gaps = 11/206 (5%)

Query: 83  RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
           R     +F + E+   I+  +++RL+ S +    G + ES    RTS G F+   ED   
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           +++ +E +I+  T++P  +GE   VLRY+  QKYD+H+D F   +       R A+ L+Y
Sbjct: 57  VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116

Query: 203 LSDVEEGGETMFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRT 258
           L D EEGGET+FP    I    G +  + +C    L  KP++G  +LF+S+ P G ++R 
Sbjct: 117 LVDTEEGGETVFP---NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQH 284
           SLH +CPVIKG KW A KWI  + Q+
Sbjct: 174 SLHTACPVIKGIKWSAAKWIHVKPQN 199


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 114/209 (54%), Gaps = 8/209 (3%)

Query: 75  FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
            QV+S   PRA     F S  +C  ++  A+  +  S + +       S    RTS+G+F
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGV-VDASNGGSSFSNIRTSTGSF 216

Query: 134 ISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
           +          ++  IE +IA  T +P  HGE   VLRY+IGQ+Y SH+D F     G  
Sbjct: 217 VPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYF--FHEGGM 274

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSL 249
            + R+A+ L+YLSDV++GGET+FP    + +     +  C   G+ V P++GD +LF+++
Sbjct: 275 KNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAILFWNM 334

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
              G +D  S H  CPV+ GEKW ATKW+
Sbjct: 335 KVGGDLDGGSTHAGCPVVLGEKWTATKWL 363


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/207 (35%), Positives = 113/207 (54%), Gaps = 23/207 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P  + F NF +  +C +++  ++  L PS++   Q    E  K +RTS GT  +  E  
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHFARGE-- 158

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
           T ++  IE +IA    +P+ HGE   +L Y +  +Y  HYD F+P + G Q       QR
Sbjct: 159 TPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + + ++YLSDVE GG T+FP                +GL+V+P++G  L F  +  +G +
Sbjct: 219 VGTLIMYLSDVESGGATVFP---------------RVGLEVQPQKGAALFFSYVGEHGKL 263

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
           D  SLHG  PV+ GEKW+ATKW+R  E
Sbjct: 264 DLQSLHGGSPVLAGEKWIATKWLRAAE 290


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 115/219 (52%), Gaps = 25/219 (11%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 83  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISAR 140

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y  H+D FNP  
Sbjct: 141 TSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGR 198

Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G         QR+A+ ++YL+ V+ GG T FP                +GL+V P +G+
Sbjct: 199 SGEARQLDVGGQRVATLVIYLNSVQAGGATGFP---------------KLGLEVAPVKGN 243

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 244 AVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRER 282


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 114/222 (51%), Gaps = 24/222 (10%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 133
            + +S  PRA  F  F +  +C  +I  A   ++ S++     GE        R+S G +
Sbjct: 68  IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEA--RPDDARSSIGGW 125

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           +S  +D+  ++  IE + +   MLP   GE   VLRYE GQKYD+H D F+         
Sbjct: 126 VSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGG 183

Query: 194 QRLASFLLYLSDVEEGGETMFPF----------------ENGIFLDSGYDYKKCIGLKVK 237
           QR+A+ L+YLSDVEEGGET+FP                 +N   L S  D +    L VK
Sbjct: 184 QRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRV---LAVK 240

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           PRRGD LLF++   +G +D  + H  CPV +G KW  T+W R
Sbjct: 241 PRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHR 282


>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
 gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
          Length = 307

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 125/222 (56%), Gaps = 36/222 (16%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGT------- 126
            +V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S   T       
Sbjct: 80  LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139

Query: 127 ----------------------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
                                 RTSSGTF++   D+  I+E IE++I+  T +P  +GE 
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197

Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
             VL YE+GQ+Y+ H+D F       +  QR+A+ L+YLSDV+EGGET+FP   G   D 
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDV 257

Query: 225 GY--DYKKC--IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
            +  +  +C   GL V P++ D LLF+S+ P+ ++D +SLHG
Sbjct: 258 PWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHG 299


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/205 (35%), Positives = 111/205 (54%), Gaps = 24/205 (11%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P  + F    S ++C+ +I TA  RLK S+L  +           RTS G F   
Sbjct: 25  VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNK------VVSDIRTSRGMFFE- 77

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            E+++  +  IE +IA+   +P  H E   VL Y  GQ+Y +H+D F P     + + R+
Sbjct: 78  -EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAAR-NNRI 135

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           ++ ++YL+DVEEGGET+FP                +G+ +KP+RG  L F   + N  ++
Sbjct: 136 STLIVYLNDVEEGGETVFPL---------------LGIAMKPKRGAALYFEYFYRNQALN 180

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
             +LH S PV++GEKWVAT+W+R Q
Sbjct: 181 DLTLHSSVPVVRGEKWVATQWMRRQ 205


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 70/154 (45%), Positives = 96/154 (62%), Gaps = 6/154 (3%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+   +DK  I++ IE +IA  T +P  +GE   VL Y +G+KY+ HYD F         
Sbjct: 2   FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 59

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY--DYKKCI--GLKVKPRRGDGLLFYS 248
            QR+A+ L+YLSDVEEGGET+FP     F    +  D  +C   GL +KP+ GD LLF+S
Sbjct: 60  GQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWS 119

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + P+ T+D +SLHG CPVI G KW +TKW+  +E
Sbjct: 120 MRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEE 153


>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 369

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 87/207 (42%), Positives = 112/207 (54%), Gaps = 21/207 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           PRA  +  F +  +C   IA A  +L  S +     GE V S    RTS G F    ED 
Sbjct: 83  PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEYGPQMSQ 194
             +++ +E +I+  T LP  +GE   VLRY  GQKYD+H DAF      + A  G    Q
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGG----Q 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPN 252
           R+A+ L+YL+DV++GGET+FP            Y  C   G+ VKPRRGD LLF+S+  +
Sbjct: 195 RVATVLMYLNDVDDGGETVFPETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSM--D 252

Query: 253 GTIDRTSLHGSCPV-IKGEKWVATKWI 278
            T  R SLHG CPV   G KW  TKWI
Sbjct: 253 ETFTR-SLHGGCPVGAGGVKWSMTKWI 278


>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 210

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+A+ L+Y
Sbjct: 12  VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 71

Query: 203 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           LS+VE+GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ T D  S
Sbjct: 72  LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 131

Query: 260 LHGSCPVIKGEKWVATKWIR 279
           LHGSCP I+G+KW ATKWI 
Sbjct: 132 LHGSCPAIEGQKWSATKWIH 151


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 82/237 (34%), Positives = 118/237 (49%), Gaps = 31/237 (13%)

Query: 50  EVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKP 109
           E VE  N     H   GD     IP       PR + F +F S ++C  +IA  + RLK 
Sbjct: 72  EAVENSNAV---HTADGD-----IPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKR 123

Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
           S + +      E+    RTS G      E    ++  IE +IA+A  +P  HGE F VL 
Sbjct: 124 SPV-VNPDTGEENLISARTSQGGMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLN 180

Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
           Y+ G +Y  H+D FNP   G         QR+A+ ++YL+ V+ GG T FP         
Sbjct: 181 YQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP--------- 231

Query: 225 GYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
                  +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 232 ------KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282


>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 204

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 66/140 (47%), Positives = 88/140 (62%), Gaps = 3/140 (2%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+A+ L+Y
Sbjct: 6   VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 65

Query: 203 LSDVEEGGETMFPFENGIFLDSGYD-YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           LS+VE+GGET+FP   G  L    D +  C   G  VKP +GD LLF+SL P+ T D  S
Sbjct: 66  LSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTDSDS 125

Query: 260 LHGSCPVIKGEKWVATKWIR 279
           LHGSCP I+G+KW ATKWI 
Sbjct: 126 LHGSCPAIEGQKWSATKWIH 145


>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
          Length = 394

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           A++D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+VE+GGET+FP   G  L   D+ +      G  VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
            T D  SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334


>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
          Length = 369

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 5/146 (3%)

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           A++D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFL---DSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           +A+ L+YLS+VE+GGET+FP   G  L   D+ +      G  VKP +GD LLF+SL P+
Sbjct: 249 IATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPD 308

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
            T D  SLHGSCPVI+G+KW ATKWI
Sbjct: 309 ATTDSDSLHGSCPVIEGQKWSATKWI 334


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 75/212 (35%), Positives = 110/212 (51%), Gaps = 33/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTSSGTFIS 135
           +PRA+ F N  S ++C  +IA +K +L      LR G     T  T     RTSSGTF  
Sbjct: 99  KPRAILFGNVLSHDECDQLIALSKTKL------LRSGVVDHQTGNTKLHEHRTSSGTFFH 152

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGP 190
                T  + +I+ ++A    +P++HGE   +L Y++G +Y  HYD F P     A++  
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +  QR A+ ++YL+DV+ GGET+FP                 GL + P +G  + F    
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFPRN---------------GLSIVPAKGSAIYFSYTN 255

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
               +D  S HG  PVI+GEKW+ATKW+R  E
Sbjct: 256 AENQLDSLSFHGGSPVIEGEKWIATKWVRQNE 287


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 79/219 (36%), Positives = 114/219 (52%), Gaps = 25/219 (11%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +I   + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G         QR+A+ ++YL+ V  GG T FP                +GL+V P +G+
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVPAGGATGFP---------------KLGLEVAPVKGN 244

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 245 AVFFVYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRER 283


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 116/217 (53%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C+ +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECEELIELSKNKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
          Length = 458

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/213 (36%), Positives = 121/213 (56%), Gaps = 16/213 (7%)

Query: 75  FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGT 132
            Q++S   PRA  +  F + E+C  +I  +K R+  S+  +   ET  + K   RTS+G+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRM--SKSGVVDAETGGTAKSDIRTSTGS 234

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+    +   +++ +E ++A  +MLP  H EA  VLRYE+ Q+Y +HYD F     G   
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYF--FHKGGMA 290

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDS-----GYDYKKC--IGLKVKPRRGDGLL 245
           + R+ + L+YL + E GGET+FP    + L+      G ++ +C   G     R+GD L+
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFP-NTEVPLERAEKGWGKNFSECGNRGRAAVVRKGDALI 349

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           F+S+ P G +D  S H  CPV++GEKW ATKWI
Sbjct: 350 FWSMKPGGELDPGSSHAGCPVVRGEKWTATKWI 382


>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
          Length = 325

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 81/213 (38%), Positives = 120/213 (56%), Gaps = 20/213 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           F+ +SW PRA    NFAS E+   +I  A+ +L+ S +   +GE+V      RTS G FI
Sbjct: 32  FEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFI 89

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++  +E ++A  T    TH E   VLRY   Q+Y +H+D+ +        S 
Sbjct: 90  RRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSP 141

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCI--GLKVKPRRGDGLLFY 247
           R A+ L+YLSDVE GGET FP  N  ++D         + +C    + +KP+RGD ++F+
Sbjct: 142 RTATVLIYLSDVESGGETTFP--NSEWIDPALPKALGPFSECAQGHVAMKPKRGDAIVFH 199

Query: 248 SLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIR 279
           SL P+G + D+ +LH +CPVI G K+VA  WI 
Sbjct: 200 SLNPDGRSHDQHALHTACPVIVGVKYVAIFWIH 232


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 118/230 (51%), Gaps = 28/230 (12%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 67  PGAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 126

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 127 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 182

Query: 177 DSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP                
Sbjct: 183 QPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP---------------K 227

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GL+V P +G+ + F    P+G +D  +LH   PV +GEKW+ATKW+R++
Sbjct: 228 LGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKWIATKWLRER 277


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 85/248 (34%), Positives = 128/248 (51%), Gaps = 39/248 (15%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV  + +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 74  EVAVQVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 129

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E++  ++ L+E +IAR    P 
Sbjct: 130 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGENE--LISLVEQRIARLLNWPL 186

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246

Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
           P                +GL+V PRRG+ + F    P+      +LHG  PV++GEKW+A
Sbjct: 247 P---------------DVGLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIA 289

Query: 275 TKWIRDQE 282
           TKW+R++E
Sbjct: 290 TKWLRERE 297


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKNKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 117/226 (51%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   V    PR + F N  S E+C +IIA A+ R+  S L +      E  
Sbjct: 92  DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS+G F    E  TGI+  +E +IAR    P  HGE   VL Y  G +Y  H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P E G P +     QR+ + ++YL++ E GG T+FP                + L+V P
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVP 250

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           RRG+ + F    P+ +    +LHG  PV+ GEKW+ATKW+R++E H
Sbjct: 251 RRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREFH 294


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 117/226 (51%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   V    PR + F N  S E+C +IIA A+ R+  S L +      E  
Sbjct: 92  DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS+G F    E  TGI+  +E +IAR    P  HGE   VL Y  G +Y  H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P E G P +     QR+ + ++YL++ E GG T+FP                + L+V P
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP---------------EVPLQVVP 250

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           RRG+ + F    P+ +    +LHG  PV+ GEKW+ATKW+R++E H
Sbjct: 251 RRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREFH 294


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 113/199 (56%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  +   N  S E+C+S+I  +K  +K S++ A R+ + +      RTSSGTF+  +E  
Sbjct: 43  PLIVVLENVLSDEECESLIELSKDSMKRSKIGASREVDNI------RTSSGTFLEENE-- 94

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
              + +IE +++    +P  HGE  ++L+Y  GQ+Y +HYD F       + + R+++ +
Sbjct: 95  --TVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRAAE-NNRISTLV 151

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L + P++G  + F   + + +++  +L
Sbjct: 152 MYLNDVEEGGETFFP---------------KLNLSIAPKKGSAVYFEYFYNDKSLNELTL 196

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PVIKGEKWVAT+W++
Sbjct: 197 HGGAPVIKGEKWVATQWMK 215


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+    ++ R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 122/223 (54%), Gaps = 17/223 (7%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTK 124
           GDA +  I  + LSW PRA    N     + ++I+A A+ R+ + + +    G++V    
Sbjct: 2   GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSV--VN 57

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TF+S ++    ++  +  +++  T LP  H E   VL Y  G+KYD+H D   
Sbjct: 58  PIRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114

Query: 185 PA-EYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYD--YKKCIGLKV- 236
              + G Q+S+    R+A+ LLYL + EEGGET FP    I  +      + KC   +V 
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVA 174

Query: 237 -KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            KP RGDGL+F+S+ P+GTID  +LH  CP  +G KW AT W+
Sbjct: 175 MKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWV 217


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 80/232 (34%), Positives = 117/232 (50%), Gaps = 29/232 (12%)

Query: 55  ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
           EN + + H   GD     IP       PR + F +F S  +C  +IA  + RLK S + +
Sbjct: 75  ENSNAV-HTADGD-----IPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-V 127

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
                 E+    RTS G      E    ++  IE +IA+A  +P  HGE F VL Y+ G 
Sbjct: 128 NPDTGEENLISARTSQGGMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185

Query: 175 KYDSHYDAFNPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYK 229
           +Y  H+D FNP   G         QR+A+ ++YL+ V+ GG T FP              
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231

Query: 230 KCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             +GL+V P +G+ + F    P+GT+D  +LH   PV +GEKW+ATKW+R++
Sbjct: 232 -KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRER 282


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 109/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+    ++ R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
          Length = 244

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 117/232 (50%), Gaps = 25/232 (10%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S +++  + LS  PR     NF SAE+C+ II TA   L PS + L+QG+     +  + 
Sbjct: 14  SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72

Query: 129 SSGTFISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
              T  +A   + K  I+  I  ++     +P ++ E   VL+Y   Q Y  HYD F+P 
Sbjct: 73  EVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPK 132

Query: 187 EYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-----------I 232
            Y  + S    RL +   YL+ VE+GGET+FPF N     S  ++ K             
Sbjct: 133 MYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGN----TSAEEHHKIQSWGPCENAVES 188

Query: 233 GLKVKPRRGDGLLFYSLFPNG----TIDRTSLHGSCPVIKGEKWVATKWIRD 280
            +KVKP RG  ++FY + P+G     +D TSLHG C  I GEKW A  WIR+
Sbjct: 189 SIKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIRN 240


>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1036

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 11/150 (7%)

Query: 47  RTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
           RTL  ++  +  GL H    +        Q LSW PR  Y PNFA+ +QC+++I  AK +
Sbjct: 776 RTLITLDSSDSQGLVHVLMWND-------QGLSWNPRVFYLPNFATKQQCEAVIDMAKPK 828

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           LKPS LALR+    E+               ED++G+L  IE KIA AT  P+ + E+FN
Sbjct: 829 LKPSTLALRK----ETKHFQMQYRSLHQHTDEDESGVLAAIEEKIALATRFPKDYYESFN 884

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           +LRY++GQKYDSHYDAF+ AEYGP +SQR+
Sbjct: 885 ILRYQLGQKYDSHYDAFHSAEYGPLISQRV 914


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 68/199 (34%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K  +K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDKLIELSKNNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 116/217 (53%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 74/194 (38%), Positives = 112/194 (57%), Gaps = 11/194 (5%)

Query: 93  AEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
           AE+   I+  +++RL+  +  +  G+    T   RTS G F+   ED+  I++ +E +IA
Sbjct: 7   AEEADHIVKVSERRLE--RSGVVGGDGGSETSNIRTSYGVFLDRGEDE--IVKRVEERIA 62

Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGET 212
             T++P  +GE   VLRY+  QKYD+H+D F   +       R A+ L+YL D EEGGET
Sbjct: 63  AWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGET 122

Query: 213 MFPFENGIFLDSGYD--YKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
           +FP    +    G +  + +C    L  KP++G  +LF+S+ P G ++R SLH +CPVI+
Sbjct: 123 VFP---NVAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIR 179

Query: 269 GEKWVATKWIRDQE 282
           G KW A KWI   E
Sbjct: 180 GIKWSAAKWIHHAE 193


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 109/200 (54%), Gaps = 29/200 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  SE   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 90

Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
             L L IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ 
Sbjct: 91  --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 146

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +
Sbjct: 147 VMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELT 191

Query: 260 LHGSCPVIKGEKWVATKWIR 279
           LHG  PV KGEKW+AT+W+R
Sbjct: 192 LHGGAPVTKGEKWIATQWVR 211


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 116/218 (53%), Gaps = 34/218 (15%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 57  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDV 107

Query: 124 KGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
              RTSSG F+  SE     L L IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 108 NDIRTSSGAFLEDSE-----LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 162

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           F  AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G
Sbjct: 163 F--AEHSRSAANNRISTLVMYLNDVEEGGETYFP---------------KLNLSVHPRKG 205

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 206 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 243


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIKRSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWVR 211


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSSGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
 gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
          Length = 298

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 81/223 (36%), Positives = 115/223 (51%), Gaps = 30/223 (13%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PR   + NF +  +C+ I  TA   +K S +  + G +V  T   RTS GTFI
Sbjct: 2   IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFN------------VLRYEIGQKYDSHYDA 182
               D   ++E I  ++A  T  P  + E               VLRY IGQKY +H D+
Sbjct: 60  RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY-----DYKKCIGLKV- 236
                     S R+A+ LLYL D EEGGET FP ++  +L          + +C    V 
Sbjct: 118 LI------DDSPRMATVLLYLHDTEEGGETAFP-DSSSWLTPDLATRMGPFSECAQGHVA 170

Query: 237 -KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            +P++GD L+F+S+ P+GT D  S+H  CPV+KG KW AT W+
Sbjct: 171 FRPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWV 213


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 115/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 126/248 (50%), Gaps = 39/248 (15%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV    +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 81  EVAVLVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 136

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E+   ++ L+E +IAR    P 
Sbjct: 137 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISLVEQRIARLLNWPL 193

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 194 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 253

Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
           P                +GL++ PRRG+ + F    P+      +LHG  PV++GEKW+A
Sbjct: 254 P---------------DVGLQIVPRRGNAVFFSYNRPDPAT--KTLHGGAPVLEGEKWIA 296

Query: 275 TKWIRDQE 282
           TKW+R++E
Sbjct: 297 TKWLRERE 304


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 109/200 (54%), Gaps = 29/200 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  SE   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 122

Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
             L L IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ 
Sbjct: 123 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 178

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +
Sbjct: 179 VMYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELT 223

Query: 260 LHGSCPVIKGEKWVATKWIR 279
           LHG  PV KGEKW+AT+W+R
Sbjct: 224 LHGGAPVTKGEKWIATQWVR 243


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score =  130 bits (326), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 117/209 (55%), Gaps = 15/209 (7%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           ++W+PR   + NF +  + + +I  A  ++K S +    G++VE     RTS GTF+   
Sbjct: 4   VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D+  I+E IE+++A  T +P  H E   +LRY +GQ+Y  H D     E G     R+A
Sbjct: 62  QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115

Query: 198 SFLLYLSDVEEGGETMFP---FENGIFLDS-GYDYKKCIGLKV--KPRRGDGLLFYSLFP 251
           + L+YL++ + GGET FP   + N     + G ++  C    V   P+RGD LLF+S+ P
Sbjct: 116 TVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSINP 175

Query: 252 NG-TIDRTSLHGSCPVIKGEKWVATKWIR 279
           +G T D  + H  CPV+ G KW ATKWI 
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIH 204


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------QLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECGELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECGELIEMSKNKIKRSTIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +I+  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F+  +E    + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 110/203 (54%), Gaps = 21/203 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSGTFISASED 139
           +P  L+   F S E+C  +I  +++RLKPS  +  + GE  E     RTS G      E+
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGE--EKAATGRTSKGMSFYLQEN 165

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLAS 198
           +   ++ +E +IA     P  +GE   VL Y IG++Y SH+D F  ++  P+   QR+ +
Sbjct: 166 E--FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGT 223

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL+YL+DV  GGET+FP                 G+ + P++G  + F      G +DR 
Sbjct: 224 FLIYLNDVPAGGETVFP---------------KAGVSIVPKKGSAVYFQYGNSKGEVDRM 268

Query: 259 SLHGSCPVIKGEKWVATKWIRDQ 281
           SLH S PV +GEKWVATKWIR +
Sbjct: 269 SLHSSIPVSEGEKWVATKWIRQE 291


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
 gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
          Length = 263

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/220 (35%), Positives = 116/220 (52%), Gaps = 31/220 (14%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PRA  +  F +  +C  +I  A  +L+ S +     + ++  + + ++S  + 
Sbjct: 59  VETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDDIRTSFSASIMY- 117

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-S 193
                +T I+  IE +IAR T           VLRY  GQKYD+H+D F+  E      S
Sbjct: 118 ----GETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGS 162

Query: 194 QRLASFLLYLSDVE--EGGETMFPFENGIFLD------SGYDYKKC---IGLKVKPRRGD 242
            R+A+ L+YLSDV+   GGET  P      LD       G  Y +C   +G+ ++PR+GD
Sbjct: 163 NRMATVLMYLSDVDPAAGGETALPLAEP--LDPHKQSVDGQGYSQCAARMGISIRPRKGD 220

Query: 243 GLLFYSLFPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQ 281
            LLF+ + P G I DR +LH SCP   G KW ATKWI ++
Sbjct: 221 VLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNK 260


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 108/199 (54%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +I+  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSINELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG   V KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWVR 211


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 75/212 (35%), Positives = 108/212 (50%), Gaps = 31/212 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           LS +PR    P F + E+C  +I T+K +L+P           E + G  R+  G F+  
Sbjct: 28  LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E++  + + I +K+     +  +  E   ++RY  G++  +HYD FNP      M    
Sbjct: 79  GEEEHPVTKNIFNKMKNFVNISDS-CEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137

Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
             QR+ + L+YL DVEEGGET FP                +G+KVKP RGD +LFY+  P
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPE---------------VGIKVKPIRGDAVLFYNCKP 182

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           NG +D  SLH   PV KG KWVA K I  + +
Sbjct: 183 NGDVDPLSLHQGDPVTKGTKWVAIKLINQKSK 214


>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/220 (35%), Positives = 108/220 (49%), Gaps = 24/220 (10%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + LSW PRA  + NF S E+ + ++   + R+  S +   Q   V      RTS GTFI
Sbjct: 68  IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSDI---RTSFGTFI 124

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  +LE IE + A  + +P  H E   +LRY  GQKY  H D       G    +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178

Query: 195 RLASFLLYLSDVEEGGETMFPFENGI-------------FLDSGYDYKKCIGLKVKPRRG 241
           R+A+ L++L +  EGGET F   N +             F D GY   K  G  VKP+ G
Sbjct: 179 RIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGK--GFAVKPKVG 236

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D +LF+S    G  D  S+H SCP + G KW AT WI ++
Sbjct: 237 DAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHER 276


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 75/197 (38%), Positives = 115/197 (58%), Gaps = 31/197 (15%)

Query: 75  FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
            +V++  PRA  + NF         + E+C  +I+ AK  +  S++  R   T +     
Sbjct: 88  LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKV--RNALTGLGEESS 145

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           +RTSSGTFI +  DK  I++ IE +I+  T +PQ +GE   V+ YE+GQK++ H+D F  
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                   QR+A+ L+YLSDV++GGET+FP   GI        K   G+ V+P++GD LL
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 245

Query: 246 FYSLFPNGTIDRTSLHG 262
           F+S+ P+G+ D +S HG
Sbjct: 246 FWSMRPDGSRDPSSKHG 262


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWVR 211


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVISDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F+  +E    + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSSGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+ T+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWITTQWVR 227


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F+  +E    + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSKGAFLDDNE----LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 121/228 (53%), Gaps = 24/228 (10%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           S LP         ++I  +VL   P  + F N  S E+CQ++I  A  RL+ S+LA ++ 
Sbjct: 6   SILPIQSVYSIDNRTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEI 65

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
            ++      RTSSG F    E++  ++  IE +I+    LP  H E   VL YE GQ++ 
Sbjct: 66  SSI------RTSSGMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFK 117

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           +H+D F P  +    + R+++ ++YL+DVEEGG T FP                +G+   
Sbjct: 118 AHFDFFGP-NHPSSSNNRISTLVVYLNDVEEGGVTTFP---------------NLGIVNV 161

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           P++G  + F   + +  ++  +LH   PVI+GEKWVAT+W+R ++  E
Sbjct: 162 PKKGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWMRKKQIRE 209


>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
          Length = 280

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 120/225 (53%), Gaps = 30/225 (13%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  SAE+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E +IA     P   GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F+P+E G P +     QR+A+ ++YL + E GG T FP                +GL+V 
Sbjct: 191 FDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEVA 235

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P RG G+ F    P+  + RT LHG  PV+ GEKWVATKW+R++E
Sbjct: 236 PARGCGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDRKIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + +  ++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   +   +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQGQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
 gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
          Length = 311

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 79/212 (37%), Positives = 111/212 (52%), Gaps = 17/212 (8%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           V+SW+PRA    NF +  +C  I   A+  ++ S +    G +V      RTS GTFI+ 
Sbjct: 3   VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +  T ++  +E ++A  T  P  + E   VLRY +GQ Y  H D+          S R+
Sbjct: 61  YQ--TPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112

Query: 197 ASFLLYLSDVEEGGETMFP----FENGIFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLF 250
           A+ LLYLS+ E GGET FP    + +         +  C+   +  KPRRGD LLF+S+ 
Sbjct: 113 ATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVK 172

Query: 251 PNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+G T D  S H  CPVI+G KW AT W+  Q
Sbjct: 173 PDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQ 204


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEISKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 280

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 120/225 (53%), Gaps = 30/225 (13%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  S E+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRQVQVL--QTMR-HPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E ++A     P  +GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F+P E G P +     QR+A+ ++YL + E+GG T FP                +GL+V 
Sbjct: 191 FDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP---------------DVGLEVA 235

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P RG G+ F    P+  + RT LHG  PV+ GEKWVATKW+R++E
Sbjct: 236 PVRGTGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
 gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
          Length = 147

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/137 (48%), Positives = 87/137 (63%), Gaps = 4/137 (2%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           I+  IE +IA  T +P  +GE   VL Y +GQK++ H+D  +          R A+FL+Y
Sbjct: 10  IVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMY 69

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           LSDVEEGGET+FP  N     S    K   G+ VKP+ GD LLF+S+ P+G++D  SLHG
Sbjct: 70  LSDVEEGGETVFP--NATAKGSAPSAKS--GISVKPKMGDALLFWSMKPDGSLDPKSLHG 125

Query: 263 SCPVIKGEKWVATKWIR 279
           + PVIKG+KW ATKWI 
Sbjct: 126 ASPVIKGDKWSATKWIH 142


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
                IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 180 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 224

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 225 HGGAPVTKGEKWIATQWVR 243


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 125/248 (50%), Gaps = 39/248 (15%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV  + +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 74  EVAVQIHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSNEECDAI 129

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E+   ++  +E +IAR    P 
Sbjct: 130 IAAARPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISRVEQRIARLLNWPL 186

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246

Query: 215 PFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVA 274
           P                +GL+V PRRG+ + F    P       +LHG  PV++GEKW+A
Sbjct: 247 P---------------DVGLQVVPRRGNAVFFSYNRPEPATK--TLHGGAPVLEGEKWIA 289

Query: 275 TKWIRDQE 282
           TKW+R++E
Sbjct: 290 TKWLRERE 297


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S +      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG   V KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWVR 211


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)

Query: 64  ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 24  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 74

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 75  VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 130

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           F  AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G
Sbjct: 131 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 173

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 174 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKWVAT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWMR 211


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 76/207 (36%), Positives = 112/207 (54%), Gaps = 27/207 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
           +PR + F N  S E+C +II  A+ R+  S  +A R G   E     RTS+G F    E+
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 178

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
              ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     Q
Sbjct: 179 P--VVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ ++YL+D E+GG T FP                + L+V PRRG+ + F    P+ +
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 281

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
               +LHG  PV+ G+KW+ATKW+R++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRER 306


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)

Query: 64  ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 40  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRD 90

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 91  VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           F  AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 189

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 190 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
          Length = 277

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 79/216 (36%), Positives = 119/216 (55%), Gaps = 25/216 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
           +SWRPRA  +  F S  +C  +I+ AK+  K  +  +  GE+ ES T   RTSSG F+  
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103

Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
            +D+  ++  IE +IA  TMLP                   +GE+  +LRY  G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI--GLKVK 237
           +D  +  +   +   R+A+ L+YLS+V+ G +++ P +  +       +  C   G  VK
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMG-DSLLP-QARLSQPKDETWSDCAEQGFAVK 219

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           P +G  +LF+SL PN T+D  SLHGSCPVI+GEK V
Sbjct: 220 PAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKVV 255


>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 280

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 120/225 (53%), Gaps = 30/225 (13%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  SAE+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E ++A     P  +GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F+P E G P +     QR+A+ ++YL + E GG T FP                +GL+V 
Sbjct: 191 FDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP---------------DVGLEVA 235

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           P RG G+ F    P+  + RT LHG  PV+ GEKWVATKW+R++E
Sbjct: 236 PVRGCGVFFSYDRPD-PVTRT-LHGGAPVLAGEKWVATKWLRERE 278


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 79/234 (33%), Positives = 119/234 (50%), Gaps = 24/234 (10%)

Query: 48  TLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL 107
           T EV E   +  + H    D  + S      + +P  L+     S+E+C  +I+ ++ RL
Sbjct: 67  TYEVPEIGRKGNILHTSDKDVKVLS-----RNEKPFVLHLDQVLSSEECDELISLSRSRL 121

Query: 108 KPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
           +PS L + +G   E     RTS        E++  ++E IE +IA  T  P  +GE   +
Sbjct: 122 QPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE--LVERIETRIAELTGYPAENGEGLQI 178

Query: 168 LRYEIGQKYDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
           L Y +G++Y  H+D F P      +  QR+ +FL+YL+DVE+GGET+F            
Sbjct: 179 LNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF------------ 226

Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                 GL   P++G  + F+     G +DR S+H S PV KGEKW ATKWIR+
Sbjct: 227 ---SKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKWAATKWIRE 277


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDRSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 245

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 74/197 (37%), Positives = 116/197 (58%), Gaps = 31/197 (15%)

Query: 75  FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
            +V++  PRA  + NF         + E+C+ +I+ AK  +  S++  R   T +     
Sbjct: 55  LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKV--RNAITGLGEESS 112

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           +RTSSGTF+    DK  I++ IE +I+  T +P+ +GEA  V+ YE+GQK++ H+D F  
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                   QR+A+ L+YLSDV++GGET+FP   GI        K   G+ V+P++GD LL
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGI--------KSKKGVSVRPKKGDALL 212

Query: 246 FYSLFPNGTIDRTSLHG 262
           F+S+ P+G+ D +S HG
Sbjct: 213 FWSMRPDGSQDPSSKHG 229


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVIYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+   E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDDE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 109/212 (51%), Gaps = 31/212 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           LS  PR    P F + E+C+ +I T+K +L+P           E + G  R+  G F+  
Sbjct: 28  LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E+   I + I +K+     + ++  E   V+RY  G++  SH+D FNP      M    
Sbjct: 79  GEEDHQITKNIFNKMKSFVNISES-CEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137

Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
             QR+ + L+YL DVEEGGET FP                +G+KVKP +GD +LFY+  P
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPE---------------VGIKVKPIKGDAVLFYNCKP 182

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           NG +D  SLH   PV+KG KWVA K I  + +
Sbjct: 183 NGDVDPLSLHQGDPVLKGNKWVAIKLINQKSK 214


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 32/218 (14%)

Query: 64  ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 40  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 90

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 91  VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           F  AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKG 189

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 190 MAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWMR 227


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 114/218 (52%), Gaps = 32/218 (14%)

Query: 64  ETGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 24  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 74

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 75  VNDIRTSSGAFLDDNE----LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDY 130

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           F  AE+     + R+++ + YL+DVEEGGET FP                + L V PR+G
Sbjct: 131 F--AEHSRSAANNRISTLVXYLNDVEEGGETFFP---------------KLNLSVHPRKG 173

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
             + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 174 XAVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWMR 211


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 65/199 (32%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+    +  
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFL----EDN 105

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 106 KLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 164 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 208

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 209 HGGAPVTKGEKWIATQWVR 227


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 74/229 (32%), Positives = 115/229 (50%), Gaps = 23/229 (10%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           +G P     D   + +P       P+   F    S ++C +++A ++ RL  S + +   
Sbjct: 80  AGAPTVYAADGGDRRVPILFRLASPQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPD 138

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 139 TGDENLIDARTSMGAMFQVAEH--ALIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQ 196

Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
            H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP                +
Sbjct: 197 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RV 241

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           GL+V P +G+ + F  L P+GT+D  +LH   PV  GEKW+ATKW+R++
Sbjct: 242 GLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRER 290


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 119/228 (52%), Gaps = 24/228 (10%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           S LP         ++I  +VL   P  + F N  S E+CQ++I  A  RL+ S+LA ++ 
Sbjct: 6   SILPIQSVYSIDNRTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEI 65

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
            ++      RTSSG F    E++  ++  IE +I+    LP  H E   VL YE GQ++ 
Sbjct: 66  SSI------RTSSGMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFK 117

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
            H+D F P  +    + R+ + ++YL+DVEEGG T FP                +G+   
Sbjct: 118 PHFDFFGP-NHPSSSNNRICTLVVYLNDVEEGGVTTFP---------------NLGIVNV 161

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           P++G  + F   + +  ++  +LH   PVI+GEKWVAT+W+R ++  E
Sbjct: 162 PKKGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWMRKKQIRE 209


>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
 gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
          Length = 284

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 114/221 (51%), Gaps = 25/221 (11%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           DA  + +   +    PR + F N  S E+CQ++I  A+ R+  S L ++     E     
Sbjct: 82  DAGDRRVEVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARS-LTVQAASGGEEVNKD 140

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G F    E++   +  +E +IAR    P  +GE   VL Y  G +Y  HYD F+PA
Sbjct: 141 RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPA 198

Query: 187 EYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           E G P++     QR+A+ ++YL+D   GG T FP                + L++ PR+G
Sbjct: 199 EPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP---------------DVPLEIGPRQG 243

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           + + F   +        +LHG  PVI+GEKW+ATKW+R++E
Sbjct: 244 NAVFFS--YGRAHPSSRTLHGGAPVIEGEKWIATKWLRERE 282


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 107/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +  +    RTSSG F+  +E   
Sbjct: 77  PLIVVLANVLSDEECDELIELSKNKMERSKIG-----SSRNVNDIRTSSGAFLEENE--- 128

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
                IE +I+  T +P  HGE  ++L Y + Q+Y +HYD F  AE+     + R+++ +
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF--AEHSRSAANNRISTLV 185

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 186 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 230

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 231 HGGAPVTKGEKWIATQWMR 249


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 52  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 103

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 160

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 161 MYLNDVEEGGETFFP---------------KLNLSVNPRKGMAVYFEYFYQDQSLNELTL 205

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 206 HGGAPVTKGEKWIATQWVR 224


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 73/229 (31%), Positives = 115/229 (50%), Gaps = 23/229 (10%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           +G P     D   + +P       P+   F    + ++C +++A ++ RL  S + +   
Sbjct: 81  AGTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPD 139

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 140 TGDENLIDARTSMGAMFQVAEH--ALIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQ 197

Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
            H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP                +
Sbjct: 198 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RV 242

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           GL+V P +G+ + F  L P+GT+D  +LH   PV  GEKW+ATKW+R++
Sbjct: 243 GLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRER 291


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 115/221 (52%), Gaps = 32/221 (14%)

Query: 68  ASIQSIPFQV-LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +S  +IP  + +S +PR    P F + E+C+ +I T+K +LKP           E + G 
Sbjct: 49  SSTDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGV 99

Query: 127 -RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            R+  G F+   E+   + + I +++     L ++  E   V+RY  G++  +H+D FNP
Sbjct: 100 HRSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTES-SEVMQVIRYNPGEETSAHFDYFNP 158

Query: 186 AEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
                 M      QR+ + L+YL+DVEEGGET FP                + +KVKP +
Sbjct: 159 LTTNGAMKIGLYGQRICTILMYLADVEEGGETSFPE---------------VNVKVKPIK 203

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           GD +LFY+  PNG +D  SLH   PVIKG KW+A K +  +
Sbjct: 204 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQK 244


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F     + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFHQDQSLNELTLHGGAPVTKGEKWIATQWVR 211


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 114/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLGNVLSDEECGELIELSKNKLARSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 174

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 175 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWMR 211


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETYFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/217 (33%), Positives = 113/217 (52%), Gaps = 32/217 (14%)

Query: 65  TGDASIQSIP-FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +    
Sbjct: 41  TEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDV 91

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 92  NDIRTSKGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF 147

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             AE+     + R+++ ++YL+DVEEGGET FP                + L V PR+G 
Sbjct: 148 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP---------------KLNLSVHPRKGM 190

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + F   + + +++  +LHG  PV KGEKW+AT+W+R
Sbjct: 191 AVYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWVR 227


>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 277

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 112/208 (53%), Gaps = 29/208 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
           P    F N  SA +C+++IA A+ RL  S  + +R G   E     RTS G F +  E++
Sbjct: 90  PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGG--EELNHDRTSHGMFYTRGENE 147

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
             ++  IE +IAR    P  +GE   VLRY  G +Y  HYD F+P E G         QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGT 254
           +AS ++YL +  EGG T+FP                IGLKV+P++G  + F Y+L    +
Sbjct: 206 VASLIMYLREPGEGGATVFP---------------DIGLKVRPQQGSAVFFSYALAHPAS 250

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +   +LHG  PV  GEKW+ATKW+R++E
Sbjct: 251 L---TLHGGEPVKSGEKWIATKWLRERE 275


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/210 (34%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P  + F    + ++C+ +I  A  RL+ S+L  +           RTS G F   
Sbjct: 25  VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNK------VVSEIRTSRGMFFE- 77

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ-R 195
            E++   +  IE +I+    +P  H E   VL Y  GQ+Y +HYD F P    P  S  R
Sbjct: 78  -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +++ ++YL+DVE GGET+FP                + L+VKP RG  L F   +    +
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL---------------LDLEVKPERGSALYFEYFYRQQEL 179

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           +  +LH S PV++GEKWVAT+W+R Q   E
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMRRQRVRE 209


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 108/210 (51%), Gaps = 31/210 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           +S  PR    P+F S  +C+ +I  +K +L+P           E + G  R+  G F+  
Sbjct: 29  MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E+   +++ I  ++     L + + E   V+RY  G++  +HYD FNP      M    
Sbjct: 80  GEEDHDVVKKIFQRMKMLVNLTE-NCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKIGL 138

Query: 194 --QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
             QR+ + L+YLS+VEEGGET FP                +G+KVKP +GD +LFY+  P
Sbjct: 139 YGQRVCTILMYLSEVEEGGETSFPE---------------VGVKVKPVKGDAVLFYNCKP 183

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           NG +D  SLH   PVIKG KWVA K I  +
Sbjct: 184 NGEVDPLSLHQGDPVIKGTKWVAIKLINQK 213


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/199 (33%), Positives = 106/199 (53%), Gaps = 27/199 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E  T
Sbjct: 39  PLIVVLGNVLSDEECGELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNELTT 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            I    E +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 94  KI----EKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                + L V PR+G  + F   + + +++  +L
Sbjct: 148 MYLNDVEEGGETFFP---------------KLNLSVHPRKGMAVYFEYFYQDQSLNELTL 192

Query: 261 HGSCPVIKGEKWVATKWIR 279
           HG  PV KGEKW+AT+W+R
Sbjct: 193 HGGAPVTKGEKWIATQWVR 211


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/206 (37%), Positives = 109/206 (52%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +   N  SAE+C +II +AK +L  S L ++     E     RTSSG F +    +T
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
             +  +E +IAR    P  +GE   VL Y  G +Y  HYD F+P E G P +     QR+
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL++   GG T FP                +GL+V P +G  + F    P+ T  
Sbjct: 226 ATLVMYLNEPARGGGTTFP---------------DVGLEVAPVKGSAVFFSYDRPHPTT- 269

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
             SLHG  PV++GEKWVATKW+R++E
Sbjct: 270 -RSLHGGAPVLEGEKWVATKWLRERE 294


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 111/230 (48%), Gaps = 35/230 (15%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G   DA+ + IP  +   RP+ + F N    ++C  +I  +  +L+       Q  TV +
Sbjct: 77  GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLE-------QSTTVNA 129

Query: 123 TKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
             GT      RTS GT+    ED   ++  IE ++A     P  +GE   VLRY  G +Y
Sbjct: 130 ETGTQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEY 187

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC 231
            SHYD F P   G         QR+A+ ++YL+DV  GGET+FP                
Sbjct: 188 RSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA-------------- 233

Query: 232 IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            G+ V PRRGD + F  +     +D  +LH   PV  GEKW+ TKW+R++
Sbjct: 234 -GISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRER 282


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 27/207 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
           +PR + F N  S E+C +II  A+ R+  S  +A R G   E     RTS+G F    E+
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 167

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
              ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F+P E G P +     Q
Sbjct: 168 P--MVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A+ ++YL+D E+GG T FP                + L+V PRRG+ + F    P+ +
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP---------------DVHLEVAPRRGNAVFFSYERPHPS 270

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
               +LHG  PV+ G+KW+ATKW+R++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRER 295


>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 363

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 76/219 (34%), Positives = 115/219 (52%), Gaps = 27/219 (12%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           +  LSW PRA  + NF + ++C+ +IA  +K+L+ S +   +G+  +     RTS GTFI
Sbjct: 91  WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +     T  L  +E ++A  + +P  H E   +LRYE GQ+Y +               +
Sbjct: 150 T--RRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGNG-------------EK 194

Query: 195 RLASFLLYLSDVEEGGETMFPFENGI------FLDSGYDYKKC-----IGLKVKPRRGDG 243
           R+A+ L++L + E GGET FP    +      FL S      C      G  V PR+GD 
Sbjct: 195 RIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDA 254

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +LF+S   NGT D  + H SCP ++G K+ ATKWI ++E
Sbjct: 255 ILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKE 293


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR L   N     +C +++A A+ RL+ S + +      E+    RTS G      E   
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +L+ IE +IA  T  P  HGE F VL Y+ G +Y  H+D FNP   G         QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL+    GG T FP                IGL+V P +G+ +LF    P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
             +LH   PV  GEKW+ATKW+R+ 
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 102/205 (49%), Gaps = 23/205 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR L   N     +C +++A A+ RL+ S + +      E+    RTS G      E   
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +L+ IE +IA  T  P  HGE F VL Y+ G +Y  H+D FNP   G         QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL+    GG T FP                IGL+V P +G+ +LF    P+G +D
Sbjct: 218 ATMVIYLNSPASGGATAFP---------------RIGLEVAPVKGNAVLFSYGLPDGALD 262

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
             +LH   PV  GEKW+ATKW+R+ 
Sbjct: 263 ERTLHAGLPVEAGEKWIATKWLREH 287


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+C +IIA A  R++ S L +      E+    RTS+G F    E+  
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F P E G P +     QR+
Sbjct: 170 -LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
            + ++YL++   GG T FP                +GL+V PRRG+ + F    P+    
Sbjct: 229 GTLVMYLNEPARGGATTFPD---------------VGLQVVPRRGNAVFFSYNRPDPAT- 272

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
             +LHG  PV++GEKW+ATKW+R++E
Sbjct: 273 -KTLHGGAPVLEGEKWIATKWLRERE 297


>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
          Length = 208

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/148 (46%), Positives = 86/148 (58%), Gaps = 10/148 (6%)

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
           K  I+  IE KIA  T LP+ +GE   VLRYE G+KYD H+D F       +   R+A+ 
Sbjct: 7   KDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGHRVATV 66

Query: 200 LLYLSDVEEGGETMFP---------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           L+YL+DV +GGET+FP           + I  D+  D  K  G  VKP+RGD LLF+SL 
Sbjct: 67  LMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAK-RGTAVKPKRGDALLFFSLT 125

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                D  SLH  CPVI+GEKW  TKWI
Sbjct: 126 TQAKPDTRSLHAGCPVIEGEKWSVTKWI 153


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/276 (32%), Positives = 128/276 (46%), Gaps = 53/276 (19%)

Query: 39  VPSIRPKLRTL-------EVVEKENESGLPHGETGDASIQSIP----------------- 74
           VP++R  +R         ++   + E+G PH E        +P                 
Sbjct: 23  VPALRASMRAAGWQDAATDLALAQLEAGFPHVEVARPRRTEMPGPDLEGAPLYIDAGDRR 82

Query: 75  FQVLSW--RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSG 131
            QVL     PR + F N  S E+C+ +IA A+ RL  S  +  R G  V +    RTS G
Sbjct: 83  VQVLQTMRHPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVD--RTSEG 140

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-P 190
            F    E+   I+  +E +IA     P   GE   +LRY  G +Y  HYD F+P E G P
Sbjct: 141 MFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTP 198

Query: 191 QM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
            +     QR+A+ ++YL +  +GG T FP                +GL+V P RG G+ F
Sbjct: 199 TILKRGGQRVATLVMYLQEPGQGGATTFP---------------DVGLEVAPVRGTGVFF 243

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
               P+      +LHG  PV+ GEKWVATKW+R++E
Sbjct: 244 SYEEPDPAT--RTLHGGAPVLAGEKWVATKWLRERE 277


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 74/228 (32%), Positives = 113/228 (49%), Gaps = 28/228 (12%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + LP G+  D     +P       PR   F      ++C+++IA ++ RL  S + +   
Sbjct: 108 NALPAGDGRD-----VPVLFAIESPRIALFQRLLMPDECEALIALSRGRLARSPV-VNPD 161

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G      E    ++E +E +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 162 TGDENLIDARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQ 219

Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
            HYD FNP   G         QR+A+ ++YL+DV  GG T FP                +
Sbjct: 220 PHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP---------------KL 264

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           GL+V P +G+ + F  L  +G++D  +LH   PV +GEKW+ATKW+R+
Sbjct: 265 GLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLRE 312


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 73/233 (31%), Positives = 114/233 (48%), Gaps = 23/233 (9%)

Query: 59  GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
           G P     D   + +P       P+   F    + ++C +++A ++ RL  S + +    
Sbjct: 82  GTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDT 140

Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
             E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y  
Sbjct: 141 GDENLIDARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQP 198

Query: 179 HYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
           H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP                +G
Sbjct: 199 HFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP---------------RVG 243

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
           L+V P +G+ + F  L P+G +D  +LH   PV  GEKW+ATKW+R++    D
Sbjct: 244 LEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRERPYRSD 296


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 72/241 (29%), Positives = 118/241 (48%), Gaps = 32/241 (13%)

Query: 53  EKENESGLPH---------GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
           E  N++ LP+         G   D   +S+   +   RP  +    F S E+C+ +I  +
Sbjct: 58  EPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFMSGEECEQLIEQS 117

Query: 104 KKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
           +++L PS +   Q    +     R+S GT+    E  + ++  ++ +I+     P+ HGE
Sbjct: 118 RRKLTPSAIVDPQTGKFQVI-ADRSSEGTYFQRGE--SPLISRLDRRISELMNWPEDHGE 174

Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEN 218
              +L Y +G +Y  H+D F   E G      Q  QR+A+ ++YL++V EGGET+FP   
Sbjct: 175 GIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFPD-- 232

Query: 219 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                        +G+ + P+RG    F      G +D  +LHG  PV+ GEKW+ATKW+
Sbjct: 233 -------------VGISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTGEKWIATKWM 279

Query: 279 R 279
           R
Sbjct: 280 R 280


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F N  S E+C ++IA A+ RL  S L +      E     RTS G F    +  
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 170

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
           + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     QR
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + + ++YL+  E+GG T FP                + L+V P+RG+ + F    P+ + 
Sbjct: 231 VGTLVMYLNTPEKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 275

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
              +LHG  PVI GEKW+ATKW+R++E
Sbjct: 276 --RTLHGGAPVIAGEKWIATKWLRERE 300


>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 252

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/207 (38%), Positives = 109/207 (52%), Gaps = 15/207 (7%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           V+SW PRA    NF + ++   I   A+  ++ S +    G +V      RTS GTFI+ 
Sbjct: 3   VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
               T ++  +E ++A  T +P  + E   VLRY  GQ Y  H D+          S RL
Sbjct: 61  Y--ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112

Query: 197 ASFLLYLSDVEEGGETMFP--FENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLFPN 252
           A+ LLYLSD E GGET FP  + +         + +C+   V  KPR+GD LLF+S+ P+
Sbjct: 113 ATVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVKPD 172

Query: 253 G-TIDRTSLHGSCPVIKGEKWVATKWI 278
           G T D  S H  CPVI+G KW AT W+
Sbjct: 173 GKTEDPLSEHEGCPVIRGVKWTATVWV 199


>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 324

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/219 (35%), Positives = 108/219 (49%), Gaps = 17/219 (7%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
            +  + LS  P       F   ++   I+A + + LKPS + L  G    +    RTS+ 
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
            F+S+S  K   L+ I+ ++A  T +P  H E   VLRYE  QKYD H D F P E+   
Sbjct: 165 YFLSSS--KHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221

Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKC-IGLKVKPR 239
            P +          R+ +   Y+SDV +GG T+FP   G         K C  GLKV P+
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGA--PRPQSMKDCSTGLKVSPK 279

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +   ++FYS+ PNG  D  SLHG CPV  G K+   KW+
Sbjct: 280 KRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/198 (32%), Positives = 108/198 (54%), Gaps = 24/198 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K R+  S++A        +    RTSS TFI   E++ 
Sbjct: 39  PLIVVLGNVLSDEECDELIRLSKDRINRSKIA------NANVDNMRTSSSTFIE--ENEN 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  IE +I++   +P  +GE   +L Y++GQ+Y SH+D F+ + +    + R+++ ++
Sbjct: 91  IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YLSDVE+GGET FP                +   V P++G  + F   + + T++  +LH
Sbjct: 150 YLSDVEQGGETYFP---------------KLHFSVSPQKGMAVYFEYFYNDQTLNELTLH 194

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PVI G+KW AT+W+R
Sbjct: 195 GGAPVIVGDKWAATQWMR 212


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 27/222 (12%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKG 125
           DA  +++   +   +PR + F N  SAE+C ++IA A  R+  S  +A + G   E    
Sbjct: 97  DAGDRAVNILLAIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGG--EEVND 154

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTS G F    E+   +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+P
Sbjct: 155 DRTSDGMFFQRGENP--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDP 212

Query: 186 AEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
            E G P +     QR+ + ++YL+  E+GG T FP                + ++V P+R
Sbjct: 213 GEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQR 257

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           G+ + F+S        RT LHG  PVI GEKW+ATKW+R++E
Sbjct: 258 GNAV-FFSYERAHPATRT-LHGGAPVIAGEKWIATKWLRERE 297


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+CQ+II  A+ R+  S L ++     E     RTS G F    E  T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +++ +E +IAR    P  +GE   VL Y  G +Y  HYD F+P + G         QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL++  +GG T FP                + L+V PR+G+ + F    P+ +  
Sbjct: 219 ATLVIYLNNPRKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
             +LHG   VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 74/206 (35%), Positives = 108/206 (52%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + F N  S  +C++++  A+ RL  S L +      E     RTS G F +  E+  
Sbjct: 90  PDLVVFGNLLSDSECEALMEVAQPRLARS-LTVNIKTGGEERNRDRTSQGMFFARGENP- 147

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            +++ +E +IAR    P   GE   VLRY  G +Y  HYD F+PAE G P +     QR+
Sbjct: 148 -LVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRV 206

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL++ E+GG T+FP                IGL+V PRRG  + F   +P     
Sbjct: 207 ATLIMYLNEPEQGGATVFP---------------DIGLQVTPRRGTAVFFS--YPAANPA 249

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
             + HG  PV  GEKW+ATKW+R++E
Sbjct: 250 SLTRHGGEPVKAGEKWIATKWLRERE 275


>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 317

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 112/217 (51%), Gaps = 19/217 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LSW PR     NF S E+C+ +I   +K+L+ S +         ST   RTS GTF++
Sbjct: 37  ETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAVST--ARTSFGTFVT 94

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
               +T  L+ +E ++A+ + +P  H E   +LRY  GQ+Y +H+D       G    +R
Sbjct: 95  RRLTET--LQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISENGG----KR 148

Query: 196 LASFLLYLSDVEEGGETMFP------FENGIFLDSGYDYKKC-----IGLKVKPRRGDGL 244
           +A+ L++L +   GGET FP           FL +     +C      G  V P++G+ +
Sbjct: 149 IATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECGWNDGNGFSVIPKKGEAV 208

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           LF+S   NGT D  + H SCP + G K+ ATKWI + 
Sbjct: 209 LFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHEN 245


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 108/200 (54%), Gaps = 24/200 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  A  ++K S++    G T E  +  RTSS  FI   +D+ 
Sbjct: 33  PLIVVLGNVLSDEECDELIQLAGDKVKRSKI----GTTREENE-LRTSSSMFIE--DDEN 85

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  ++ +I+    +P  HGE   +LRY  GQ+Y +H+D F+        + R+++ ++
Sbjct: 86  LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE+GGET FP                +   V PR+G  + F   + + T++  +LH
Sbjct: 144 YLNDVEQGGETFFPH---------------LKFSVSPRKGMAVYFEYFYSDQTLNDFTLH 188

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
           G  PV++GEKWVAT+W+R Q
Sbjct: 189 GGAPVVEGEKWVATQWMRKQ 208


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/193 (39%), Positives = 106/193 (54%), Gaps = 15/193 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
            PRA  F NF +  +   ++  A  +LK S +    GE V      RTS G FI    D 
Sbjct: 60  HPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGV--VDEIRTSYGMFIRRLADP 117

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQRLASF 199
             ++  IE +I+  T LP  H E   VLRY  GQ Y +HYD+ + + E GP+   RLA+F
Sbjct: 118 --VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW--RLATF 173

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYK-----KCI--GLKVKPRRGDGLLFYSLFPN 252
           L+YLSDVEEGGET FP +N ++ D     +     +C    +  KP+ GD +LFYS +PN
Sbjct: 174 LMYLSDVEEGGETAFP-QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPN 232

Query: 253 GTIDRTSLHGSCP 265
            T+D  ++H  CP
Sbjct: 233 LTMDPAAMHTGCP 245


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 27/198 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C ++I  +K +LK S++   + E        RTSS TF+   E + 
Sbjct: 37  PLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN-----DMRTSSSTFMEEGESE- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  +E +I++   +P  +GE   +L Y+IGQ+Y +H+D F  A      + R+++ ++
Sbjct: 91  -VVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNAS-----NPRISTLVM 144

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP                +   V P++G  + F   + N  ++  +LH
Sbjct: 145 YLNDVEEGGETYFP---------------KLNFSVSPQKGMAVYFEYFYDNQELNDLTLH 189

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PVI G+KW AT+W+R
Sbjct: 190 GGAPVIIGDKWAATQWMR 207


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 73/208 (35%), Positives = 115/208 (55%), Gaps = 15/208 (7%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           ++W+PR   + NF +  + + +I  A  ++K S +    G++VE +  T  ++G  +   
Sbjct: 4   VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAG--VRRY 61

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   ++E IE+++A  T +   H E   +LRY IGQ+Y  H D     E G     R+A
Sbjct: 62  QDD--VVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115

Query: 198 SFLLYLSDVEEGGETMFP---FENGIFLDS-GYDYKKCIGLKV--KPRRGDGLLFYSLFP 251
           + L+YL++ E GGET FP   + N    ++ G ++  C    V   P+RGD LLF+S+ P
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIGP 175

Query: 252 NGTI-DRTSLHGSCPVIKGEKWVATKWI 278
           +GT  D  + H  CPV+ G KW ATKWI
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWI 203


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 74/207 (35%), Positives = 109/207 (52%), Gaps = 25/207 (12%)

Query: 80  WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
           + PR + F +  S ++C+ +I  AK RL  S L +      E     RTSSG F    E+
Sbjct: 97  YNPRVVVFGSLLSDQECEQLIGLAKPRLARS-LTVATKTGGEEVNEDRTSSGMFFQRGEN 155

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
           +  ++  IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     Q
Sbjct: 156 E--LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQ 213

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + ++YL + E+GG T FP                + L+V P+RG G+ F    P+ +
Sbjct: 214 RVGTLVMYLGEPEKGGGTTFP---------------DVHLEVAPKRGHGVFFSYERPHPS 258

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
               +LHG  PV+ GEKW+ATKW+R++
Sbjct: 259 T--RTLHGGAPVLAGEKWIATKWLRER 283


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+CQ+II  A+ R+  S L ++     E     RTS G F    E  T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +++ +E +IAR    P  +GE   VL Y  G +Y  HYD F+P + G         QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL++  +GG T FP                + L+V PR+G+ + F    P+ +  
Sbjct: 219 ATLVIYLNNPLKGGGTTFP---------------DVPLEVAPRQGNAVFFSYERPHPST- 262

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQE 282
             +LHG   VI+GEKW+ATKW+R++E
Sbjct: 263 -RTLHGGASVIEGEKWIATKWLRERE 287


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 110/207 (53%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F N  S E+C ++IA A+ R+  S L +      E     RTS G F    +  
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARS-LTVATKTGGEEINADRTSDGMFFQRGQ-- 171

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
           + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + + ++YL+  ++GG T FP                + L+V P+RG+ + F    P+ + 
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP---------------DVHLEVAPQRGNAVFFSYERPHPST 276

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
              +LHG  PVI G+KW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVIAGDKWIATKWLRERE 301


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 112/224 (50%), Gaps = 23/224 (10%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G T  A  + +  ++   RP+ + F +  S E+C  +I  A+ RLK S     +  + E 
Sbjct: 77  GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 135

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G +    ED    +E ++H+I+     P  HGE   +L Y  G +Y  H+D 
Sbjct: 136 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 193

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F P + G  +      QR+A+ ++YLSDVE GGET+FP       D+G        L V 
Sbjct: 194 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP-------DAG--------LAVM 238

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            R+G  + F  +     +D  +LHG  PV  G+KW+ TKW+R++
Sbjct: 239 ARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 282


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 112/224 (50%), Gaps = 23/224 (10%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G T  A  + +  ++   RP+ + F +  S E+C  +I  A+ RLK S     +  + E 
Sbjct: 74  GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 132

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G +    ED    +E ++H+I+     P  HGE   +L Y  G +Y  H+D 
Sbjct: 133 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 190

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F P + G  +      QR+A+ ++YLSDVE GGET+FP       D+G        L V 
Sbjct: 191 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP-------DAG--------LAVM 235

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            R+G  + F  +     +D  +LHG  PV  G+KW+ TKW+R++
Sbjct: 236 ARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRER 279


>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
           sativus]
          Length = 164

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/155 (42%), Positives = 91/155 (58%), Gaps = 2/155 (1%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG F+S  E    +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+  
Sbjct: 7   RTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDT 66

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
               +  QR+A+ L+YLS+  EGGET FP         G   K   GL VKP +GD +LF
Sbjct: 67  FNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCG--GKTVPGLSVKPAKGDAVLF 124

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +S+  +G  D  S+HG C V+ GEKW ATKW+R +
Sbjct: 125 WSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQK 159


>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 165

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/156 (43%), Positives = 92/156 (58%), Gaps = 4/156 (2%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+S+ E K+ +   IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+ 
Sbjct: 11  VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                +  QR+A+ L+YLSD  EGGET FP         G    K  GL VKP +GD +L
Sbjct: 69  TFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGECSCGGKIVK--GLSVKPIKGDAVL 126

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           F+S+  +G  D  S+HG C V+ GEKW ATKW+R +
Sbjct: 127 FWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162


>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
          Length = 327

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 73/198 (36%), Positives = 101/198 (51%), Gaps = 22/198 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR  +FP F S E+C  +  TA+  L+PS   L            RTS G  I  + +  
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPS-FVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  I  +IA AT      GE+  VLRY  GQ+Y  H D    AE     +QR+A+F++
Sbjct: 199 -VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFIV 252

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+D  EGGET FP  N               ++V+PR GD + F ++ P+GT D   +H
Sbjct: 253 YLNDGFEGGETHFPLLN---------------IQVRPRIGDAIRFDTIRPDGTPDPRLVH 297

Query: 262 GSCPVIKGEKWVATKWIR 279
              PV  G KW+AT+WIR
Sbjct: 298 AGQPVRNGVKWIATRWIR 315


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 106/198 (53%), Gaps = 25/198 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      N  S E+C  +I+ +K R+  S++A  Q   +      RTS+  F+   ED +
Sbjct: 33  PFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI------RTSTSVFLP--EDAS 84

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            +++ +E +I++   +P  HGE   +L Y+IGQ+Y +H+D F+P +     + R+++ +L
Sbjct: 85  EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIE--NPRISTLVL 142

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGG+T FP                + L V P +G  + F   + +  ++  +LH
Sbjct: 143 YLNDVEEGGDTYFP---------------NLKLSVSPHKGMAVYFEYFYDDPMLNELTLH 187

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PV  G+KW AT W+R
Sbjct: 188 GGAPVTIGDKWAATMWMR 205


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 81/207 (39%), Positives = 106/207 (51%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+   +F SA +C+ +I+ A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
            T ++  +E +IA  T LP  +GE   +L YE+G +   H D   A NPA  E   +  Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+DVE GGETMFP                 G  V PRRG  L F      G 
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D +SLH S P+  GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRVGEKWVATKWIRTR 288


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 82/207 (39%), Positives = 105/207 (50%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+   +F SA +C+ +IA A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
            T ++  +E +IA  T LP  +GE   +L YE G +   H D   A NPA  E   +  Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+DVE GGETMFP                 G  V PRRG  L F      G 
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP---------------QTGWSVVPRRGQALYFEYGNRFGL 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D +SLH S P+  GEKWVATKWIR +
Sbjct: 262 ADPSSLHTSTPLRAGEKWVATKWIRTR 288


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 122/242 (50%), Gaps = 29/242 (11%)

Query: 50  EVVEKENESGLPHGETGDASIQ----SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK 105
           E++ K +   +PH    +  ++     +    + ++P      +F S E+C ++I+ A +
Sbjct: 43  ELISKSSNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQ 102

Query: 106 RLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
           +LK S++     G  VE +  T TS+G        +  I++ IE +IA     P  HGE 
Sbjct: 103 KLKASRVVDPEDGSFVEHSARTSTSTGY----HRGEIDIIKTIEARIADLINWPVDHGEG 158

Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENG 219
             VLRYE G +Y  H+D F+PA+   ++      QR+ +FL+YLS+V+ GG T FP    
Sbjct: 159 LQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFP---- 214

Query: 220 IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
                       +  +++P +G  L F +      I+  +LH   PV +G K++ATKW+R
Sbjct: 215 -----------NLNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWLR 263

Query: 280 DQ 281
           ++
Sbjct: 264 EK 265


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 110/202 (54%), Gaps = 25/202 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  L   N  S E+C  +I  +K +++ S++ A R+  ++      RTSSG F   SE++
Sbjct: 39  PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAREVNSI------RTSSGMFFEESENE 92

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  IE ++++       + E   VL+Y   Q+Y +H+D F  A    + + R+++ +
Sbjct: 93  --LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                +GL V P +G  + F   + +  ++  +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194

Query: 261 HGSCPVIKGEKWVATKWIRDQE 282
           HG  PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216


>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
          Length = 187

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 79/122 (64%), Gaps = 3/122 (2%)

Query: 161 HGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENG 219
           +GE+  +L YE G+KY+ HYD F+          R+A+ L+YLSDV +GGET+FP  E+ 
Sbjct: 8   NGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAESK 67

Query: 220 IFLDSGYDYKKCI--GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
           +       + +C   G  VKPR+GD LLF+SL  N T D  SLHGSCPVI+GEKW ATKW
Sbjct: 68  LSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSATKW 127

Query: 278 IR 279
           I 
Sbjct: 128 IH 129


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 109/207 (52%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F N  S E+C ++IA A  R+  S L +      E     RTS G F    +  
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 171

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
           + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     QR
Sbjct: 172 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQR 231

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + + ++YL+  E+GG T FP                + ++V P+RG+ + F    P+ + 
Sbjct: 232 VGTLVMYLNTPEKGGGTTFP---------------DVHVEVAPQRGNAVFFSYERPHPST 276

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQE 282
              +LHG  PV+ GEKW+ATKW+R++E
Sbjct: 277 --RTLHGGAPVLAGEKWIATKWLRERE 301


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/198 (30%), Positives = 105/198 (53%), Gaps = 24/198 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I+ +K R++ S+++ +      S    RTSS  F   +E+  
Sbjct: 44  PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEN-- 95

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  +E ++++   +P  HGE   +L Y IGQ+Y +HYD F+        + R+++ ++
Sbjct: 96  DVVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN-NPRISTLVM 154

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE GGET FP                +   V P++G  + F   + + T++  +LH
Sbjct: 155 YLNDVEAGGETYFP---------------KLNFYVAPKKGMAVYFEYFYNDTTLNELTLH 199

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PV+ G+KW AT+W+R
Sbjct: 200 GGAPVVIGDKWAATQWMR 217


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 108/198 (54%), Gaps = 23/198 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K R++ S++A     ++E  +  RTSS TF    E++ 
Sbjct: 38  PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA----NSLEVDE-LRTSSSTFFHEGENE- 91

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  IE +I++   +P  HGE   +L Y+IGQ+Y +H+D F+        + R+++ ++
Sbjct: 92  -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAAS-NPRISTLVM 149

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE+GGET FP                +   V P++G  + F   + +  ++  +LH
Sbjct: 150 YLNDVEQGGETYFP---------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLTLH 194

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PV+ G+KW AT+W+R
Sbjct: 195 GGAPVVMGDKWAATQWMR 212


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 110/202 (54%), Gaps = 25/202 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  L   N  S E+C  +I  +K +++ S++ A R+  ++      RTSSG F   SE++
Sbjct: 39  PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAREVNSI------RTSSGMFFDESENE 92

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  IE ++++       + E   +L+Y   Q+Y +H+D F  A    + + R+++ +
Sbjct: 93  --LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DVEEGGET FP                +GL V P +G  + F   + +  ++  +L
Sbjct: 150 MYLNDVEEGGETYFP---------------KLGLSVSPTKGMAVYFEYFYSDAELNDRTL 194

Query: 261 HGSCPVIKGEKWVATKWIRDQE 282
           HG  PVIKGEKWVAT+W+R Q+
Sbjct: 195 HGGAPVIKGEKWVATQWMRKQK 216


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/225 (33%), Positives = 111/225 (49%), Gaps = 23/225 (10%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP      AS +S P    S +P      +     +C  +I   ++ ++ S + +     
Sbjct: 72  LPQARVLTASDRSAPKVFSSEQPVIALVADVLDDTECDRLIEIGREHVQRSSV-VDPDSG 130

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E T   R S G F++AS D   ++E I+ +IA     P  +GE  ++LRY +G +Y  H
Sbjct: 131 KEITIEERRSEGAFVNASTD--ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPH 188

Query: 180 YDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           YD F   + G +       QR+A+ +LYL++VE+GG+T FP                IGL
Sbjct: 189 YDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFP---------------DIGL 233

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            + PRRG  L F  +   G  D  +LH   PV KGEKW+ATKWIR
Sbjct: 234 AIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWIATKWIR 278


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 106/201 (52%), Gaps = 23/201 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L   N  S E+C  +I  +K +++ S++   +          RTSSG F   SE++ 
Sbjct: 39  PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  IE ++++       + E   +L+Y   Q+Y +H+D F  A    + + R+++ ++
Sbjct: 93  -LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLVM 150

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP                +GL + P +G  + F   + +  ++  +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLGLSISPTKGMAVYFEYFYSDAELNDRTLH 195

Query: 262 GSCPVIKGEKWVATKWIRDQE 282
           G  PVIKGEKWVAT+W+R Q+
Sbjct: 196 GGAPVIKGEKWVATQWMRKQK 216


>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
          Length = 178

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 62/198 (31%), Positives = 104/198 (52%), Gaps = 23/198 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+ +I  ++ +LK S++         +    RTSS  F    E++ 
Sbjct: 39  PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG-----NTRTVDDIRTSSSMFFEEGENE- 92

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  IE ++++   +P  HGE   +L Y IGQ+Y +H+D F+ +      + R+++ ++
Sbjct: 93  -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSR-AASNPRISTLVM 150

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP                +   V P++G  + F   + N  ++  +LH
Sbjct: 151 YLNDVEEGGETYFP---------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLTLH 195

Query: 262 GSCPVIKGEKWVATKWIR 279
           G  PVIKG KW AT+W+R
Sbjct: 196 GGAPVIKGSKWAATQWMR 213


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score =  116 bits (291), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 70/200 (35%), Positives = 101/200 (50%), Gaps = 25/200 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K+RL+ S++    GE   S    RTSSG F   +E   
Sbjct: 36  PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEE-RSVNQIRTSSGVFCEENE--- 87

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
             +  IE +I++   +P  HG+   VL Y  GQ+Y  H+D F         + R+++ ++
Sbjct: 88  -TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRASA-NNRISTLVM 145

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP  N               L V P +G  + F   + N  ++  +LH
Sbjct: 146 YLNDVEEGGETTFPMLN---------------LSVFPSKGMAVYFEYFYSNHELNERTLH 190

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
              PV KGEKWVAT W+R Q
Sbjct: 191 AGAPVRKGEKWVATMWMRRQ 210


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score =  116 bits (291), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 75/224 (33%), Positives = 112/224 (50%), Gaps = 28/224 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD S++ +   +    PR + F N  S E+C+++IA A  R+  S L +      E  
Sbjct: 116 DCGDRSVEVL---LTMAHPRVVVFGNLLSPEECEALIAAAAPRMARS-LTVATQTGGEEV 171

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F    E    +++ IE +IA     P  +GE   VL Y  G +Y  HYD F
Sbjct: 172 NDDRTSHGMFFQRGESP--LVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYF 229

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
           +PAE G P +     QR+ + ++YL+  E+GG T FP                  ++V P
Sbjct: 230 DPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFPDAQ---------------IEVAP 274

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +RG+   F    P  T    +LHG  PV+ G+KW+ATKW+R++E
Sbjct: 275 QRGNAAFFSYERP--TPSTRTLHGGAPVLAGDKWIATKWLRERE 316


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S  +C  +I  +++RL+ S++    GE   S    RTSSG F   +E  T
Sbjct: 36  PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-RSVNSIRTSSGVFCEQTETIT 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I    E +I++   +P  HG+   VLRY  GQ+Y  HYD F         + R+++ ++
Sbjct: 91  RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 145

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE+GGET+FP                + L V P +G  + F   + N  ++  +LH
Sbjct: 146 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYRNQEVNEFTLH 190

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
               VI GEKWVAT W+R Q
Sbjct: 191 AGAQVIHGEKWVATMWMRRQ 210


>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 243

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/147 (43%), Positives = 90/147 (61%), Gaps = 7/147 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF   E+C+ +I  AK  ++ S +   + G++ +S    RTSSGTF+
Sbjct: 79  EIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F          Q
Sbjct: 135 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIF 221
           R+A+ L+YLSDVEEGGET+FP   G +
Sbjct: 193 RIATVLMYLSDVEEGGETVFPAAKGNY 219


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 101/207 (48%), Gaps = 32/207 (15%)

Query: 83  RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFISA 136
           R   F NFASA++C  +    +K+L            V  T G       R S+  ++  
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKL---------SRAVAWTDGAFRPVEFRISTAAWLQP 355

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
             D   ++  +  +IA AT L     EA  V  Y IG  Y++HYD     E       R+
Sbjct: 356 DHDD--VVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRI 413

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+F++YL+ VE+GG T FP                +G  V+P  GD + +Y+L P+G  D
Sbjct: 414 ATFMIYLNQVEQGGYTAFPR---------------LGAAVEPGHGDAVFWYNLLPDGESD 458

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             +LHG+CPV++G KWVA KWI +++ 
Sbjct: 459 NNTLHGACPVLQGSKWVANKWIHEKKN 485


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 99/194 (51%), Gaps = 25/194 (12%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  S E+C+ +I  +K ++  S++  +           RTSS TF+   +    +   IE
Sbjct: 41  NVVSEEECEELIFLSKNKMNRSKIGSQH-----EVSDIRTSSSTFLPEDD----LTNRIE 91

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            ++A+   +P  HGE  ++L Y+ GQ+Y +HYD F         + R+++ +LYL+DVEE
Sbjct: 92  KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150

Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
           GGET FP  N               L + P +G  + F   + +  I+  +LHG  PV  
Sbjct: 151 GGETYFPHMN---------------LSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTS 195

Query: 269 GEKWVATKWIRDQE 282
           GEKW AT W+R ++
Sbjct: 196 GEKWAATMWVRRKQ 209


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 100/207 (48%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+    F SA +C+ +I  A+ RL  S +     G  +    G R+S G F    E 
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D   P     AE   +  Q
Sbjct: 158 -TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+DVE GGET+FP                +G  V PRRG    F     +G 
Sbjct: 217 RVGTLLMYLNDVESGGETLFP---------------QVGCSVVPRRGQAFYFEYGNGSGR 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  SLH S P+  G+KWVATKWIR +
Sbjct: 262 SDPASLHASSPIGSGDKWVATKWIRTR 288


>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
 gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
          Length = 262

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 74/215 (34%), Positives = 106/215 (49%), Gaps = 18/215 (8%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P+A  +  F SAE+C  +I      LK S +   + +T       RTS GTF+   
Sbjct: 4   LSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDT-GVLDDVRTSFGTFLPKK 62

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
            D   +L  IE ++   + +   + E   +L+Y  GQ+Y  H D        P   +R+A
Sbjct: 63  YDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLT----SPNGGRRIA 116

Query: 198 SFLLYLSDVEEGGETMFPFENGI---------FLDSGYD--YKKCIGLKVKPRRGDGLLF 246
           + L++L + E+GGET FP    +           D   D  ++   GL VKPRRGD +LF
Sbjct: 117 TVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDAVLF 176

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +S   NG  D  S H SCP + G KW ATKWI ++
Sbjct: 177 FSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEK 211


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 102/201 (50%), Gaps = 25/201 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+     N  S E+C+++I  +K ++  S++      +       RTSS  F+   E   
Sbjct: 34  PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            +   IE ++A+   +P  HGE  ++L Y+ GQ+Y +H+D F       + + R+++ +L
Sbjct: 86  -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVL 143

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP  N               L V P +G  + F   + +  I+  +LH
Sbjct: 144 YLNDVEEGGETYFPEMN---------------LTVSPHKGMAVYFEYFYNDPAINERTLH 188

Query: 262 GSCPVIKGEKWVATKWIRDQE 282
           G  PV  GEKW AT W+R Q+
Sbjct: 189 GGSPVTAGEKWAATMWVRRQQ 209


>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 276

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/187 (36%), Positives = 108/187 (57%), Gaps = 7/187 (3%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVEST 123
             D  +  +  +V+SW PR + F NF S+E+C  +   A+ RL+ S +  +  G+ V+S 
Sbjct: 50  AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD 109

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F
Sbjct: 110 --VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYF 167

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE-NGIFLDSGYDYKKCIGLKVKPRRGD 242
           +      +  QR+A+ L+YL+D  EGGET FP   +G  +  G   +   GL VKP +GD
Sbjct: 168 SDTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAGDGECICGG---RLVRGLCVKPNKGD 224

Query: 243 GLLFYSL 249
            +LF+S+
Sbjct: 225 AVLFWSM 231


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 70/219 (31%), Positives = 107/219 (48%), Gaps = 23/219 (10%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           A  + +  +V   RP+ + F +  S ++C  +I  A+ RLK S   +      E     R
Sbjct: 102 AHDRDVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRS-TTVNPESGREDVIQLR 160

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G +    ED    +E ++ +I+     P  HGE   +L Y  G +Y  H+D F P++
Sbjct: 161 TSEGFWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQ 218

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
            G  +      QR+A+ ++YLSDV  GGET+FP                 GL V  R+G 
Sbjct: 219 SGSVLHTSRGGQRVATLIVYLSDVAGGGETVFP---------------NAGLAVMARQGG 263

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            + F  L  +  +D  +LHG  PV  GEKW+ TKW+R++
Sbjct: 264 AIYFRYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRER 302


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 25/200 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S  +C  +I  +++RL+ S++    GE   S    RTSSG F   +E  T
Sbjct: 41  PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTETIT 95

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I    E +I++   +P  HG+   VLRY  GQ+Y  HYD F         + R+++ ++
Sbjct: 96  RI----EKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 150

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE+GGET+FP                + L V P +G  + F   + N  ++  +LH
Sbjct: 151 YLNDVEQGGETVFPL---------------LHLSVFPTKGMAVYFEYFYSNQELNDFTLH 195

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
               VI GEKWVAT W+R Q
Sbjct: 196 AGTQVIHGEKWVATMWMRRQ 215


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 68/212 (32%), Positives = 107/212 (50%), Gaps = 21/212 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++   P  + F +  S ++   +   A+  LK + + +  G+ V  ++  RTS G 
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQ 191
           ++    D   +   IE ++   T L     EA+N++ Y +G  Y +HYD FN   +   +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A+ L YLSDVE+GG T+FP                + L V P RG  L +Y+L  
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP---------------NLKLAVSPERGMALFWYNLLD 467

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           NGT D  +LHG CPV+ G KWV T WI ++ Q
Sbjct: 468 NGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQ 499


>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
          Length = 417

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 71/225 (31%), Positives = 110/225 (48%), Gaps = 17/225 (7%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
            +  + LS  P       F   ++   I+  + + LKPS + L  G    +    RTS+ 
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
            F+ +  D    ++ I+ +++  T +P  H E   VLRYE  QKYD H D F P E+   
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314

Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPR 239
            P +          R+ +   Y+SDV +GG T+FP   G    +    K C  GL V P+
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRPTSM--KDCTTGLNVPPK 372

Query: 240 RGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           +   ++FYS+ PNG  D  SLHG CPV +G K+   KW+ ++ ++
Sbjct: 373 KRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWVWNKARY 417


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 72/210 (34%), Positives = 103/210 (49%), Gaps = 37/210 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR + F    S E+C  ++A A+ RL  S       ETV+++ G       RTS G F  
Sbjct: 92  PRVVVFGGLLSDEECDELVALARPRLARS-------ETVDNSTGGSEVNAARTSDGMFFE 144

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 190
             E    ++E IE +IA     P   GE   VLRY  G +Y  H+D F+PA  G      
Sbjct: 145 RGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILR 202

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +  QR+ + ++YL+    GG T FP                +GL+V+P +G+ + F    
Sbjct: 203 RGGQRVGTVVMYLNTPAGGGATTFP---------------EVGLEVQPVKGNAVFFSYER 247

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           P  +    +LHG  PV+ GEKWVATKW+R+
Sbjct: 248 PLAST--RTLHGGAPVLDGEKWVATKWMRE 275


>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
          Length = 315

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 140/281 (49%), Gaps = 35/281 (12%)

Query: 15  PTALLLCSFFFLAGFYGS--TFLSRDVPSIRPKLRTLEVVEKENESGLPH--GETGDASI 70
           P A +LC  F L    G   TF++  V S R ++  LEV  ++  SG+ +  G +   + 
Sbjct: 15  PPAAVLCGIFVLGIGSGVILTFIAAQVVSRRSEI--LEVKPQKLRSGMLNAPGTSLSPAD 72

Query: 71  QSIPFQVLSWR----PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTK 124
           +   F + +      PR     N  + E+C+S+ +          L +  G  E VES+ 
Sbjct: 73  KKNEFWIETISDLPGPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESS- 131

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
            TRT++  ++   +    ++  +E+ +A+ T     +GE   +L Y+  Q++  H+D F+
Sbjct: 132 -TRTNTAAWLEYHQGP--VVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFD 188

Query: 185 PAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
           PA   P+       RLA+ ++YL + EEGGET              D+ K I  KVKP  
Sbjct: 189 PATDPPENFEPGGNRLATAIIYLQNAEEGGET--------------DFMK-IDTKVKPEA 233

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           G  +LFY L P+G++D+ ++H   P   GEKWVATKWI ++
Sbjct: 234 GSAVLFYDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHER 274


>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
 gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
          Length = 478

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 75/262 (28%), Positives = 123/262 (46%), Gaps = 41/262 (15%)

Query: 39  VPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPN--------- 89
           V + RP +   +++     +  P     + ++Q+         PR ++ PN         
Sbjct: 43  VMNFRPTVPVDQLINPFKRAASPQQTVNNQNLQTAANHGAPTDPRKIFIPNALKLNSDKL 102

Query: 90  -------FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
                  F + E+C+ IIA  + +L+PS+L+     + ES K  RTS    +   +D   
Sbjct: 103 EMYALGEFLTTEECERIIANIRSKLRPSELS-----SQESDKTYRTSRTCDLGTIDDP-- 155

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYGPQMSQRLASF 199
            +  ++ +I +   +  ++ E      YE+GQ++ +H D F      E+G  M QR  + 
Sbjct: 156 FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYTV 215

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+DVEEGGET FP  +G                +KPR G  L++ SL  NG  +  S
Sbjct: 216 MIYLNDVEEGGETDFPAADG---------------AIKPRAGLALIWNSLQSNGAPNPHS 260

Query: 260 LHGSCPVIKGEKWVATKWIRDQ 281
           +H + PV+KG K V TKW R Q
Sbjct: 261 MHQAYPVLKGHKAVITKWFRSQ 282


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 71/225 (31%), Positives = 114/225 (50%), Gaps = 28/225 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +    PR + F N  +AE+C ++IA A++++K S +        +  
Sbjct: 81  QAGDRQVQVLASLL---HPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQDQQ 136

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F     +   +   +E +IA     P  +GE   VLRY  G +Y+ HYD F
Sbjct: 137 HQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYF 194

Query: 184 NPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
           +PA  G +++     QR+AS ++YL+   +GG T FP  +               L+V P
Sbjct: 195 DPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAH---------------LEVAP 239

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +G+ + F    P+      +LHG  PV++GEKWVATKW+R++  
Sbjct: 240 IKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRH 282


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 101/205 (49%), Gaps = 23/205 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +   N  S ++C +I A ++ R   S   +     +     +RTS    I   E  T
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARST-TIDNASGINRFDDSRTSESAHIQRGE--T 148

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQRL 196
            ++  I+ ++A  +  P  HGE   + +Y+ G +Y  H+D F+PA  G      +  QRL
Sbjct: 149 ELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRL 208

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ +LYL+DVEEGG T FP                IGL V P++G  L F +  P G  D
Sbjct: 209 ATIILYLTDVEEGGGTSFP---------------GIGLDVHPQKGGALFFRNTTPYGVPD 253

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
           R + H   PV KG K +A KW+R++
Sbjct: 254 RKTQHAGLPVEKGTKIIANKWLREK 278


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 100/200 (50%), Gaps = 25/200 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L   N  S  +C  +I  A  R++ +++      +       RTSS  F   SE++ 
Sbjct: 32  PLILILDNVLSWAECDLLIDLASARMQRAKIG-----SSHDVSEVRTSSSMFFEESENEC 86

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
             +  +E ++A    +P +H E   VLRY+ G++Y  H+D F     G  M+ R+++ ++
Sbjct: 87  --IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVEEGGET FP                +   V P++G  + F   + +  ++  +LH
Sbjct: 142 YLNDVEEGGETYFP---------------SLHFSVTPKKGSAVYFEYFYNDTRLNELTLH 186

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
              PV  GEKWVAT+W+R Q
Sbjct: 187 AGHPVEAGEKWVATQWMRRQ 206


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 77/205 (37%), Positives = 104/205 (50%), Gaps = 25/205 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A++  NF SA++C+ +IA A+ RL  S +     G  V +T   R+S G F    E 
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D     N A  E   +  Q
Sbjct: 158 -TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL DVE GGET+FP                +G  + P+RG  L F      G 
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QVGWSIVPQRGHALYFEYGNRYGM 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D +SLH S P+  G+KWVATKWIR
Sbjct: 262 CDPSSLHASTPLRTGDKWVATKWIR 286


>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
 gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 75/213 (35%), Positives = 109/213 (51%), Gaps = 24/213 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPS----QLALRQGETVESTKGTRTSSGTFISAS 137
           PR  Y  NF SA++   ++A +   + PS      A  QG +  +   TRTS   F   +
Sbjct: 2   PRVFYVHNFLSADEADELVAFS---MAPSTGGTHKAWNQGGS-NAKLTTRTSMNAFDITT 57

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQM 192
           +    I +    ++ R     +   +   +LRYE+GQ Y +H+D F     N   + P  
Sbjct: 58  KLSFRI-KRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSK 116

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG-----LKVKPRRGDGLL 245
             S R A+  LYLSDVE GG+T+   E    +D+G    K +      L V PRRGD +L
Sbjct: 117 GGSNRFATIFLYLSDVEVGGQTL---EKDAGVDAGSWEDKLVDQCYSKLAVPPRRGDAIL 173

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           FYS +P+G +D  SLHG+CP++KG KW A  W+
Sbjct: 174 FYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 75/205 (36%), Positives = 100/205 (48%), Gaps = 25/205 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+    F +  +C  +IA A+ RL  S +     G  V +  G R+S GTF   +E 
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAA--GHRSSDGTFFRLAE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQ 194
            T ++  +E +IA  T L   +GE   +LRY+ G +   H D          E   +  Q
Sbjct: 158 -TPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL+DVE GGET+FP                +G  V PRRG  L F      G 
Sbjct: 217 RVGTLLMYLNDVEGGGETVFP---------------QVGCSVVPRRGQALYFEYCNRAGV 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIR 279
            D  SLH S P+  GEKWVATKWIR
Sbjct: 262 CDPASLHASTPLRSGEKWVATKWIR 286


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 108/220 (49%), Gaps = 30/220 (13%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           D     IP  V +  P         S  +C  +I  A+ RL+ +      G+  +     
Sbjct: 80  DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGK--QQIDQR 135

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G F  A E  T ++  IE ++A+   +P +HGE   +L Y  GQ+Y+ HYD F+PA
Sbjct: 136 RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPA 193

Query: 187 EYG-----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
             G      +  QR+AS ++YL+  E GG T FP                IGL V  RRG
Sbjct: 194 LPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP---------------EIGLTVTARRG 238

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             + F   +  G  D++SLH   PV++GEKW+AT W+R++
Sbjct: 239 AAVYF--AYEGG--DQSSLHAGLPVLQGEKWIATHWLRER 274


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 113/235 (48%), Gaps = 30/235 (12%)

Query: 52  VEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
           V ++N++ L   + GD  ++ +   +L   PR +   +F S  +C ++IA A+ RL  S+
Sbjct: 71  VLQDNDASL--LDLGDRQVRVLVSLLL---PRVVVLGDFLSDAECDALIALAQPRLARSR 125

Query: 112 LALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
             +      +     RTS    +   +D   + + IE +IAR    P  HGE   VLRY 
Sbjct: 126 -TVDNDNGAQIVHAARTSDSMCLQLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYA 182

Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
            G +Y  HYD F+P   G  +      QRLAS ++YL+  E GG T FP           
Sbjct: 183 TGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFPD---------- 232

Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
                + L V   +G+ + F    P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 233 -----VHLDVAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRER 280


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 103/206 (50%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  SA +C  +I  ++ RLK S   +      E     RTS G +    ED+
Sbjct: 96  RPQLVVFADVLSAAECAELIERSRHRLKRST-TVNPLTGREDVIRNRTSEGVWYRRGEDQ 154

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQR 195
             ++  +E +IA  T  P  +GE   VL Y    +Y  H+D F P + G      Q  QR
Sbjct: 155 --LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQR 212

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 GL V  + G  + F  +     +
Sbjct: 213 VATLIIYLNDVADGGETVFP---------------TAGLSVAAQAGGAVYFRYMNAERQL 257

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D ++LHG  PV+ G+KW+ TKW+R++
Sbjct: 258 DPSTLHGGAPVLAGDKWIMTKWMRER 283


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 104/207 (50%), Gaps = 25/207 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A++  +F SA++C+ +IA A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D     N A  E   +  Q
Sbjct: 158 -TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+ + L+YL DVE GGET+FP                IG  V P+RG  L F      G 
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP---------------QIGWSVAPQRGHALYFEYGNRFGL 261

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D +SLH S P+  G+KWVATKWIR +
Sbjct: 262 CDPSSLHASTPLRVGDKWVATKWIRTR 288


>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
          Length = 151

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 63/153 (41%), Positives = 88/153 (57%), Gaps = 3/153 (1%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F++  E K  ++  IE +I+  + +P  +GE   VLRYE  Q Y  H+D F       + 
Sbjct: 2   FLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRG 61

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
            QR+A+ L+YLSD  EGGET FP         G   K   GL VKP +G+ +LF+S+  +
Sbjct: 62  GQRIATMLMYLSDNVEGGETYFPNIGSGQCSCG--GKTVEGLSVKPTKGNAVLFWSMGLD 119

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           G  D  S+HG C V+ GEKW ATKW+R Q+ H+
Sbjct: 120 GQSDPLSVHGGCEVLAGEKWSATKWMR-QKAHQ 151


>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
 gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
          Length = 208

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 61/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  +++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL 222
           A+ L+YL+DV++GGE +FP   G  L
Sbjct: 154 ATVLMYLTDVKKGGEAVFPDAEGSHL 179


>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
 gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
          Length = 204

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 90/146 (61%), Gaps = 5/146 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R+
Sbjct: 97  KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFL 222
           A+ L+YLS+VE+GGET+FP   G  L
Sbjct: 155 ATVLMYLSNVEKGGETIFPNAEGKLL 180


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 99/205 (48%), Gaps = 23/205 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P    + +  S  +C +++  A+ RL  S + +      E+    RTS G      E   
Sbjct: 90  PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
            +++ IE +IA    +P  HGE   +L Y+ G +Y  H+D FNP   G         QR 
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ ++YL+  + GG T FP                IGL+V P +G+ + F  L P+G +D
Sbjct: 207 ATLVIYLNTPQAGGATAFP---------------RIGLEVAPVKGNAVYFSYLQPDGKLD 251

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
             +LH   PV  GEKW+ATKW+R+ 
Sbjct: 252 ERTLHAGLPVQSGEKWIATKWLREH 276


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 75/227 (33%), Positives = 109/227 (48%), Gaps = 40/227 (17%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
            GD  +  +    L   PR + F    S ++C  ++A A+ RL      LR  ETV+++ 
Sbjct: 23  AGDREVHVLATMAL---PRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVDNST 72

Query: 125 G------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
           G       RTS G F    E  T ++E IE +IA     P   GE   VL Y  G +Y  
Sbjct: 73  GGSEVNAARTSDGMFFERGE--TPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQYKP 130

Query: 179 HYDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
           H+D F+PA  G      +  QR+ + ++YL+    GG T FP                +G
Sbjct: 131 HHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP---------------EVG 175

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           L+V+P +G+ + F    P  +    +LHG  PV+ GEKWVATKW+R+
Sbjct: 176 LEVQPIKGNAVFFSYERPLAST--RTLHGGAPVLDGEKWVATKWLRE 220


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 99/203 (48%), Gaps = 23/203 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      +  S  +C  +I   ++R++ S + +      E     R S G F++ S D  
Sbjct: 91  PVVALLADVLSPRECDRLIEIGRERVRRSSV-VDPDSGGEVLIDARKSEGAFVNGSTDP- 148

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            ++  I+ +IA     P  +GE  ++LRY  G +Y  H+D F   + G +       QR+
Sbjct: 149 -LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRI 207

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ +LYL+ VEEGG+T FP                IGL + PRRG  L F  +   G  D
Sbjct: 208 ATLILYLNQVEEGGDTTFPD---------------IGLTIHPRRGAALYFEYVNALGQTD 252

Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
             +LH   PV +GEKW+ATKW+R
Sbjct: 253 PRTLHAGMPVERGEKWIATKWMR 275


>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 286

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 101/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS G  + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDGMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score =  110 bits (274), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 66/217 (30%), Positives = 111/217 (51%), Gaps = 24/217 (11%)

Query: 73  IPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTS 129
           +P+ +  +S  P  + + N  S  + +S+ A A K+L+P+ +         + +G TR +
Sbjct: 93  MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              FI   ++++ +   I  ++   T L     E   V+ Y I  +Y  HYD F PA+ G
Sbjct: 153 KMAFIL--DEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209

Query: 190 PQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
            +      RLA+ +LYLSDVE GG T+F                 I ++V PR+G+ +++
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF---------------TNINVRVLPRKGNVIIW 254

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           Y+  P+G +   +LH  CPV+ G KW+A KWI+ + Q
Sbjct: 255 YNYLPDGNLHPGTLHAGCPVLVGSKWIANKWIQSKGQ 291


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P+ +   N  S E+C +IIA    R   S +      +    +G RTS   FI   E +
Sbjct: 82  QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG-RTSEMAFIQRGEAE 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
             + E IE ++A     P    E F + +Y+  Q+Y  HYD  +P   G      +  QR
Sbjct: 141 --VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQR 198

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           LA+F+LYLSDVE+GG T+FP                +GL+V P++G  L F +   N   
Sbjct: 199 LATFILYLSDVEQGGGTVFP---------------GLGLEVYPKKGSALWFLNTDINHQP 243

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D+ +LHG  PV++G K +A KW+R
Sbjct: 244 DKRTLHGGAPVVRGTKIIANKWLR 267


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/227 (29%), Positives = 111/227 (48%), Gaps = 23/227 (10%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP G    A+ + +   +   +P         S E+C  +I  A  +L+ S + +     
Sbjct: 73  LPPGNRIIAADREVQVALRVEQPVLAVLDGVLSHEECDELIRRAAAKLQRSTI-VDPTTG 131

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
              T   R+S GTF   + D    +  ++ +I+    LP  HGE   +L Y  G +Y  H
Sbjct: 132 KHETIADRSSEGTFFEINADD--FIARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPH 189

Query: 180 YDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           +D F P + G   QM+   QR+++ ++YL++VE+GG T+FP                +GL
Sbjct: 190 FDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFP---------------ELGL 234

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            V P++G  + F      G +D  +LHG  PV++GEKW+ TKW+R +
Sbjct: 235 SVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKWIVTKWMRQR 281


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 103/209 (49%), Gaps = 37/209 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR + F  F S ++C +++A A+ RL  S       ETV++  G       RTS G F  
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARS-------ETVDNDTGGSEVNEARTSQGMFFM 152

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
             E +  ++  IE +IA     P  +GE   VL Y  G +Y  HYD F+PA+ G P +  
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+ + ++YL+  E GG T FP  N               L+V P +G+ + F   +
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFPDVN---------------LEVAPIKGNAVFFS--Y 253

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
                   SLHG  PV+ GEKWVATKW+R
Sbjct: 254 ERAHPSTRSLHGGAPVLAGEKWVATKWLR 282


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 168 -AFIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 GL V  ++G  + F  +     +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV  G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVHAGDKWIMTKWMRER 297


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 114/235 (48%), Gaps = 43/235 (18%)

Query: 71  QSIPFQVLSWRPR------------ALYFPNFA------SAEQCQSIIATAKKRLKPSQL 112
           Q+ P +++S  PR             +  PN A      S E+C  +I  ++ ++K SQ+
Sbjct: 92  QASPIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQV 151

Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
             R+ G + ES+   R S G+     E++  ++  IE +++    LP   GE   +L Y 
Sbjct: 152 VDRESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYG 207

Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGY 226
            G +Y +H D F P + G  +      QR+ + ++YL+DV EGGET FP           
Sbjct: 208 PGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP----------- 256

Query: 227 DYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
                IG   KP +G  + F     +G +D   LH   PVI+G+KW+ TKW+R++
Sbjct: 257 ----DIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRER 307


>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
 gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
          Length = 284

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 102/198 (51%), Gaps = 28/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  SA +C  +IA A+ RL+ +     +G   +     RTS G F +   D+  ++  IE
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTL--DEVPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ Y+ H+D F+P + G +       QR+AS ++YL
Sbjct: 158 RRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYL 217

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +    GG T FP                +GL V  RRG  + F   +  G  D +SLH  
Sbjct: 218 NTPARGGGTAFP---------------ALGLTVTARRGAAVYF--AYEGG--DCSSLHAG 258

Query: 264 CPVIKGEKWVATKWIRDQ 281
            PV++GEKW+ATKW+R++
Sbjct: 259 LPVLEGEKWIATKWLRER 276


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGED- 167

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 168 -AFIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 GL V  ++G  + F  +     +
Sbjct: 227 VATLVVYLNDVADGGETIFP---------------AAGLSVAAKQGGAVYFRYMNGQRQL 271

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV  G+KW+ TKW+R++
Sbjct: 272 DPLTLHGGAPVRAGDKWIMTKWMRER 297


>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
 gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
          Length = 284

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 103/198 (52%), Gaps = 28/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  +AE+C+ +IA A+ RLK +      G      +  RTS G F + +E    ++  IE
Sbjct: 102 NLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR--RTSEGMFFTLNE--LPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ+Y+ H+D F+P + G         QR+AS ++YL
Sbjct: 158 QRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASVVMYL 217

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +   +GG T FP                +GL V  RRG  + F   +  G  D+ SLH  
Sbjct: 218 NTPAQGGGTAFP---------------ELGLTVTARRGAAVYF--AYEGG--DQQSLHAG 258

Query: 264 CPVIKGEKWVATKWIRDQ 281
            PV +GEKW+ATKW+R++
Sbjct: 259 LPVQRGEKWIATKWLRER 276


>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
          Length = 190

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 96/190 (50%), Gaps = 20/190 (10%)

Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
           S +A    E        RTSS  ++S + D   ++  I  ++A    LP    E   VL 
Sbjct: 4   STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61

Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSG 225
           Y   Q Y +H+D F+P  Y   ++     R  +   YLSDVEEGGET+FPF NG      
Sbjct: 62  YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRV- 120

Query: 226 YDYKKCI-GLKVKPRRGDGLLFYSLF---------PNG---TIDRTSLHGSCPVIKGEKW 272
            D+  C  GLKVKP+ G+ ++FYS+          P+     +D  SLHG C VIKG+KW
Sbjct: 121 TDFADCSRGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDKW 180

Query: 273 VATKWIRDQE 282
            A  WI +++
Sbjct: 181 AANYWIANKK 190


>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
 gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
          Length = 295

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 73/212 (34%), Positives = 107/212 (50%), Gaps = 26/212 (12%)

Query: 77  VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +LS R PR + F    S E+C +++  A+ RL  S+  +  G         RTS G F  
Sbjct: 102 LLSMRNPRVMVFGGLLSDEECDAMVDLARPRLARSE-TVHNGSGGSEVNAARTSDGMFFD 160

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
             E    +   IE +IA     P  +GE   VLRY  G +Y +H+D F+PA+ G P +  
Sbjct: 161 RGEFP--LCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+ + ++YL+    GG T FP                +GL+V P +G+  +F+S  
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP---------------DVGLEVAPFKGNA-VFFSYD 262

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
               + RT LH   PV++GEKWVATKW+R+ E
Sbjct: 263 RAHPMTRT-LHAGTPVLEGEKWVATKWVREGE 293


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 109/212 (51%), Gaps = 25/212 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +VL+  P    + + AS  +   +I  AK R+  S+  +R  GE        RTS   ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRI--SRATIRDDGEP--QVSNARTSQNAWL 358

Query: 135 SASEDKTGILELIEHKIARATM-LPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEY-GPQ 191
            A +D+  ++  ++ ++   T  L Q   E   V  Y +G  Y +H+D A     Y G +
Sbjct: 359 DAGDDR--VVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLR 416

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           +  R+A+ + YLSDVE GG T+FP                +GL V PR+G  +L+Y+L+ 
Sbjct: 417 VGNRIATVMFYLSDVEIGGATVFP---------------QLGLAVFPRKGSAILWYNLYR 461

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           NG  DR +LH +CPV+ G KWVA +WI +  Q
Sbjct: 462 NGKGDRRTLHAACPVLSGSKWVANQWIHEYHQ 493


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/211 (33%), Positives = 101/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ G+KWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 290


>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
 gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
          Length = 212

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 97/200 (48%), Gaps = 23/200 (11%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           ++F    S E+C  +IA      KPS++     +    T G R+   T  S S DK  I+
Sbjct: 15  VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRS---TVASPSADKYPII 71

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLL 201
           + +  +I+    + + + E   VL Y  G +YD HYD+F   E  PQ+     R+ + LL
Sbjct: 72  KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YL+DVE+GG T FP                I   + P  G G+LF +        R SLH
Sbjct: 130 YLNDVEQGGWTQFPH---------------IMANIVPNVGTGILFRNTDAQNLQLRESLH 174

Query: 262 GSCPVIKGEKWVATKWIRDQ 281
              PVI GEKW+A+ WIR++
Sbjct: 175 AGLPVIDGEKWIASIWIREK 194


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 103/206 (50%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S ++C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTI-VDPATGREDVIRNRTSEGIWYQRGED- 180

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
             ++E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 181 -ALIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQR 239

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 GL V  ++G  + F  +     +
Sbjct: 240 VATLVVYLNDVPDGGETIFPEA---------------GLSVAAQQGGAVYFRYMNGRRQL 284

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV+ G+KW+ TKW+R++
Sbjct: 285 DPLTLHGGAPVLSGDKWIMTKWVRER 310


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/216 (30%), Positives = 102/216 (47%), Gaps = 20/216 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  ++ S +PR + + N  + E+ ++    A+ RL+ S +        E TK  R +
Sbjct: 330 IQPIKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIA 388

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              F+  SE    +   +  +I   T L  T  E   V  Y IG  Y+ HYD     E  
Sbjct: 389 KAAFLQNSEHDHIVK--MTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ 446

Query: 190 PQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
                  R+A+++ Y+SDVE GG T+FP                I L + P++G    ++
Sbjct: 447 KDFGWGNRIATWMFYMSDVEAGGATVFP---------------QINLALWPQKGSAAFWF 491

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L PNG  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 492 NLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQ 527


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 101/206 (49%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEDVIRNRTSEGIWYQRGEDA 175

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 176 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQR 233

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 GL V  ++G  + F  +     +
Sbjct: 234 VATLVIYLNDVPDGGETIFPEA---------------GLSVAAKQGGAVYFRYMNGQRQL 278

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV  G+KW+ TKW+R++
Sbjct: 279 DPLTLHGGAPVRAGDKWIMTKWMRER 304


>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
 gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
          Length = 284

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/198 (33%), Positives = 104/198 (52%), Gaps = 28/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  S ++C+ +IA A+ RL+ +     +G   +     RTS G F + +E    ++  IE
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTLNE--VPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS---QRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ+Y+ H+D F+P +  YG   +   QR+AS ++YL
Sbjct: 158 QRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYL 217

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +    GG T FP                +GL V  RRG  + F   +  G  D +SLH  
Sbjct: 218 NTPARGGGTAFP---------------ELGLTVTARRGSAVYFA--YEGG--DPSSLHAG 258

Query: 264 CPVIKGEKWVATKWIRDQ 281
            PV+ GEKW+ATKW+R++
Sbjct: 259 LPVLDGEKWIATKWLRER 276


>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 216

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 26  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDSMCLR 78

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 79  VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 136

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 137 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 181

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 182 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 210


>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
 gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 106/208 (50%), Gaps = 19/208 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            Q +SW+PRA+ + NF S ++ + II  A +++K S +   + E V      RTS GTF+
Sbjct: 41  IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV--VDDIRTSYGTFL 98

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
             ++D   ++  IE ++A  + +P +H E   VLRY    KY  H D            +
Sbjct: 99  RRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHIDGL----------E 146

Query: 195 RLASFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIG-LKVKPRRGDGLLFYSLFP 251
           R+A+ L+YL   E  G  + P      ++ +         G +  KP+RGD L+F+ + P
Sbjct: 147 RVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYKPKRGDALMFFDVKP 205

Query: 252 N-GTIDRTSLHGSCPVIKGEKWVATKWI 278
           +  T D  S+H  CPV+ G KW A KWI
Sbjct: 206 DYTTTDGHSMHTGCPVVAGVKWNAVKWI 233


>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 294

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 119/268 (44%), Gaps = 29/268 (10%)

Query: 14  LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
           +P A+ +      +   G T LS D P I  +L  ++   K++   LP  +        +
Sbjct: 36  VPMAVRIVDAVLRSEGEGGT-LSYDPPHITRRLPAIDTSAKQSVLHLPGADA------VV 88

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
            F+ L+  PR +   NF S+E+C  +   A+    P+ +     + V +       S   
Sbjct: 89  TFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQL 146

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
            +A  +   ++  +E +I R T  P    E   + RY  GQ Y  HYD F       Q  
Sbjct: 147 PAAGSE---LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGG 203

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           QRLA+ +LYL   E GG T F                 +G+++ PR+G  L F   +P+ 
Sbjct: 204 QRLATLILYLRAPEAGGATYF---------------ANLGMRIAPRKGSALFF--TYPDP 246

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             +  +LHG   V+ GEKW+AT+W RD+
Sbjct: 247 GNNSGTLHGGEAVLAGEKWIATQWFRDR 274


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 102/201 (50%), Gaps = 35/201 (17%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILE 145
           SAE+C+++IA A+ RL PS        +V+   G       R+S G F    E+    + 
Sbjct: 109 SAEECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVA 159

Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFL 200
            ++ +++    LP  +GE   VL Y  G +   H+D   P+    Q S     QR+++ +
Sbjct: 160 RLDERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLV 219

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
            YL++VEEGGET+FP       ++G+         V P+RG  + F      G +D  SL
Sbjct: 220 AYLNEVEEGGETVFP-------ETGW--------SVSPQRGGAVYFEYCNSLGQVDHASL 264

Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
           H   PV+ GEKWVATKW+R +
Sbjct: 265 HAGAPVLSGEKWVATKWMRQR 285


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 107/206 (51%), Gaps = 25/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           P  +   NF +AE+C  +IA A+ +++ + +     GE V+     RTS     + +E  
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QR 195
             ++  +E +IA A   P  +GE   VLRY  G +Y +H+D F+    G + +     QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + +FL+YL DV+ GG T FP                +  +++P++G  L F +  PNG  
Sbjct: 207 VGTFLVYLCDVDAGGATRFP---------------ALNFEIRPKKGMALFFANTLPNGEG 251

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +  +LH   PV+ G K++A+KW+R++
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREK 277


>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 285

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 75/230 (32%), Positives = 110/230 (47%), Gaps = 31/230 (13%)

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           PH   G+ S++ +   + +  P    F    S ++C ++I  AK RL+ ++     G   
Sbjct: 77  PHAVIGERSVRVM---LAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDG--A 131

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           +     RTS G F    E    ++E IE +IA    +P  HGE   VL Y  GQ+Y+ H 
Sbjct: 132 QQIDEHRTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQ 189

Query: 181 DAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
           D F+P + G         QR+AS ++YL+  + GG T FP                IGL 
Sbjct: 190 DWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPE---------------IGLT 234

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           V   RG  + F   + +G  D  SLH   PV +GEKW+ATKW+R++   E
Sbjct: 235 VTALRGSAVCF--TYESG--DVFSLHAGLPVTRGEKWIATKWLRERPYRE 280


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
           N   A +C+++I  AK RL PS L     G  V S K  R S G F    E+   ++  +
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDK--RASWGMFFRLCEND--LVARL 162

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
           + +++    LP  +GE  ++L Y  G   + H+D   P     + S     QR+++ + Y
Sbjct: 163 DRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTY 222

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           L+D  EGG+T+FP                +GL V P RG+   F     NG +D  SLH 
Sbjct: 223 LNDAPEGGQTVFPQ---------------LGLAVSPIRGNACYFEYCDGNGRVDARSLHA 267

Query: 263 SCPVIKGEKWVATKWIRDQ 281
           S PV +G+KWV TKW+R++
Sbjct: 268 SAPVTRGDKWVMTKWMRER 286


>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 244

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 57/143 (39%), Positives = 90/143 (62%), Gaps = 3/143 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175

Query: 195 RLASFLLYLSDVEEGGETMFPFE 217
           R+A+ L+YL+D   GGET FP E
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQE 198


>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
 gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
          Length = 280

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 108/205 (52%), Gaps = 25/205 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           R + +   NF +A++C++++A  K +L+PS++  R+G+     KG RTSS   +  ++D 
Sbjct: 84  RVQMIKIDNFLTAQECEALVALTKSKLRPSEIPEREGDQY---KGFRTSSTCDLPFTKDP 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQR 195
             +   I+ KI  A  L     E      Y IGQ++ +H D F P       Y     QR
Sbjct: 141 --LAHEIDQKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQR 198

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
             +F++YL+++ EGGET F                 +G+K KP++G  L++ +L  +G+I
Sbjct: 199 TWTFMIYLNELCEGGETEFV---------------KLGIKFKPKQGTALVWNNLHEDGSI 243

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
           +  +LH + P+  GEK V TKW R+
Sbjct: 244 NEDTLHHAHPIESGEKVVITKWFRE 268


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 73/226 (32%), Positives = 108/226 (47%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      S ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 81  DLGDRQVQVLVSLML---PRVVVLGGLLSDDECDALIALARPQLARSR-TVDNRDGSEIV 136

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F
Sbjct: 137 HAARTSHSMALQPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 194

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P   G  +      QR+AS ++YL+  E GG T FP                + L V  
Sbjct: 195 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAA 239

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G+ + F    P+  + RT LH   PV+ GEKWVATKW+R++  H
Sbjct: 240 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 283


>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 306

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 168

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 169 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 226

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 227 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 271

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 272 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 300


>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 286

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 280


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/228 (31%), Positives = 111/228 (48%), Gaps = 31/228 (13%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVEST 123
           G  + +S   + +S  PR     N  + E+C  +++ A ++   + L    G  + VEST
Sbjct: 66  GGMAKKSTWIETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVEST 125

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
             TRT+   ++   +D   +++ +E KIA+ T      GE   VL Y   Q++  H+D F
Sbjct: 126 --TRTNKQAWLDFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYF 181

Query: 184 NPAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPR 239
           +PA   P+       RL + ++YL   EEGGET F   N               LK+   
Sbjct: 182 DPATDPPENYEKGGNRLITVIVYLQAAEEGGETHFGAAN---------------LKLTAA 226

Query: 240 RGDGLLFYSL------FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +GD ++FY+L           +D+ +LH   P IKGEKWVATKWI ++
Sbjct: 227 KGDAVMFYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHER 274


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+  L  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALARPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280


>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 207

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 85/145 (58%), Gaps = 5/145 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           + +SW PR   +  F S  +C  ++  AKK+++ S +A  + G++V+S    RTSSG F+
Sbjct: 46  KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSE--VRTSSGMFL 103

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   ++  IE +IA  T LPQ + E   VLRYE GQKY+ H+D F+      +   
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161

Query: 195 RLASFLLYLSDVEEGGETMFPFENG 219
           R A+ L+YLS V EGGET+FP   G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 23/193 (11%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
           + E CQ++IA  +  L+P+ +   Q    E   G R S   +     D   IL+ +   I
Sbjct: 73  TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDYPILQSLAEGI 129

Query: 152 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 208
           A+ T +P    E   +L Y  G +Y  HYDAF  A   P + Q   R A+ +LYL+ VEE
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQATLILYLNAVEE 187

Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
           GGET FP                +GL+V P  G G+ F +L   G     SLH   PV K
Sbjct: 188 GGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRK 232

Query: 269 GEKWVATKWIRDQ 281
           GEKW+AT+WIR +
Sbjct: 233 GEKWIATQWIRQE 245


>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 286

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+  L  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALAQPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 251

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ G+KWVATKW+R++
Sbjct: 252 PHPMT--RSLHAGAPVLAGDKWVATKWLRER 280


>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 82/282 (29%), Positives = 120/282 (42%), Gaps = 44/282 (15%)

Query: 38  DVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQ 97
           +VP       T E    ++ S    G T D  +     + LS  PRA  F  F + E+C 
Sbjct: 46  EVPRADATTTTREAPRADDASASSLGPTRDIGVGDARVEKLSDSPRAYLFREFLTKEECA 105

Query: 98  SIIATAKKRLKPSQLALRQ---GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
            +I  +   LK S +       GE        RTS+G F+    D   ++  +E ++   
Sbjct: 106 HLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTGAFLPKLYDD--VVTRVERRVEAF 163

Query: 155 TMLP---QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGE 211
           + LP   Q   +A ++LRYE+GQ+Y  H D F     G    +R+A+ L++L++ EEGGE
Sbjct: 164 SRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENGG----KRVATVLMFLAEPEEGGE 219

Query: 212 TMFP---------------FENGIFLDSGYDYKKCI----------GLKVKPRRGDGLLF 246
           T FP                  G   D  +                G  VKPR GD +LF
Sbjct: 220 TAFPNGEPSEAVAARVAAQRARGELSDCAWRGGGGGTAGGGRGNLRGFAVKPRLGDAVLF 279

Query: 247 YSLFPNGT-------IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +S   +         +   S H SCP  +G KW ATKWI ++
Sbjct: 280 FSYDADDDGGYDGAEVSHASTHASCPTTRGVKWTATKWIHER 321


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 104/208 (50%), Gaps = 23/208 (11%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  +VL   P  + F +  S+ +   +   A+  L+ S + ++    V+     R S+GT
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERS-MVVKYRANVQGKH--RISAGT 366

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +   IE +IA    L     E F V+ Y IG +Y +H+D F         
Sbjct: 367 WVERKYN--NLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE--- 421

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             RLA+ L Y++DVE+GG T+FP                +G  V+ +RG+ L +Y++  N
Sbjct: 422 DNRLATVLFYMNDVEQGGATVFP---------------RLGQTVRAKRGNALFWYNMQHN 466

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           GT+D  +LHG CP++ G KW+ T+WI D
Sbjct: 467 GTVDDRTLHGGCPILVGSKWIFTQWISD 494


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++  +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P RG  L +Y+
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 110/212 (51%), Gaps = 22/212 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  +  +P+ + F +  S  + + +   AK  L+ + +A +Q    E +K   + S 
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            F    ++    +  I  ++A  T L     E   V+ Y +G +YD H+D F+  +   +
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKL--K 433

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A+ L Y+SDV  GG T+FP                +G+ ++ R+G    +Y+L  
Sbjct: 434 EVNRIATVLFYMSDVSIGGATVFP---------------KLGVTLEARKGTAAFWYNLHS 478

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G +D ++LHG+CPV+ GEKWVA KWIR++ Q
Sbjct: 479 SGELDYSTLHGACPVLIGEKWVANKWIRERGQ 510


>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
 gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
          Length = 303

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 21/204 (10%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +I+ A  +        +Q   V        S G  I   
Sbjct: 62  LSWHPRVFLYEGFLSDMECDHLISMAHGK--------KQSSLVVGGSAGNNSQGASI--- 110

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   I+  IE +I+  + LP+  GE+  +L+YE+ +   ++Y++ + + +      RL 
Sbjct: 111 EDT--IVSTIEDRISVWSFLPKDFGESMQILKYEVNKSDYNNYESQSSSGH-----DRLV 163

Query: 198 SFLLYLSDVEEGGETMFPFE--NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           + L+YLSDV+ GGET FP     G  ++      +C G  V+P RG+ +L ++L P+G I
Sbjct: 164 TVLMYLSDVKRGGETAFPRSELKGTKVELAAP-SECAGYAVQPVRGNAILLFNLKPDGVI 222

Query: 256 DRTSLHGSCPVIKGEKWVATKWIR 279
           D+ S +  C V++GE+W+A K I 
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIH 246


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/197 (32%), Positives = 98/197 (49%), Gaps = 35/197 (17%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
           E+C  +I  +  +L+ S        TV+   G       R+S GTF   + D    +  +
Sbjct: 109 EECDELIRRSADKLQRST-------TVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
           + +IA     P  +GE   VL Y  G +Y  H+D F+P + G +       QR+++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           L+DV +GG T+FP                +GL+V PR+G  + F     +G +D  +LHG
Sbjct: 220 LNDVAQGGATVFP---------------TLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHG 264

Query: 263 SCPVIKGEKWVATKWIR 279
             PV KGEKW+ TKW+R
Sbjct: 265 GEPVEKGEKWIITKWMR 281


>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
           [Brachypodium distachyon]
          Length = 314

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 104/202 (51%), Gaps = 13/202 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           L+W PR   +  F S  +C  ++  A+  ++ S L       +  T+ +  +   F   +
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQNSTDARFKF-QLA 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           + K  ++  IE +I+  + +P+ HGE+  +L+Y   Q  D + D    +  G     RL 
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 174

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           + L+YLSDV++GGET+FP       D+        +C G  VKP +GD +L ++L P+G 
Sbjct: 175 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 232

Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
            D  S +  C V++GEKW+A K
Sbjct: 233 TDSDSHYEDCSVLEGEKWLAIK 254


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 98/199 (49%), Gaps = 35/199 (17%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
           ++C+ +IA A+ RL PS        TV+   G       R+S G F    E+    +  +
Sbjct: 111 QECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIARL 161

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
           + +++    LP  +GE   VL Y  G +   H+D   P+    + S     QR+++ + Y
Sbjct: 162 DQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSY 221

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           L++VEEGGET+FP              +C G  V PRRG  + F      G +D  SLH 
Sbjct: 222 LNEVEEGGETIFP--------------EC-GWSVPPRRGSAVYFEYCNSLGQVDHASLHA 266

Query: 263 SCPVIKGEKWVATKWIRDQ 281
             PV+ GEKWVATKW+R +
Sbjct: 267 GGPVLHGEKWVATKWMRQR 285


>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
 gi|194704960|gb|ACF86564.1| unknown [Zea mays]
 gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
          Length = 207

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/145 (40%), Positives = 84/145 (57%), Gaps = 5/145 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           + +SW PR   +  F S  +C  ++  AKK+ + S +A  + G++V+S    RTSSG F+
Sbjct: 46  KAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSE--VRTSSGMFL 103

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   ++  IE +IA  T LPQ + E   VLRYE GQKY+ H+D F+      +   
Sbjct: 104 DKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGH 161

Query: 195 RLASFLLYLSDVEEGGETMFPFENG 219
           R A+ L+YLS V EGGET+FP   G
Sbjct: 162 RYATVLMYLSTVREGGETVFPNAKG 186


>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
          Length = 190

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 5/136 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGET 212
           A+ L+YLS+VE+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +LRY    +Y  H+D F P + G      Q  QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 G+ V   +G  + F  +     +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNGRRQL 263

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLSGDKWIMTKWMRER 289


>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 548

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 78/243 (32%), Positives = 112/243 (46%), Gaps = 44/243 (18%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TR 127
           +  + LS  PR     NF   E+  SII  A   L  +Q A R   +   TKG     TR
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA---LGMTQEAYRLKRSSTGTKGKAISKTR 261

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTH---GEAFNVLRYEIGQKYDSHYDAFN 184
           TS   F++     T   + ++ +I +   + + H    +   VLRY   Q Y +H+D   
Sbjct: 262 TSDNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLE 317

Query: 185 PAEYGPQMSQ-----RLASFLLYLSDVEEGGETMFPFENGI-----------------FL 222
            AE     S+     R A+ +LY +DV EGGET+F    GI                  L
Sbjct: 318 SAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENL 377

Query: 223 D---SGYDYKKCIGLK----VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVAT 275
           D   SG++ K  +  +    V P+RG  +LFY+  P+G  D +S HG+CPVI G+KW A 
Sbjct: 378 DLPRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAAN 437

Query: 276 KWI 278
            W+
Sbjct: 438 LWV 440


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 101/206 (49%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRS-TTVNPETGKEDVIRNRTSEGIWYQRGED- 162

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +L Y    +Y  H+D F P + G      Q  QR
Sbjct: 163 -AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 221

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 G+ V  R+G  + F  +     +
Sbjct: 222 VATLVIYLNDVPDGGETIFPEA---------------GISVAARQGGAVYFRYMNGQRQL 266

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV+ G+KW+ TKW+R++
Sbjct: 267 DPLTLHGGAPVLGGDKWIMTKWMRER 292


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 95/198 (47%), Gaps = 23/198 (11%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +    + E CQ++IA  +  L+P+ +   Q    E   G R S   +     D   IL+ 
Sbjct: 16  WAGLLTPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDHPILQS 72

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYL 203
           +   IA+ T +P    E   +L Y  G +Y  HYDAF  A   P + Q   R  + +LYL
Sbjct: 73  LAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQGTLILYL 130

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           + VEEGGET FP                +GL+V P  G G+ F +L   G     SLH  
Sbjct: 131 NAVEEGGETAFPE---------------LGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAG 175

Query: 264 CPVIKGEKWVATKWIRDQ 281
            PV KGEKW+AT+WIR +
Sbjct: 176 LPVRKGEKWIATQWIRQE 193


>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 296

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           P  +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 71/226 (31%), Positives = 108/226 (47%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 83  DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P   G  +      QR+AS ++YL+  E GG T FP                + L V  
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP---------------DVHLDVAA 241

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G+ + F    P+  + RT LH   PV+ GEKWVATKW+R++  H
Sbjct: 242 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 285


>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 296

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 99/211 (46%), Gaps = 37/211 (17%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           P  +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QR+AS ++YL+  E GG T FP  +               L V   +G+ + F    
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVAAVKGNAVFFSYDR 261

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 262 PHPMT--RSLHAGAPVLAGEKWVATKWLRER 290


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 71/226 (31%), Positives = 108/226 (47%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P   G  +      QR+AS ++YL+  E GG T FP                + L V  
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPD---------------VHLDVAA 261

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G+ + F    P+  + RT LH   PV+ GEKWVATKW+R++  H
Sbjct: 262 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 305


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 104/215 (48%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       A+  +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  ++   +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P +G  L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 107/207 (51%), Gaps = 34/207 (16%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASE 138
           R R   F  FAS E+C+ +    K+RL+ + +A   G  + VE     R S+  ++    
Sbjct: 337 RQRLQVFRQFASPEECRHLQHAGKRRLERA-VAWTDGRFQPVE----FRISTAAWLQPDH 391

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 194
           D   I++ I  +I  AT +   + EA  +  Y +G  Y+ H+D      NP        +
Sbjct: 392 D--AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNPD------GE 443

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           RLA+F++YL+ V++GG T FP                +G  V+P  GD + +Y+L P+G 
Sbjct: 444 RLATFMIYLNPVKQGGFTAFPR---------------LGAAVQPGYGDAVFWYNLQPSGV 488

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  +LHG+CPV++G KWVA KWI ++
Sbjct: 489 GDPLTLHGACPVLRGSKWVANKWIHER 515


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 100/206 (48%), Gaps = 23/206 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +L Y    +Y  H+D F P + G      Q  QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ ++YL+DV +GGET+FP                 G+ V   +G  + F  +     +
Sbjct: 219 VATLVIYLNDVPDGGETIFPEA---------------GMSVAASQGGAVYFRYMNDRRQL 263

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQ 281
           D  +LHG  PV+ G+KW+ TKW+R++
Sbjct: 264 DPLTLHGGAPVLAGDKWIMTKWMRER 289


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 102/208 (49%), Gaps = 21/208 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSI--IATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           IP + +S  P  + + +  +  Q ++I  I+ +K    P+   L  G   E+T+ +    
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVSIFCC 349

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
            T++  + D   ++E I       T L   + E   V  Y IG  Y  HYD+   A   P
Sbjct: 350 STWLEDAYDP--VVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDP 407

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
              QRLA+ + YLS+VE GG T+FP                +G+ V+P++G  L + +L 
Sbjct: 408 L--QRLATMMFYLSNVEIGGATIFPR---------------LGVAVRPQKGSALFWINLK 450

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            NG  +R +LH +CPV+ G KW+A KWI
Sbjct: 451 RNGLTNRQTLHAACPVVIGSKWIANKWI 478


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 25/205 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +    F S  +C ++IA A+ RL  S+  +            RTS    +   +D  
Sbjct: 96  PRVMVLGGFLSDAECDAMIALAQPRLARSR-TVDNANGAHVVHAARTSDSMCLQLGQD-- 152

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            + + IE +IAR    P  +GE   VLRY  G +Y  HYD F+P   G  +      QR+
Sbjct: 153 ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRV 212

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           AS ++YL+  + GG T FP                + L +   +G+ + F    P+    
Sbjct: 213 ASLVMYLNTPDRGGATRFPD---------------VHLDIAAIKGNAVFFSYDRPHPMT- 256

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQ 281
             SLH   PV+ GEKWVATKW+R++
Sbjct: 257 -RSLHAGAPVLAGEKWVATKWLRER 280


>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
 gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
          Length = 303

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 105/202 (51%), Gaps = 20/202 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +++  +  ++ S LA         T G R SS   I   
Sbjct: 63  LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   ++  IE +I+  + LP+ +GE+  VL+Y + +       +          + RLA
Sbjct: 111 EDI--VVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           + L+YLSDV++GGET+FP        +      +C G  V+P +G+ +L ++L P+G  D
Sbjct: 164 TILMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETD 223

Query: 257 RTSLHGSCPVIKGEKWVATKWI 278
           + S +  CPV++GEKW+A K I
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHI 245


>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
          Length = 549

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 104/202 (51%), Gaps = 20/202 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +++T +  +  S LA         T G R SS   I   
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMD-SSLAF--------TDGDRNSSYNNI--- 356

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   ++  IE +I+  + LP+ +GE   VL+Y + ++      +             LA
Sbjct: 357 EDI--VVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDY-KKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           + L+YLSDV++GGET+FP        +      +C G  V+P +G+ LL ++L P+G ID
Sbjct: 410 TILIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCSGYAVRPAKGNALLLFNLRPDGEID 469

Query: 257 RTSLHGSCPVIKGEKWVATKWI 278
           + S +  CPV++GEKW+A K I
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHI 491


>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
           [Brachypodium distachyon]
          Length = 303

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 100/202 (49%), Gaps = 24/202 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           L+W PR   +  F S  +C  ++  A+  ++ S L            G R      I+ +
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
                ++  IE +I+  + +P+ HGE+  +L+Y   Q  D + D    +  G     RL 
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 163

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYD---YKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           + L+YLSDV++GGET+FP       D+        +C G  VKP +GD +L ++L P+G 
Sbjct: 164 TILMYLSDVKQGGETVFPRSE--LKDTQAKEGALSECAGYAVKPVKGDAILLFNLRPDGV 221

Query: 255 IDRTSLHGSCPVIKGEKWVATK 276
            D  S +  C V++GEKW+A K
Sbjct: 222 TDSDSHYEDCSVLEGEKWLAIK 243


>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
          Length = 239

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 66/208 (31%), Positives = 105/208 (50%), Gaps = 23/208 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P   +  +FA  + C+ +I  A+  L  +++  R+G    +    R +S  +++A 
Sbjct: 31  LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYE--IGQKYDSHYDAFNPAEYGPQMS-Q 194
            D+   LE +E  I      P+   E F+V+RY    G++Y +H DAF       +   Q
Sbjct: 89  GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           RL + LLYLSDV  GG T+FP                +GL V PRRG  L+F ++  + T
Sbjct: 147 RLTTALLYLSDVGAGGATVFP---------------ALGLSVAPRRGRLLVFANVADDTT 191

Query: 255 IDRTSLHGSCPVI-KGEKWVATKWIRDQ 281
           +D  ++H   P+    EKW+A KW+R++
Sbjct: 192 VDARTVHAGEPIAGDTEKWIANKWVRER 219


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 68/228 (29%), Positives = 107/228 (46%), Gaps = 27/228 (11%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           H  T    I  +  Q ++  P  + + +  S ++  +II+ +K  +  S +     + V 
Sbjct: 296 HNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVS 355

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            T   RTSS  ++   +    ++  +  +    T L  T  E   V  Y IG  Y  HYD
Sbjct: 356 KT---RTSSNAWLD--DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYD 410

Query: 182 AFNPAEYGPQM------SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
            +  AE G ++        R+A+ + YLSDV  GG T+FP                +GL 
Sbjct: 411 -YAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP---------------QLGLG 454

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           V P++G  + +Y+L  NGT+D  +LHG+CPV  G KWV  KWI ++ Q
Sbjct: 455 VFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHERGQ 502


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       +   +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P +G  L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 110/215 (51%), Gaps = 23/215 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++   P  + + +  S  +   +   AK +LK +++      T + +K TRT+   
Sbjct: 279 LKMELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSK-TRTAKLA 337

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           +   + ++  + E +  +I   T       E   V+ Y +G  Y  H+D FN  + GP +
Sbjct: 338 WFLDTFNQ--LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPHI 394

Query: 193 SQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           +Q    R+A+ L YL+DVE+GG T+FP           + KK     V P+RG  +++Y+
Sbjct: 395 TQINGDRIATVLFYLNDVEQGGATVFP-----------EIKKA----VFPKRGSAIMWYN 439

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L  +G  +R +LH  CPVI G KWV  KWIR++EQ
Sbjct: 440 LKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQ 474


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/205 (32%), Positives = 104/205 (50%), Gaps = 26/205 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR   F +  S  +C ++IA ++ RL+ S++   +G   E    TRTS G + +  E+  
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSG-EFVDDTRTSYGAYFNKGENS- 183

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMS--QRL 196
            ++  I+ +IA  T  P TH E   +L Y +G +Y  H+D F P + G   P  S  QR+
Sbjct: 184 -LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRI 242

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF-YSLFPNGTI 255
           A+ ++YL+DVE GG T+FP  N               L+ +PR+G  + F Y L    +I
Sbjct: 243 ATVVMYLNDVEAGGGTIFPHLN---------------LETRPRKGGAIYFSYQLAVARSI 287

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
               +  +   I   KW+AT+W RD
Sbjct: 288 RSRCM--AARRIARRKWIATQWFRD 310


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 93/203 (45%), Gaps = 25/203 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F    S  +C  I+A A  RL  S   +            RTS G F +  E   
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSH-TVDTATGASEVNAARTSDGMFFTRGEHP- 159

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            +    E +IA     P  +GE   VL Y  G +Y  HYD F+P + G P +     QR+
Sbjct: 160 -VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ + YL+    GG T FP                IGL+V P +G  + F    P+ +  
Sbjct: 219 ATLVTYLNTPTRGGGTTFP---------------DIGLEVTPLKGHAVFFSYDRPHPST- 262

Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
             SLHG  PV++G+KWVATKW+R
Sbjct: 263 -RSLHGGAPVLEGDKWVATKWLR 284


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 103/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + F    S E+ +++   AK RL  + +   Q   + +T   R S   ++S  E+ 
Sbjct: 342 RPRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKL-TTAHYRVSKSAWLSGYENP 400

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             I+  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 401 --IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 455

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V PR+G  + +Y+LFP+
Sbjct: 456 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFPS 499

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 500 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 530


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    ++ 
Sbjct: 308 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 357

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+D       G ++++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           Y+SDVE GG T+F                 +G+ VKP +G  + +Y+L  NG +D  + H
Sbjct: 412 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 456

Query: 262 GSCPVIKGEKWVATKWIRD 280
             CPV+ G KWVA KWI +
Sbjct: 457 AGCPVLVGNKWVANKWIHE 475


>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
 gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 71/216 (32%), Positives = 108/216 (50%), Gaps = 21/216 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSII-ATAKKRLKPSQLALRQGETVE---STKGTRT 128
           +  +VLS  PRA    NF S  +   I+  T   +L  S  A     T +   ST+ TRT
Sbjct: 1   MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTH-GEAFNVLRYEIGQKYDSHYDAFNP-- 185
           S  T++    +K+ I++ I  + A   ++ +    EA  ++ Y++GQ+Y +H+D  +P  
Sbjct: 61  SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118

Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
             EY P    R  + LLYL++  EGG T FP           + +   GL V+P+ G  +
Sbjct: 119 DNEYQPA---RYCTLLLYLNEGMEGGATQFP--------RWVNAETRNGLDVEPKIGKAV 167

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           LFYS  P+G +D  S H + PV  GEKW+   W  D
Sbjct: 168 LFYSQLPDGNMDDWSHHAAMPVRVGEKWLMNLWTWD 203


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 98/199 (49%), Gaps = 31/199 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    ++ 
Sbjct: 326 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 375

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+D       G ++++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           Y+SDVE GG T+F                 +G+ VKP +G  + +Y+L  NG +D  + H
Sbjct: 430 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWYNLHKNGELDLKTKH 474

Query: 262 GSCPVIKGEKWVATKWIRD 280
             CPV+ G KWVA KWI +
Sbjct: 475 AGCPVLVGNKWVANKWIHE 493


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P   G  +      QR+AS ++YL+  E GG T  P                + L V  
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAA 261

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G+ + F    P+  + RT LH   PV+ GEKWVATKW+R++  H
Sbjct: 262 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 305


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/226 (30%), Positives = 107/226 (47%), Gaps = 28/226 (12%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 83  DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            P   G  +      QR+AS ++YL+  E GG T  P                + L V  
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPD---------------VHLDVAA 241

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
            +G+ + F    P+  + RT LH   PV+ GEKWVATKW+R++  H
Sbjct: 242 VKGNAVFFSYDRPH-PMTRT-LHAGAPVLAGEKWVATKWLRERPLH 285


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/222 (28%), Positives = 104/222 (46%), Gaps = 30/222 (13%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  +    +P  + + +  S ++ +++   AK R K +   +R  +T E        
Sbjct: 324 IQPIKMEEALLKPMIVVYHDVMSDDEIETVKKMAKPRFKRA--TIRNSKTGELEPANYRI 381

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
           S +    SE+   IL+ +  ++   T L  +  E   V+ Y IG  Y+ H+D        
Sbjct: 382 SKSAWLKSEEHDHILK-VTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTE 440

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           AF    +G     R+A++L Y+SDVE GG T+FP                 G  V PR+G
Sbjct: 441 AFKELGWG----NRIATWLFYMSDVEAGGATVFP---------------PTGAAVWPRKG 481

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
               +Y+L+PNG  +  + H +CPV+ G KWV+ +WI +  Q
Sbjct: 482 SAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIHEHRQ 523


>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 215

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/218 (30%), Positives = 101/218 (46%), Gaps = 19/218 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P       F   ++   I+  +   L PS + L+ G         RTS+  ++ +S    
Sbjct: 3   PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP- 61

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------------EYG 189
            +++ I+ + A    +P +H E+  VLRYE  Q YD H D F+              EYG
Sbjct: 62  -VVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG 120

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYS 248
                R+ +   Y+SDV +GG T F    G+   S    K C  G+ V P++   ++FYS
Sbjct: 121 --YKNRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN--KDCSQGISVAPKKRKVVVFYS 176

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHED 286
           + PNG  D  SLH  CPV +G K    KWI ++ + +D
Sbjct: 177 MLPNGEGDPMSLHAGCPVEEGIKLSGNKWIWNKPRSDD 214


>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 750

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/229 (31%), Positives = 112/229 (48%), Gaps = 48/229 (20%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           F +F SA +C  ++A A   L+ S++    G+  E     RTSS TF++  + +  ++  
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVT--DGKLSEG----RTSSSTFLTGCKQEEPLVRA 587

Query: 147 IEHKIARA----TMLP---------QTHG--------------------EAFNVLRYEIG 173
           IE ++ RA    T++          + HG                    E   V+RY  G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
           Q Y +HYD            +R A+F++YL+DV  GG T FP    + +  G       G
Sbjct: 648 QMYTAHYDNKQGC------LRRTATFMMYLTDVHSGGATHFPRAVPVSMRDGC--GDAAG 699

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           +++ P+RG  L+F+S+   G  D  SLH + PVI+GEKW+ATKW+R+ E
Sbjct: 700 IRIWPKRGRALVFWSV-SGGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747


>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 331

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 65/217 (29%), Positives = 102/217 (47%), Gaps = 23/217 (10%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTS 129
           + +  Q +S  PRA    +  S ++C ++I  A+ RL  S  +    G+ V +      S
Sbjct: 121 RQVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCS 180

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----N 184
             +F    E+ + + + I  + AR    P  H E     RY  G+++  H D F     N
Sbjct: 181 CASF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLN 238

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
             +       R+A+ LLYL++VE GG T FP                 G +V+P++G  L
Sbjct: 239 NDKIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------------GFEVRPQKGGAL 283

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            F     +G++D TSLH  C V +GEKW+AT W R++
Sbjct: 284 YFAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 101/211 (47%), Gaps = 21/211 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI   AK  L  +    + G   E     RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDP--ARTTKGT 331

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    + + +++ +       T L     + F VL Y IG  Y +H+D     E G   
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           S R+A+ + YLSDV +GG T+FP                +GL V P++G  LL+Y+L   
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 432

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CP I G +WV TKWI ++EQ
Sbjct: 433 GDGDNRTAHSACPTIVGSRWVMTKWINEREQ 463


>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
          Length = 207

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 55/132 (41%), Positives = 77/132 (58%), Gaps = 5/132 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 80  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE +IA  + +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195

Query: 195 RLASFLLYLSDV 206
           R+A+ L+YL+DV
Sbjct: 196 RIATVLMYLTDV 207


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 95/192 (49%), Gaps = 27/192 (14%)

Query: 100 IATAKKRLKPSQLALRQGETVESTKGT----RTSSGTFISASEDKTGILELIEHKIARAT 155
           IA  ++  KP    L++     STK T    +T +       +    + E +  +I   T
Sbjct: 339 IAELQEMAKPE---LKRATVYNSTKNTNQFVKTRTAKVAWFLDTFNQLTERLNQRIMDMT 395

Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGE 211
                  E   V+ Y +G  Y  H+D FN     P +SQ    R+A+ L YL+DVE+GG 
Sbjct: 396 NFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQINGDRIATVLFYLNDVEQGGA 454

Query: 212 TMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEK 271
           T+FP           + KK     V P+RG  +++Y+L  +G  +R +LH +CPVI G K
Sbjct: 455 TVFP-----------EIKKA----VFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSK 499

Query: 272 WVATKWIRDQEQ 283
           WV  KWIR++EQ
Sbjct: 500 WVCNKWIREREQ 511


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 40/228 (17%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      S E+C +++  ++ RL       R+  TV++ 
Sbjct: 87  DAGDRQVQLLASLML---PRVVVLGGLLSDEECDALVELSRPRL-------RRSTTVDAQ 136

Query: 124 KG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
            G       RTS GTF         +   IE +IAR    P  +GE   VL Y  G ++ 
Sbjct: 137 TGGSQVHADRTSRGTFFERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFR 194

Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI 232
            HYD F+P E G ++      QR+A+ ++YL+    GG T FP  +              
Sbjct: 195 PHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFPDAH-------------- 240

Query: 233 GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
            L+V   +G+ + F    P+  + RT LHG  PV +GEKW+ATKW+R+
Sbjct: 241 -LEVAAVKGNAVFFSYDRPH-PMTRT-LHGGAPVTEGEKWIATKWLRE 285


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 99/215 (46%), Gaps = 28/215 (13%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           P +V  LS  P  + F +     +  +++  AK ++  + +    G         RTS  
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVT---GHNSSVVSNARTSQF 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
           TF+  +  K  +L  I+ ++A  T L   + E   +  Y IG  Y  H D F P  +   
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438

Query: 189 ---GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
               P+M  R+ + L YLSDVE+GG T FP                +   ++P++     
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP---------------ALKQLLRPKKHAAAF 483

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           +Y+L  +G  D  ++HG+CP+I G KWV  +WIR+
Sbjct: 484 WYNLHASGVGDARTMHGACPIIVGSKWVLNRWIRE 518


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P RG  L +Y+
Sbjct: 434 GDLHGNRMATGIYYLSDVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 109/244 (44%), Gaps = 21/244 (8%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
           R +LR      +E    L  G    ++      + L   P  +   +  SA     +   
Sbjct: 292 REELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTAELQHL 351

Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           A+  L+ SQ+  R G     +   RTS GT    ++    I++ + H +A  + L     
Sbjct: 352 ARPELQRSQVYSRTGHE-HISANFRTSQGTTFEYTDHP--IMQKMSHHVAEISGLDMRSA 408

Query: 163 EAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM--SQRLASFLLYLSDVEEGGETMFPFENG 219
           E   +  Y IG  Y+ H D+F +  +Y   M  + RLA+ + YLS+VE GG T FPF   
Sbjct: 409 EPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPF--- 465

Query: 220 IFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
                       + L V P RG  L +Y+L P+G  D  + H +CPV++G KW+A  WIR
Sbjct: 466 ------------LPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIR 513

Query: 280 DQEQ 283
              Q
Sbjct: 514 LSNQ 517


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 97/199 (48%), Gaps = 31/199 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    D+ 
Sbjct: 53  PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVISE--IRTSQSVFL----DEV 102

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+DA      G  +++R A+FL+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA------GGDVNERTATFLI 156

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           Y+SDVE GG T+F                 +G+ VKP +G  + + +L  NG +D  + H
Sbjct: 157 YMSDVEVGGATVF---------------TNVGVAVKPEKGSAVFWNNLHKNGELDLKTKH 201

Query: 262 GSCPVIKGEKWVATKWIRD 280
             CPV+ G KWVA KWI +
Sbjct: 202 AGCPVLVGNKWVANKWIHE 220


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 104/226 (46%), Gaps = 24/226 (10%)

Query: 62  HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +  T  A ++  P   ++L   P  + + +  S  + + +   A  RLK +++ + Q   
Sbjct: 299 YNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKV-VDQVTH 357

Query: 120 VESTKGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
                  RTS  T++    D T    + +  +I   +       E   V+ Y +G  Y S
Sbjct: 358 RNMMVKERTSKVTWLG---DATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYAS 414

Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVK 237
           HYD  N          R+A+ + YLSDVE+GG T+FP  +  +F                
Sbjct: 415 HYDFLNATSKTRLNGDRIATVMFYLSDVEQGGATVFPKIQKAVF---------------- 458

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           P+RG  +++Y+L  NG  D  ++H +CPVI G KWV  KWIR+ EQ
Sbjct: 459 PQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKWVCNKWIRENEQ 504


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + F +  S E+ + +   +K RL+ + ++      +E T   R S   ++S  E+ 
Sbjct: 343 RPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYENP 401

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 402 --VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 456

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP+
Sbjct: 457 -NRIATWLFYMSDVAAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 500

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 501 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 531


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 65/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V PR+G  + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 129/244 (52%), Gaps = 24/244 (9%)

Query: 42  IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
           IR +LR  +VV +E    L H    +    S   Q LSW+PRA  +  F S E+C  +I+
Sbjct: 25  IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83

Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
            A    K  +LA   G++  V   +  ++S G      E    +   IE +I+  T LP+
Sbjct: 84  LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137

Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP- 215
            + E   V++Y+     QKY+ ++   + +++G  +   +A+ LL+LS+V  GGE  FP 
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFPE 193

Query: 216 --FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
              ++GI  D     +   GL+  P +G+ +LF+++ PN + D++S +  CPV++GE W 
Sbjct: 194 SESKSGILSDCT---ESSSGLR--PVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWC 248

Query: 274 ATKW 277
           ATK+
Sbjct: 249 ATKF 252


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 80/158 (50%), Gaps = 18/158 (11%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
           RTS GT+I    D   + + IE +I     L   + E F V+ Y +G  Y +H D   + 
Sbjct: 345 RTSKGTWIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDT 402

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                +   R+A+ L YL+DVE+GG T+F   N                 V P+RG  L 
Sbjct: 403 WADKKEEDDRIATVLFYLTDVEQGGATVFTILNQ---------------AVSPKRGTALF 447

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L  NGT D  +LHG CPV+ G KW+ T WIR++ Q
Sbjct: 448 WYNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQ 485


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       +   +S+  +A+  +K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  K    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--KNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P +G  L +Y+
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
           E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP     
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461

Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                      I   V P+RG  +++Y+L  NG ID  +LH +CPVI G KWV  KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511

Query: 281 QEQ 283
           +EQ
Sbjct: 512 REQ 514


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
           E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP     
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461

Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                      I   V P+RG  +++Y+L  NG ID  +LH +CPVI G KWV  KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511

Query: 281 QEQ 283
           +EQ
Sbjct: 512 REQ 514


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 105/217 (48%), Gaps = 28/217 (12%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + + +  +  +  ++   +K  +K   +   + +        RTS+ 
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F      PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQ 488

Query: 192 MSQR-----LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           +  R     +A+ L YLSDV +GG T+FP  N               + V+PR+GD LL+
Sbjct: 489 LEHRGGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLW 533

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           Y+L   G  +  ++H SCP+IKG KW   KWI +  Q
Sbjct: 534 YNLNDRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 570


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 18/126 (14%)

Query: 161 HG-EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           HG E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP  
Sbjct: 407 HGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATVFP-- 464

Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
                         I   V P+RG  +++Y+L  +G ID  +LH +CPVI G KWV  KW
Sbjct: 465 -------------NIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKW 511

Query: 278 IRDQEQ 283
           IR++EQ
Sbjct: 512 IREREQ 517


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YL+DVE GG T FPF               + L V P RG  L +Y+
Sbjct: 434 GDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 479 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 513


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 101/215 (46%), Gaps = 22/215 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 250 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQE 307

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YL+DVE GG T FPF               + L V P RG  L +Y+
Sbjct: 308 GDLHGNRMATGIYYLADVEAGGGTAFPF---------------LPLLVTPERGSLLFWYN 352

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 353 LHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 387


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
           E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP     
Sbjct: 407 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 461

Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                      I   V P+RG  +++Y+L  NG ID  +LH +CPVI G KWV  KWIR+
Sbjct: 462 ----------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 511

Query: 281 QEQ 283
           +EQ
Sbjct: 512 REQ 514


>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 309

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 122/244 (50%), Gaps = 29/244 (11%)

Query: 41  SIRPKLRTLEVVEKENESGLPHGETGDAS-IQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
           SIR +LR  EV   ++E+ +  G +   + I  +    LSWRPR   +  F + E+C  +
Sbjct: 26  SIRKELRDKEV---KHETIIQLGSSVQTNRISLLQVVQLSWRPRVFLYKGFLTDEECDRL 82

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI----LELIEHKIARAT 155
           I+ A              + +   KG  + +   +++SE ++ I    L  IE +I+  T
Sbjct: 83  ISLA-----------HGAKEISKGKGDGSRNNIQLASSESRSHIYDDLLARIEERISAWT 131

Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            +P+ + +   V+ Y I +  + H+D F+       +S  +A+ +LYLS+V  GGE +FP
Sbjct: 132 FIPKENSKPLQVMHYGIEEARE-HFDYFDNKTLISNVSL-MATLVLYLSNVTRGGEILFP 189

Query: 216 ---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
               ++ ++ D   D        ++P +G+ +L ++   N + D  S HG CPV++GE W
Sbjct: 190 KSELKDKVWSDCTKDSSI-----LRPVKGNAVLIFNAHLNASADSRSTHGRCPVLEGEMW 244

Query: 273 VATK 276
            ATK
Sbjct: 245 CATK 248


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 105/235 (44%), Gaps = 29/235 (12%)

Query: 55  ENESGLPHG--ETGDASI----QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
           E+ SGL H   + G  +     + IP       PR   F    +  +C +++A A+ RL 
Sbjct: 99  EHASGLGHAGAQAGRNAAHFAGREIPILFTLAAPRVTLFQQLLTDAECDALVALARGRLA 158

Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
            S + +      E+    RTS G      E    ++E IE  IA  T +    GE   +L
Sbjct: 159 RSPV-INPDTGDENLIEARTSLGAMFQVGEHP--LIERIEDCIAAVTGIAAERGEGLQIL 215

Query: 169 RYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLD 223
            Y+ G +Y  HYD FNP   G         QR+ + ++YL+    GG T FP        
Sbjct: 216 NYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFPK------- 268

Query: 224 SGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                   +GL+V P +G+ + F     +G +D  +LH   PV  GEKW+ATKW+
Sbjct: 269 --------LGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWL 315


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 24/203 (11%)

Query: 84  ALYFPNFASAE--QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
            +Y    +SAE  + + +   + KR    + +L + E V+    TRTS   +   S +  
Sbjct: 330 VIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVK----TRTSKVAWFPDSYNSL 385

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFL 200
            +   +  +I   T    +  E   ++ Y +G  YD HYD FN  E    ++  R+A+ L
Sbjct: 386 TLR--LNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVL 443

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
            Y+SDVE+GG T+FP                I   V P+RG  +++Y+L  +G  D  +L
Sbjct: 444 FYMSDVEQGGATVFP---------------NIYKTVYPQRGTAVMWYNLKDDGQPDEQTL 488

Query: 261 HGSCPVIKGEKWVATKWIRDQEQ 283
           H +CPV+ G KWV  KWIR++ Q
Sbjct: 489 HAACPVLVGSKWVCNKWIRERAQ 511


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 80/160 (50%), Gaps = 19/160 (11%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           TRTS   +   S +   +   +  +IA  T       E   ++ Y +G  YD HYD FN 
Sbjct: 372 TRTSKVAWFPDSYNPLTVR--LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNT 429

Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
             +        R+A+ L YL+DVE+GG T+FP                I   V P+RG  
Sbjct: 430 INSNLTAMSGDRIATVLFYLTDVEQGGATVFP---------------NIRKAVFPQRGSV 474

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +++Y+L  NG  D  +LH +CPVI G KWV  KWIR++EQ
Sbjct: 475 IMWYNLQDNGQTDNKTLHAACPVIVGSKWVCNKWIREREQ 514


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 104/213 (48%), Gaps = 22/213 (10%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + + +  +  +  ++   +K  +K   +   + +        RTS+ 
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F  P   G 
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRGG 429

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               R+A+ L YLSDV +GG T+FP  N               + V+PR+GD LL+Y+L 
Sbjct: 430 --GDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWYNLN 472

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G  +  ++H SCP+I+G KW   KWI +  Q
Sbjct: 473 DRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 505


>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
 gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
          Length = 235

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 101/217 (46%), Gaps = 26/217 (11%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P  +L   P    +    +  +C  +I  A+  L+PS++    G + + T   RTS    
Sbjct: 39  PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDT-- 93

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----Y 188
           I      T +   I  +IA    LP  + E   + RY++G K+++H+D FNP+      Y
Sbjct: 94  IGFQHHLTELTLQICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
             +  QR+ + LLYL++V  GGET FP  N               ++V P  G  L+F +
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPLLN---------------IQVAPSEGTLLVFEN 198

Query: 249 LFPNGTIDRT-SLHGSCPVIKGEKWVATKWIRDQEQH 284
              N       S+H  C V +GEKW+AT W  ++ Q+
Sbjct: 199 CKKNSNERHALSIHEGCAVHEGEKWIATLWFHEKSQY 235


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 104/215 (48%), Gaps = 23/215 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++S  P  + + +  S  +   + + A   LK + +  +Q       K TRTS  T
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVK-TRTSKVT 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
           ++  + ++  I   +  +I   T       E   V+ Y +G  YD HYD FN    A+  
Sbjct: 380 WLLDTLNQLTIR--LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLT 437

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
                R+A+ L YL+DVE+GG T+FP  E  +F                P+ G  +++Y+
Sbjct: 438 RLNGDRIATVLFYLTDVEQGGATVFPNIEKAVF----------------PKSGTAVVWYN 481

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L  +G  D  +LH +CPVI G KWV  KWIR+++Q
Sbjct: 482 LRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQ 516


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 68/123 (55%), Gaps = 17/123 (13%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
           E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP     
Sbjct: 198 EMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFP----- 252

Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                      I   V P+RG  +++Y+L  NG ID  +LH +CPVI G KWV  KWIR+
Sbjct: 253 ----------NIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIRE 302

Query: 281 QEQ 283
           +EQ
Sbjct: 303 REQ 305


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 24/176 (13%)

Query: 115 RQGETVESTKGT---RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
           R G  + ST      RTS   FI+A+  K  +L  I+ ++A  T L   + E   +  Y 
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419

Query: 172 IGQKYDSHYDAFNPAEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 227
           IG  Y  H+D F  ++       +M  R+A+ L YLSDV +GG T FP    +       
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILKQL------- 472

Query: 228 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
                   +KP++     +Y+L  +G  D  +LHG CP+I G KWV  +WIR+ +Q
Sbjct: 473 --------LKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWIREYDQ 520


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 100/211 (47%), Gaps = 22/211 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++L   P  + + +  S  +   I+  A++R+  +    +   T   +  TRT+ G 
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTAMGA 366

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++  S +   +   I  ++   + L     E   V+ Y IG  Y  H D F   ++   M
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 422

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             RLA+ L YL+DVE+GG TMF                    KV PRRG  L +Y+L  +
Sbjct: 423 GNRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTD 467

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 468 GEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 498


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 117/263 (44%), Gaps = 37/263 (14%)

Query: 26  LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI----QSIPFQVLSWR 81
           LA F  ++   RD P              +++     GE G A      Q     +LS  
Sbjct: 16  LAAFLAASTTLRDTP-------FSGTASTQHQQPFKPGEWGYADADWLRQHYNITMLSED 68

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF----ISAS 137
           P  + F NF S E+  +I+  AK +   S   + +          RTSS  +    +  +
Sbjct: 69  PPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-----EVSNYRTSSTAWMLPDVLGN 123

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +     L+ +E +IAR   LP  + E F VL+Y+  Q Y  H D        P    R+A
Sbjct: 124 DPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQRQQP-CGIRVA 182

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-ID 256
           +F LYL+DVEEGG T FP                + L V+P +G+ +L+YS +PN T +D
Sbjct: 183 TFFLYLNDVEEGGGTRFP---------------NLNLTVQPAKGNAVLWYSAYPNTTRMD 227

Query: 257 RTSLHGSCPVIKGEKWVATKWIR 279
             + H + PV KG K+ A KWI 
Sbjct: 228 SRTDHEAMPVAKGMKYGANKWIH 250


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 492

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
          Length = 311

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 108/207 (52%), Gaps = 19/207 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPR   +  F S E+C  +I+ A      PS+ +   G TV +      SSG  ++ 
Sbjct: 59  VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE+++A  T+LP+ H   F +++Y  G++    Y   N +   P     +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 251
           A+ +LYLSD   GGE +FP       +S    K   G +     ++P +G+ +LF+S+  
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILFFSVHL 225

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
           N + D++S H   P+  GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252


>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 532

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/221 (26%), Positives = 107/221 (48%), Gaps = 25/221 (11%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           GETG   +Q I  +  + +P  +   +         +IA AK RL+ S+      +    
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363

Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
              +RTSS T+++  +      + + ++  +   T+  +   E + +  Y IG  Y  H+
Sbjct: 364 GPPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHH 423

Query: 181 DAF----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
           D F     P++ G +   R+A+ ++Y+SDVEEGG T+FP                +G++V
Sbjct: 424 DYFEEFQTPSK-GNRFGNRVATLMIYMSDVEEGGATVFP---------------SLGVRV 467

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
            P++GD + ++++  +   +  + H  CPV+ G KW+A KW
Sbjct: 468 SPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSKWIANKW 508


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 372

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 373 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 429

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP
Sbjct: 430 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 472

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 473 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 504


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/212 (30%), Positives = 105/212 (49%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V PR+G  + +Y+LFP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPRKGTAVFWYNLFP 492

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV  KW+ ++ Q
Sbjct: 493 SGEGDYSTRHAACPVLVGNKWVFNKWLHERGQ 524


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 106/212 (50%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 345

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 346 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 402

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP
Sbjct: 403 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFP 445

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 446 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 477


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 100/211 (47%), Gaps = 22/211 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++L   P  + + +  S  +   I+  A++R+  +    +   T   +  TRT+ G 
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTALGA 391

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++  S +   +   I  ++   + L     E   V+ Y IG  Y  H D F   ++   M
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             RLA+ L YL+DVE+GG TMF                    KV PRRG  L +Y+L  +
Sbjct: 448 GNRLATVLFYLTDVEQGGATMFNKAEH---------------KVLPRRGTALFWYNLHTD 492

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CP+I G KWV T+WIR++ Q
Sbjct: 493 GEGDWSTTHAACPIIVGSKWVLTQWIRERNQ 523


>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
          Length = 561

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/225 (29%), Positives = 106/225 (47%), Gaps = 30/225 (13%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +  S  P  +   +  +  Q + +    + +L  S     +G+ V S    RTS
Sbjct: 337 LQPIKVEEQSLDPMIVVLHDLITERQTEILRQLGEPKLATSLHRGGEGKFVRSM--IRTS 394

Query: 130 SGTFISASEDKTGILELIEHKIARATML---PQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
              ++   E+ +  L  I H++  AT L   P+T  E F +  Y IG  Y +H D     
Sbjct: 395 KNAWLQEHENAS--LPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHP 452

Query: 187 EYGPQ--------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
           +  P+        +  R+A+ ++YLSDVE GG T+FP                 G+   P
Sbjct: 453 DVRPEDQDPWNLYVGDRIATLMVYLSDVEAGGATVFPRA---------------GVTCWP 497

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           R+G    +++L+ +G  D T+ HG+CPV+ G KWV+ KWIR  +Q
Sbjct: 498 RKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKWVSNKWIRQYDQ 542


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPS 493

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 524


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 104/211 (49%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + + +  S  + + +   AK RL+ + ++      +E T   R S   +++A ED 
Sbjct: 345 RPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLE-TAHYRISKSAWLTAYEDP 403

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             +++ I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 404 --VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 458

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L+Y+SDV  GG T+F                 +G  V P++G  + +Y+LFP+
Sbjct: 459 -NRIATWLIYMSDVPSGGATVFT---------------DVGAAVWPKKGSAVFWYNLFPS 502

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 503 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 533


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 98/211 (46%), Gaps = 21/211 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI    K +L  +    + G   E     RT+ GT
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +++ +       T       + F VL Y IG  Y  H+D    AE     
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           S R+A+ + YLSDV +GG T+FP                +GL V P++G  LL+Y+L   
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 345

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CP + G +WV TKWI ++EQ
Sbjct: 346 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 376


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 98/211 (46%), Gaps = 21/211 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI   AK  L  +    + G   E     RT+ GT
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGNDKEDP--ARTTKGT 367

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    + + +++ +       T       + F VL Y IG  Y +H+D     E G   
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           S R+A+ + YLSDV +GG T FP                +GL V P +G  LL+Y+L   
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP---------------DLGLSVFPEKGAALLWYNLDHK 468

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CP I G +WV TKWI ++EQ
Sbjct: 469 GVGDNRTAHSACPTIVGSRWVMTKWINEREQ 499


>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 77/249 (30%), Positives = 129/249 (51%), Gaps = 29/249 (11%)

Query: 42  IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
           IR +LR  +VV +E    L H    +    S   Q LSW+PRA  +  F S E+C  +I+
Sbjct: 25  IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83

Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
            A    K  +LA   G++  V   +  ++S G      E    +   IE +I+  T LP+
Sbjct: 84  LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137

Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP- 215
            + E   V++Y+     QKY+ ++   + +++G  +   +A+ LL+LS+V  GGE  FP 
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFPE 193

Query: 216 -------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
                   ++GI  D     +   GL+  P +G+ +LF+++ PN + D++S +  CPV++
Sbjct: 194 SELKNSQSKSGILSDCT---ESSSGLR--PVKGNAILFFNVHPNASPDKSSSYARCPVLE 248

Query: 269 GEKWVATKW 277
           GE W ATK+
Sbjct: 249 GEMWCATKF 257


>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
           anophagefferens]
          Length = 182

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 100/198 (50%), Gaps = 29/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF + E+C ++I +AK  + P+ +    G        +RTSS  ++ A ED    L  + 
Sbjct: 8   NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
            K+   T  P  H E   V RY  G+ Y  HYDAF+ +      +     QR+A+ L+YL
Sbjct: 60  TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +DVE GGET F                 +G+++KPR+G+ L+F+    +G +D+  LH +
Sbjct: 120 NDVERGGETSF---------------SKLGVRIKPRKGNALIFFPATLDGVLDQNYLHAA 164

Query: 264 CPVIKGEKWVATKWIRDQ 281
            P +   KWV+  WIR +
Sbjct: 165 EPAVD-PKWVSQIWIRQR 181


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 83/139 (59%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYL+DV EGGET+FP
Sbjct: 170 ATVLLYLTDVPEGGETVFP 188


>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 222

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 15/134 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207

Query: 190 PQMSQRLASFLLYL 203
               QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221


>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
          Length = 480

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 106/248 (42%), Gaps = 55/248 (22%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKG-------TRTSSGTF 133
           PR  Y  NF SA +    +  +     P ++A   G T ++  +G       TRTS   F
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSENAF 264

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---------- 183
              ++    + +    ++ R     +   +   +LRY++GQ Y +H+D F          
Sbjct: 265 DITTKQSFDVKKRA-FRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDFNW 323

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP---------------------------- 215
           +P   G   S R A+  LYLSDV  GG+T+FP                            
Sbjct: 324 DPLSGG---SNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASELKE 380

Query: 216 -FENGIFLDSGYD---YKKCI-GLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 270
              N   ++  ++     KC     V PRRGD +LFYS  P+G +D  SLHG+CP++ G 
Sbjct: 381 FVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPILNGT 440

Query: 271 KWVATKWI 278
           KW A  W+
Sbjct: 441 KWGANLWV 448


>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
 gi|255628535|gb|ACU14612.1| unknown [Glycine max]
          Length = 238

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/146 (37%), Positives = 83/146 (56%), Gaps = 3/146 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPFENGI 220
           R+A+ L+YLSD  E GET FP    +
Sbjct: 200 RIATMLMYLSDNIERGETYFPLAGSV 225


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 98/211 (46%), Gaps = 21/211 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI    K +L  +    + G   E     RT+ GT
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +++ +       T       + F VL Y IG  Y  H+D    AE     
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
           S R+A+ + YLSDV +GG T+FP                +GL V P++G  LL+Y+L   
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP---------------KLGLSVFPKKGSALLWYNLDHK 468

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CP + G +WV TKWI ++EQ
Sbjct: 469 GDGDNRTAHSACPTVVGSRWVMTKWINEREQ 499


>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 204

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 74/129 (57%), Gaps = 5/129 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KIA  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYL 203
           R+A+ L+YL
Sbjct: 195 RIATVLMYL 203


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 110/218 (50%), Gaps = 29/218 (13%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSS 130
           PF++   S  P  + + N  S ++ + II  +K  LK S +    GE+  +     RTS 
Sbjct: 3   PFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMV----GESFSKEVSNERTSQ 58

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEY 188
             +++  + +  +++++  +    T L +   E+  V  Y IG  Y  H+D    N  E 
Sbjct: 59  NAWLADYDFE--LVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEE 116

Query: 189 GPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
             +   +  R+A+ + YLSDVE+GG T+FP                IG+ V P++G  + 
Sbjct: 117 PYKDMGLGNRIATLMYYLSDVEQGGATVFP---------------QIGVGVFPKKGSAIF 161

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L P+GT D  +LHG+CPV+ G KWVA KWI    Q
Sbjct: 162 WYNLLPDGTGDERTLHGACPVLLGSKWVANKWIHQYHQ 199


>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
          Length = 305

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 64/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
           F  F + ++C  +I+  +  L+P+  +  R G  +      RTS G  F  A ED   ++
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
           + I  +IA A+    + GE   +LRY +GQ+Y  H+D        P + +QR  + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           ++   GGET+FP                +GL VK R+GD LLF +    G     ++H  
Sbjct: 236 NEGYAGGETIFPR---------------LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLG 280

Query: 264 CPVIKGEKWVATKWIRDQEQHE 285
            PV+ G+KW+ T+WIR  ++H+
Sbjct: 281 APVMAGQKWLCTRWIR-HDRHD 301


>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
          Length = 190

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 97/211 (45%), Gaps = 31/211 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED-- 139
           PR        S  +C  II    K ++ S +    G+    T  TRTS   ++  S    
Sbjct: 1   PRVFLVREMLSEFECDHIIELGTKVVRKSMV----GQGGGFTSKTRTSENGWLRRSASPI 56

Query: 140 ------KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
                 + G +  I+H + R+      + E   V+RY+  Q+Y  H+D  +     PQ  
Sbjct: 57  LENIYKRFGDVLGIDHDLLRSG----KNAEELQVVRYDRSQEYAPHHDFGDDGT--PQ-- 108

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
           QR  + LLY+   EEGG T FP  N             +G++V P RGD +LFYS+ P+G
Sbjct: 109 QRFLTLLLYIQLPEEGGATSFPKAN-----------DGMGVQVVPARGDAVLFYSMLPDG 157

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
             D  +LH   PV KG+KWV   W+ D  +H
Sbjct: 158 NADDLALHAGMPVRKGQKWVCNLWVWDPHRH 188


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 103/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + +    + ++ + I   +K RL+ + ++      +E T   R S   +++A E  
Sbjct: 341 RPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLAAYEHP 399

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             +++ I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 400 --VVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 454

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  VKP +G  + +Y+LFP+
Sbjct: 455 -NRIATWLFYMSDVAAGGATVFP---------------EVGAAVKPLKGTAVFWYNLFPS 498

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 499 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 529


>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
 gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
          Length = 248

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 67/226 (29%), Positives = 98/226 (43%), Gaps = 34/226 (15%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  + +S  PR     +F S  +   I+    +  +  + +            TRTS  T
Sbjct: 35  ITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMNT 94

Query: 133 FISASEDKTGILELIEHKIARATML---------PQTH---------GEAFNVLRYEIGQ 174
           +I   E  T I++ I  ++A    +         P  H          E   ++ Y+ G+
Sbjct: 95  WIYREE--TAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPGE 152

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
           +Y +H+D        P    R  + LLYL+DVEEGGET FP              +  GL
Sbjct: 153 EYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP--------------RWGGL 198

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
            VKP +G  +LFY L  +G  D  S H + PVIKGEKW++  WI D
Sbjct: 199 DVKPVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWIWD 244


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 80/170 (47%), Gaps = 23/170 (13%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G         RTS  TFI  +  K  +L  I+ ++A  T L     E   +  Y IG  Y
Sbjct: 362 GNNASVVSNARTSQFTFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419

Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
             H D F+P  +        +M  R+A+ L YL+DVE+GG T FP    +          
Sbjct: 420 AQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFPVLKQL---------- 469

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                +KP++     +Y+L  +G  D  ++HG+CP+I G KWV  +WIR+
Sbjct: 470 -----LKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWIRE 514


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 67/228 (29%), Positives = 108/228 (47%), Gaps = 25/228 (10%)

Query: 62  HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +  T  A ++  P   +++   P  + + +  SA +   +   A   LK + +    G  
Sbjct: 308 YNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRR 367

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E  K TRTS   +   + ++  + E +  +IA  T       E    + Y +G  YD H
Sbjct: 368 SEVVK-TRTSKVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKH 424

Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
           YD FN A     ++Q    R+A+ L YL+DVE+GG T+FP                I   
Sbjct: 425 YDFFN-ASTATNLTQMNGDRIATVLFYLTDVEQGGATVFP---------------NIRKA 468

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           V P+RG  +++Y+L  +G  +  +LH +CPV+ G KWV  KWIR++ Q
Sbjct: 469 VFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 516


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 104/206 (50%), Gaps = 24/206 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + N  S ++ + I   AK RL  ++  +R  +T V +T   R S   ++   +D 
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRL--ARATVRDPKTGVLTTAPYRVSKSAWLEGEDDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             +++ +  +I   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + PR+G  + +Y+LF +G  D 
Sbjct: 450 TFLNYMSDVEAGGATVFP-----------DF----GASIWPRKGTAVFWYNLFRSGEGDY 494

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 RTRHAACPVLVGSKWVSNKWIHERGQ 520


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 103/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S ++ +++   AK RL+ + ++      +E T   R S   +++  E  
Sbjct: 348 RPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLE-TASYRISKSAWLTGYEHP 406

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E+I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 407 --VIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 461

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF N
Sbjct: 462 -NRIATWLFYMSDVAAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAN 505

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 506 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 536


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 102/218 (46%), Gaps = 22/218 (10%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +V+  +P+ + F +     + + + A A  RL+ + +       +E  +  R S
Sbjct: 321 LQPAKEEVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRIS 379

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              ++S  ED   ++  + H+I + T L     E   V  Y +G  Y+ H+D     E  
Sbjct: 380 KSAWLS--EDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEIN 437

Query: 190 P----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                    R+A+FL Y+SDVE GG T+FP                +G ++ P +G    
Sbjct: 438 AFKSLNTGNRIATFLFYMSDVEAGGATVFP---------------QVGARLIPEKGSAAF 482

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L  NG  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 483 WYNLLKNGEGDYSTRHAACPVLVGSKWVSNKWIHERGQ 520


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 67/228 (29%), Positives = 108/228 (47%), Gaps = 25/228 (10%)

Query: 62  HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +  T  A ++  P   +++   P  + + +  SA +   +   A   LK + +    G  
Sbjct: 273 YNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRR 332

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E  K TRTS   +   + ++  + E +  +IA  T       E    + Y +G  YD H
Sbjct: 333 SEVVK-TRTSKVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKH 389

Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK 235
           YD FN A     ++Q    R+A+ L YL+DVE+GG T+FP                I   
Sbjct: 390 YDFFN-ASTAANLTQMNGDRIATVLFYLTDVEQGGATVFP---------------NIRKA 433

Query: 236 VKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           V P+RG  +++Y+L  +G  +  +LH +CPV+ G KWV  KWIR++ Q
Sbjct: 434 VFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQ 481


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + +    S  + +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 RPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLE-TAPYRISKSAWLSGYEHS 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           T  +E I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 395 T--IERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+F                 +G  V P++G  + +Y+LFP+
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFT---------------DVGAAVWPKKGTAVFWYNLFPS 493

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 524


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 103/209 (49%), Gaps = 26/209 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + + +  S E+   +   AK RL+ + ++      +E+ +  R +   ++S  ED 
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP 384

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------MSQ 194
             ++  +  +I   T L  +  E   V  Y IG +Y+ H+D     +Y P          
Sbjct: 385 --VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR--KYEPDAFKKLGTGN 440

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A++L Y+SDVE GG T+FP                +G  V P++G  + +Y+L  +G 
Sbjct: 441 RVATWLFYMSDVEAGGATVFPE---------------VGAAVYPKKGTAVFWYNLLESGE 485

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 486 GDYSTRHAACPVLVGNKWVSNKWIHERGQ 514


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 102/210 (48%), Gaps = 21/210 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V++  P    + + AS  +   +I   + ++  S +     + V  +   RTS  ++++
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGDAAKKEVSKS---RTSQNSWLT 370

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
              D   +  L       A  L +T  E+  V  Y IG  Y  HYD        P+++  
Sbjct: 371 -DYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTG 429

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
            R+A+ + YLSDVEEGG T+FP                +G+ V P++G  + +Y+L  +G
Sbjct: 430 NRIATLMFYLSDVEEGGATVFPH---------------LGVGVFPKKGTAIFWYNLRASG 474

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             D  +LHG+CPV+ G KWVA KWI ++ Q
Sbjct: 475 KGDEKTLHGACPVLIGSKWVANKWIHERHQ 504


>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
 gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
          Length = 535

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 63/216 (29%), Positives = 99/216 (45%), Gaps = 22/216 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  LS  P         S++  + I   A+ ++K S +    G         RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +++   +   + L     E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYDE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G     R+A+ + YLSDVE GG T FPF               + L V P +G  L +Y+
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPF---------------LPLLVTPEKGSLLFWYN 478

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           L  +G  D  + H +CPV++G KW+A  WIR++ QH
Sbjct: 479 LHESGDQDYRTKHAACPVLQGSKWIANVWIRERNQH 514


>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
          Length = 262

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 75/251 (29%), Positives = 120/251 (47%), Gaps = 48/251 (19%)

Query: 44  PKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
           P LR LE ++    S LP+ E  +AS            PR     N  + ++C+ ++  A
Sbjct: 42  PSLRGLESLK----SNLPYLEQINAS------------PRVFRIRNLLTKQECEHLMLLA 85

Query: 104 KKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
            ++     + +  G  + VEST  TRT+ G ++   +D   ++  +E  + + T      
Sbjct: 86  FRKGLSKTMIMPYGTHKLVEST--TRTNDGAWLDFLQDD--VVRRLEETLGKLTKTTPQQ 141

Query: 162 GEAFNVLRYEIG-QKYDSHYDAFNPAEYGP----QMSQRLASFLLYLSDVEEGGETMFPF 216
           GE   VL Y  G Q +  HYD F+PA   P    Q   R  + ++YL    EGGET FP 
Sbjct: 142 GENLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFP- 200

Query: 217 ENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL--FPNGT----IDRTSLHGSCPVIKGE 270
                          +GLK+  + GD L+FY+L    +GT    +++ ++H + P ++GE
Sbjct: 201 --------------ELGLKLTAQPGDALMFYNLKEHCSGTDPDCVEKKTIHAALPPVRGE 246

Query: 271 KWVATKWIRDQ 281
           KWVA KWI ++
Sbjct: 247 KWVAVKWIHEK 257


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 101/215 (46%), Gaps = 25/215 (11%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +V++  P  + +    S  +   +I  A+  +K S +   + E +      R S   
Sbjct: 315 LKLEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGDTRSEQISKI---RISQNA 371

Query: 133 FISASEDKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           +     D   I+E +  + AR  A  L +   E   V  Y +G  Y  HYD    A   P
Sbjct: 372 WFENEHDP--IVETLNQR-ARDMAGGLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFP 428

Query: 191 Q--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
              M  R+A+ + YLSDV+EGG T+FP  N               L V+PR+G  + +Y+
Sbjct: 429 NKGMGNRIATLMFYLSDVQEGGSTVFPRLN---------------LAVRPRKGTAIFWYN 473

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L  NG  ++ +LH +CPV+ G KWVA KWI ++ Q
Sbjct: 474 LHRNGKGNKKTLHAACPVLIGSKWVANKWIHERHQ 508


>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
          Length = 226

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 98/202 (48%), Gaps = 30/202 (14%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK-TGI 143
            Y P+F  A  C  ++A      + S +        ES +  RTS     S   D+ +  
Sbjct: 43  FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSD----SCDMDRWSPD 93

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLAS 198
           +   +  IA    +   HGE     RY +GQ + +H+D FN A+ Y P+M     QR  +
Sbjct: 94  VRPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
            ++YL+DVEEGG T FP                IG++V P++G  L + ++ P+G  +  
Sbjct: 154 AMIYLNDVEEGGATWFP---------------TIGIRVAPKKGLLLTWNNMKPDGDRNTA 198

Query: 259 SLHGSCPVIKGEKWVATKWIRD 280
           +LH   PV++G K++ TKW R+
Sbjct: 199 TLHEGMPVVQGTKYIVTKWFRE 220


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 105/212 (49%), Gaps = 22/212 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V      P  L F N  S  + +++   A+ RL  +       + +E     R S  
Sbjct: 317 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +   L ++  ++A  T L  +  E F V+ Y IG  Y+ H+D  +  +  P 
Sbjct: 376 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 431

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           +  R+ + L YLSDVE+GG T+FP                I + V P++G  +++++L P
Sbjct: 432 IGSRIETVLFYLSDVEQGGATVFP---------------EIQVSVWPQKGSAVVWFNLHP 476

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D+ + H  CPV+ G KW+ATKWI ++ Q
Sbjct: 477 SGDGDQRTKHAGCPVLIGSKWIATKWIHERGQ 508


>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
          Length = 222

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 74/134 (55%), Gaps = 15/134 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 98  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208

Query: 190 PQMSQRLASFLLYL 203
               QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 99/204 (48%), Gaps = 21/204 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           P  + + +  SA++ + +   A  R++ S +  L  G+  +S    R S   +++     
Sbjct: 331 PYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKS--AFRVSKNAWLAYESHP 388

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASF 199
           T  +E +   +  AT L  T+ E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ 
Sbjct: 389 T--MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATA 446

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           + YLSDVE+GG T FPF               +   VKP+ G+ L +Y+L  +  +D  +
Sbjct: 447 IFYLSDVEQGGATAFPF---------------LDFAVKPQLGNVLFWYNLHRSLDMDYRT 491

Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
            H  CPV+KG KW+   WI D  Q
Sbjct: 492 KHAGCPVLKGSKWIGNVWIHDMTQ 515


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/216 (27%), Positives = 105/216 (48%), Gaps = 24/216 (11%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSS 130
           PF  ++LS  P  + + +  +  +  ++   +K  +K   + +     V       RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F   +  P
Sbjct: 372 SVWLASHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQ-AP 428

Query: 191 QMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
           +      R+A+ L YLSDV +GG T+FP  N               + V+PR+GD LL+Y
Sbjct: 429 EHRGGGDRIATVLFYLSDVPQGGATLFPRLN---------------ISVQPRQGDALLWY 473

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L   G  +  ++H SCP+I+G KW   KWI +  Q
Sbjct: 474 NLNDRGQGEIGTVHTSCPIIQGSKWALVKWIDELSQ 509


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 101/202 (50%), Gaps = 20/202 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L F N  S  + +++   A+ RL  +       + +E     R S   ++   E + 
Sbjct: 321 PDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKVAWLEDQEHQH 379

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
             L ++  ++A  T L  +  E F V+ Y IG  Y+ H+D  +  +  P +  R+ + L 
Sbjct: 380 --LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PAIGSRIETVLF 435

Query: 202 YLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLH 261
           YLSDVE+GG T+FP                I + V P++G  +++++L P+G  D+ + H
Sbjct: 436 YLSDVEQGGATVFPE---------------IQVSVWPQKGSAVVWFNLHPSGDGDQRTKH 480

Query: 262 GSCPVIKGEKWVATKWIRDQEQ 283
             CPV+ G KW+ATKWI ++ Q
Sbjct: 481 AGCPVLIGSKWIATKWIHERGQ 502


>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
           [Cucumis sativus]
          Length = 311

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 107/207 (51%), Gaps = 19/207 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPR   +  F S E+C  +I+ A      PS+ +   G TV +      SSG  ++ 
Sbjct: 59  VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE+++A  T+LP+ H   F +++Y  G++    Y   N +   P     +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLK-----VKPRRGDGLLFYSLFP 251
           A+ +LYLSD   GGE +FP       +S    K   G +     ++P +G+ +L +S+  
Sbjct: 173 ATVVLYLSDSASGGEILFP-------ESKVKSKFWSGRRKKNNFLRPVKGNAILXFSVHL 225

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWI 278
           N + D++S H   P+  GE WVATK++
Sbjct: 226 NASPDKSSYHIRSPIRDGELWVATKFL 252


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 97/211 (45%), Gaps = 32/211 (15%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +VL  +P  + F +  S  +   +   A   LK + +         S KGTRTS G 
Sbjct: 295 LKMEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGI 354

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++S S +   + + I  +I+  T        +  V+ Y +   Y  H D FN AE     
Sbjct: 355 WLSRSHN--NLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE----- 407

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
                     LSDVE+GG+T+FP     F               KP RG  LL+Y+L  N
Sbjct: 408 ----------LSDVEQGGDTVFPRIEQAF---------------KPERGKALLWYNLHRN 442

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           GT D+ + HG+CPV+ G KW+ T+WI ++ Q
Sbjct: 443 GTGDKRTEHGACPVLVGSKWIMTQWINERPQ 473


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   QG  V      RT+ G +     ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YL+DVE+GG T       +F D GY         V P+ G  + +Y+L  N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGYS--------VYPQAGTAIFWYNLDTN 486

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK RL  ++  +R  +T V +T   R S   ++   ED 
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             +++ +  +I   T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 450

Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
             A+FL Y+SDVE GG T+FP           D+    G  + PR+G  + +Y+LF +G 
Sbjct: 451 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTSVFWYNLFRSGE 495

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GDYRTRHAACPVLVGSKWVSNKWIHERGQ 524


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 98/211 (46%), Gaps = 29/211 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I+ A + +K +++     ET   T   RT+ G ++    ++ 
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ---- 194
            +   I  +I   T       E F V+ Y IG  Y  H D F+ A     GP+  Q    
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441

Query: 195 --RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YLSDVE+GG T+F                 +G  V P+ G  + +Y+L  +
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDTD 486

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H SCPVI G KWV T+WIR+  Q
Sbjct: 487 GNGDPLTRHASCPVIVGSKWVMTEWIRESRQ 517


>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 322

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 103/202 (50%), Gaps = 29/202 (14%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
           F  F + ++C  +I+  +  L+P+  +  R G  +      RTS G  F  A ED   ++
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
           + I  +IA A+    + GE   +LRY +GQ+Y  H+D        P + +QR  + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           ++   GGET+FP                +GL VK R+G+ LLF +    G     ++H  
Sbjct: 253 NEGYAGGETIFPR---------------LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLG 297

Query: 264 CPVIKGEKWVATKWIRDQEQHE 285
            PV+ G+KW+ T+WIR  ++H+
Sbjct: 298 APVMAGQKWLCTRWIR-HDRHD 318


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 97/211 (45%), Gaps = 28/211 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   QG  V      RT+ G +     ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YL+DVE+GG T       +F D GY         V P+ G  + +Y+L  N
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFADVGY--------SVYPQAGTAIFWYNLDTN 486

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GKGDPRTKHAACPVIVGSKWVMTEWIREKRQ 517


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 103/209 (49%), Gaps = 28/209 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +F S E+ + I   AK +L  ++  +R  ++ V +T   R S   ++   ED 
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKL--ARATVRDPKSGVLTTASYRVSKSAWLEGEEDP 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             I+  +  +I   T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 397 --IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 452

Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
             A+FL Y+SDVE GG T+FP           D+    G  + PR+G  + +Y+LF +G 
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFKSGE 497

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 498 GDYRTRHAACPVLVGNKWVSNKWIHERGQ 526


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S  +   I   AK RL+ + ++      +E T   R S   +++A ED 
Sbjct: 350 RPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLE-TAPYRISKSAWLTAYEDP 408

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 409 --VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 463

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 464 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVGPQKGTAVFWYNLFAS 507

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 508 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 538


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 103/210 (49%), Gaps = 28/210 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----- 194
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q     
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTG 447

Query: 195 -RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
            R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +G
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASG 492

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 493 EGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/217 (29%), Positives = 99/217 (45%), Gaps = 26/217 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  +  + LS  P  + + +     +  +I      +LK + +     E+V S    RTS
Sbjct: 290 IAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATIT-STNESVVS--NVRTS 346

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
             TF+  +EDK  +L  I+ ++A  T     + E      Y IG  Y  H D F    + 
Sbjct: 347 QFTFLPVTEDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFD 404

Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
                 P+M  R+A+ L YLSDV +GG T FP                + + +KP++   
Sbjct: 405 AGLVSSPEMGNRIATVLFYLSDVTQGGGTAFPH---------------LRVLLKPKKYAA 449

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
             +Y+L  +G  D  + HG+CP+I G KWV  +WIR+
Sbjct: 450 AFWYNLHASGVGDPRTQHGACPIISGSKWVQNRWIRE 486


>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
          Length = 478

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/293 (29%), Positives = 122/293 (41%), Gaps = 68/293 (23%)

Query: 39  VPS----IRPKLRTLEVVEKENESGLPHGETG-DASIQSIPFQVLSWRPRALYFPNFASA 93
           VPS    + P +RT      +    L   ETG DA  +S+    LS RP+      F   
Sbjct: 114 VPSGVHFVWPTVRTGHATVTD----LADAETGVDAGHRSV-VTTLSMRPQVFRISQFMMG 168

Query: 94  EQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
            + + +I   K R+KPS++ L  R G+       TRTS+  + +AS        +    I
Sbjct: 169 HETEKLIERNKPRIKPSEVGLVGRSGDK------TRTSTNAWDTASP-------VARDVI 215

Query: 152 ARATMLPQTHG-----EAFNVLRYEIGQKYDSHYDAFNPAEYG----------------P 190
            RA  L +        +   VL YE  Q Y  H D F     G                 
Sbjct: 216 GRAFRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTAN 275

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP-------FENGIFLDSGYDYK-------------- 229
             + R A+  LYL++   GGET+FP       ++ G    +G +                
Sbjct: 276 NGTNRFATVFLYLNNAGSGGETVFPLSTTHEIYQGGRLTQAGTNRTPGFIRDADAAWVCD 335

Query: 230 -KCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            K   L+V PR GD +LFYS   + ++D  SLHGSCP+  GEKW A  W+ ++
Sbjct: 336 TKSEALRVTPRTGDSVLFYSQRGDASLDGYSLHGSCPMGDGEKWAANLWVWNR 388


>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
 gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
          Length = 314

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 78/251 (31%), Positives = 117/251 (46%), Gaps = 39/251 (15%)

Query: 35  LSRDVPSIRPKLRTLEVVEKE-NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
           L+ D P  + ++R L   E+E N    P  ET   SI+ +         R L    F+SA
Sbjct: 97  LAVDDPVAQQQMRLLNAAEEEENWRTEPVSET--PSIRMV---------RHL----FSSA 141

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
           E C  +   +  RL+PS + L            RTS G  +S  E+   ++ ++  +IA 
Sbjct: 142 E-CAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVEEDL-VVGMLNRRIAA 198

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
           AT   +  GE  ++LRY   Q+Y  H+DA    E     +QR  + ++YL+   EGGET 
Sbjct: 199 ATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEGGETA 253

Query: 214 FPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           FP                +G +++ R+GD LLF +L  +G  D    H   P   G KW+
Sbjct: 254 FPE---------------LGFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWI 298

Query: 274 ATKWIRDQEQH 284
           AT+WIR +  H
Sbjct: 299 ATRWIRTRPYH 309


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 103/217 (47%), Gaps = 20/217 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +  I  +V   +P+   F +  +  + + +   A  +L  + +    GE + +T   R S
Sbjct: 314 LSPIKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRIS 371

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-- 187
              ++S S+D  G ++ I+ +I   T L  +  E   V+ Y IG +Y+ HYD     E  
Sbjct: 372 KSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDT 431

Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
                   R+++ L+Y+SDVE+GG T+FP                +G ++ P +     +
Sbjct: 432 FTSLGSGNRISTLLIYMSDVEKGGATVFP---------------GVGARLVPIKRAAAYW 476

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++L  +G  D ++ H  CPV+ G KWV  KWI ++ Q
Sbjct: 477 WNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQ 513


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 105/213 (49%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLS RP  + + +F S  + + I   A+  L+ S +A   G+  ++T   R S   ++ 
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVA--TGDK-QATAEYRISKSAWLK 392

Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S   T  +  ++ KI+  T L     HGE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 393 GSAHST--VSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   + +++L 
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 495

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            NG  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 496 RNGEGDADTLHAGCPVLIGDKWVANKWIHEYGQ 528


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F    S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 242

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 243 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 298

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 299 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 341

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 342 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 373


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 85/163 (52%), Gaps = 24/163 (14%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           TRTS  T+++ S +   +   +  +I+  T       E   V+ Y +G  YD H+D FN 
Sbjct: 372 TRTSKVTWLTDSLNPLTVR--LNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFN- 428

Query: 186 AEYGPQMSQ----RLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRR 240
           A     +++    R+A+ L YL+DVE+GG T+FP  +  IF                P++
Sbjct: 429 ATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFPNIKQAIF----------------PKK 472

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  +++Y+L  N   D  +LH +CPVI G KWV  KWIR+ +Q
Sbjct: 473 GTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQ 515


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + +    S  + +++   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 350

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 351 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 406

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 407 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 449

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 450 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 481


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 383

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 384 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 439

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 440 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 482

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 514


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F    S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 394

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 395 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 450

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 451 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 493

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 494 SGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 311

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 312 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 367

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 368 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 410

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 411 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 442


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 102/209 (48%), Gaps = 26/209 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q      
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 448

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +G 
Sbjct: 449 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 493

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 494 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 215

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)

Query: 85  LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
           ++F    S ++C  +IA       KPS +     +    T G R S  T ++ S D   I
Sbjct: 17  VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 73

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
           +  I  +I   + + Q + E   +L Y  G KYD HYDAF  ++  PQ+     RL + L
Sbjct: 74  ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 131

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           LYL+DVE GG T FP                I   + P  G G+LF +        R SL
Sbjct: 132 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 176

Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
           H   PV  GEKW+A+ WIR+ 
Sbjct: 177 HAGLPVTHGEKWIASIWIREN 197


>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 213

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 93/201 (46%), Gaps = 24/201 (11%)

Query: 85  LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
           ++F    S ++C  +IA       KPS +     +    T G R S  T ++ S D   I
Sbjct: 15  VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 71

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
           +  I  +I   + + Q + E   +L Y  G KYD HYDAF  ++  PQ+     RL + L
Sbjct: 72  ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 129

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           LYL+DVE GG T FP                I   + P  G G+LF +        R SL
Sbjct: 130 LYLNDVEYGGWTQFPH---------------IMANIVPNAGSGILFRNTDAQNRQLRESL 174

Query: 261 HGSCPVIKGEKWVATKWIRDQ 281
           H   PV  GEKW+A+ WIR+ 
Sbjct: 175 HAGLPVTHGEKWIASIWIREN 195


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 101/211 (47%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F    S  + + +   AK RL+ + ++      +E T   R S   ++S  ED 
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYEDP 395

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 396 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 450

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 451 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 494

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 495 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 525


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
 gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
          Length = 310

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 106/205 (51%), Gaps = 19/205 (9%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT---SSGTFI 134
           +SW+PR   +  F + E+C  +I+ A+   + S+     G+  +S +  R    +S T +
Sbjct: 61  VSWQPRVFVYKGFLTDEECDHLISLAQGTKETSE-----GKDDDSGRIERNRLFASSTSL 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   IL  IE +++  T+LP+ + +   V+ Y I +   +++D F           
Sbjct: 116 LNMDD--NILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEP 172

Query: 195 RLASFLLYLSDVEEGGETMFP---FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
            +A+ + YLS+V +GGE  FP    +N I+ D        I   ++P +G+ +LF+++ P
Sbjct: 173 LMATLVFYLSNVTQGGEIFFPKSEVKNKIWSDCTK-----ISDSLRPIKGNAILFFTVHP 227

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATK 276
           N + D  S H  CPV++GE W ATK
Sbjct: 228 NTSPDMGSSHSRCPVLEGEMWYATK 252


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 101/213 (47%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K  + PS+      + V S    RTS   
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFVVSK--FRTSKSV 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGP 190
           ++    ++  +   +  ++A AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 380 WLDRDANEATVK--LTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               R+A+ L YL+DV +GG T FP  N               + V PR G  L +Y+L 
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFPGLN---------------ITVFPRLGAALFWYNLD 482

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 483 TQGMLQVRTMHTGCPVIVGSKWVVSKWIDDKGQ 515


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 102/209 (48%), Gaps = 26/209 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E T   R S   ++S  ED 
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLE-TVHYRISKSAWLSGYEDP 365

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q      
Sbjct: 366 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 421

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +G 
Sbjct: 422 RIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFASGE 466

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 467 GDYSTRHAACPVLVGNKWVSNKWLHERGQ 495


>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 221

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 91/197 (46%), Gaps = 34/197 (17%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           + + NF S  +C+ II  A  ++K S +   +   V      RTS GTF+    D   ++
Sbjct: 1   MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
             IEH++A  + LP +H E   VLRY    KY  H D            +R+A+ L+YL 
Sbjct: 57  AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106

Query: 205 DVEEGGETMFPFENGIFLDSGYDYKKCIGLKV--KPRRGDGLLFYSLFPN-GTIDRTSLH 261
             E                   +  +C   +V  KP+RGD L+F+   P+    D  S+H
Sbjct: 107 QAERA-----------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSMH 149

Query: 262 GSCPVIKGEKWVATKWI 278
             CPV++G KW A KW+
Sbjct: 150 TGCPVVEGVKWNAVKWL 166


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 102/211 (48%), Gaps = 32/211 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + N  S  + + I   AK RL  ++  +R  +T V +T   R S   ++   ED 
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E +  +I   T L     E   +  Y +G +Y+ H+D        AF     G   
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTG--- 451

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +
Sbjct: 452 -NRVATFLNYMSDVEAGGATVFP-----------DF----GAAIYPKKGTAVFWYNLFRS 495

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 496 GEGDYRTRHAACPVLVGCKWVSNKWIHERGQ 526


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 80/173 (46%), Gaps = 23/173 (13%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G         RTS  TFI AS  K  +L  I+ ++A  T L   + E      Y IG  Y
Sbjct: 362 GHNESLVSNVRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHY 419

Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
             H D F    +       P+M  R+A+ L YLSDV +GG T FP    +          
Sbjct: 420 GQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRTL---------- 469

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
                +KP++     +++L  +G  D  + HG+CP+I G KWV  +WIR+ +Q
Sbjct: 470 -----LKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIREFDQ 517


>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 76/151 (50%), Gaps = 6/151 (3%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  + +SW PRA  + NF S  +C  +     KR+  S L +            RTS G 
Sbjct: 6   IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQ 191
                ED   ++  +E +IA  T LP  +GE   +LRY  GQKYD+H+D F +P  +   
Sbjct: 65  AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122

Query: 192 MSQ--RLASFLLYLSDVEEGGETMFPFENGI 220
           + +  R A+ LLYLS VE GGET  P  + I
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPLADPI 153


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 59/219 (26%), Positives = 100/219 (45%), Gaps = 23/219 (10%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ +  +    +P  + + +    E+ +++   A  R K + +       +E+ K  R S
Sbjct: 335 IQPLKMEEAFLKPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAK-YRIS 393

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
              F+   E    +L++   ++   T L  +  E   V  Y IG  Y+ H+D     E  
Sbjct: 394 KAAFLKNKEHHH-VLKM-SRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETI 451

Query: 189 ----GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
                     R+A++L Y+SDVE GG T+FP                + + + P++G   
Sbjct: 452 GFNKDSGWRNRIATWLFYMSDVEAGGATVFP---------------ALNVALWPQKGSAA 496

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +Y+LFPNG  +  + H +CPV+ G KWVA KWI ++ Q
Sbjct: 497 FWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKNQ 535


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 78/161 (48%), Gaps = 21/161 (13%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           R +   F+  SE    ++  +  ++   T L     E   V  Y IG  Y  H+D     
Sbjct: 373 RIAKAAFLKDSEH--NLIVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQG 430

Query: 187 E-YGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
           E +GP+      R+A++L Y+SDVE GG T+FP                +G  + P++G 
Sbjct: 431 EIHGPRDLDWGNRIATWLFYMSDVEAGGATVFP---------------AVGAALWPQKGS 475

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
              +Y+L PNG  D  +LH  CPV+ G KWV+ KWI ++ Q
Sbjct: 476 AAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQ 516


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 44  KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 101

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 102 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 157

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 158 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 200

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 201 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 232


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 345

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 346 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 401

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 402 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 444

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 445 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 476


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 508

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 26/220 (11%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQGETVESTKGTRTSS 130
           ++   VLS  PR     +F S  + + ++  A KR LK S +         +   TRTS+
Sbjct: 278 NMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTST 337

Query: 131 GTFISASED---------KTGILELIEH--KIARATMLPQ---TH---GEAFNVLRYEIG 173
             +I   +D            +L++ E   +  R + +P+   +H    E   ++ Y++G
Sbjct: 338 NDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVG 397

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
           Q+Y  H+D   P     Q S R A+ L YL+D  +GGET FP       + G        
Sbjct: 398 QQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFPRWLHADEEGG-------S 449

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWV 273
           LKVKP +G  +LFY+L P+G  D  S H + PV +GEKW+
Sbjct: 450 LKVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKWL 489


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 360 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 415

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 416 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 458

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 459 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 490


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 66/217 (30%), Positives = 104/217 (47%), Gaps = 26/217 (11%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + LS  P  +Y   F    +C+++I  A+ R+K + ++L     V  ++G RT S  ++
Sbjct: 50  METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEYG 189
              E+   +   I  ++A+    P  + E   V+ Y   Q+Y  HYDA++          
Sbjct: 107 RYQEEP--LARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
            Q  QR+ + LLYL++VEEGG T FP                 G++V PR+G   +F ++
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP---------------NAGVEVAPRKGRIAIFNNV 209

Query: 250 FPN-GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
             + G     SLHG  PV  GEKW A+ W R +  HE
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAHE 246


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 96/209 (45%), Gaps = 24/209 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +++ N    E+ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 208 PRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + PR+G    +Y+L PNG  D 
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 369

Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
            + H +CPV+ G KWVA KW+  R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S  + +++   AK RL+ + +   Q   + +T   R S   ++ A E  
Sbjct: 331 RPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL-TTAPYRVSKSAWLGAFEHP 389

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             +++ I  +I   T L  +  E   V  Y +G +Y+ HYD        AF     G   
Sbjct: 390 --VVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTG--- 444

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++LLY+S+V+ GG T+F                 IG  V P++G  + +Y+L P+
Sbjct: 445 -NRIATWLLYMSEVQAGGATVFT---------------DIGASVSPKKGSAVFWYNLHPS 488

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 489 GDGDYRTRHAACPVLLGNKWVSNKWIHERGQ 519


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
          Length = 538

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 237 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 292

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 293 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 335

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 336 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 367


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 159

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 160 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 215

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 216 --NRIATWLFYMSDVSAGGATVFPE---------------VGASVWPKKGTAVFWYNLFA 258

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 259 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 290


>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
          Length = 538

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514


>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
          Length = 538

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 481

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 482 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 514


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 79/144 (54%), Gaps = 18/144 (12%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---QMSQRLASF 199
           +++ +  +I+  T L     E   +  Y +G +Y+ H+D    +++G    ++  R+A+F
Sbjct: 50  VIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGNRIATF 109

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           L Y+S+VE+GG T+F                  G+ V+P +G  + +Y+L P+G  D  +
Sbjct: 110 LTYMSNVEQGGSTVFLHP---------------GIAVRPIKGSAVFWYNLLPSGAGDERT 154

Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
            H +CPV+ G KWV+ KWI +++Q
Sbjct: 155 RHAACPVLTGVKWVSNKWIHERDQ 178


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 95/208 (45%), Gaps = 25/208 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKP--SQLALRQGETVESTKGTRTSSGTFISASE 138
           +P+     N  S  + + I   A+ RL+P  +Q     G  + S    R S   ++   E
Sbjct: 320 KPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSY---RISKNAWLYYWE 376

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQR 195
            +  ++  ++ ++  AT L     E   V+ Y IG  Y+ H+D     E     P    R
Sbjct: 377 HR--LINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ L Y+SDVE GG T+FP                +G +V P +G G  +Y+L  +G  
Sbjct: 435 IATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNLLKSGEG 479

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           D  + H  CPV+ G KWV+  WI ++ Q
Sbjct: 480 DMLTEHAGCPVLVGSKWVSNMWIHERGQ 507


>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
 gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
          Length = 535

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 376

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 377 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 433

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 434 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 478

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 479 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 511


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 83/156 (53%), Gaps = 29/156 (18%)

Query: 135 SASEDKTGILELIEHKIARATML-PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----- 188
           SAS DK      +  +++ AT L  + + E F V  Y IG  Y+ H+D F+  +Y     
Sbjct: 8   SASADK------LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPV 60

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
              QM  R+A+F++YL+DVE GG T+FP  N               L ++P +   + ++
Sbjct: 61  LNEQMGDRIATFMIYLNDVEAGGRTVFPRLN---------------LVIEPIKNSAVFWH 105

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L  +G  D  ++HG+CPV+ G KWVA KWI +  Q
Sbjct: 106 NLLDDGQQDDRTIHGACPVVLGRKWVANKWIHEYGQ 141


>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
           12442]
 gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
           12442]
          Length = 195

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 98/218 (44%), Gaps = 39/218 (17%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P    +    +  +CQ +I  +KK ++P+Q     GE        R S  T++  
Sbjct: 7   VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
                G++  +   IA A  LP  H E     RYE+G K+D+H D +        N  E 
Sbjct: 59  YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G    QRL + +LYL+ V  GGET FP                + L V P  G  L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158

Query: 249 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 284
               GT +    SLH  C V +GEKW+AT W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIATLWFREKPQY 195


>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
 gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
          Length = 536

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 104/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 377

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 378 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 434

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 435 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 479

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 480 TEGMLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 512


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 99/216 (45%), Gaps = 20/216 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  +    +PR + + +  S E+ ++I   A+ R + + +  ++    E ++  R +
Sbjct: 334 IQPIKMEEALLKPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSR-YRIA 392

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              ++   E     +  I  ++   T L     E   V  Y IG  Y+ HYD     E  
Sbjct: 393 KSAWLKHEEHD--YVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQ 450

Query: 190 PQMSQ--RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
                  R+A++L Y+SDVE GG T+FP  N               L + P++G    ++
Sbjct: 451 QDFGWGGRIATWLFYMSDVEAGGATVFPKLN---------------LSLWPQKGSAAFWF 495

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L+PNG  +  + H  CPV+ G KWVA  WI ++ Q
Sbjct: 496 NLYPNGEGNEMTQHAGCPVLTGSKWVANYWIHERGQ 531


>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
          Length = 421

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 104/252 (41%), Gaps = 44/252 (17%)

Query: 60  LPHGETGDASIQSIP-------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
            P  E GD     +P        +V S  PR L   +F + E+C  +I++AK  +  S +
Sbjct: 172 FPTCEVGDTFDIPLPSANRNSIAKVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTV 231

Query: 113 ALRQGETVESTKGTRTSSGTFISASE--------DKTGILELI-----EHKIARATMLPQ 159
           +      V   + +RTSS  ++            D+   L  I     EH +        
Sbjct: 232 SAEGDSAVSLQESSRTSSTAWLPPHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAID 291

Query: 160 THG----EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGE 211
             G     A+ VLRYE+ Q Y  H+D F+P  +   +      R  +   YL+DVE G  
Sbjct: 292 KRGGSSVTAWQVLRYEVNQHYHIHHDYFDPVLHRGFLQGDGRNRFITAFFYLTDVERGDP 351

Query: 212 TMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDGLLFYSLFPNGT----IDRTSLHGSCPV 266
                          DY  C  GL+V P+RG  ++FYSL  +G     +D  S HG C V
Sbjct: 352 RPIT-----------DYSDCNRGLRVPPKRGKAIIFYSLLADGQRSGGLDVASWHGGCDV 400

Query: 267 IKGEKWVATKWI 278
             G KW A  WI
Sbjct: 401 HNGTKWAANYWI 412


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 95/194 (48%), Gaps = 30/194 (15%)

Query: 99  IIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
           ++ TA  R+   +  +   +T + +T   R S   +++A E    +++ I  +I   T L
Sbjct: 338 VLETAHYRISKRRATVHDPQTGKLTTAQYRVSKSAWLAAYEHP--VVDRINQRIEDITGL 395

Query: 158 PQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMSQRLASFLLYLSDVEEG 209
                E   V  Y +G +Y+ H+D        AF     G     R+A++L Y+SDV  G
Sbjct: 396 NVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG----NRIATWLFYMSDVAAG 451

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T+FP                +G  VKP +G  + +Y+LFP+G  D ++ H +CPV+ G
Sbjct: 452 GATVFPE---------------VGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVG 496

Query: 270 EKWVATKWIRDQEQ 283
            KWV+ KWI ++ Q
Sbjct: 497 NKWVSNKWIHERGQ 510


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 533

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 57/221 (25%), Positives = 102/221 (46%), Gaps = 24/221 (10%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           GETG   +Q I  +  + +P  +   +         +IA AK RL+ S+      +    
Sbjct: 307 GETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363

Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
               RTSS T++   +      + + ++  +   T+  +   E + +  Y IG  Y  H+
Sbjct: 364 GPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHH 423

Query: 181 D----AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
           D    +   ++       R+A+ ++Y+SDVEEGG T+FP                +G++V
Sbjct: 424 DYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGGATVFP---------------SLGVRV 468

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
            PR+GD + ++++  +   D  + H  CPV+ G KW+A KW
Sbjct: 469 SPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGSKWIANKW 509


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 100/210 (47%), Gaps = 30/210 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + + N  S +  + +   AK RL+ + ++      +E T   R S   ++ A E   
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLE-TAHYRISKSAWLGAYEHP- 452

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
            +++ I   I   T L     E   V  Y +G +Y+ H+D        AF     G    
Sbjct: 453 -VVDKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTG---- 507

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
            R+A++LLY++DV+ GG T+F                 IG  VKP++G  + +Y+L+P+G
Sbjct: 508 NRIATWLLYMTDVQAGGATVFT---------------DIGAAVKPKKGTAVFWYNLYPSG 552

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 553 EGDYRTRHAACPVLLGNKWVSNKWIHERGQ 582


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL+  +  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLR--RATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 152

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 78/164 (47%), Gaps = 24/164 (14%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS    +   +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD 
Sbjct: 2   VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           F+P   G  +      QR+AS ++YL+  E GG T FP  +               L V 
Sbjct: 60  FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAH---------------LDVA 104

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
             +G+ + F    P+      SLH   PV+ GEKWVATKW+R++
Sbjct: 105 AVKGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRER 146


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------DVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   +   V      RT+ G ++    ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YL+DVE+GG T       +F D GY         V P+ G  + +Y+L  +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517


>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 193

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 103/209 (49%), Gaps = 23/209 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LS  PRA    NF +  +   I+   +K+       +++  T      TRTSS T+++
Sbjct: 2   KALSCAPRAFQVENFLTDVEADHIVGLVQKKND-----MQRSSTNGHISETRTSSTTWLA 56

Query: 136 ASEDKTGILELIEHKIARA-----TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
              D   +++ I  ++A        ML +   E   ++ Y +GQ+Y +H+D F   +  P
Sbjct: 57  RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP-FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
               R  +F +YL+DV  GG+T FP + N          +    L V P++G  ++FY +
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFPRWRNA---------ETNGALNVVPKKGTAMIFYMV 164

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
            P+G +D  + H + PVI+GEK+ +  WI
Sbjct: 165 NPDGNLDDLTHHAALPVIEGEKFFSNLWI 193


>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
 gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
          Length = 201

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 107/217 (49%), Gaps = 25/217 (11%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P Q+L+ +P    +P+  S+E CQS+I  A+ +L P+ +  + G  V      R S   +
Sbjct: 4   PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAW 60

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
              + ++  +++ I  +IA     P  + E   V  Y  G K+++H D ++  E    + 
Sbjct: 61  FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
               QRL + +LYL+DV  GGET FP                + ++V P  G  L+F + 
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163

Query: 250 FPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
            P+ +I D  SLHGS  +  GEKW+ T W  ++ Q++
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 29/212 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P  + +    SA +   ++  A + +K +++   Q E   +T   RT+ G ++    ++
Sbjct: 325 KPYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE 381

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQ 191
             +   I  +I   T       E F V+ Y IG  Y  H+D F  A          +   
Sbjct: 382 --MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIV 439

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           +  R+A+ L YL+DVE+GG T+F                 +G  V P+ G  + +Y+L  
Sbjct: 440 LGDRIATVLFYLTDVEQGGATVF---------------GNVGYSVYPQAGTAIFWYNLDT 484

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D  + H SCPV+ G KWV T+WI +  Q
Sbjct: 485 DGNGDPLTRHASCPVVVGSKWVMTEWIHEARQ 516


>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
 gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
          Length = 518

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 103/213 (48%), Gaps = 21/213 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S  +   I +++K ++ PS+  +      E  K   + S  
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 359

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D     E  +  
Sbjct: 360 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 416

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL+DV +GG T FP                + + V P+ G  L++Y+L 
Sbjct: 417 GYIDRLATTLFYLNDVPQGGATHFP---------------GLNITVFPKFGTVLMWYNLH 461

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 462 TEGLLHVRTMHTGCPVIVGSKWVVSKWIDDKGQ 494


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/212 (29%), Positives = 103/212 (48%), Gaps = 32/212 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
              R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF 
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFA 490

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 491 SGEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 28/211 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   +   V      RT+ G ++    ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YL+DVE+GG T       +F D GY         V P+ G  + +Y+L  +
Sbjct: 442 GDRIATVLFYLTDVEQGGAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTD 486

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPVI G KWV T+WIR++ Q
Sbjct: 487 GNGDPRTRHAACPVIVGSKWVMTEWIREKRQ 517


>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
 gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
          Length = 201

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 107/217 (49%), Gaps = 25/217 (11%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P Q+L+ +P    +P+  S+E CQS+I  A+ +L P+ +  + G  V      R S   +
Sbjct: 4   PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSHV---RISELAW 60

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
              + ++  +++ I  +IA     P  + E   V  Y  G K+++H D ++  E    + 
Sbjct: 61  FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
               QRL + +LYL+DV  GGET FP                + ++V P  G  L+F + 
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFPN---------------LKIEVSPTTGTLLVFENC 163

Query: 250 FPNGTI-DRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
            P+ +I D  SLHGS  +  GEKW+ T W  ++ Q++
Sbjct: 164 QPDTSIPDLRSLHGSKILQSGEKWIGTLWFCERPQYQ 200


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TFI  +  K  +L  I+ ++A  T L   + E      Y IG  Y  H D F 
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
              +       P+M  R+A+ L YLSDV +GG T FP                +   +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++     +++L  +G  D  + HG+CP+I G KWV  +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/211 (28%), Positives = 101/211 (47%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S  + + +   AK RL+ + ++      +E T   R S   +++  +D 
Sbjct: 415 RPYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLE-TASYRISKSAWLTEYDDP 473

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 474 --MIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 528

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 529 -NRIATWLFYMSDVSAGGATVFP---------------DVGAAVWPQKGTAVFWYNLFAS 572

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 573 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 603


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 23/165 (13%)

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TFI  +  K  +L  I+ ++A  T L   + E      Y IG  Y  H D F 
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
              +       P+M  R+A+ L YLSDV +GG T FP                +   +KP
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFP---------------QLRTLLKP 475

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++     +++L  +G  D  + HG+CP+I G KWV  +WIR+ +Q
Sbjct: 476 KKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWIRENDQ 520


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 267

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/231 (30%), Positives = 108/231 (46%), Gaps = 35/231 (15%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +VLS  PR + FP+F +  +C+   + ++++L  +++ L  G         RT+
Sbjct: 46  LQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL-GGPPEGGFSLRRTN 104

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH--YDAFNPAE 187
              ++S  +D   +L  +  +IA AT L  T  E + V  Y +G  Y  H  Y  F  A+
Sbjct: 105 KVAWMS--DDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQ 162

Query: 188 YGPQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
                S   RLA+ L+YL+DV  GG T F                 + L VKP  G  L 
Sbjct: 163 GDIYKSSGNRLATMLIYLADVAGGGATAF---------------INMRLAVKPTLGTALF 207

Query: 246 FYSLFP-NGTI------------DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L P +G I            D  + H  CPV+ G KW+ TKWI ++EQ
Sbjct: 208 WYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQ 258


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 94/209 (44%), Gaps = 26/209 (12%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P  + + +     +   I      R+  + + L    TV +    RTS  TFI+ +
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSNV---RTSQITFIAKT 380

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 191
           E +  +L+ I+ ++A  T L   + E      Y IG  Y  H D F    +        +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
           M  R+A+ L YLSDV +GG T FP+               +   ++P++     +++L  
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY---------------LKQHLRPKKYAAAFWHNLHA 483

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
            G  D  + HG+CP+I G KWV  +WIR+
Sbjct: 484 AGRGDARTQHGACPIIAGSKWVLNRWIRE 512


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 294

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 96/202 (47%), Gaps = 31/202 (15%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
           +F S  +C ++IA A   +  S +     GE  ES    RTSS  F+ A ED    L  +
Sbjct: 108 DFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTV 158

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLY 202
            HK+   T  P  H E   V RY   QKY +H+DAF     +   +     QR+ + L+Y
Sbjct: 159 CHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVY 218

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           L+DV  GG T FP                +G+KV+PR+G  ++F+    +G +D   LH 
Sbjct: 219 LNDVPSGGCTAFPQ---------------LGMKVQPRKGMAVVFFPATLDGVLDSRLLHA 263

Query: 263 SCPVIKGEKWVATKWIRDQEQH 284
           + P I   KWV+  WIR    H
Sbjct: 264 AEPAID-TKWVSQIWIRQGAYH 284


>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
 gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
          Length = 508

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 112/223 (50%), Gaps = 36/223 (16%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  +S  P  + + +    +  Q +IA A+ RL+P+++   + +  E+    R++ G
Sbjct: 295 PFKMEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVF--EEDKSEARTSDRSALG 352

Query: 132 TFISASE-DKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE- 187
           TF+   + + +G  +L+ +  ++   T +   H   FN+++Y  G +Y +++D FN    
Sbjct: 353 TFLPFKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS 412

Query: 188 ----YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
               YG     R+A+ L YL+D   GG T+FP                I +KV   RG  
Sbjct: 413 EMEGYG----DRMATVLFYLNDAPNGGATVFP---------------RIDVKVTAERGKV 453

Query: 244 LLFYSLFPNG---TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L +++L  NG    ++  +LH +CPV +G KWV   WI + +Q
Sbjct: 454 LFWHNL--NGETHDVEPNTLHAACPVFQGSKWVMAAWIHEYDQ 494


>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
 gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
          Length = 496

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 94/199 (47%), Gaps = 32/199 (16%)

Query: 84  ALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS--EDKT 141
           ALY    ++AEQ   ++      L  SQL  ++G   +  +       TF SAS   + T
Sbjct: 312 ALYHEVVSAAEQRHLML------LSESQLQRQRGHQYDKIR-------TFASASVAANAT 358

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ--RLASF 199
             +E +  ++   T L     E   +L Y IG +Y  H D   P  +     +  RLA+ 
Sbjct: 359 PTVEQLHRRLEDITGLDLAESEPLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATV 418

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           LLYLSDV  GG T FP                +GL ++P RG  L++++    G  D  +
Sbjct: 419 LLYLSDVRLGGFTSFP---------------ALGLGIRPNRGSALVWHNANNAGNCDYRA 463

Query: 260 LHGSCPVIKGEKWVATKWI 278
           LH +CPV+ G +WVA+KWI
Sbjct: 464 LHAACPVLLGTRWVASKWI 482


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 24/209 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    E+ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 269 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 327

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 328 --VAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 385

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + PR+G    +Y+L PNG  D 
Sbjct: 386 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWYNLKPNGEGDF 430

Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
            + H +CPV+ G KWVA KW+  R QE H
Sbjct: 431 KTRHAACPVLTGSKWVANKWLHERGQEFH 459


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 28/173 (16%)

Query: 119 TVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           TV   KG+     RTS  TFI  +  K  +L+ I+ ++A  + L   + E      Y IG
Sbjct: 359 TVIGAKGSEVSKVRTSQFTFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIG 416

Query: 174 QKYDSHYDAF------NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD 227
             Y  H D F      N     P+M  R+A+ L YLSDV +GG T FP    +       
Sbjct: 417 GHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLKQL------- 469

Query: 228 YKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
                   ++P++     +++L  +G  D  +LHG+CP+I G KWV  +WIR+
Sbjct: 470 --------LQPKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWIRE 514


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 26/208 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + + +     +   I    + RLK + +    G         RTS  TFI  S  K 
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVT---GHNESVVSNVRTSQFTFIPVSAHK- 357

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQMSQR 195
            +L  I+ ++A  T L   + E      Y IG  Y  H D F            P+M  R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ L YLSDV +GG T FP                +   +KP++     +++L  +G  
Sbjct: 417 IATVLFYLSDVSQGGGTAFP---------------QLRTLLKPKKYAAAFWHNLHASGVG 461

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           D  + HG+CP+I G KWV  +WIR+ +Q
Sbjct: 462 DVRTQHGACPIIAGSKWVQNRWIREVDQ 489


>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
 gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
          Length = 536

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G   + S+  T   + +   +A  + L   + E   +  Y IG  Y+ H+D+F   
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429

Query: 187 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
              P+      RLA+ + YLSDV  GG T FPF               + L V P RG  
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L +Y+L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514


>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
 gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
          Length = 539

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 99/219 (45%), Gaps = 21/219 (9%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-R 127
           S + +  + L   P  +   N  S +    +   A+  ++ SQ+  +     E+     R
Sbjct: 316 SYEVLELEELHQDPFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETVAAAYR 375

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPA 186
           TS G      E ++  +EL+   +A  + L     E   +  Y IG  Y+ H+D F +  
Sbjct: 376 TSKGATFEYFEHRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHH 433

Query: 187 EYGP--QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
            Y P  +   R+A+ + YLS+VE GG T FPF               + L V P RG  +
Sbjct: 434 VYLPDDRDGNRIATGIYYLSEVEAGGGTAFPF---------------LPLLVTPERGSLV 478

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +Y+L  +G  D  + H +CPV++G KW+A  WIR   Q
Sbjct: 479 FWYNLHRSGDQDYRTKHAACPVLQGSKWIANVWIRQSNQ 517


>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
 gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
          Length = 536

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 79/160 (49%), Gaps = 20/160 (12%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G   + S+  T   + +   +A  + L   + E   +  Y IG  Y+ H+D+F   
Sbjct: 372 RTSQGASFNYSQYATT--QRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEH 429

Query: 187 EYGPQ---MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
              P+      RLA+ + YLSDV  GG T FPF               + L V P RG  
Sbjct: 430 HEYPEDDLYGNRLATAIYYLSDVVAGGGTAFPF---------------LPLLVTPERGSL 474

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L +Y+L P+G  D  + H +CPV++G KW+A  WIR++ Q
Sbjct: 475 LFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQ 514


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
 gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
          Length = 540

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 109/246 (44%), Gaps = 24/246 (9%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
           R +L+     +++    L  G    +S Q    + L   P  +   +  SAE+   +   
Sbjct: 292 REELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQL 351

Query: 103 AKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
           A+  L+ S + +L   E +  +   R S GTF    E    I++ +   +   + L    
Sbjct: 352 ARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRS 407

Query: 162 GEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPFE 217
            E   V  Y IG  Y+ H D+F+    YG      + R+A+ + YLS+VE GG T FPF 
Sbjct: 408 AEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF- 466

Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
                         + L V+P RG  L +Y+L  +G +D  + H  CPV+ G KW+A  W
Sbjct: 467 --------------LPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVW 512

Query: 278 IRDQEQ 283
           IR   Q
Sbjct: 513 IRLSNQ 518


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + +     + + I   A+ RLK + +   +   +E     R S   ++   ED  
Sbjct: 44  PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
            ++  +  ++   T L     E   V+ Y +G  YD HYD     E     S     R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDV +GG T+FP+               +G+ ++P +G   ++++L+P+G  D 
Sbjct: 161 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 205

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV++G KWV  KW+ +  Q
Sbjct: 206 RTRHAACPVLQGSKWVCNKWLHEAGQ 231


>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
 gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
 gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
 gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
          Length = 195

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 97/218 (44%), Gaps = 39/218 (17%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P    +    +  +CQ +I  +KK ++P+Q     GE        R S  T++  
Sbjct: 7   VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
                G++  +   IA A  LP  H E     RYE+G K+D+H D +        N  E 
Sbjct: 59  YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G    QRL + +LYL+ V  GGET FP                + L V P  G  L+F +
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP---------------SLNLTVTPSEGKLLVFEN 158

Query: 249 LFPNGTID--RTSLHGSCPVIKGEKWVATKWIRDQEQH 284
               GT +    SLH  C V +GEKW+ T W R++ Q+
Sbjct: 159 C-KRGTNEPHPLSLHEGCAVHEGEKWIVTLWFREKPQY 195


>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
 gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
          Length = 242

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/221 (29%), Positives = 100/221 (45%), Gaps = 24/221 (10%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGT 126
           +S Q    + L   P  +   +  SAE+   +   A+  L+ S + +L   E + +    
Sbjct: 19  SSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTN--F 76

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           R S GTF    E    I++ +   +   + L     E   V  Y IG  Y+ H D+F+  
Sbjct: 77  RISQGTFFEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSEN 134

Query: 187 E-YGPQM---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             YG      + R+A+ + YLS+VE GG T FPF               + L V+P RG 
Sbjct: 135 HNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF---------------LPLLVEPERGS 179

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            L +Y+L  +G +D  + H  CPV+ G KW+A  WIR   Q
Sbjct: 180 LLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRLSNQ 220


>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
 gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
          Length = 422

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 91/198 (45%), Gaps = 30/198 (15%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE-----L 146
           SA++C+ ++  A+  L+ SQ+      +   T   RTS G  +        ILE      
Sbjct: 242 SADECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRGATLDP------ILEDFAARA 294

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLASFLLYL 203
            + ++A    LP TH EA +VL Y  G+ Y +H D   P       P    RL +  +YL
Sbjct: 295 AQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYL 354

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +DV+ GGET FP                 G++V+PR G  + F +L  +G  D  SLH  
Sbjct: 355 NDVDAGGETEFPVA---------------GIRVQPRAGSVVCFDNLQADGCPDPDSLHAG 399

Query: 264 CPVIKGEKWVATKWIRDQ 281
            PV  G KW+ T W R Q
Sbjct: 400 LPVTTGSKWLGTLWFRQQ 417


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 103/207 (49%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           PR + + +  S E+ + I   AK RL  ++  +R  +T V +    R S   ++   +D 
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTVANYRVSKSAWLEEYDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L +   E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           ++L Y+SDVE GG T+FP           D+    G  + PR+G  + +Y+LF +G  D 
Sbjct: 451 TYLNYMSDVEAGGATVFP-----------DF----GAAIWPRKGTAVFWYNLFRSGEGDY 495

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQH 284
            + H +CPV+ G KWV+ KW  ++ Q 
Sbjct: 496 RTRHAACPVLVGSKWVSNKWFHERGQE 522


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/194 (28%), Positives = 101/194 (52%), Gaps = 25/194 (12%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F   ++CQ++I   ++  +PS +      +    +  RTSS   +   +D   ++  I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT-----SENPDQQFRTSSTCHLGNMQDP--VIRKID 155

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSD 205
            +I +   +  ++ E      Y++GQ++  H D F P   A YG    QR  +F++YL++
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215

Query: 206 VEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCP 265
           VE+GG+T+FP             +  IG K K  +G  +++ ++ P+G+++  +LH   P
Sbjct: 216 VEQGGDTVFP-------------ELAIGFKAK--KGMAVIWNNINPDGSVNYQTLHQGMP 260

Query: 266 VIKGEKWVATKWIR 279
           V KGEK + TKW R
Sbjct: 261 VQKGEKLIITKWFR 274


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVLAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 102/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  + H++   T L     E   V  Y +G +Y+ H+D +  P + G +    R+A+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVAT 456

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 457 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 501

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 502 TRHAACPVLVGCKWVSNKWFHERGQ 526


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + +     + + I   A+ RLK + +   +   +E     R S   ++   ED  
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFA-DYRISKSAWLKEHEDV- 386

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
            ++  +  ++   T L     E   V+ Y +G  YD HYD     E     S     R+A
Sbjct: 387 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 445

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDV +GG T+FP+               +G+ ++P +G   ++++L+P+G  D 
Sbjct: 446 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 490

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV++G KWV  KW+ +  Q
Sbjct: 491 RTRHAACPVLQGSKWVCNKWLHEAGQ 516


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 83/154 (53%), Gaps = 18/154 (11%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           R S   ++   +  T  +E    +I+R T L   + E   +  Y IG +Y+ HYD ++  
Sbjct: 367 RVSKSAWLKDEDSDT--VEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYD-YSRR 423

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           E+    ++R+A++L YL+ VE+GG T+F                 +GL ++  +G  + +
Sbjct: 424 EWDIYNNRRIATWLSYLTTVEQGGGTVFT---------------ELGLHIRSIKGSAVFW 468

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           Y+L PNG+ D  + H +CPV++G KWV+ KWI +
Sbjct: 469 YNLLPNGSGDERTRHAACPVLRGNKWVSNKWIHE 502


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/216 (27%), Positives = 100/216 (46%), Gaps = 24/216 (11%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V     RP    F +  +  +  +I   A+ R K + +       +E  +  R S  
Sbjct: 322 PFKVEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQ-YRISKS 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E K   +  +  +++  T L  +  E   V+ Y IG  Y+ H+D     E    
Sbjct: 381 AWLKEEEHKH--IADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAF 438

Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            S     R+A+ L Y+SDVE+GG T+FP                I + + P++G    +Y
Sbjct: 439 KSLGTGNRIATVLFYMSDVEQGGATVFP---------------SIQVSLWPQKGSAAFWY 483

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L P+G  D+ + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQ 519


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 97/206 (47%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + +     + + I   A+ RLK + +   +   +E     R S   ++   ED  
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFA-DYRISKSAWLKEHEDV- 404

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
            ++  +  ++   T L     E   V+ Y +G  YD HYD     E     S     R+A
Sbjct: 405 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 463

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDV +GG T+FP+               +G+ ++P +G   ++++L+P+G  D 
Sbjct: 464 TVLFYMSDVAQGGATVFPW---------------LGVALQPVKGTAAVWFNLYPSGNGDL 508

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV++G KWV  KW+ +  Q
Sbjct: 509 RTRHAACPVLQGSKWVCNKWLHEAGQ 534


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    E+ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 343 PRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 401

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T L     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 402 --VAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 459

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + PR+G    +++L PNG  D 
Sbjct: 460 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWFNLKPNGEGDL 504

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWVA KW+ ++ Q
Sbjct: 505 RTRHAACPVLTGSKWVANKWLHERGQ 530


>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
 gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
          Length = 182

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 62/179 (34%), Positives = 89/179 (49%), Gaps = 27/179 (15%)

Query: 123 TKGTRT--SSGTFISASED-KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
           TK T +  SS  ++   ED K  IL  I     + +       E   + +Y++GQKY  H
Sbjct: 1   TKETESGFSSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLH 60

Query: 180 YDAFNPAEYGPQMSQ----RLASFLLYLSDVEEGGETMFPFENGI-------------FL 222
           YD+      G  M+Q    R A+FL+YL+DV+ GGET+FP    I              L
Sbjct: 61  YDS------GFLMNQRRVKRTATFLVYLNDVKSGGETIFPLATNISSIQLKKENVDKPSL 114

Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           DS    +  + +KV P     LLF++      +D  SLHGSCPV+ GEKW+A  W+ ++
Sbjct: 115 DSICGKENNM-VKVSPEAQSCLLFWNHVDGDDVDAFSLHGSCPVVSGEKWIAQIWLHNE 172


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 93/214 (43%), Gaps = 37/214 (17%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P+     N  +  + + I   A+ RL+ ++        VES     T  G   S    K
Sbjct: 321 KPKLWVLHNILTDPEMEVIKKLAQPRLRRAR--------VESPT---TGEGELASYRISK 369

Query: 141 TGILELIEHKIAR--------ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
           +  L   EH++ R         T L     E   V+ Y IG  Y+ H+D     E     
Sbjct: 370 SAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALD 429

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
           P    R+A+ L Y+SDVE GG T+FP                +G +V P +G G  +Y+L
Sbjct: 430 PNEGDRIATMLFYMSDVEAGGATVFP---------------QVGARVVPEKGAGAFWYNL 474

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             +G  D  + H  CPV+ G KWV+ KWI ++ Q
Sbjct: 475 LKSGEGDMLTEHAGCPVLVGSKWVSNKWIHERGQ 508


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 101/211 (47%), Gaps = 30/211 (14%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  +   +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  +  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDV  GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFAS 491

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 492 GEGDYSTRHAACPVLVGNKWVSNKWLHERGQ 522


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 54/158 (34%), Positives = 77/158 (48%), Gaps = 21/158 (13%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFN 184
           R S   ++   +D+  I+  +  +I   T L  T    E   VL Y +G +Y+ H+D   
Sbjct: 126 RISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMT 183

Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
             E  +G  +  R+A+FL+YLSDV  GG T+FP  N               + V   +  
Sbjct: 184 AEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVAN---------------VTVPVVKNA 228

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           GLLF  L  +G  D  SLH  CPV+ G KW+A KWI +
Sbjct: 229 GLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIHE 266


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 73/143 (51%), Gaps = 17/143 (11%)

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--SQR 195
           ++++ ++  +  ++A  T L  T  E   V+ Y IG  YD H+D     E   +     R
Sbjct: 75  DEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNR 134

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTI 255
           +A+ L Y+SDV +GG T+F                 +GL V PRRG  + + +L P+G  
Sbjct: 135 IATVLFYMSDVAQGGATVF---------------TELGLSVFPRRGSAVFWLNLHPSGEG 179

Query: 256 DRTSLHGSCPVIKGEKWVATKWI 278
           D  + H +CPV++G KWV  KWI
Sbjct: 180 DLATRHAACPVLRGSKWVCNKWI 202


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 99/205 (48%), Gaps = 26/205 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      NF S ++C++ +   K +++ +++ +   E+      +RT+   ++  S   +
Sbjct: 17  PIVYVVNNFLSDDECEAFVEMGKGKMERAKV-ISDDES--EFHASRTNDFCWLEHS--AS 71

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
            ++  +  + +    +P  + E F ++ Y  G +Y  H+DAF+      Q +     QR+
Sbjct: 72  DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-I 255
            + L YL+DVEEGG T FP                I + VKP +GD ++F++     T I
Sbjct: 132 VTALAYLNDVEEGGATDFP---------------KINVSVKPNKGDVVVFHNCIEGTTEI 176

Query: 256 DRTSLHGSCPVIKGEKWVATKWIRD 280
           +  +LHG  PV+ GEKW    W R+
Sbjct: 177 NPQALHGGSPVVAGEKWAVNLWFRE 201


>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 418

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
           PR   +    SA++C+ ++  A+  L+ S++ +   +        RTS G T     ED 
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 197
                  + ++A    LP  H E  +VL Y  G++Y +H D   P       P    R  
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +  +YL+DV  GGET FP                 G++V+PR G  + F +L  +G  D 
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            SLH   PV  G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413


>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 418

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 90/204 (44%), Gaps = 22/204 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
           PR   +    SA++C+ ++  A+  L+ S++ +   +        RTS G T     ED 
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMSQRLA 197
                  + ++A    LP  H E  +VL Y  G++Y +H D   P       P    R  
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +  +YL+DV  GGET FP                 G++V+PR G  + F +L  +G  D 
Sbjct: 345 TVCVYLNDVGAGGETEFPVA---------------GVRVRPRPGTLVCFDNLHADGRPDA 389

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            SLH   PV  G KW+ T W R Q
Sbjct: 390 DSLHAGLPVTAGSKWLGTLWFRQQ 413


>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 225

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 59/187 (31%), Positives = 98/187 (52%), Gaps = 24/187 (12%)

Query: 95  QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
           +C  +++  +  ++ S LA         T G R SS   I   ED   ++  IE +I+  
Sbjct: 2   ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI---EDI--VVSKIEDRISLW 47

Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           + LP+ +GE+  VL+Y + +       +          + RLA+ L+YLSDV++GGET+F
Sbjct: 48  SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102

Query: 215 PFENGIFLDSGYDY---KKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEK 271
           P       D+        +C G  V+P +G+ +L ++L P+G  D+ S +  CPV++GEK
Sbjct: 103 PRSE--MKDAQAKEGAPSQCSGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEK 160

Query: 272 WVATKWI 278
           W+A K I
Sbjct: 161 WLAIKHI 167


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 VKP+ G+ L +Y++  +  +D  + H  CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498

Query: 270 EKWVATKWIRDQEQ 283
            KW+   WI +  Q
Sbjct: 499 SKWIGNVWIHEATQ 512


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 VKP+ G+ L +Y++  +  +D  + H  CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKG 498

Query: 270 EKWVATKWIRDQEQ 283
            KW+   WI +  Q
Sbjct: 499 SKWIGNVWIHEATQ 512


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-QMSQRLASFLLYLSDVEEG 209
           ++ AT L  T+ E   V  Y +G  Y+ H+D F  +++ P +   R+A+ + YLSDVE+G
Sbjct: 398 LSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQG 457

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 V+P+ G+ L +Y+L  +  +D  + H  CPV+KG
Sbjct: 458 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKG 502

Query: 270 EKWVATKWIRDQEQ 283
            KW+A  WI +  Q
Sbjct: 503 SKWIANIWIHEATQ 516


>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
           roenbergensis virus BV-PW1]
 gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
           roenbergensis virus BV-PW1]
          Length = 210

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/212 (31%), Positives = 94/212 (44%), Gaps = 29/212 (13%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
            +LS  P   Y  N  + ++C  II     +LKP   AL  G +       RT +  ++S
Sbjct: 4   HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
              D+  I   I  KI      P  + E F VL Y   QKY+ HYDAF P +   +    
Sbjct: 61  HKNDE--ITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAF-PIDNSEKAKRC 117

Query: 194 -----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
                QRL + L+YL++V +GGET F               K + +K+ P+ G  L+F +
Sbjct: 118 LKKGGQRLLTALIYLNNVTKGGETEF---------------KNLNIKITPKIGRILVFEN 162

Query: 249 LFPNGTIDR-TSLHGSCPVIKGEKWVATKWIR 279
              N       SLH    VI+GEK+V   W R
Sbjct: 163 TLQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 95/209 (45%), Gaps = 24/209 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    E+ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 208 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 267 --VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + PR+G    +++L PNG  D 
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINISLWPRKGSAAFWHNLKPNGEGDF 369

Query: 258 TSLHGSCPVIKGEKWVATKWI--RDQEQH 284
            + H +CPV+ G KWVA KW+  R QE H
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQEFH 398


>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
 gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
          Length = 543

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 99/214 (46%), Gaps = 21/214 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
           I  ++LS  P  L   +    ++   I A++K+ L  S++      + E      RTS  
Sbjct: 324 IKTEILSLDPFVLLLHDMVRQKESTLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKS 383

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYG 189
            + S+  + T   + I  ++A AT L     E F V+ Y +G  + +H D    +   + 
Sbjct: 384 VWYSSDFNDTT--KKITERLADATGLDMHFTEYFQVINYGLGGFFATHLDMLLSDKTRFN 441

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
              S R+A+ + YL+ V +GG T FP  N               L V P+ G  L +Y+L
Sbjct: 442 GT-SDRIATTVFYLNGVRQGGATHFPLLN---------------LTVFPQPGSALFWYNL 485

Query: 250 FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
              G   R+++H  CPVI G KWV TKW+ DQ Q
Sbjct: 486 DTKGNDQRSTMHTGCPVIVGSKWVMTKWVGDQGQ 519


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 103/220 (46%), Gaps = 25/220 (11%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
           T  A I  +  + LS  P  + F +     +   ++   +  LK S +   Q   V ++K
Sbjct: 286 TAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLVGQYQYSPVRTSK 343

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
                   F+  ++  T +++ +  ++   T L     +A  ++ Y +G  YD HYD+ N
Sbjct: 344 EQH-----FVEYND--TAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHN 396

Query: 185 PAEYGPQ-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
            +E     +  R+A+ L Y+ +V+ GG T FP+               I + V P++G  
Sbjct: 397 YSEANRLILGDRIATVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSA 441

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L+Y+L   G ++  ++H  CPVI G K+V TKWI +  Q
Sbjct: 442 VLWYNLDNAGQMNPKAIHAGCPVIVGSKYVLTKWINEIPQ 481


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/222 (25%), Positives = 103/222 (46%), Gaps = 30/222 (13%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ +  +    +P  + + N  + ++ +++   A+ R K + +       +E     R S
Sbjct: 332 IQPVKMEEAFHKPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPA-NYRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
              ++  SE+   + + +  ++   T L     E   V+ Y IG  Y+ H+D        
Sbjct: 391 KSAWLK-SEEHDHVFK-VTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVN 448

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           AF    +G     R+A++L Y+S+VE GG T+FP                + L + P++G
Sbjct: 449 AFKDLGWG----NRVATWLFYMSEVEAGGATVFP---------------KLNLALWPQKG 489

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
               +Y+L PNG  +  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 490 SAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERNQ 531


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 100/227 (44%), Gaps = 41/227 (18%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 125
           +P  L F NF +  + + I   A  RLK +               +++ R+        G
Sbjct: 309 KPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHPVTG 368

Query: 126 T------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
                  R S   ++   ED+  +++ I +++   + L  T  E   V+ Y IG  Y+ H
Sbjct: 369 KLEFANYRISKSGWLRDEEDE--LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPH 426

Query: 180 YDAFNPAE---YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKV 236
           YD     E          R+A+FL YLSDVE GG T+F                 +G  V
Sbjct: 427 YDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT---------------RVGATV 471

Query: 237 KPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            P++GD   +Y+L  +G  D ++ H +CPV+ G KWVA KWI +  Q
Sbjct: 472 WPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQ 518


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 106/213 (49%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLS +P  + + NF +  + + I   A+  L+ S +A   GE  ++T   R S   ++ 
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369

Query: 136 ASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            SE  + I+  ++ +I+  T L     + E   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 370 GSE--SCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   + +++L 
Sbjct: 428 KTGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVLKKAAIFWWNLH 472

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            NG  D  +LH  CPV+ G+KWVA KW+ +  Q
Sbjct: 473 RNGRGDAETLHAGCPVLIGDKWVANKWVHEYGQ 505


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE  +     R S
Sbjct: 334 LQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEK-QLQVDYRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  NG  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRNGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 535

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/258 (27%), Positives = 106/258 (41%), Gaps = 48/258 (18%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATA------KKRLKPSQLALRQGETVESTK 124
           + +  + +S  PR     NF S E+   +I           +L+ S +     +  +   
Sbjct: 177 EPVIIESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKS 236

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--- 181
             RTS   F + SE    I + +   +           +   +LRY+  Q Y +H D   
Sbjct: 237 KHRTSENAFDTVSEAAVDIRKRV-FDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYFP 295

Query: 182 -------AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF---------ENGIFLDSG 225
                   F+P + G   S R A+  LYLSDV  GG+T+FP          E     +S 
Sbjct: 296 VGAAKDFNFDPHKGG---SNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNSA 352

Query: 226 YDY------------------KKC-IGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPV 266
            DY                  +KC   L   P +G  +LFYS  PNG +D  SLHG CPV
Sbjct: 353 QDYEAIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPV 412

Query: 267 IKGEKWVATKWIRDQEQH 284
           ++G KW A  W+ ++ +H
Sbjct: 413 LEGTKWGANLWVWNRRRH 430


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 101/212 (47%), Gaps = 25/212 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
           LS  P  + + N  S  +    IA  ++  +P   ++  GE   S K   RT+ G +I  
Sbjct: 322 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 377

Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
                    +++ I  +I   T L   HG+   +++Y  G  YD+H+D  N +    Q +
Sbjct: 378 KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 437

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF-P 251
             R+A+ L YL+DV+ GG T+FP                + LKV   RG  L++Y++   
Sbjct: 438 GDRMATVLFYLNDVKHGGSTVFP---------------VLKLKVPSERGKVLVWYNMHGE 482

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
              +D  +LHGSCPVI G K V + WI + +Q
Sbjct: 483 THDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 514


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
 gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
          Length = 229

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 103/220 (46%), Gaps = 25/220 (11%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
           T  A I  +  + LS  P  + F +     +   ++   +  LK S +   Q   V ++K
Sbjct: 17  TAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQ--LKASLVGQYQYSPVRTSK 74

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
                   F+  ++  T +++ +  ++   T L     +   ++ Y +G  YD HYD+ N
Sbjct: 75  EQH-----FVEYND--TAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHN 127

Query: 185 PAEYGPQ-MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
            +E     +  R+A+ L Y+ +V+ GG T FP+               I + V P++G  
Sbjct: 128 YSEANRLILGDRIATVLFYVGEVDSGGATTFPY---------------INVSVTPKKGSA 172

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L+Y+L  +G ++  ++H  CPVI G K+V TKWI +  Q
Sbjct: 173 VLWYNLDNSGQMNPKAIHAGCPVIVGSKYVLTKWINEIPQ 212


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 362 LQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA--SGEKQLQVE-YRIS 418

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L   H   E   V+ Y IG  Y+ H+D A +P+
Sbjct: 419 KSAWLKDTADP--VLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPS 476

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 477 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 521

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 522 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 560


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F  + + P     R+A+ + YLSDVE+G
Sbjct: 394 VSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 453

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 V+P+ G+ L +Y+L  +  +D  + H  CPV+KG
Sbjct: 454 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKG 498

Query: 270 EKWVATKWIRDQEQ 283
            KW+A  WI +  Q
Sbjct: 499 SKWIANIWIHEATQ 512


>gi|302849869|ref|XP_002956463.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
           nagariensis]
 gi|300258161|gb|EFJ42400.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
           nagariensis]
          Length = 965

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 96/206 (46%), Gaps = 38/206 (18%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGT 132
            +VL+  P  +    F SA +C  I+ +A     +K S + +  G  V+ T+  RTSS  
Sbjct: 744 LRVLNIDPPVITVEGFLSAPECDGIVRSAADSGLMKQSGVGV-SGYQVKDTENVRTSSTL 802

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
             +A   +T            A  LPQ       V RY+ GQ + +H DAF  A  G + 
Sbjct: 803 AATAEPGQT------------AFELPQ-------VARYQPGQHFLTHEDAFPAAVVGSKG 843

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
            QR A+ L+YL+D E+GG T F                 + + V+PR+G  LLF+  F N
Sbjct: 844 YQRRATLLVYLNDCEQGGATKF---------------DILDIAVQPRKGTALLFFPAFAN 888

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
           G  DR +LH +   +  EKWV   W+
Sbjct: 889 GMPDRRTLHTAQDAVS-EKWVTQLWL 913


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 109/225 (48%), Gaps = 31/225 (13%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
            D  ++  PF+V  L+  P    + +  +  + + +   +  +++ S L       ++  
Sbjct: 306 ADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELKLMSSPQIRRS-LLYNHTLDIDQA 364

Query: 124 KGTRTSSGTFISASEDKTGI--LELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSH 179
              RTS+  F+    ++TGI  LE I  + A  T L  T    E   V+ Y +G +Y  H
Sbjct: 365 DVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPH 420

Query: 180 YDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
            D F+  AE G     RLA+ L YL+DV++GG T+FPF               + L   P
Sbjct: 421 CDYFDENAENG----DRLATVLFYLTDVQQGGATVFPF---------------LRLSYFP 461

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++G  L+F +L    + D+ S H +CPV+ G KWVATKWI   +Q
Sbjct: 462 KKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIYHFDQ 506


>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
          Length = 173

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 72/141 (51%), Gaps = 22/141 (15%)

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYL 203
           L+ IE +I+  + +P  +GE                   FN    G    QR+A+ L+YL
Sbjct: 55  LQAIEKRISVYSQVPVENGELIQ----------------FNLKRGG----QRVATMLIYL 94

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           SD  EGGET FP     F   G   K   GL V P +G+ +LF+S+  +G  D  S+HG 
Sbjct: 95  SDNVEGGETYFPMAGSGFCRCGG--KSVRGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGG 152

Query: 264 CPVIKGEKWVATKWIRDQEQH 284
           C V+ GEKW ATKW+R +  H
Sbjct: 153 CEVLAGEKWSATKWMRQRSTH 173


>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 538

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/227 (26%), Positives = 108/227 (47%), Gaps = 30/227 (13%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G+ G  S+Q I  + ++ +P  +   +    +  + ++A A+ RL+ S       E V S
Sbjct: 306 GQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKDIKDLMAYAEPRLERST-TYTGSEMVPS 364

Query: 123 TKGTRTSSGTFISASEDKTGIL--ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
               RTSS  +++  E    +     +   +   T       EA+ +  Y  G ++  H+
Sbjct: 365 P--VRTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTNEEAEAYQLANYGTGGQFLPHH 422

Query: 181 D-------AFNP-AEYGPQ--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
           D       ++N  A+Y  Q     RLA+ ++Y++DVEEGG T+FP               
Sbjct: 423 DFLQDSLHSYNSSADYYLQYGTGDRLATLMIYMTDVEEGGATVFP--------------- 467

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
            +G+++ P++GD   +++L  +G  DR + H  CPV+ G KW+A KW
Sbjct: 468 SLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKWIANKW 514


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/230 (26%), Positives = 105/230 (45%), Gaps = 46/230 (20%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS---------------QLALRQGETVESTKG 125
           RP  + + +  S ++ + +   AK RL+ +               +++ R+    +   G
Sbjct: 373 RPYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTG 432

Query: 126 TRTSSGTFISASEDKTG----ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
             T++   +S S   TG    ++E I  +I   T L     E   V  Y +G +Y+ H+D
Sbjct: 433 KLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFD 492

Query: 182 --------AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIG 233
                   AF     G     R+A++L Y+SDV  GG T+FP                +G
Sbjct: 493 FGRKDEPDAFKELGTG----NRIATWLFYMSDVAAGGATVFP---------------DVG 533

Query: 234 LKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             V P++G  + +Y+LF +G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 534 AAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 583


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 68/134 (50%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
           +  AT L  T+ E   V  Y +G  Y+ H+D F  + + P     R+A+ + YLSDVE+G
Sbjct: 393 VGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 452

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 V+P+ G+ L +Y+L  +   D  + H  CPV+KG
Sbjct: 453 GATAFPFLN---------------FAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKG 497

Query: 270 EKWVATKWIRDQEQ 283
            KW+A  WI +  Q
Sbjct: 498 SKWIANIWIHEATQ 511


>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
           pulchellus]
          Length = 548

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 101/229 (44%), Gaps = 29/229 (12%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           +G  G   +Q +  +  + +P  + F +         ++A A  RL  S      GE   
Sbjct: 316 YGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDINDLLAYATPRLFRST---HYGEHGT 372

Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            T   RTSS  ++   +    T +   +E  +   +   +   E + +  Y +G +Y +H
Sbjct: 373 ETSLIRTSSTAWLGDQDAPVATRLNRFVESLLGLGSQYLKGEAEYYQLANYGVGGQYIAH 432

Query: 180 YDAFNPAEYGPQ---------MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKK 230
           +D        P             R+A+ + YLSDVEEGG T+FP               
Sbjct: 433 HDFLADIYADPNRKLDDFERSAGDRIATLMFYLSDVEEGGATVFPH-------------- 478

Query: 231 CIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            +G+++ P++G+   +++L  +G  ++ + HG CPV+ G KW+A KW R
Sbjct: 479 -LGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKWIANKWFR 526


>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
          Length = 239

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 95/199 (47%), Gaps = 30/199 (15%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF + E+C  I+   + +L  S++   +       K  R S   ++S  +    +++ + 
Sbjct: 61  NFINKEKCGEIMNNTQSKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
            KI++   +P  + E   V+RY  GQ Y+ H+DA         E+  +  QR  + L+YL
Sbjct: 112 QKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIYL 171

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 262
           ++  EGG T F               K +GLKVKP  GD ++FY L  N +     SLH 
Sbjct: 172 NNEFEGGHTFF---------------KNLGLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216

Query: 263 SCPVIKGEKWVATKWIRDQ 281
             PV  GEKW+A  W R++
Sbjct: 217 GMPVTNGEKWIANLWFRER 235


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 98/223 (43%), Gaps = 32/223 (14%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  I  ++L + P A+ F N  S  + + I   A  +LK +        TV+++K     
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRA--------TVQNSKTGELE 364

Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             T+ IS S     D   +++ +  +I   T L Q   E   V  Y +G  YD H+D   
Sbjct: 365 HATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFAR 424

Query: 185 PAEYGP----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
             E           R+A+ L Y+S  E GG T+F                 +G  V P +
Sbjct: 425 KEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSK 469

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            D L +Y+L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 470 NDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQ 512


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    ++ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 208 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 266

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 267 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 324

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + P++G    +Y+L PNG  D 
Sbjct: 325 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 369

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWVA KW+ ++ Q
Sbjct: 370 KTRHAACPVLTGSKWVANKWLHERGQ 395


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++  T L  T+ E   V  Y +G  Y+ H+D F NP  Y  +   R+A+ + YLS+VE+G
Sbjct: 396 LSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYYLSEVEQG 455

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF N                 V+P+ G+ L +Y+L  +  +D  + H  CPV+KG
Sbjct: 456 GATAFPFLN---------------FAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKG 500

Query: 270 EKWVATKWIRDQEQ 283
            KW+   WI +  Q
Sbjct: 501 SKWIGNVWIHEVTQ 514


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +VL   P  + + +F S  + Q I   A   L+ S +A   GE  +  +  R S
Sbjct: 349 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 405

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 406 KSAWLKDTVDP--MLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 463

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y      R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 464 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAAL 508

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 509 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 547


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 24/216 (11%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A++F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 381

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  I+  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 382 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 439

Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            S     RLA+ L Y++  E GG T+F                 +   V P + D L +Y
Sbjct: 440 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 484

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 485 NLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 520


>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 188

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 29/208 (13%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL+  P      NF +  +C+ +I  A+    P+ +    G+       +RTSS  ++S 
Sbjct: 1   VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLS- 56

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQ 191
            ED   ++     K++  T  P  H E   V RY   Q+Y  HYDAF+        +   
Sbjct: 57  REDLPDLMR----KVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFP 251
             QR  + LLYL+DV  GG T FP                + L V+PR+G  L+F+    
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFP---------------ALNLDVQPRQGMALVFFPATI 157

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           +G +DR +LH + P +   K+V+  WIR
Sbjct: 158 DGMLDRMALHAAMPAVD-TKYVSQVWIR 184


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    E+ ++I   A+ R K + +   +   +E     R S   ++   E + 
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + P++G    +Y+L PNG  D 
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 20/146 (13%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPAEYGPQMSQRLA 197
           +++ ++++I   T L     +A  V  Y IG  YD HYD       + +E   +   R+A
Sbjct: 393 VVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIA 452

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FLLY++DV+ GG T+FP                I ++V P++G  + +Y+L  +G    
Sbjct: 453 TFLLYMTDVDAGGATVFPI---------------IDVRVLPKKGTAVFWYNLRRSGKGIM 497

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWV+ KWIR + Q
Sbjct: 498 ETRHAACPVLVGTKWVSNKWIRTRGQ 523


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 97/216 (44%), Gaps = 24/216 (11%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A++F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  I+  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 381 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438

Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            S     RLA+ L Y++  E GG T+F                 +   V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQ 519


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  I  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 346

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 347 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 406

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 407 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 451

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 452 TRHAACPVLVGCKWVSNKWFHERGQ 476


>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
 gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
          Length = 546

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/218 (29%), Positives = 102/218 (46%), Gaps = 24/218 (11%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE---TVESTKGT-R 127
           I  ++LS  P  +   +  S E+   +   +K  + PS+ A L   E     E   G+ R
Sbjct: 322 IKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISPSETAELSDSEEKSIFEFEVGSFR 381

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NP 185
           TS   ++    ++  +   +  ++  AT L  +H E F V+ Y IG  ++SH+D    + 
Sbjct: 382 TSKSVWLDNDANEATLK--LTQRLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQDE 439

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
             +      RLA+ L YL+DV +GG T FP  N               + V P+ G  L 
Sbjct: 440 NRFLDGYMDRLATTLFYLNDVPQGGATHFPGLN---------------ITVFPKFGTALF 484

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L   G +   ++H  CPVI G KWV +KWI D+ Q
Sbjct: 485 WYNLDTKGLLRLRTMHTGCPVIVGSKWVVSKWIDDKGQ 522


>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
          Length = 525

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/269 (26%), Positives = 123/269 (45%), Gaps = 45/269 (16%)

Query: 43  RPKLRTLEVVE---KENESGLPHGETGDASIQS------IPF-----QVLSWRPRALYFP 88
           RPKL++ +  E   +  +  LP  ++   +++       +PF     ++L+ +P  + F 
Sbjct: 263 RPKLKSTKAYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFH 322

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +  S  + +++   A  +L+ + +A  + +   S    R S  +++        I +L  
Sbjct: 323 DVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIHQL-S 381

Query: 149 HKIARATMLPQT-------HGEAFNVLRYEIGQKYDSHYDAF-------NPAEYGPQMSQ 194
            ++A  T L QT         E F +L Y IG +Y+ H D F       +  E+      
Sbjct: 382 RRVADITGL-QTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN 440

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           RLA+F+ YL+DV  GG T+FP             K  +G  + P +     +Y++  NG 
Sbjct: 441 RLATFMFYLNDVHAGGATVFP-------------KLKVG--IPPTKNGAAFWYNIGLNGD 485

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +D  + H  CPV+ G+KWVA KWI +  Q
Sbjct: 486 VDPLTEHAGCPVLLGQKWVANKWIHEHGQ 514


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    ++ ++I   A+ R K + +   +   +E     R S   ++   E K 
Sbjct: 330 PRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHKH 388

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 389 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 446

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + P++G    +Y+L PNG  D 
Sbjct: 447 TVLYYMSDVEQGGGTVFT---------------AINIALWPKKGSAAFWYNLKPNGEGDF 491

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWVA KW+ ++ Q
Sbjct: 492 KTRHAACPVLTGSKWVANKWLHERGQ 517


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q+I   A+  L+ S +A   GE     +  R S
Sbjct: 268 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 325 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 382

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y      R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 383 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 427

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH +CPV+ G+KWVA KWI +  Q
Sbjct: 428 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 466


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 94/206 (45%), Gaps = 22/206 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + N    E+ ++I   A+ R K + +   +   +E     R S   ++   E + 
Sbjct: 350 PRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALE-IANYRISKSAWLQEHEHEH 408

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
             +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+A
Sbjct: 409 --VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIA 466

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           + L Y+SDVE+GG T+F                 I + + P++G    +Y+L PNG  D 
Sbjct: 467 TVLYYMSDVEQGGGTVFT---------------AINISLWPKKGSAAFWYNLKPNGEGDF 511

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQ 283
            + H +CPV+ G KWVA KW+ ++ Q
Sbjct: 512 KTRHAACPVLTGSKWVANKWLHERGQ 537


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 101/204 (49%), Gaps = 24/204 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK RL  ++  +R  +T V +    R S  +++   ED 
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 397

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 398 DPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 456

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D 
Sbjct: 457 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 501

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            + H +CPV+ G KWV+ KW  ++
Sbjct: 502 RTRHAACPVLVGCKWVSNKWFHER 525


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q+I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y      R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 449 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH +CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 532


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 484 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 529 TRHAACPVLVGCKWVSNKWFHERGQ 553


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 273 LQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 329

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 330 KSAWLKDTADP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 387

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 388 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 432

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH +CPV+ G+KWVA KWI +  Q
Sbjct: 433 FWWNLHRSGEGDSDTLHAACPVLVGDKWVANKWIHEYGQ 471


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +VL   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 324 LQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 380

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 381 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 438

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   +   L
Sbjct: 439 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 483

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G KWVA KWI +  Q
Sbjct: 484 FWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 522


>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
 gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
          Length = 193

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 95/198 (47%), Gaps = 28/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F    QC ++IA  +   +PS +A   G+ V      RTSS   +S      G +  + 
Sbjct: 16  DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPD---VGAVAALA 67

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
            K+   + +   H E     RYE+GQ++ +H D F P      +Y     QR  +F++YL
Sbjct: 68  RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +DV+ GG T F               K I   ++P RG  + + +  P+G+++  +LH +
Sbjct: 128 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 172

Query: 264 CPVIKGEKWVATKWIRDQ 281
             V +G K+V T+W R++
Sbjct: 173 MKVRQGRKYVVTQWFRER 190


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 474

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 475 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 519

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 520 TRHAACPVLVGCKWVSNKWFHERGQ 544


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 493

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 494 FLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 538

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 539 TRHAACPVLVGCKWVSNKWFHERGQ 563


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
          Length = 487

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/222 (28%), Positives = 103/222 (46%), Gaps = 15/222 (6%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +S  P       F   ++   ++  +   L PS + L+ G         RTS+  ++
Sbjct: 268 METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWL 327

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEY 188
            +S     +++ I+ + A    +P +H E+  VLRYE  Q YD H D F      N A+ 
Sbjct: 328 ESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNSADV 385

Query: 189 GPQMSQ----RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCI-GLKVKPRRGDG 243
             ++      R+ +   Y+SDV +GG T F    G  L      K C  GL V P++   
Sbjct: 386 LKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGG--LPPPPTNKGCTQGLSVVPKKRKV 443

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           ++FYS+ PNG  D  SLH  CPV +G K    KW+ ++ + +
Sbjct: 444 VVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWVWNKPRSD 485


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 98/213 (46%), Gaps = 19/213 (8%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
           I  ++LS  P  L F +  S ++   I +++K+ + PS          E    T RTS  
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            + S++ + T   + I  ++  AT L     E F V+ Y +G  +++H D   +      
Sbjct: 388 VWYSSTSNDTT--KRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               RLA+ L YL++V +GG T FP  N               L V P+ G  L +Y+L 
Sbjct: 446 GTRDRLATTLFYLNEVRQGGGTHFPRLN---------------LTVFPQPGSALFWYNLD 490

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             G    ++LH  CPVI G KWV +KW+ D  Q
Sbjct: 491 TRGNDHTSTLHTGCPVIVGSKWVMSKWVEDAGQ 523


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 422

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 423 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 467

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 468 TRHAACPVLVGCKWVSNKWFHERGQ 492


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 103/210 (49%), Gaps = 30/210 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + + + AS ++ +++   AK RL+ + +   Q   + +T   R S   ++ + E   
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKL-TTAQYRVSKSAWLGSHEHP- 381

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
            I++ I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G    
Sbjct: 382 -IVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436

Query: 194 QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNG 253
            R+A++LLY+SDV+ GG       N +F D        IG  V P++G  + +Y+L  +G
Sbjct: 437 NRIATWLLYMSDVQAGG-------NTVFTD--------IGAVVWPKKGTAVFWYNLHRSG 481

Query: 254 TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 482 EGDYRTRHAACPVLVGNKWVSNKWIHERGQ 511


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 103/205 (50%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +T   R S  +++  ++D 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTTASYRVSKSSWLEETDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 307 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 363

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 364 KSAWLKDTVDPK--LVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 421

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 422 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 466

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 467 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 505


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 107/231 (46%), Gaps = 38/231 (16%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKG----TRTSS 130
           +V+++ PR   F +  S    + + + A K    S + L   G     T G     R S 
Sbjct: 361 EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQ 420

Query: 131 GTFISASEDKTGILELIEHKIARATMLP------QTHGEAFNVLRYEIGQKYDSHYDAF- 183
            +++    D+   L  +E++I   T L       ++H E F VL Y +G  Y  HYD   
Sbjct: 421 TSWLGT--DEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTG 478

Query: 184 -------NPAEYGPQMS--QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGL 234
                  NP +     +  +R+A+++ YL+DV+ GG T+FP                +  
Sbjct: 479 YMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFP---------------EVKT 523

Query: 235 KVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
           ++   +G    +Y++ P+G  D  +LHG CPV+ G KWV+ KWIR++ Q +
Sbjct: 524 RIPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIREEGQMD 574


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 91/194 (46%), Gaps = 28/194 (14%)

Query: 99  IIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
           +I  A + +K +++   +   V      RT+ G ++    ++  + + I  +I   T   
Sbjct: 2   LIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE--LTKRITRRIMDMTGFD 57

Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQMSQRLASFLLYLSDVEEG 209
               E F V+ Y IG  Y  H D F+ A          Y   +  R+A+ L YL+DVE+G
Sbjct: 58  LADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQG 117

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T       +F D GY         V P+ G  + +Y+L  +G  D  + H +CPVI G
Sbjct: 118 GAT-------VFGDVGY--------YVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVG 162

Query: 270 EKWVATKWIRDQEQ 283
            KWV T+WIR++ Q
Sbjct: 163 SKWVMTEWIREKRQ 176


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 374

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 477

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 478 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 516


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 451

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 452 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 511

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 512 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 556

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 557 TRHAACPVLVGCKWVSNKWFHERGQ 581


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + +    S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  + H++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 --VVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFD-FSRRPFDITLKTEGNRLA 450

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D 
Sbjct: 451 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 495

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            + H +CPV+ G KWV+ KW  ++
Sbjct: 496 RTRHAACPVLVGCKWVSNKWFHER 519


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
 gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
           ATCC 15444]
          Length = 298

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 104/207 (50%), Gaps = 29/207 (14%)

Query: 82  PRA-LY-FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
           P+A LY +PNF + E C ++IA   +RL+ S           +    RTS  + I  +  
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTT-----DAFADPKIRTSRSSDI-GTMG 153

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----Q 194
              +++L E  IA A  +  ++ +A    RY++ Q+Y +HYD F P     Q+      Q
Sbjct: 154 HNLVMQLDE-LIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQ 212

Query: 195 RLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
           R  +F++YL+DVEEGG T F               + +   + P +G  +++ +L P+G+
Sbjct: 213 RTWTFMIYLNDVEEGGGTRF---------------RRLEKTIMPEKGKAVIWNNLNPDGS 257

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           ++  ++H    V  G K+V TKW R++
Sbjct: 258 VNPYTIHHGMKVRSGAKYVITKWFRER 284


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                 G  + P++G  + +Y+LF +G  D  
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGDYR 497

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522


>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Strongylocentrotus purpuratus]
          Length = 121

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 64/121 (52%), Gaps = 16/121 (13%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
           E   +  Y +G  Y  H+D F       +   R+AS L YLSDV +GG+T       +F+
Sbjct: 5   EFLQIANYGLGGHYLPHFD-FTRDVATHKNGNRIASMLFYLSDVAKGGDT-------VFI 56

Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
           D+G         K+KP +G  + +Y+LF NG +D  + H SCPVI G KWVA  W+ +  
Sbjct: 57  DAG--------AKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMHEHG 108

Query: 283 Q 283
           Q
Sbjct: 109 Q 109


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 98/207 (47%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  + H++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRV 456

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 457 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 501

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 502 YRTRHAACPVLVGCKWVSNKWFHERGQ 528


>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
 gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
          Length = 535

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 85/179 (47%), Gaps = 25/179 (13%)

Query: 113 ALRQGETVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
            L++ E    T G+     RTS GT      D+  I+E +   +   + L     E   +
Sbjct: 354 VLQRSEVYSPTNGSTAATFRTSQGTVFEY--DEHPIIEKLSQHMTLISGLDMGFAEPLQI 411

Query: 168 LRYEIGQKYDSHYDAFNPA-EYGPQM--SQRLASFLLYLSDVEEGGETMFPFENGIFLDS 224
             Y IG  Y+ H D+F  + +Y  Q   + R+A+ + YLS+VE GG T FPF        
Sbjct: 412 ANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAGGATAFPF-------- 463

Query: 225 GYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
                  + L VKP +G  L +Y+L  +G  D  + H  CPV++G KW+A  WIR   Q
Sbjct: 464 -------LPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKWIANVWIRLSHQ 515


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
           [Ornithorhynchus anatinus]
          Length = 495

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 88/195 (45%), Gaps = 34/195 (17%)

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 174
           + V+ +   R S  T++   E    ++  I+ ++ R T LPQ    H E   V+RY+ G 
Sbjct: 259 QKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRLPQEIVEHSEPLQVVRYDQGG 318

Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 215
            Y +H D+             F   E  P + S R  + L YL++V  GGET FP     
Sbjct: 319 HYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVLFYLNNVTGGGETTFPVADNR 378

Query: 216 -------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 263
                   +N I L     +     L+VKP++G  + +Y+   +G      +D  SLHG 
Sbjct: 379 TYDEMSLIQNDIDLRDTRKHCDKGNLRVKPKQGTAVFWYNYLSDGQGWVGDLDEYSLHGG 438

Query: 264 CPVIKGEKWVATKWI 278
           C V +G KW+A  WI
Sbjct: 439 CLVTQGTKWIANNWI 453


>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
 gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
 gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
          Length = 239

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 95/199 (47%), Gaps = 30/199 (15%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF + E+C+ I+   + +L  S++   +       K  R S   ++S  +    +++ + 
Sbjct: 61  NFINKEKCKEIMNNTQNKLFDSEVISGK------NKAIRNSQQCWVSKYDP---MVKSMF 111

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
            KI++   +P  + E   V+RY  GQ Y+ H+DA         E+  +  QR  + L+YL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT-IDRTSLHG 262
           ++  EGG T F               K + LKVKP  GD ++FY L  N +     SLH 
Sbjct: 172 NNEFEGGHTFF---------------KNLNLKVKPETGDAIVFYPLAKNTSKCHPLSLHA 216

Query: 263 SCPVIKGEKWVATKWIRDQ 281
             PV  GEKW+A  W R++
Sbjct: 217 GMPVTSGEKWIANLWFRER 235


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 453 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 497

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 498 TRHAACPVLVGCKWVSNKWFHERGQ 522


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 256

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 257 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 314

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   L +++L 
Sbjct: 315 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 359

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 360 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 392


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 105/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 304 LQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 360

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 361 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 418

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y      R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 419 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 463

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH +CPV+ G+KWVA KWI +  Q
Sbjct: 464 FWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQ 502


>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 340

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 103/221 (46%), Gaps = 22/221 (9%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G+ G  S+Q I  + ++ +P  +   +    +  + ++A A+ RL+ S       E + S
Sbjct: 12  GQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERST-TYTGNEMMPS 70

Query: 123 TKGTRTSSGTFISASEDKTGIL--ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
            +  RTSS  +++  E    +     +   +   T       EA+ +  Y  G  +  H+
Sbjct: 71  PE--RTSSTAWLNEDEAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLANYGTGGHFLPHH 128

Query: 181 DAFNPAEYGPQ--MSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
           D    +          RLA+ ++Y++DVEEGG T+FP                +G+++ P
Sbjct: 129 DFLQDSLQADNSVTGDRLATLMIYMTDVEEGGTTVFP---------------NLGIRLTP 173

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
           ++GD   +++L  +G  +R + H  CPV+ G KW+A KW R
Sbjct: 174 KKGDAAFWWNLKASGDGERLTTHAGCPVLYGSKWIANKWFR 214


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 102/216 (47%), Gaps = 22/216 (10%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           ++ +  +++S  P+   F N  S  + + ++  A+ RL+ +++   +   +E     R S
Sbjct: 314 LKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVD-YRIS 372

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              ++S S+    I+  I  ++   T L    GE   V  Y +G  Y+ H+D     E  
Sbjct: 373 QIAWLSDSDGD--IVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENS 430

Query: 190 PQMS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
           P  S     R+A+F+ YLS+VE GG T+F                  G+K  P +G  + 
Sbjct: 431 PIASLGQGNRIATFMFYLSEVEAGGSTVFI---------------KTGVKTNPFKGGAVF 475

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQ 281
           +Y+L  +G  D  SLH  CPV+ G KWVA KW+ + 
Sbjct: 476 WYNLKKSGEGDWDSLHAGCPVLIGNKWVANKWLHEH 511


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 338 LQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 394

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 395 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 452

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 453 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 497

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 498 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 536


>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
 gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
          Length = 263

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/203 (29%), Positives = 96/203 (47%), Gaps = 31/203 (15%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
           ++++I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+D  +GG T F                 + + VKP++G  L + +L P+G  +  S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240

Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
           +H + PV +G K V TKW R + 
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263


>gi|323456313|gb|EGB12180.1| hypothetical protein AURANDRAFT_61447 [Aureococcus anophagefferens]
          Length = 317

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 98/219 (44%), Gaps = 33/219 (15%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGT 132
           P +VLS  P A    +FA+  +C  IIA A  RL     AL  G+   E    +R++   
Sbjct: 101 PCEVLSTAPLAFCVRDFATGAECDRIIAEATPRLS---AALVAGDGAGEQAGSSRSAQVA 157

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------- 183
           ++  S D       +  ++A    +P +H E+  V++Y  G +Y  H+DAF         
Sbjct: 158 WVPRSPDDP----WLARRVAELIDVPLSHAESLQVVKYGAGGEYKPHFDAFPLDAARGRR 213

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
                     QR  + +LYL+DVE+GG T F  E                  V+PRRG  
Sbjct: 214 AAVRGRTYAGQRRVTAILYLNDVEKGGGTAFHSETP------------AEFVVRPRRGSL 261

Query: 244 LLFYSLFPNGTIDR--TSLHGSCPVIK-GEKWVATKWIR 279
            +FY+ + + T DR   SLH   PV   G KW+A  W R
Sbjct: 262 FVFYNCYEDST-DRHPMSLHAGLPVAPGGTKWIANIWWR 299


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 104/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 325 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 381

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 382 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 439

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   +   L
Sbjct: 440 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 484

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 485 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 523


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 452

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D 
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            + H +CPV+ G KWV+ KW  ++
Sbjct: 498 RTRHAACPVLVGCKWVSNKWFHER 521


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 102/204 (50%), Gaps = 24/204 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 393 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 451

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D 
Sbjct: 452 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 496

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQ 281
            + H +CPV+ G KWV+ KW  ++
Sbjct: 497 RTRHAACPVLVGCKWVSNKWFHER 520


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   L +++L 
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 103/213 (48%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   L +++L 
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAALFWWNLH 497

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 498 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 530


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 24/216 (11%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A+ F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  ++  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 381 AWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438

Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            S     RLA+ L Y++  E GG T+F                 +   V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 71/141 (50%), Gaps = 15/141 (10%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           +++ I +++   + L  T  E   V+ Y IG  Y+ HYD             R+A+FL Y
Sbjct: 13  LVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATFLSY 72

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           LSDVE GG T+F                 +G  V P++GD   +Y+L  +G  D ++ H 
Sbjct: 73  LSDVEAGGGTVFTR---------------VGATVWPQKGDAAFWYNLKRSGDGDSSTRHA 117

Query: 263 SCPVIKGEKWVATKWIRDQEQ 283
           +CPV+ G KWVA KWI +  Q
Sbjct: 118 ACPVLVGSKWVANKWIHEVGQ 138


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 99/218 (45%), Gaps = 21/218 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           ++ +  +VL   P    +    + ++ + II  AK  L+ + +  +  G+ + +    R 
Sbjct: 321 LKPLKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYA--DYRV 378

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNP 185
           S  T+I+  ED   I   I  ++   T L   + E   V  Y I  +Y+ H+D      P
Sbjct: 379 SKNTWIA--EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRP 436

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
             +      R+A+ LLYLSDV+ GG T+F                  G+   P +G G+ 
Sbjct: 437 KHFDRWGGNRIATMLLYLSDVDWGGRTVFT-------------NTAPGVGTDPIKGAGVF 483

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L  NG  +  + H  CPV+ G+KWVA  WI +  Q
Sbjct: 484 WYNLLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQ 521


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  ++L   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   +   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVKNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQ 532


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 80/162 (49%), Gaps = 20/162 (12%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD- 181
           T   R S   ++S  E    +++ +E +IA  T L     E F V  Y +  +YD H+D 
Sbjct: 335 TAHYRISKNCWLSGREHGE-VIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDF 393

Query: 182 ---AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
                N +        R+A+ L+++S VE GG T+FP+               +G ++ P
Sbjct: 394 SRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPY---------------VGARILP 438

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
           ++GD + +++L  +G  D  + H  CPV+ G KWVA KWI +
Sbjct: 439 QKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIHE 480


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/216 (29%), Positives = 96/216 (44%), Gaps = 24/216 (11%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A+ F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 322 PFKVEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  ++  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 381 AWLKDEEHE--VVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438

Query: 192 MS----QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            S     RLA+ L Y++  E GG T+F                 +   V P + D L +Y
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF---------------TEVKTTVMPSKNDALFWY 483

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 484 NLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQ 519


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 27/215 (12%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSS 130
           PF+V  LS  P   YF +  S ++ + II   K ++  S++    G+T  ST    RTS 
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSDIRTSQ 391

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
            T++    +    L  I+ ++   T L     E   ++ Y IG +Y+ H+D  + AE  +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G +   RL + L YL+DV  GG T FPF               + L V P +G  L++Y+
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYN 493

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L  +   D  + H  CPV+KG KW+  +W  +  Q
Sbjct: 494 LHRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQ 528


>gi|159474434|ref|XP_001695330.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275813|gb|EDP01588.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1887

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 59/211 (27%), Positives = 94/211 (44%), Gaps = 25/211 (11%)

Query: 79   SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 138
            S  PR L    F     C ++ A A  RL      +R   +  +   +R S  TF +   
Sbjct: 1687 SLSPRVLVVDGFLPPGLCDALCAVAAPRL------IRSRVSTGAETPSRVSQSTFFTGDS 1740

Query: 139  DKTGILELIEHKIARATMLPQT---------HGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
             +   +  +E ++      P+            EA  V+ Y++G  Y  HYD     + G
Sbjct: 1741 ARLPEVVAVEARLQALMERPEVTAGGRPTLVKSEALQVVSYDVGGFYSEHYDN----KTG 1796

Query: 190  PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL 249
              +S R A+ ++YL D + GG T FP +    +          GL+V P +G  L+F+S 
Sbjct: 1797 GVIS-RAATIIIYLQDTQAGGSTHFPNQQLRLMRVARP-----GLRVYPAKGRALIFWSR 1850

Query: 250  FPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
             P+G+ D  SLH + PV  G KW+ T+W ++
Sbjct: 1851 LPDGSEDLASLHSAEPVRAGSKWICTRWFKE 1881


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +VL   P  + + +F S  + Q I   A   L+ S +A   GE  +  +  R S
Sbjct: 311 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 367

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   IL  ++ +IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 368 KSAWLKDTVDP--ILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 425

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y      R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 426 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIYAN---------------FSVPVVKNAAL 470

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 471 FWWNLHRSGQGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 509


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 454 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 16/134 (11%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           +  AT L  T+ E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+G
Sbjct: 397 LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFYLSEVEQG 456

Query: 210 GETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKG 269
           G T FPF               + + VKP+ G+ L +Y+L  +   D  + H  CPV+KG
Sbjct: 457 GATAFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKG 501

Query: 270 EKWVATKWIRDQEQ 283
            KW+   WI +  Q
Sbjct: 502 SKWIGNVWIHEVTQ 515


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 26/209 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           PR + + N  S E+   I   AK +L  ++  +R  +T V S    R S   ++  ++D 
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKL--ARATVRDPKTGVLSVANYRVSKSAWLEENDDP 395

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 396 --VIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTDGNRLA 452

Query: 198 SFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDR 257
           +FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D 
Sbjct: 453 TFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGEGDY 497

Query: 258 TSLHGSCPVIKGEKWVATKWIRDQEQHED 286
            + H +CPV+ G KW   KW   Q+ H D
Sbjct: 498 RTRHAACPVLVGSKW--GKWTHTQDHHFD 524


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   +K +L  S+  +R  +T        R S  +++   ED 
Sbjct: 337 PHIVRYYDVLSDEEIEKIKEISKPKL--SRATVRDPKTGHLIVVSYRISKSSWLK--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             I+  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 393 DPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G  D  
Sbjct: 453 FLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTSVFWYNLFRSGECDYR 497

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 498 TRHAACPVLVGSKWVSNKWFHERGQ 522


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 100/213 (46%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LS +P  + + +F S  + + I   A+  L+ S +A R  +    T   R S   ++ 
Sbjct: 81  ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137

Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S      +  ++ +I+  T L     HGE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+ ++YLS VE GG T F + N                 V   +   + +++L 
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIYAN---------------FSVPVMKNAAIFWWNLH 240

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            NG  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 241 RNGRGDPDTLHAGCPVLIGDKWVANKWIHEYGQ 273


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 101/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 406 FLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 451 TRHAACPVLVGCKWVSNKWFHERGQ 475


>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
 gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 65/118 (55%), Gaps = 18/118 (15%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEY--GPQMSQRLASFLLYLSDVEEGGETMFPFENGI 220
           E  NV  Y +G  +  HYD + P  Y  G  M   L + L Y+SD+++GG T+FP     
Sbjct: 409 EELNVANYGLGTIFGPHYD-YTPENYDIGWFMGGPLGTILFYVSDLQQGGATIFP----- 462

Query: 221 FLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                      I + V PR+G  LL+++L+ +G  D  +LH SCPVI+G++W  TKW+
Sbjct: 463 ----------SINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRWTLTKWV 510


>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
 gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
          Length = 263

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 59/203 (29%), Positives = 95/203 (46%), Gaps = 31/203 (15%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
           ++ +I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+D  +GG T F                 + + VKP++G  L + +L P+G  +  S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVKPKKGMALFWNNLLPSGDPNLNS 240

Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
           +H + PV +G K V TKW R + 
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 104/213 (48%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++S +P  + + +F +  + + I + A   L+ S +A   GE  ++T   R S   ++ 
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA--AGEK-QATADYRISKSAWLK 369

Query: 136 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S     I+  ++ +I+  T L   H  GE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 370 GS--AQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V       + +++L 
Sbjct: 428 KTGNRVATFMIYLSPVEAGGSTAFIYAN---------------FSVPVVEKAAIFWWNLH 472

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            NG  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 473 RNGEGDDDTLHAGCPVLIGDKWVANKWIHEYGQ 505


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 102/222 (45%), Gaps = 30/222 (13%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +V+  RP    F +  S ++ Q++I  +  RLK + +   +   +E     R S
Sbjct: 314 LQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELE-VANYRIS 372

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
              ++   + +  ++E +  +    T L     E   V+ Y IG  Y++H+D        
Sbjct: 373 KSAWLKNHDHE--VVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKD 430

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
           AF     G     R+A+++ Y+SDV+ GG T+FP                +GL V P +G
Sbjct: 431 AFKQLGTG----NRIATWINYMSDVKAGGATVFPR---------------LGLTVWPEKG 471

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
               +++L  +G  D  + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 472 SAAFWWNLHRSGEGDILTRHAACPVLAGSKWVSNKWFHERGQ 513


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 96/220 (43%), Gaps = 32/220 (14%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++L + P A+ F N     + + I   A  +LK +        TV+++K       T
Sbjct: 316 IKVEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRA--------TVQNSKTGELEHAT 367

Query: 133 F-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           + IS S     D   +++ +  +I   T L Q   E   V  Y +G  YD H+D     E
Sbjct: 368 YRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEE 427

Query: 188 YGP----QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
                      R+A+ L Y+S  E GG T+F                 +G  V P + D 
Sbjct: 428 KNAFKTLNTGNRIATVLFYMSQPERGGATVF---------------NHLGTAVFPSKNDA 472

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L +Y+L  +G  D  + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 473 LFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQ 512


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 102/205 (49%), Gaps = 22/205 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLEEEDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 --VVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRT 258
           FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D  
Sbjct: 452 FLNYMSDVEAGGATVFP---------------DLGAALWPKKGTAVFWYNLLRSGEGDYR 496

Query: 259 SLHGSCPVIKGEKWVATKWIRDQEQ 283
           + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGQ 521


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + +    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVNPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N               L V   R   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------LSVPVVRNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 504

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 97/211 (45%), Gaps = 56/211 (26%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + + +  S E+   +   AK RL+ + +                        S   
Sbjct: 330 KPRIVRYHDIISDEEISKVKELAKPRLRRATI------------------------SNPI 365

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           TG+LE  +++I++   + +       V  Y +G +Y+ H+D        AF     G   
Sbjct: 366 TGVLETAQYRISKRWAIME-----LEVANYGMGGQYEPHFDFARKDEPDAFKELGTG--- 417

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A++L Y+SDVE GG T+FP                +G  V P++G  + +Y+LF +
Sbjct: 418 -NRVATWLFYMSDVEAGGATVFP---------------EVGAAVYPKKGTAVFWYNLFES 461

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D ++ H +CPV+ G KWV+ KWI ++ Q
Sbjct: 462 GEGDYSTRHAACPVLVGNKWVSNKWIHERGQ 492


>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 169

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 71/110 (64%), Gaps = 3/110 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFS 165


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 95/210 (45%), Gaps = 26/210 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           PR + + +    ++ ++I   A+ R K + +   + GE        R S   ++   E K
Sbjct: 349 PRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGEL--EIANYRISKSAWLQEHEHK 406

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 196
              +  +  ++   T +     E   V+ Y IG  Y+ H+D     E     S     R+
Sbjct: 407 H--VRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRI 464

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ L Y+SDVE+GG T+F                 I + + P++G    +Y+L PNG  D
Sbjct: 465 ATVLYYMSDVEQGGGTVFT---------------KINISLWPKKGSAAFWYNLKPNGEGD 509

Query: 257 RTSLHGSCPVIKGEKWVATKWI--RDQEQH 284
             + H +CPV+ G KWVA KW+  R QE H
Sbjct: 510 YKTRHAACPVLTGSKWVANKWLHERGQEFH 539


>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
 gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
          Length = 454

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 98/215 (45%), Gaps = 19/215 (8%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  +  ++LS  P  +   +     + + +   + KRL+ ++ AL Q +        RTS
Sbjct: 249 IAPVKMEILSLNPYIVLCHDVILPSEQEFLKTQSSKRLEGAR-ALDQVKNEVVFNFIRTS 307

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-Y 188
             T++  + D   +   + H I   + L    G+ + ++ Y +G  +++H D     E  
Sbjct: 308 KATWLKKNSD--NVTRRLSHWIEDVSNLDSNIGDLYQIINYGVGGLFEAHSDTMRKDEDR 365

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
              +  R+A+F+ YL DV +GG T+F                 + L V P+ G  L +++
Sbjct: 366 WKVLYDRIATFIFYLQDVPQGGATLF---------------NNLNLTVFPKAGAALFWFN 410

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L   G  D  ++H  CPVI G KW+ TKW+ D  Q
Sbjct: 411 LDNAGDTDLFTVHTGCPVIVGSKWIMTKWVYDLGQ 445


>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
           pv. vasculorum NCPPB 702]
          Length = 301

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 89/194 (45%), Gaps = 22/194 (11%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDKTGILELIEHK 150
           SA++C+ ++  A+  L+ SQ+ +   +        RTS G T     ED      + + +
Sbjct: 121 SADECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA--RVAQAR 177

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSDVE 207
           +A    L  TH E  +VL Y  G++Y +H D   P   A   P    R  +  +YL+ V+
Sbjct: 178 LAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVD 237

Query: 208 EGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVI 267
            GGET FP                 G++V+PR G  + F +L  +G  +  SLH   PV 
Sbjct: 238 AGGETEFPLA---------------GVRVQPRPGALVCFDNLHADGRPNADSLHAGLPVT 282

Query: 268 KGEKWVATKWIRDQ 281
            G KW+ T W R Q
Sbjct: 283 AGSKWLGTLWFRQQ 296


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 103/215 (47%), Gaps = 27/215 (12%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSS 130
           PF+V  LS  P   YF +  S ++ + II   K ++  S++    G+T  ST    RTS 
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSEIRTSQ 391

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
            T++    +    L  I+ ++   T L     E   ++ Y IG +Y+ H+D  + AE  +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
           G +   RL + L YL+DV  GG T FPF               + L V P +G  L++Y+
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF---------------LHLAVPPVKGSLLVWYN 493

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L  +   D  + H  CPV+KG KW+  +W  +  Q
Sbjct: 494 LHRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQ 528


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 104/218 (47%), Gaps = 31/218 (14%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + F +     + + +  TA+  L  S +     E+V S    RT+ G
Sbjct: 281 PFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKSNNESVVSK--VRTAKG 338

Query: 132 TFISA---SEDKTGILELIEHKIARATMLPQTHGEAFN---VLRYEIGQKYDSHYDAFNP 185
            F+ A   S +   +++ ++ ++   + L     E +N    L Y+ G  Y  H D FN 
Sbjct: 339 AFMHADRLSPESAQVVQRLKQRMGDLSDL-NIKREGYNEMQYLNYDFGDHYLLHMDYFNI 397

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
           +     M+ R+A+FL+YL+DV  GG T+FP                +   V P +G  +L
Sbjct: 398 S-----MNDRIATFLIYLNDVTRGGGTIFP---------------QVKQAVHPEKGKLIL 437

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y++  N   +  SLHG+CPV+ G K     WIR+ +Q
Sbjct: 438 WYNMNSNLDYELASLHGACPVLIGRKIAIVYWIREHDQ 475


>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
 gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
          Length = 542

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 96/213 (45%), Gaps = 19/213 (8%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
           I  ++LS  P  L   +  S ++   I  ++K+ + PS          E+   T RTS  
Sbjct: 323 IKTEILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKS 382

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            + S+  + T   + I  ++  AT L     E + V+ Y +G  +++H D   +      
Sbjct: 383 VWYSSDFNDTT--KKITERLGDATGLDTNFTEFYQVINYGLGGFFETHLDMLLSEKNRFN 440

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               R+A+ L YL++V +GG T FP                I L V P+ G  L +Y+L 
Sbjct: 441 GTRDRIATTLFYLNEVRQGGGTYFPR---------------INLTVFPQPGSALFWYNLD 485

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            NG     SLH  CPVI G KWV +KWI D  Q
Sbjct: 486 TNGNDHMGSLHTGCPVIVGSKWVMSKWINDMGQ 518


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/241 (27%), Positives = 100/241 (41%), Gaps = 50/241 (20%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  I  ++L + P A+ F N  S  + + I   A  +LK +        TV+++K     
Sbjct: 329 IAPIKVEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRA--------TVQNSKTGELE 380

Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             T+ IS S     D   ++E +  +I   T L Q   E   V  Y +G  YD H+D   
Sbjct: 381 HATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFDFAR 440

Query: 185 PAEYGP----------------------QMSQRLASFLLYLSDVEEGGETMFPFENGIFL 222
            A YG                           R+A+ L Y+S  E GG T+F        
Sbjct: 441 IANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF-------- 492

Query: 223 DSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
                    +G  V P + D L +Y+L  +G  D  + H +CPV+ G KWV+ KWI ++ 
Sbjct: 493 -------NHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERG 545

Query: 283 Q 283
           Q
Sbjct: 546 Q 546


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/226 (29%), Positives = 102/226 (45%), Gaps = 30/226 (13%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +  I  +  S+ P    F +  S E+ ++I   AK  L  S +  + G   E +   RTS
Sbjct: 307 LMPIKIEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVS-NVRTS 365

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP----QTHGEAFNVLRYEIGQKYDSHYDAF-- 183
              ++   E    +L  +  +I   T L     +   E   V  Y IG  Y  H+D    
Sbjct: 366 KTAWLP--EGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMK 423

Query: 184 NPAEYG------PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           + A++        Q   R+A+F+ YL+DVE GG T FP                 G+ VK
Sbjct: 424 DKADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP---------------RAGVAVK 468

Query: 238 PRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           P +G    +++L  +G  D  +LHG+CPV+ G KWV+ KWIR+  Q
Sbjct: 469 PVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIRETAQ 514


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 95/207 (45%), Gaps = 22/207 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S  + + I   A+ R + + +   +   +E     R S   ++  +ED+
Sbjct: 348 RPYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFA-NYRISKSAWLKDAEDE 406

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRL 196
             ++  I  ++   T L     E   V+ Y IG  Y+ H+D     E     S     R+
Sbjct: 407 --MIRTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRI 464

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+ L Y+SDV +GG T+FP                + L + PR+G    +++L  +G  D
Sbjct: 465 ATVLFYMSDVTQGGATVFP---------------SLNLALWPRKGTAAFWFNLHASGRGD 509

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KWI ++ Q
Sbjct: 510 YATRHAACPVLTGTKWVSNKWIHERGQ 536


>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
 gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
          Length = 415

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 102/225 (45%), Gaps = 35/225 (15%)

Query: 62  HGETGDASIQSIPF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +  T    ++  PF  ++LS  P  + + +  +  +  ++   +K  +K   + +     
Sbjct: 215 YNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLK 274

Query: 120 VES-TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
           V       RTS+  ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  
Sbjct: 275 VRPFIDSGRTSNSVWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKP 332

Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKP 238
           H D F      PQ           LSDV +GG T+FP  N               + V+P
Sbjct: 333 HTDHFE----TPQ-----------LSDVPQGGATLFPRLN---------------ISVQP 362

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           R+GD LL+Y+L   G  +  ++H SCP+IKG KW   KWI +  Q
Sbjct: 363 RQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWIDELSQ 407


>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
 gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
          Length = 376

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 98/219 (44%), Gaps = 41/219 (18%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
           +S+P +V         + +  S  +C+ +IA     LKPS +       V+   G     
Sbjct: 174 KSLPIEV---------YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKID 217

Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS    I  +     I   ++  I++ T   + +GEA N+LRY  GQ+Y  HYD  N
Sbjct: 218 SVRTSYVAVIEPTHCD-WITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276

Query: 185 PAEYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
                       QR+ + L+YL+ + EGGET+FP                + +++ P+ G
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTINEGGETLFPK---------------LDIRIAPKSG 321

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
             ++F +   NG +   S H   P +   KW+ TKWIR+
Sbjct: 322 TMVVFSNSDENGKLLLNSYHAGAPTVSENKWLVTKWIRE 360


>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 263

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/203 (28%), Positives = 95/203 (46%), Gaps = 31/203 (15%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
           ++ +I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPG--SPQYKTHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           ++YL+D  +GG T F                 + + V+P++G  L + +L P+G  +  S
Sbjct: 196 MIYLNDECDGGHTRF---------------TKLDIAVRPKKGMALFWNNLLPSGDPNLNS 240

Query: 260 LHGSCPVIKGEKWVATKWIRDQE 282
           +H + PV +G K V TKW R + 
Sbjct: 241 IHFAEPVTRGHKTVITKWFRTKN 263


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/128 (36%), Positives = 66/128 (51%), Gaps = 22/128 (17%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEY----GPQMSQ---RLASFLLYLSDVEEGGETMFP 215
           EA  V+ Y IG +Y+ H D +   E      P +     R+++FL YLS V  GG T+FP
Sbjct: 364 EAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSRVHLGGATVFP 423

Query: 216 FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVAT 275
             N               ++V P +     +Y+  PNG  D+ +LH  CPV+ GEKWVA 
Sbjct: 424 KLN---------------VRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVAN 468

Query: 276 KWIRDQEQ 283
           KWIR++ Q
Sbjct: 469 KWIRERGQ 476


>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
          Length = 538

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 95/211 (45%), Gaps = 20/211 (9%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+   P    + N  + ++   I   +K +L  S +    G   +  +  RTS   +I 
Sbjct: 333 EVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSGGNQKPVQDYRTSKSAWIE 392

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQM 192
             E    ++  +  + +  T L     E F V+ Y IG  Y+ H+D   P E   + P++
Sbjct: 393 DEEHP--MIRRVSERTSALTDLSLDTVELFQVVNYGIGGHYEPHFDFARPNEIATFDPEV 450

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+ + + Y++  E GG T+FP                +G+K+ P +G   ++++L  N
Sbjct: 451 GNRIITVIFYVAAPEAGGATVFP---------------DLGVKLWPEKGSCAVWWNLMRN 495

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H  CP I G KW+A KW  ++ Q
Sbjct: 496 GEGDYRTKHAGCPTITGSKWIANKWYHERGQ 526


>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
          Length = 539

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/223 (30%), Positives = 104/223 (46%), Gaps = 31/223 (13%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQG 117
           HG T + + +  P+ V+        F NF + E+   ++   +    +R      A   G
Sbjct: 324 HGATWEGAEEGSPWVVV--------FDNFLTDEEVADLVKGGELEGYERSTDQGAANAYG 375

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           E  +    TRTSS  +     ++   +     KI   T +PQ + E+F +L+Y+ GQ Y 
Sbjct: 376 EQEKVVSRTRTSSNAWCMHKCERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYR 435

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVK 237
           SH+D+ +  +  P    R+ +F LYLSDVEEGGET F                 +G+ VK
Sbjct: 436 SHHDS-SSVDDSP-AGHRILTFFLYLSDVEEGGETYF---------------SKLGIAVK 478

Query: 238 PRRGDGLLFYSLFPNGTI--DRTSLHGSCPVIKGEKWVATKWI 278
           P++G  L++ S+        D+   H +  VIKGEK  A  WI
Sbjct: 479 PKKGRALVWPSVLDEDPTYWDKRMYHEAKDVIKGEKKAANHWI 521


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
            FPF               + + VKP+ G+ L +Y+L  +   D  + H  CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504

Query: 273 VATKWIRDQEQ 283
           +   WI +  Q
Sbjct: 505 IGNVWIHEVTQ 515


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/195 (29%), Positives = 97/195 (49%), Gaps = 22/195 (11%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHK 150
           S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED   ++  +  +
Sbjct: 2   SDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRR 57

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLASFLLYLSDVEE 208
           +   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+FL Y+SDVE 
Sbjct: 58  MQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEA 117

Query: 209 GGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIK 268
           GG T+FP                +G  + P++G  + +Y+L  +G  D  + H +CPV+ 
Sbjct: 118 GGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLV 162

Query: 269 GEKWVATKWIRDQEQ 283
           G KWV+ KW  ++ Q
Sbjct: 163 GCKWVSNKWFHERGQ 177


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
            FPF               + + VKP+ G+ L +Y+L  +   D  + H  CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504

Query: 273 VATKWIRDQEQ 283
           +   WI +  Q
Sbjct: 505 IGNVWIHEVTQ 515


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
            FPF               + + VKP+ G+ L +Y+L  +   D  + H  CPV+KG KW
Sbjct: 460 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504

Query: 273 VATKWIRDQEQ 283
           +   WI +  Q
Sbjct: 505 IGNVWIHEVTQ 515


>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
 gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 96/211 (45%), Gaps = 21/211 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + + +   A++  S++    +R     +     E   S    RT+   
Sbjct: 310 LKMELLSLDPYVVLYHDVL-ADREMSLLKLMAQRDLVRAVTYNATEKKHSEDPNRTTKAG 368

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++  S +    + ++   ++    L     E F VL Y IG  Y  H D F  +   P++
Sbjct: 369 WLDPSHNLIRRMGILTEDMSN---LDLERSEDFQVLNYGIGGHYAVHPDFFEGS--NPEL 423

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L YLSDV  GG T+FP                + L V P++G  L++Y+L   
Sbjct: 424 PDRVATLLFYLSDVPLGGATVFPL---------------LDLSVFPKKGAVLMWYNLDHK 468

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G     ++H +CPV+ G +WV TKW+  Q Q
Sbjct: 469 GQGMEKTIHSACPVVVGSRWVMTKWVNQQPQ 499


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 25/212 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
           LS  P  + + N  S  +    IA  ++  +P   ++  GE   S K   RT+ G +I  
Sbjct: 328 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 383

Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
                    +++ I  +I   T L    G+   +++Y  G  YD+H+D  N +    Q +
Sbjct: 384 ENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 443

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF-P 251
             R+A+ L YL+DV+ GG T+FP                + LKV   RG  L++Y++   
Sbjct: 444 GDRMATVLFYLNDVKHGGSTVFP---------------VLQLKVPSERGKVLVWYNMHGE 488

Query: 252 NGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
              +D  +LHGSCPVI G K V + WI + +Q
Sbjct: 489 THDLDSRTLHGSCPVIDGAKTVLSCWIHEWDQ 520


>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
 gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
          Length = 228

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 98/200 (49%), Gaps = 28/200 (14%)

Query: 86  YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE 145
           Y  +F +  QC ++IA      +PS L      +     G RTS    ++    +   ++
Sbjct: 47  YQADFLTPAQCDALIAMIDANRRPSTLL-----SDRPDYGFRTSESCDMNRWSPE---VQ 98

Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQMS----QRLASFL 200
            I+  IA+   +P   GE     RY  GQ++ +H+D F+ +E Y  ++     QR  + +
Sbjct: 99  PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158

Query: 201 LYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSL 260
           +YL+DV EGG T FP                 G++V PRRG  L + ++  +G+ +  +L
Sbjct: 159 IYLNDVPEGGATWFP---------------QAGIRVAPRRGLLLAWNNMLLDGSPNDATL 203

Query: 261 HGSCPVIKGEKWVATKWIRD 280
           H   PV++G K+V TKW R+
Sbjct: 204 HEGMPVVEGVKYVITKWFRE 223


>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
 gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
          Length = 533

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/218 (29%), Positives = 102/218 (46%), Gaps = 26/218 (11%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P    F +   A +   +I   +  LK + +              RT++G
Sbjct: 313 PFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQNITQNVDTYISKDRTATG 372

Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFNPA 186
           ++I   + ++ +  ++  I+ +I   T L  T    +   +L Y  G  Y SHYD FN  
Sbjct: 373 SWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQDLQLLNYVFGGHYQSHYDFFNCP 432

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
            +      R+A+ L+YL+DV  GG T+FP                + L V+P RG  L +
Sbjct: 433 SFP---HDRIATTLIYLNDVVRGGATVFP---------------KLDLVVQPERGKVLHW 474

Query: 247 YSLFPNG-TIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           Y++ P+    DR SLHG CPV+ GEK   T WI + +Q
Sbjct: 475 YNMLPDTFDYDRRSLHGGCPVLIGEKLALTNWIYEWDQ 512


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 100/207 (48%), Gaps = 28/207 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + N  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 412

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 413 --VVAKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEE--PDAFKRLGTGN 468

Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
             A+FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G 
Sbjct: 469 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 513

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  + H +CPV+ G KWV+ KW  ++
Sbjct: 514 GDYRTRHAACPVLVGCKWVSNKWFHER 540


>gi|219113765|ref|XP_002186466.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583316|gb|ACI65936.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 537

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 116/274 (42%), Gaps = 58/274 (21%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK------RLKPSQLA 113
           LPH   G+   Q +  + LS  PR     NF   E+ Q ++A   K      RLK S   
Sbjct: 264 LPHVMGGNPE-QPVYLRTLSLSPRVFDIVNFFRLEESQDLVARVLKEERELWRLKRST-- 320

Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
              G    +    RTS   F ++SE     L++   +        +T+ +   +LRY + 
Sbjct: 321 --TGPDSSNVNKRRTSESGFDTSSESALK-LKMRGFQALGFDDYIETYADGLQMLRYNLT 377

Query: 174 QKYDSHYD------------AFNPAEYGPQMSQRLASFLLYLSDVEE--GGETMF----P 215
             Y++H+D             FN A  G   + R A+ LLY+++  E  GGET+F    P
Sbjct: 378 TAYNAHHDFYGDISAVTGHHNFNSAGVG---ADRFATILLYMTEHGERGGGETVFTEAWP 434

Query: 216 FE--------------------NGIFLDSGYDYKKCIG-----LKVKPRRGDGLLFYSLF 250
            E                    +   L +G   +  +      L V+P  G  +LFYS  
Sbjct: 435 TELAPGKRVELPDAIESLRASGDASMLKTGSWEETLVAKCRSRLAVQPHPGRAVLFYSQL 494

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQH 284
           PNG  D TSLHG CPV+ G+K+ A  W+ +  +H
Sbjct: 495 PNGKPDYTSLHGGCPVLAGQKYAANLWVWNSHRH 528


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 80/161 (49%), Gaps = 22/161 (13%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AF 183
           R S   ++   ED   ++  I  + +  T L  T  E   V+ Y IG +Y+ H+D     
Sbjct: 380 RISKSGWLRDEEDP--LIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRS 437

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
            P  +      R+ + + Y++DVE GG T       +FLD+G        +KV P +G  
Sbjct: 438 EPTAFEKWRGNRILTVIYYMTDVEAGGAT-------VFLDAG--------VKVYPEKGSA 482

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI--RDQE 282
            ++++L P+G  D  + H +CPV+ G KWVA KW   RDQE
Sbjct: 483 AVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFHERDQE 523


>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 354

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/212 (29%), Positives = 96/212 (45%), Gaps = 24/212 (11%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSG 131
           P +VL        + +  S  +C  +I      L+PS +   L     V++    RTS  
Sbjct: 146 PTEVLDQTLPVELYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNV---RTSYV 202

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
             I+ S     I   ++  I++ T  P+ +GEA N+LRY  GQ+Y  HYDA N    G  
Sbjct: 203 AIIAPSYCD-WITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSM 261

Query: 192 M---SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
                QR+ + L+YL+ V +GGET FP                + + V P  G+ ++F +
Sbjct: 262 YKDGKQRIKTALVYLNTVRQGGETRFPK---------------LDISVSPTLGNMVVFSN 306

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
              +G +   S H   P     KW+ TKWIR+
Sbjct: 307 SDESGKLLLNSYHLGAPTFSENKWLVTKWIRE 338


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 77/158 (48%), Gaps = 18/158 (11%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
           R S   +++     T +  L +  +  AT L  T  E   V  Y +G  Y+ H+D F +P
Sbjct: 25  RVSKNAWLAYESHPTMVGMLRD--LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDP 82

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
             Y  +   R+A+ + YLS+VE+GG T FPF               + + VKP+ G+ L 
Sbjct: 83  NHYPAEEGNRIATAIFYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLF 127

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L  +   D  + H  CPV+KG KW+   WI +  Q
Sbjct: 128 WYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 165


>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
 gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
          Length = 209

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/198 (28%), Positives = 93/198 (46%), Gaps = 28/198 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F    QC ++IA  +   +PS +A   G+ V      RTSS   +S        L    
Sbjct: 32  DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPDVPAVAALA--- 83

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
            K+   + +   H E     RYE+GQ++ +H D F P      +Y     QR  +F++YL
Sbjct: 84  RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143

Query: 204 SDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGS 263
           +DV+ GG T F               K I   ++P RG  + + +  P+G+++  +LH +
Sbjct: 144 NDVDAGGATRF---------------KVINKLIQPERGKLVAWNNRRPDGSLNPATLHHA 188

Query: 264 CPVIKGEKWVATKWIRDQ 281
             V +G K+V T+W R++
Sbjct: 189 MKVRQGRKYVVTQWFRER 206


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 16/131 (12%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 330 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 389

Query: 213 MFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKW 272
            FPF               + + VKP+ G+ L +Y+L  +   D  + H  CPV+KG KW
Sbjct: 390 AFPF---------------LDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 434

Query: 273 VATKWIRDQEQ 283
           +   WI +  Q
Sbjct: 435 IGNVWIHEVTQ 445


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 395

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 396 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 453

Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
             A+FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G 
Sbjct: 454 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 498

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  + H +CPV+ G KWV+ KW  ++
Sbjct: 499 GDYRTRHAACPVLVGCKWVSNKWFHER 525


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 102/213 (47%), Gaps = 25/213 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S  + Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 361

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 362 DTVDP--MLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 419

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
           +   R+A+F++YLS VE GG T F + N                 V   +   L +++L 
Sbjct: 420 KSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSVPVVKNAALFWWNLH 464

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 465 RSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 497


>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
 gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
          Length = 446

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 19/157 (12%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           R+    FI    +K  +++ IE ++   + L     +  +++ Y IG  Y  H+D+F+  
Sbjct: 296 RSGKNVFIEL--EKGELVKTIEMRVTDMSGLSMEGSDDLSLINYGIGGHYIPHHDSFSEE 353

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           E   +   R+A+ L YLSDVE GG T FP  N               L + P +G  +L+
Sbjct: 354 E--NKTEDRIATALFYLSDVELGGATTFPLLN---------------LTISPEKGTAVLW 396

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++L  +GT    ++H +CPVI G K+V TKWI + +Q
Sbjct: 397 HNLKDSGTPHPKTVHAACPVIVGSKYVMTKWIYNMDQ 433


>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
          Length = 475

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 26/206 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRL--KPSQLALRQGE-TVESTKGT-RTSSGTFISAS 137
           P  + F NF + ++C  +I   +K    +   +   Q + + +S + T RTS   + S  
Sbjct: 273 PWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRTSENAWCSFR 332

Query: 138 ED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           +      I+EL+  +IA+ T +   H E F +L+YE GQ Y  H+D +   +   +   R
Sbjct: 333 DGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHD-YIEHQRDRRCGPR 391

Query: 196 LASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF---PN 252
           + +F LYLSDVEEGG T FP                +G+ VKP+ G  LL+ S+    P 
Sbjct: 392 VLTFFLYLSDVEEGGATNFPK---------------LGIAVKPKVGRALLWPSVLNSEPR 436

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWI 278
               RT  H +  VI G K+ A  WI
Sbjct: 437 NKDGRTD-HEAQDVIAGVKYGANAWI 461


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 101/207 (48%), Gaps = 28/207 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 395 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDE--PDAFKRLGTGN 452

Query: 197 --ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT 254
             A+FL Y+SDVE GG T+FP           D+    G  + P++G  + +Y+LF +G 
Sbjct: 453 RVATFLNYMSDVEAGGATVFP-----------DF----GAAIWPKKGTAVFWYNLFRSGE 497

Query: 255 IDRTSLHGSCPVIKGEKWVATKWIRDQ 281
            D  + H +CPV+ G KWV+ KW  ++
Sbjct: 498 GDYRTRHAACPVLVGCKWVSNKWFHER 524


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 99/211 (46%), Gaps = 26/211 (12%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
           +  ++ S  P  + + +     + Q +I + ++R+  S  Q  +RQ E  E     RTS 
Sbjct: 141 LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 196

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH---YDAFNPAE 187
               +   D   +L+ I  ++   T       E  ++L Y+ G  +D H   +D +   E
Sbjct: 197 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYE 255

Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
           Y P    R AS + YL+DVE+GGET+FP                + L + P +G  L+++
Sbjct: 256 YHP-FGDRQASVVFYLNDVEDGGETVFP---------------KLQLVIPPTKGSALMWH 299

Query: 248 SLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +L P G  D  + H SCPV+ G K VA +WI
Sbjct: 300 NLRPWGEGDPRTQHASCPVLSGYKQVAIQWI 330


>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
           precursor (4-PH alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 527

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 48/138 (34%), Positives = 67/138 (48%), Gaps = 19/138 (13%)

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YGPQMSQRLASFLLY 202
           I  +I+  T L     E   V  Y +G +Y  H+D     E       Q  +R+A+FL+Y
Sbjct: 381 ITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDGERIATFLIY 440

Query: 203 LSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHG 262
           LSDVE GG T F                  G+  KP +G  + +Y++FP+G  D  + HG
Sbjct: 441 LSDVEVGGRTAFV---------------NAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHG 485

Query: 263 SCPVIKGEKWVATKWIRD 280
           +CPV  G KW   KWIR+
Sbjct: 486 ACPVAFGNKWAGNKWIRE 503


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/223 (28%), Positives = 99/223 (44%), Gaps = 31/223 (13%)

Query: 72  SIPFQVL--------SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           S P+++L        S  P    F +  S  +   +   A  R++ S +  R G   + +
Sbjct: 313 SHPYRILAPLKLEEHSLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKS 372

Query: 124 KGTRTSSGTFIS--ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
              R S   +++  A     G+L      +  AT L  T  E   V  Y +G  Y+ H+D
Sbjct: 373 -AFRVSKNAWLAYEAHPTMAGMLR----DLKDATGLDTTFCEQLQVANYGVGGHYEPHWD 427

Query: 182 AF-NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRR 240
            F +P+ Y      R+A+ + YLS+VE+GG T FPF               +   VKP+ 
Sbjct: 428 FFRDPSHYPAAEGNRIATAIFYLSEVEQGGATAFPF---------------LDFAVKPQL 472

Query: 241 GDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G+ L +Y+L  +   D  + H  CPV+KG KW+   WI +  Q
Sbjct: 473 GNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 515


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 77/158 (48%), Gaps = 18/158 (11%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
           R S   +++     T +  L +  +  AT L  T  E   V  Y +G  Y+ H+D F +P
Sbjct: 124 RVSKNAWLAYESHPTMVGMLRD--LKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDP 181

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLL 245
             Y  +   R+A+ + YLS+VE+GG T FPF               + + VKP+ G+ L 
Sbjct: 182 NHYPAEEGNRIATAIFYLSEVEQGGATAFPF---------------LDIAVKPQLGNVLF 226

Query: 246 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           +Y+L  +   D  + H  CPV+KG KW+   WI +  Q
Sbjct: 227 WYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQ 264


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 268 LQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 325 KSAWLKDTVDP--LLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 382

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 383 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 427

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 428 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 466


>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
 gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
          Length = 376

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 41/219 (18%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
           +S+P +V         + +  S  +C+ +I      LKPS +       V+   G     
Sbjct: 174 KSLPIEV---------YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKID 217

Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS    I  +     I   ++  I++ T   + +GEA N+LRY  GQ+Y  HYD  N
Sbjct: 218 SVRTSYVAVIEPAHCD-WITRKLDKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276

Query: 185 PAEYGPQMS---QRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
                       QR+ + L+YL+ + EGGET+FP                + +++ P+ G
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTISEGGETLFPK---------------LDIRIAPKSG 321

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRD 280
             ++F +   NG +   S H   P +   KW+ TKWIR+
Sbjct: 322 TMVVFSNSDENGKLLLNSYHAGAPTVSENKWLVTKWIRE 360


>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
          Length = 463

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 32/185 (17%)

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHK------IARATMLPQTHGE-------- 163
           E     + TRTS  T++   +D   I++ I  +      I  A + P++ GE        
Sbjct: 289 EARHDIRETRTSLNTWVYREKDL--IIDAIYRRAADLLRIDEALLRPRSAGEVPEMKNTR 346

Query: 164 ----AFNVLRYEIGQKYDSHYD-AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP-FE 217
               A  ++ YE+GQ+Y +H+D  + P +   Q + R A+ LLYL++   GGET FP + 
Sbjct: 347 GLAEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPA-RFATLLLYLNEGMVGGETQFPRWA 405

Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
           N          +   GL V+P+ G  +LFYS  P+G +D  S H + PV  GEKW+   W
Sbjct: 406 NA---------ETRAGLDVEPKIGKAVLFYSQLPDGNMDDLSQHAARPVKIGEKWLMNLW 456

Query: 278 IRDQE 282
           + D E
Sbjct: 457 VWDPE 461


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRV 452

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                 G  + P++G  + +Y+LF +G  D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DFGATIWPKKGTSVFWYNLFRSGEGD 497

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWFHERGQ 524


>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
 gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
          Length = 499

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 98/210 (46%), Gaps = 27/210 (12%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  + LS  P  + + +   A + + I+  AK  L+ + +   +  +       R +   
Sbjct: 296 LKLEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALVGAARAHS------QRFAMNA 349

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPAEY 188
             S ++ + G  + +  ++   +    T+     VL Y IG +Y  HYD +    + A+ 
Sbjct: 350 GFSYNDSRQG--QRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQV 407

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYS 248
                 R+A+ LLYL+DV+ GG T FP                +GL V+P  G  L++++
Sbjct: 408 ASIKDNRIATILLYLTDVQLGGLTSFP---------------ALGLAVQPSPGSALIWHN 452

Query: 249 LFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           +      DR +LH +CP++ G +WVAT+WI
Sbjct: 453 MNNAAECDRRTLHAACPLLLGTRWVATQWI 482


>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
          Length = 557

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 72/241 (29%), Positives = 112/241 (46%), Gaps = 46/241 (19%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
            +S  P    F NF    +C+ +   A K LK S++      T       RTSS  F+  
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRV------TDGKLSNGRTSSSCFLIG 371

Query: 137 SEDKTGILELIEHKI---ARATMLPQTH---------GEAFNVLRYEIGQKYDSHYDAFN 184
           ++ K  +++ IE ++    R+T +  T           E   ++RY   +KY SH+D  N
Sbjct: 372 AKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD--N 429

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----------YKKCI-- 232
            A       +R+A+F+ YLSD  EGG T FP    +FL+  +D           KK +  
Sbjct: 430 KA----GSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGAFKPFGRKKKTVAS 485

Query: 233 ---GLKVKPRRGDGLLFYSL----FPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQHE 285
              G+K+ P+ G  +LF+S+    F    +   SLH    V KGEK++ TKW+   E+ E
Sbjct: 486 EQHGVKIHPKLGRAILFFSISEEPFRENPL---SLHEGQTVRKGEKFICTKWLTRTEESE 542

Query: 286 D 286
           +
Sbjct: 543 N 543


>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
 gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
          Length = 521

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 85/185 (45%), Gaps = 24/185 (12%)

Query: 105 KRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
           KRL P  Q     G     TK T  ++   ++   + T  LE +  +I   T        
Sbjct: 344 KRLSPQMQNGYIHGYKANQTKVTDIAAR--VNWLVENTPFLERMNQRITDMTGFDLKEFP 401

Query: 164 AFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEN 218
           +  V  + IG  +++HYD          + G  +  RLAS + Y SDV  GG T+FP   
Sbjct: 402 SVQVANFGIGNNFEAHYDYIFGKRVRKEDVG-DLGDRLASIIFYSSDVPLGGATVFP--- 457

Query: 219 GIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 278
                        I + V+P++G+ LL+Y+LF +GT D  SLH  CPV+ G +W  TKW+
Sbjct: 458 ------------DIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRWTLTKWL 505

Query: 279 RDQEQ 283
               Q
Sbjct: 506 HTSPQ 510


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  I  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 22/193 (11%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
           E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED   ++  +  ++ 
Sbjct: 2   EEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRRMQ 57

Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLASFLLYLSDVEEGG 210
             T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+FL Y+SDVE GG
Sbjct: 58  HITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGG 117

Query: 211 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGE 270
            T+FP                +G  + P++G  + +Y+L  +G  D  + H +CPV+ G 
Sbjct: 118 ATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGC 162

Query: 271 KWVATKWIRDQEQ 283
           KWV+ KW  ++ Q
Sbjct: 163 KWVSNKWFHERGQ 175


>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 490

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 101/214 (47%), Gaps = 34/214 (15%)

Query: 78  LSWRPRALYFPNFASAEQCQSII-----ATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +S  P  + F NF + E+C  +I     A  ++     ++ +        +KG RTS   
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342

Query: 133 FISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
           + S  +    T   +LI  +I+  T +P  H E F +L+YE GQ Y SH+D     E   
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402

Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFY 247
            GP    R+ +F LYLSDVEEGG+T FP                + + VKP++G  +L+ 
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFPK---------------LSIAVKPKKGSAVLWP 443

Query: 248 SLF---PNGTIDRTSLHGSCPVIKGEKWVATKWI 278
           S+    P+    RT  H +  V+ G K+ A  W+
Sbjct: 444 SVLDSNPSMKDPRTD-HEAQEVVNGTKFGANAWL 476


>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
 gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
          Length = 210

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/220 (27%), Positives = 101/220 (45%), Gaps = 32/220 (14%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           A +Q  P    S R       NF +AEQC  ++A  +   +PS +A   G+        R
Sbjct: 16  AGVQRFP----SSRLDLFILKNFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFR 66

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
           TSS   +S       ++  +   ++R + +   H E     RYE+GQ++ +H D F P  
Sbjct: 67  TSSTCDLSTD---VPVVANLAAALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGN 123

Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
               +Y     QR  +F++YL++VE GG T F               + I   ++P  G 
Sbjct: 124 ADYDKYCAVPGQRTWTFMIYLNEVEAGGATRF---------------RVIDKMIQPEIGK 168

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQE 282
            + + +  P+GT +  +LH +  V KG K+V T+W R++ 
Sbjct: 169 LIAWNNRRPDGTPNAATLHHAMKVRKGYKYVITQWYRERH 208


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  +  ++   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTG--- 448

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +
Sbjct: 449 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 492

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 493 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 329 LQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 385

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 386 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 443

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 444 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 488

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 489 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 527


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 96/219 (43%), Gaps = 30/219 (13%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++L + P  + F    S  + + I   A  +LK + +   +   +E     R S   
Sbjct: 316 IKVEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYA-NYRISKSA 374

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFN 184
           ++  ++     ++ I  +I   T L Q   E      Y IG  YD H+D        AF 
Sbjct: 375 WLKGTDHPA--IDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAFK 432

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
               G     R+A+ L+Y+SDVE GG T+F                 +G  V P + D L
Sbjct: 433 TLNTG----NRIATILIYMSDVESGGATVF---------------NHLGNAVFPSKYDAL 473

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +Y+L  +G  D  + H +CPV+ G KWV+ KWI D+ Q
Sbjct: 474 FWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQ 512


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 105/219 (47%), Gaps = 24/219 (10%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P  + + +F +  + Q I   A+  +  +++ +  GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWVT-AEILVASGEKQLPVE-YRIS 391

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 392 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 449

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 450 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYGN---------------FSVPVVKNAAL 494

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 495 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 533


>gi|326928035|ref|XP_003210190.1| PREDICTED: WD repeat-containing protein 6-like [Meleagris
           gallopavo]
          Length = 900

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 89/195 (45%), Gaps = 34/195 (17%)

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP---QTHGEAFNVLRYEIGQ 174
           + V+++   R S  T++   E    ++  I  ++ R T LP     H E   V+RY+ G 
Sbjct: 134 QKVKTSDAVRNSQHTWLYQGEGAHQVMRAIRQRVMRLTRLPPEIVEHSEPLQVVRYDQGG 193

Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP----- 215
            Y +H D+                 E  P + S R  + L YL++V  GGET+FP     
Sbjct: 194 HYHAHMDSGPVFPETACSHTKLVANESAPFETSCRYVTVLFYLNNVTGGGETVFPIADNR 253

Query: 216 -FENGIFLDSGYDY----KKCI--GLKVKPRRGDGLLFYSLFPNGT-----IDRTSLHGS 263
            +E    + +  D     K C    L+VKP++G  + +Y+   +G      +D  +LHG 
Sbjct: 254 TYEEMSLIQNDVDLRDTRKNCDKGNLRVKPQQGTAVFWYNYLSDGEGWVGELDDFALHGG 313

Query: 264 CPVIKGEKWVATKWI 278
           C V +G KW+A  WI
Sbjct: 314 CLVTQGTKWIANNWI 328


>gi|400602974|gb|EJP70572.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Beauveria bassiana
           ARSEF 2860]
          Length = 269

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 102/219 (46%), Gaps = 16/219 (7%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           Q+   ++LS  P A+Y  NF +  + + ++A  +   KPS++A   G  V +T   R+S 
Sbjct: 43  QNYTVEILSIDPLAIYLNNFLNDAEIRYLLALGENIYKPSEVASHSGIIVNTT--VRSSE 100

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG-QKYDSHYDAFNPAEYG 189
             F+   ED      LI    +    +   H E+  +++Y  G  +Y  H D    A+  
Sbjct: 101 SAFLL--EDDAVCNCLISRMKSLLGNVQHEHVESLQMVKYAAGGDRYRLHTDWSVAAKNN 158

Query: 190 P-------QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYD----YKKCIGLKVKP 238
                   + S+RL +  +YL D   GGET FP   G+  D+  +     K+  GL V+P
Sbjct: 159 TDEASGKLRQSRRLGTIFVYLEDSCAGGETYFPLLTGVSDDADGEKFAVAKQGGGLLVRP 218

Query: 239 RRGDGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKW 277
           +RG+G+ + ++  NGT D   +H   P+  G K     W
Sbjct: 219 KRGNGVFWNNIHSNGTGDDRVVHAGLPIKSGVKIGLNMW 257


>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
          Length = 197

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 35/217 (16%)

Query: 70  IQSIPFQVLSW-------RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           I  + F ++ W       +PR L   N  S ++C+ I   A K+L+ S ++    ++ + 
Sbjct: 5   ILFVSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQNIASKKLQTSTVS----KSRDI 58

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
            +  R S   ++ ASED   +++ +  K    T  P  + E   VL+Y+ G  Y  H D 
Sbjct: 59  DESIRKSETAWLKASED--PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDT 116

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
           F P +     ++R+ +F++ L+D  EGGET FP           + KK   L+    +GD
Sbjct: 117 F-PDD----KNKRMYTFIIALNDEYEGGETEFP-----------NIKKSYRLE----KGD 156

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            L F +L     I + +LHG  PV  GEKWV   W+R
Sbjct: 157 ALFFNTLNNYECITKKALHGGTPVKSGEKWVCNLWVR 193


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
 gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
          Length = 197

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 100/217 (46%), Gaps = 35/217 (16%)

Query: 70  IQSIPFQVLSW-------RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           I  + F ++ W       +PR L   N  S ++C+ I   A K+L+ S +++ +    + 
Sbjct: 5   ILFVSFLLIIWFFIPIYEKPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DI 58

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
            +  R S   ++ ASED   +++ +  K    T  P  + E   VL+Y+ G  Y  H D 
Sbjct: 59  DEKIRKSETAWLKASED--PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDC 116

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGD 242
           F   +     ++R+ +F++ L+D  EGGET FP                I  + +  +GD
Sbjct: 117 FKNDK-----NKRMYTFIIALNDEYEGGETEFPN---------------IKRRYRLEKGD 156

Query: 243 GLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 279
            L F +L       + +LHG  PV  GEKWV   WIR
Sbjct: 157 ALFFNTLNNYECTTKQALHGGAPVKSGEKWVCNLWIR 193


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 453

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 474

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 475 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 519

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 520 YRTRHAACPVLVGCKWVSNKWFHERGQ 546


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 483

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 484 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 528

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 529 YRTRHAACPVLVGCKWVSNKWFHERGQ 555


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/217 (29%), Positives = 104/217 (47%), Gaps = 27/217 (12%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  +V+   P  + + +F +  + Q I   A+  L+ S +A   GE  +     R S  
Sbjct: 336 PFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA--SGEK-QLQVDYRISKS 392

Query: 132 TFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE- 187
            ++  S D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+  
Sbjct: 393 AWLKDSVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 450

Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
            Y  +   R+A+F++YLS VE GG T F + N                 +   +   L +
Sbjct: 451 LYRMKSGNRVATFMIYLSAVEAGGATAFIYAN---------------FSMPVVKNAALFW 495

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           ++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 496 WNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 453

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 454 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFHERGQ 525


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRV 493

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 494 ATFLNYMSDVEAGGATVFPD---------------LGAAIWPKKGTAVFWYNLLRSGEGD 538

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 539 YRTRHAACPVLVGCKWVSNKWFHERGQ 565


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 422

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494


>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
           [Meleagris gallopavo]
          Length = 518

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 100/204 (49%), Gaps = 34/204 (16%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
            + ++  I  +I   T L  +  E          QK +   DAF     G     R+A++
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEEL--------QKDEP--DAFKELGTG----NRIATW 437

Query: 200 LLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTIDRTS 259
           L Y+SDV  GG T+FP                +G  V P++G  + +Y+LFP+G  D ++
Sbjct: 438 LFYMSDVSAGGATVFP---------------EVGASVWPKKGTAVFWYNLFPSGEGDYST 482

Query: 260 LHGSCPVIKGEKWVATKWIRDQEQ 283
            H +CPV+ G KWV+ KW+ ++ Q
Sbjct: 483 RHAACPVLVGNKWVSNKWLHERGQ 506


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
 gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
          Length = 472

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 78/162 (48%), Gaps = 29/162 (17%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTS  ++I  SE        +  ++   T       + F+++ Y +G  Y  HYD    
Sbjct: 318 VRTSKDSYIVDSES-------LNERVTDMTGFSMEMSDPFSLINYGLGGHYMLHYDFH-- 368

Query: 186 AEYG----PQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRG 241
            EY     P+   R+A+ L YL +V+ GG T+FP                I + V P++G
Sbjct: 369 -EYTNTTRPKQGDRIATVLFYLGEVDSGGATIFP---------------KINIAVTPKKG 412

Query: 242 DGLLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + +Y+L  +G ++  SLH +CPVI G K+V TKWI +  Q
Sbjct: 413 SAVFWYNLHNSGAMNLKSLHSACPVISGSKYVLTKWINELPQ 454


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 96/217 (44%), Gaps = 23/217 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +VLS +P  + + N  +  +   +   A   LK + +  +  +        R S   
Sbjct: 343 LNVEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTA 402

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP------A 186
           ++   ED   + + I   I     L     E   +  Y IG  Y+ H D          +
Sbjct: 403 WLD-KEDHPAV-KRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALS 460

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLF 246
           EY  ++  R+A+ L+YLS+VE GG T+FP                 G++V+PR+G    +
Sbjct: 461 EYTSRIGNRIATVLIYLSNVEAGGATVFP---------------KAGVRVEPRQGSAAFW 505

Query: 247 YSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           Y++  NG  ++ S+H +CPV+ G KW A  W R+  Q
Sbjct: 506 YNMHRNGEGNKLSVHAACPVLIGSKWAANLWFREVGQ 542


>gi|395516293|ref|XP_003762325.1| PREDICTED: transmembrane prolyl 4-hydroxylase [Sarcophilus
           harrisii]
          Length = 517

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 90/208 (43%), Gaps = 40/208 (19%)

Query: 111 QLALRQ------GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---H 161
           QL LR        + V+ +   R S  T++   E    ++  I  ++ R T LP+    H
Sbjct: 267 QLNLRDFHKYMGSQKVKVSDLVRNSQHTWLYQGEGAHQVMRSIRQRVLRLTRLPRQIVEH 326

Query: 162 GEAFNVLRYEIGQKYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVE 207
            E   V+RY+ G  Y +H D+                 E  P + S R  + L YL++V 
Sbjct: 327 SEPMQVVRYDQGGHYHAHMDSGPVFPETACSHTKLIANETSPFETSCRYVTVLFYLNNVT 386

Query: 208 EGGETMFP------------FENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT- 254
            GGET FP             +N + L     +     L+VKPR+G  + +Y+   +G  
Sbjct: 387 GGGETTFPVADNRTYDELSLIQNDVDLRDTRKHCDKGNLRVKPRQGTAVFWYNYLSDGQG 446

Query: 255 ----IDRTSLHGSCPVIKGEKWVATKWI 278
               +D  SLHG C V +G KW+A  WI
Sbjct: 447 WVGELDEYSLHGGCLVTQGTKWIANNWI 474


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/220 (29%), Positives = 100/220 (45%), Gaps = 32/220 (14%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A+ F +  S E+ + I   A  RLK + +   +   +E T   R S  
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AF 183
            ++   + +  +++ I  +I   T L Q   E   +  Y +G  YD H+D        AF
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAF 435

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDG 243
                G     RLA+ L YL++ E GG T+F                 +   V P +   
Sbjct: 436 ESLGTG----NRLATVLFYLTEPEIGGGTVF---------------TELRTAVMPSKNGA 476

Query: 244 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           L +Y+L+ +G  D  + H +CPV+ G KWVA KWI ++ Q
Sbjct: 477 LFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQ 516


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRV 422

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 423 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFHERGQ 494


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRV 452

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 453 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 497

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 498 YRTRHAACPVLVGCKWVSNKWFHERGQ 524


>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
           ANT32C12]
          Length = 186

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 57/193 (29%), Positives = 95/193 (49%), Gaps = 29/193 (15%)

Query: 100 IATAKKRLKPSQLALRQGETVESTK----GTRTSSGTFISASEDKTGILELIEHKIARAT 155
           + +A+  L+  Q  + +   +  ++     +RT+S  +I    D + I+  +  + +   
Sbjct: 9   LMSARPLLRLDQARVERATVITDSEHQFHDSRTNSYAWIQ--HDASEIIHEVSKRFSILV 66

Query: 156 MLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM----SQRLASFLLYLSDVEEGG 210
            +P  + E F ++ Y  G +Y  H+DAF+ + E G        QR+ + L YL+DVE+GG
Sbjct: 67  KMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTEEGRNNWFPGGQRMVTALAYLNDVEDGG 126

Query: 211 ETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGT--IDRTSLHGSCPVIK 268
            T FP                I + VKP +GD ++F++   +GT  I+  SLHG  PVI 
Sbjct: 127 ATDFP---------------DIHVSVKPNKGDVVVFHNC-KDGTSDINPNSLHGGSPVIS 170

Query: 269 GEKWVATKWIRDQ 281
           GEKW    W R +
Sbjct: 171 GEKWAVNLWFRQE 183


>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 531

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 91/200 (45%), Gaps = 31/200 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS-ASE 138
           P  L F    + E  + I   A  RL+PS++   + Q      T   R S   F   A E
Sbjct: 342 PDVLVFHEMITEEVAEKIRDVANPRLRPSEVIDPIIQKHV---TASYRVSKNVFFDDAFE 398

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------EYGPQM 192
           ++  I   +   +  AT L     E   V  Y +G +Y+ H D  +P       E+G   
Sbjct: 399 EELEISRKLRPLVEDATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHG--- 455

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+ L+YLSDVE GG+T+F                 +GL +KP+ GD   +++L+ N
Sbjct: 456 -NRIATLLIYLSDVERGGDTVFT---------------RLGLSLKPKLGDAAFWHNLYKN 499

Query: 253 GTIDRTSLHGSCPVIKGEKW 272
           G+    + H SCPV+ G KW
Sbjct: 500 GSGIYATEHASCPVVSGSKW 519


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 24/207 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  +  ++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRV 451

Query: 197 ASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPNGTID 256
           A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +G  D
Sbjct: 452 ATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 257 RTSLHGSCPVIKGEKWVATKWIRDQEQ 283
             + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFHERGQ 523


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F +  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 391 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 493

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 494 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 532


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 99/211 (46%), Gaps = 32/211 (15%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  +  ++   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTG--- 450

Query: 193 SQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLFPN 252
             R+A+FL Y+SDVE GG T+FP                +G  + P++G  + +Y+L  +
Sbjct: 451 -NRVATFLNYMSDVEAGGATVFP---------------DLGAAIWPKKGTAVFWYNLLRS 494

Query: 253 GTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
           G  D  + H +CPV+ G KWV+ KW  ++ Q
Sbjct: 495 GEGDYRTRHAACPVLVGCKWVSNKWFHERGQ 525


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 98/213 (46%), Gaps = 23/213 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L+  P   YF    ++ + + II      ++ S++   Q  T   T   RTS+ 
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
           T++  +E+    L  I+ ++   T L     E   ++ Y IG +Y+ H+D    P +   
Sbjct: 386 TWLWYNENPW--LSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSLF 250
               R+ + L Y++DV  GG T FPF               + L V P +G  L++Y+L 
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF---------------LQLAVPPVKGSLLVWYNLH 488

Query: 251 PNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +   D  + H  CPVIKG KW+  +W  +  Q
Sbjct: 489 RSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQ 521


>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 518

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 87/188 (46%), Gaps = 25/188 (13%)

Query: 107 LKPSQLALR--QGETVESTKGT------RTSSGTFISASE-DKTGILELIEHKIARATML 157
           LK   LAL   +  TV+S  G       +T SG     S+ D    L+ ++ +I   T  
Sbjct: 334 LKIKTLALENMKDATVKSVDGKGDSLIEKTRSGQVYWISKVDAVEYLDALDTRIESFTGF 393

Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
                E + ++ Y +G  Y  H+D+F  A    Q   RL + L YL+DV+  G T FP  
Sbjct: 394 STKTAEQYQIVNYGLGGHYLPHHDSFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLL 453

Query: 218 NGIFLDSGYDYKKCIGLKVKPRRGDGLLFYSL-FPNGTIDRTSLHGSCPVIKGEKWVATK 276
           N I                   +G  L++ +L   NG     SLHGSCP++KG KW+ T+
Sbjct: 454 NII---------------APAEKGAALVWNNLHMSNGQKFYESLHGSCPLLKGNKWIMTR 498

Query: 277 WIRDQEQH 284
           W+ ++ QH
Sbjct: 499 WLYEEGQH 506


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 64/219 (29%), Positives = 103/219 (47%), Gaps = 25/219 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P  + + +F +  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 323 LQPIRKEVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 379

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 380 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 437

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPFENGIFLDSGYDYKKCIGLKVKPRRGDGL 244
              Y  +   R+A+F++YLS VE GG T F + N                 V   +   L
Sbjct: 438 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIYAN---------------FSVPVVKNAAL 482

Query: 245 LFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIRDQEQ 283
            +++L  +G  D  +LH  CPV+ G+KWVA KWI +  Q
Sbjct: 483 FWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQ 521


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.135    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,646,853,308
Number of Sequences: 23463169
Number of extensions: 194179654
Number of successful extensions: 384605
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1423
Number of HSP's successfully gapped in prelim test: 591
Number of HSP's that attempted gapping in prelim test: 379607
Number of HSP's gapped (non-prelim): 2250
length of query: 286
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 145
effective length of database: 9,050,888,538
effective search space: 1312378838010
effective search space used: 1312378838010
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)