BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 027189
         (227 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 286

 Score =  337 bits (864), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 154/211 (72%), Positives = 185/211 (87%)

Query: 13  GLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQS 72
           GLP  +L+CS FF+AGFY ST +S+DVP I+P+LR LEV ++E    +P G TG++ I+S
Sbjct: 14  GLPAVILVCSVFFVAGFYASTLISQDVPVIKPRLRMLEVTDEEKHQAMPRGVTGESYIES 73

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           IPFQVLSW+PRA+YFP+FA+ EQC++II  AK RLKPS LALR+GET ESTKGTRTSSGT
Sbjct: 74  IPFQVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKGTRTSSGT 133

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+SASED TG L+ IEHKIARATM+P++HGEAFN+LRYEIGQKYDSHYD+FNPAEYGPQM
Sbjct: 134 FLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQM 193

Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           SQR+ASFLLYLSDVE+GGETMFPFE  ++IS
Sbjct: 194 SQRVASFLLYLSDVEKGGETMFPFENGVKIS 224


>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score =  328 bits (842), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 152/209 (72%), Positives = 179/209 (85%), Gaps = 2/209 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR LE+VE  +E+   +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLVGFYGSTLLSQNVPRVKPRLRMLEMVENGEEDTGSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S+ SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASED TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score =  325 bits (834), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 154/209 (73%), Positives = 180/209 (86%), Gaps = 2/209 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEK-ENESG-LPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR LE+VE  E E+G +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLEMVENGEEEAGSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score =  325 bits (833), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 152/209 (72%), Positives = 179/209 (85%), Gaps = 2/209 (0%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVE--KENESGLPHGETGDA 68
           KLGL T ++ CS  FL GFYGST LS++VP ++P+LR L++VE  +E  S +PHG TG+ 
Sbjct: 12  KLGLATVIVFCSLCFLFGFYGSTLLSQNVPRVKPRLRMLDMVENGEEEASSMPHGVTGEE 71

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LALR+GET E+TKGTRT
Sbjct: 72  SIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRT 131

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+GQKYDSHYD FNP EY
Sbjct: 132 SSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEY 191

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 192 GPQSSQRIASFLLYLSDVEEGGETMFPFE 220


>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
 gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score =  323 bits (829), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 156/225 (69%), Positives = 182/225 (80%), Gaps = 9/225 (4%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR--------DVPSIRPKLRTLEVV 52
           MK K KRS  KLGLP A++LCS FFLAGFY S F           DVP  RP+LR LEV 
Sbjct: 1   MKIKTKRSKRKLGLPAAIILCSLFFLAGFYSSIFFPHVSASLSACDVPVTRPRLRMLEVA 60

Query: 53  EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
           E ++++ +PHG TG+AS++SIPFQVLSW+PRALYFP FA+ EQC+SII   + +LKPS L
Sbjct: 61  EGDHDT-MPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTL 119

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET ESTK TRTSSG+F+S SED+TG L+ IE KIA+ATM+PQ+HGEAFN+LRYEI
Sbjct: 120 ALRKGETAESTKDTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEI 179

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GQKYDSHYDAFNP EYG Q SQR ASFLLYLS+VEEGGETMFPFE
Sbjct: 180 GQKYDSHYDAFNPDEYGQQSSQRTASFLLYLSNVEEGGETMFPFE 224


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score =  322 bits (824), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 160/223 (71%), Positives = 183/223 (82%), Gaps = 3/223 (1%)

Query: 1   MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
           MKGK K     KLGL    +  SFFFLAG +GS   S+DV  +R + R LE VE+   S 
Sbjct: 1   MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEY--SP 58

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +PHGETG++S+  IPFQVLSW+PRALYFP FA+AEQCQSII  AK  L+PS LALRQGET
Sbjct: 59  MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 118

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 119 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 178

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL I
Sbjct: 179 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNI 221


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score =  310 bits (793), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 155/228 (67%), Positives = 181/228 (79%), Gaps = 6/228 (2%)

Query: 1   MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
           MKGK+ RS+    +KLGLP  + +LC F FLAGF+GST LS+DV   RP+ R L+     
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFCFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 56  NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
            E  L   GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+  AK +L+PS LAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET ESTKG RTSSG F SASED++G L +IE KIARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQ 180

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L +
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNM 228


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score =  307 bits (787), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 158/223 (70%), Positives = 180/223 (80%), Gaps = 11/223 (4%)

Query: 1   MKGKAKRS-STKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESG 59
           MKGK K     KLGL    +  SFFFLAG +GS   S+      P+L  LE VE+   S 
Sbjct: 1   MKGKGKGVWRPKLGLLLLFISWSFFFLAGLFGSMLFSQ------PRL--LESVEEY--SP 50

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           +PHGETG++S+  IPFQVLSW+PRALYFP FA+AEQCQSII  AK  L+PS LALRQGET
Sbjct: 51  MPHGETGESSVDMIPFQVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGET 110

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKGTRTSSGTFISASEDKTGIL+ +E KIA+ATM+P++HGEAFN+LRYEIGQ+Y+SH
Sbjct: 111 DESTKGTRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSH 170

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           YDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL I
Sbjct: 171 YDAFNPAEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNI 213


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score =  307 bits (787), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 154/228 (67%), Positives = 180/228 (78%), Gaps = 6/228 (2%)

Query: 1   MKGKAKRSS----TKLGLPTAL-LLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE 55
           MKGK+ RS+    +KLGLP  + +LC F FLAGF+GST LS+DV   RP+ R L+     
Sbjct: 1   MKGKSGRSNWSLRSKLGLPALIFVLCLFXFLAGFFGSTLLSQDVDDDRPRARLLQSASDV 60

Query: 56  NESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
            E  L   GE GD SI SIPFQVLSWRPRALYFP FA+AEQCQSI+  AK +L+PS LAL
Sbjct: 61  TEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLAL 120

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET ESTKG RTSSG F SASED++G L +IE K ARATM+P+THGEA+N+LRYEIGQ
Sbjct: 121 RKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQ 180

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           KY+SHYDAF P+EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L +
Sbjct: 181 KYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNM 228


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score =  305 bits (781), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 154/223 (69%), Positives = 176/223 (78%), Gaps = 2/223 (0%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
           MKGK   S  KL LP   L  S FFLAG +GS   S+DV  +R   R LE VE+E  S +
Sbjct: 1   MKGKGGWSP-KLELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVEEEYSS-M 58

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           PHGETG++S+  IPFQVLSW+PRA YFP+FA+AEQCQSII  AK  L PS L LR+GET 
Sbjct: 59  PHGETGESSVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRKGETE 118

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           ESTKG RTSSGTFISASEDKTGIL+ IE KIA+ATM+P+ HGE FN+LRYEIGQ+Y+SHY
Sbjct: 119 ESTKGIRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHY 178

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           DA +PAEYG Q SQR+ASFLLYLSDVEEGGETMFPFE DL I+
Sbjct: 179 DAISPAEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNIN 221


>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 290

 Score =  302 bits (774), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 150/229 (65%), Positives = 177/229 (77%), Gaps = 7/229 (3%)

Query: 1   MKGKAKRSS----TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKE- 55
           MK K  +      +KLGLP   L C FFFLAG + S  +S++V   + + R L+ V++E 
Sbjct: 1   MKAKGSKGKWSIKSKLGLPVVFLSCLFFFLAGLFASNLISQNVNGDKNR-RQLQWVKEEI 59

Query: 56  -NESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
                LP G+TGD  +  IPFQVLSW+PRALYFPNFA+AEQCQS+I  AK  L PS LAL
Sbjct: 60  IEYDLLPSGDTGDDYLTVIPFQVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLAL 119

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R+GET E+TKG RTSSG F+SASEDKTG+L+ IE KIARATMLP+ +GEAFN+LRYEIGQ
Sbjct: 120 RKGETEENTKGIRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQ 179

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           KY+SHYDAFNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE DL + 
Sbjct: 180 KYNSHYDAFNPAEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVD 228


>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  296 bits (759), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 144/224 (64%), Positives = 175/224 (78%), Gaps = 2/224 (0%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTL-EVVEKENESG 59
           MKGK K S  KLG+PT  +LC+ FF  GF+ S  L +D+  + P+ R L E V+KE E  
Sbjct: 2   MKGKVKSSKLKLGVPTLFILCALFFFVGFFVSPLLFQDLDDVGPRSRILQESVKKEYEP- 60

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L HGE+G+  + SIP Q+LSWRPRA++FPNF S E CQ II  AK +L+PS+LALR+GET
Sbjct: 61  LEHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGET 120

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTK TRTSSGTFISASEDK+GIL+L+E KIA+ TM+P+THGE FN+L+YE+GQKYDSH
Sbjct: 121 AESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSH 180

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           YDAFNP EYG   SQR+ASFLLYLS+VE GGETMFP+E  L I 
Sbjct: 181 YDAFNPDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNID 224


>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 297

 Score =  296 bits (759), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 142/219 (64%), Positives = 171/219 (78%), Gaps = 4/219 (1%)

Query: 9   STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
           + KL  P   L+C FFFLAGF+GST  S     D   +RP+ R L+  ++   + +  GE
Sbjct: 15  TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
            GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK  LKPS LALR+GET E+TK
Sbjct: 75  FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           G RTSSG F+SAS DKT  LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y+SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFN 194

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L + 
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMD 233


>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
          Length = 297

 Score =  295 bits (754), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 142/219 (64%), Positives = 170/219 (77%), Gaps = 4/219 (1%)

Query: 9   STKLGLPTALLLCSFFFLAGFYGSTFLSR----DVPSIRPKLRTLEVVEKENESGLPHGE 64
           + KL  P   L+C FFFLAGF+GST  S     D   +RP+ R L+  ++   + +  GE
Sbjct: 15  TNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDSTKETEYNLMTAGE 74

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
            GD SI SIPFQVLSW+PRALYFPNFA+AEQC++I++ AK  LKPS LALR+GET E+TK
Sbjct: 75  FGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTK 134

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           G RTSSG F+SAS DKT  LE IE KIARATM+P++HGEAFN+LRYE+GQ+Y SHYDAFN
Sbjct: 135 GIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFN 194

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           P EYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L + 
Sbjct: 195 PDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMD 233


>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score =  285 bits (730), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 139/184 (75%), Positives = 158/184 (85%)

Query: 34  FLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASA 93
           F  +DVPS+RP+ R LE V +E+   LP GETG+ S+ SIPFQVLSW PRALYFPNFAS 
Sbjct: 35  FSHQDVPSVRPRPRFLESVYQEDFDPLPIGETGEHSLISIPFQVLSWFPRALYFPNFASI 94

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIAR 153
           EQCQSII  AK  ++PS LALR GET E+TKG RTSSGTFISASEDKTGIL+LIE KIA+
Sbjct: 95  EQCQSIIKMAKANMEPSSLALRTGETEETTKGIRTSSGTFISASEDKTGILDLIEEKIAK 154

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
           ATM+P+THGEAFNVLRYEIGQ+Y SHYDAF+PA+YGPQ SQR ASFLLYLSDVEEGGET+
Sbjct: 155 ATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQRAASFLLYLSDVEEGGETV 214

Query: 214 FPFE 217
           FP+E
Sbjct: 215 FPYE 218


>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
 gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
          Length = 297

 Score =  285 bits (728), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 144/233 (61%), Positives = 172/233 (73%), Gaps = 14/233 (6%)

Query: 1   MKGKAKRSSTK------LGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIRPKLRTL 49
           M+GKA + + +      LG P  +L CSFFF+AG + S  L     S D   +R + R L
Sbjct: 1   MRGKACKENWRFYKKPNLGFPALILSCSFFFIAGLFASNLLLSQGTSSDERWLRARARQL 60

Query: 50  EVVEKENESG---LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
           + VE+E  S    LP GE+GD  I  IPFQVLSWRPRALY+P F +AEQCQ II  AK  
Sbjct: 61  QSVEEEIISKYDLLPSGESGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPS 120

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           L+PS LALR+GET E+TKG RTSSG F+ +SED+ G+L++IE KIARATM+P THGEAFN
Sbjct: 121 LQPSTLALRKGETAETTKGIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFN 180

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
           VLRYEIGQKYD+HYDAFNPAEYGPQ SQR+A+FLLYLS+ EEGGET FP E D
Sbjct: 181 VLRYEIGQKYDAHYDAFNPAEYGPQTSQRVATFLLYLSNFEEGGETTFPIEND 233


>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
          Length = 318

 Score =  283 bits (723), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 149/235 (63%), Positives = 171/235 (72%), Gaps = 12/235 (5%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFL-----SRDVPSIR----PKL-RTLE 50
           ++G       K+  P+  LLC FFFLAGF+GST         D   +R    P+  R LE
Sbjct: 6   VRGNWSWRMNKIAFPSVFLLCIFFFLAGFFGSTLFFQYHSQEDEHGLRQIQRPRTTRLLE 65

Query: 51  -VVEKENESGL-PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
            + EKE ES L P GETGD  I +IPFQVLSW P ALYFPNFA+AEQC+SII TAK+ LK
Sbjct: 66  NLTEKETESHLLPAGETGDNFITTIPFQVLSWNPHALYFPNFATAEQCESIIETAKEGLK 125

Query: 109 PSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
           PS L LR GET EST G RTSSG FISA EDKTG+L++IE KIARAT +P+THGEAFNVL
Sbjct: 126 PSTLVLRVGETDESTTGIRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVL 185

Query: 169 RYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           RY++GQKY SHYDA +P  YGPQ SQR+ASFLLYLSDV EGGETMFPFE  L + 
Sbjct: 186 RYKVGQKYSSHYDALHPDIYGPQKSQRMASFLLYLSDVPEGGETMFPFENGLNMD 240


>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
          Length = 293

 Score =  282 bits (722), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 146/226 (64%), Positives = 174/226 (76%), Gaps = 3/226 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R L+   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII  AK  LKPS LALRQG
Sbjct: 66  NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET ++TKG RTSSG F+SASEDKT  L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L + 
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231


>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
          Length = 293

 Score =  282 bits (721), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 146/226 (64%), Positives = 174/226 (76%), Gaps = 3/226 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R L+   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLDSTNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P GE GD SI SIPFQVLSWRPRA+YFPNFA+AEQC+SII  AK  LKPS LALRQG
Sbjct: 66  NLMPVGELGDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET ++TKG RTSSG F+SASEDKT  L++IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEDNTKGIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L + 
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231


>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 290

 Score =  281 bits (720), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 140/223 (62%), Positives = 171/223 (76%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGL 60
           MKGK +RS  KLG+PT   LC+ FF  GF+ S  L +D+  + P+ R L+   K+    L
Sbjct: 1   MKGKVRRSKLKLGVPTFFFLCALFFFVGFFVSPLLFQDLGDVGPRSRILQESVKKAYEPL 60

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
            HGE+G+  + SIPFQ+LSWRPRA+YFPNF S E CQ II  AK +L+PS+LALR+GET 
Sbjct: 61  EHGESGEPFLNSIPFQILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 120

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           ESTK TRTSSGTFISASEDK+GIL+ +E KIA+ TM+P+THGE FN+L+YE+ QKYDSHY
Sbjct: 121 ESTKDTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHY 180

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           DAFNP EYG   SQR+ASFLLYLS+VE GGETMFP+E  L I 
Sbjct: 181 DAFNPDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNID 223


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score =  280 bits (716), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 136/211 (64%), Positives = 166/211 (78%), Gaps = 7/211 (3%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R LE    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLEAAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +YGPQ +QR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYE 228


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score =  279 bits (713), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 135/211 (63%), Positives = 166/211 (78%), Gaps = 7/211 (3%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R L+    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLDAAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATMLP+ HGE FNVLRY IGQ+Y SHYDAF+PA
Sbjct: 138 RTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPA 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +YGPQ +QR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 198 QYGPQKNQRVASFLLYLTDVEEGGETMFPYE 228


>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
 gi|255641811|gb|ACU21174.1| unknown [Glycine max]
          Length = 293

 Score =  277 bits (708), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 146/226 (64%), Positives = 172/226 (76%), Gaps = 3/226 (1%)

Query: 1   MKGKAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR---DVPSIRPKLRTLEVVEKENE 57
           +KG     + KL  P   L+C FFFLAGF+G T  S    D   +RP+ R LE   +   
Sbjct: 6   VKGNWSLRTNKLSFPYVFLVCIFFFLAGFFGFTLFSHSQGDGDGLRPRQRLLEPGNEAEY 65

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + +P  + GD SI SIPFQVLSWRPRALYFPNFA+AEQC++II  AK  LKPS LALRQG
Sbjct: 66  NLMPVRDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQG 125

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ET E+TKG RTSSG F+SAS DKTG L +IE KIARATM+P++HGEAFN+LRYE+ Q+Y+
Sbjct: 126 ETEENTKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYN 185

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           SHYDAFNPAEYGPQ SQR+ASFLLYL+DVEEGGETMFPFE  L + 
Sbjct: 186 SHYDAFNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMD 231


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score =  276 bits (707), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 134/211 (63%), Positives = 165/211 (78%), Gaps = 7/211 (3%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR---PKLRTLEVVEKENESGLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D        P+ R LE    E    +P+GE+G
Sbjct: 22  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPEEADMPVPRERLLETAWPE----MPYGESG 77

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +A+   IP+Q+LSW+PRALYFP FA++EQC++I+ TAK+RLKPS LALR+GET ESTKG 
Sbjct: 78  EAAPSLIPYQILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGI 137

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+SA+ED T  L  IE KIARATM+P+ HGE FNVLRY IGQ+Y SHYDAF+P 
Sbjct: 138 RTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPV 197

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +YGPQ SQR+ASFLLYL++VEEGGETMFP+E
Sbjct: 198 QYGPQKSQRVASFLLYLTNVEEGGETMFPYE 228


>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
 gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
          Length = 257

 Score =  275 bits (703), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 127/164 (77%), Positives = 145/164 (88%)

Query: 54  KENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           +E  S +PHG TG+ SI SIPFQVLSWRPRA+YFPNFA+AEQCQ+II  AK  LKPS LA
Sbjct: 6   EEEASSMPHGVTGEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALA 65

Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           LR+GET E+TKGTRTSSGTFISASE+ TG L+ +E KIARATM+P++HGE+FN+LRYE+G
Sbjct: 66  LRKGETAENTKGTRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELG 125

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           QKYDSHYD FNP EYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 126 QKYDSHYDVFNPTEYGPQSSQRIASFLLYLSDVEEGGETMFPFE 169


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score =  273 bits (697), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 135/214 (63%), Positives = 166/214 (77%), Gaps = 10/214 (4%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
           T+L LP  LL CS FFLAGF GS   ++D        RP  + R +E V  E    + +G
Sbjct: 72  TRLRLPVVLLSCSLFFLAGFLGSLLFTQDPQGEEDPERPLRRERVMEAVWPE----MAYG 127

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           E+GD +   IP+Q+LSW+PRALYFP FA+AEQC++++ TAK RL+PS LALR+GE+ E+T
Sbjct: 128 ESGDPAPSLIPYQILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRKGESEETT 187

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           KG RTSSGTF+SA ED TG L  IE KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 188 KGIRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAF 247

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 248 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYE 281


>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
          Length = 285

 Score =  271 bits (692), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/226 (65%), Positives = 173/226 (76%), Gaps = 6/226 (2%)

Query: 1   MKGKAK---RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE 57
           MK KAK   R  TKLGLP   L C FFFLAG +GS  L ++  S  P  R L + E+ + 
Sbjct: 1   MKSKAKGKWRFGTKLGLPVVFLFCLFFFLAGLFGSGLLPQEFSSSEP--RRL-IREETDY 57

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
             L HGE+G+ S+ SIPFQVLSWRPRALYFPNFA++EQCQSII  AK  L PS +ALR G
Sbjct: 58  DPLAHGESGEDSVTSIPFQVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVG 117

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           E   +T+G RTSSG FISASEDKTG L+LIE KIAR  M+P+THGEAFNVLRYEIGQ+Y+
Sbjct: 118 EIRGNTEGIRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYN 177

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETMFPFE  L + 
Sbjct: 178 SHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMD 223


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score =  270 bits (691), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 131/211 (62%), Positives = 166/211 (78%), Gaps = 4/211 (1%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D P    +L T    E+  E+   G+ +GE+G
Sbjct: 33  TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 91

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +     IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG 
Sbjct: 92  EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 151

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+S+ ED TG L  +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 152 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 211

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 212 QYGPQKSQRVASFLLYLTDVEEGGETMFPYE 242


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score =  270 bits (691), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 131/211 (62%), Positives = 166/211 (78%), Gaps = 4/211 (1%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENES---GLPHGETG 66
           T+L LP  LL CS FFLAGF+GS   ++D P    +L T    E+  E+   G+ +GE+G
Sbjct: 3   TRLRLPVVLLSCSLFFLAGFFGSILFTQD-PQGEEELDTPMRRERLMEAAWPGMAYGESG 61

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +     IP+Q+LSW+PRALYFP FA+++QC++I+ TAK+RL PS LALR+GET ESTKG 
Sbjct: 62  EPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRKGETEESTKGI 121

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSGTF+S+ ED TG L  +E KIA+ATM+P+ HGE FN+LRYEIGQ+Y SHYDAF+PA
Sbjct: 122 RTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPA 181

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 182 QYGPQKSQRVASFLLYLTDVEEGGETMFPYE 212


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score =  270 bits (689), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 132/214 (61%), Positives = 166/214 (77%), Gaps = 10/214 (4%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSI----RP--KLRTLEVVEKENESGLPHG 63
           T+L LP  LL CS FFLAGF+GS   ++D        RP  + R +E V  E    + +G
Sbjct: 20  TRLRLPVVLLSCSLFFLAGFFGSLLFTQDPQGEEELERPLSRERLMEAVWPE----MAYG 75

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           ++GD +   IP+Q+LSW+PRALYFP FA++EQC++++ TAK RL+PS LALR+GET E+T
Sbjct: 76  DSGDPAPSLIPYQILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETT 135

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           KG RTSSGTF+SA ED T  L  +E KIA+ATM+P++HGE FNVLRYEIGQKY SHYDAF
Sbjct: 136 KGIRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAF 195

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           +PA+YGPQ SQR+ASFLLYL+DVEEGGETMFP+E
Sbjct: 196 DPAQYGPQKSQRVASFLLYLTDVEEGGETMFPYE 229


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  268 bits (686), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 132/207 (63%), Positives = 157/207 (75%), Gaps = 6/207 (2%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASI 70
           KLG P  LL C+FFFL GFYGS  LS++   +      ++ V  E +    HG TGD+S+
Sbjct: 17  KLGFPLVLLCCTFFFLVGFYGSNSLSKEEKHV-----VIDPVTNE-KLVFEHGRTGDSSV 70

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
             IPFQVLSW+PRAL +PNFAS EQC++II  A+ RL PS LALR+GE+  +TK  RTSS
Sbjct: 71  TDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSS 130

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           GTF+ ASEDKT  L  +E K+ARATM+P+ +GEAFNVLRY  GQKYD HYD F+PAEYGP
Sbjct: 131 GTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGP 190

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFE 217
           Q SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 191 QPSQRMASFLLYLSDVEEGGETMFPFE 217


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score =  267 bits (683), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 136/225 (60%), Positives = 162/225 (72%), Gaps = 11/225 (4%)

Query: 4   KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSR------DVPSIRPKLRTLEVVEKENE 57
           K  R   KL LP+  L+C FFFLAGF  S+  S       D+           ++EK  +
Sbjct: 7   KGNRRMNKLDLPSVFLICIFFFLAGFSASSLFSHSQKHEYDLRLRPRPRPRARLLEKSTQ 66

Query: 58  SG-----LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
                  L  G++GD  +  IPFQVLSW PRALYFPNFASAEQC+SII  A+  LK S L
Sbjct: 67  EKTEYHLLKAGDSGDDYVTLIPFQVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTL 126

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET ESTKG RTSSG F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+
Sbjct: 127 ALRKGETEESTKGIRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEV 186

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GQKY+SHYDAF+ AEYGP  SQR+ASFLLYL+DV EGGETMFP+E
Sbjct: 187 GQKYNSHYDAFDEAEYGPLQSQRVASFLLYLTDVPEGGETMFPYE 231


>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 299

 Score =  259 bits (662), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 134/218 (61%), Positives = 158/218 (72%), Gaps = 13/218 (5%)

Query: 10  TKLGLPTALLLCSFFFLAGFYGSTFLSRDVPS--------IRPKLRTLE--VVEKENESG 59
            KL LP+  L+C FFFLAGF  S+  S   PS        + P+ R LE    EK     
Sbjct: 17  NKLDLPSVFLICIFFFLAGFSASSLFS---PSQEHEYDLRLIPRARLLEKSTHEKTEHHL 73

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L  G++GD  I  IPFQVLSW PRALYFPNF SAEQC++II  A+  LKPS L LR+GET
Sbjct: 74  LKAGDSGDDYITLIPFQVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRKGET 133

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            ESTKG RTS G F+SASED+TGIL+ IE KIA+AT +P+THGEAFN+LRYE+GQKY  H
Sbjct: 134 EESTKGIRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPH 193

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           YDAF+ AE+GP  SQR ASFLLYL+DV EGGET+FP+E
Sbjct: 194 YDAFDEAEFGPLQSQRAASFLLYLTDVPEGGETLFPYE 231


>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 294

 Score =  254 bits (650), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 135/214 (63%), Positives = 156/214 (72%), Gaps = 7/214 (3%)

Query: 9   STKLGLPTALLLC---SFFFLAGFYGSTFLSRDVPSIRPKLRTLE-VVEKENESGLPH-G 63
           + K  LP+  LLC         GF        D   +R  +R LE  V+ E E  L H G
Sbjct: 14  TKKFSLPSVFLLCIFFFLAGFFGFSFFHHSQEDEYGVR--VRLLEKSVKDETEHRLLHAG 71

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           ++GD  I SIPFQVLSW PRALYFPNFASAEQC  II  AK  L PS+L LR+GET E T
Sbjct: 72  KSGDNFITSIPFQVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEGT 131

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           KG RTSSG FISASEDKTG+LE+I+ KIARA  +P+THG A+N+LRY++GQKY+SHYDAF
Sbjct: 132 KGIRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAF 191

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           NPAEYGPQ SQR+ASFLLYL+DV EGGETMFPFE
Sbjct: 192 NPAEYGPQESQRVASFLLYLTDVPEGGETMFPFE 225


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score =  248 bits (634), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 126/216 (58%), Positives = 151/216 (69%), Gaps = 9/216 (4%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVP--SIRPKLRTLEVVEKENESG-------LP 61
           KL LP  LL C+   LAG+  S F  + +   +I    R     E E   G       L 
Sbjct: 17  KLSLPQVLLWCTICLLAGYAASNFFPQKIEEEAIYQPYRKSAQQEGEFPFGEFSEKVVLD 76

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           HG TGD  I  IPFQVLSW+PRAL +P FAS EQC++I+  A+ RL PS LALR+GE+ +
Sbjct: 77  HGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALALRKGESED 136

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           STK  RTSSGTF+ A ED T  LE +E K+A+ATM+P+ +GEAFNVL+Y +GQKYD HYD
Sbjct: 137 STKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYD 196

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
            F+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 197 VFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFE 232


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score =  246 bits (627), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 124/225 (55%), Positives = 154/225 (68%), Gaps = 9/225 (4%)

Query: 1   MKGKAK--RSSTKLGLPTALLLCSFFFLAGFYGSTF------LSRDVPSIRPKLRTLEVV 52
           MKG+++      ++  P  L+LC   FL G++   F          +PS          +
Sbjct: 1   MKGRSRGRVCGHQVSAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHEL 60

Query: 53  EKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
             E+ +   HG TGD  +  IPFQVLSW PRAL FP FAS  QC++II+ AK +L PS L
Sbjct: 61  -LEDVTVAEHGVTGDDQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSL 119

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
           ALR+GET   T+  RTS G F+S+ +DKTG L  +E K+A+ATM+P++HGEAFNVLRYEI
Sbjct: 120 ALRKGETATETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEI 179

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           GQKY+SHYD FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 180 GQKYNSHYDVFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFE 224


>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
 gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
          Length = 231

 Score =  237 bits (604), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 109/156 (69%), Positives = 128/156 (82%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           HG TG+  +  IPFQVLSW PRAL FP FAS  QC++II+ AK +L PS LALR+GET  
Sbjct: 8   HGVTGEDQLAFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETAT 67

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            T+  RTS G F+S+ +DKTG L  +E K+A+ATM+P++HGEAFNVLRYEIGQKY+SHYD
Sbjct: 68  ETQDVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYD 127

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
            FNPAEYGPQ SQR+ASFLLYLSDVEEGGETMFPFE
Sbjct: 128 VFNPAEYGPQKSQRMASFLLYLSDVEEGGETMFPFE 163


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 110/208 (52%), Positives = 141/208 (67%), Gaps = 12/208 (5%)

Query: 11  KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
           KLG P   L C FF  + GF     LS+ +     ++ T   V +E ES L HG     S
Sbjct: 9   KLGFPYVFLACCFFLAIFGFCFFNLLSQIISF--SEISTRRSVNEETES-LDHG-----S 60

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           + +IPF  LSW PR  Y PNFA+ +QC+++I  AK +LKPS LALR+GET E+T+  RT 
Sbjct: 61  VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTR 120

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
                   ED++GIL  IE KIA AT +P  + E+FN+LRY++GQKYDSHYDAF+PAEYG
Sbjct: 121 ---LKKTDEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYG 177

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE 217
           PQ+SQR+ +F+L+LS VEEGGETMFPFE
Sbjct: 178 PQISQRVVTFILFLSSVEEGGETMFPFE 205


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score =  197 bits (502), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 104/208 (50%), Positives = 136/208 (65%), Gaps = 10/208 (4%)

Query: 11  KLGLPTALLLCSFFF-LAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDAS 69
           KL  P   + C FF  + GF      S+ + S      T   V  E +S L HG    +S
Sbjct: 9   KLVFPYVFIACCFFLAIFGFCFFNLFSQGI-SFSEIPTTRRSVNDETDS-LDHG----SS 62

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           + +IPF  LSW PR  Y PNFA+ +QC+++I  AK +LKPS LALR+GET E+T+  R+ 
Sbjct: 63  VSNIPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSL 122

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
                   ED++G+L  IE KIA AT  P+ + E+FN+LRY++GQKYDSHYDAF+ AEYG
Sbjct: 123 HQ---HTDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYG 179

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFE 217
           P +SQR+ +FLL+LS VEEGGETMFPFE
Sbjct: 180 PLISQRVVTFLLFLSSVEEGGETMFPFE 207


>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
          Length = 341

 Score =  195 bits (495), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 88/155 (56%), Positives = 120/155 (77%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G  GD  +  + FQ+LS  PR++ + NFAS   C +I+  A+ RL  S LAL++GET+E+
Sbjct: 116 GALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLALKRGETLET 175

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
           TK  RTSSGTF+++  +++G L+ +E K+ARAT +P THGEA+N+LRYEIGQKYDSHYD 
Sbjct: 176 TKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDM 235

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           F+P++YGPQ SQR+ASFLLYL+  +EGGET+FP E
Sbjct: 236 FDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLE 270


>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
 gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 88/121 (72%), Positives = 97/121 (80%), Gaps = 9/121 (7%)

Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           AK +LKPS LALR+GET EST         FI  SEDKTG L+ IE KIA+ATM+PQ+HG
Sbjct: 2   AKSKLKPSTLALRKGETTEST---------FIGGSEDKTGTLDFIERKIAKATMIPQSHG 52

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           EAFN+LRYEIGQKYDSHYDAFNP EYGPQ SQR+ASFLLYLS VEEGGETMFPFE    +
Sbjct: 53  EAFNILRYEIGQKYDSHYDAFNPDEYGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAV 112

Query: 223 S 223
           S
Sbjct: 113 S 113


>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
 gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
          Length = 231

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 84/155 (54%), Positives = 111/155 (71%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP  E+G  ++  IPFQ+LSW PR + FP F    + + +I  A K + PS LA R GET
Sbjct: 2   LPAAESGSDNVYVIPFQILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLAYRPGET 61

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
           V+ ++ TRTS+GTF++A+ D  G+L  +E +IA AT+LP  +GEAFNVL YE  Q YDSH
Sbjct: 62  VDPSQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSH 121

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           YD F+P E+GPQ SQR+A+ LLYLS+V EGGET+F
Sbjct: 122 YDTFDPKEFGPQPSQRIATVLLYLSEVLEGGETVF 156


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 80/154 (51%), Positives = 103/154 (66%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           E+G+      P Q++SW PR + +P F   E+C+  +  AK RL PS LALR  E  + T
Sbjct: 98  ESGNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQET 157

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
           +  RTS GTF+S  +D  G++  +E K A+ T LP +HGE FNVLRY+ GQ YDSHYD F
Sbjct: 158 ENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIF 217

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
            P  YGPQ SQR+A+ L YL+DVEEGGET+FP E
Sbjct: 218 EPESYGPQPSQRMATILFYLTDVEEGGETIFPLE 251


>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 82/155 (52%), Positives = 107/155 (69%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP   +G      IPFQ+LSW PR + FP F    + + I+  A K + PS LA R GE 
Sbjct: 2   LPAAASGSDVTYIIPFQILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQ 61

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
           VES++ TRTS+GTF+S+  D  G+L  +E +IA AT+LP  +GEAFNVL YE  Q YDSH
Sbjct: 62  VESSQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSH 121

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
            D+F+P ++GPQ SQR+A+ LLYLS+V EGGET+F
Sbjct: 122 MDSFDPKDFGPQPSQRIATVLLYLSEVLEGGETVF 156


>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
          Length = 334

 Score =  163 bits (412), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 82/158 (51%), Positives = 102/158 (64%), Gaps = 12/158 (7%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           L HGE+G +     P Q+LS  PRA   P F S +QC  +IA A++RL PS LA + G+T
Sbjct: 116 LEHGESGHSFYTVQPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDT 175

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E+T+             ED  G+L  IE K+A  TM+P  HGE FNVLRYE  Q YDSH
Sbjct: 176 AENTR------------DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSH 223

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFE 217
           YD+F+  EYGPQ SQR+A+ LLYL+DVEEGGET+F  E
Sbjct: 224 YDSFSEEEYGPQFSQRIATVLLYLADVEEGGETVFLLE 261


>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 162

 Score =  160 bits (404), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 93/161 (57%), Positives = 120/161 (74%), Gaps = 9/161 (5%)

Query: 11  KLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENE-SG--------LP 61
           KLGLP+   LC FFFL GF+GS+ +S+D  + R + R LE    E + +G        LP
Sbjct: 2   KLGLPSVFFLCFFFFLLGFFGSSLISQDYANARLRARILESSNGEEDITGGSSFGFDLLP 61

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
            G+TGD SI SIPFQVLSWRPR LYFP FA+A+ C++II+ A+ +LKPS+LALR+GET++
Sbjct: 62  SGDTGDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLALRKGETLD 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           ST+  RTSSG FISA EDKTGIL+ I+ KIARATM+P+ +G
Sbjct: 122 STREIRTSSGMFISADEDKTGILDFIDEKIARATMIPRANG 162


>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
          Length = 282

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 73/130 (56%), Positives = 84/130 (64%), Gaps = 33/130 (25%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE----------------------- 163
           R  SG FISASEDKTG L+LIE KIAR  M+P+THGE                       
Sbjct: 91  RLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVMK 150

Query: 164 ----------AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETM 213
                     AFN+LRYEIGQ+Y+SHYDAF+PAEYGPQ S R+A+FL+YLSDVEEGGETM
Sbjct: 151 RFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGETM 210

Query: 214 FPFEVDLQIS 223
           FPFE  L + 
Sbjct: 211 FPFENGLNMD 220


>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1036

 Score =  135 bits (339), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 68/150 (45%), Positives = 92/150 (61%), Gaps = 11/150 (7%)

Query: 47  RTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR 106
           RTL  ++  +  GL H       +     Q LSW PR  Y PNFA+ +QC+++I  AK +
Sbjct: 776 RTLITLDSSDSQGLVH-------VLMWNDQGLSWNPRVFYLPNFATKQQCEAVIDMAKPK 828

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           LKPS LALR+    E+               ED++G+L  IE KIA AT  P+ + E+FN
Sbjct: 829 LKPSTLALRK----ETKHFQMQYRSLHQHTDEDESGVLAAIEEKIALATRFPKDYYESFN 884

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           +LRY++GQKYDSHYDAF+ AEYGP +SQR+
Sbjct: 885 ILRYQLGQKYDSHYDAFHSAEYGPLISQRV 914


>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
 gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
          Length = 429

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 75/161 (46%), Positives = 101/161 (62%), Gaps = 3/161 (1%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G++G+A  ++IPFQ+LS  PR   FPNF    + + IIA A K + PS LA R GE VE+
Sbjct: 203 GDSGEAFYRTIPFQILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEA 262

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
            +  RTS GTF+    D +  L  +E KIA  T +P+ +GE +NVL Y+  Q YDSH D+
Sbjct: 263 EQQVRTSKGTFLGG--DSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDS 320

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFEVDLQI 222
           F+P EYG Q SQR+A+ ++ LSD    GGET+F  E    I
Sbjct: 321 FDPKEYGQQYSQRIATVIVVLSDEGLVGGETVFKREGKANI 361


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 120/214 (56%), Gaps = 15/214 (7%)

Query: 4   KAKRSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIR-PKLRTLEVVEKENESGLPH 62
           K K S+  L  P +L+ C   FL  F   TFL   + ++R PKL  L        + + H
Sbjct: 3   KFKHSNVGLRKP-SLITCWTLFLTLFVTFTFLILIILTLRIPKLNHL--------NSITH 53

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVE 121
             T          Q++SW PRA  + NF + E+C+ +I  AK  +  S+ +  + G+++ 
Sbjct: 54  SNTLRNDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLN 113

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S+   RTSSGTF+    D+  I+  IE +IA  T +P  HGE+FNVL YE+GQKY+ HYD
Sbjct: 114 SS--IRTSSGTFLDREGDE--IVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F          QR+A+ L+YLSDVEEGGET+FP
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFP 203


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score =  129 bits (324), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 97/154 (62%), Gaps = 10/154 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + G+ VE
Sbjct: 32  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 86

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            +   RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 87  DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 142

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F+      ++ QR+A+ L+YLSDVEEGGET+FP
Sbjct: 143 FFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFP 176


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 69/154 (44%), Positives = 96/154 (62%), Gaps = 10/154 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + GE+V+
Sbjct: 67  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVD 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 122 SR--VRTSSGMFLNRGQDK--IIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F       +  QR+A+ L+YLSDVEEGGET+FP
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFP 211


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           + +SW+PRA  F NF S+E+C  +I  A+  +K S +   Q G++ +S    RTSSGTF+
Sbjct: 47  ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR--VRTSSGTFL 104

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D+  I+  IE +IA+ T +P+ HGE   VL YE+GQKYD+H+D F+         Q
Sbjct: 105 RRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 163 RVATVLMYLSDVEEGGETVFP 183


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           + +SW+PRA  F NF S+E+C  +I  A+  +K S +   Q G++ +S    RTSSGTF+
Sbjct: 47  ETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSR--VRTSSGTFL 104

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D+  I+  IE +IA+ T +P+ HGE   VL YE+GQKYD+H+D F+         Q
Sbjct: 105 RRGQDE--IISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQ 162

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 163 RVATVLMYLSDVEEGGETVFP 183


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 95/154 (61%), Gaps = 10/154 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           G+ GD  ++ I     SW PRA  + NF S E+C  +I+ AK  ++ S +   + G+ VE
Sbjct: 67  GKRGDQWVEFI-----SWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVE 121

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            +   RTSSG F++  +DK  I+  IE +IA  T +P  HGE   +L YE+GQKYD+HYD
Sbjct: 122 DS--VRTSSGMFLNRGQDK--IVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F       +  QR+A+ L+YLSDVEEGGET+FP
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFP 211


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score =  127 bits (318), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 93/141 (65%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           Q++SW PRA  + NF + ++C+ +I TAK  ++ S +   + G++ +S+   RTSSGTF+
Sbjct: 68  QIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNETGKSKDSS--VRTSSGTFL 125

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  I+  IE +IA  T +P  +GE+FNVLRYE+GQKYD H D F          Q
Sbjct: 126 DRGGDE--IVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHLDYFADDYNTVNGGQ 183

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 184 RIATMLMYLSDVEEGGETVFP 204


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score =  126 bits (316), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 89/141 (63%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           Q++SW PRA  + NF + E+C+ +I  AK  +  S +   + G  V+S +  RTSSG F+
Sbjct: 116 QIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRE--RTSSGAFL 173

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  I++ IE +IA  T +P  HGE FNVL YE+GQKY+ HYD F          Q
Sbjct: 174 KRGSDR--IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQ 231

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 232 RIATMLMYLSDVEEGGETVFP 252


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 62/140 (44%), Positives = 87/140 (62%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSR-VRTSSGMFLR 155

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I++ IE +IA  T +P  HGE   VL YE+GQKY+ H+D F+         QR
Sbjct: 156 RGQDK--IIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNGGQR 213

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVE+GGET+FP
Sbjct: 214 IATLLMYLSDVEDGGETVFP 233


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/141 (47%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   VL YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 67/141 (47%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   VL YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/140 (45%), Positives = 87/140 (62%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA  + NF S E+C+ +I+ AK  +K S +   +    + ++  RTSSG F+ 
Sbjct: 78  EIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSR-VRTSSGMFLR 136

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              DK  I+  IE +IA  T +P  HGE   VL YE+GQKYD+HYD F          QR
Sbjct: 137 RGRDK--IIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVEEGGET+FP
Sbjct: 195 IATLLMYLSDVEEGGETVFP 214


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +VLSW PRA  + NF S E+C+ +I+ AK  +  S +     ET +S     RTSSGTF+
Sbjct: 77  EVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVV--DSETGKSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I++ IE +IA  T +P  HGE   +L YE GQKY+ HYD F          Q
Sbjct: 135 RRGRDK--IIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYDYFVDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RMATMLMYLSDVEEGGETVFP 213


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  123 bits (308), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 65/146 (44%), Positives = 87/146 (59%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK R++ S +       V+ST G       RTS
Sbjct: 108 EVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV-------VDSTTGKSKDSRVRTS 160

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 161 SGMFLRRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 218

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 219 KNGGQRMATILMYLSDVEEGGETIFP 244


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  123 bits (308), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 91/141 (64%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK ++ K S +  + G++ ES    RTSSG F+
Sbjct: 80  EILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESR--VRTSSGMFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P+ +GE   +L YE+GQKY+ HYD F          Q
Sbjct: 138 KRGKDK--IVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYFLDEFNTKNGGQ 195

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 196 RIATVLMYLSDVEEGGETVFP 216


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 72/166 (43%), Positives = 101/166 (60%), Gaps = 12/166 (7%)

Query: 67  DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           D S QSI  P +V  LSW+PRA  +  F SA +C  ++  AK +L+ S +A  + G++V 
Sbjct: 30  DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 89

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F+S  +D+  ++  IE +IA  T LP+ +GEA  VLRYE G+KY+ HYD
Sbjct: 90  SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 145

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI---SW 224
            F+          R+A+ L+YLSDV +GGET+FP   D  +   SW
Sbjct: 146 YFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSW 191


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +++SW PR   + NF + E+C+ +I  AK  ++ S +     G++V S+   RTSSGTFI
Sbjct: 71  EIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSS--ARTSSGTFI 128

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  IL  IE +IA  T +P  HGE  N+L YE+GQKYD H D F          +
Sbjct: 129 DRGYDK--ILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDFHTDYFEDEVNTKHGGE 186

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 187 RIATMLMYLSDVEEGGETVFP 207


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  ++ S +     ET  S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVV--DSETGRSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S   DK   +  IE +IA  + +P  HGE   VL YE+GQKY+ H+D FN         Q
Sbjct: 135 SRGRDKK--IRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFNDEFNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RVATLLMYLSDVEEGGETVFP 213


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/141 (46%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KIA  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YL+DVEEGGET+FP
Sbjct: 195 RIATVLMYLTDVEEGGETVFP 215


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/150 (43%), Positives = 91/150 (60%), Gaps = 5/150 (3%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
           D S      +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++ +S   
Sbjct: 70  DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 127

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSGTF++   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F  
Sbjct: 128 VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 185

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                   QR+A+ L+YLSDVEEGGET+FP
Sbjct: 186 EYNTRNGGQRIATVLMYLSDVEEGGETVFP 215


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 82/214 (38%), Positives = 116/214 (54%), Gaps = 16/214 (7%)

Query: 7   RSSTKLGLPTALLLCSFFFLAGFYGSTFLSRDVPSIRP---KLRTL-EVVEKENESGLPH 62
           R S++  L  A+L+ S F +        LS  VPS      K   L  +V K  + G+  
Sbjct: 12  RKSSRSTLVFAVLIMSTFVILILLAFGILS--VPSNNAGSSKANDLTSIVRKTLQRGVE- 68

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVE 121
               D S      +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++ +
Sbjct: 69  ----DDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTD 124

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSGTF++   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD
Sbjct: 125 SR--VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 180

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F          QR+A+ L+YLSDVEEGGET+FP
Sbjct: 181 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFP 214


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 89/146 (60%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +       V+S  G       RTS
Sbjct: 96  EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 148

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF+   +DK  ++  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+     
Sbjct: 149 SGTFLRRGQDK--VIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 206

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 207 KNGGQRIATLLMYLSDVEEGGETVFP 232


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 89/141 (63%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF + E+C+ +I  AK  ++ S +   + G++ +S    RTSSGTF+
Sbjct: 79  EIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSR--VRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F          Q
Sbjct: 137 ARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMDEYNTRNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 195 RIATVLMYLSDVEEGGETVFP 215


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 65/146 (44%), Positives = 89/146 (60%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +       V+S  G       RTS
Sbjct: 90  EVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 142

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF+   +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+     
Sbjct: 143 SGTFLRRGQDK--IVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNT 200

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 201 KNGGQRIATVLMYLSDVEEGGETVFP 226


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVEEGGET+FP
Sbjct: 228 IATLLMYLSDVEEGGETIFP 247


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 91/154 (59%), Gaps = 16/154 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 98  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
               QR+A+ L+YLSDVEEGGET+FP + ++ +S
Sbjct: 209 KNGGQRMATLLMYLSDVEEGGETIFP-DANVNVS 241


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 12  EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 70

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 71  RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 128

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVEEGGET+FP
Sbjct: 129 IATLLMYLSDVEEGGETIFP 148


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 61/140 (43%), Positives = 87/140 (62%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLSW PRA  + NF S E+C+ +I+ AK  +K S +        + ++  RTSSG F+ 
Sbjct: 111 EVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSR-VRTSSGMFLG 169

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +DK  I+  IE +I+  T +P  +GE   VL YE+GQKY+ H+D F+         QR
Sbjct: 170 RGQDK--IIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNGGQR 227

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVEEGGET+FP
Sbjct: 228 IATLLMYLSDVEEGGETIFP 247


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 86/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVE+GGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEDGGETVFP 233


>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
          Length = 387

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 89/146 (60%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  ++ S +       V+S+ G       RTS
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTV-------VDSSTGKSKDSRVRTS 130

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGTF++  +DK  I+  IE +++  T LP  HGE   +L YE+GQKY+ HYD F      
Sbjct: 131 SGTFLTRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNT 188

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 189 KNGGQRMATVLMYLSDVEEGGETVFP 214


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 80  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE +IA  + +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YL+DVEEGGET+FP
Sbjct: 196 RIATVLMYLTDVEEGGETVFP 216


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KI+  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YL+DVEEGGET+FP
Sbjct: 195 RIATVLMYLTDVEEGGETVFP 215


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 62/141 (43%), Positives = 89/141 (63%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +V+SW PRA  + NF S ++C+ +I  AK  ++ S +     G++ +S    RTSSGTF+
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSR--VRTSSGTFL 135

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  +DK  I+  IE +++  T LP  HGE   +L YE+GQKY+ HYD F          Q
Sbjct: 136 TRGQDK--IIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYDYFLDDYNTKNGGQ 193

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 194 RMATVLMYLSDVEEGGETVFP 214


>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
          Length = 376

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 60/142 (42%), Positives = 94/142 (66%), Gaps = 5/142 (3%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
            +V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S    RTSSGTF
Sbjct: 80  LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKTGKSIDSR--VRTSSGTF 137

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           +    D+  I+E IE++I+  T +P  +GE   VL YE+GQKY+ H+D F       +  
Sbjct: 138 LKRGHDE--IVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHHDYFFDEFNVRKGG 195

Query: 194 QRLASFLLYLSDVEEGGETMFP 215
           QR+A+ L+YLSDV+EGGET+FP
Sbjct: 196 QRIATVLMYLSDVDEGGETVFP 217


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/154 (44%), Positives = 96/154 (62%), Gaps = 9/154 (5%)

Query: 67  DASIQSI--PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVE 121
           D S QSI  P +V  LSW+PRA  +  F SA +C  ++  AK +L+ S +A  + G++V 
Sbjct: 16  DKSDQSIVDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVL 75

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S    RTSSG F+S  +D+  ++  IE +IA  T LP+ +GEA  VLRYE G+KY+ HYD
Sbjct: 76  SN--IRTSSGMFLSKGQDE--VINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYD 131

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F+          R+A+ L+YLSD  +GGET+FP
Sbjct: 132 YFHDKYNQALGGHRIATVLMYLSDAVKGGETVFP 165


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 85/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF S E+C+ +I  AK R+  S +     ET +S     RTSSG F+
Sbjct: 105 EVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVV--DSETGKSKDSRVRTSSGMFL 162

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 163 QRGRDK--VIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQ 220

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSD+EEGGET+FP
Sbjct: 221 RMATILMYLSDIEEGGETIFP 241


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRMATLLMYLSDVEEGGETIFP 233


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 86/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 154 EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 206

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 207 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 264

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVE+GGET+FP
Sbjct: 265 KNGGQRIATLLMYLSDVEDGGETVFP 290


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 64/146 (43%), Positives = 84/146 (57%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C  +I  AK  +  S +       V+ST G       RTS
Sbjct: 100 EVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 153 SGMFLQRGRDK--VIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNT 210

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 211 KNGGQRMATLLMYLSDVEEGGETIFP 236


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 63/141 (44%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +++SW PRA  + NF S E+C+ +IA AK  + K + +  + G + +S    RTSSG F+
Sbjct: 79  EIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSR--VRTSSGMFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DK  I+  IE +IA  + +P  HGE   VL YE+GQKY++HYD F          Q
Sbjct: 137 RRGRDK--IIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ L+YLSDVEEGGET+FP
Sbjct: 195 RTATLLMYLSDVEEGGETVFP 215


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 59/142 (41%), Positives = 95/142 (66%), Gaps = 5/142 (3%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTF 133
            +V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S    RTSSGTF
Sbjct: 80  LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSR--VRTSSGTF 137

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           ++   D+  I+E IE++I+  T +P  +GE   VL YE+GQ+Y+ H+D F       +  
Sbjct: 138 LNRGHDE--IVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGG 195

Query: 194 QRLASFLLYLSDVEEGGETMFP 215
           QR+A+ L+YLSDV+EGGET+FP
Sbjct: 196 QRIATVLMYLSDVDEGGETVFP 217


>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
          Length = 178

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S  +C  ++  AK RL+ S +A    G++V S    RTSSGTF++ 
Sbjct: 34  LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQ--VRTSSGTFLNK 91

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VL YE+GQKYD+H+D F+          R+
Sbjct: 92  HEDE--IISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRV 149

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 150 ATVLMYLTDVKKGGETVFP 168


>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 65/150 (43%), Positives = 95/150 (63%), Gaps = 11/150 (7%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK-GTRTS 129
           +S P  + + RP+A  F NF + E+C+ +IA AK +L PS +    G+  +STK G RTS
Sbjct: 225 RSPPLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK--KSTKSGIRTS 282

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NP 185
           +G F++  +  T  + ++E ++A A  LP+ +GE   +LRYE GQKYD HYD F    NP
Sbjct: 283 AGMFLTKGQ--TPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINP 340

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +    +  QR+A+ L+YL D EEGGET+FP
Sbjct: 341 SPN--RGGQRMATMLIYLKDTEEGGETIFP 368


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I+ AK  ++ S +     ET +S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DKT  +  IE +++  + +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RIATVLMYLSDVEEGGETVFP 213


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 64/141 (45%), Positives = 88/141 (62%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK  L K S +  + G++ ES    RTSSG F+
Sbjct: 70  EILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESR--VRTSSGMFL 127

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P  +GE   VL Y +G+KY+ HYD F          Q
Sbjct: 128 KRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 185

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 186 RVATVLMYLSDVEEGGETVFP 206


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/140 (42%), Positives = 86/140 (61%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+SW+PRA  + NF +  +C+ +I  AK R++ S +        + +K  RTSSGTF+ 
Sbjct: 81  EVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSK-VRTSSGTFLP 139

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              DK  I+  IE +IA  + +P  HGE   +L YE+GQ+Y+ H+D F          QR
Sbjct: 140 RGRDK--IVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEPHFDYFMDEYNTKNGGQR 197

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVEEGGET+FP
Sbjct: 198 IATVLMYLSDVEEGGETVFP 217


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I+ AK  ++ S +     ET +S     RTSSGTF+
Sbjct: 77  EVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVV--DSETGQSKDSRVRTSSGTFL 134

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               DKT  +  IE +++  + +P  HGE   VL YE+GQKY+ H+D F          Q
Sbjct: 135 PRGRDKT--VRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQ 192

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 193 RIATVLMYLSDVEEGGETVFP 213


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 90/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S ++C  ++  AK R++ S +A    G+++ S    RTSSGTF+S 
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQ--VRTSSGTFLSK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+  IE ++A  T LP+ + E+  +L YE+GQKYD+H+D F+      +   R+
Sbjct: 98  HEDD--IVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRV 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 156 ATVLMYLTDVKKGGETVFP 174


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score =  117 bits (294), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S  +C  +I  AK  ++ S +A    G+++ S    RTSSG F++ 
Sbjct: 38  LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQ--VRTSSGAFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYEIGQKYD+H+D F+         QR 
Sbjct: 96  HEDE--IVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRF 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 154 ATVLMYLTDVKKGGETVFP 172


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/146 (43%), Positives = 85/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 89/141 (63%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + NF + ++C+ +I  AK  +  S +   + G +++S    RTSSG F+
Sbjct: 92  EVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSN--VRTSSGWFL 149

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  +DK  I+  IE +IA  + +P  HGE  +VL YE+ QKYD+HYD F+         Q
Sbjct: 150 NRGQDK--IIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQ 207

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ L+YLSDVE+GGET+FP
Sbjct: 208 RGATMLMYLSDVEKGGETVFP 228


>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 243

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 65/150 (43%), Positives = 90/150 (60%), Gaps = 7/150 (4%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
           D S      +++SW PRA  + NF   E+C+ +I  AK  ++ S +   + G++ +S   
Sbjct: 70  DDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTGKSTDSR-- 125

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSGTF++   DKT  +  IE +I+  T +P  HGE   VL YEIGQKY+ HYD F  
Sbjct: 126 VRTSSGTFLARGRDKT--IREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYDYFMD 183

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                   QR+A+ L+YLSDVEEGGET+FP
Sbjct: 184 EYNTRNGGQRIATVLMYLSDVEEGGETVFP 213


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S ++C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 97  EVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    +K  ++  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 150 SGMFLQRGRNK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 207

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QR+A+ L+YLSDVEEGGET+FP
Sbjct: 208 KNGGQRIATLLMYLSDVEEGGETIFP 233


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 5/156 (3%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
           + +   +++ +   + +SW+PRA  +  F S E+C  +I+ AK  LK S +A    G++ 
Sbjct: 24  YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
            S    RTSSG FI   +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+HY
Sbjct: 84  LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           D F       +   R+A+ L+YLSDV +GGET+FP 
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPM 175


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           ++LSW PRA  + NF S E+C+ +I  AK  + K + +  + G + +S    RTSSG F+
Sbjct: 78  EILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTGRSKDSR--VRTSSGMFL 135

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++  IE +IA  + +P  HGE   VL YE+GQKY++H+D F          Q
Sbjct: 136 RRGRDR--VIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQ 193

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ L+YLSDVEEGGET+FP
Sbjct: 194 RTATLLMYLSDVEEGGETVFP 214


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +SWRPRA  + NF + E+C   I  AK +L+ S +A  + G++VES    RTSSG F   
Sbjct: 65  ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESE--VRTSSGMFFRK 122

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 123 AQDQ--VVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRV 180

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDVE+GGET+FP
Sbjct: 181 ATVLMYLSDVEKGGETVFP 199


>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
          Length = 564

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/144 (43%), Positives = 92/144 (63%), Gaps = 6/144 (4%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P  + + +P+A  F NF SAE+C  ++  AK  L PS +    G +V ST   RTS+G F
Sbjct: 278 PLVLSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAGGTSVPST--IRTSAGMF 335

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQM 192
           +  + DKT  LE IE++IA A+  P+ +GE   +LRY++GQKYD H+D F+ A    P+ 
Sbjct: 336 LRKAADKT--LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKR 393

Query: 193 -SQRLASFLLYLSDVEEGGETMFP 215
             QR+A+ L+YL + +EGGET+FP
Sbjct: 394 GGQRMATMLIYLENTKEGGETIFP 417


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/156 (41%), Positives = 92/156 (58%), Gaps = 5/156 (3%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETV 120
           + +   +++ +   + +SW+PRA  +  F S E+C  +I+ AK  LK S +A    G++ 
Sbjct: 24  YADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSR 83

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
            S    RTSSG FI   +D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+HY
Sbjct: 84  LSE--VRTSSGMFIGKGKDP--IVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHY 139

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           D F       +   R+A+ L+YLSDV +GGET+FP 
Sbjct: 140 DYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPM 175


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 62/152 (40%), Positives = 92/152 (60%), Gaps = 14/152 (9%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG----ETVESTKGTRTSS 130
            +++SW+PRAL    F +  +C  +I+ A+ RL+PS++  R G    ++V + +G  +SS
Sbjct: 15  IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGL-SSS 73

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-- 188
           GTF++  +D   ++  +E +I  AT LP +H E   VL+YE+GQKY +HYD     E   
Sbjct: 74  GTFLTKRQDS--VVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQ 131

Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFP 215
                G Q   R A+ L+YLSDVEEGGET FP
Sbjct: 132 LAIRRGEQGGSRYATMLMYLSDVEEGGETSFP 163


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 96/153 (62%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
           G +S++  P +V  LSW PRA  +  F S E+C  +I  AK +L+ S +A  + G+++ S
Sbjct: 45  GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMS 104

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F++ ++D+  I+  IE +IA  T LP  +GE+  +L YE GQKY+ H+D 
Sbjct: 105 E--VRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDY 160

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F+          R+A+ L+YLSDVE+GGET+FP
Sbjct: 161 FHDKANQVMGGHRIATVLMYLSDVEKGGETIFP 193


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/141 (41%), Positives = 90/141 (63%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW PRA+ + NF S E+C+ +I  AK  + K + +  + G + +S    RTSSGTF+
Sbjct: 81  EVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 90/143 (62%), Gaps = 5/143 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  + NF S  +C  +I+ AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 9   LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSE--IRTSSGMFLMK 66

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE +IA  T LP+ +GEA  VLRY+ G+KY+ H+D F+          R+
Sbjct: 67  GQDD--IISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRI 124

Query: 197 ASFLLYLSDVEEGGETMFPFEVD 219
           A+ L+YLSDV +GGET+FP   D
Sbjct: 125 ATVLMYLSDVVKGGETVFPSSED 147


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/153 (41%), Positives = 95/153 (62%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
           G +S++  P +V  LSW PRA  +  F S E+C  +I  AK +L+ S +A    G+++ S
Sbjct: 46  GGSSVKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMS 105

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F++ ++D+  I+  IE +IA  T LP  +GE+  +L YE GQKY+ H+D 
Sbjct: 106 D--IRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDY 161

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F+          R+A+ L+YLSDVE+GGET+FP
Sbjct: 162 FHDKANQVMGGHRIATVLMYLSDVEKGGETIFP 194


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/143 (43%), Positives = 96/143 (67%), Gaps = 9/143 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPRA  + NF S E+C+ +   A+KRL K + +  + G++++ST   RTSSGTF++ 
Sbjct: 41  VSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDST--VRTSSGTFLAR 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---S 193
            ED+  ++  IE +I+  TM+P+ +GEA  +L+Y  GQKY+ H D F+  +Y  +     
Sbjct: 99  GEDE--VVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFH-DKYNSRTENGG 155

Query: 194 QRLASFLLYLSDVEEGGETMFPF 216
           QR+A+ L+YLS  EEGGET+FP+
Sbjct: 156 QRVATILMYLSTPEEGGETVFPY 178


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 57/140 (40%), Positives = 85/140 (60%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA    NF + ++C  +I  A   ++ S +   Q      ++  RTSSG F++
Sbjct: 3   EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 61

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE KIA+ T +P+ HGE   VL YE GQKYD+H+D F          QR
Sbjct: 62  RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 119

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YL+DVEEGGET+FP
Sbjct: 120 IATLLMYLTDVEEGGETVFP 139


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 91/141 (64%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW PRA+ + NF + E+C+ +I+ AK  + K + +  + G + +S    RTSSGTF+
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 91/141 (64%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW PRA+ + NF + E+C+ +I+ AK  + K + +  + G + +S    RTSSGTF+
Sbjct: 81  EVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSR--VRTSSGTFL 138

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++E+IE +I+  T +P  +GE   VL Y++GQKY+ HYD F          Q
Sbjct: 139 RRGHDE--VVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFLDEFNTKNGGQ 196

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDV++GGET+FP
Sbjct: 197 RIATVLMYLSDVDDGGETVFP 217


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/139 (46%), Positives = 89/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S E+C  +I  AK +L  S +A  + GE++ES +  RTSSG FI  
Sbjct: 21  LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQE--RTSSGMFIFK 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           +ED+  I+  IE +IA  T LP+ +GE   +LRYE GQKY++H D F       +   R 
Sbjct: 79  TEDE--IVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRA 136

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDV++GGET+FP
Sbjct: 137 ATVLMYLSDVKKGGETVFP 155


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 57/140 (40%), Positives = 85/140 (60%), Gaps = 3/140 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++SW PRA    NF + ++C  +I  A   ++ S +   Q      ++  RTSSG F++
Sbjct: 15  EIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSR-VRTSSGMFLN 73

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE KIA+ T +P+ HGE   VL YE GQKYD+H+D F          QR
Sbjct: 74  RGQDR--VISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQR 131

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YL+DVEEGGET+FP
Sbjct: 132 IATLLMYLTDVEEGGETVFP 151


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/189 (35%), Positives = 102/189 (53%), Gaps = 8/189 (4%)

Query: 32  STFLSRDVPSIRPKLRTLEVVEKENESGLPHG----ETGDASIQSIPF-QVLSWRPRALY 86
           S   +R  PS     ++L V    + S + H     E+GD   ++  + + +SW PRA  
Sbjct: 7   SRISTRKSPSSTLIRKSLNVHFPNDLSSIAHNSKIHESGDDEGKAEQWVEAISWEPRAFI 66

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + NF +  +C  +I  AK  ++ S +        + ++  RTSSGTF+    DK  I+  
Sbjct: 67  YHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSR-VRTSSGTFLPRGRDK--IIRD 123

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
           IE +IA  + +P  HGE   +L YE+GQKY+ H+D F          QR+A+ L+YLSDV
Sbjct: 124 IEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFDYFMDDYNTENGGQRIATVLMYLSDV 183

Query: 207 EEGGETMFP 215
           EEGGET+FP
Sbjct: 184 EEGGETVFP 192


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 90/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C  +I  A+ +L+ S +A  + G+++ES    RTSSG FI+ 
Sbjct: 52  LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESE--VRTSSGMFIAK 109

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  I+  IE +IA  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 110 AQDE--IVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRV 167

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 168 ATVLMYLSNVEKGGETVFP 186


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 61/138 (44%), Positives = 87/138 (63%), Gaps = 3/138 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F + E+C  +I+ AK  LK S +A  +    ++++  RTSSG FI  +
Sbjct: 36  ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSE-VRTSSGMFIPKA 94

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRYE GQKY+ HYD F       +   RLA
Sbjct: 95  KDP--IVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYDYFVDKVNIARGGHRLA 152

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YL++VE+GGET+FP
Sbjct: 153 TVLMYLTNVEKGGETVFP 170


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score =  114 bits (284), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/151 (41%), Positives = 91/151 (60%), Gaps = 3/151 (1%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTK 124
           T  + I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A  +    + ++
Sbjct: 29  TSSSIINPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE 88

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTSSG FI+ ++D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD HYD F+
Sbjct: 89  -VRTSSGMFITKAKDP--IVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFS 145

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                 +   R+A+ L+YL+DVE+GGET+FP
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFP 176


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 97/154 (62%), Gaps = 6/154 (3%)

Query: 64  ETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           +TG +++   P +V  LS +PRA  +  F SAE+CQ +I +AK +L  S +A   G++V 
Sbjct: 47  KTGGSAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVT 106

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S +  RTS+G F+  ++D+  I+  IE +IA  T LP  +GE   +LRYE GQKY+ H+D
Sbjct: 107 SKE--RTSTGMFLHKAQDE--IVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFD 162

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            F           R+A+ L+YLS+VE+GGET+FP
Sbjct: 163 FFQDPGNIAIGGHRIATILMYLSNVEKGGETVFP 196


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 62/150 (41%), Positives = 89/150 (59%), Gaps = 12/150 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVES---------TKG 125
           +VLSW PRA+ + NF + E+C+ +I  AK  + K + +    G++ +S            
Sbjct: 82  EVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSR 141

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F++  +DKT  +  IE +IA  T +P  HGE   VL YE+GQKY+ H+D F  
Sbjct: 142 VRTSSGMFLNRGQDKT--IRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLD 199

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                   QR+A+ L+YLSDVE+GGET+FP
Sbjct: 200 EFNTKNGGQRIATVLMYLSDVEKGGETVFP 229


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 65/161 (40%), Positives = 95/161 (59%), Gaps = 5/161 (3%)

Query: 57  ESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ 116
           ES   +  +  A++     + +SW+PRA  +  F +  +C  +++ A+  LK S++A   
Sbjct: 21  ESTCSYAGSASATVDPSKVKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADND 80

Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G++  ST   RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQK
Sbjct: 81  SGKSKLST--VRTSSGMFISKNKDP--IVSGIEDKISAWTFLPKENGEDIQVLRYEHGQK 136

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           Y+SHYD F           RLA+ L+YLS+V +GGET+FP 
Sbjct: 137 YESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPL 177


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 61/155 (39%), Positives = 91/155 (58%), Gaps = 5/155 (3%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETV 120
           H E  +    ++  +V+SW+PRA  + NF + E+C+ +I  A   ++ S +A  Q G++V
Sbjct: 61  HIEAEEDDQVALRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSV 120

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
                 R S+G F+   +D+  I+  IE +IA  T +P  +GE   V+ YE+GQ YD HY
Sbjct: 121 --VHDVRKSTGAFLDRGQDE--IVRNIEKRIADVTFIPIENGEPIYVIHYEVGQYYDPHY 176

Query: 181 DAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           D F          QR+A+ L+YLS+VEEGGETMFP
Sbjct: 177 DYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFP 211


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 88/138 (63%), Gaps = 3/138 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F + E+C  +I+ AK  LK S +A  +    + ++  RTSSG FIS +
Sbjct: 40  ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSE-VRTSSGAFISKA 98

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I++ IE K+A  T LP  +GE   VLRYE GQKY++H+D F+      +   R A
Sbjct: 99  KD--AIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLS+VE+GG+T+FP
Sbjct: 157 TVLMYLSNVEKGGDTVFP 174


>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 91/149 (61%), Gaps = 10/149 (6%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           ++ P  + + +P+A    NF SAE+C  ++  AK+ L PS +    G++V S    RTS+
Sbjct: 38  ETPPLVLSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVVGEAGDSVPSD--IRTSA 95

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF----NPA 186
           G F+   +DK  I++ IE +IAR +  P  +GE   +LRY++GQKYD H+D F    NPA
Sbjct: 96  GMFLRKGQDK--IVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPA 153

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
               +  QRLA+ L+YL D ++GGET FP
Sbjct: 154 PK--RGGQRLATMLIYLVDTDKGGETTFP 180


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 89/139 (64%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  I++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 154 ATVLMYLTDVKKGGETVFP 172


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +++SW PR   + NF + E+C+ +I  AK  ++ S +   + G ++ES    RTSSGTF+
Sbjct: 86  EIISWEPRIFLYHNFLTKEECEHLINIAKPNMRKSTVIESETGMSIESR--VRTSSGTFL 143

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE++IA  T +P  +GE   VL Y++G+KY  H+D F           
Sbjct: 144 ARGRDK--IVRNIENRIADFTFIPVDNGEELQVLHYQVGEKYVPHHDYFMDDINTANGGD 201

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 202 RIATMLMYLSDVEEGGETVFP 222


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T +P+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score =  112 bits (281), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 62/148 (41%), Positives = 92/148 (62%), Gaps = 6/148 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F + E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F+  
Sbjct: 58  LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSE--VRTSSGMFLQK 115

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   ++  IE +IA  T LP  +GEA  +L YE GQKY+ H+D F+          R+
Sbjct: 116 AQDD--VVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRI 173

Query: 197 ASFLLYLSDVEEGGETMFP-FEVDLQIS 223
           A+ L+YLS+VEEGGET+FP  E  LQ++
Sbjct: 174 ATVLMYLSNVEEGGETVFPNAEAKLQLA 201


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/152 (41%), Positives = 95/152 (62%), Gaps = 7/152 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
           G +S++  P +V  LSW PRA  +  F S E+C  +I  AK +L+ S +A  + G+++ S
Sbjct: 45  GGSSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMS 104

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F++ ++D+  I+  IE +IA  T LP  +GE+  +L YE GQKY+ H+D 
Sbjct: 105 E--VRTSSGMFLNKAQDE--IVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDY 160

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           F+          R+A+ L+YLSDVE+GGET+F
Sbjct: 161 FHDKANQVMGGHRIATVLMYLSDVEKGGETIF 192


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PRA  +  F S E+C  ++  A+ +L+ S +A  + G+++ES    RTSSG FI  
Sbjct: 49  LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESE--VRTSSGMFIGK 106

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           S+D+  I++ IE +IA  T LPQ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 107 SQDE--IVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRV 164

Query: 197 ASFLLYLSDVEEGGETMFP 215
            + L+YLS+V +GGET+FP
Sbjct: 165 VTVLMYLSNVGKGGETVFP 183


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  +++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV +GGET+FP
Sbjct: 154 ATVLMYLTDVNKGGETVFP 172


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 175 ATVLMYLSNVEKGGETVFP 193


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/139 (45%), Positives = 85/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FI  
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE--VRTSSGAFIHK 101

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+H+D F       +   R+
Sbjct: 102 AKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRM 159

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDVE+GGET+FP
Sbjct: 160 ATVLMYLSDVEKGGETVFP 178


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/140 (43%), Positives = 85/140 (60%), Gaps = 5/140 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 75  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 132

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 133 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 190

Query: 197 ASFLLYLSDVEEGGETMFPF 216
           A+ L+YLS+VE+GGET+FP 
Sbjct: 191 ATVLMYLSNVEKGGETVFPM 210


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/149 (42%), Positives = 88/149 (59%), Gaps = 5/149 (3%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
           A I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S    
Sbjct: 33  AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F   
Sbjct: 91  RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
               +   R+A+ L+YL++V +GGET+FP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFP 177


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/149 (42%), Positives = 88/149 (59%), Gaps = 5/149 (3%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGT 126
           A I     + +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S    
Sbjct: 33  AIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSE--V 90

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F   
Sbjct: 91  RTSSGMFISKNKD--AIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADK 148

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
               +   R+A+ L+YL++V +GGET+FP
Sbjct: 149 VNIARGGHRVATVLMYLTNVTKGGETVFP 177


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/149 (42%), Positives = 91/149 (61%), Gaps = 8/149 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA     F S E+C+ IIA AK R+ K S +    G++V+S    RTS+G +++ 
Sbjct: 57  LSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSE--IRTSTGAWLAK 114

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED+  I+  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P    P+   Q
Sbjct: 115 GEDE--IISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQ 172

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+ + L+YL+ VEEGGET+ P   D ++S
Sbjct: 173 RVVTVLMYLTTVEEGGETVLP-HADQKVS 200


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score =  111 bits (277), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/141 (41%), Positives = 86/141 (60%), Gaps = 6/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFIS 135
           LSW PRA  +  F S E+C  +I  AK +L+ S +      GE+++S +  RTSSG F++
Sbjct: 35  LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEE--RTSSGVFLT 92

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D +   E       R
Sbjct: 93  KRQDD--IVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHR 150

Query: 196 LASFLLYLSDVEEGGETMFPF 216
           +A+ L+YLS+V +GGET+FP 
Sbjct: 151 IATVLMYLSNVTKGGETVFPM 171


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 26  GSASSIVNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 85

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 86  D--VRTSSGMFISKNKDP--IISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 141

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL++V +GGET+FP
Sbjct: 142 FTDKVNIARGGHRIATVLMYLTNVTKGGETVFP 174


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 25  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 83  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+ VEEGGET+ P
Sbjct: 141 RVVTMLMYLTTVEEGGETVLP 161


>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
 gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
          Length = 208

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  +++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQ--ARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED+  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+         QR+
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRV 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGE +FP
Sbjct: 154 ATVLMYLTDVKKGGEAVFP 172


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 17  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 74

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 75  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 132

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+ VEEGGET+ P
Sbjct: 133 RVVTMLMYLTTVEEGGETVLP 153


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 58/140 (41%), Positives = 85/140 (60%), Gaps = 5/140 (3%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
           ++SW+PR  ++  F S ++C  ++   K++LK S +A  + G++V S    RTSSG F+ 
Sbjct: 50  IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSE--VRTSSGMFLD 107

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   ++  IE +IA  T+LPQ + E   +LRYE GQKYD H+D F       Q   R
Sbjct: 108 KQQDP--VVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHR 165

Query: 196 LASFLLYLSDVEEGGETMFP 215
            A+ L YLS VE+GGET+FP
Sbjct: 166 YATVLTYLSTVEKGGETVFP 185


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 16  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 73

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 74  GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 131

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+ VEEGGET+ P
Sbjct: 132 RVVTMLMYLTTVEEGGETVLP 152


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 86/141 (60%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 45  LSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSE--IRTSTGTWFAK 102

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED   ++  IE ++A+ TM+P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 103 GEDS--VISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 160

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+ VEEGGET+ P
Sbjct: 161 RVVTMLMYLTTVEEGGETVLP 181


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 29  GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FI  ++D   I+  +E KI+  T+LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 89  E--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDY 144

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score =  110 bits (274), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 66/153 (43%), Positives = 91/153 (59%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 29  GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FI  ++D   I+  +E KI+  T+LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 89  E--VRTSSGMFIPKNKDP--IVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDY 144

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R+
Sbjct: 97  KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 155 ATVLMYLSNVEKGGETIFP 173


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 25  GSASSVINPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLS 84

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   V RYE GQKYD HYD 
Sbjct: 85  D--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDY 140

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL+DV +GGET+FP
Sbjct: 141 FTDKVNIARGGHRIATVLMYLTDVAKGGETVFP 173


>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
 gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
          Length = 204

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 88/139 (63%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R+
Sbjct: 97  KQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 155 ATVLMYLSNVEKGGETIFP 173


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +I  AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 43  LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 100

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 101 KQDE--VVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 158

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 159 ATVLMYLSNVEKGGETIFP 177


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 85/139 (61%), Gaps = 3/139 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW PRA  +P+F S ++   +++ A+  LK S +A       + ++  RTSSGTFIS  
Sbjct: 54  ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSE-VRTSSGTFISKG 112

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F  +        R+A
Sbjct: 113 KDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVA 170

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + LLYL+DV EGGET+FP 
Sbjct: 171 TVLLYLTDVAEGGETVFPL 189


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F S  +C  +I  AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  RQDE--VVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 94/152 (61%), Gaps = 3/152 (1%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVES 122
           E G   ++ +  +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S
Sbjct: 46  EAGLLRLRYVKPEVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKS 105

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D 
Sbjct: 106 D--VRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDY 163

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           F+      +  QR+A+ L+YL+D  EGGET F
Sbjct: 164 FSDTFNLKRGGQRVATMLMYLTDGVEGGETHF 195


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/153 (40%), Positives = 89/153 (58%), Gaps = 11/153 (7%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
            +I +   + LSW P A  +  F +  +C+ I   A   LKPS + +      +++   R
Sbjct: 19  GTIDAGAIERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTV-VDASTGGDASSEIR 77

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
           TSSG F+  +ED   ++E IE +IA  T +P++HGE F VLRYE  Q+Y +HYD F+   
Sbjct: 78  TSSGMFLGRAEDD--VIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKF 135

Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
               E G    QR+ + L+YLSDVEEGGET+FP
Sbjct: 136 NVKREKG---GQRMGTVLMYLSDVEEGGETVFP 165


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 84/138 (60%), Gaps = 3/138 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  +  F +  +C  +I+ AK  LK S +A  +    + ++  RTSSG FI+  
Sbjct: 39  VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSE-VRTSSGMFIAKG 97

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F       +   R+A
Sbjct: 98  KDP--IIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMA 155

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV +GGET+FP
Sbjct: 156 TVLMYLSDVVKGGETVFP 173


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/138 (44%), Positives = 84/138 (60%), Gaps = 5/138 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FI  
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSE--VRTSSGAFIHK 101

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE KIA  T LP+ +GE   VLRYE GQKYD+H+D F       +   R+
Sbjct: 102 AKDP--IVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRM 159

Query: 197 ASFLLYLSDVEEGGETMF 214
           A+ L+YLSDVE+GGET+F
Sbjct: 160 ATVLMYLSDVEKGGETVF 177


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 29  GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 88

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FI  ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 89  E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 144

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL+DV +GGET+FP
Sbjct: 145 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 177


>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
 gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
          Length = 307

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/171 (35%), Positives = 96/171 (56%), Gaps = 32/171 (18%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGT------- 126
            +V+SW PRA  + NF + E+C+ +I+ AK  +  S++  ++ G++++S   T       
Sbjct: 80  LEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVF 139

Query: 127 ----------------------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
                                 RTSSGTF++   D+  I+E IE++I+  T +P  +GE 
Sbjct: 140 TFQLNLERFENSKFANPSLCRVRTSSGTFLNRGHDE--IVEEIENRISDFTFIPPENGEG 197

Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
             VL YE+GQ+Y+ H+D F       +  QR+A+ L+YLSDV+EGGET+FP
Sbjct: 198 LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFP 248


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
           LSWRPRA     F    +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISA 136
           LSWRPRA     F    +C  +IA AK +L+ S +A  + G++V+S    RTSSG F+  
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSE--VRTSSGMFLEK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+
Sbjct: 96  KQDE--VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 153

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 154 ATVLMYLSNVEKGGETIFP 172


>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 244

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 57/145 (39%), Positives = 92/145 (63%), Gaps = 3/145 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175

Query: 195 RLASFLLYLSDVEEGGETMFPFEVD 219
           R+A+ L+YL+D   GGET FP E++
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQEME 200


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/168 (38%), Positives = 96/168 (57%), Gaps = 5/168 (2%)

Query: 49  LEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLK 108
           + ++ + + +G  +  +  A I     + +SW+PRA  +  F +  +C  +I+ AK  LK
Sbjct: 15  VSLICESHGAGSSYAGSASAIIDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELK 74

Query: 109 PSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
            S +A    G++  S    RTSSG FIS ++D   I+  IE KI+  T LP+ +GE   V
Sbjct: 75  RSAVADNLSGDSKLSD--VRTSSGMFISKNKDP--IVAGIEDKISSWTFLPKENGEDIQV 130

Query: 168 LRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           LRYE GQKYD HYD F       +   R+A+ L+YL++V  GGET+FP
Sbjct: 131 LRYEHGQKYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFP 178


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/140 (42%), Positives = 85/140 (60%), Gaps = 6/140 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS 135
           LSW PRA  +  F S E+C  +I  AK +L+ S +   +  GE+ +S    RTSSG F++
Sbjct: 35  LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLT 92

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D F   +       R
Sbjct: 93  KRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHR 150

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLS+V +GGET+FP
Sbjct: 151 IATVLMYLSNVTKGGETVFP 170


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 90/153 (58%), Gaps = 7/153 (4%)

Query: 66  GDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVES 122
           G AS    P +V  +SW+PRA  +  F +  +C  +I+ AK  LK S +A    GE+  S
Sbjct: 567 GSASAIIDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLS 626

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG FI  ++D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD 
Sbjct: 627 E--VRTSSGMFIPKNKDL--IVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDY 682

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           F       +   R+A+ L+YL+DV +GGET+FP
Sbjct: 683 FADKVNIARGGHRVATVLMYLTDVTKGGETVFP 715


>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 120

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/56 (85%), Positives = 54/56 (96%)

Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
           A+NVLRYE+GQKY+SHYDAF+PAEYGPQ SQR+ASFLLYLSDVEEGGETMFP+E D
Sbjct: 1   AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYEND 56


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 36  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 95

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 96  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 151

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP
Sbjct: 152 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 194


>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
 gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
          Length = 369

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/154 (42%), Positives = 90/154 (58%), Gaps = 11/154 (7%)

Query: 67  DASIQSIPFQVLS-WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
           +A   S P  VLS  +P+A    NF S ++C  ++  AK+ L PS +    G +V S   
Sbjct: 74  EAGFPSTPPLVLSSKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVVGDGGSSVASE-- 131

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-- 183
            RTS+G F+  S+D T  +  IE +IAR + +P  +GE   +LRY+ GQKYD H+D F  
Sbjct: 132 IRTSAGMFLRKSQDDT--VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHD 189

Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
             NPA    +  QR+A+ L+YL D EEGGET FP
Sbjct: 190 KVNPAPK--RGGQRVATVLIYLVDTEEGGETTFP 221


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/142 (43%), Positives = 91/142 (64%), Gaps = 8/142 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-GETVESTKGTRTSSGTFIS 135
           +SWRPRA  +  F +  +C  ++A A++  L+ S +  RQ G++V S    RTSSGTF++
Sbjct: 41  VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSE--VRTSSGTFLA 98

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG--PQMS 193
             +D+  ++  IE +IA  T+LPQ +GE+  VLRYE GQKY+ H D    A  G   +  
Sbjct: 99  KKQDQ--VVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGG 156

Query: 194 QRLASFLLYLSDVEEGGETMFP 215
            R+A+ L+YLSDV+ GGET+FP
Sbjct: 157 HRVATVLMYLSDVKMGGETVFP 178


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S  +C  +I  AK +L+ S +A    G++V S    RTSSG F+  
Sbjct: 35  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 92

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T+LP  +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 93  AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 150

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 151 ATVLMYLSNVEKGGETIFP 169


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 60/163 (36%), Positives = 93/163 (57%), Gaps = 14/163 (8%)

Query: 63  GETGDASIQSI----PF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           GE GD  + ++    PF     + +SWRPR   +  F S ++C  ++   K++++ S +A
Sbjct: 30  GEAGDDGVGAVAAAPPFNASRVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVA 89

Query: 114 -LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
             + G++V S    RTSSG F+   +D   ++  IE +IA  T LP+ + E   +LRYE 
Sbjct: 90  DNKSGKSVMSE--VRTSSGMFLDKRQDP--VVSRIEKRIAAWTFLPEENAENIQILRYEH 145

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F+          R A+ L+YLS VE+GGET+FP
Sbjct: 146 GQKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFP 188


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/221 (32%), Positives = 113/221 (51%), Gaps = 27/221 (12%)

Query: 12  LGLPTALLLCSFFFLAGF-------YGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHG- 63
           L   TA ++   FF   F       YG+ F         P  + +  ++ +    LP G 
Sbjct: 12  LAFVTAGMIVGAFFQLAFILKLEDSYGTKF---------PSFKRVRKLQSDAYLQLPRGI 62

Query: 64  ----ETGDASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LR 115
                  +A++  I +   +++SW PR +   +F S+E+C  + A AK RL+ S +  ++
Sbjct: 63  SHWDNDTEAAVLRIGYVKPEIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVK 122

Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G+ +ES    RTSSG F+S+ E    +++ IE +I+  + +P  +GE   VLRYE  Q 
Sbjct: 123 TGKGIESK--VRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQY 180

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           Y  H+D F+      +  QR+A+ L+YLSD  EGGET FP 
Sbjct: 181 YKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPM 221


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/146 (41%), Positives = 86/146 (58%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
           +VL+W PR +    F SAE+C  +IA A  RL  S +        R G  +ES    RTS
Sbjct: 61  EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 116

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           +G F+S  + +  +++ IE +IA  +M+P  +GE   VLRYE  Q Y  H+D F+     
Sbjct: 117 TGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 176

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
            +  QR+A+ L+YLSDVEEGGET+FP
Sbjct: 177 KRGGQRVATVLMYLSDVEEGGETIFP 202


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 92/149 (61%), Gaps = 3/149 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQ 175

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ L+YL+D   GGET FP   D + S
Sbjct: 176 RVATMLMYLTDGVVGGETHFPQAGDGECS 204


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/139 (44%), Positives = 83/139 (59%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FIS 
Sbjct: 39  VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSE--VRTSSGMFISK 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD HYD F       +   R+
Sbjct: 97  KKDP--IVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRM 154

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYL++V  GGET+FP
Sbjct: 155 ATVLLYLTNVTRGGETVFP 173


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA     F +  +C+ +I+ AK +L+ S +A  + G++V S    RTSSG F+  
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSE--VRTSSGMFLEK 105

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+          R+
Sbjct: 106 KQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRI 163

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLSDV +GGET+FP
Sbjct: 164 ATVLMYLSDVGKGGETIFP 182


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA------LRQGETVESTKGTRTS 129
           +VL+W PR      F SAE+C  +IA A  RL  S +        R G  +ES    RTS
Sbjct: 60  EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHG--IESK--VRTS 115

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           +G F+S  + +  ++E IE +IA  +M+P  +GE   VLRYE  Q Y  H+D F+     
Sbjct: 116 TGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNL 175

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
            +  QR+A+ L+YLSDVEEGGET+FP
Sbjct: 176 KRGGQRVATVLMYLSDVEEGGETIFP 201


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/139 (41%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW+PRA  +  F S  +C  +I  AK +L+ S +A    G++V S    RTSSG F+  
Sbjct: 56  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSE--VRTSSGMFLRK 113

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  ++  +E +IA  T+LP  +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 114 AQDE--VVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRI 171

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+VE+GGET+FP
Sbjct: 172 ATVLMYLSNVEKGGETIFP 190


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 3/149 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  + + A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S+ E K  +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ L+YL+D  EGGET FP   D + S
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECS 208


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 98/164 (59%), Gaps = 7/164 (4%)

Query: 55  ENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
           E  + +  G    A ++  P +V  LSW PRA  + NF + E+C  +I  +K +L+ S +
Sbjct: 29  EEGNKITRGSVFGAKVKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMV 88

Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
           A  + G++++S    RTSSG F++  +D+  I+  IE +IA  T LP  +GE+  VL Y 
Sbjct: 89  ADNESGKSIQSE--VRTSSGMFLNKQQDE--IVSGIEARIAAWTFLPVENGESMQVLHYM 144

Query: 172 IGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            G+KY+ H+D F+          R+A+ L+YLS+VE+GGET+FP
Sbjct: 145 NGEKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFP 188


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 59/149 (39%), Positives = 91/149 (61%), Gaps = 3/149 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  + + A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSN--VRTSSGMFV 119

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S+ E K  +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 120 SSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQ 179

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ L+YL+D  EGGET FP   D + S
Sbjct: 180 RVATMLMYLTDGVEGGETHFPQAGDGECS 208


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 59/139 (42%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSWRPRA  +  F S E+C  +I  AK +L+ S +A  + G+++ S    RTSSG F+  
Sbjct: 57  LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSE--VRTSSGMFLLK 114

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D+  I+  IE +IA  T LP  +GE+  +L YE G+KY+ H+D F+          R+
Sbjct: 115 AQDE--IVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRI 172

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+ VEEGGET+FP
Sbjct: 173 ATVLMYLATVEEGGETVFP 191


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/139 (44%), Positives = 84/139 (60%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FIS 
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD F       Q   RL
Sbjct: 98  NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score =  107 bits (267), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/139 (44%), Positives = 84/139 (60%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG FIS 
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD F       Q   RL
Sbjct: 98  NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/141 (41%), Positives = 87/141 (61%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + +F + E+C  +I  A+  L K + +    G++ +S    RTSSGTF+
Sbjct: 4   EVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSR--LRTSSGTFL 61

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   +++ IE +IA  T +P   GE   VL+Y+  +KY+ HYD F+ A       Q
Sbjct: 62  MRGQDP--VIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQ 119

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLS+VEEGGET+FP
Sbjct: 120 RIATVLMYLSNVEEGGETVFP 140


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 6/139 (4%)

Query: 79  SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFISA 136
           SW PRA  +  F S E+C  +I  AK +L+ S +   +  GE+ +S    RTSSG F++ 
Sbjct: 1   SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSE--VRTSSGMFLTK 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GEA  +L YE GQKYD H+D F   +       R+
Sbjct: 59  RQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 116

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+V +GGET+FP
Sbjct: 117 ATVLMYLSNVTKGGETVFP 135


>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
 gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
          Length = 454

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 88/149 (59%), Gaps = 5/149 (3%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           S P  + +  P+A  F NF +  +C+ ++  AKK+L PS +   +G     +K  RTS+G
Sbjct: 167 SPPLVLSNHEPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSK-IRTSAG 225

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            F+   +D T  +  IE +IA A+ LP+ +GE   +LRYE GQKYD H+D F +     P
Sbjct: 226 MFLGRGQDPT--VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSP 283

Query: 191 QM-SQRLASFLLYLSDVEEGGETMFPFEV 218
           +   QR+A+ L+YL D  EGGET+FP  V
Sbjct: 284 RRGGQRMATMLIYLEDTTEGGETIFPNGV 312


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score =  107 bits (266), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 92/158 (58%), Gaps = 10/158 (6%)

Query: 64  ETGDASIQSIPF-----QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-G 117
           + GD    + PF     + +SW+PR   +  F S  +C  ++  AKK+++ S +A  Q G
Sbjct: 28  DGGDVVAPAPPFNSSRVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSG 87

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           ++V S    RTSSG F++  +D   ++  IE +IA  T LPQ + E   +LRYE GQKY+
Sbjct: 88  KSVMSE--VRTSSGMFLNKRQDP--VVSRIEERIAAWTFLPQENAENMQILRYEHGQKYE 143

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            H+D F+      +   R A+ L+YLS V++GGET+FP
Sbjct: 144 PHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFP 181


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 63/145 (43%), Positives = 87/145 (60%), Gaps = 7/145 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSD--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFPF--EVD 219
           A+ LLYL+DV EGGET+FP   EVD
Sbjct: 170 ATVLLYLTDVAEGGETVFPLAEEVD 194


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 65/147 (44%), Positives = 86/147 (58%), Gaps = 11/147 (7%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS +P+A  +  F    +C  I   AK +L+ S +   + G++V S    RTS G F   
Sbjct: 8   LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSN--IRTSDGMFFDR 65

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPAEYGPQM 192
            ED   I+E IE +IA  T +P  +GE   VLRYE+GQKY+ H DA    FN  E   + 
Sbjct: 66  HEDD--IIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSDKFNTEE--SKG 121

Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVD 219
            QR+A+ L+YLSDVEEGGET+FP  VD
Sbjct: 122 GQRMATVLMYLSDVEEGGETVFPRSVD 148


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 39  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 97  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 154

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+V +GGET+FP
Sbjct: 155 ATVLLYLSNVTKGGETVFP 173


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 3/139 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PR   + +F S ++   +++ A+  LK S +A       E +   RTSSGTFI  S
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F+      +   R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + L+YL+DV EGGET+FP 
Sbjct: 178 TVLMYLTDVAEGGETVFPL 196


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 58/139 (41%), Positives = 84/139 (60%), Gaps = 3/139 (2%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PR   + +F S ++   +++ A+  LK S +A       E +   RTSSGTFI  S
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDA-RTSSGTFIRKS 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F+      +   R+A
Sbjct: 120 QDP--IVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIA 177

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + L+YL+DV EGGET+FP 
Sbjct: 178 TVLMYLTDVAEGGETVFPL 196


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 41  VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175


>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
          Length = 190

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 82/136 (60%), Gaps = 5/136 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES    RTSSG F+S 
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESE--VRTSSGMFLSK 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+          R+
Sbjct: 117 RQDD--IVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRI 174

Query: 197 ASFLLYLSDVEEGGET 212
           A+ L+YLS+VE+GGET
Sbjct: 175 ATVLMYLSNVEKGGET 190


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 87/139 (62%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +I+ AK+ L+ S +A    GE+  S    RTSSGTFIS 
Sbjct: 41  VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSD--VRTSSGTFISK 98

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE K++  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 99  GKDP--IVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRI 156

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+V +GGET+FP
Sbjct: 157 ATVLLYLSNVTKGGETVFP 175


>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 207

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 88/159 (55%), Gaps = 11/159 (6%)

Query: 64  ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           E G   +   P       + +SW PR   +  F S  +C  ++  AKK+++ S +A  + 
Sbjct: 28  ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNES 87

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G++V+S    RTSSG F+   +D   ++  IE +IA  T LPQ + E   VLRYE GQKY
Sbjct: 88  GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           + H+D F+      +   R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 62/136 (45%), Positives = 82/136 (60%), Gaps = 9/136 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL--ALRQGETVESTKGTRTSSGTFISASED 139
           PRA  +  F + E+C  I+A +K  L  S +  A   G T   T   RTS+GTFIS + D
Sbjct: 1   PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGST---TSDIRTSTGTFISRAHD 57

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
            T  +  IE +I   + +P  HGEA  VLRYE GQ+Y +H+D F     G + + R+A+ 
Sbjct: 58  PT--ITAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYF--FHKGGKRNNRIATV 113

Query: 200 LLYLSDVEEGGETMFP 215
           LLYLSDVEEGGET+FP
Sbjct: 114 LLYLSDVEEGGETVFP 129


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score =  106 bits (264), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 88/159 (55%), Gaps = 11/159 (6%)

Query: 64  ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           E G   +   P       + +SW PR   +  F S  +C  ++  AKK+++ S +A  + 
Sbjct: 28  ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNES 87

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G++V+S    RTSSG F+   +D   ++  IE +IA  T LPQ + E   VLRYE GQKY
Sbjct: 88  GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           + H+D F+      +   R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182


>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
          Length = 327

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 60/149 (40%), Positives = 82/149 (55%), Gaps = 11/149 (7%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            +V++W+PRAL    F S  +C  II  A   L+ S +   +G ++      RTSSG FI
Sbjct: 42  VEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLDE--IRTSSGMFI 99

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D   ++  +E ++A  T LP +H E   VLRYE+GQKY +H+D  +  E   QM  
Sbjct: 100 LKGHD--AVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRA 157

Query: 195 -------RLASFLLYLSDVEEGGETMFPF 216
                  R A+ L+YLSDVEEGGET FP 
Sbjct: 158 KGVLGGLRTATLLMYLSDVEEGGETAFPH 186


>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 91/149 (61%), Gaps = 3/149 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW+PR +   NF SA++C  +I  A+ RL K + +    G+ +ES    RTS+G F+
Sbjct: 79  EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESK--VRTSTGMFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           + ++ +   ++ IE +IA  +M+P  +GE   VLRYE  Q Y +H+D F+      +  Q
Sbjct: 137 NGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQ 196

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ L+YL++  EGGET+FP   D + S
Sbjct: 197 RVATMLMYLTEGVEGGETIFPQAGDKECS 225


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/141 (41%), Positives = 84/141 (59%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           LSW PRA    NF S E+C  I+  A+ K +K S +    G++V+S    RTS+GT+ + 
Sbjct: 25  LSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSE--IRTSTGTWFAK 82

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM-SQ 194
            ED   ++  IE ++A+ T +P  + E   VL Y  GQKY+ HYD F +P   GP+   Q
Sbjct: 83  GEDS--VISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGPEHGGQ 140

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L YL+ VEEGGET+ P
Sbjct: 141 RVVTXLXYLTTVEEGGETVLP 161


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 59/141 (41%), Positives = 85/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSSGTFI 134
           +VLSW PRA  + +F +  +C  +I  AK  L K + +    G++ +S    RTSSGTF+
Sbjct: 3   EVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSR--VRTSSGTFL 60

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   I++ IE +IA  T +P   GE   VL+Y   +KY+ HYD F+ A       Q
Sbjct: 61  VRGQDH--IIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQ 118

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVE+GGET+FP
Sbjct: 119 RIATVLMYLSDVEKGGETVFP 139


>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 186

 Score =  106 bits (264), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 56/112 (50%), Positives = 74/112 (66%), Gaps = 3/112 (2%)

Query: 107 LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFN 166
           + PS LA R GE  E+ +  RTS GTF+    D +  L  +E KIA  T+LP+T+GE +N
Sbjct: 1   MYPSGLAYRPGEKAEAEQQVRTSKGTFLGG--DSSPALRWLEDKIAAVTLLPRTNGEFWN 58

Query: 167 VLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE-EGGETMFPFE 217
           VL Y+  Q YDSH D+F+P EYGPQ SQR+A+ ++ LSD    GGET+F  E
Sbjct: 59  VLNYKHSQHYDSHMDSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVFKRE 110


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/152 (40%), Positives = 88/152 (57%), Gaps = 7/152 (4%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
           D +I+    + +SWRP A  +  F + E+C  + A A   L  S +     G +V S   
Sbjct: 48  DRAIEFDRIERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSD-- 105

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+   ED   ++  IE +IA  T +P++HGE F VLRYE GQ+Y  H+D F  
Sbjct: 106 IRTSSGMFLLRGEDD--VVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQD 163

Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
              +   +  QR+A+ L+YL+DVEEGGET+FP
Sbjct: 164 EFNQKREKGGQRVATVLMYLTDVEEGGETIFP 195


>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
 gi|194704960|gb|ACF86564.1| unknown [Zea mays]
 gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
          Length = 207

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 60/159 (37%), Positives = 87/159 (54%), Gaps = 11/159 (6%)

Query: 64  ETGDASIQSIP------FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           E G   +   P       + +SW PR   +  F S  +C  ++  AKK+ + S +A  + 
Sbjct: 28  ENGGGGVAPAPPFNSSRVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNES 87

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G++V+S    RTSSG F+   +D   ++  IE +IA  T LPQ + E   VLRYE GQKY
Sbjct: 88  GKSVKSE--VRTSSGMFLDKRQDP--VVSRIEERIAAWTFLPQENAENMQVLRYEPGQKY 143

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           + H+D F+      +   R A+ L+YLS V EGGET+FP
Sbjct: 144 EPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFP 182


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFIS 
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+V +GGET+FP
Sbjct: 156 ATILMYLSNVTKGGETVFP 174


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 86/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFIS 
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRM 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+V +GGET+FP
Sbjct: 156 ATILMYLSNVTKGGETVFP 174


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/139 (43%), Positives = 83/139 (59%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW PRA  +  F +  +C  +I+ AK  LK S +A    G++  S    RTSSG  IS 
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSD--VRTSSGMLISK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE +I+  T LP+ +GE   VLRYE GQKYD HYD F       Q   RL
Sbjct: 98  NKDP--IVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRL 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL++V +GGET+FP
Sbjct: 156 ATVLMYLTNVTKGGETVFP 174


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 56/157 (35%), Positives = 86/157 (54%), Gaps = 12/157 (7%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S + +  + LS RP  +    F S E+C  I   A  ++K S ++L+  +  + +   RT
Sbjct: 255 SQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRT 314

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           S   F+SA +D+  +L  I+H++A  T +P+ H E   VLRY  G+KYDSH+D F+P+ Y
Sbjct: 315 SQSAFLSARDDE--VLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDYFDPSAY 372

Query: 189 GPQMS----------QRLASFLLYLSDVEEGGETMFP 215
               S           R A+   YL+DV +GGET+FP
Sbjct: 373 RSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFP 409


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 57/142 (40%), Positives = 83/142 (58%), Gaps = 5/142 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I TA    LK   +    GE +E++   RTS+   +
Sbjct: 84  EVMSWEPRAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETS--YRTSTEYVV 141

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I+  IE +IA  T +P  HGE  +V+RY +GQ Y+ H D F          Q
Sbjct: 142 ERGKDK--IVRNIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPF 216
           R+A+ L+YLS+VE GGET+FP 
Sbjct: 200 RIATMLMYLSNVEGGGETVFPI 221


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 89/150 (59%), Gaps = 3/150 (2%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKG 125
           D  +  +  +V+SW PR + F NF S+E+C  +   A+ RL+ S +  +  G+ V+S   
Sbjct: 52  DLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSD-- 109

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+ 
Sbjct: 110 VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSD 169

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                +  QR+A+ L+YL+D  EGGET FP
Sbjct: 170 TFNLKRGGQRVATMLMYLTDGVEGGETHFP 199


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 62/145 (42%), Positives = 88/145 (60%), Gaps = 13/145 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSSGTFI 134
           +VLS  PRA  + NF S E+C+ +I  AK  ++ S +    G T +      RTSSGTF+
Sbjct: 86  EVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVV--DGVTGQGILNSVRTSSGTFL 143

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFN---PAEYGP 190
              +DK  I++ +E +IA  T +P  +GE   ++ YE+GQK++ HYD  FN       GP
Sbjct: 144 ERGKDK--IVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNGGP 201

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               R+A+ L+YLSDVEEGGET+FP
Sbjct: 202 ----RVATVLMYLSDVEEGGETVFP 222


>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 276

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 59/162 (36%), Positives = 93/162 (57%), Gaps = 8/162 (4%)

Query: 55  ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA- 113
           +N+ G       D  +  +  +V+SW PR + F NF S+E+C  +   A+ RL+ S +  
Sbjct: 45  DNDQG-----AADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVD 99

Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           +  G+ V+S    RTSSG F+++ E K  +++ IE +I+  + +P  +GE   VLRYE  
Sbjct: 100 VATGKGVKSD--VRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPN 157

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           Q Y  H+D F+      +  QR+A+ L+YL+D  EGGET FP
Sbjct: 158 QYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP 199


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score =  103 bits (256), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 56/141 (39%), Positives = 86/141 (60%), Gaps = 3/141 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  +   A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSD--VRTSSGMFV 118

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 119 NSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQ 178

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YL+D  EGGET FP
Sbjct: 179 RVATMLMYLTDGVEGGETHFP 199


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 3/151 (1%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           I ++  +V+SW PR +   +F S E+C+ + A A+ RL+ S +  ++ G+ V+S    RT
Sbjct: 72  IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSG F++  E    I++ IE +IA  + +P  +GE   VLRYE  Q Y  H+D F     
Sbjct: 130 SSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFN 189

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
             +  QR+A+ L+YL+D  EGGET FP   D
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGD 220


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score =  103 bits (256), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/139 (43%), Positives = 85/139 (61%), Gaps = 5/139 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PRA  +  F +  +C  +++ AK  LK S +A    GE+  S    RTSSGTFI  
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSE--VRTSSGTFIPK 97

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KI+  T LP+ +GE   VLRYE GQKYD+H+D F+      +   R+
Sbjct: 98  GKDP--IVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRI 155

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YLS+V +GGET+FP
Sbjct: 156 ATVLMYLSNVTKGGETVFP 174


>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
          Length = 207

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 55/132 (41%), Positives = 77/132 (58%), Gaps = 5/132 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 80  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVV--DSETGKSKDSRVRTSSGTFL 137

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE +IA  + +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 138 ARGRDK--IVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDDFNTKNGGQ 195

Query: 195 RLASFLLYLSDV 206
           R+A+ L+YL+DV
Sbjct: 196 RIATVLMYLTDV 207


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 84/145 (57%), Gaps = 3/145 (2%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            +VLSW PRA  + NF +  +   ++   K  ++ S++   +      +K  RTSSG F+
Sbjct: 1   MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSK-VRTSSGMFL 59

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  ED   ++E IE +IA+ T +P+ +GE   +L Y+  ++Y  H+D F+         Q
Sbjct: 60  NRGEDD--VIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGGQ 117

Query: 195 RLASFLLYLSDVEEGGETMFPFEVD 219
           R+A+ L+YLSDVE+GGET+FP   D
Sbjct: 118 RIATMLMYLSDVEDGGETVFPESSD 142


>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
           C-169]
          Length = 347

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 87/149 (58%), Gaps = 15/149 (10%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +SW PRA     F    +C+ +I+ AK  +  S +     G++++ST   RTS+GTF   
Sbjct: 86  VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDST--VRTSTGTFFGR 143

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGP 190
            ED+  +++ IE +I+  T LP+ +GE   +L YE GQKY++H+D F+      P   G 
Sbjct: 144 EEDE--VIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGG- 200

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVD 219
              QR+A+ L+YL+  EEGGET+FP   +
Sbjct: 201 ---QRIATVLMYLTTAEEGGETVFPMAAN 226


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 58/134 (43%), Positives = 82/134 (61%), Gaps = 5/134 (3%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C+ ++A +K +L  S +   Q     S    RTS+GTFIS   D  
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQ-TGGSSLSEVRTSTGTFISRKYDD- 322

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  +E +I   + +PQ+H EAF +LRYE GQ+Y +H+D F         + R+A+ LL
Sbjct: 323 -IIAGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYF--FHKSGMRNNRIATVLL 379

Query: 202 YLSDVEEGGETMFP 215
           YLSDVEEGGET+FP
Sbjct: 380 YLSDVEEGGETVFP 393


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/127 (44%), Positives = 80/127 (62%), Gaps = 5/127 (3%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIE 148
           F S E+C  +IA  K +L+ S +A  + G++V S    RTSSG F+   +D+T  +  IE
Sbjct: 3   FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSE--IRTSSGMFLERRQDET--ITRIE 58

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            +IA  T LP+ +GE   +L YE GQKYD+HYD F+          R+A+ L+YLSDV++
Sbjct: 59  KRIAAWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKK 118

Query: 209 GGETMFP 215
           GGET+FP
Sbjct: 119 GGETVFP 125


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/151 (38%), Positives = 88/151 (58%), Gaps = 3/151 (1%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           I ++  +V+SW PR +   +F S E+C+ + A A+ RL+ S +  ++ G+ V+S    RT
Sbjct: 72  IGNVKPEVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSD--VRT 129

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
           SSG F++  E    I++ IE +IA  + +P  +GE   VLRYE  Q Y  H+D F     
Sbjct: 130 SSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFN 189

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
             +  QR+A+ L+YL+D  EGGET FP   D
Sbjct: 190 LKRGGQRVATMLMYLTDDVEGGETYFPLAGD 220


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/141 (41%), Positives = 85/141 (60%), Gaps = 5/141 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFI 134
           ++LS  PRA  + NF S E+C+ +I  AK  +  S +     GE  ES+  +RTSSG F+
Sbjct: 113 EILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESS--SRTSSGMFL 170

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +DK  I++ IE +IA  T +P  +GE  +V+ Y +GQK + HYD  +          
Sbjct: 171 DRGKDK--IVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNGGP 228

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLSDVEEGGET+FP
Sbjct: 229 RVATVLMYLSDVEEGGETVFP 249



 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 67/127 (52%), Gaps = 20/127 (15%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           F S E+C+ +I  AK  +  S +     G+  ES+   RTSSG F+   +DK  I++ IE
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESS--ARTSSGRFLERGKDK--IVQNIE 427

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            +IA  T +P+    A + + +  G               GP    R+A+ L+YLSDVEE
Sbjct: 428 QRIADITSIPRM---ARDFMLFTAG--------GVVTKNGGP----RVATVLMYLSDVEE 472

Query: 209 GGETMFP 215
           GGET+FP
Sbjct: 473 GGETVFP 479


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 54/140 (38%), Positives = 81/140 (57%), Gaps = 8/140 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
           +SW PRA  + NF + E+C  ++  AK     +   L++    ++ T GT   SG F+  
Sbjct: 5   ISWEPRAFVYHNFLTPEECAHLVNLAKA----TDGGLKRATVADARTGGTFPGSGAFLLR 60

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQR 195
           + D   I+  IE +I+   M+P  HGE   +LRY  G+KYD H+D F+  +   +   QR
Sbjct: 61  NHDP--IVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFYGQR 118

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLSDVE GGET+FP
Sbjct: 119 VATVLMYLSDVESGGETVFP 138


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/142 (39%), Positives = 82/142 (57%), Gaps = 3/142 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +  E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPF 216
           R+A+ L+YLSD  EGGET FP 
Sbjct: 200 RIATMLMYLSDNIEGGETYFPL 221


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 88/150 (58%), Gaps = 23/150 (15%)

Query: 75  FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
            +V++  PRA  + NF         + E+C  +I+ AK  +  S++  R   T +     
Sbjct: 88  LEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKV--RNALTGLGEESS 145

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           +RTSSGTFI +  DK  I++ IE +I+  T +PQ +GE   V+ YE+GQK++ H+D F  
Sbjct: 146 SRTSSGTFIRSGHDK--IVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                   QR+A+ L+YLSDV++GGET+FP
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFP 223


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 62/147 (42%), Positives = 87/147 (59%), Gaps = 6/147 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           +S +PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFPF-EVDLQI 222
           A+ LLYL+DV EGGET+FP  EV+  I
Sbjct: 170 ATVLLYLTDVPEGGETVFPLAEVNFFI 196


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 20/149 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           +SW PRA +  NF S E+C  I+  A+ R+       R+   ++S  G       RTS  
Sbjct: 1   VSWYPRAFHLHNFMSHEECDRILEIARPRV-------RRSTVIDSVTGQSKVDPIRTSEQ 53

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
           TF++       I+  +E ++A  T LP  HGE   +L+Y +GQKYD+H+D     +  G 
Sbjct: 54  TFLNRG--TWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHDVGELTSASGK 111

Query: 191 QMS----QRLASFLLYLSDVEEGGETMFP 215
           Q++     R+A+ LLYLSDVEEGGET FP
Sbjct: 112 QLAAEGGHRVATVLLYLSDVEEGGETAFP 140


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score =  100 bits (249), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/140 (42%), Positives = 83/140 (59%), Gaps = 5/140 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +S +PR   + +F S ++   +I+ A+  LK S +A    G++  S    RTSSGTF+  
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE--VRTSSGTFLRK 111

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R 
Sbjct: 112 GQDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRY 169

Query: 197 ASFLLYLSDVEEGGETMFPF 216
           A+ LLYL+DV EGGET+FP 
Sbjct: 170 ATVLLYLTDVPEGGETVFPL 189


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score = 99.8 bits (247), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 57/141 (40%), Positives = 79/141 (56%), Gaps = 3/141 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VLSW PR +   NF S E+C  +   A  RLK S +     G+ ++S    RTSSG F+
Sbjct: 76  EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSD--VRTSSGMFL 133

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  E K  ++  IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 134 SHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQ 193

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YL D  EGGET FP
Sbjct: 194 RIATMLMYLGDNVEGGETHFP 214


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score = 99.8 bits (247), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 60/157 (38%), Positives = 86/157 (54%), Gaps = 8/157 (5%)

Query: 63  GETGDASIQSIP---FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE 118
           GET  +    +P     VLSW+PR   +    + E+C  +I  A+ RL+ S ++    GE
Sbjct: 35  GETYRSGRTDVPDSRMVVLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGE 94

Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
                   RTSSG F +  E+   +++ IE ++A  TMLP  +GE   VLRYE  QKYD 
Sbjct: 95  G--GVSDIRTSSGMFYTRGEND--VVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDP 150

Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           H+D F+          R+A+ L+YL+  EEGGET+FP
Sbjct: 151 HHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFP 187


>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 204

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 74/129 (57%), Gaps = 5/129 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFI 134
           +V+SW PRA  + NF + E+C+ +I  AK  +  S +     ET +S     RTSSGTF+
Sbjct: 79  EVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVV--DSETGKSKDSRVRTSSGTFL 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           +   DK  I+  IE KIA  T +P  HGE   VL YE+GQKY+ HYD F          Q
Sbjct: 137 ARGRDK--IVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQ 194

Query: 195 RLASFLLYL 203
           R+A+ L+YL
Sbjct: 195 RIATVLMYL 203


>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 245

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/150 (37%), Positives = 89/150 (59%), Gaps = 23/150 (15%)

Query: 75  FQVLSWRPRALYFPNF--------ASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKG 125
            +V++  PRA  + NF         + E+C+ +I+ AK  +  S++  R   T +     
Sbjct: 55  LEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKV--RNAITGLGEESS 112

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           +RTSSGTF+    DK  I++ IE +I+  T +P+ +GEA  V+ YE+GQK++ H+D F  
Sbjct: 113 SRTSSGTFLRKGHDK--IVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                   QR+A+ L+YLSDV++GGET+FP
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFP 190


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/140 (39%), Positives = 86/140 (61%), Gaps = 6/140 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
           +SW PR   +  F S  +C+ +IA AK+ R++ S +   + GE+V S   TRTSSG F+ 
Sbjct: 40  VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE +IA  TM P  +GE+  +LRY  G+KY+ H+D     +   +   R
Sbjct: 98  RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLS+V+ GGET+FP
Sbjct: 156 IATVLMYLSNVKMGGETVFP 175


>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 222

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/134 (40%), Positives = 74/134 (55%), Gaps = 15/134 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +VLSW PRA  + NF S E+C  +I+ AK  +K S +       V+S  G       RTS
Sbjct: 97  EVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTV-------VDSATGGSKDSRVRTS 149

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+   +DK  I+  IE +IA  T +P   GE   VL YE+GQKY+ H+D F+     
Sbjct: 150 SGMFLRRGQDK--IIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNT 207

Query: 190 PQMSQRLASFLLYL 203
               QR+A+ L+YL
Sbjct: 208 KNGGQRIATLLMYL 221


>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
 gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
          Length = 343

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/162 (37%), Positives = 87/162 (53%), Gaps = 18/162 (11%)

Query: 63  GET---GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           GET   G+   Q     VLSW PR   +    + E+C  ++  ++ RL+ S ++      
Sbjct: 52  GETFNAGEHRAQDSRMVVLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVS------ 105

Query: 120 VESTKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
            ++T G       RTSSG F    E  T +++ IE+++A  TMLP  +GE   VLRYE  
Sbjct: 106 -DATTGAGAVSDIRTSSGMFYERGE--TELVKRIENRLAMWTMLPVENGEGIQVLRYEKT 162

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           QKYD H+D F+          R+A+ L+YL+  EEGGET+FP
Sbjct: 163 QKYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFP 204


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/140 (39%), Positives = 86/140 (61%), Gaps = 6/140 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKK-RLKPSQLAL-RQGETVESTKGTRTSSGTFIS 135
           +SW PR   +  F S  +C+ +IA AK+ R++ S +   + GE+V S   TRTSSG F+ 
Sbjct: 40  VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSK--TRTSSGMFLI 97

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
             +D+  ++  IE +IA  TM P  +GE+  +LRY  G+KY+ H+D     +   +   R
Sbjct: 98  RKQDE--VVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHR 155

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLS+V+ GGET+FP
Sbjct: 156 IATVLMYLSNVKMGGETVFP 175


>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
 gi|255628535|gb|ACU14612.1| unknown [Glycine max]
          Length = 238

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/142 (38%), Positives = 82/142 (57%), Gaps = 3/142 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQ 199

Query: 195 RLASFLLYLSDVEEGGETMFPF 216
           R+A+ L+YLSD  E GET FP 
Sbjct: 200 RIATMLMYLSDNIERGETYFPL 221


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 61/141 (43%), Positives = 80/141 (56%), Gaps = 7/141 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           Q +   PRA YF NF +  +   ++  A  +LK S +    GE V      RTS G FI 
Sbjct: 2   QQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGV--VDNIRTSYGMFIR 59

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQ 194
             +D   ++  IE +I+  T LP  H E   VLRY  GQ Y +HYD+ + + E GP+   
Sbjct: 60  RLQDP--VVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW-- 115

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           RLA+FL+YLSDVEEGGET FP
Sbjct: 116 RLATFLMYLSDVEEGGETAFP 136


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/135 (40%), Positives = 83/135 (61%), Gaps = 5/135 (3%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDK 140
           P    + NF +  +C  +I  A+ +L+ S +A  + G++V S    RTSSG F++ ++D+
Sbjct: 28  PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSE--IRTSSGMFLNKAQDE 85

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             I+  +E +IA  T LP  +GEA  VL YE+GQKY+ H+D F+          R+A+ L
Sbjct: 86  --IVASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVL 143

Query: 201 LYLSDVEEGGETMFP 215
           +YLSDV +GGET+FP
Sbjct: 144 MYLSDVVKGGETVFP 158


>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
          Length = 222

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 74/134 (55%), Gaps = 15/134 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           +V+SW PRA  + NF S E+C+ +I  AK  +  S +       V+ST G       RTS
Sbjct: 98  EVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTV-------VDSTTGKSKDSRVRTS 150

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+GQKY+ H+D F      
Sbjct: 151 SGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 208

Query: 190 PQMSQRLASFLLYL 203
               QR+A+ L+YL
Sbjct: 209 KNGGQRMATLLMYL 222


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/140 (41%), Positives = 80/140 (57%), Gaps = 12/140 (8%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +SW+PRA  F NF + E+   I+A AK  +K S +    G +VE     RTS GTF+   
Sbjct: 35  VSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGASVEDQ--IRTSYGTFLKRL 92

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   I+  +E ++A  T L  +H E   +LRY IGQKY +HYD+ +        S R+ 
Sbjct: 93  QDP--IVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYDSLD------NDSPRVC 144

Query: 198 SFLLYLSDV--EEGGETMFP 215
           + LLYLSDV  + GGET FP
Sbjct: 145 TVLLYLSDVPADGGGETAFP 164


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 12/164 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   V    PR + F N  S E+C +IIA A+ R+  S L +      E  
Sbjct: 92  DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS+G F    E  TGI+  +E +IAR    P  HGE   VL Y  G +Y  H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
            P E G P +     QR+ + ++YL++ E GG T+FP EV LQ+
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP-EVPLQV 248


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score = 96.7 bits (239), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/164 (37%), Positives = 86/164 (52%), Gaps = 12/164 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   V    PR + F N  S E+C +IIA A+ R+  S L +      E  
Sbjct: 92  DVGDRQVQVL---VSMRNPRIVVFGNLLSHEECDAIIAAARPRMARS-LTVATQSGGEEI 147

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS+G F    E  TGI+  +E +IAR    P  HGE   VL Y  G +Y  H+D F
Sbjct: 148 NDDRTSNGMFFQRGE--TGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYF 205

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
            P E G P +     QR+ + ++YL++ E GG T+FP EV LQ+
Sbjct: 206 APGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFP-EVPLQV 248


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score = 96.7 bits (239), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 57/143 (39%), Positives = 83/143 (58%), Gaps = 5/143 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES--TKGTRTSSGTFI 134
           LSW PR   +  F S E+C   I  AK +L+ S +A    GE+VES  +      S +FI
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFI 118

Query: 135 SA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           +   S +   I+  +E K+A  T LP+ +GE+  +L YE GQKY+ H+D F+        
Sbjct: 119 ANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELG 178

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A+ L+YLS+VE+GGET+FP
Sbjct: 179 GHRIATVLMYLSNVEKGGETVFP 201


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 83/144 (57%), Gaps = 5/144 (3%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PR   + +F S ++   +++ A+  LK S +A    G++  S    RTS GTFIS 
Sbjct: 55  ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSE--VRTSYGTFISK 112

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +D   I+  IE KIA  T LP+ +GE   VLRY+ G+K +  +D F       +   R+
Sbjct: 113 GKDP--IVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRV 170

Query: 197 ASFLLYLSDVEEGGETMFPFEVDL 220
           A+ LLYL+DV EGGET+FP   D 
Sbjct: 171 ATVLLYLTDVAEGGETVFPLAKDF 194


>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 81/135 (60%), Gaps = 3/135 (2%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C  +I  AK R++ S ++ +           RTSSG F++  +++ 
Sbjct: 83  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 141

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            ++  IE +IA  T +P  +GE  ++L YE+GQK++ H+D  +P  +    + QR A+ +
Sbjct: 142 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 200

Query: 201 LYLSDVEEGGETMFP 215
           +YLS V+EGG T+FP
Sbjct: 201 MYLSGVKEGGATVFP 215


>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 323

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 81/135 (60%), Gaps = 3/135 (2%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  + NF S ++C  +I  AK R++ S ++ +           RTSSG F++  +++ 
Sbjct: 74  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQ- 132

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            ++  IE +IA  T +P  +GE  ++L YE+GQK++ H+D  +P  +    + QR A+ +
Sbjct: 133 -LVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLV 191

Query: 201 LYLSDVEEGGETMFP 215
           +YLS V+EGG T+FP
Sbjct: 192 MYLSGVKEGGATVFP 206


>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 369

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 56/139 (40%), Positives = 77/139 (55%), Gaps = 11/139 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           PRA  +  F +  +C   IA A  +L  S +     GE V S    RTS G F    ED 
Sbjct: 83  PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSA--IRTSDGMFFDRGEDD 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPAEYGPQMSQRL 196
             +++ +E +I+  T LP  +GE   VLRY  GQKYD+H DA    FN  +      QR+
Sbjct: 141 --VVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDA--HGGQRV 196

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L+YL+DV++GGET+FP
Sbjct: 197 ATVLMYLNDVDDGGETVFP 215


>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
 gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
          Length = 275

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 79/145 (54%), Gaps = 18/145 (12%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTSS 130
           + LSW+PR   +  F S ++C  ++  AKK           G  V   +      TRTSS
Sbjct: 49  KALSWQPRIFVYKGFLSDDECDHLVTLAKK-----------GTMVAHNRSSYYRQTRTSS 97

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           G F+   +D   ++  IE +IA  T+LP+ + E   + RY+ GQKYD H+D F+   +  
Sbjct: 98  GMFLRKRQDP--VVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHT 155

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
           +   R A+ L+YLS V++GGET+FP
Sbjct: 156 RGGPRYATVLMYLSTVDKGGETVFP 180


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/154 (37%), Positives = 88/154 (57%), Gaps = 6/154 (3%)

Query: 67  DASIQSIPF---QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVES 122
           DA I  I +   ++L+W PR +   +F S+E+C  + A A+  L+ S +   Q G+ ++S
Sbjct: 70  DADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQS 129

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+S  +    I+  IE +I+  + +P  +GE   VLRY+  Q Y  H+D 
Sbjct: 130 D--VRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDY 187

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           F+ +    +  QR+A+ L+YLSD  EGGET FP 
Sbjct: 188 FSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPM 221


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 81/145 (55%), Gaps = 18/145 (12%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTSSGTFIS 135
           +PRA+ F N  S ++C  +IA +K +L      LR G     T  T     RTSSGTF  
Sbjct: 99  KPRAILFGNVLSHDECDQLIALSKTKL------LRSGVVDHQTGNTKLHEHRTSSGTFFH 152

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGP 190
                T  + +I+ ++A    +P++HGE   +L Y++G +Y  HYD F P     A++  
Sbjct: 153 --RGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLA 210

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
           +  QR A+ ++YL+DV+ GGET+FP
Sbjct: 211 RGGQRTATLIIYLNDVDGGGETIFP 235


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 5/143 (3%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTF 133
            + +S  PRA  F  F +  +C  +I  A   ++ S++     GE        R+S G +
Sbjct: 68  IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEA--RPDDARSSIGGW 125

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           +S  +D+  ++  IE + +   MLP   GE   VLRYE GQKYD+H D F+         
Sbjct: 126 VSGDDDE--VIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGG 183

Query: 194 QRLASFLLYLSDVEEGGETMFPF 216
           QR+A+ L+YLSDVEEGGET+FP 
Sbjct: 184 QRVATILMYLSDVEEGGETVFPL 206


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 84/145 (57%), Gaps = 9/145 (6%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           ++I  +VL   P  + F N  S E+CQ++I  A  RL+ S+LA ++  ++      RTSS
Sbjct: 19  RTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSS 72

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           G F    E++  ++  IE +I+    LP  H E   VL YE GQ++ +H+D F P  +  
Sbjct: 73  GMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGP-NHPS 129

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
             + R+++ ++YL+DVEEGG T FP
Sbjct: 130 SSNNRISTLVVYLNDVEEGGVTTFP 154


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 81/142 (57%), Gaps = 11/142 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+++I  +K ++K S++ + +      T   RTSSG F+  SE   
Sbjct: 42  PLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR-----KTNDIRTSSGAFLEESE--- 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I   IE +IA    +P  HGE   +L+Y +GQ+Y +HYD F         + R+++ ++
Sbjct: 94  -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151

Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
           YL+ VEEGGET FP +++L +S
Sbjct: 152 YLNHVEEGGETFFP-KLNLSVS 172


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 50/111 (45%), Positives = 68/111 (61%), Gaps = 9/111 (8%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++ +IE +IA  T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRIADYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           GQKY+ H+D F          QR+A+ L+YLSDVEEGGET+FP + ++ +S
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFP-DANVNVS 113


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 57/182 (31%), Positives = 95/182 (52%), Gaps = 17/182 (9%)

Query: 50  EVVEKENESGLPHGETGDASIQ----SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKK 105
           E++ K +   +PH    +  ++     +    + ++P      +F S E+C ++I+ A +
Sbjct: 43  ELISKSSNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQ 102

Query: 106 RLKPSQLAL-RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEA 164
           +LK S++     G  VE +  T TS+G        +  I++ IE +IA     P  HGE 
Sbjct: 103 KLKASRVVDPEDGSFVEHSARTSTSTGY----HRGEIDIIKTIEARIADLINWPVDHGEG 158

Query: 165 FNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP---F 216
             VLRYE G +Y  H+D F+PA+   ++      QR+ +FL+YLS+V+ GG T FP   F
Sbjct: 159 LQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPNLNF 218

Query: 217 EV 218
           E+
Sbjct: 219 EI 220


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 50/142 (35%), Positives = 81/142 (57%), Gaps = 11/142 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+++I  +K ++K S++ + +      T   RTSSG F+  SE   
Sbjct: 42  PLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR-----KTNDIRTSSGAFLEESE--- 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I   IE +IA    +P  HGE   +L+Y +GQ+Y +HYD F         + R+++ ++
Sbjct: 94  -ITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFF-VENSAAASNNRMSTLVM 151

Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
           YL+ VEEGGET FP +++L +S
Sbjct: 152 YLNHVEEGGETFFP-KLNLSVS 172


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 80/155 (51%), Gaps = 12/155 (7%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           +++  + LS  P  L    F S ++C  I  TA+  ++ S++ L   +        RTS 
Sbjct: 3   RAVTLETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPASDFRTSQ 62

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             FI A +D   IL  I+++ A    +P+ H E   VLRY++ +KYDSH D F+PA Y  
Sbjct: 63  SAFIRAHDD--AILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTK 120

Query: 191 Q----------MSQRLASFLLYLSDVEEGGETMFP 215
                         R+A+   YLSDVE+GGET+FP
Sbjct: 121 DKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFP 155


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 23/188 (12%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV  + +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 74  EVAVQVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 129

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E++  ++ L+E +IAR    P 
Sbjct: 130 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGENE--LISLVEQRIARLLNWPL 186

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246

Query: 215 PFEVDLQI 222
           P +V LQ+
Sbjct: 247 P-DVGLQV 253


>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
 gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
          Length = 797

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 54/152 (35%), Positives = 78/152 (51%), Gaps = 6/152 (3%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PRA  + NF ++ +C  ++    +R+  S L +            RTS G   
Sbjct: 493 IETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRS-LVVDSQTGQSKLDDIRTSYGAAF 551

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM- 192
              ED   ++  IE +IA  T LP  HGE   +LRY  GQKYD+H+D F +P  +   + 
Sbjct: 552 GRGEDP--VIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609

Query: 193 -SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
              R A+ LLYLS+VE GGET  P    + +S
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMS 641


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 81/141 (57%), Gaps = 3/141 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR +   NF S ++C  +   A  RL+ S +   + G+ V+S    RTSSG F+
Sbjct: 83  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSD--FRTSSGMFL 140

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  E    +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+      +  Q
Sbjct: 141 SHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQ 200

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L+YLS+  EGGET FP
Sbjct: 201 RIATMLMYLSENIEGGETYFP 221


>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
          Length = 300

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 75/143 (52%), Gaps = 2/143 (1%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  +VLSW PR   +    + E+C  ++  A  RL  S +        ES    RTS G 
Sbjct: 14  ILLKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGM 73

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F    ED+  ++  +E +++  +++P  HGE   VLRYE G++Y  H+D F         
Sbjct: 74  FFDRGEDE--VVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNG 131

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             RLA+ L+YL++ E GGET+FP
Sbjct: 132 GNRLATILMYLAEPEFGGETVFP 154


>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
 gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
          Length = 284

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/162 (35%), Positives = 85/162 (52%), Gaps = 9/162 (5%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           DA  + +   +    PR + F N  S E+CQ++I  A+ R+  S L ++     E     
Sbjct: 82  DAGDRRVEVLMAMANPRVVLFGNLLSPEECQAVIEAARTRMARS-LTVQAASGGEEVNKD 140

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G F    E++   +  +E +IAR    P  +GE   VL Y  G +Y  HYD F+PA
Sbjct: 141 RTSDGMFFQRGENEA--VARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPA 198

Query: 187 EYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           E G P++     QR+A+ ++YL+D   GG T FP +V L+I 
Sbjct: 199 EPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFP-DVPLEIG 239


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/149 (38%), Positives = 84/149 (56%), Gaps = 11/149 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
           +PR + F N  S E+C +II  A+ R+  S  +A R G   E     RTS+G F    E+
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 178

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
              ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     Q
Sbjct: 179 P--VVARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQ 236

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ ++YL+D E+GG T FP +V L+++
Sbjct: 237 RVATIVIYLNDPEKGGGTTFP-DVHLEVA 264


>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 57/146 (39%), Positives = 82/146 (56%), Gaps = 11/146 (7%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSD--VRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY----DSHYDAFNPAEYGP 190
           ++ E K  +++ IE +I+  + +P  +GE   VLRYE  Q Y    D  +D FN    G 
Sbjct: 140 NSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGG- 198

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
              Q +A+ L+YLSD  EGGET FP 
Sbjct: 199 ---QGIATMLMYLSDNIEGGETYFPL 221


>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
          Length = 325

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 55/141 (39%), Positives = 78/141 (55%), Gaps = 10/141 (7%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           F+ +SW PRA    NFAS E+   +I  A+ +L+ S +   +GE+V      RTS G FI
Sbjct: 32  FEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSRGESV--VDNYRTSYGMFI 89

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  ++  +E ++A  T    TH E   VLRY   Q+Y +H+D+ +        S 
Sbjct: 90  RRHHDE--VVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLD------DDSP 141

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ L+YLSDVE GGET FP
Sbjct: 142 RTATVLIYLSDVESGGETTFP 162


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 82/145 (56%), Gaps = 9/145 (6%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           ++I  +VL   P  + F N  S E+CQ++I  A  RL+ S+LA ++  ++      RTSS
Sbjct: 19  RTITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEISSI------RTSS 72

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
           G F    E++  ++  IE +I+    LP  H E   VL YE GQ++  H+D F P  +  
Sbjct: 73  GMFFE--ENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGP-NHPS 129

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
             + R+ + ++YL+DVEEGG T FP
Sbjct: 130 SSNNRICTLVVYLNDVEEGGVTTFP 154


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 86/156 (55%), Gaps = 21/156 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
           +SWRPRA  +  F S  +C  +I+ AK+  K  +  +  GE+ ES T   RTSSG F+  
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103

Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
            +D+  ++  IE +IA  TMLP                   +GE+  +LRY  G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +D  +  +   +   R+A+ L+YLS+V+ GGET+FP
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFP 197


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 77/135 (57%), Gaps = 7/135 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PRA  F NF +  +   ++  A  +LK S +   +GE V      RTS G FI    D  
Sbjct: 55  PRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSKGEGV--VDNIRTSFGMFIRRLSDP- 111

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQRLASFL 200
            I+  IE +I+  T LP  H E   VLRY  GQ Y +HYD+   +++ GP+   RLA+FL
Sbjct: 112 -IIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVGPKW--RLATFL 168

Query: 201 LYLSDVEEGGETMFP 215
           +YLSDVEEGGET FP
Sbjct: 169 MYLSDVEEGGETAFP 183


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 23/188 (12%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV    +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 81  EVAVLVHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSDEECDAI 136

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E+   ++ L+E +IAR    P 
Sbjct: 137 IAAARPRMRRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISLVEQRIARLLNWPL 193

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 194 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 253

Query: 215 PFEVDLQI 222
           P +V LQI
Sbjct: 254 P-DVGLQI 260


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 62/103 (60%), Gaps = 8/103 (7%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++  IE +IA  T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRAIEKRIADYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F          QR+A+ L+YLSDVEEGGET+FP
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFP 106


>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 74/147 (50%), Gaps = 6/147 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  + +SW PRA  + NF S  +C  +     KR+  S L +            RTS G 
Sbjct: 6   IRIETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRS-LVVDSKTGQSKLDDIRTSYGA 64

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQ 191
                ED   ++  +E +IA  T LP  +GE   +LRY  GQKYD+H+D F +P  +   
Sbjct: 65  AFGRGEDP--VIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAY 122

Query: 192 MSQ--RLASFLLYLSDVEEGGETMFPF 216
           + +  R A+ LLYLS VE GGET  P 
Sbjct: 123 LHEGNRYATVLLYLSGVEGGGETNLPL 149


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 55/139 (39%), Positives = 73/139 (52%), Gaps = 3/139 (2%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL    R   +  F + E+C  I   A+KRL+ S + +  G         RTS G F   
Sbjct: 38  VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGV-VDTGSGGSVVSDIRTSDGMFFER 96

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            ED   I+E +E ++A  TM P   GE+  VLRY   QKYDSH+D F   +       R 
Sbjct: 97  GED--AIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYL++ EEGGET+FP
Sbjct: 155 ATVLLYLTETEEGGETVFP 173


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 15/146 (10%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTS 129
           QVL    R   + NF + E+C  II  A+  +  S +       VE+  G       RTS
Sbjct: 64  QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGV-------VETDSGKSKIDNVRTS 116

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
            GTF++   D   ++  IE +IA+ T++P  +GE   VL+YE GQ+Y+ HYD F      
Sbjct: 117 KGTFLNRGHDS--VIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGT 174

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
                R  + L+YL+DVEEGGET FP
Sbjct: 175 ANGGNRYLTVLMYLNDVEEGGETCFP 200


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/136 (43%), Positives = 76/136 (55%), Gaps = 7/136 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
            PRA  F NF +  +   ++  A  +LK S +    GE V      RTS G FI    D 
Sbjct: 60  HPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGNDGEGV--VDEIRTSYGMFIRRLADP 117

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQMSQRLASF 199
             ++  IE +I+  T LP  H E   VLRY  GQ Y +HYD+ + + E GP+   RLA+F
Sbjct: 118 --VITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGDKSNEPGPKW--RLATF 173

Query: 200 LLYLSDVEEGGETMFP 215
           L+YLSDVEEGGET FP
Sbjct: 174 LMYLSDVEEGGETAFP 189


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 86/141 (60%), Gaps = 3/141 (2%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +S  PRA  + NF + E+ ++ IA A++ ++ S++ + + +    T   RTSSG ++
Sbjct: 78  IERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEV-VNEADGTSKTSDERTSSGGWV 136

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
           S  + +  ++  IE ++A  TMLP+  GE   V+RYE GQ+Y +H D F+         Q
Sbjct: 137 SGEDSE--VMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQ 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ L+YLSDVEEGGET+FP
Sbjct: 195 RAATVLMYLSDVEEGGETVFP 215


>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
          Length = 488

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 85/165 (51%), Gaps = 14/165 (8%)

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           P   TG++  +++  + LS +P  L    F + E+C  I+  A   +K S ++L+  +  
Sbjct: 255 PSTITGNS--KNVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKG 312

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
                 RTS  TF++A  D   IL  IE + A  T +P TH E   VLRY + +KYD+H+
Sbjct: 313 RPASDWRTSQSTFVAAMGDP--ILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHH 370

Query: 181 DAFNPAEY--GPQMSQ--------RLASFLLYLSDVEEGGETMFP 215
           D F+P+ Y   P   Q        R A+   YL+DV  GGET FP
Sbjct: 371 DFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFP 415


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 84/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C+ +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECEELIELSKNKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/144 (38%), Positives = 82/144 (56%), Gaps = 10/144 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALR-QGETVESTKGTRTSSGTFISA 136
           +SW+PRA    +  S E+C+ I+  AK  +K S +     GE    T   RTS  TF++ 
Sbjct: 83  ISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEI--KTDPIRTSKQTFLAR 140

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQMS-- 193
              K  ++  +E +++R TMLP  +GE   +L Y +G+KY +H+D      + G Q+S  
Sbjct: 141 G--KYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHDVGEKNTKSGQQLSAD 198

Query: 194 --QRLASFLLYLSDVEEGGETMFP 215
             QR+A+ LLYL D EEGGET FP
Sbjct: 199 GGQRVATVLLYLQDTEEGGETAFP 222


>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
           C-169]
          Length = 285

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 81/145 (55%), Gaps = 5/145 (3%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTKGTRTSS 130
           S+  + +SW PRA  +    S ++C  II  A+  + K + L  +  + V +    R + 
Sbjct: 50  SLMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNK--LRNNK 107

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             +I  S D   +++ IE +IAR T LP  HGE F++++Y  GQ Y  H D  +   +  
Sbjct: 108 EAYIDGSADD--VIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPR 165

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
             ++R+A+ ++YLSDV EGGET+FP
Sbjct: 166 LGNERIATMIIYLSDVVEGGETVFP 190


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 80/135 (59%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  +   N  S E+C+S+I  +K  +K S++ A R+ + +      RTSSGTF+  +E  
Sbjct: 43  PLIVVLENVLSDEECESLIELSKDSMKRSKIGASREVDNI------RTSSGTFLEENE-- 94

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
              + +IE +++    +P  HGE  ++L+Y  GQ+Y +HYD F       + + R+++ +
Sbjct: 95  --TVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEHSRAAE-NNRISTLV 151

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 152 MYLNDVEEGGETFFP 166


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/118 (44%), Positives = 71/118 (60%), Gaps = 5/118 (4%)

Query: 99  IIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATML 157
           +I  AK  +  S +   Q G++V S    RTSSG F+   +DK  +++ IE +IA    +
Sbjct: 1   MINLAKPHMAKSSVVDSQTGKSVGSR--VRTSSGMFLKRGKDK--VIQTIEKRIADFAFI 56

Query: 158 PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           P  +GE   VL YE+GQKY+ HYD F          QR+A+ L+YLSDVEEGGET+FP
Sbjct: 57  PVENGEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFP 114


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 83/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKNKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/182 (35%), Positives = 90/182 (49%), Gaps = 14/182 (7%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSI 99
           RPK      V     +G P  E+ +A   S + IP       PR + F +F S E+C  +
Sbjct: 56  RPKPEPAATVPSPAPAG-PEPESSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDQL 114

Query: 100 IATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP 158
           IA  + RLK S +   + GE  E+    RTS G      E    ++  IE +IA+AT +P
Sbjct: 115 IALGRHRLKRSPVVNPETGE--ENLISARTSQGAMFQVGEHP--LVARIEARIAQATGVP 170

Query: 159 QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETM 213
             HGE F VL Y+ G +Y  H+D FNP   G         QR+A+ ++YL+ V  GG T 
Sbjct: 171 VEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATG 230

Query: 214 FP 215
           FP
Sbjct: 231 FP 232


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 10/153 (6%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECNELIEMSKNKIKRSTIG-----SARDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 77/135 (57%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+    ++ R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 77/135 (57%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+    ++ R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAVNNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 57/134 (42%), Positives = 71/134 (52%), Gaps = 5/134 (3%)

Query: 83  RALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKT 141
           R   +  F + E+C  I   A+KRL+ S +     G +V S    RTS G F    ED  
Sbjct: 44  RIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSD--IRTSDGMFFERGED-- 99

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ILE +E ++A  TM P   GEA  VLRY   QKYDSH + F   E       R A+ L 
Sbjct: 100 AILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLT 159

Query: 202 YLSDVEEGGETMFP 215
           YL+D EEGGET+FP
Sbjct: 160 YLTDTEEGGETVFP 173


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 56/149 (37%), Positives = 83/149 (55%), Gaps = 11/149 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
           +PR + F N  S E+C +II  A+ R+  S  +A R G   E     RTS+G F    E+
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGG--EEVNDDRTSNGMFFQREEN 167

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
              ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F+P E G P +     Q
Sbjct: 168 P--MVAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQ 225

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+A+ ++YL+D E+GG T FP +V L+++
Sbjct: 226 RVATIVIYLNDPEKGGGTTFP-DVHLEVA 253


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 55/151 (36%), Positives = 81/151 (53%), Gaps = 6/151 (3%)

Query: 75  FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
            QV+S   PRA     F S  +C  ++  A+  +  S + +       S    RTS+G+F
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGV-VDASNGGSSFSNIRTSTGSF 216

Query: 134 ISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
           +          ++  IE +IA  T +P  HGE   VLRY+IGQ+Y SH+D F     G  
Sbjct: 217 VPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYF--FHEGGM 274

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
            + R+A+ L+YLSDV++GGET+FP    LQ+
Sbjct: 275 KNNRIATVLMYLSDVKDGGETVFPSAESLQV 305


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 83/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 81/158 (51%), Gaps = 8/158 (5%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G T  A  + +  ++   RP+ + F +  S E+C  +I  A+ RLK S     +  + E 
Sbjct: 74  GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 132

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G +    ED    +E ++H+I+     P  HGE   +L Y  G +Y  H+D 
Sbjct: 133 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 190

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           F P + G  +      QR+A+ ++YLSDVE GGET+FP
Sbjct: 191 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP 228


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SARDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/103 (45%), Positives = 62/103 (60%), Gaps = 8/103 (7%)

Query: 119 TVESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI 172
            V+ST G       RTSSG F+    DK  ++ +IE +I   T +P  HGE   VL YE+
Sbjct: 6   VVDSTTGKSKDSRVRTSSGMFLQRGRDK--VIRVIEKRITDYTFIPVDHGEGLQVLHYEV 63

Query: 173 GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           GQKY+ H+D F          QR+A+ L++LSDVEEGGET+FP
Sbjct: 64  GQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFP 106


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 79/134 (58%), Gaps = 10/134 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      N  S E+C  +I+ +K R+  S++A  Q   +      RTS+  F+   ED +
Sbjct: 33  PFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQENDI------RTSTSVFLP--EDAS 84

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            +++ +E +I++   +P  HGE   +L Y+IGQ+Y +H+D F+P +     + R+++ +L
Sbjct: 85  EVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIE--NPRISTLVL 142

Query: 202 YLSDVEEGGETMFP 215
           YL+DVEEGG+T FP
Sbjct: 143 YLNDVEEGGDTYFP 156


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 81/158 (51%), Gaps = 8/158 (5%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G T  A  + +  ++   RP+ + F +  S E+C  +I  A+ RLK S     +  + E 
Sbjct: 77  GNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGS-ED 135

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G +    ED    +E ++H+I+     P  HGE   +L Y  G +Y  H+D 
Sbjct: 136 VIQLRTSEGFWFQRCED--AFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDY 193

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           F P + G  +      QR+A+ ++YLSDVE GGET+FP
Sbjct: 194 FPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFP 231


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 73  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 95/188 (50%), Gaps = 23/188 (12%)

Query: 50  EVVEKENESGLPHGE---------TGDASIQSIPFQVLSWR-PRALYFPNFASAEQCQSI 99
           EV  + +ESG+P  +          GD  +  +    +S R PR + F N  S E+C +I
Sbjct: 74  EVAVQIHESGVPEPDLLTDPCCVHAGDRDVHVL----MSVRHPRVVVFGNLLSNEECDAI 129

Query: 100 IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
           IA A+ R++ S L +      E+    RTS+G F    E+   ++  +E +IAR    P 
Sbjct: 130 IAAARPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND--LISRVEQRIARLLNWPL 186

Query: 160 THGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMF 214
            +GE   VL Y  G +Y  HYD F P E G P +     QR+ + ++YL++   GG T F
Sbjct: 187 ENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTF 246

Query: 215 PFEVDLQI 222
           P +V LQ+
Sbjct: 247 P-DVGLQV 253


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K  +K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDKLIELSKNNMKRSKVG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
          Length = 280

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 89/166 (53%), Gaps = 14/166 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  SAE+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E +IA     P   GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           F+P+E G      +  QR+A+ ++YL + E GG T FP +V L+++
Sbjct: 191 FDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP-DVGLEVA 235


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 64  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 123

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 124 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 179

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 180 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 223


>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 280

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 90/166 (54%), Gaps = 14/166 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  SAE+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRRVQVL--QTMR-HPRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E ++A     P  +GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           F+P E G P +     QR+A+ ++YL + E GG T FP +V L+++
Sbjct: 191 FDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFP-DVGLEVA 235


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  SE   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 84/154 (54%), Gaps = 19/154 (12%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 57  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDV 107

Query: 124 KGTRTSSGTFISASEDKTGILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
              RTSSG F+  SE     L L IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 108 NDIRTSSGAFLEDSE-----LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 162

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
           F  AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 163 F--AEHSRSAANNRISTLVMYLNDVEEGGETYFP 194


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/164 (34%), Positives = 81/164 (49%), Gaps = 20/164 (12%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           G   DA+ + IP  +   RP+ + F N    ++C  +I  +  +L+       Q  TV +
Sbjct: 77  GNVIDAADRRIPVLIRCERPQIVVFGNVLDQDECDEMIQRSMHKLE-------QSTTVNA 129

Query: 123 TKGT------RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
             GT      RTS GT+    ED   ++  IE ++A     P  +GE   VLRY  G +Y
Sbjct: 130 ETGTQEVIRHRTSHGTWFQNGED--ALIRRIETRLAALMNCPVENGEGLQVLRYTPGGEY 187

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            SHYD F P   G         QR+A+ ++YL+DV  GGET+FP
Sbjct: 188 RSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFP 231


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDKLIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 73  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 132

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 133 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 188

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 189 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 232


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 76  PEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 135

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 136 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 191

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 192 QPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 235


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 77/136 (56%), Gaps = 14/136 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  SE   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDSE--- 122

Query: 142 GILEL-IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASF 199
             L L IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ 
Sbjct: 123 --LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTL 178

Query: 200 LLYLSDVEEGGETMFP 215
           ++YL+DVEEGGET FP
Sbjct: 179 VMYLNDVEEGGETYFP 194


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKNNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 84/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P +HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 277

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 81/147 (55%), Gaps = 11/147 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
           P    F N  SA +C+++IA A+ RL  S  + +R G   E     RTS G F +  E++
Sbjct: 90  PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGG--EELNHDRTSHGMFYTRGENE 147

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQR 195
             ++  IE +IAR    P  +GE   VLRY  G +Y  HYD F+P E G      +  QR
Sbjct: 148 --VIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAAILRRGGQR 205

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
           +AS ++YL +  EGG T+FP ++ L++
Sbjct: 206 VASLIMYLREPGEGGATVFP-DIGLKV 231


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 79/153 (51%), Gaps = 10/153 (6%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +IA  + RLK S +   + GE  E+    R
Sbjct: 83  SDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGE--ENLISAR 140

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y  H+D FNP  
Sbjct: 141 TSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGR 198

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 199 SGEARQLDVGGQRVATLVIYLNSVQAGGATGFP 231


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 81/147 (55%), Gaps = 9/147 (6%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P  + F NF +  +C +++  ++  L PS++   Q    E  K +RTS GT  +  E  
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFE-LKPSRTSGGTHFARGE-- 158

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
           T ++  IE +IA    +P+ HGE   +L Y +  +Y  HYD F+P + G Q       QR
Sbjct: 159 TPLIADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
           + + ++YLSDVE GG T+FP  V L++
Sbjct: 219 VGTLIMYLSDVESGGATVFP-RVGLEV 244


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 79/147 (53%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+CQ+II  A+ R+  S L ++     E     RTS G F    E  T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +++ +E +IAR    P  +GE   VL Y  G +Y  HYD F+P + G         QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL++  +GG T FP +V L+++
Sbjct: 219 ATLVIYLNNPRKGGGTTFP-DVPLEVA 244


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score = 89.7 bits (221), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score = 89.7 bits (221), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 83/164 (50%), Gaps = 13/164 (7%)

Query: 61  PHGETGDA---SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ- 116
           P  E  +A   S + IP       PR + F +F S E+C  +IA  + RLK S +   + 
Sbjct: 67  PGAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPET 126

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE  E+    RTS G      E    ++  IE +IA+AT +P  HGE F VL Y  G +Y
Sbjct: 127 GE--ENLISARTSEGAMFQVGEHP--LVARIEARIAQATGVPVEHGEGFQVLHYHPGGEY 182

Query: 177 DSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
             H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 183 QPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP 226


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECGELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P THGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 78/134 (58%), Gaps = 9/134 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K R+  S++A        +    RTSS TFI   E++ 
Sbjct: 39  PLIVVLGNVLSDEECDELIRLSKDRINRSKIA------NANVDNMRTSSSTFIE--ENEN 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  IE +I++   +P  +GE   +L Y++GQ+Y SH+D F+ + +    + R+++ ++
Sbjct: 91  IIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFS-SPHNAINNPRISTLVM 149

Query: 202 YLSDVEEGGETMFP 215
           YLSDVE+GGET FP
Sbjct: 150 YLSDVEQGGETYFP 163


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K ++K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIKRSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 280

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 90/166 (54%), Gaps = 14/166 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  S E+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 78  DAGDRQVQVL--QTMR-HPRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLN 134

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E++  I+  +E ++A     P  +GE   +LRY  G +Y  HYD 
Sbjct: 135 VD--RTSDGMFFERGENE--IVARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDY 190

Query: 183 FNPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           F+P E G P +     QR+A+ ++YL + E+GG T FP +V L+++
Sbjct: 191 FDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFP-DVGLEVA 235


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 83/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 83/153 (54%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDELIELSKSKMERSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECGELIEMSKNKMKRSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
 gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 253

 Score = 89.4 bits (220), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 84/156 (53%), Gaps = 10/156 (6%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           ++GD     I  + +SW PRA  +  F S  +C  +I  A  +L+ S +   + + V+  
Sbjct: 29  DSGDVGAPWI--ETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDPI 86

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    I  +E  T ++  IE +IAR T LP++H E   VLRY  GQKYD+H+D F
Sbjct: 87  ---RTSYSASIGYNE--TDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWF 141

Query: 184 N-PAEYGPQMSQRLASFLLYLSDVE--EGGETMFPF 216
           +     G     R+A+ L+YLSD+E   GGET  P 
Sbjct: 142 DETETGGTGGGNRMATALMYLSDMEPAAGGETALPL 177


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 88/163 (53%), Gaps = 11/163 (6%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKG 125
           DA  +++   +   +PR + F N  SAE+C ++IA A  R+  S  +A + G   E    
Sbjct: 97  DAGDRAVNILLAIAKPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGG--EEVND 154

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTS G F    E+   +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+P
Sbjct: 155 DRTSDGMFFQRGENP--VVQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDP 212

Query: 186 AEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            E G P +     QR+ + ++YL+  E+GG T FP +V ++++
Sbjct: 213 GEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFP-DVHVEVA 254


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K  +K S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECAELIELSKSNMKRSKVG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+  T +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKMKRSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 77/134 (57%), Gaps = 12/134 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C ++I  +K +LK S++   + E        RTSS TF+   E + 
Sbjct: 37  PLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNEN-----DMRTSSSTFMEEGESE- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  +E +I++   +P  +GE   +L Y+IGQ+Y +H+D F  A      + R+++ ++
Sbjct: 91  -VVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNAS-----NPRISTLVM 144

Query: 202 YLSDVEEGGETMFP 215
           YL+DVEEGGET FP
Sbjct: 145 YLNDVEEGGETYFP 158


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/141 (39%), Positives = 77/141 (54%), Gaps = 7/141 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           Q +   PRA YF NF +  +   ++  A  +LK S +   +          RTS G FI 
Sbjct: 20  QQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGK--GEGVVDDIRTSYGMFIR 77

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY-GPQMSQ 194
              D   ++  IE +I+  T LP  H E   +LRY  GQ Y +HYD+   +++ GP+   
Sbjct: 78  RLSDP--VVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVGPKW-- 133

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           RLA+FL+YLSDVEEGGET FP
Sbjct: 134 RLATFLMYLSDVEEGGETAFP 154


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            + E IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 9/140 (6%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P  + F    S ++C+ +I TA  RLK S+L  +           RTS G F   
Sbjct: 25  VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNK------VVSDIRTSRGMFFE- 77

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            E+++  +  IE +IA+   +P  H E   VL Y  GQ+Y +H+D F P     + + R+
Sbjct: 78  -EEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPGSPAAR-NNRI 135

Query: 197 ASFLLYLSDVEEGGETMFPF 216
           ++ ++YL+DVEEGGET+FP 
Sbjct: 136 STLIVYLNDVEEGGETVFPL 155


>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
 gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 251

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 54/141 (38%), Positives = 72/141 (51%), Gaps = 10/141 (7%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PR   + NF S  +C+ I  TA   +K S +    G +V  T   RTS GTFI
Sbjct: 2   IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTNGSSVLDT--IRTSYGTFI 59

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D   ++E +  ++A  T  P  + E   VLRY  GQKY +H D+          S 
Sbjct: 60  RRRHDP--VVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLI------DDSP 111

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ LLYL D E GGET FP
Sbjct: 112 RMATVLLYLHDTEYGGETAFP 132


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 78/153 (50%), Gaps = 10/153 (6%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTR 127
           S + IP       PR + F +F S E+C  +I   + RLK S +   + GE  E+    R
Sbjct: 84  SDREIPILFAIETPRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGE--ENLISAR 141

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G      E    ++  IE +IA+AT +P  HGE F VL Y+ G +Y  H+D FNP  
Sbjct: 142 TSQGAMFQVGEHP--LIARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGR 199

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G         QR+A+ ++YL+ V  GG T FP
Sbjct: 200 SGEARQLEVGGQRVATLVIYLNSVPAGGATGFP 232


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K ++K S +      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVISDEECDELIEMSKNKIKRSTIG-----SSRDVNDIRTSSGAFLEENE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/147 (40%), Positives = 77/147 (52%), Gaps = 19/147 (12%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-----RTS 129
            QVL    R   F NF + E+C  I+A AK  L+      R G    +T G+     RTS
Sbjct: 34  MQVLDAEAR--IFINFLTEEECDHIVALAKPHLE------RSGVVDTATGGSEISDIRTS 85

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH-YDAFNPAEY 188
            G F+    D T  +  IE +IAR T+LP  +GE   VL Y  G+KYD + +D  N    
Sbjct: 86  KGMFLERGHDDT--VAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYDDYFFDKVNGESN 143

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
           G     R A+ L+YL+ VEEGGET+FP
Sbjct: 144 G---GNRYATVLMYLNTVEEGGETVFP 167


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 79/147 (53%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+CQ+II  A+ R+  S L ++     E     RTS G F    E  T
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARS-LTVQTTTGGEEVNADRTSDGMFFQRGE--T 158

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +++ +E +IAR    P  +GE   VL Y  G +Y  HYD F+P + G         QR+
Sbjct: 159 PVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL++  +GG T FP +V L+++
Sbjct: 219 ATLVIYLNNPLKGGGTTFP-DVPLEVA 244


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/171 (34%), Positives = 82/171 (47%), Gaps = 16/171 (9%)

Query: 50  EVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKP 109
           E VE  N     H   GD     IP       PR + F +F S ++C  +IA  + RLK 
Sbjct: 72  EAVENSNAV---HTADGD-----IPILFAIETPRIVLFQHFLSDQECDELIAIGRNRLKR 123

Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
           S + +      E+    RTS G      E    ++  IE +IA+A  +P  HGE F VL 
Sbjct: 124 SPV-VNPDTGEENLISARTSQGGMFQVGEHP--LIAKIEARIAQAVGVPVEHGEGFQVLN 180

Query: 170 YEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           Y+ G +Y  H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 181 YQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP 231


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 80/147 (54%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +   N  SAE+C +II +AK +L  S L ++     E     RTSSG F +    +T
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARS-LTVQTATGGEELNADRTSSGMFFT--RGQT 165

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
             +  +E +IAR    P  +GE   VL Y  G +Y  HYD F+P E G P +     QR+
Sbjct: 166 PEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTPTILKRGGQRV 225

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL++   GG T FP +V L+++
Sbjct: 226 ATLVMYLNEPARGGGTTFP-DVGLEVA 251


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVISDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 8/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR   F +  S  +C ++IA ++ RL+ S++   +G   E    TRTS G + +  E+  
Sbjct: 126 PRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSG-EFVDDTRTSYGAYFNKGENS- 183

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG---PQMS--QRL 196
            ++  I+ +IA  T  P TH E   +L Y +G +Y  H+D F P + G   P  S  QR+
Sbjct: 184 -LVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESGGQRI 242

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ ++YL+DVE GG T+FP
Sbjct: 243 ATVVMYLNDVEAGGGTIFP 261


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 IYLNDVEEGGETFFP 162


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 88/168 (52%), Gaps = 9/168 (5%)

Query: 48  TLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL 107
           T EV E   +  + H    D  + S      + +P  L+     S+E+C  +I+ ++ RL
Sbjct: 67  TYEVPEIGRKGNILHTSDKDVKVLS-----RNEKPFVLHLDQVLSSEECDELISLSRSRL 121

Query: 108 KPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNV 167
           +PS L + +G   E     RTS        E++  ++E IE +IA  T  P  +GE   +
Sbjct: 122 QPS-LVVDRGSGEERAGSGRTSKSMAFRLKENE--LVERIETRIAELTGYPAENGEGLQI 178

Query: 168 LRYEIGQKYDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           L Y +G++Y  H+D F P      +  QR+ +FL+YL+DVE+GGET+F
Sbjct: 179 LNYGLGEEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF 226


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/138 (35%), Positives = 79/138 (57%), Gaps = 8/138 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           ++W+PR   + NF +  + + +I  A  ++K S +    G++VE     RTS GTF+   
Sbjct: 4   VAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDN--YRTSYGTFLKRY 61

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D+  I+E IE+++A  T +P  H E   +LRY +GQ+Y  H D     E G     R+A
Sbjct: 62  QDE--IVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDEEAG----VRVA 115

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YL++ + GGET FP
Sbjct: 116 TVLIYLNEPDGGGETAFP 133


>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
          Length = 244

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT 128
           S +++  + LS  PR     NF SAE+C+ II TA   L PS + L+QG+     +  + 
Sbjct: 14  SSRTVEVKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTV-LKQGDQSNGEEKVKD 72

Query: 129 SSGTFISA--SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
              T  +A   + K  I+  I  ++     +P ++ E   VL+Y   Q Y  HYD F+P 
Sbjct: 73  EVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPK 132

Query: 187 EYGPQMS---QRLASFLLYLSDVEEGGETMFPF 216
            Y  + S    RL +   YL+ VE+GGET+FPF
Sbjct: 133 MYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPF 165


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/154 (34%), Positives = 83/154 (53%), Gaps = 17/154 (11%)

Query: 64  ETGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 40  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRD 90

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 91  VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
           F  AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP 178


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEISKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 79/137 (57%), Gaps = 6/137 (4%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSGTFISASED 139
           +P  L+   F S E+C  +I  +++RLKPS  +  + GE   +T   RTS G      E+
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAAT--GRTSKGMSFYLQEN 165

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLAS 198
           +   ++ +E +IA     P  +GE   VL Y IG++Y SH+D F  ++  P+   QR+ +
Sbjct: 166 E--FIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGT 223

Query: 199 FLLYLSDVEEGGETMFP 215
           FL+YL+DV  GGET+FP
Sbjct: 224 FLIYLNDVPAGGETVFP 240


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 82/147 (55%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + F N  S  +C++++  A+ RL  S L +      E     RTS G F +  E+  
Sbjct: 90  PDLVVFGNLLSDSECEALMEVAQPRLARS-LTVNIKTGGEERNRDRTSQGMFFARGENP- 147

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            +++ +E +IAR    P   GE   VLRY  G +Y  HYD F+PAE G P +     QR+
Sbjct: 148 -LVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEPGTPAILQRGGQRV 206

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL++ E+GG T+FP ++ LQ++
Sbjct: 207 ATLIMYLNEPEQGGATVFP-DIGLQVT 232


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
                IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 71  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 122

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 123 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 179

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 180 MYLNDVEEGGETYFP 194


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLANVLSDEECGELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 75/133 (56%), Gaps = 4/133 (3%)

Query: 83  RALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
           R     +F + E+   I+  +++RL+ S +    G + ES    RTS G F+   ED   
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQ--IRTSFGVFLERGEDP-- 56

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           +++ +E +I+  T++P  +GE   VLRY+  QKYD+H+D F   +       R A+ L+Y
Sbjct: 57  VVKGVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMY 116

Query: 203 LSDVEEGGETMFP 215
           L D EEGGET+FP
Sbjct: 117 LVDTEEGGETVFP 129


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 IYLNDVEEGGETFFP 178


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFLEDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 82/153 (53%), Gaps = 17/153 (11%)

Query: 65  TGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           T D  IQ I  F+     P  +   N  S E+C  +I  +K +++ S++      +    
Sbjct: 25  TEDREIQIISKFE----EPLIVVLANVLSDEECDGLIELSKNKIERSKIG-----SSRDV 75

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTSSG F+  +E    +   IE +I+    +P  HGE  ++L YE+ Q+Y +HYD F
Sbjct: 76  NDIRTSSGAFLEENE----LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYF 131

Query: 184 NPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
             AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 132 --AEHSRSAANNRISTLVMYLNDVEEGGETFFP 162


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/148 (36%), Positives = 81/148 (54%), Gaps = 9/148 (6%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F N  S E+C ++IA A+ RL  S L +      E     RTS G F    +  
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARS-LTVATKTGGEEINDDRTSDGMFFQRGQ-- 170

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
           + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     QR
Sbjct: 171 SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQR 230

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
           + + ++YL+  E+GG T FP +V L+++
Sbjct: 231 VGTLVMYLNTPEKGGGTTFP-DVHLEVA 257


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/154 (34%), Positives = 83/154 (53%), Gaps = 17/154 (11%)

Query: 64  ETGDASIQSI-PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           +T D  IQ I  F+     P  +   N  S E+C  +I  +K +L  S++      +   
Sbjct: 40  KTEDREIQIISKFE----EPLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRD 90

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTSSG F+  +E    +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD 
Sbjct: 91  VNDIRTSSGAFLDDNE----LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDY 146

Query: 183 FNPAEYG-PQMSQRLASFLLYLSDVEEGGETMFP 215
           F  AE+     + R+++ ++YL+DVEEGGET FP
Sbjct: 147 F--AEHSRSAANNRISTLVMYLNDVEEGGETFFP 178


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+   E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDDE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 80/149 (53%), Gaps = 9/149 (6%)

Query: 80  WRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
           + PR + F +  S ++C+ +I  AK RL  S L +      E     RTSSG F    E+
Sbjct: 97  YNPRVVVFGSLLSDQECEQLIGLAKPRLARS-LTVATKTGGEEVNEDRTSSGMFFQRGEN 155

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQ 194
           +  ++  IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G      +  Q
Sbjct: 156 E--LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQ 213

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+ + ++YL + E+GG T FP +V L+++
Sbjct: 214 RVGTLVMYLGEPEKGGGTTFP-DVHLEVA 241


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 81/153 (52%), Gaps = 15/153 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+   +F SA +C+ +I+ A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
            T ++  +E +IA  T LP  +GE   +L YE+G +   H D   A NPA  E   +  Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
           R+ + L+YL+DVE GGETMFP     Q  W ++
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP-----QTGWSVV 244


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 54/139 (38%), Positives = 76/139 (54%), Gaps = 16/139 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +S +PR   + +F S ++   +I+ A+  LK S +A       ++  G  T        S
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVA-------DNMSGKST-------LS 99

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   I+E IE KIA  T LP+ +GE   VLRY+ G+KY+ HYD F       +   R A
Sbjct: 100 EDP--IVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYA 157

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + LLYL+DV EGGET+FP 
Sbjct: 158 TVLLYLTDVPEGGETVFPL 176


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +       RTSSG F+    +  
Sbjct: 55  PLIVVLANVLSDEECDELIEMSKNKMERSKIG-----SSRDVNDIRTSSGAFL----EDN 105

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 106 KLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETYFP 178


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLEDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
 gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
          Length = 298

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 75/153 (49%), Gaps = 22/153 (14%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +SW PR   + NF +  +C+ I  TA   +K S +  + G +V  T   RTS GTFI
Sbjct: 2   IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQNGSSV--TDNIRTSYGTFI 59

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFN------------VLRYEIGQKYDSHYDA 182
               D   ++E I  ++A  T  P  + E               VLRY IGQKY +H D+
Sbjct: 60  RRRHDP--VIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDS 117

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                     S R+A+ LLYL D EEGGET FP
Sbjct: 118 LI------DDSPRMATVLLYLHDTEEGGETAFP 144


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECGELIELSKNKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 77/146 (52%), Gaps = 9/146 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F N  S E+C +IIA A  R++ S L +      E+    RTS+G F    E+  
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRS-LTVDNQSGGEAVNDDRTSNGMFFQRGEND- 169

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            ++  +E +IAR    P  +GE   VL Y  G +Y  HYD F P E G P +     QR+
Sbjct: 170 -LICRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRV 228

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
            + ++YL++   GG T FP +V LQ+
Sbjct: 229 GTLVMYLNEPARGGATTFP-DVGLQV 253


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 57/166 (34%), Positives = 86/166 (51%), Gaps = 14/166 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVES 122
           + GD  +Q +  Q +   PR + F N  S E+C+ +IA A+ RL  S  +  R G  V +
Sbjct: 77  DAGDRRVQVL--QTMR-HPRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLN 133

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS G F    E+   I+  +E +IA     P   GE   +LRY  G +Y  HYD 
Sbjct: 134 VD--RTSEGMFFERGEND--IVARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDY 189

Query: 183 FNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           F+P E G      +  QR+A+ ++YL +  +GG T FP +V L+++
Sbjct: 190 FDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFP-DVGLEVA 234


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 79/134 (58%), Gaps = 8/134 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K R++ S++A     ++E  +  RTSS TF    E++ 
Sbjct: 38  PLIVILGNVLSDEECDQLIQQSKDRMQRSKVA----NSLEVDE-LRTSSSTFFHEGENE- 91

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  IE +I++   +P  HGE   +L Y+IGQ+Y +H+D F+        + R+++ ++
Sbjct: 92  -IVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFFSSTSRAAS-NPRISTLVM 149

Query: 202 YLSDVEEGGETMFP 215
           YL+DVE+GGET FP
Sbjct: 150 YLNDVEQGGETYFP 163


>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 169

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/110 (39%), Positives = 71/110 (64%), Gaps = 3/110 (2%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKS--DVRTSSGMFV 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           ++ E K+ +++ IE +I+  + +P+ +GE   VLRYE  Q Y  H+D F+
Sbjct: 116 NSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFS 165


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 80/153 (52%), Gaps = 15/153 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+   +F SA +C+ +IA A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNV--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
            T ++  +E +IA  T LP  +GE   +L YE G +   H D   A NPA  E   +  Q
Sbjct: 158 -TPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
           R+ + L+YL+DVE GGETMFP     Q  W ++
Sbjct: 217 RVGTLLMYLNDVEGGGETMFP-----QTGWSVV 244


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +++ S++      +  +    RTSSG F+  +E   
Sbjct: 77  PLIVVLANVLSDEECDELIELSKNKMERSKIG-----SSRNVNDIRTSSGAFLEENE--- 128

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
                IE +I+  T +P  HGE  ++L Y + Q+Y +HYD F  AE+     + R+++ +
Sbjct: 129 -FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYF--AEHSRSAANNRISTLV 185

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 186 MYLNDVEEGGETFFP 200


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score = 86.7 bits (213), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 81/148 (54%), Gaps = 9/148 (6%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F N  S E+C ++IA A+ R+  S L +      E     RTS G F    +  
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARS-LTVATKTGGEEINADRTSDGMFFQRGQ-- 171

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQR 195
           + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     QR
Sbjct: 172 SPLIQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQR 231

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
           + + ++YL+  ++GG T FP +V L+++
Sbjct: 232 VGTLVMYLNTPDKGGGTTFP-DVHLEVA 258


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score = 86.7 bits (213), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTSSG F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSSGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
            YL+DVEEGGET FP
Sbjct: 148 XYLNDVEEGGETFFP 162


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score = 86.7 bits (213), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 89/177 (50%), Gaps = 14/177 (7%)

Query: 52  VEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
           V ++N++ L   + GD  ++ +   +L   PR +   +F S  +C ++IA A+ RL  S+
Sbjct: 71  VLQDNDASL--LDLGDRQVRVLVSLLL---PRVVVLGDFLSDAECDALIALAQPRLARSR 125

Query: 112 LALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
             +      +     RTS    +   +D   + + IE +IAR    P  HGE   VLRY 
Sbjct: 126 -TVDNDNGAQIVHAARTSDSMCLQLGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYA 182

Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            G +Y  HYD F+P   G  +      QRLAS ++YL+  E GG T FP +V L ++
Sbjct: 183 TGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFP-DVHLDVA 238


>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
          Length = 458

 Score = 86.7 bits (213), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 82/143 (57%), Gaps = 8/143 (5%)

Query: 75  FQVLSW-RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGT 132
            Q++S   PRA  +  F + E+C  +I  +K R+  S+  +   ET  + K   RTS+G+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRM--SKSGVVDAETGGTAKSDIRTSTGS 234

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+    +   +++ +E ++A  +MLP  H EA  VLRYE+ Q+Y +HYD F     G   
Sbjct: 235 FVGIGAND--LMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYF--FHKGGMA 290

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           + R+ + L+YL + E GGET+FP
Sbjct: 291 NNRIVTILMYLHEPEFGGETVFP 313


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 55  PLIVVLGNVLSDEECDELIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 106

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 107 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 163

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 164 MYLNDVEEGGETFFP 178


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDKLIELSKNKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETYFP 162


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 75/134 (55%), Gaps = 9/134 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I+ +K R++ S+++ +      S    RTSS  F   +E+  
Sbjct: 44  PLIVLLGNVLSEEECDQLISLSKDRIERSKISNK------SVHDLRTSSSMFFDDAEND- 96

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  +E ++++   +P  HGE   +L Y IGQ+Y +HYD F+        + R+++ ++
Sbjct: 97  -VVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYFSSGNSKVN-NPRISTLVM 154

Query: 202 YLSDVEEGGETMFP 215
           YL+DVE GGET FP
Sbjct: 155 YLNDVEAGGETYFP 168


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score = 86.3 bits (212), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 57/166 (34%), Positives = 81/166 (48%), Gaps = 14/166 (8%)

Query: 55  ENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL 114
           EN + + H   GD     IP       PR + F +F S  +C  +IA  + RLK S + +
Sbjct: 75  ENSNAV-HTADGD-----IPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV-V 127

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
                 E+    RTS G      E    ++  IE +IA+A  +P  HGE F VL Y+ G 
Sbjct: 128 NPDTGEENLISARTSQGGMFQVGEHP--LIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185

Query: 175 KYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           +Y  H+D FNP   G         QR+A+ ++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP 231


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score = 86.3 bits (212), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 52  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 103

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 104 -LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 160

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 161 MYLNDVEEGGETFFP 175


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E   
Sbjct: 39  PLIVVLGNVLSDEECDELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNE--- 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            +   IE +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 91  -LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 12/135 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K +L  S++      +       RTS G F+  +E  T
Sbjct: 39  PLIVVLGNVLSDEECGELIELSKSKLARSKVG-----SSRDVNDIRTSKGAFLDDNELTT 93

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQMSQRLASFL 200
            I    E +I+    +P +HGE  ++L YE+ Q+Y +HYD F  AE+     + R+++ +
Sbjct: 94  KI----EKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYF--AEHSRSAANNRISTLV 147

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 148 MYLNDVEEGGETFFP 162


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 14/164 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD S++ +   +    PR + F N  S E+C+++IA A  R+  S L +      E  
Sbjct: 116 DCGDRSVEVL---LTMAHPRVVVFGNLLSPEECEALIAAAAPRMARS-LTVATQTGGEEV 171

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F    E    +++ IE +IA     P  +GE   VL Y  G +Y  HYD F
Sbjct: 172 NDDRTSHGMFFQRGESP--LVQRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYF 229

Query: 184 NPAEYG-PQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           +PAE G P +     QR+ + ++YL+  E+GG T FP   D QI
Sbjct: 230 DPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFP---DAQI 270


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 5/153 (3%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
           G  ++ SI  + LS  P      NF   E+C+ I   A   +KPS ++L   +  +    
Sbjct: 184 GLETLGSIDMKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKPDTN 243

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTS+  F+ ++ D   +L+ I+ ++   T +P++H E   VL+Y+ GQ+Y +H+D  + 
Sbjct: 244 WRTSTTYFMPSTRDP--LLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHDFLDE 301

Query: 186 ---AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                       R+ +   YLSDVEEGGET+FP
Sbjct: 302 RTMRNMDGGRKNRMITVFWYLSDVEEGGETIFP 334


>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 286

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS G  + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDGMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/149 (35%), Positives = 82/149 (55%), Gaps = 11/149 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASED 139
           +PR + F N  S E+C ++IA A  R+  S  +A + G   E     RTS G F    + 
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGG--EEINDDRTSDGMFFQRGQ- 171

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQ 194
            + +++ IE +IAR    P  +GE   VL Y  G +Y  HYD F+PAE G P +     Q
Sbjct: 172 -SPLIQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQ 230

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           R+ + ++YL+  E+GG T FP +V ++++
Sbjct: 231 RVGTLVMYLNTPEKGGGTTFP-DVHVEVA 258


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 73/135 (54%), Gaps = 10/135 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S  +C  +I  +++RL+ S++    GE   S    RTSSG F   +E  T
Sbjct: 36  PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-RSVNSIRTSSGVFCEQTETIT 90

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
                IE +I++   +P  HG+   VLRY  GQ+Y  HYD F         + R+++ ++
Sbjct: 91  ----RIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 145

Query: 202 YLSDVEEGGETMFPF 216
           YL+DVE+GGET+FP 
Sbjct: 146 YLNDVEQGGETVFPL 160


>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
 gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
          Length = 311

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 73/139 (52%), Gaps = 10/139 (7%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           V+SW+PRA    NF +  +C  I   A+  ++ S +    G +V      RTS GTFI+ 
Sbjct: 3   VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
            +  T ++  +E ++A  T  P  + E   VLRY +GQ Y  H D+          S R+
Sbjct: 61  YQ--TPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTDSLE------NDSPRM 112

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ LLYLS+ E GGET FP
Sbjct: 113 ATVLLYLSEPELGGETAFP 131


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 92/185 (49%), Gaps = 20/185 (10%)

Query: 53  EKENESGLPH---------GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
           E  N++ LP+         G   D   +S+   +   RP  +    F S E+C+ +I  +
Sbjct: 58  EPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFMSGEECEQLIEQS 117

Query: 104 KKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGE 163
           +++L PS +   Q    +     R+S GT+    E  + ++  ++ +I+     P+ HGE
Sbjct: 118 RRKLTPSAIVDPQTGKFQVI-ADRSSEGTYFQRGE--SPLISRLDRRISELMNWPEDHGE 174

Query: 164 AFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEV 218
              +L Y +G +Y  H+D F   E G      Q  QR+A+ ++YL++V EGGET+FP   
Sbjct: 175 GIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFP--- 231

Query: 219 DLQIS 223
           D+ IS
Sbjct: 232 DVGIS 236


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 72/123 (58%), Gaps = 4/123 (3%)

Query: 93  AEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
           AE+   I+  +++RL+  +  +  G+    T   RTS G F+   ED+  I++ +E +IA
Sbjct: 7   AEEADHIVKVSERRLE--RSGVVGGDGGSETSNIRTSYGVFLDRGEDE--IVKRVEERIA 62

Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGET 212
             T++P  +GE   VLRY+  QKYD+H+D F   +       R A+ L+YL D EEGGET
Sbjct: 63  AWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGET 122

Query: 213 MFP 215
           +FP
Sbjct: 123 VFP 125


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 3/131 (2%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           L F    S ++C  +I  A  RL+ S +   +    + ++  RTS G F+    D   I+
Sbjct: 1   LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSR-NRTSWGAFLRRDHDN--IV 57

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
             IE +I+  T +P+ +GE+  V+RY+ GQK++ H D +   E       R+ + LLYL+
Sbjct: 58  SGIEDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLT 117

Query: 205 DVEEGGETMFP 215
           +VE GGET+FP
Sbjct: 118 NVENGGETVFP 128


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score = 84.0 bits (206), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 73/135 (54%), Gaps = 10/135 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S  +C  +I  +++RL+ S++    GE   S    RTSSG F   +E  T
Sbjct: 41  PLVVVLGNVLSDSECDELIEHSRERLQRSKI----GED-GSVNSIRTSSGVFCEQTETIT 95

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
                IE +I++   +P  HG+   VLRY  GQ+Y  HYD F         + R+++ ++
Sbjct: 96  ----RIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFAETSRAS-TNNRISTLVM 150

Query: 202 YLSDVEEGGETMFPF 216
           YL+DVE+GGET+FP 
Sbjct: 151 YLNDVEQGGETVFPL 165


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score = 84.0 bits (206), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 87/162 (53%), Gaps = 18/162 (11%)

Query: 68  ASIQSIPFQV-LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           +S  +IP  + +S +PR    P F + E+C+ +I T+K +LKP           E + G 
Sbjct: 49  SSTDNIPKLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCN---------EISSGV 99

Query: 127 -RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            R+  G F+   E+   + + I +++     L ++  E   V+RY  G++  +H+D FNP
Sbjct: 100 HRSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLTES-SEVMQVIRYNPGEETSAHFDYFNP 158

Query: 186 AEYGPQMS-----QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
                 M      QR+ + L+YL+DVEEGGET FP EV++++
Sbjct: 159 LTTNGAMKIGLYGQRICTILMYLADVEEGGETSFP-EVNVKV 199


>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 317

 Score = 84.0 bits (206), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 76/140 (54%), Gaps = 8/140 (5%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LSW PR     NF S E+C+ +I   +K+L+ S +         ST   RTS GTF++
Sbjct: 37  ETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAVST--ARTSFGTFVT 94

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
               +T  L+ +E ++A+ + +P  H E   +LRY  GQ+Y +H+D       G    +R
Sbjct: 95  RRLTET--LQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISENGG----KR 148

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L++L +   GGET FP
Sbjct: 149 IATVLMFLREPTSGGETSFP 168


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 21/149 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           LSW PRA    +  +  QC++++   + R+       R+   V+S  G       RTS  
Sbjct: 6   LSWYPRAFALRDALTEAQCEAVLRATRARV-------RRSTVVDSVTGESKVDPIRTSKQ 58

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH-----YDAFNPA 186
           TF++  E+   ++  I   ++  TMLP TH E   VL Y +G+KYD+H      D+ +  
Sbjct: 59  TFLNRDEE---VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGR 115

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           E      +R+A+ LLYL + E GGET FP
Sbjct: 116 ELSKDGGKRVATVLLYLEEPEAGGETAFP 144


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 75/135 (55%), Gaps = 11/135 (8%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  S E+C+ +I  +K ++  S++  +           RTSS TF+   +    +   IE
Sbjct: 41  NVVSEEECEELIFLSKNKMNRSKIGSQH-----EVSDIRTSSSTFLPEDD----LTNRIE 91

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEE 208
            ++A+   +P  HGE  ++L Y+ GQ+Y +HYD F         + R+++ +LYL+DVEE
Sbjct: 92  KRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLNDVEE 150

Query: 209 GGETMFPFEVDLQIS 223
           GGET FP  ++L IS
Sbjct: 151 GGETYFP-HMNLSIS 164


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 77/147 (52%), Gaps = 12/147 (8%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P  + F    + ++C+ +I  A  RL+ S+L  +           RTS G F   
Sbjct: 25  VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNK------VVSEIRTSRGMFFE- 77

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QR 195
            E++   +  IE +I+    +P  H E   VL Y  GQ+Y +HYD F P    P  S  R
Sbjct: 78  -EEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN--SPSASNNR 134

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
           +++ ++YL+DVE GGET+FP  +DL++
Sbjct: 135 ISTLIIYLNDVEAGGETVFPL-LDLEV 160


>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 216

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 26  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHLVHAARTSDSMCLR 78

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 79  VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQ 136

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 137 AGGQRVASLVMYLNTPERGGATRFP 161


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score = 83.2 bits (204), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 78/151 (51%), Gaps = 17/151 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           LS +PR    P F + E+C  +I T+K +L+P           E + G  R+  G F+  
Sbjct: 28  LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E++  + + I +K+     +  +  E   ++RY  G++  +HYD FNP      M    
Sbjct: 79  GEEEHPVTKNIFNKMKNFVNISDS-CEVMQIIRYNPGEETSAHYDYFNPLTTNGSMKIGL 137

Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
             QR+ + L+YL DVEEGGET FP EV +++
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFP-EVGIKV 167


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score = 83.2 bits (204), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 55/156 (35%), Positives = 81/156 (51%), Gaps = 27/156 (17%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSG 131
           +SW PRA +  N  +  +C  ++  A+ R       +R+   V+ST G       RTS  
Sbjct: 4   ISWHPRAFHLHNIMTDAECDEVLELARTR-------VRRSTVVDSTTGESKVDPIRTSEQ 56

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFN-----VLRYEIGQKYDSHYDAFN-P 185
            F++       I+ +IE ++ R TMLP  +GE        VL+Y  GQKYD+H+D     
Sbjct: 57  CFLNRGH--FPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELD 114

Query: 186 AEYGPQMS----QRLASFLLYLSDVEE--GGETMFP 215
              G Q++     R+A+ LLYLSDV++  GGET FP
Sbjct: 115 TASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFP 150


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score = 83.2 bits (204), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+  L  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALARPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 53/152 (34%), Positives = 75/152 (49%), Gaps = 21/152 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR + F    S E+C  ++A A+ RL  S       ETV+++ G       RTS G F  
Sbjct: 92  PRVVVFGGLLSDEECDELVALARPRLARS-------ETVDNSTGGSEVNAARTSDGMFFE 144

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----- 190
             E    ++E IE +IA     P   GE   VLRY  G +Y  H+D F+PA  G      
Sbjct: 145 RGEKP--LIERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILR 202

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           +  QR+ + ++YL+    GG T FP EV L++
Sbjct: 203 RGGQRVGTVVMYLNTPAGGGATTFP-EVGLEV 233


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score = 82.8 bits (203), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 57/153 (37%), Positives = 80/153 (52%), Gaps = 15/153 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A++  NF SA++C+ +IA A+ RL  S +     G  V +T   R+S G F    E 
Sbjct: 101 RPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATH--RSSHGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D     N A  E   +  Q
Sbjct: 158 -TPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLII 227
           R+ + L+YL DVE GGET+FP     Q+ W I+
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP-----QVGWSIV 244


>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
          Length = 327

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 76/142 (53%), Gaps = 10/142 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVESTKGTRTSSGTFISASEDK 140
           PR  +FP F S E+C  +  TA+  L+PS  L    G  +      RTS G  I  + + 
Sbjct: 140 PRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHP--IRTSDGGAIGPTNEN 197

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  I  +IA AT      GE+  VLRY  GQ+Y  H D    AE     +QR+A+F+
Sbjct: 198 L-VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAE-----NQRIATFI 251

Query: 201 LYLSDVEEGGETMFPFEVDLQI 222
           +YL+D  EGGET FP  +++Q+
Sbjct: 252 VYLNDGFEGGETHFPL-LNIQV 272


>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
          Length = 277

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 84/156 (53%), Gaps = 22/156 (14%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSSGTFISA 136
           +SWRPRA  +  F S  +C  +I+ AK+  K  +  +  GE+ ES T   RTSSG F+  
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQG-KMEKSTVVDGESGESVTSKVRTSSGMFLDK 103

Query: 137 SEDKTGILELIEHKIARATMLP-----------------QTHGEAFNVLRYEIGQKYDSH 179
            +D+  ++  IE +IA  TMLP                   +GE+  +LRY  G+KY+ H
Sbjct: 104 KQDE--VVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPH 161

Query: 180 YDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +D  +  +   +   R+A+ L+YLS+V+  G+++ P
Sbjct: 162 FDYISGRQGSTREGDRVATVLMYLSNVKM-GDSLLP 196


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 11/157 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +    PR + F N  +AE+C ++IA A++++K S +        +  
Sbjct: 81  QAGDRQVQVLASLL---HPRVIVFGNLLAAEECDALIALARRQIKRSPV-FDPDTGQDQQ 136

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS G F     +   +   +E +IA     P  +GE   VLRY  G +Y+ HYD F
Sbjct: 137 HQARTSEGMFFGRGANP--LCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYF 194

Query: 184 NPAEYGPQMS-----QRLASFLLYLSDVEEGGETMFP 215
           +PA  G +++     QR+AS ++YL+   +GG T FP
Sbjct: 195 DPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFP 231


>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 286

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/142 (32%), Positives = 79/142 (55%), Gaps = 11/142 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+     N  S E+C+++I  +K ++  S++      +       RTSS  F+   E   
Sbjct: 34  PKIAILGNVVSEEECEALIRLSKDKVNRSKIG-----SDHDVSDIRTSSSAFLPDDE--- 85

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            +   IE ++A+   +P  HGE  ++L Y+ GQ+Y +H+D F       + + R+++ +L
Sbjct: 86  -LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVL 143

Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
           YL+DVEEGGET FP E++L +S
Sbjct: 144 YLNDVEEGGETYFP-EMNLTVS 164


>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 286

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S E+C ++IA A+  L  S+       TV++  G       RTS    + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALAQPHLARSR-------TVDNANGEHVVHAARTSDSMCLR 148

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 149 LGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQ 206

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 207 AGGQRVASLVMYLNTPERGGATRFP 231


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 75/134 (55%), Gaps = 9/134 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  A  ++K S++    G T E  +  RTSS  FI   +D+ 
Sbjct: 33  PLIVVLGNVLSDEECDELIQLAGDKVKRSKI----GTTREENE-LRTSSSMFIE--DDEN 85

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I+  ++ +I+    +P  HGE   +LRY  GQ+Y +H+D F+        + R+++ ++
Sbjct: 86  LIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSD--SKITNNRISTLVM 143

Query: 202 YLSDVEEGGETMFP 215
           YL+DVE+GGET FP
Sbjct: 144 YLNDVEQGGETFFP 157


>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 252

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 71/140 (50%), Gaps = 10/140 (7%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           V+SW PRA    NF + ++   I   A+  ++ S +    G +V      RTS GTFI+ 
Sbjct: 3   VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVVADNGSSV--LDDYRTSYGTFINR 60

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
               T ++  +E ++A  T +P  + E   VLRY  GQ Y  H D+          S RL
Sbjct: 61  Y--ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTDSLE------NDSPRL 112

Query: 197 ASFLLYLSDVEEGGETMFPF 216
           A+ LLYLSD E GGET FP 
Sbjct: 113 ATVLLYLSDPELGGETAFPL 132


>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 306

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSR-------TVDNANGEHMVHAARTSDSMCLR 168

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 169 VGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQ 226

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 227 AGGQRVASLVMYLNTPERGGATRFP 251


>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
          Length = 328

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/99 (44%), Positives = 63/99 (63%), Gaps = 4/99 (4%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           GE+ +S    RTSSG F++  +D   I+  +E K+A  T LP+ +GEA  +L YE GQKY
Sbjct: 9   GESEDSE--VRTSSGMFLTKRQDD--IVANVEAKLAAWTFLPEENGEALQILHYENGQKY 64

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           D H+D F   +       R+A+ L+YLS+V +GGET+FP
Sbjct: 65  DPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFP 103


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 77/153 (50%), Gaps = 8/153 (5%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           A  + +  +V   RP+ + F +  S ++C  +I  A+ RLK S   +      E     R
Sbjct: 102 AHDRDVAVRVRFERPQVIVFDDVLSRDECDELIERARHRLKRS-TTVNPESGREDVIQLR 160

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           TS G +    ED    +E ++ +I+     P  HGE   +L Y  G +Y  H+D F P++
Sbjct: 161 TSEGFWFQRCED--AFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQ 218

Query: 188 YGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G  +      QR+A+ ++YLSDV  GGET+FP
Sbjct: 219 SGSVLHTSRGGQRVATLIVYLSDVAGGGETVFP 251


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 71/134 (52%), Gaps = 10/134 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L   N  S  +C  +I  A  R++ +++      +       RTSS  F   SE++ 
Sbjct: 32  PLILILDNVLSWAECDLLIDLASARMQRAKIG-----SSHDVSEVRTSSSMFFEESENEC 86

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
             +  +E ++A    +P +H E   VLRY+ G++Y  H+D F     G  M+ R+++ ++
Sbjct: 87  --IGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQ---GSSMNNRISTLVM 141

Query: 202 YLSDVEEGGETMFP 215
           YL+DVEEGGET FP
Sbjct: 142 YLNDVEEGGETYFP 155


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 17/151 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           LS  PR    P F + E+C+ +I T+K +L+P           E + G  R+  G F+  
Sbjct: 28  LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCN---------EISSGVHRSGWGLFMKE 78

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E+   I + I +K+     + ++  E   V+RY  G++  SH+D FNP      M    
Sbjct: 79  GEEDHQITKNIFNKMKSFVNISES-CEVMQVIRYNQGEETSSHFDYFNPLTTNGSMKIGL 137

Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
             QR+ + L+YL DVEEGGET FP EV +++
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFP-EVGIKV 167


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 78/163 (47%), Gaps = 13/163 (7%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           + LP G+  D     +P       PR   F      ++C+++IA ++ RL  S + +   
Sbjct: 108 NALPAGDGRD-----VPVLFAIESPRIALFQRLLMPDECEALIALSRGRLARSPV-VNPD 161

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G      E    ++E +E +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 162 TGDENLIDARTSMGAMFQVGEHP--LIERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQ 219

Query: 178 SHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            HYD FNP   G         QR+A+ ++YL+DV  GG T FP
Sbjct: 220 PHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP 262


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 21/153 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           PR + F  F S ++C +++A A+ RL  S       ETV++  G       RTS G F  
Sbjct: 100 PRVVVFGGFLSHDECDALVALAQPRLARS-------ETVDNDTGGSEVNEARTSQGMFFM 152

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM-- 192
             E +  ++  IE +IA     P  +GE   VL Y  G +Y  HYD F+PA+ G P +  
Sbjct: 153 RGEGE--LISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYFDPAQPGTPTILK 210

Query: 193 --SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
              QR+ + ++YL+  E GG T FP +V+L+++
Sbjct: 211 RGGQRVGTLVMYLNTPERGGGTTFP-DVNLEVA 242


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 72/135 (53%), Gaps = 10/135 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C  +I  +K+RL+ S++    GE   S    RTSSG F   +E   
Sbjct: 36  PLIVILGNVLSNEECDELIEHSKERLQRSKI----GEE-RSVNQIRTSSGVFCEENE--- 87

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
             +  IE +I++   +P  HG+   VL Y  GQ+Y  H+D F         + R+++ ++
Sbjct: 88  -TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADTSRA-SANNRISTLVM 145

Query: 202 YLSDVEEGGETMFPF 216
           YL+DVEEGGET FP 
Sbjct: 146 YLNDVEEGGETTFPM 160


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 76/148 (51%), Gaps = 9/148 (6%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 117 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEDVIRNRTSEGIWYQRGEDA 175

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 176 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQR 233

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
           +A+ ++YL+DV +GGET+FP E  L ++
Sbjct: 234 VATLVIYLNDVPDGGETIFP-EAGLSVA 260


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 74/146 (50%), Gaps = 20/146 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFI 134
           RP+ + F +  SA +C  +I  ++ RLK S        TV    G       RTS G + 
Sbjct: 96  RPQLVVFADVLSAAECAELIERSRHRLKRST-------TVNPLTGREDVIRNRTSEGVWY 148

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP---- 190
              ED+  ++  +E +IA  T  P  +GE   VL Y    +Y  H+D F P + G     
Sbjct: 149 RRGEDQ--LIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHT 206

Query: 191 -QMSQRLASFLLYLSDVEEGGETMFP 215
            Q  QR+A+ ++YL+DV +GGET+FP
Sbjct: 207 TQGGQRVATLIIYLNDVADGGETVFP 232


>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 196

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 72/127 (56%), Gaps = 17/127 (13%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
             + E+C+ +I  AK  +  S +    G++V+++   RTSSGTFI+   DK  IL  IE 
Sbjct: 12  ITTKEECEHLINIAKPSMHKSTVDDETGKSVDNS--ARTSSGTFINRGHDK--ILRNIEQ 67

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           +IA  T +P  +GE+ N+L YE+GQKY+ H D F       +++ +           E+G
Sbjct: 68  RIADFTFIPVENGESVNILHYEVGQKYEPHPDFFT-----DEINTKNGG--------EQG 114

Query: 210 GETMFPF 216
           GET+FPF
Sbjct: 115 GETVFPF 121


>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
 gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
          Length = 241

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/90 (43%), Positives = 57/90 (63%), Gaps = 2/90 (2%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+   +D+  ++  IE +IA  T LP  +GE+  +L Y+ G+KY+ HYD F+ 
Sbjct: 15  VRTSSGMFLEKKQDE--VVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHD 72

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                    R+A+ L+YLSDV +GGET+FP
Sbjct: 73  KNNQALGGHRIATVLMYLSDVGKGGETIFP 102


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 80/162 (49%), Gaps = 12/162 (7%)

Query: 67  DASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT 126
           D     IP  V +  P         S  +C  +I  A+ RL+ +      G+  +     
Sbjct: 80  DGRTIGIPLSVDA--PALRVLDGLLSERECADLIELARPRLQRALTVDSDGK--QQIDQR 135

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTS G F  A E  T ++  IE ++A+   +P +HGE   +L Y  GQ+Y+ HYD F+PA
Sbjct: 136 RTSEGMFFRAGE--TPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPA 193

Query: 187 EYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
             G      +  QR+AS ++YL+  E GG T FP E+ L ++
Sbjct: 194 LPGYDKLTARAGQRIASVVMYLNTPERGGGTAFP-EIGLTVT 234


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGEDA 168

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 169 --FIERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 227 VATLVVYLNDVADGGETIFP 246


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 75/134 (55%), Gaps = 8/134 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C+ +I  ++ +LK S++         +    RTSS  F    E++ 
Sbjct: 39  PLIVILGNVLSDEECEGLIRMSEDKLKRSKIG-----NTRTVDDIRTSSSMFFEEGENE- 92

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  IE ++++   +P  HGE   +L Y IGQ+Y +H+D F+ +      + R+++ ++
Sbjct: 93  -LVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSR-AASNPRISTLVM 150

Query: 202 YLSDVEEGGETMFP 215
           YL+DVEEGGET FP
Sbjct: 151 YLNDVEEGGETYFP 164


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S E+C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 110 RPQVIVFANVLSPEECDEVIERSRHRLKRSTI-VDPATGQEGVIRNRTSEGIWYQRGEDA 168

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
              +E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 169 --FIERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQR 226

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 227 VATLVVYLNDVADGGETIFP 246


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 81/165 (49%), Gaps = 12/165 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      S ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 81  DLGDRQVQVLVSLML---PRVVVLGGLLSDDECDALIALARPQLARSR-TVDNRDGSEIV 136

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F
Sbjct: 137 HAARTSHSMALQPGQD--ALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 194

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            P   G  +      QR+AS ++YL+  E GG T FP +V L ++
Sbjct: 195 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 238


>gi|90022913|ref|YP_528740.1| hypothetical protein Sde_3273 [Saccharophagus degradans 2-40]
 gi|89952513|gb|ABD82528.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
          Length = 478

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 92/196 (46%), Gaps = 26/196 (13%)

Query: 39  VPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPN--------- 89
           V + RP +   +++     +  P     + ++Q+         PR ++ PN         
Sbjct: 43  VMNFRPTVPVDQLINPFKRAASPQQTVNNQNLQTAANHGAPTDPRKIFIPNALKLNSDKL 102

Query: 90  -------FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTG 142
                  F + E+C+ IIA  + +L+PS+L+     + ES K  RTS    +   +D   
Sbjct: 103 EMYALGEFLTTEECERIIANIRSKLRPSELS-----SQESDKTYRTSRTCDLGTIDDP-- 155

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQRLASF 199
            +  ++ +I +   +  ++ E      YE+GQ++ +H D F   E   +G  M QR  + 
Sbjct: 156 FIHYVDSRICKLVGIDPSYSEVIQGQLYEVGQEFKAHTDYFEIKEMPEHGAVMGQRTYTV 215

Query: 200 LLYLSDVEEGGETMFP 215
           ++YL+DVEEGGET FP
Sbjct: 216 MIYLNDVEEGGETDFP 231


>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 50/140 (35%), Positives = 71/140 (50%), Gaps = 9/140 (6%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + LSW PRA  + NF S E+ + ++   + R+  S +   Q   V      RTS GTFI
Sbjct: 68  IEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSDI---RTSFGTFI 124

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               D+  +LE IE + A  + +P  H E   +LRY  GQKY  H D       G    +
Sbjct: 125 PKKYDE--VLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISENGG----K 178

Query: 195 RLASFLLYLSDVEEGGETMF 214
           R+A+ L++L +  EGGET F
Sbjct: 179 RIATILMFLHEPTEGGETSF 198


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR L   N     +C +++A A+ RL+ S + +      E+    RTS G      E   
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +L+ IE +IA  T  P  HGE F VL Y+ G +Y  H+D FNP   G         QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL+    GG T FP  + L+++
Sbjct: 218 ATMVIYLNSPASGGATAFP-RIGLEVA 243


>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 324

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 75/155 (48%), Gaps = 14/155 (9%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
            +  + LS  P       F   ++   I+A + + LKPS + L  G    +    RTS+ 
Sbjct: 105 DVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTSTT 164

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
            F+S+S  K   L+ I+ ++A  T +P  H E   VLRYE  QKYD H D F P E+   
Sbjct: 165 YFLSSS--KHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYF-PVEHHKN 221

Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFP 215
            P +          R+ +   Y+SDV +GG T+FP
Sbjct: 222 SPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFP 256


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 84/156 (53%), Gaps = 13/156 (8%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRL-KPSQLALRQGETVESTK 124
           GDA +  I  + LSW PRA    N     + ++I+A A+ R+ + + +    G++V    
Sbjct: 2   GDARV--IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSV--VN 57

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TF+S ++    ++  +  +++  T LP  H E   VL Y  G+KYD+H D   
Sbjct: 58  PIRTSKQTFLSRNDP---VVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGE 114

Query: 185 PA-EYGPQMS----QRLASFLLYLSDVEEGGETMFP 215
              + G Q+S    +R+A+ LLYL + EEGGET FP
Sbjct: 115 EGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFP 150


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 73/147 (49%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR L   N     +C +++A A+ RL+ S + +      E+    RTS G      E   
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPV-VNPDTGDENLIDARTSMGAMFQVGEH-- 157

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +L+ IE +IA  T  P  HGE F VL Y+ G +Y  H+D FNP   G         QR+
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRV 217

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL+    GG T FP  + L+++
Sbjct: 218 ATMVIYLNSPASGGATAFP-RIGLEVA 243


>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 285

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 84/170 (49%), Gaps = 13/170 (7%)

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           PH   G+ S++ +   + +  P    F    S ++C ++I  AK RL+ ++     G   
Sbjct: 77  PHAVIGERSVRVM---LAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDG--A 131

Query: 121 ESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
           +     RTS G F    E    ++E IE +IA    +P  HGE   VL Y  GQ+Y+ H 
Sbjct: 132 QQIDEHRTSDGMFFGLGEQP--LIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQ 189

Query: 181 DAFNPAEYG-----PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQISWL 225
           D F+P + G         QR+AS ++YL+  + GG T FP E+ L ++ L
Sbjct: 190 DWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFP-EIGLTVTAL 238


>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 296

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           P  +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 72/147 (48%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +    F S  +C ++IA A+ RL  S+  +            RTS    +   +D  
Sbjct: 96  PRVMVLGGFLSDAECDAMIALAQPRLARSR-TVDNANGAHVVHAARTSDSMCLQLGQD-- 152

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            + + IE +IAR    P  +GE   VLRY  G +Y  HYD F+P   G  +      QR+
Sbjct: 153 ALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRV 212

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           AS ++YL+  + GG T FP +V L I+
Sbjct: 213 ASLVMYLNTPDRGGATRFP-DVHLDIA 238


>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 296

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 20/145 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFIS 135
           P  +    F S  +C ++IA A+ RL  S+       TV++  G       RTS    + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSR-------TVDNANGEHVVHAARTSDSMCLR 158

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM--- 192
             +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD F+P   G  +   
Sbjct: 159 VGQD--ALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQ 216

Query: 193 --SQRLASFLLYLSDVEEGGETMFP 215
              QR+AS ++YL+  E GG T FP
Sbjct: 217 AGGQRVASLVMYLNTPERGGATRFP 241


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score = 80.1 bits (196), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 71/139 (51%), Gaps = 8/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +   N  S ++C +I A ++ R   S   +     +     +RTS    I   E  T
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARST-TIDNASGINRFDDSRTSESAHIQRGE--T 148

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-----QMSQRL 196
            ++  I+ ++A  +  P  HGE   + +Y+ G +Y  H+D F+PA  G      +  QRL
Sbjct: 149 ELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRL 208

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ +LYL+DVEEGG T FP
Sbjct: 209 ATIILYLTDVEEGGGTSFP 227


>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
 gi|194706408|gb|ACF87288.1| unknown [Zea mays]
 gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
 gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
          Length = 217

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 37/83 (44%), Positives = 54/83 (65%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
            +S  + K  I+  IE ++A  T LP+ + E+  VLRYE GQKYD+H+D F+        
Sbjct: 10  MLSPPQPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLG 69

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
            QR+A+ L+YL+DV +GGET+FP
Sbjct: 70  GQRVATVLMYLTDVNKGGETVFP 92


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 10/141 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+    F SA +C+ +I  A+ RL  S +     G  +    G R+S G F    E 
Sbjct: 101 RPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNI--VAGHRSSDGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D   P     AE   +  Q
Sbjct: 158 -TPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+DVE GGET+FP
Sbjct: 217 RVGTLLMYLNDVESGGETLFP 237


>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 363

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 76/141 (53%), Gaps = 16/141 (11%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
           +  LSW PRA  + NF + ++C+ +IA  +K+L+ S +   +G+  +     RTS GTFI
Sbjct: 91  WTTLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGD-VHSARTSFGTFI 149

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
             +   T  L  +E ++A  + +P  H E   +LRYE GQ+Y +               +
Sbjct: 150 --TRRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGNG-------------EK 194

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A+ L++L + E GGET FP
Sbjct: 195 RIATVLMFLREPEFGGETHFP 215


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 81/168 (48%), Gaps = 24/168 (14%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG 125
           GD  +  +    L   PR + F    S ++C  ++A A+ RL      LR  ETV+++ G
Sbjct: 24  GDREVHVLATMAL---PRVVVFGGLLSEQECDELVALAQPRL------LRS-ETVDNSTG 73

Query: 126 ------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
                  RTS G F    E  T ++E IE +IA     P   GE   VL Y  G +Y  H
Sbjct: 74  GSEVNAARTSDGMFFERGE--TPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPH 131

Query: 180 YDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           +D F+PA  G      +  QR+ + ++YL+    GG T FP EV L++
Sbjct: 132 HDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFP-EVGLEV 178


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/152 (36%), Positives = 78/152 (51%), Gaps = 15/152 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A++  +F SA++C+ +IA A+ RL  S +     G  V    G R+S G F    E 
Sbjct: 101 RPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNV--VAGHRSSHGMFFRLGE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPA--EYGPQMSQ 194
            T ++  IE +IA  T  P  +GE   +L YE G +   H D     N A  E   +  Q
Sbjct: 158 -TPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFPFEVDLQISWLI 226
           R+ + L+YL DVE GGET+FP     QI W +
Sbjct: 217 RMGTLLMYLKDVEGGGETVFP-----QIGWSV 243


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/171 (30%), Positives = 84/171 (49%), Gaps = 9/171 (5%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           +G P     D   + +P       P+   F    S ++C +++A ++ RL  S + +   
Sbjct: 80  AGAPTVYAADGGDRRVPILFRLASPQVQLFQQLLSDDECDALVALSRGRLARSPV-VNPD 138

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 139 TGDENLIDARTSMGAMFQVAEHA--LIARIEARIAAVTGVPADHGEGLQILNYKPGGEYQ 196

Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP  V L+++
Sbjct: 197 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 246


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 73/140 (52%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F N  S ++C  +I  ++ RLK S + +      E     RTS G +    ED 
Sbjct: 123 RPQVIVFGNVLSPDECAEMIERSRHRLKRSTI-VDPATGREDVIRNRTSEGIWYQRGEDA 181

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQR 195
             ++E ++ +IA     P  +GE   +L Y    +Y  H+D F P + G  +      QR
Sbjct: 182 --LIERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQR 239

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 240 VATLVVYLNDVPDGGETIFP 259


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 78/151 (51%), Gaps = 17/151 (11%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSGTFISA 136
           +S  PR    P+F S  +C+ +I  +K +L+P           E + G  R+  G F+  
Sbjct: 29  MSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCN---------EISSGVHRSGWGLFMKE 79

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS--- 193
            E+   +++ I  ++     L + + E   V+RY  G++  +HYD FNP      M    
Sbjct: 80  GEEDHDVVKKIFQRMKMLVNLTE-NCEVMQVIRYHPGEETSAHYDYFNPLTTNGAMKIGL 138

Query: 194 --QRLASFLLYLSDVEEGGETMFPFEVDLQI 222
             QR+ + L+YLS+VEEGGET FP EV +++
Sbjct: 139 YGQRVCTILMYLSEVEEGGETSFP-EVGVKV 168


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 76/138 (55%), Gaps = 8/138 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           ++W+PR   + NF +  + + +I  A  ++K S +    G++VE +  T  ++G  +   
Sbjct: 4   VAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDSYRTLYTAG--VRRY 61

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           +D   ++E IE+++A  T +   H E   +LRY IGQ+Y  H D     E G     R+A
Sbjct: 62  QDD--VVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDDEAG----VRVA 115

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YL++ E GGET FP
Sbjct: 116 TVLIYLNEPEAGGETAFP 133


>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
 gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
          Length = 295

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 77/153 (50%), Gaps = 10/153 (6%)

Query: 77  VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +LS R PR + F    S E+C +++  A+ RL  S+  +  G         RTS G F  
Sbjct: 102 LLSMRNPRVMVFGGLLSDEECDAMVDLARPRLARSE-TVHNGSGGSEVNAARTSDGMFFD 160

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----P 190
             E    +   IE +IA     P  +GE   VLRY  G +Y +H+D F+PA+ G      
Sbjct: 161 RGEFP--LCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYFDPAQPGTPTILK 218

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           +  QR+ + ++YL+    GG T FP +V L+++
Sbjct: 219 RGGQRVGTVVMYLNHPIRGGGTAFP-DVGLEVA 250


>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/184 (30%), Positives = 87/184 (47%), Gaps = 12/184 (6%)

Query: 38  DVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQ 97
           +VP       T E    ++ S    G T D  +     + LS  PRA  F  F + E+C 
Sbjct: 46  EVPRADATTTTREAPRADDASASSLGPTRDIGVGDARVEKLSDSPRAYLFREFLTKEECA 105

Query: 98  SIIATAKKRLKPSQLALRQ---GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
            +I  +   LK S +       GE        RTS+G F+    D   ++  +E ++   
Sbjct: 106 HLIEISTPHLKRSTVVGDDALLGEADGRRSDYRTSTGAFLPKLYDD--VVTRVERRVEAF 163

Query: 155 TMLP---QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGE 211
           + LP   Q   +A ++LRYE+GQ+Y  H D F     G    +R+A+ L++L++ EEGGE
Sbjct: 164 SRLPFENQEQLQARSLLRYELGQEYRDHVDGFATENGG----KRVATVLMFLAEPEEGGE 219

Query: 212 TMFP 215
           T FP
Sbjct: 220 TAFP 223


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 72/140 (51%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 102 RPQMIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +LRY    +Y  H+D F P + G      Q  QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 219 VATLVIYLNDVPDGGETIFP 238


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 84/171 (49%), Gaps = 9/171 (5%)

Query: 58  SGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG 117
           +G P     D   + +P       P+   F    + ++C +++A ++ RL  S + +   
Sbjct: 81  AGTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPD 139

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
              E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y 
Sbjct: 140 TGDENLIDARTSMGAMFQVAEHA--LIARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQ 197

Query: 178 SHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP  V L+++
Sbjct: 198 PHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 247


>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
 gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
          Length = 212

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 69/134 (51%), Gaps = 8/134 (5%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           ++F    S E+C  +IA      KPS++     +    T G R+   T  S S DK  I+
Sbjct: 15  VHFSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRS---TVASPSADKYPII 71

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFLL 201
           + +  +I+    + + + E   VL Y  G +YD HYD+F   E  PQ+     R+ + LL
Sbjct: 72  KAVRRRISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSF--LEGSPQLENGGNRMLTVLL 129

Query: 202 YLSDVEEGGETMFP 215
           YL+DVE+GG T FP
Sbjct: 130 YLNDVEQGGWTQFP 143


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 2/83 (2%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F+   +DK  I++ IE +IA  T +P  +GE   VL Y +G+KY+ HYD F         
Sbjct: 2   FLKRGKDK--IIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNG 59

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
            QR+A+ L+YLSDVEEGGET+FP
Sbjct: 60  GQRVATVLMYLSDVEEGGETVFP 82


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 83/170 (48%), Gaps = 9/170 (5%)

Query: 59  GLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
           G P     D   + +P       P+   F    + ++C +++A ++ RL  S + +    
Sbjct: 82  GTPAVYAADGGDRQVPILFRLASPQVQLFQQLLTDDECDALVALSRGRLARSPV-VNPDT 140

Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
             E+    RTS G     +E    ++  IE +IA  T +P  HGE   +L Y+ G +Y  
Sbjct: 141 GDENLIDARTSMGAMFQVAEHP--LITRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQP 198

Query: 179 HYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           H+D FNP   G   Q+S   QR+A+ ++YL+  E GG T FP  V L+++
Sbjct: 199 HFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-RVGLEVA 247


>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
 gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
          Length = 284

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 10/140 (7%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  +AE+C+ +IA A+ RLK +      G      +  RTS G F + +E    ++  IE
Sbjct: 102 NLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR--RTSEGMFFTLNE--LPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ+Y+ H+D F+P + G         QR+AS ++YL
Sbjct: 158 QRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASVVMYL 217

Query: 204 SDVEEGGETMFPFEVDLQIS 223
           +   +GG T FP E+ L ++
Sbjct: 218 NTPAQGGGTAFP-ELGLTVT 236


>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 156

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/143 (34%), Positives = 80/143 (55%), Gaps = 20/143 (13%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFIS 135
           + SW+ +      + S E+C+ +I   K  L+ S+++  R G+ +E+      + G F+ 
Sbjct: 17  LCSWKIKL-----YESKEECEHLIKLGKPYLERSRISDKRTGKGIENR--FAYACGGFV- 68

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY---GPQM 192
               K  I++ IE +I     +P  +GE   V+ Y +GQK+  HYD+ +   +   GP  
Sbjct: 69  ----KDKIIKNIEQRIPDIISIPVENGEGLQVIHYGVGQKFVPHYDSRSNESFWNGGP-- 122

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A+FL+YLSDVEEGGET+FP
Sbjct: 123 --RVATFLMYLSDVEEGGETVFP 143


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 12/165 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 83  DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            P   G  +      QR+AS ++YL+  E GG T FP +V L ++
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 240


>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
          Length = 208

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 49/76 (64%)

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
           K  I+  IE KIA  T LP+ +GE   VLRYE G+KYD H+D F       +   R+A+ 
Sbjct: 7   KDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGHRVATV 66

Query: 200 LLYLSDVEEGGETMFP 215
           L+YL+DV +GGET+FP
Sbjct: 67  LMYLTDVSKGGETVFP 82


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 12/165 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            P   G  +      QR+AS ++YL+  E GG T FP +V L ++
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFP-DVHLDVA 260


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score = 77.0 bits (188), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 102 RPQVIVFADVLSPDECAEMIERSRHRLKRS-TTVNPATGKEDVIRNRTSEGIWYQRGEDP 160

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +L Y    +Y  H+D F P + G      Q  QR
Sbjct: 161 --FIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 218

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 219 VATLVIYLNDVPDGGETIFP 238


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 76/135 (56%), Gaps = 10/135 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  L   N  S E+C  +I  +K +++ S++ A R+  ++      RTSSG F   SE++
Sbjct: 39  PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAAREVNSI------RTSSGMFFEESENE 92

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  IE ++++       + E   VL+Y   Q+Y +H+D F  A    + + R+++ +
Sbjct: 93  --LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 150 MYLNDVEEGGETYFP 164


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 71/140 (50%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P+ +   N  S E+C +IIA    R   S +      +    +G RTS   FI   E +
Sbjct: 82  QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEG-RTSEMAFIQRGEAE 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
             + E IE ++A     P    E F + +Y+  Q+Y  HYD  +P   G      +  QR
Sbjct: 141 --VAERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQR 198

Query: 196 LASFLLYLSDVEEGGETMFP 215
           LA+F+LYLSDVE+GG T+FP
Sbjct: 199 LATFILYLSDVEQGGGTVFP 218


>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
 gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
          Length = 284

 Score = 76.6 bits (187), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 69/132 (52%), Gaps = 9/132 (6%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  SA +C  +IA A+ RL+ +     +G   +     RTS G F +   D+  ++  IE
Sbjct: 102 NILSARECDELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTL--DEVPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ Y+ H+D F+P + G +       QR+AS ++YL
Sbjct: 158 RRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYL 217

Query: 204 SDVEEGGETMFP 215
           +    GG T FP
Sbjct: 218 NTPARGGGTAFP 229


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score = 76.6 bits (187), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 81/161 (50%), Gaps = 8/161 (4%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP      AS +S P    S +P      +     +C  +I   ++ ++ S + +     
Sbjct: 72  LPQARVLTASDRSAPKVFSSEQPVIALVADVLDDTECDRLIEIGREHVQRSSV-VDPDSG 130

Query: 120 VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            E T   R S G F++AS D   ++E I+ +IA     P  +GE  ++LRY +G +Y  H
Sbjct: 131 KEITIEERRSEGAFVNASTD--ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPH 188

Query: 180 YDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           YD F   + G +       QR+A+ +LYL++VE+GG+T FP
Sbjct: 189 YDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFP 229


>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
 gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
          Length = 263

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 19/144 (13%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + +SW PRA  +  F +  +C  +I  A  +L+ S +     + ++     RTS    I 
Sbjct: 60  ETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDDI---RTSFSASIM 116

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MSQ 194
             E  T I+  IE +IAR T           VLRY  GQKYD+H+D F+  E      S 
Sbjct: 117 YGE--TSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGSN 163

Query: 195 RLASFLLYLSDVE--EGGETMFPF 216
           R+A+ L+YLSDV+   GGET  P 
Sbjct: 164 RMATVLMYLSDVDPAAGGETALPL 187


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 76/135 (56%), Gaps = 10/135 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDK 140
           P  L   N  S E+C  +I  +K +++ S++ A R+  ++      RTSSG F   SE++
Sbjct: 39  PLVLVLGNVLSNEECDELIRLSKDKMQRSKIGAAREVNSI------RTSSGMFFDESENE 92

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  IE ++++       + E   +L+Y   Q+Y +H+D F  A    + + R+++ +
Sbjct: 93  --LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLV 149

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DVEEGGET FP
Sbjct: 150 MYLNDVEEGGETYFP 164


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 72/134 (53%), Gaps = 8/134 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L   N  S E+C  +I  +K +++ S++   +          RTSSG F   SE++ 
Sbjct: 39  PLVLVLGNVLSNEECDELIQLSKDKMQRSKIGAER-----EVNSIRTSSGMFFEESENE- 92

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            ++  IE ++++       + E   +L+Y   Q+Y +H+D F  A    + + R+++ ++
Sbjct: 93  -LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSASKASK-NNRISTLVM 150

Query: 202 YLSDVEEGGETMFP 215
           YL+DVEEGGET FP
Sbjct: 151 YLNDVEEGGETYFP 164


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 8/140 (5%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP+ + F +  S ++C  +I  ++ RLK S   +      E     RTS G +    ED 
Sbjct: 105 RPQVIVFGDVLSPDECAEMIERSRHRLKRS-TTVNPETGKEDVIRNRTSEGIWYQRGED- 162

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-----PQMSQR 195
              +E ++ +I+     P  +GE   +L Y    +Y  H+D F P + G      Q  QR
Sbjct: 163 -AFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQR 221

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ ++YL+DV +GGET+FP
Sbjct: 222 VATLVIYLNDVPDGGETIFP 241


>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
 gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
          Length = 284

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 10/140 (7%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  S ++C+ +IA A+ RL+ +     +G   +     RTS G F + +E    ++  IE
Sbjct: 102 NILSTQECEELIALARPRLQRALTVDSEGR--QQVDRRRTSEGMFFTLNE--VPLVGRIE 157

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMS---QRLASFLLYL 203
            ++A    +P +HGE   +L Y  GQ+Y+ H+D F+P +  YG   +   QR+AS ++YL
Sbjct: 158 QRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYL 217

Query: 204 SDVEEGGETMFPFEVDLQIS 223
           +    GG T FP E+ L ++
Sbjct: 218 NTPARGGGTAFP-ELGLTVT 236


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/169 (30%), Positives = 84/169 (49%), Gaps = 28/169 (16%)

Query: 71  QSIPFQVLSWRPR------------ALYFPNFA------SAEQCQSIIATAKKRLKPSQL 112
           Q+ P +++S  PR             +  PN A      S E+C  +I  ++ ++K SQ+
Sbjct: 92  QASPIRLISQLPRFTVADREVELAAVMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQV 151

Query: 113 ALRQ-GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
             R+ G + ES+   R S G+     E++  ++  IE +++    LP   GE   +L Y 
Sbjct: 152 VDRESGGSYESS--VRKSEGSHFERGENE--LVRRIEARLSALVDLPVNRGEPLQILHYG 207

Query: 172 IGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
            G +Y +H D F P + G  +      QR+ + ++YL+DV EGGET FP
Sbjct: 208 PGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFP 256


>gi|410637601|ref|ZP_11348175.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
 gi|410142794|dbj|GAC15380.1| prolyl 4-hydroxylase [Glaciecola lipolytica E3]
          Length = 280

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 74/139 (53%), Gaps = 10/139 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           R + +   NF +A++C++++A  K +L+PS++  R+G+     KG RTSS   +  ++D 
Sbjct: 84  RVQMIKIDNFLTAQECEALVALTKSKLRPSEIPEREGDQY---KGFRTSSTCDLPFTKDP 140

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQR 195
             +   I+ KI  A  L     E      Y IGQ++ +H D F P       Y     QR
Sbjct: 141 --LAHEIDQKIVDALGLGVGEKEVIQAQHYAIGQEFKAHCDYFVPGSKDFKTYSKDGGQR 198

Query: 196 LASFLLYLSDVEEGGETMF 214
             +F++YL+++ EGGET F
Sbjct: 199 TWTFMIYLNELCEGGETEF 217


>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 490

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 74/148 (50%), Gaps = 15/148 (10%)

Query: 78  LSWRPRALYFPNFASAEQCQSII-----ATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +S  P  + F NF + E+C  +I     A  ++     ++ +        +KG RTS   
Sbjct: 284 MSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSVVSKG-RTSENA 342

Query: 133 FISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
           + S  +    T   +LI  +I+  T +P  H E F +L+YE GQ Y SH+D     E   
Sbjct: 343 WCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHHDYIEHQEKRR 402

Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
            GP    R+ +F LYLSDVEEGG+T FP
Sbjct: 403 CGP----RVLTFFLYLSDVEEGGDTNFP 426


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 12/165 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 83  DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 138

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 139 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 196

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            P   G  +      QR+AS ++YL+  E GG T  P +V L ++
Sbjct: 197 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVP-DVHLDVA 240


>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
 gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
          Length = 235

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 78/155 (50%), Gaps = 11/155 (7%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P  +L   P    +    +  +C  +I  A+  L+PS++    G + + T   RTS    
Sbjct: 39  PSNLLHDNPFIGCYEKVVTQTECHQLIDLARHGLQPSKVI---GNSEQKTSAVRTSDT-- 93

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----Y 188
           I      T +   I  +IA    LP  + E   + RY++G K+++H+D FNP+      Y
Sbjct: 94  IGFQHHLTELTLQICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMY 153

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
             +  QR+ + LLYL++V  GGET FP  +++Q++
Sbjct: 154 LSENGQRIITALLYLNNVSAGGETSFPL-LNIQVA 187


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 12/165 (7%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      + ++C ++IA A+ +L  S+  +   +  E  
Sbjct: 103 DLGDRQVQVLVSLML---PRVVVLGGLLADDECDALIALARPQLARSR-TVDNRDGSEIV 158

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
              RTS    +   +D   + + IE +IA+    P  HGE   VLRY  G +Y  HYD F
Sbjct: 159 HAARTSHSMALQPGQD--ALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYF 216

Query: 184 NPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
            P   G  +      QR+AS ++YL+  E GG T  P +V L ++
Sbjct: 217 EPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVP-DVHLDVA 260


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 76/163 (46%), Gaps = 23/163 (14%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           + GD  +Q +   +L   PR +      S E+C +++  ++ RL       R+  TV++ 
Sbjct: 87  DAGDRQVQLLASLML---PRVVVLGGLLSDEECDALVELSRPRL-------RRSTTVDAQ 136

Query: 124 KG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
            G       RTS GTF         +   IE +IAR    P  +GE   VL Y  G ++ 
Sbjct: 137 TGGSQVHADRTSRGTFFERGAHP--VCATIEARIARLLEWPVENGEGLQVLHYPPGAEFR 194

Query: 178 SHYDAFNPAEYGP-----QMSQRLASFLLYLSDVEEGGETMFP 215
            HYD F+P E G      Q  QR+A+ ++YL+    GG T FP
Sbjct: 195 PHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFP 237


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 77/146 (52%), Gaps = 13/146 (8%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           P  +   NF +AE+C  +IA A+ +++ + +     GE V+     RTS     + +E  
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQD--RTSMNAAFARAEHP 148

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QR 195
             ++  +E +IA A   P  +GE   VLRY  G +Y +H+D F+    G + +     QR
Sbjct: 149 --LIARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQR 206

Query: 196 LASFLLYLSDVEEGGETMFP---FEV 218
           + +FL+YL DV+ GG T FP   FE+
Sbjct: 207 VGTFLVYLCDVDAGGATRFPALNFEI 232


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 74/141 (52%), Gaps = 10/141 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASED 139
           RP A+    F +  +C  +IA A+ RL  S +     G  V +  G R+S GTF   +E 
Sbjct: 101 RPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAA--GHRSSDGTFFRLAE- 157

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNPA--EYGPQMSQ 194
            T ++  +E +IA  T L   +GE   +LRY+ G +   H D   A N    E   +  Q
Sbjct: 158 -TPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRESIARSGQ 216

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+ + L+YL+DVE GGET+FP
Sbjct: 217 RVGTLLMYLNDVEGGGETVFP 237


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score = 73.2 bits (178), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 70/135 (51%), Gaps = 9/135 (6%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
           + E CQ++IA  +  L+P+ +   Q    E   G R S   +     D   IL+ +   I
Sbjct: 73  TPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDYPILQSLAEGI 129

Query: 152 ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYLSDVEE 208
           A+ T +P    E   +L Y  G +Y  HYDAF  A   P + Q   R A+ +LYL+ VEE
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQATLILYLNAVEE 187

Query: 209 GGETMFPFEVDLQIS 223
           GGET FP E+ LQ+S
Sbjct: 188 GGETAFP-ELGLQVS 201


>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
           [Cucumis sativus]
          Length = 311

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 7/139 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATA-KKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPR   +  F S E+C  +I+ A      PS+ +   G TV +      SSG  ++ 
Sbjct: 59  VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE+++A  T+LP+ H   F +++Y  G++    Y   N +   P     +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ +LYLSD   GGE +FP
Sbjct: 173 ATVVLYLSDSASGGEILFP 191


>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
          Length = 311

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 73/139 (52%), Gaps = 7/139 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATA-KKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           +SWRPR   +  F S E+C  +I+ A      PS+ +   G TV +      SSG  ++ 
Sbjct: 59  VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTE--LLNSSGVILNT 116

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
           ++D   I+  IE+++A  T+LP+ H   F +++Y  G++    Y   N +   P     +
Sbjct: 117 TDD---IVARIENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLM 172

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ +LYLSD   GGE +FP
Sbjct: 173 ATVVLYLSDSASGGEILFP 191


>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
 gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 68/129 (52%), Gaps = 14/129 (10%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            Q +SW+PRA+ + NF S ++ + II  A +++K S +   + E V      RTS GTF+
Sbjct: 41  IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGV--VDDIRTSYGTFL 98

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
             ++D   ++  IE ++A  + +P +H E   VLRY    KY  H D            +
Sbjct: 99  RRAQDP--VIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHIDGL----------E 146

Query: 195 RLASFLLYL 203
           R+A+ L+YL
Sbjct: 147 RVATVLMYL 155


>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 294

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 12/201 (5%)

Query: 14  LPTALLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI 73
           +P A+ +      +   G T LS D P I  +L  ++   K++   LP  +        +
Sbjct: 36  VPMAVRIVDAVLRSEGEGGT-LSYDPPHITRRLPAIDTSAKQSVLHLPGADA------VV 88

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
            F+ L+  PR +   NF S+E+C  +   A+    P+ +     + V +       S   
Sbjct: 89  TFEQLA--PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQL 146

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
            +A  +   ++  +E +I R T  P    E   + RY  GQ Y  HYD F       Q  
Sbjct: 147 PAAGSE---LVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGG 203

Query: 194 QRLASFLLYLSDVEEGGETMF 214
           QRLA+ +LYL   E GG T F
Sbjct: 204 QRLATLILYLRAPEAGGATYF 224


>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
          Length = 417

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 72/155 (46%), Gaps = 14/155 (9%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
            +  + LS  P       F   ++   I+  + + LKPS + L  G    +    RTS+ 
Sbjct: 198 DVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRTSTT 257

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
            F+ +  D    ++ I+ +++  T +P  H E   VLRYE  QKYD H D F P E+   
Sbjct: 258 YFLPS--DAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYF-PVEHHKN 314

Query: 189 GPQM--------SQRLASFLLYLSDVEEGGETMFP 215
            P +          R+ +   Y+SDV +GG T+FP
Sbjct: 315 APHILESIDYGYKNRMITVFWYMSDVAKGGHTIFP 349


>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
           sativus]
          Length = 164

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/89 (42%), Positives = 54/89 (60%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           RTSSG F+S  E    +++ IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+  
Sbjct: 7   RTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDT 66

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
               +  QR+A+ L+YLS+  EGGET FP
Sbjct: 67  FNLKRGGQRIATMLMYLSENIEGGETYFP 95


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/140 (35%), Positives = 70/140 (50%), Gaps = 9/140 (6%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +    + E CQ++IA  +  L+P+ +   Q    E   G R S   +     D   IL+ 
Sbjct: 16  WAGLLTPENCQNLIAIGQSLLRPATVTDEQ-TGQEVAHGERVSEMAW--PKRDDHPILQS 72

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ---RLASFLLYL 203
           +   IA+ T +P    E   +L Y  G +Y  HYDAF  A   P + Q   R  + +LYL
Sbjct: 73  LAEGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAF--AADAPTLRQGGNRQGTLILYL 130

Query: 204 SDVEEGGETMFPFEVDLQIS 223
           + VEEGGET FP E+ LQ+S
Sbjct: 131 NAVEEGGETAFP-ELGLQVS 149


>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 210

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 48/73 (65%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+A+ L+Y
Sbjct: 12  VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 71

Query: 203 LSDVEEGGETMFP 215
           LS+VE+GGET+FP
Sbjct: 72  LSNVEKGGETIFP 84


>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
           roenbergensis virus BV-PW1]
 gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
           roenbergensis virus BV-PW1]
          Length = 210

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 68/146 (46%), Gaps = 13/146 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
            +LS  P   Y  N  + ++C  II     +LKP   AL  G +       RT +  ++S
Sbjct: 4   HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKP---ALVSGNSRGFLSTGRTGTNCWLS 60

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
              D+  I   I  KI      P  + E F VL Y   QKY+ HYDAF P +   +    
Sbjct: 61  HKNDE--ITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAF-PIDNSEKAKRC 117

Query: 194 -----QRLASFLLYLSDVEEGGETMF 214
                QRL + L+YL++V +GGET F
Sbjct: 118 LKKGGQRLLTALIYLNNVTKGGETEF 143


>gi|397644755|gb|EJK76534.1| hypothetical protein THAOC_01697 [Thalassiosira oceanica]
          Length = 475

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 73/140 (52%), Gaps = 7/140 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRL--KPSQLALRQGE-TVESTKGT-RTSSGTFISAS 137
           P  + F NF + ++C  +I   +K    +   +   Q + + +S + T RTS   + S  
Sbjct: 273 PWVITFENFLTEDECTHMIEQGRKAEYERSEDVGEVQADGSYDSVRSTGRTSENAWCSFR 332

Query: 138 ED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           +      I+EL+  +IA+ T +   H E F +L+YE GQ Y  H+D +   +   +   R
Sbjct: 333 DGCRNDTIVELVHDRIAKVTGIGANHSEDFQILKYEPGQFYRQHHD-YIEHQRDRRCGPR 391

Query: 196 LASFLLYLSDVEEGGETMFP 215
           + +F LYLSDVEEGG T FP
Sbjct: 392 VLTFFLYLSDVEEGGATNFP 411


>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 204

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/73 (43%), Positives = 48/73 (65%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           ++  IE +I+  T LP  +GEA  +L Y+ G+KY+ HYD F+          R+A+ L+Y
Sbjct: 6   VVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVLMY 65

Query: 203 LSDVEEGGETMFP 215
           LS+VE+GGET+FP
Sbjct: 66  LSNVEKGGETIFP 78


>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
          Length = 394

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 2/80 (2%)

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           A++D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLS+VE+GGET+FP
Sbjct: 249 IATVLMYLSNVEKGGETIFP 268


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 81/167 (48%), Gaps = 20/167 (11%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET 119
           LP G    A+ + +   +   +P         S E+C  +I  A  +L+ S +       
Sbjct: 73  LPPGNRIIAADREVQVALRVEQPVLAVLDGVLSHEECDELIRRAAAKLQRSTI------- 125

Query: 120 VESTKG------TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           V+ T G       R+S GTF   + D    +  ++ +I+    LP  HGE   +L Y  G
Sbjct: 126 VDPTTGKHETIADRSSEGTFFEINADD--FIARLDRRISALMNLPVDHGEGLQILHYGPG 183

Query: 174 QKYDSHYDAFNPAEYGP--QMS---QRLASFLLYLSDVEEGGETMFP 215
            +Y  H+D F P + G   QM+   QR+++ ++YL++VE+GG T+FP
Sbjct: 184 GEYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFP 230


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 69/139 (49%), Gaps = 8/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      +  S  +C  +I   ++R++ S + +      E     R S G F++ S D  
Sbjct: 91  PVVALLADVLSPRECDRLIEIGRERVRRSSV-VDPDSGGEVLIDARKSEGAFVNGSTDP- 148

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            ++  I+ +IA     P  +GE  ++LRY  G +Y  H+D F   + G +       QR+
Sbjct: 149 -LVATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRI 207

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ +LYL+ VEEGG+T FP
Sbjct: 208 ATLILYLNQVEEGGDTTFP 226


>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
          Length = 369

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 33/80 (41%), Positives = 54/80 (67%), Gaps = 2/80 (2%)

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           A++D+  ++  IE +I+  T LP  +GE+  +L Y+ G+KY+ HYD F+  +       R
Sbjct: 191 ATQDE--VVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHR 248

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L+YLS+VE+GGET+FP
Sbjct: 249 IATVLMYLSNVEKGGETIFP 268


>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
 gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
          Length = 147

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 34/73 (46%), Positives = 46/73 (63%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
           I+  IE +IA  T +P  +GE   VL Y +GQK++ H+D  +          R A+FL+Y
Sbjct: 10  IVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGPRKATFLMY 69

Query: 203 LSDVEEGGETMFP 215
           LSDVEEGGET+FP
Sbjct: 70  LSDVEEGGETVFP 82


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + F    S  +C  I+A A  RL  S   +            RTS G F +  E   
Sbjct: 102 PRVIVFSGLLSDAECDEIVALAGARLARSH-TVDTATGASEVNAARTSDGMFFTRGEHP- 159

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG-PQM----SQRL 196
            +    E +IA     P  +GE   VL Y  G +Y  HYD F+P + G P +     QR+
Sbjct: 160 -VCARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRV 218

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ + YL+    GG T FP ++ L+++
Sbjct: 219 ATLVTYLNTPTRGGGTTFP-DIGLEVT 244


>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 165

 Score = 70.5 bits (171), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/91 (43%), Positives = 55/91 (60%), Gaps = 2/91 (2%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
            RTSSG F+S+ E K+ +   IE +I+  + +P  +GE   VLRYE  Q Y  H+D F+ 
Sbjct: 11  VRTSSGMFLSSEERKSPMA--IEKRISVYSQVPIENGELVQVLRYEKSQFYRPHHDYFSD 68

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
                +  QR+A+ L+YLSD  EGGET FP 
Sbjct: 69  TFNLKRGGQRVATMLMYLSDNVEGGETYFPM 99


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score = 70.5 bits (171), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 77/155 (49%), Gaps = 10/155 (6%)

Query: 66  GDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVEST 123
           G  + +S   + +S  PR     N  + E+C  +++ A ++   + L    G  + VEST
Sbjct: 66  GGMAKKSTWIETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVEST 125

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
             TRT+   ++   +D   +++ +E KIA+ T      GE   VL Y   Q++  H+D F
Sbjct: 126 --TRTNKQAWLDFQQDD--VVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYF 181

Query: 184 NPAEYGPQM----SQRLASFLLYLSDVEEGGETMF 214
           +PA   P+       RL + ++YL   EEGGET F
Sbjct: 182 DPATDPPENYEKGGNRLITVIVYLQAAEEGGETHF 216


>gi|323445926|gb|EGB02303.1| hypothetical protein AURANDRAFT_39521 [Aureococcus anophagefferens]
          Length = 239

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 7/141 (4%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P   +  +FA  + C+ +I  A+  L  +++  R+G    +    R +S  +++A 
Sbjct: 31  LSADPLVYFIDDFADEDSCEHLIRQARPSLGGAEVQTRRGSAART--AIRRASSCWLAAR 88

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYE--IGQKYDSHYDAFNPAEYG-PQMSQ 194
            D+   LE +E  I      P+   E F+V+RY    G++Y +H DAF        +  Q
Sbjct: 89  GDEA--LEHLEDAICAELGAPEERTEFFHVVRYRPSTGERYAAHADAFEAGNAELERGGQ 146

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           RL + LLYLSDV  GG T+FP
Sbjct: 147 RLTTALLYLSDVGAGGATVFP 167


>gi|224012759|ref|XP_002295032.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
           CCMP1335]
 gi|220969471|gb|EED87812.1| hypothetical protein THAPSDRAFT_264808 [Thalassiosira pseudonana
           CCMP1335]
          Length = 194

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 45/137 (32%), Positives = 71/137 (51%), Gaps = 9/137 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI--SASED 139
           P  +   NF + E+   +I   K+  +  QL+ ++ +++    GTRTS   +      +D
Sbjct: 2   PWLVSLENFLTDEEADYLIEVGKR--QQYQLSEQRKDSL----GTRTSYSAWCRRDCWKD 55

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASF 199
              +  +++ +IA+ T +         +LRYE GQK+  H D       G    QRL +F
Sbjct: 56  DATVSSVVD-RIAKVTKVETKQLSNLQLLRYEEGQKFKQHTDFAAMLSRGRAQGQRLMTF 114

Query: 200 LLYLSDVEEGGETMFPF 216
           L+YLSDVEEGGET FP+
Sbjct: 115 LIYLSDVEEGGETSFPY 131


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score = 70.1 bits (170), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 70/135 (51%), Gaps = 20/135 (14%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILE 145
           SAE+C+++IA A+ RL PS        +V+   G       R+S G F    E+    + 
Sbjct: 109 SAEECEALIALARPRLAPST-------SVDPLTGRNRLGAQRSSLGMFFRLREN--AFVA 159

Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFL 200
            ++ +++    LP  +GE   VL Y  G +   H+D   P+    Q S     QR+++ +
Sbjct: 160 RLDERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLV 219

Query: 201 LYLSDVEEGGETMFP 215
            YL++VEEGGET+FP
Sbjct: 220 AYLNEVEEGGETVFP 234


>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
 gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
          Length = 303

 Score = 69.7 bits (169), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 69/138 (50%), Gaps = 18/138 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +I+ A  +        +Q   V        S G  I   
Sbjct: 62  LSWHPRVFLYEGFLSDMECDHLISMAHGK--------KQSSLVVGGSAGNNSQGASI--- 110

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   I+  IE +I+  + LP+  GE+  +L+YE+ +   ++Y++ + + +      RL 
Sbjct: 111 EDT--IVSTIEDRISVWSFLPKDFGESMQILKYEVNKSDYNNYESQSSSGH-----DRLV 163

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV+ GGET FP
Sbjct: 164 TVLMYLSDVKRGGETAFP 181


>gi|397620797|gb|EJK65916.1| hypothetical protein THAOC_13183 [Thalassiosira oceanica]
          Length = 520

 Score = 69.7 bits (169), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 51/150 (34%), Positives = 70/150 (46%), Gaps = 9/150 (6%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            +++   P      +F SA++C  I+A A   LKP  +   +   VE     RTS+   +
Sbjct: 314 LKMIHQDPDIYSVDDFLSADECDRIVAKACPHLKPCLVNNERNGRVEQDPA-RTSTNANV 372

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQM 192
              E  T     +  KI   T       E   VLRY+ GQ++  H D F+   +  G   
Sbjct: 373 PQVEVPT-----VMRKITDLTNCSPDKVEILQVLRYKKGQEFVPHTDGFSGPYSSCGFVD 427

Query: 193 SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           S RL +   YL+DVEEGG T FP EVDL I
Sbjct: 428 STRLCTVFCYLNDVEEGGTTYFP-EVDLDI 456


>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
           [Brachypodium distachyon]
          Length = 314

 Score = 69.7 bits (169), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 71/138 (51%), Gaps = 8/138 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           L+W PR   +  F S  +C  ++  A+  ++ S L       +  T+ +  +   F   +
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNI--TQNSTDARFKF-QLA 119

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           + K  ++  IE +I+  + +P+ HGE+  +L+Y   Q  D + D    +  G     RL 
Sbjct: 120 DSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 174

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV++GGET+FP
Sbjct: 175 TILMYLSDVKQGGETVFP 192


>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
 gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
          Length = 262

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 7/138 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P+A  +  F SAE+C  +I      LK S +   + +T       RTS GTF+   
Sbjct: 4   LSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDT-GVLDDVRTSFGTFLPKK 62

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
            D   +L  IE ++   + +   + E   +L+Y  GQ+Y  H D        P   +R+A
Sbjct: 63  YDD--VLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQDGLT----SPNGGRRIA 116

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L++L + E+GGET FP
Sbjct: 117 TVLMFLHEPEKGGETSFP 134


>gi|228999322|ref|ZP_04158902.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
 gi|229006877|ref|ZP_04164509.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
 gi|228754370|gb|EEM03783.1| hypothetical protein bmyco0002_37790 [Bacillus mycoides Rock1-4]
 gi|228760519|gb|EEM09485.1| hypothetical protein bmyco0003_38780 [Bacillus mycoides Rock3-17]
          Length = 195

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 67/147 (45%), Gaps = 21/147 (14%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P    +    +  +CQ +I  +KK ++P+Q     GE        R S  T++  
Sbjct: 7   VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
                G++  +   IA A  LP  H E     RYE+G K+D+H D +        N  E 
Sbjct: 59  YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
           G    QRL + +LYL+ V  GGET FP
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP 140


>gi|307106128|gb|EFN54375.1| hypothetical protein CHLNCDRAFT_135671 [Chlorella variabilis]
          Length = 486

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 30/165 (18%)

Query: 77  VLSWR-PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           V+SW  PR L   +F + ++ + +I  A      S++   + +  E+    RTS G++++
Sbjct: 181 VVSWSSPRVLLIRDFLTPDETEHLIRQATGGFARSEVVAEESKQHEA----RTSYGSWLN 236

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--AFNPAEYGPQMS 193
            ++    +LE I+++I R   +P+  GE+  VL+Y  GQKYD H D  A +PA    Q  
Sbjct: 237 GAKRDDKVLE-IQNRIHRLVGIPEAFGESIYVLQYSDGQKYDPHTDHCASSPAAASTQSC 295

Query: 194 Q----------------------RLASFLLYLSDVEEGGETMFPF 216
           Q                      RLA+F+LYL     GG T+FP 
Sbjct: 296 QDFLRRGGGPKCGPGGGGVTCGDRLATFILYLRSPTRGGATVFPL 340


>gi|228993272|ref|ZP_04153188.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
           12442]
 gi|228766340|gb|EEM14983.1| hypothetical protein bpmyx0001_40040 [Bacillus pseudomycoides DSM
           12442]
          Length = 195

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 67/147 (45%), Gaps = 21/147 (14%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL   P    +    +  +CQ +I  +KK ++P+Q     GE        R S  T++  
Sbjct: 7   VLHDEPFVAQYEQIITPAECQELIELSKKHIQPAQAYGHTGE--------RKSDFTWLPH 58

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------NPAEY 188
                G++  +   IA A  LP  H E     RYE+G K+D+H D +        N  E 
Sbjct: 59  YSH--GLVSQVSELIATAMPLPLNHAEPLQAARYEVGGKFDAHIDCYGTWHEDGRNRVEQ 116

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFP 215
           G    QRL + +LYL+ V  GGET FP
Sbjct: 117 G---GQRLYTAILYLNTVNAGGETFFP 140


>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
          Length = 315

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 62/225 (27%), Positives = 108/225 (48%), Gaps = 27/225 (12%)

Query: 15  PTALLLCSFFFLAGFYGS--TFLSRDVPSIRPKLRTLEVVEKENESGL---------PHG 63
           P A +LC  F L    G   TF++  V S R ++  LEV  ++  SG+         P  
Sbjct: 15  PPAAVLCGIFVLGIGSGVILTFIAAQVVSRRSEI--LEVKPQKLRSGMLNAPGTSLSPAD 72

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVE 121
           +  +  I++I        PR     N  + E+C+S+ +          L +  G  E VE
Sbjct: 73  KKNEFWIETISDLP---GPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVE 129

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
           S+  TRT++  ++   +    ++  +E+ +A+ T     +GE   +L Y+  Q++  H+D
Sbjct: 130 SS--TRTNTAAWLEYHQGP--VVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHD 185

Query: 182 AFNPAEYGPQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
            F+PA   P+       RLA+ ++YL + EEGGET F  ++D ++
Sbjct: 186 YFDPATDPPENFEPGGNRLATAIIYLQNAEEGGETDF-MKIDTKV 229


>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 354

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 77/155 (49%), Gaps = 10/155 (6%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSG 131
           P +VL        + +  S  +C  +I      L+PS +   L     V++    RTS  
Sbjct: 146 PTEVLDQTLPVELYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNV---RTSYV 202

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
             I+ S     I   ++  I++ T  P+ +GEA N+LRY  GQ+Y  HYDA N    G  
Sbjct: 203 AIIAPSYCD-WITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSM 261

Query: 192 M---SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
                QR+ + L+YL+ V +GGET FP ++D+ +S
Sbjct: 262 YKDGKQRIKTALVYLNTVRQGGETRFP-KLDISVS 295


>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
 gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
          Length = 201

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 74/146 (50%), Gaps = 9/146 (6%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P Q+L+ +P    +P+  S+E CQS+I  A+ +L P+ +  + G  V      R S   +
Sbjct: 4   PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
              + ++  +++ I  +IA     P  + E   V  Y  G K+++H D ++  E    + 
Sbjct: 61  FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFL 118

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QRL + +LYL+DV  GGET FP
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFP 144


>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
 gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
          Length = 201

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 74/146 (50%), Gaps = 9/146 (6%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P Q+L+ +P    +P+  S+E CQS+I  A+ +L P+ +  + G  V      R S   +
Sbjct: 4   PTQLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVVGQSGLEVSH---VRISELAW 60

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE----YG 189
              + ++  +++ I  +IA     P  + E   V  Y  G K+++H D ++  E    + 
Sbjct: 61  FCHNYNE--VVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFL 118

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
               QRL + +LYL+DV  GGET FP
Sbjct: 119 EHSGQRLYTAILYLNDVVSGGETYFP 144


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 20/133 (15%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
           E+C  +I  +  +L+ S        TV+   G       R+S GTF   + D    +  +
Sbjct: 109 EECDELIRRSADKLQRST-------TVDPVNGGYEVIAARSSEGTFFPVNADD--FIARL 159

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLLY 202
           + +IA     P  +GE   VL Y  G +Y  H+D F+P + G +       QR+++ L+Y
Sbjct: 160 DRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTLLIY 219

Query: 203 LSDVEEGGETMFP 215
           L+DV +GG T+FP
Sbjct: 220 LNDVAQGGATVFP 232


>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
           [Brachypodium distachyon]
          Length = 303

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 19/138 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           L+W PR   +  F S  +C  ++  A+  ++ S L            G R      I+ +
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLV---------NAGARN-----ITQN 108

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
                ++  IE +I+  + +P+ HGE+  +L+Y   Q  D + D    +  G     RL 
Sbjct: 109 STDDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-DHNKDGTQSSSGG----NRLV 163

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV++GGET+FP
Sbjct: 164 TILMYLSDVKQGGETVFP 181


>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
           anophagefferens]
          Length = 182

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 67/131 (51%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF + E+C ++I +AK  + P+ +    G        +RTSS  ++ A ED    L  + 
Sbjct: 8   NFLTEEECDALIDSAKDHMTPAPVV---GPGNGEVSVSRTSSTCYL-ARED----LPSVC 59

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
            K+   T  P  H E   V RY  G+ Y  HYDAF+ +      +     QR+A+ L+YL
Sbjct: 60  TKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATVLVYL 119

Query: 204 SDVEEGGETMF 214
           +DVE GGET F
Sbjct: 120 NDVERGGETSF 130


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 74/144 (51%), Gaps = 6/144 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++   P  + F +  S ++   +   A+  LK + + +  G+ V  ++  RTS G 
Sbjct: 308 LKMEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTVHV-NGKYV--SRRVRTSKGA 364

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-EYGPQ 191
           ++    D   +   IE ++   T L     EA+N++ Y +G  Y +HYD FN   +   +
Sbjct: 365 WLE--RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSE 422

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A+ L YLSDVE+GG T+FP
Sbjct: 423 TGDRIATVLFYLSDVEQGGATVFP 446


>gi|397644356|gb|EJK76358.1| hypothetical protein THAOC_01879, partial [Thalassiosira oceanica]
          Length = 539

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 77/157 (49%), Gaps = 14/157 (8%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQG 117
           HG T + + +  P+ V+        F NF + E+   ++   +    +R      A   G
Sbjct: 324 HGATWEGAEEGSPWVVV--------FDNFLTDEEVADLVKGGELEGYERSTDQGAANAYG 375

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYD 177
           E  +    TRTSS  +     ++   +     KI   T +PQ + E+F +L+Y+ GQ Y 
Sbjct: 376 EQEKVVSRTRTSSNAWCMHKCERLPGVRSASKKIEAVTGIPQVNYESFQLLKYDGGQFYR 435

Query: 178 SHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           SH+D+ +  +  P    R+ +F LYLSDVEEGGET F
Sbjct: 436 SHHDS-SSVDDSP-AGHRILTFFLYLSDVEEGGETYF 470


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 72/146 (49%), Gaps = 5/146 (3%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  +V   +P+   F +  +  + + +   A  +L  + +    GE + +T   R S   
Sbjct: 317 IKTEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATVHGENGELLHAT--YRISKSG 374

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
           ++S S+D  G ++ I+ +I   T L  +  E   V+ Y IG +Y+ HYD     E     
Sbjct: 375 WLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTS 434

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
                R+++ L+Y+SDVE+GG T+FP
Sbjct: 435 LGSGNRISTLLIYMSDVEKGGATVFP 460


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 5/148 (3%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  ++ S +PR + + N  + E+ ++    A+ RL+ S +        E TK  R +
Sbjct: 330 IQPIKMELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTK-YRIA 388

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              F+  SE    I+++   +I   T L  T  E   V  Y IG  Y+ HYD     E  
Sbjct: 389 KAAFLQNSEHD-HIVKMTR-RIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQ 446

Query: 190 PQMS--QRLASFLLYLSDVEEGGETMFP 215
                  R+A+++ Y+SDVE GG T+FP
Sbjct: 447 KDFGWGNRIATWMFYMSDVEAGGATVFP 474


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 53/161 (32%), Positives = 74/161 (45%), Gaps = 14/161 (8%)

Query: 63  GETGDASI----QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGE 118
           GE G A      Q     +LS  P  + F NF S E+  +I+  AK +   S   + +  
Sbjct: 46  GEWGYADADWLRQHYNITMLSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIER-- 103

Query: 119 TVESTKGTRTSSGTF----ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
                   RTSS  +    +  ++     L+ +E +IAR   LP  + E F VL+Y+  Q
Sbjct: 104 ---EVSNYRTSSTAWMLPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQ 160

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            Y  H D        P    R+A+F LYL+DVEEGG T FP
Sbjct: 161 YYKVHSDYIEEQRQQP-CGIRVATFFLYLNDVEEGGGTRFP 200


>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 152

 Score = 67.4 bits (163), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 49/98 (50%), Gaps = 7/98 (7%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
               RTS    +   +D   + + IE +IAR    P  HGE   VLRY  G +Y  HYD 
Sbjct: 2   VHAARTSDSMCLRVGQD--ALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDY 59

Query: 183 FNPAEYGPQM-----SQRLASFLLYLSDVEEGGETMFP 215
           F+P   G  +      QR+AS ++YL+  E GG T FP
Sbjct: 60  FDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFP 97


>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
 gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
          Length = 303

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 70/138 (50%), Gaps = 19/138 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +++  +  ++ S LA         T G R SS   I   
Sbjct: 63  LSWHPRIFLYEGFLSDMECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI--- 110

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   ++  IE +I+  + LP+ +GE+  VL+Y + +       +          + RLA
Sbjct: 111 EDI--VVSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLA 163

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV++GGET+FP
Sbjct: 164 TILMYLSDVKQGGETVFP 181


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 10/133 (7%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
           N   A +C+++I  AK RL PS L     G  V S K  R S G F    E+   ++  +
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDK--RASWGMFFRLCEND--LVARL 162

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
           + +++    LP  +GE  ++L Y  G   + H+D   P     + S     QR+++ + Y
Sbjct: 163 DRRLSALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTY 222

Query: 203 LSDVEEGGETMFP 215
           L+D  EGG+T+FP
Sbjct: 223 LNDAPEGGQTVFP 235


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score = 67.0 bits (162), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P    + +  S  +C +++  A+ RL  S + +      E+    RTS G      E   
Sbjct: 90  PSIRLYQHLLSDAECDALVELARGRLARSPV-INPDTGDENLIDARTSMGAMFQVGEHT- 147

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            +++ IE +IA    +P  HGE   +L Y+ G +Y  H+D FNP   G         QR 
Sbjct: 148 -LIQRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRT 206

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQIS 223
           A+ ++YL+  + GG T FP  + L+++
Sbjct: 207 ATLVIYLNTPQAGGATAFP-RIGLEVA 232


>gi|323456313|gb|EGB12180.1| hypothetical protein AURANDRAFT_61447 [Aureococcus anophagefferens]
          Length = 317

 Score = 66.6 bits (161), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 17/151 (11%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV-ESTKGTRTSSGT 132
           P +VLS  P A    +FA+  +C  IIA A  RL     AL  G+   E    +R++   
Sbjct: 101 PCEVLSTAPLAFCVRDFATGAECDRIIAEATPRL---SAALVAGDGAGEQAGSSRSAQVA 157

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--------- 183
           ++  S D       +  ++A    +P +H E+  V++Y  G +Y  H+DAF         
Sbjct: 158 WVPRSPDDP----WLARRVAELIDVPLSHAESLQVVKYGAGGEYKPHFDAFPLDAARGRR 213

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
                     QR  + +LYL+DVE+GG T F
Sbjct: 214 AAVRGRTYAGQRRVTAILYLNDVEKGGGTAF 244


>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
 gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
           ATCC 15444]
          Length = 298

 Score = 66.6 bits (161), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 73/140 (52%), Gaps = 14/140 (10%)

Query: 82  PRA-LY-FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED 139
           P+A LY +PNF + E C ++IA   +RL+ S           +    RTS  + I  +  
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTT-----DAFADPKIRTSRSSDI-GTMG 153

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQ 194
              +++L E  IA A  +  ++ +A    RY++ Q+Y +HYD F P     Q+      Q
Sbjct: 154 HNLVMQLDE-LIAEALGIHWSYSDATQTQRYDVNQEYKAHYDYFTPGTRDYQVHCQFTGQ 212

Query: 195 RLASFLLYLSDVEEGGETMF 214
           R  +F++YL+DVEEGG T F
Sbjct: 213 RTWTFMIYLNDVEEGGGTRF 232


>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 215

 Score = 66.2 bits (160), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 9/135 (6%)

Query: 85  LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
           ++F    S ++C  +IA       KPS +     +    T G R S  T ++ S D   I
Sbjct: 17  VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 73

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
           +  I  +I   + + Q + E   +L Y  G KYD HYDAF  ++  PQ+     RL + L
Sbjct: 74  ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 131

Query: 201 LYLSDVEEGGETMFP 215
           LYL+DVE GG T FP
Sbjct: 132 LYLNDVEYGGWTQFP 146


>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 213

 Score = 66.2 bits (160), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 67/135 (49%), Gaps = 9/135 (6%)

Query: 85  LYFPNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI 143
           ++F    S ++C  +IA       KPS +     +    T G R S  T ++ S D   I
Sbjct: 15  VHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPG-RCS--TVVAPSVDAYPI 71

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFL 200
           +  I  +I   + + Q + E   +L Y  G KYD HYDAF  ++  PQ+     RL + L
Sbjct: 72  ILEIRRRIELFSGISQENQEPLQILHYTRGGKYDIHYDAF--SDGSPQLRNGGNRLLTVL 129

Query: 201 LYLSDVEEGGETMFP 215
           LYL+DVE GG T FP
Sbjct: 130 LYLNDVEYGGWTQFP 144


>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
          Length = 226

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 65/137 (47%), Gaps = 15/137 (10%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK-TGI 143
            Y P+F  A  C  ++A      + S +        ES +  RTS     S   D+ +  
Sbjct: 43  FYHPDFLDAATCDRLVALIDANRRRSTVLAE-----ESVQDFRTSD----SCDMDRWSPD 93

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLAS 198
           +   +  IA    +   HGE     RY +GQ + +H+D FN A+ Y P+M     QR  +
Sbjct: 94  VRPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153

Query: 199 FLLYLSDVEEGGETMFP 215
            ++YL+DVEEGG T FP
Sbjct: 154 AMIYLNDVEEGGATWFP 170


>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
 gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 79/155 (50%), Gaps = 13/155 (8%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSII-ATAKKRLKPSQLALRQGETVE---STKGTRT 128
           +  +VLS  PRA    NF S  +   I+  T   +L  S  A     T +   ST+ TRT
Sbjct: 1   MTLKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRT 60

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTH-GEAFNVLRYEIGQKYDSHYDAFNP-- 185
           S  T++    +K+ I++ I  + A   ++ +    EA  ++ Y++GQ+Y +H+D  +P  
Sbjct: 61  SLNTWVY--REKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDI 118

Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVD 219
             EY P    R  + LLYL++  EGG T FP  V+
Sbjct: 119 DNEYQP---ARYCTLLLYLNEGMEGGATQFPRWVN 150


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 73/146 (50%), Gaps = 10/146 (6%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + LS  P  +Y   F    +C+++I  A+ R+K + ++L     V  ++G RT S  ++
Sbjct: 50  METLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALVSLDGSSGV--SQG-RTGSNCWL 106

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEYG 189
              E+   +   I  ++A+    P  + E   V+ Y   Q+Y  HYDA++          
Sbjct: 107 RYQEEP--LARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCT 164

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
            Q  QR+ + LLYL++VEEGG T FP
Sbjct: 165 RQGGQRMVTALLYLNEVEEGGATAFP 190


>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 309

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/181 (28%), Positives = 91/181 (50%), Gaps = 23/181 (12%)

Query: 41  SIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV--LSWRPRALYFPNFASAEQCQS 98
           SIR +LR  EV   ++E+ +  G +   +  S+  QV  LSWRPR   +  F + E+C  
Sbjct: 26  SIRKELRDKEV---KHETIIQLGSSVQTNRISL-LQVVQLSWRPRVFLYKGFLTDEECDR 81

Query: 99  IIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGI----LELIEHKIARA 154
           +I+ A              + +   KG  + +   +++SE ++ I    L  IE +I+  
Sbjct: 82  LISLA-----------HGAKEISKGKGDGSRNNIQLASSESRSHIYDDLLARIEERISAW 130

Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           T +P+ + +   V+ Y I +  + H+D F+       +S  +A+ +LYLS+V  GGE +F
Sbjct: 131 TFIPKENSKPLQVMHYGIEEARE-HFDYFDNKTLISNVSL-MATLVLYLSNVTRGGEILF 188

Query: 215 P 215
           P
Sbjct: 189 P 189


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 68/150 (45%), Gaps = 13/150 (8%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           P +V  LS  P  + F +     +  +++  AK ++  + +    G         RTS  
Sbjct: 324 PLKVEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVT---GHNSSVVSNARTSQF 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
           TF+  +  K  +L  I+ ++A  T L   + E   +  Y IG  Y  H D F P  +   
Sbjct: 381 TFLPKTRHK--VLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETK 438

Query: 189 ---GPQMSQRLASFLLYLSDVEEGGETMFP 215
               P+M  R+ + L YLSDVE+GG T FP
Sbjct: 439 QVSNPEMGNRIGTVLFYLSDVEQGGATAFP 468


>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 532

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 77/159 (48%), Gaps = 10/159 (6%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           GETG   +Q I  +  + +P  +   +         +IA AK RL+ S+      +    
Sbjct: 307 GETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363

Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
              +RTSS T+++  +      + + ++  +   T+  +   E + +  Y IG  Y  H+
Sbjct: 364 GPPSRTSSNTWLNDEDAPVAARVNQYLQSLLGLGTLFSRDEAEKYQLANYGIGGHYVPHH 423

Query: 181 DAF----NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           D F     P++ G +   R+A+ ++Y+SDVEEGG T+FP
Sbjct: 424 DYFEEFQTPSK-GNRFGNRVATLMIYMSDVEEGGATVFP 461


>gi|85373596|ref|YP_457658.1| hypothetical protein ELI_03845 [Erythrobacter litoralis HTCC2594]
 gi|84786679|gb|ABC62861.1| hypothetical protein ELI_03845 [Erythrobacter litoralis HTCC2594]
          Length = 217

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 73/166 (43%), Gaps = 23/166 (13%)

Query: 60  LPHGE------TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA 113
           LP GE      T    +Q +P+  L    R     +F S + C  +IA  +   +PS +A
Sbjct: 2   LPPGESSADRLTHYGGVQRLPYAKLELFQRR----DFLSPDHCAELIALIETNRRPSTIA 57

Query: 114 LRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
              G+        RTS    +    D  G LE +   +   + +   HGE     RYE+G
Sbjct: 58  DHNGDEY-----FRTSETCDLDPDLDAVGALEAM---LEELSGIDSAHGEPVQGQRYEVG 109

Query: 174 QKYDSHYDAFNP-----AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           Q++  H D F P      +Y     QR  +F++YL++VE GG T F
Sbjct: 110 QEFKPHNDWFEPNGADYEKYCAVAGQRTWTFMIYLNEVEAGGATRF 155


>gi|359400227|ref|ZP_09193216.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
 gi|357598467|gb|EHJ60196.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
          Length = 193

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 63/131 (48%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F    QC ++IA  +   +PS +A   G+ V      RTSS   +S      G +  + 
Sbjct: 16  DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPD---VGAVAALA 67

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
            K+   + +   H E     RYE+GQ++ +H D F P      +Y     QR  +F++YL
Sbjct: 68  RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 127

Query: 204 SDVEEGGETMF 214
           +DV+ GG T F
Sbjct: 128 NDVDAGGATRF 138


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 67/133 (50%), Gaps = 20/133 (15%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGETVESTKG------TRTSSGTFISASEDKTGILELI 147
           ++C+ +IA A+ RL PS        TV+   G       R+S G F    E+    +  +
Sbjct: 111 QECEELIALARPRLAPST-------TVDPLSGRDLVGEQRSSLGMFFRLREN--AFIARL 161

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRLASFLLY 202
           + +++    LP  +GE   VL Y  G +   H+D   P+    + S     QR+++ + Y
Sbjct: 162 DQRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSY 221

Query: 203 LSDVEEGGETMFP 215
           L++VEEGGET+FP
Sbjct: 222 LNEVEEGGETIFP 234


>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
          Length = 151

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 48/83 (57%)

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           F++  E K  ++  IE +I+  + +P  +GE   VLRYE  Q Y  H+D F       + 
Sbjct: 2   FLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRG 61

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
            QR+A+ L+YLSD  EGGET FP
Sbjct: 62  GQRIATMLMYLSDNVEGGETYFP 84


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 63/139 (45%), Gaps = 8/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR   F    +  +C +++A A+ RL  S + +      E+    RTS G      E   
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPV-INPDTGDENLIEARTSLGAMFQVGEHP- 189

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRL 196
            ++E IE  IA  T +    GE   +L Y+ G +Y  HYD FNP   G         QR+
Sbjct: 190 -LIERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRV 248

Query: 197 ASFLLYLSDVEEGGETMFP 215
            + ++YL+    GG T FP
Sbjct: 249 GTLVIYLNSPLAGGATAFP 267


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 17/135 (12%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT------RTSSGTFISASEDK 140
           F NFASA++C  +    +K+L            V  T G       R S+  ++    D 
Sbjct: 309 FRNFASAQECAHLREEGRKKL---------SRAVAWTDGAFRPVEFRISTAAWLQPDHDD 359

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             ++  +  +IA AT L     EA  V  Y IG  Y++HYD     E       R+A+F+
Sbjct: 360 --VVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIATFM 417

Query: 201 LYLSDVEEGGETMFP 215
           +YL+ VE+GG T FP
Sbjct: 418 IYLNQVEQGGYTAFP 432


>gi|399057802|ref|ZP_10744231.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
 gi|398041550|gb|EJL34606.1| 2OG-Fe(II) oxygenase superfamily enzyme [Novosphingobium sp. AP12]
          Length = 210

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 17/152 (11%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           A +Q  P    S R       NF +AEQC  ++A  +   +PS +A   G+        R
Sbjct: 16  AGVQRFP----SSRLDLFILKNFVAAEQCAELMALIEDSHRPSTIADYNGD-----DAFR 66

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
           TSS   +S       ++  +   ++R + +   H E     RYE+GQ++ +H D F P  
Sbjct: 67  TSSTCDLSTD---VPVVANLAAALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPGN 123

Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMF 214
               +Y     QR  +F++YL++VE GG T F
Sbjct: 124 ADYDKYCAVPGQRTWTFMIYLNEVEAGGATRF 155


>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
          Length = 549

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 68/138 (49%), Gaps = 19/138 (13%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LSW PR   +  F S  +C  +++T +  +  S LA         T G R SS   I   
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMD-SSLAF--------TDGDRNSSYNNI--- 356

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
           ED   ++  IE +I+  + LP+ +GE   VL+Y + ++      +             LA
Sbjct: 357 EDI--VVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLA 409

Query: 198 SFLLYLSDVEEGGETMFP 215
           + L+YLSDV++GGET+FP
Sbjct: 410 TILIYLSDVKQGGETVFP 427


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 75/148 (50%), Gaps = 9/148 (6%)

Query: 73  IPFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTS 129
           +P+ +  +S  P  + + N  S  + +S+ A A K+L+P+ +         + +G TR +
Sbjct: 93  MPYAIEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIA 152

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              FI   ++++ +   I  ++   T L     E   V+ Y I  +Y  HYD F PA+ G
Sbjct: 153 KMAFIL--DEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTF-PAKSG 209

Query: 190 PQMS---QRLASFLLYLSDVEEGGETMF 214
            +      RLA+ +LYLSDVE GG T+F
Sbjct: 210 DRSHPSHDRLATAILYLSDVERGGATVF 237


>gi|393774561|ref|ZP_10362923.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
 gi|392720044|gb|EIZ77547.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. Rr 2-17]
          Length = 210

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F SA +C+ ++A  +   +PS +A   G+        RTSS   +SA   +   +  + 
Sbjct: 33  DFLSAPECEELVALIEAEHRPSTIADFTGD-----DAFRTSSTCDLSA---QVPAVADLA 84

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
            K+AR + +   H E     RYE+GQ++ +H D F P       Y     QR  +F++YL
Sbjct: 85  AKLARLSGIDPAHAEPLQGQRYEVGQQFKAHTDTFEPGTADYQRYCSASGQRTWTFMVYL 144

Query: 204 SDVEEGGETMF 214
           ++V+ GG T F
Sbjct: 145 NEVDAGGATRF 155


>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 207

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 67/137 (48%), Gaps = 5/137 (3%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQ---LALRQGETVESTKGT-RTSSGTFISAS 137
           P  +    F S E+C   I     R + S      +    T +S + + RTS+ T+    
Sbjct: 9   PWVVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEG 68

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLA 197
                I++ +  ++   T +P  + E   ++RYEIGQ+Y+ H+D ++ +  G Q   R+ 
Sbjct: 69  CRDDPIIKKVIERMESLTGIPYANFEDLQLVRYEIGQRYEEHHD-YSSSHEGTQYGPRIL 127

Query: 198 SFLLYLSDVEEGGETMF 214
           +   YL+DVEEGG T F
Sbjct: 128 TVFFYLNDVEEGGGTQF 144


>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
 gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
          Length = 508

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 42/160 (26%), Positives = 86/160 (53%), Gaps = 17/160 (10%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  +S  P  + + +    +  Q +IA A+ RL+P+++   + +  E+    R++ G
Sbjct: 295 PFKMEEISLEPYIVVYHDILPDKDMQQLIALAEPRLRPTEVF--EEDKSEARTSDRSALG 352

Query: 132 TFISASE-DKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE- 187
           TF+   + + +G  +L+ +  ++   T +   H   FN+++Y  G +Y +++D FN    
Sbjct: 353 TFLPFKDMNPSGGPLLDRLTQRMRDITGIQIRHENTFNIIKYGFGSQYATNFDFFNGTNS 412

Query: 188 ----YGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
               YG     R+A+ L YL+D   GG T+FP  +D++++
Sbjct: 413 EMEGYG----DRMATVLFYLNDAPNGGATVFP-RIDVKVT 447


>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/179 (30%), Positives = 90/179 (50%), Gaps = 16/179 (8%)

Query: 42  IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
           IR +LR  +VV +E    L H    +    S   Q LSW+PRA  +  F S E+C  +I+
Sbjct: 25  IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83

Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
            A    K  +LA   G++  V   +  ++S G      E    +   IE +I+  T LP+
Sbjct: 84  LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137

Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            + E   V++Y+     QKY+ ++   + +++G  +   +A+ LL+LS+V  GGE  FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFP 192


>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/179 (30%), Positives = 90/179 (50%), Gaps = 16/179 (8%)

Query: 42  IRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIA 101
           IR +LR  +VV +E    L H    +    S   Q LSW+PRA  +  F S E+C  +I+
Sbjct: 25  IRKELRINKVVNQETTVQLGHSIEYNRVDPSRVIQ-LSWQPRAFLYRGFLSDEECDHLIS 83

Query: 102 TAKKRLKPSQLALRQGET--VESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQ 159
            A    K  +LA   G++  V   +  ++S G      E    +   IE +I+  T LP+
Sbjct: 84  LALG--KKEELATNGGDSGNVVLKRLLKSSEGPLYIDDE----VAARIEKRISAWTFLPK 137

Query: 160 THGEAFNVLRYEI---GQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            + E   V++Y+     QKY+ ++   + +++G  +   +A+ LL+LS+V  GGE  FP
Sbjct: 138 ENSEPLEVVQYQFENAKQKYN-YFSNKSTSKFGEPL---MATVLLHLSNVTRGGELFFP 192


>gi|224003081|ref|XP_002291212.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972988|gb|EED91319.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 467

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 70/146 (47%), Gaps = 7/146 (4%)

Query: 81  RPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
            P  + F NF S  +   +I   +    +R      A   GE  +    TRTSS  +   
Sbjct: 263 HPWVVVFNNFLSDNEVDDLIRGGEMEGFERSTDQGAANALGEQEKIVSQTRTSSNAWCMH 322

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL 196
             ++ G +     KI   T +P+ + E+F +L Y   Q Y SH+D+ +     P    R+
Sbjct: 323 KCERLGGVRSATTKIEDVTGIPRVNYESFQLLNYGQNQFYRSHHDSSSRDHTPP--GPRI 380

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
            +F LYLSDVEEGGET F  ++DL +
Sbjct: 381 LTFFLYLSDVEEGGETYFN-KLDLAV 405


>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 294

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 15/133 (11%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELI 147
           +F S  +C ++IA A   +  S +     GE  ES    RTSS  F+ A ED    L  +
Sbjct: 108 DFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES----RTSSSCFL-ARED----LPTV 158

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLY 202
            HK+   T  P  H E   V RY   QKY +H+DAF     +   +     QR+ + L+Y
Sbjct: 159 CHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAFDLNTEDGRRFAQNGGQRVCTVLVY 218

Query: 203 LSDVEEGGETMFP 215
           L+DV  GG T FP
Sbjct: 219 LNDVPSGGCTAFP 231


>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
          Length = 190

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 110 SQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLR 169
           S +A    E        RTSS  ++S + D   ++  I  ++A    LP    E   VL 
Sbjct: 4   STIAEAGNEAKNGVGSARTSSTAWLSKTADP--LVAKIRTRVAELVKLPMELAEDMQVLH 61

Query: 170 YEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEGGETMFPF 216
           Y   Q Y +H+D F+P  Y   ++     R  +   YLSDVEEGGET+FPF
Sbjct: 62  YSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPF 112


>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
 gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
          Length = 570

 Score = 63.5 bits (153), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 70/135 (51%), Gaps = 16/135 (11%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           N  S  +C  +I  AK  +K + + L +G   + ++G RT S  ++   ED   +++ + 
Sbjct: 28  NVISPVECAYLIELAKPHIKRAGVVLDEG--YKESEG-RTGSNHWLKYDEDD--VVQSVG 82

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN--------PAEYGPQMSQRLASFL 200
            +I+    LP  + E+  ++ Y   Q+Y  H+DAFN         A++G    QRL + L
Sbjct: 83  QRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNLSLPKGQRAAKWG---GQRLVTAL 139

Query: 201 LYLSDVEEGGETMFP 215
           +YL+ VE GG T FP
Sbjct: 140 VYLNKVEAGGATQFP 154


>gi|219128111|ref|XP_002184265.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217404496|gb|EEC44443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 584

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 70/148 (47%), Gaps = 10/148 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKK----RLKPSQLALRQG--ETVESTKGTRTSSGTFIS 135
           P  +   NF + E+C  +I    K    R K        G  E V+ST+ T  ++     
Sbjct: 382 PWVITLDNFLTLEECTELINIGHKHGYNRSKDVGKVKVDGTHEAVQSTRRTSENAWCSNQ 441

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
           +      + +L+  ++A    +P  + E F +L+YE GQ Y +H+D F   +   Q   R
Sbjct: 442 SGCRDEALPQLLHERMATVMRIPAQNSEDFQLLKYEKGQFYRTHHD-FIQHQTKRQCGPR 500

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
           + +F LYLSDV  GG T FP   DL I+
Sbjct: 501 ILTFFLYLSDVTAGGGTNFP---DLDIT 525


>gi|311032645|ref|ZP_07710735.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           m3-13]
          Length = 137

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 59/102 (57%), Gaps = 7/102 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   N  S E+C ++I  +K ++K S+++   G  V++    RTSS TF    E++ 
Sbjct: 39  PLVVVLANVLSDEECDALIRLSKDKMKRSKVS--NGLEVDAI---RTSSSTFFHEGENE- 92

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
            ++  IE ++++   +P  HGE   +L Y++GQ+Y +H+D F
Sbjct: 93  -LVARIEKRVSQIMNVPVEHGEGLQILNYQVGQEYKAHFDFF 133


>gi|219126272|ref|XP_002183385.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405141|gb|EEC45085.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 474

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 80/166 (48%), Gaps = 10/166 (6%)

Query: 61  PHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAK----KRLKPSQLALRQ 116
           P  E  D ++++  +Q+  W    +   +F +  +  ++IA       +R       L  
Sbjct: 258 PDYEPND-TVETASYQIGPW---VVIIDDFLNETETSTLIALGADQGYERSTDVGEILED 313

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G   +    TRTS+  +     D   + ++I  ++   T +P  + E+  +LRYE GQ Y
Sbjct: 314 GSYEDDESETRTSTNAWCYNECDDHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFY 373

Query: 177 DSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
             H+D +   ++   +  R+ +  LYL+DVEEGG T FP E++L +
Sbjct: 374 AVHHD-YIENDWNRAVGSRILTVFLYLNDVEEGGATNFP-ELELAV 417


>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 331

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 47/160 (29%), Positives = 73/160 (45%), Gaps = 11/160 (6%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTS 129
           + +  Q +S  PRA    +  S ++C ++I  A+ RL  S  +    G+ V +      S
Sbjct: 121 RQVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCS 180

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----N 184
             +F    E+ + + + I  + AR    P  H E     RY  G+++  H D F     N
Sbjct: 181 CASF--PPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLN 238

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP---FEVDLQ 221
             +       R+A+ LLYL++VE GG T FP   FEV  Q
Sbjct: 239 NDKIMGSSGHRIATVLLYLNEVEAGGATFFPNPGFEVRPQ 278


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 70/148 (47%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       A+  +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  ++   +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
           G     R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDLHGNRIATGIYYLSDVEAGGGTAFPF 461


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 54/109 (49%), Gaps = 9/109 (8%)

Query: 115 RQGETVESTKGT---RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYE 171
           R G  + ST      RTS   FI+A+  K  +L  I+ ++A  T L   + E   +  Y 
Sbjct: 362 RAGVVINSTSTVSKKRTSQHIFIAATRHK--VLRTIDQRVADMTNLNMQYAEDHQLADYG 419

Query: 172 IGQKYDSHYDAFNPAEYG----PQMSQRLASFLLYLSDVEEGGETMFPF 216
           IG  Y  H+D F  ++       +M  R+A+ L YLSDV +GG T FP 
Sbjct: 420 IGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPI 468


>gi|223999645|ref|XP_002289495.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974703|gb|EED93032.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1578

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 65/147 (44%), Gaps = 9/147 (6%)

Query: 78   LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
            L   P      NF S E C  I+A A    +P  +     E VE     RTSS   +   
Sbjct: 1375 LHSNPDIFTVDNFLSDEDCDRIVAKADPHFRPCLVKNESTEAVEQDP-FRTSSDANLPQR 1433

Query: 138  EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA--EYGPQMSQR 195
            E  + + +L       A  L     E   +LRY+ GQ++  H D F+ A    G +   R
Sbjct: 1434 EAPSLVSKLTSLASCDANQL-----EIMQILRYDKGQQFQPHTDGFDGATTACGFEQGNR 1488

Query: 196  LASFLLYLSDVEEGGETMFPFEVDLQI 222
            L +   YL+ VE+GG T FP  +DL+I
Sbjct: 1489 LVTIFCYLNSVEKGGSTYFPG-IDLEI 1514


>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
          Length = 305

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 69/132 (52%), Gaps = 13/132 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
           F  F + ++C  +I+  +  L+P+  +  R G  +      RTS G  F  A ED   ++
Sbjct: 126 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--VRTSDGGIFGPAREDL--VI 181

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
           + I  +IA A+    + GE   +LRY +GQ+Y  H+D        P + +QR  + L+YL
Sbjct: 182 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 235

Query: 204 SDVEEGGETMFP 215
           ++   GGET+FP
Sbjct: 236 NEGYAGGETIFP 247


>gi|223997846|ref|XP_002288596.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
           CCMP1335]
 gi|220975704|gb|EED94032.1| hypothetical protein THAPSDRAFT_261963 [Thalassiosira pseudonana
           CCMP1335]
          Length = 373

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 66/129 (51%), Gaps = 4/129 (3%)

Query: 88  PNFASAEQCQSIIATAK-KRLKPSQLALRQGETVESTKGTRTSSGTFI-SASEDKTGILE 145
           P +     CQS  A      L  S +    GE  +     RTSS  +     E+ TG+ +
Sbjct: 178 PGWMIVNCCQSCDAKEGFGVLLDSNVIGGSGEKEKVISTHRTSSNAWCRKECENLTGV-K 236

Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSD 205
            +  +I   T +PQ + E+F +L+Y+ G+ Y SH+D+ + A        R+ +F LYL+D
Sbjct: 237 GVSKRIEEMTGIPQNNYESFQILQYKPGEYYKSHHDSSD-ANKDKVTGHRVLTFFLYLND 295

Query: 206 VEEGGETMF 214
           VEEGGET F
Sbjct: 296 VEEGGETHF 304


>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
 gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
          Length = 376

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 79/162 (48%), Gaps = 27/162 (16%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
           +S+P +V         + +  S  +C+ +IA     LKPS +       V+   G     
Sbjct: 174 KSLPIEV---------YESILSEYECRYLIAKFSALLKPSMV-------VDPVTGRGKID 217

Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF- 183
             RTS    I  +     I   ++  I++ T   + +GEA N+LRY  GQ+Y  HYD   
Sbjct: 218 SVRTSYVAVIEPTHCD-WITRKLDKIISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276

Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
             N A       QR+ + L+YL+ + EGGET+FP ++D++I+
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTINEGGETLFP-KLDIRIA 317


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 69/148 (46%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++  +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
           G     R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDLHGNRIATAIYYLSDVEAGGGTAFPF 461


>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
          Length = 262

 Score = 62.8 bits (151), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 88/187 (47%), Gaps = 28/187 (14%)

Query: 44  PKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATA 103
           P LR LE ++    S LP+ E  +AS            PR     N  + ++C+ ++  A
Sbjct: 42  PSLRGLESLK----SNLPYLEQINAS------------PRVFRIRNLLTKQECEHLMLLA 85

Query: 104 KKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
            ++     + +  G  + VEST  TRT+ G ++   +D   ++  +E  + + T      
Sbjct: 86  FRKGLSKTMIMPYGTHKLVEST--TRTNDGAWLDFLQDD--VVRRLEETLGKLTKTTPQQ 141

Query: 162 GEAFNVLRYEIG-QKYDSHYDAFNPAEYGP----QMSQRLASFLLYLSDVEEGGETMFPF 216
           GE   VL Y  G Q +  HYD F+PA   P    Q   R  + ++YL    EGGET FP 
Sbjct: 142 GENLQVLHYSNGAQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFP- 200

Query: 217 EVDLQIS 223
           E+ L+++
Sbjct: 201 ELGLKLT 207


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 10/130 (7%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F   ++CQ++I   ++  +PS +      +    +  RTSS   +   +D   ++  I+
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTIT-----SENPDQQFRTSSTCHLGNMQDP--VIRKID 155

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSD 205
            +I +   +  ++ E      Y++GQ++  H D F P   A YG    QR  +F++YL++
Sbjct: 156 LQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNE 215

Query: 206 VEEGGETMFP 215
           VE+GG+T+FP
Sbjct: 216 VEQGGDTVFP 225


>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 322

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 69/132 (52%), Gaps = 13/132 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQ-LALRQGETVESTKGTRTSSG-TFISASEDKTGIL 144
           F  F + ++C  +I+  +  L+P+  +  R G  +      RTS G  F  A ED   ++
Sbjct: 143 FRQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHP--IRTSDGGIFGPAREDL--VI 198

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYL 203
           + I  +IA A+    + GE   +LRY +GQ+Y  H+D        P + +QR  + L+YL
Sbjct: 199 QAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCL------PHVRNQRAWTMLIYL 252

Query: 204 SDVEEGGETMFP 215
           ++   GGET+FP
Sbjct: 253 NEGYAGGETIFP 264


>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 548

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 71/155 (45%), Gaps = 20/155 (12%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TR 127
           +  + LS  PR     NF   E+  SII  A   L  +Q A R   +   TKG     TR
Sbjct: 205 VVLETLSHSPRVFSLYNFMDMEEADSIIEDA---LGMTQEAYRLKRSSTGTKGKAISKTR 261

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTH---GEAFNVLRYEIGQKYDSHYDAFN 184
           TS   F++     T   + ++ +I +   + + H    +   VLRY   Q Y +H+D   
Sbjct: 262 TSDNAFVT----HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLE 317

Query: 185 PAEYGPQMSQ-----RLASFLLYLSDVEEGGETMF 214
            AE     S+     R A+ +LY +DV EGGET+F
Sbjct: 318 SAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVF 352


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/105 (34%), Positives = 49/105 (46%), Gaps = 8/105 (7%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G         RTS  TFI  +  K  +L  I+ ++A  T L     E   +  Y IG  Y
Sbjct: 362 GNNASVVSNARTSQFTFIPKTRHK--VLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHY 419

Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
             H D F+P  +        +M  R+A+ L YL+DVE+GG T FP
Sbjct: 420 AQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDVEQGGGTAFP 464


>gi|224012377|ref|XP_002294841.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969280|gb|EED87621.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1164

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 67/136 (49%), Gaps = 6/136 (4%)

Query: 82   PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
            P  +   NF + E+   +I   K++    Q      + ++    T +S+       +D  
Sbjct: 937  PWLVSLENFLTEEEAAYLIEVGKRQ----QYQRSDYKEIDPEHRTSSSAWCRRDCWKDDA 992

Query: 142  GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFL 200
             +  +++ +IA+ T           +LRYE GQK++ H D   P     ++  QRL +FL
Sbjct: 993  TVSSVVD-RIAKVTKSETKQLSNLQILRYEEGQKFNQHSDFSRPIRRLKRVQGQRLMTFL 1051

Query: 201  LYLSDVEEGGETMFPF 216
            +YLSDVEEGGET FP+
Sbjct: 1052 IYLSDVEEGGETSFPY 1067


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 9/149 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       +   +S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
              + S  +    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F P  +   
Sbjct: 376 ASFNYS--RNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
            G     R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF 461


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 48/105 (45%), Gaps = 8/105 (7%)

Query: 117 GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKY 176
           G         RTS  TFI AS  K  +L  I+ ++A  T L   + E      Y IG  Y
Sbjct: 362 GHNESLVSNVRTSQFTFIPASAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHY 419

Query: 177 DSHYDAFNPAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
             H D F    +       P+M  R+A+ L YLSDV +GG T FP
Sbjct: 420 GQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFP 464


>gi|402824057|ref|ZP_10873445.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
 gi|402262407|gb|EJU12382.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. LH128]
          Length = 210

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 64/130 (49%), Gaps = 13/130 (10%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F SAE+C  ++   +   +PS +A   G+        RTSS   +S       +++ +  
Sbjct: 34  FLSAERCAQLVDLIETNNRPSTIADYNGD-----DAFRTSSTCDLSRD---YPVVDELAQ 85

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYLS 204
            ++R + +   H E     RYE+GQ++ +H D F P      +Y     QR  +F++YL+
Sbjct: 86  ALSRLSGIDLAHAEPLQGQRYEVGQEFKAHTDYFEPDSADFEKYCKVPGQRTWTFMIYLN 145

Query: 205 DVEEGGETMF 214
           DVE GG T F
Sbjct: 146 DVEAGGATRF 155


>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 188

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 69/151 (45%), Gaps = 14/151 (9%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
           VL+  P      NF +  +C+ +I  A+    P+ +    G+       +RTSS  ++S 
Sbjct: 1   VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVV---GKGAGEVSPSRTSSTCYLSR 57

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQ 191
            +     L  +  K++  T  P  H E   V RY   Q+Y  HYDAF+        +   
Sbjct: 58  ED-----LPDLMRKVSSLTGKPIEHCELPQVGRYFPSQQYLQHYDAFDLGTEDGLRFAAN 112

Query: 192 MSQRLASFLLYLSDVEEGGETMFP-FEVDLQ 221
             QR  + LLYL+DV  GG T FP   +D+Q
Sbjct: 113 GGQRTITVLLYLNDVARGGATRFPALNLDVQ 143


>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
           anophagefferens]
          Length = 148

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 7/94 (7%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---A 182
           +RTS   + + + +       +  +I   T +P+ + E+F VLRY  GQ+Y +H+D    
Sbjct: 3   SRTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMSRG 62

Query: 183 FNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
            N    GP    R+ +F +Y SDVE+GGET FP 
Sbjct: 63  DNALACGP----RIYTFFMYFSDVEKGGETEFPM 92


>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
 gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
          Length = 376

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 78/162 (48%), Gaps = 27/162 (16%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG----- 125
           +S+P +V         + +  S  +C+ +I      LKPS +       V+   G     
Sbjct: 174 KSLPIEV---------YESILSEYECRYLITKFNALLKPSMV-------VDPVTGRGKID 217

Query: 126 -TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF- 183
             RTS    I  +     I   ++  I++ T   + +GEA N+LRY  GQ+Y  HYD   
Sbjct: 218 SVRTSYVAVIEPAHCD-WITRKLDKTISQITHTLRQNGEALNLLRYSPGQQYKPHYDGLN 276

Query: 184 --NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
             N A       QR+ + L+YL+ + EGGET+FP ++D++I+
Sbjct: 277 EINDALMFKDGKQRIKTALVYLNTISEGGETLFP-KLDIRIA 317


>gi|357542083|gb|AET84843.1| hypothetical protein MPXG_00045 [Micromonas pusilla virus SP1]
          Length = 196

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 65/129 (50%), Gaps = 12/129 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           F NF ++ +   +I  AKK L PS ++  + +  ES + + T+  +F         I+  
Sbjct: 24  FRNFITSGERAHVIEEAKKNLTPSTVST-EHKLDESVRKSETAWLSF------DDPIIRG 76

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDV 206
           I  K  R T  P  + E   VLRYE G  Y  H D    A+     +QR+ +F+L L+D 
Sbjct: 77  IAEKCIRYTDRPLINCEKLQVLRYEEGGHYIPHQDILRNAK-----NQRMYTFILALNDD 131

Query: 207 EEGGETMFP 215
            EGGET+FP
Sbjct: 132 YEGGETVFP 140


>gi|334140935|ref|YP_004534141.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
 gi|333938965|emb|CCA92323.1| 2OG-Fe(II) oxygenase [Novosphingobium sp. PP1Y]
          Length = 209

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 61/131 (46%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F    QC ++IA  +   +PS +A   G+ V      RTSS   +S        L    
Sbjct: 32  DFLDTAQCDALIALIEAEHRPSTVANYNGDDV-----FRTSSTCDLSPDVPAVAALA--- 83

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
            K+   + +   H E     RYE+GQ++ +H D F P      +Y     QR  +F++YL
Sbjct: 84  RKLCDISGIDPAHAEPLQGQRYEVGQEFKAHTDYFEPNNSDFEKYCSVSGQRTWTFMIYL 143

Query: 204 SDVEEGGETMF 214
           +DV+ GG T F
Sbjct: 144 NDVDAGGATRF 154


>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
 gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
          Length = 263

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 65/135 (48%), Gaps = 16/135 (11%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-------SQRLASF 199
           ++++I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDNRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMF 214
           ++YL+D  +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210


>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 215

 Score = 61.2 bits (147), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 16/137 (11%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F   ++   I+  +   L PS + L+ G         RTS+  ++ +S     +++ I+ 
Sbjct: 11  FLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHP--VVQTIDK 68

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------------EYGPQMSQRLA 197
           + A    +P +H E+  VLRYE  Q YD H D F+              EYG     R+ 
Sbjct: 69  RTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYG--YKNRMI 126

Query: 198 SFLLYLSDVEEGGETMF 214
           +   Y+SDV +GG T F
Sbjct: 127 TVFWYMSDVAKGGHTNF 143


>gi|302849869|ref|XP_002956463.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
           nagariensis]
 gi|300258161|gb|EFJ42400.1| hypothetical protein VOLCADRAFT_107241 [Volvox carteri f.
           nagariensis]
          Length = 965

 Score = 61.2 bits (147), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 68/142 (47%), Gaps = 22/142 (15%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGT 132
            +VL+  P  +    F SA +C  I+ +A     +K S + +  G  V+ T+  RTSS  
Sbjct: 744 LRVLNIDPPVITVEGFLSAPECDGIVRSAADSGLMKQSGVGV-SGYQVKDTENVRTSSTL 802

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
             +A   +T            A  LPQ       V RY+ GQ + +H DAF  A  G + 
Sbjct: 803 AATAEPGQT------------AFELPQ-------VARYQPGQHFLTHEDAFPAAVVGSKG 843

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
            QR A+ L+YL+D E+GG T F
Sbjct: 844 YQRRATLLVYLNDCEQGGATKF 865


>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 221

 Score = 61.2 bits (147), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 58/124 (46%), Gaps = 14/124 (11%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           + + NF S  +C+ II  A  ++K S +   +   V      RTS GTF+    D   ++
Sbjct: 1   MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGV--VDDIRTSYGTFLRRVPDP--VI 56

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLS 204
             IEH++A  + LP +H E   VLRY    KY  H D            +R+A+ L+YL 
Sbjct: 57  AAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHIDGL----------ERVATVLIYLG 106

Query: 205 DVEE 208
             E 
Sbjct: 107 QAER 110


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score = 61.2 bits (147), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 69/137 (50%), Gaps = 6/137 (4%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFISASEDK 140
           P  + + +  SA++ + +   A  R++ S +  L  G+  +S    R S   +++     
Sbjct: 331 PYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKS--AFRVSKNAWLAYESHP 388

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASF 199
           T  +E +   +  AT L  T+ E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ 
Sbjct: 389 T--MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATA 446

Query: 200 LLYLSDVEEGGETMFPF 216
           + YLSDVE+GG T FPF
Sbjct: 447 IFYLSDVEQGGATAFPF 463


>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
           [Saccharophagus degradans 2-40]
 gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
          Length = 269

 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 77/146 (52%), Gaps = 11/146 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      +F +  +   II  A  +++ ++++  + E +ES    RT S  +++   +K 
Sbjct: 63  PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGK-EGIESA--GRTGSNCWVAHDHNK- 118

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRL 196
            +   +  +I++   +   + E+F V+ Y + Q+Y SH+DA+    E G +      QRL
Sbjct: 119 -VTHALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAWEFNTERGERCMARGGQRL 177

Query: 197 ASFLLYLSDVEEGGETMFPFEVDLQI 222
            + L+YL+DV  GG T FP E+DL++
Sbjct: 178 VTCLIYLNDVPAGGGTGFP-ELDLEV 202


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 11/145 (7%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P  + + +     +   I      R+  + + L    TV +    RTS  TFI+ +
Sbjct: 324 LSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMVTLTNQSTVSNV---RTSQITFIAKT 380

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQ 191
           E +  +L+ I+ ++A  T L   + E      Y IG  Y  H D F    +        +
Sbjct: 381 EHE--VLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTE 438

Query: 192 MSQRLASFLLYLSDVEEGGETMFPF 216
           M  R+A+ L YLSDV +GG T FP+
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPY 463


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 11/152 (7%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  +  + LS  P  + + +     +  +I      +LK + +     E+V S    RTS
Sbjct: 290 IAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATIT-STNESVVSN--VRTS 346

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY- 188
             TF+  +EDK  +L  I+ ++A  T     + E      Y IG  Y  H D F    + 
Sbjct: 347 QFTFLPVTEDK--VLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFD 404

Query: 189 -----GPQMSQRLASFLLYLSDVEEGGETMFP 215
                 P+M  R+A+ L YLSDV +GG T FP
Sbjct: 405 AGLVSSPEMGNRIATVLFYLSDVTQGGGTAFP 436


>gi|307103831|gb|EFN52088.1| hypothetical protein CHLNCDRAFT_139357 [Chlorella variabilis]
          Length = 1038

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 37/90 (41%), Positives = 52/90 (57%), Gaps = 4/90 (4%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLP-QTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           RTS GTF++ ++D+  ++  IEH++A  T LP +  G      R+  G  +D      NP
Sbjct: 11  RTSWGTFLTRAQDE--VVYAIEHRVANWTHLPVENAGGVLQGKRFHYGAHWDDLDLDENP 68

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
              G   S R+A+ L+YLSD EEGGET FP
Sbjct: 69  DGLG-GGSVRVATVLIYLSDAEEGGETAFP 97


>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 193

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 70/145 (48%), Gaps = 13/145 (8%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LS  PRA    NF +  +   I+   +K+       +++  T      TRTSS T+++
Sbjct: 2   KALSCAPRAFQVENFLTDVEADHIVGLVQKKND-----MQRSSTNGHISETRTSSTTWLA 56

Query: 136 ASEDKTGILELIEHKIARA-----TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
              D   +++ I  ++A        ML +   E   ++ Y +GQ+Y +H+D F   +  P
Sbjct: 57  RHSDP--VIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHD-FGYPKGDP 113

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               R  +F +YL+DV  GG+T FP
Sbjct: 114 GSPSRSINFCMYLNDVPAGGQTSFP 138


>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 267

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 74/149 (49%), Gaps = 7/149 (4%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +VLS  PR + FP+F +  +C+   + ++++L  +++ L  G         RT+
Sbjct: 46  LQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYL-GGPPEGGFSLRRTN 104

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH--YDAFNPAE 187
              ++  S+D   +L  +  +IA AT L  T  E + V  Y +G  Y  H  Y  F  A+
Sbjct: 105 KVAWM--SDDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQ 162

Query: 188 YGPQMS--QRLASFLLYLSDVEEGGETMF 214
                S   RLA+ L+YL+DV  GG T F
Sbjct: 163 GDIYKSSGNRLATMLIYLADVAGGGATAF 191


>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
 gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
          Length = 263

 Score = 60.5 bits (145), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 16/135 (11%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
           ++ +I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKAHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMF 214
           ++YL+D  +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score = 60.5 bits (145), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 16/133 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    D+ 
Sbjct: 53  PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVISE--IRTSQSVFL----DEV 102

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+DA      G  +++R A+FL+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDA------GGDVNERTATFLI 156

Query: 202 YLSDVEEGGETMF 214
           Y+SDVE GG T+F
Sbjct: 157 YMSDVEVGGATVF 169


>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 263

 Score = 60.5 bits (145), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 16/135 (11%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           + +F S+++C  I+A  K +L PS+LA        S    RTSS   ++   +K  +++ 
Sbjct: 85  YDDFLSSQECDDIVALTKDKLAPSKLA-----GAASADDIRTSSTCELAFLGNK--LVKD 137

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-------QRLASF 199
           ++ +I     L    GE      Y +G+ Y  HYD F P    PQ         QR  + 
Sbjct: 138 VDSRIVSTLSLGVGEGEVIQAQHYNVGEYYKPHYDFFPPGS--PQYKTHCLSRGQRTWTC 195

Query: 200 LLYLSDVEEGGETMF 214
           ++YL+D  +GG T F
Sbjct: 196 MIYLNDECDGGHTRF 210


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score = 60.1 bits (144), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLS RP  + + +F S  + + I   A+  L+ S +A   G+  ++T   R S   ++ 
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVA--TGDK-QATAEYRISKSAWLK 392

Query: 136 ASEDKTGILELIEHKIARATML--PQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S   T  +  ++ KI+  T L     HGE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 393 GSAHST--VSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 450

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 451 KTGNRVATFMIYLSSVEAGGSTAFIY 476


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score = 60.1 bits (144), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 69/145 (47%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSI--IATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           IP + +S  P  + + +  +  Q ++I  I+ +K    P+   L  G   E+T+ +    
Sbjct: 290 IPVEEISLDPFIVIYYDIINDHQIETIKKISPSKSNKSPNHAMLCSGIKSEATQVSIFCC 349

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
            T++  + D   ++E I       T L   + E   V  Y IG  Y  HYD+   A   P
Sbjct: 350 STWLEDAYDP--VVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDP 407

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
              QRLA+ + YLS+VE GG T+FP
Sbjct: 408 L--QRLATMMFYLSNVEIGGATIFP 430


>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
          Length = 487

 Score = 60.1 bits (144), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 12/150 (8%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            + +S  P       F   ++   ++  +   L PS + L+ G         RTS+  ++
Sbjct: 268 METISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWL 327

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF------NPAEY 188
            +S     +++ I+ + A    +P +H E+  VLRYE  Q YD H D F      N A+ 
Sbjct: 328 ESSSHP--VVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNSADV 385

Query: 189 GPQMSQ----RLASFLLYLSDVEEGGETMF 214
             ++      R+ +   Y+SDV +GG T F
Sbjct: 386 LKKIEHGYKNRMITVFWYMSDVAKGGHTNF 415


>gi|224013910|ref|XP_002296619.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968971|gb|EED87315.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 541

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 9/147 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKR--LKPSQLALRQ--GETVESTKGTRTSSGTFISA- 136
           P  + F NF S EQ  ++IA   K+   + + + +    G   +     RTS   +    
Sbjct: 340 PWIVVFENFVSDEQATALIAAGAKKGYERSADVGIENPDGSHEDDVNDDRTSHNAWCDDE 399

Query: 137 -SEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQR 195
              +   I  +IE +IA  T     + E   +L+YE GQ Y  H+D     E  P    R
Sbjct: 400 LCNNDPVIAPVIE-RIASVTKTSVNNSEFLQLLQYEPGQYYKQHHDYIEYQEDMP-CGVR 457

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQI 222
           + +  LYL+DVEEGG T FP  +D+ I
Sbjct: 458 MLTLFLYLNDVEEGGGTHFPL-LDITI 483


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 8/143 (5%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  +VL   P  + F +  S+ +   +   A+  L+ S + ++    V+     R S+GT
Sbjct: 310 IKMEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERS-MVVKYRANVQGKH--RISAGT 366

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +   IE +IA    L     E F V+ Y IG +Y +H+D F         
Sbjct: 367 WVERKYN--NLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVE--- 421

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             RLA+ L Y++DVE+GG T+FP
Sbjct: 422 DNRLATVLFYMNDVEQGGATVFP 444


>gi|356980146|gb|AET43625.1| hypothetical protein MPWG_00137 [Micromonas pusilla virus PL1]
          Length = 196

 Score = 59.7 bits (143), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 65/134 (48%), Gaps = 14/134 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR   F NF + ++   II  AK  LKPS ++  + +  E  + + T+   F        
Sbjct: 21  PRV--FHNFITPDERMHIIRKAKSELKPSTVSTER-KVDEKVRKSETAWLDF------DD 71

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            I++ +  +  R T  P  + E   VLRYE G  Y  H D    A      +QR+ +F+L
Sbjct: 72  PIVKGVADRCLRYTDRPLINCEKLQVLRYEEGGHYIPHQDIIANA-----TNQRMYTFIL 126

Query: 202 YLSDVEEGGETMFP 215
            L+D  EGGET+FP
Sbjct: 127 ALNDDYEGGETVFP 140


>gi|313768105|ref|YP_004061536.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
 gi|312599712|gb|ADQ91733.1| hypothetical protein BpV1_106c [Bathycoccus sp. RCC1105 virus BpV1]
          Length = 197

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 69/135 (51%), Gaps = 13/135 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR L   N  S ++C+ I   A K+L+ S +++ +    +  +  R S   ++ ASED 
Sbjct: 23  KPRVL--KNVLSEDECKHIQDIASKKLQTSTVSMSR----DIDEKIRKSETAWLKASED- 75

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             +++ +  K    T  P  + E   VL+Y+ G  Y  H D F   +     ++R+ +F+
Sbjct: 76  -PVVDKLIRKCVSMTDRPLHNCEDLQVLKYKPGGFYKPHQDCFKNDK-----NKRMYTFI 129

Query: 201 LYLSDVEEGGETMFP 215
           + L+D  EGGET FP
Sbjct: 130 IALNDEYEGGETEFP 144


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       +   +S+  +A+  +K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSNDSESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
              + S  K    +L+ H +   + L   + E   V  Y IG  Y+ H+D+F P  +   
Sbjct: 376 ASFNYS--KNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
            G     R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRIATGIYYLSDVEAGGGTAFPF 461


>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
 gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
          Length = 314

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 64/126 (50%), Gaps = 8/126 (6%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F+SAE C  +   +  RL+PS + L            RTS G  +S  E+   ++ ++  
Sbjct: 138 FSSAE-CAYLQQMSAPRLRPSTI-LDPQTGARRPDPVRTSVGAALSPVEEDL-VVGMLNR 194

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           +IA AT   +  GE  ++LRY   Q+Y  H+DA    E     +QR  + ++YL+   EG
Sbjct: 195 RIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE-----NQRSHTLIVYLTADYEG 249

Query: 210 GETMFP 215
           GET FP
Sbjct: 250 GETAFP 255


>gi|296284739|ref|ZP_06862737.1| hypothetical protein CbatJ_14013 [Citromicrobium bathyomarinum
           JL354]
          Length = 210

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 67/150 (44%), Gaps = 17/150 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +PF  +      L    F   + C  +IA      +PS LA   G+        RTS
Sbjct: 17  MQRLPFPKVE----LLQMRQFLDPDFCGELIAMIDADRRPSTLADHDGDMY-----FRTS 67

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---- 185
               +   + +T  LE +   +A  + +   HGE     RY +GQ++ +H D FNP    
Sbjct: 68  ETCDLPMDDPRTQRLEAM---LAELSGIDPRHGEPLQGQRYAVGQEFKAHCDYFNPDGQD 124

Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
             +Y     QR  +F++YL++ E GG T F
Sbjct: 125 WEKYCSVAGQRTWTFMIYLNEPEAGGATRF 154


>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
 gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
          Length = 328

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 79/159 (49%), Gaps = 14/159 (8%)

Query: 64  ETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST 123
           +T D SI +  ++    R      P+F SA++C  II  A+++   ++  L   +   + 
Sbjct: 136 DTDDESIAAAVYK----RECVQIHPHFFSADECAYIIQYAEEKTLFTRSQLEYDDNTVNE 191

Query: 124 KGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF 183
             TRTS   F+   + +  + + I  ++A +  +   + E    +RY  GQ++  H+D+ 
Sbjct: 192 SDTRTSYSAFL--KDRQHPVFQAIYERVAASLKVDLNYIEPLQCVRYGEGQQFKPHFDSM 249

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           +        + RL + L+YL+D   GGET FP E+++ +
Sbjct: 250 S-------ANHRLHTMLVYLNDDFVGGETYFP-ELNMNV 280


>gi|397571457|gb|EJK47807.1| hypothetical protein THAOC_33450 [Thalassiosira oceanica]
          Length = 861

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 40/146 (27%), Positives = 63/146 (43%), Gaps = 18/146 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----------TRTSS 130
           P  +   +F + E+C  +I         S+L   Q     +T             +RTS+
Sbjct: 661 PWVITIEDFLTGEECDVLIKHG------SELGYEQSTQYSNTADVGGSHEFIESESRTST 714

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
            TF     D   I+  +  +I   + +   H E   ++RYE+GQ Y  H+D  +  E  P
Sbjct: 715 NTFCQGDCDADPIVRGVIDRIVEMSGITANHYEKLQLVRYEVGQYYKEHHDYASAHEELP 774

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
               R+ +F LYL+D  EGG T F +
Sbjct: 775 -FGPRILTFFLYLNDDYEGGGTSFNY 799


>gi|294889729|ref|XP_002772943.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
           ATCC 50983]
 gi|239877523|gb|EER04759.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
           ATCC 50983]
          Length = 383

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 24/150 (16%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLAL--------RQGETVESTKGTRT 128
           V+   P+    P+F + E+C+ +I+ A+ + +PS +          +  + V      RT
Sbjct: 170 VICRSPKVRLVPDFLTPEECEYMISLAEGKWRPSTVGRSSSSISDGKSDKYVNKRSKGRT 229

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAE 187
           SS   +  S+D   ++  IE + A     P  H E  N+LRYE G+ +  H+D AF P  
Sbjct: 230 SSSFMLLHSQDD--VVAEIERRAASLVGFPADHVERLNMLRYESGEFFGQHHDGAFRP-- 285

Query: 188 YGPQMSQRLASFLLYLSDVEE--GGETMFP 215
                     +  + L+D+    GGET+FP
Sbjct: 286 ---------WTVFITLNDIPRGAGGETLFP 306


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 72/154 (46%), Gaps = 15/154 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  +    +P  + + +  S ++ +++   AK R K +   +R  +T E        
Sbjct: 324 IQPIKMEEALLKPMIVVYHDVMSDDEIETVKKMAKPRFKRA--TIRNSKTGELEPANYRI 381

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
           S +    SE+   IL+ +  ++   T L  +  E   V+ Y IG  Y+ H+D        
Sbjct: 382 SKSAWLKSEEHDHILK-VTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTE 440

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           AF    +G     R+A++L Y+SDVE GG T+FP
Sbjct: 441 AFKELGWG----NRIATWLFYMSDVEAGGATVFP 470


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 69/143 (48%), Gaps = 9/143 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
           LS  P  + + N  S  +    IA  ++  +P   ++  GE   S K   RT+ G +I  
Sbjct: 322 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 377

Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
                    +++ I  +I   T L   HG+   +++Y  G  YD+H+D  N +    Q +
Sbjct: 378 KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 437

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A+ L YL+DV+ GG T+FP
Sbjct: 438 GDRMATVLFYLNDVKHGGSTVFP 460


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 70/145 (48%), Gaps = 7/145 (4%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + + +  +  +  ++   +K  +K   +   + +        RTS+ 
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 371

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGP 190
            ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F  P   G 
Sbjct: 372 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQHRGG 429

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               R+A+ L YLSDV +GG T+FP
Sbjct: 430 --GDRIATVLFYLSDVPQGGATLFP 452


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 6/143 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI   AK  L  +    + G   E     RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKDGSYEEDP--ARTTKGT 331

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    + + +++ +       T L     + F VL Y IG  Y +H+D     E G   
Sbjct: 332 WLV---ENSKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADTEMG-NF 387

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           S R+A+ + YLSDV +GG T+FP
Sbjct: 388 SNRIATAVFYLSDVPQGGATIFP 410


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 46/97 (47%), Gaps = 8/97 (8%)

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TFI  +  K  +L  I+ ++A  T L   + E      Y IG  Y  H D F 
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
              +       P+M  R+A+ L YLSDV +GG T FP
Sbjct: 431 QTTFDAGLVSSPEMGNRIAAVLFYLSDVAQGGGTAFP 467


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/108 (36%), Positives = 52/108 (48%), Gaps = 13/108 (12%)

Query: 119 TVESTKGT-----RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIG 173
           TV   KG+     RTS  TFI  +  K  +L+ I+ ++A  + L   + E      Y IG
Sbjct: 359 TVIGAKGSEVSKVRTSQFTFIPKTRHK--VLQTIDQRVADMSNLNMDYAELHQFANYGIG 416

Query: 174 QKYDSHYDAF------NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
             Y  H D F      N     P+M  R+A+ L YLSDV +GG T FP
Sbjct: 417 GHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFP 464


>gi|116181314|ref|XP_001220506.1| hypothetical protein CHGG_01285 [Chaetomium globosum CBS 148.51]
 gi|88185582|gb|EAQ93050.1| hypothetical protein CHGG_01285 [Chaetomium globosum CBS 148.51]
          Length = 304

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 10/138 (7%)

Query: 85  LYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGIL 144
           +Y  N  +  +  S++ATA+ +  PSQ+  + G   ++   T +S+G      +D     
Sbjct: 78  IYIHNILTPSEITSLLATAEPKFAPSQVT-KHGRKQQTNDRTSSSAGL---PRDDPAVRC 133

Query: 145 ELIEHKIARATMLPQTHGEAF--NVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLAS 198
            L   +    TML     E     ++RY  GQ+++ H+D ++  ++ P  +     R+AS
Sbjct: 134 VLDRARTFLGTMLRDGWDEMGPPQLVRYSEGQRFNVHHDWYDVPQWAPDGTTRKWNRVAS 193

Query: 199 FLLYLSDVEEGGETMFPF 216
           F   L D  EGGET FPF
Sbjct: 194 FFAILEDQCEGGETHFPF 211


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 46/97 (47%), Gaps = 8/97 (8%)

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             RTS  TFI  +  K  +L  I+ ++A  T L   + E      Y IG  Y  H D F 
Sbjct: 373 NVRTSQFTFIPVTAHK--VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFY 430

Query: 185 PAEY------GPQMSQRLASFLLYLSDVEEGGETMFP 215
              +       P+M  R+A+ L YLSDV +GG T FP
Sbjct: 431 QTTFDAGLVSSPEMGNRIATVLFYLSDVAQGGGTAFP 467


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 12/148 (8%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVEST-KGTRTSS 130
           PF+V  LS  P   YF +  S ++ + II   K ++  S++    G+T  ST    RTS 
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSDIRTSQ 391

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
            T++    +    L  I+ ++   T L     E   ++ Y IG +Y+ H+D  + AE  +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
           G +   RL + L YL+DV  GG T FPF
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF 476


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F P  +   
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
            G     R+A+ + YLSDVE GG T FPF
Sbjct: 433 EGDLHGNRMATGIYYLSDVEAGGGTAFPF 461


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/140 (30%), Positives = 58/140 (41%), Gaps = 11/140 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + + +     +   I    + RLK + +    G         RTS  TFI  S  K 
Sbjct: 302 PLLVLYHDVIYQSEIDVIRKLTENRLKRATVT---GHNESVVSNVRTSQFTFIPVSAHK- 357

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY------GPQMSQR 195
            +L  I+ ++A  T L   + E      Y IG  Y  H D F            P+M  R
Sbjct: 358 -VLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNR 416

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L YLSDV +GG T FP
Sbjct: 417 IATVLFYLSDVSQGGGTAFP 436


>gi|357010489|ref|ZP_09075488.1| response regulator receiver domain-containing protein
           [Paenibacillus elgii B69]
          Length = 397

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 76/155 (49%), Gaps = 11/155 (7%)

Query: 74  PFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTF 133
           P  V    P    +    S  +C+ +I  A+ +L+P+++ + + E V S    R S   F
Sbjct: 4   PSVVFHEEPLVASYEQVVSGPECRQLIELARHQLEPAKV-IGEKEVVASE--FRKSE--F 58

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-----PAEY 188
                D   ++  +  ++A     P  + E+  + RY +G ++ +H+D ++        +
Sbjct: 59  AWFHHDSHPLVREVSERLAALAGRPLHYAESLQIARYVVGGRFGAHFDTYDLNTVDGKRF 118

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
             Q  QRL + LLYL+ V+ GGET FP E++L I+
Sbjct: 119 YDQGGQRLYTALLYLNTVDAGGETYFP-ELNLDIA 152


>gi|89092696|ref|ZP_01165649.1| hypothetical protein MED92_15358 [Neptuniibacter caesariensis]
 gi|89083208|gb|EAR62427.1| hypothetical protein MED92_15358 [Oceanospirillum sp. MED92]
          Length = 441

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 63/140 (45%), Gaps = 11/140 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +    Y  NF S E+C  +I   +   +PS      G      K  RTS    +S  E  
Sbjct: 69  KAEVFYVNNFLSPEECAQMIELIQHHQRPSTTTNETGHY----KHYRTSKTCDLSLLE-- 122

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQR 195
           +  +  I+ +I +   +  ++ E      Y+IG+++  H D F P      E+     QR
Sbjct: 123 STFVAEIDQRICKMLGIEPSYSEGIQGQWYDIGEEFKPHTDYFEPKSDEFLEHAEARGQR 182

Query: 196 LASFLLYLSDVEEGGETMFP 215
             +F++YL++ +EGG T FP
Sbjct: 183 TWTFMIYLNNTQEGGGTFFP 202


>gi|333894037|ref|YP_004467912.1| prolyl 4-hydroxylase subunit alpha [Alteromonas sp. SN2]
 gi|332994055|gb|AEF04110.1| Prolyl 4-hydroxylase subunit alpha [Alteromonas sp. SN2]
          Length = 373

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 61/135 (45%), Gaps = 11/135 (8%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLA---LRQGETVESTKGTRTSSGTFISASEDKTGI 143
           F +  S  +C  +I      L+PS +      QG         RTS    IS  E    +
Sbjct: 181 FESTLSGFECNYLITKFSALLQPSMVVDPITGQGRI----DKVRTSYVAIISP-EHCDWL 235

Query: 144 LELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYGPQMSQRLASFL 200
              I+  +A+AT      GE  N+LRY  GQ+Y  HYDA N    A+      QR  + +
Sbjct: 236 TRKIDKLVAKATKTRCCEGEVLNLLRYVPGQEYKPHYDALNRLHDAKTFEDGGQRTKTAI 295

Query: 201 LYLSDVEEGGETMFP 215
           +YL+ V EGG T FP
Sbjct: 296 IYLNTVNEGGNTTFP 310


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 69/139 (49%), Gaps = 10/139 (7%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P      NF S ++C++ +   K +++ +++ +   E+      +RT+   ++  S   +
Sbjct: 17  PIVYVVNNFLSDDECEAFVEMGKGKMERAKV-ISDDES--EFHASRTNDFCWLEHS--AS 71

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-----QRL 196
            ++  +  + +    +P  + E F ++ Y  G +Y  H+DAF+      Q +     QR+
Sbjct: 72  DVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRM 131

Query: 197 ASFLLYLSDVEEGGETMFP 215
            + L YL+DVEEGG T FP
Sbjct: 132 VTALAYLNDVEEGGATDFP 150


>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
 gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
          Length = 422

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 15/133 (11%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE-----L 146
           SA++C+ ++  A+  L+ SQ+      +   T   RTS G  +        ILE      
Sbjct: 242 SADECRLLMLLARPHLRASQVVDPNDASTHRTP-IRTSRGATLDP------ILEDFAARA 294

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYL 203
            + ++A    LP TH EA +VL Y  G+ Y +H D   P   A   P    RL +  +YL
Sbjct: 295 AQARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYL 354

Query: 204 SDVEEGGETMFPF 216
           +DV+ GGET FP 
Sbjct: 355 NDVDAGGETEFPV 367


>gi|219126074|ref|XP_002183290.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405046|gb|EEC44990.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 509

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 75/159 (47%), Gaps = 6/159 (3%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQ-- 116
           + H E+ D ++ +   +V    P  +   +F + E+C  +I     +  + S+    Q  
Sbjct: 298 MEHWESRD-TLTNTTTEVHDDGPWVVSLEDFLTPEECAVMIQLGGDQGYEQSKDVGEQKF 356

Query: 117 -GETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G     T   RTS+  +   + D+  +++ I  ++     +P  + E   +LRYE  Q 
Sbjct: 357 DGTYAAVTSNERTSTNAWCVGACDEHPVVQTIHQRMESLLNIPAVNYEHLQLLRYEETQF 416

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           Y +H+D F P   G +   R+ +  LYL+DVEEGG T F
Sbjct: 417 YGNHHD-FIPFHVGRRQGPRILTVFLYLNDVEEGGGTHF 454


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 11/141 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + + +  S E+   +   AK RL+ + ++      +E+ +  R +   ++S  ED 
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQ-YRITKSAWLSGYEDP 384

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ------MSQ 194
             ++  +  +I   T L  +  E   V  Y IG +Y+ H+D     +Y P          
Sbjct: 385 --VVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR--KYEPDAFKKLGTGN 440

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A++L Y+SDVE GG T+FP
Sbjct: 441 RVATWLFYMSDVEAGGATVFP 461


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/177 (27%), Positives = 77/177 (43%), Gaps = 6/177 (3%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
           R +LR      +E    L  G    ++      + L   P  +   +  SA     +   
Sbjct: 292 REELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDTAELQHL 351

Query: 103 AKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHG 162
           A+  L+ SQ+  R G     +   RTS GT    ++    I++ + H +A  + L     
Sbjct: 352 ARPELQRSQVYSRTGHE-HISANFRTSQGTTFEYTDHP--IMQKMSHHVAEISGLDMRSA 408

Query: 163 EAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM--SQRLASFLLYLSDVEEGGETMFPF 216
           E   +  Y IG  Y+ H D+F +  +Y   M  + RLA+ + YLS+VE GG T FPF
Sbjct: 409 EPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPF 465


>gi|334343683|ref|YP_004552235.1| procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
 gi|334100305|gb|AEG47729.1| Procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
          Length = 225

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F SAE+C  +        +PS L      +  +    RTS+   +S  +    ++  +  
Sbjct: 48  FLSAEECAGLRDLIDAGAQPSSLF-----SGSANADYRTSASCNLSPWDP---LVSAVSD 99

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
           +I   T +   HGE     RY  GQ+Y  H+D F   A Y P M     QR  + ++YLS
Sbjct: 100 RICALTGIAADHGETLQGQRYHPGQEYKPHWDYFPVTANYWPAMLKTGGQRSWTAMIYLS 159

Query: 205 DVEEGGETMFP 215
            VE GGET FP
Sbjct: 160 PVEAGGETHFP 170


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + N  S ++ + I   AK RL  ++  +R  +T V +T   R S   ++   +D 
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRL--ARATVRDPKTGVLTTAPYRVSKSAWLEGEDDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             +++ +  +I   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 393 --VIDRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFD-FSRRPFDSNLKVDGNRLA 449

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 450 TFLNYMSDVEAGGATVFP 467


>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
 gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
          Length = 487

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 78/154 (50%), Gaps = 9/154 (5%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + F +     + + +   +K  L+ + + + +  + ++    RT++G
Sbjct: 295 PFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFLQRATVVV-EDNSEDTLIKFRTANG 353

Query: 132 TFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
            F+   +     ++L+E    R   +   Q + +AF  L+Y+ G  YD H D FN  +  
Sbjct: 354 AFLYRDKISPKDVQLVERIFQRMRDMSDLQINDDAFEYLKYDFGGHYDIHADYFNYTD-D 412

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
                R A+F++YL+DV  GG T+FP   D++I+
Sbjct: 413 QFTDDRFATFVIYLNDVARGGATVFP---DVEIA 443


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 69/146 (47%), Gaps = 6/146 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++   P  + + +  SA +   +   A   LK + +    G   E  K TRTS   
Sbjct: 321 LKMELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVK-TRTSKVA 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGP 190
           +   + ++  + E +  +IA  T       E    + Y +G  YD HYD FN   A    
Sbjct: 380 WFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLT 437

Query: 191 QMS-QRLASFLLYLSDVEEGGETMFP 215
           QM+  R+A+ L YL+DVE+GG T+FP
Sbjct: 438 QMNGDRIATVLFYLTDVEQGGATVFP 463


>gi|255641158|gb|ACU20856.1| unknown [Glycine max]
          Length = 195

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 53/94 (56%), Gaps = 3/94 (3%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +VL+W PR +   NF S E+C  + A A  RL  S +   + G+ ++S    RTSSG F+
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKS--DVRTSSGMFL 139

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVL 168
           +  E K  +++ IE +I+  + +P  +GE   VL
Sbjct: 140 NPQERKYPMVQAIEKRISVYSQIPIENGELMQVL 173


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 12/148 (8%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSS 130
           PF+V  LS  P   YF +  S ++ + II   K ++  S++    G+T  ST    RTS 
Sbjct: 336 PFKVEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI----GQTGNSTVSEIRTSQ 391

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--Y 188
            T++    +    L  I+ ++   T L     E   ++ Y IG +Y+ H+D  + AE  +
Sbjct: 392 NTWLWY--ENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNF 449

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
           G +   RL + L YL+DV  GG T FPF
Sbjct: 450 GWK-GNRLLTALFYLNDVPLGGATAFPF 476


>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
          Length = 197

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 70/135 (51%), Gaps = 13/135 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR L   N  S ++C+ I   A K+L+ S ++    ++ +  +  R S   ++ ASED 
Sbjct: 23  KPRVL--KNVLSEDECKHIQNIASKKLQTSTVS----KSRDIDESIRKSETAWLKASED- 75

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
             +++ +  K    T  P  + E   VL+Y+ G  Y  H D F P +     ++R+ +F+
Sbjct: 76  -PVVDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTF-PDD----KNKRMYTFI 129

Query: 201 LYLSDVEEGGETMFP 215
           + L+D  EGGET FP
Sbjct: 130 IALNDEYEGGETEFP 144


>gi|383640592|ref|ZP_09952998.1| 2OG-Fe(II) oxygenase [Sphingomonas elodea ATCC 31461]
          Length = 221

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF  + QC +++A   +  +PS LA    +    T  T   +     A + K  ILELI 
Sbjct: 43  NFLDSGQCAALMARIDEHRRPSTLANAGDDYAFRTSETCDLAAEDPLAIDLKARILELIG 102

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYL 203
                   L   H E     RY +GQ++ +H D F+P      +Y     QR  + +LYL
Sbjct: 103 --------LDPDHAEPMQGQRYAVGQEFKAHTDYFDPGSVDFDKYCSVAGQRTWTVMLYL 154

Query: 204 SDVEEGGETMF 214
           ++VE GG T F
Sbjct: 155 NEVEAGGATRF 165


>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
          Length = 187

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/55 (47%), Positives = 37/55 (67%)

Query: 161 HGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +GE+  +L YE G+KY+ HYD F+          R+A+ L+YLSDV +GGET+FP
Sbjct: 8   NGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFP 62


>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
 gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
          Length = 485

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  + F +  S  +   I +++K ++ PS+  +      E  K   + S  
Sbjct: 302 IKTEILSIDPFVILFHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 360

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 361 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 417

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 418 GYIDRLATTLFYLNDVPQGGATHFP 442


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 14/150 (9%)

Query: 73  IPFQVLSWRPRALYFPNFASA---EQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +  +++   P  + + +  SA    Q Q +     KR    + + R+ E V+    TRTS
Sbjct: 286 LKMELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVK----TRTS 341

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              +   + ++  + E +  +IA  T       E    + Y +G  YD HYD FN A   
Sbjct: 342 KVAWFPDTFNE--LTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFN-ASTA 398

Query: 190 PQMSQ----RLASFLLYLSDVEEGGETMFP 215
             ++Q    R+A+ L YL+DVE+GG T+FP
Sbjct: 399 ANLTQMNGDRIATVLFYLTDVEQGGATVFP 428


>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
 gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
          Length = 310

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 71/141 (50%), Gaps = 11/141 (7%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRT---SSGTFI 134
           +SW+PR   +  F + E+C  +I+ A+   + S     +G+  +S +  R    +S T +
Sbjct: 61  VSWQPRVFVYKGFLTDEECDHLISLAQGTKETS-----EGKDDDSGRIERNRLFASSTSL 115

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
              +D   IL  IE +++  T+LP+ + +   V+ Y I +   +++D F           
Sbjct: 116 LNMDD--NILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEP 172

Query: 195 RLASFLLYLSDVEEGGETMFP 215
            +A+ + YLS+V +GGE  FP
Sbjct: 173 LMATLVFYLSNVTQGGEIFFP 193


>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 533

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 72/159 (45%), Gaps = 9/159 (5%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           GETG   +Q I  +  + +P  +   +         +IA AK RL+ S+      +    
Sbjct: 307 GETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLNDMIAFAKPRLEQSKTLCAADK---D 363

Query: 123 TKGTRTSSGTFISASEDKTG--ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHY 180
               RTSS T++   +      + + ++  +   T+  +   E + +  Y IG  Y  H+
Sbjct: 364 GPPPRTSSNTWLDDDDAPVAARVNQYLQSLLGLGTLYGKDEAEKYQLANYGIGGHYVPHH 423

Query: 181 D----AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           D    +   ++       R+A+ ++Y+SDVEEGG T+FP
Sbjct: 424 DYLEESLTSSKKHRLFGDRVATLMIYMSDVEEGGATVFP 462


>gi|219126281|ref|XP_002183389.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
           CCAP 1055/1]
 gi|217405145|gb|EEC45089.1| hypothetical protein PHATRDRAFT_48891 [Phaeodactylum tricornutum
           CCAP 1055/1]
          Length = 427

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 50/101 (49%), Gaps = 2/101 (1%)

Query: 115 RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQ 174
           R  +  E+  G RTS+  +      K      +  ++A  T +P+ + E   +L+YE  Q
Sbjct: 266 RSADVAETNSG-RTSTNAWCQHDCYKDPTARAVMDRVANITSIPEVNSEYLQMLQYEKSQ 324

Query: 175 KYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            Y +H D + P +       R+ +F  YLSDVEEGG T FP
Sbjct: 325 FYQTHSD-YIPYQVNRPTGVRILTFYFYLSDVEEGGGTNFP 364


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 72/144 (50%), Gaps = 10/144 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFI 134
           +VL+  P    + + AS  +   +I  AK R+  S+  +R  GE        RTS   ++
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRI--SRATIRDDGEP--QVSNARTSQNAWL 358

Query: 135 SASEDKTGILELIEHKIARATM-LPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEY-GPQ 191
            A +D+  ++  ++ ++   T  L Q   E   V  Y +G  Y +H+D A     Y G +
Sbjct: 359 DAGDDR--VVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLR 416

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
           +  R+A+ + YLSDVE GG T+FP
Sbjct: 417 VGNRIATVMFYLSDVEIGGATVFP 440


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 69/150 (46%), Gaps = 7/150 (4%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +V+  +P+ + F +     + + + A A  RL+ + +       +E  +  R S
Sbjct: 321 LQPAKEEVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAE-YRIS 379

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              ++S  ED   ++  + H+I + T L     E   V  Y +G  Y+ H+D     E  
Sbjct: 380 KSAWLS--EDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEIN 437

Query: 190 P----QMSQRLASFLLYLSDVEEGGETMFP 215
                    R+A+FL Y+SDVE GG T+FP
Sbjct: 438 AFKSLNTGNRIATFLFYMSDVEAGGATVFP 467


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 9/135 (6%)

Query: 84  ALYFPNFASAE--QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
            +Y    +SAE  + + +   + KR    + +L + E V+    TRTS   +   S +  
Sbjct: 330 VIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVK----TRTSKVAWFPDSYNSL 385

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFL 200
            +   +  +I   T    +  E   ++ Y +G  YD HYD FN  E    ++  R+A+ L
Sbjct: 386 TLR--LNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVL 443

Query: 201 LYLSDVEEGGETMFP 215
            Y+SDVE+GG T+FP
Sbjct: 444 FYMSDVEQGGATVFP 458


>gi|224011483|ref|XP_002295516.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209583547|gb|ACI64233.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 451

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/104 (34%), Positives = 52/104 (50%), Gaps = 5/104 (4%)

Query: 116 QGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQK 175
            G   E    TRTSS ++      K  + + +  +I   TM+PQT+ E+  +LRYE GQ 
Sbjct: 285 DGSYTEDVHSTRTSSNSWCLDKCMKDPVAKDVVDRIEHMTMIPQTNSESLQLLRYEEGQY 344

Query: 176 YDSHYDAFNPAEYGPQMSQRLASFLLYLSDVE----EGGETMFP 215
           Y  H+D     +  P    R+ +F +YL+  E    EGG T FP
Sbjct: 345 YGVHHDLIEHQKDRPP-GVRILTFYMYLNGNEDSGLEGGGTKFP 387


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 190 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 249

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F P  +   
Sbjct: 250 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 306

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
            G     R+A+ + YL+DVE GG T FPF
Sbjct: 307 EGDLHGNRMATGIYYLADVEAGGGTAFPF 335


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 9/149 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  L   P  +       ++   S+  TA+ R+K S +    G    +    RTS G
Sbjct: 316 PFKLEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY--- 188
              + S  +    +L+   +   + L   + E   V  Y IG  Y+ H+D+F P  +   
Sbjct: 376 ASFNYS--RNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSF-PENHIYQ 432

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFPF 216
            G     R+A+ + YL+DVE GG T FPF
Sbjct: 433 EGDLHGNRMATGIYYLADVEAGGGTAFPF 461


>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
 gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
          Length = 448

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 65/143 (45%), Gaps = 6/143 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI   AK RL  +    + G   E     RT+ GT
Sbjct: 274 LKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTVSKDGNYTEDPD--RTTKGT 331

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    + + +++ +       T       + F VL Y IG  Y  H D    AE     
Sbjct: 332 WLV---ENSKLIQRLSQLTQDMTNFEIHDADPFQVLNYGIGGFYGIHLDFLGEAELD-NF 387

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           S R+A+ + YLSDV +GG T+FP
Sbjct: 388 SDRIATAVFYLSDVPQGGATIFP 410


>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
 gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 16/143 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPS----QLALRQGETVESTKGTRTSSGTFISAS 137
           PR  Y  NF SA++   ++A +   + PS      A  QG +  +   TRTS   F   +
Sbjct: 2   PRVFYVHNFLSADEADELVAFS---MAPSTGGTHKAWNQGGS-NAKLTTRTSMNAF-DIT 56

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQM 192
              +  ++    ++ R     +   +   +LRYE+GQ Y +H+D F     N   + P  
Sbjct: 57  TKLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDPSK 116

Query: 193 --SQRLASFLLYLSDVEEGGETM 213
             S R A+  LYLSDVE GG+T+
Sbjct: 117 GGSNRFATIFLYLSDVEVGGQTL 139


>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
          Length = 480

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 68/152 (44%), Gaps = 22/152 (14%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKG-------TRTSSGTF 133
           PR  Y  NF SA +    +  +     P ++A   G T ++  +G       TRTS   F
Sbjct: 205 PRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSENAF 264

Query: 134 ISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---------- 183
              ++    + +    ++ R     +   +   +LRY++GQ Y +H+D F          
Sbjct: 265 DITTKQSFDVKKRA-FRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSKDFNW 323

Query: 184 NPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +P   G   S R A+  LYLSDV  GG+T+FP
Sbjct: 324 DPLSGG---SNRFATIFLYLSDVSYGGQTVFP 352


>gi|398385057|ref|ZP_10543083.1| 2OG-Fe(II) oxygenase superfamily enzyme [Sphingobium sp. AP49]
 gi|397721148|gb|EJK81697.1| 2OG-Fe(II) oxygenase superfamily enzyme [Sphingobium sp. AP49]
          Length = 225

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 61/133 (45%), Gaps = 15/133 (11%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL-I 147
           +F SAE+C ++ +      +PS L    G         RTS    +    D+   L L I
Sbjct: 47  DFLSAEECAALRSLIDDGAQPSTLFSGSGNAE-----YRTSHSCHL----DRHDPLVLAI 97

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-AEYGPQM----SQRLASFLLY 202
             +I   T L   +GE     RY  GQ+Y  H D F P A Y P M     QR  + ++Y
Sbjct: 98  SDRICALTGLDSDNGETLQGQRYTSGQEYKVHCDYFPPNASYWPDMLKTGGQRSWTAMIY 157

Query: 203 LSDVEEGGETMFP 215
           LS V+ GGET FP
Sbjct: 158 LSSVDAGGETHFP 170


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 19/141 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQG--ETVESTKGTRTSSGTFISASE 138
           R R   F  FAS E+C+ +    K+RL+ + +A   G  + VE     R S+  ++    
Sbjct: 337 RQRLQVFRQFASPEECRHLQHAGKRRLERA-VAWTDGRFQPVE----FRISTAAWLQPDH 391

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD----AFNPAEYGPQMSQ 194
           D   I++ I  +I  AT +   + EA  +  Y +G  Y+ H+D      NP        +
Sbjct: 392 D--AIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDHSSRGTNPD------GE 443

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           RLA+F++YL+ V++GG T FP
Sbjct: 444 RLATFMIYLNPVKQGGFTAFP 464


>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 750

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 75/169 (44%), Gaps = 45/169 (26%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           F +F SA +C  ++A A   L+ S++    G+  E     RTSS TF++  + +  ++  
Sbjct: 534 FDHFLSAVECDDLVAIAAPDLRRSRVT--DGKLSEG----RTSSSTFLTGCKQEEPLVRA 587

Query: 147 IEHKIARA----TMLP---------QTHG--------------------EAFNVLRYEIG 173
           IE ++ RA    T++          + HG                    E   V+RY  G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
           Q Y +HYD            +R A+F++YL+DV  GG T FP  V + +
Sbjct: 648 QMYTAHYDNKQGC------LRRTATFMMYLTDVHSGGATHFPRAVPVSM 690


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 16/133 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    ++ 
Sbjct: 308 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 357

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+D       G ++++R A+FL+
Sbjct: 358 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 411

Query: 202 YLSDVEEGGETMF 214
           Y+SDVE GG T+F
Sbjct: 412 YMSDVEVGGATVF 424


>gi|404253277|ref|ZP_10957245.1| putative oxygenase [Sphingomonas sp. PAMC 26621]
          Length = 226

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFLLY 202
           +  IA    +   HGE     RY +GQ + +H D FN A+ Y P+M     QR  + ++Y
Sbjct: 98  DEAIATLLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157

Query: 203 LSDVEEGGETMFPF 216
           L+DVEEGG T FP 
Sbjct: 158 LNDVEEGGATWFPL 171


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 71/149 (47%), Gaps = 13/149 (8%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + + +  +  +  ++   +K  +K   +   + +        RTS+ 
Sbjct: 375 PFKTELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNS 434

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F      PQ
Sbjct: 435 VWLTSHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFE----TPQ 488

Query: 192 MSQR-----LASFLLYLSDVEEGGETMFP 215
           +  R     +A+ L YLSDV +GG T+FP
Sbjct: 489 LEHRGGGDRIATVLFYLSDVPQGGATLFP 517


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 66/133 (49%), Gaps = 16/133 (12%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P+ + + +  S  + +++   A+  L  SQ     G  V S    RTS   F+    ++ 
Sbjct: 326 PKIIRYHDVISDTEIETLKDIARPELTRSQ----TGWGVIS--DIRTSQSVFL----EEV 375

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           G +  I  +IA  T L     E  +V  Y IG +Y  H+D       G ++++R A+FL+
Sbjct: 376 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT------GDEVNERTATFLI 429

Query: 202 YLSDVEEGGETMF 214
           Y+SDVE GG T+F
Sbjct: 430 YMSDVEVGGATVF 442


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 70/137 (51%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  + H++   T L     E   V  Y +G +Y+ H+D +  P + G +    R+A+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVAT 456

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 457 FLNYMSDVEAGGATVFP 473


>gi|226503309|ref|NP_001141668.1| uncharacterized protein LOC100273794 [Zea mays]
 gi|194705482|gb|ACF86825.1| unknown [Zea mays]
 gi|414587757|tpg|DAA38328.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
 gi|414587758|tpg|DAA38329.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 165

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 49/73 (67%), Gaps = 3/73 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRTSSGTFI 134
           +V+SW PR + F NF S+E+C  ++A A+ RL+ S +  +  G+ V+S    RTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSD--VRTSSGMFV 115

Query: 135 SASEDKTGILELI 147
           ++ E K+ +++L+
Sbjct: 116 NSEERKSPVVQLV 128


>gi|395492951|ref|ZP_10424530.1| putative oxygenase [Sphingomonas sp. PAMC 26617]
          Length = 226

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 148 EHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFLLY 202
           +  IA    +   HGE     RY +GQ + +H D FN A+ Y P+M     QR  + ++Y
Sbjct: 98  DEAIAALLGIDPVHGETMQGQRYAVGQHFRAHNDYFNEAQPYWPKMIESGGQRTWTAMIY 157

Query: 203 LSDVEEGGETMFPF 216
           L+DVEEGG T FP 
Sbjct: 158 LNDVEEGGATWFPL 171


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 10/140 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKP--SQLALRQGETVESTKGTRTSSGTFISASE 138
           +P+     N  S  + + I   A+ RL+P  +Q     G  + S    R S   ++   E
Sbjct: 320 KPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSY---RISKNAWLYYWE 376

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQR 195
            +  ++  ++ ++  AT L     E   V+ Y IG  Y+ H+D     E     P    R
Sbjct: 377 HR--LINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434

Query: 196 LASFLLYLSDVEEGGETMFP 215
           +A+ L Y+SDVE GG T+FP
Sbjct: 435 IATMLFYMSDVEAGGATVFP 454


>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 508

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 19/162 (11%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKR-LKPSQLALRQGETVESTKGTRTSS 130
           ++   VLS  PR     +F S  + + ++  A KR LK S +         +   TRTS+
Sbjct: 278 NMTMTVLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTST 337

Query: 131 GTFISASED---------KTGILELIEH--KIARATMLPQ---TH---GEAFNVLRYEIG 173
             +I   +D            +L++ E   +  R + +P+   +H    E   ++ Y++G
Sbjct: 338 NDWIPRHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVG 397

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           Q+Y  H+D   P     Q S R A+ L YL+D  +GGET FP
Sbjct: 398 QQYTPHHDFTMPGLVNMQPS-RFATLLFYLNDDMDGGETAFP 438


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 90/199 (45%), Gaps = 22/199 (11%)

Query: 35  LSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI------PF--------QVLSW 80
           L R+ P+I+PK     V + E           DA +  I      PF        +++  
Sbjct: 227 LIRNEPNIKPKPFNKSVGDFERGCRGEFPALTDAKLYCIYNTTSSPFLRLAPLKMELIGL 286

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
            P  + + +  S  +   +   AK +LK +++      T + +K TRT+   +   + ++
Sbjct: 287 DPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSK-TRTAKLAWFLDTFNQ 345

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RL 196
             + E +  +I   T       E   V+ Y +G  Y  H+D FN  + GP ++Q    R+
Sbjct: 346 --LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTK-GPHITQINGDRI 402

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L YL+DVE+GG T+FP
Sbjct: 403 ATVLFYLNDVEQGGATVFP 421


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/144 (27%), Positives = 71/144 (49%), Gaps = 7/144 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  +  +P+ + F +  S  + + +   AK  L+ + +A +Q    E +K   + S 
Sbjct: 319 PFKLEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSS 378

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            F    ++    +  I  ++A  T L     E   V+ Y +G +YD H+D F+  +   +
Sbjct: 379 WF---PDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKL--K 433

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A+ L Y+SDV  GG T+FP
Sbjct: 434 EVNRIATVLFYMSDVSIGGATVFP 457


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 6/146 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++S  P  + + +  S  +   + + A   LK + +  +Q       K TRTS  T
Sbjct: 321 LKMELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVK-TRTSKVT 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
           ++  + ++  I   +  +I   T       E   V+ Y +G  YD HYD FN    A+  
Sbjct: 380 WLLDTLNQLTIR--LNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLT 437

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
                R+A+ L YL+DVE+GG T+FP
Sbjct: 438 RLNGDRIATVLFYLTDVEQGGATVFP 463


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 64/143 (44%), Gaps = 6/143 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI    K +L  +    + G   E     RT+ GT
Sbjct: 187 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 244

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +++ +       T       + F VL Y IG  Y  H+D    AE     
Sbjct: 245 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 300

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           S R+A+ + YLSDV +GG T+FP
Sbjct: 301 SDRIATAVFYLSDVPQGGATIFP 323


>gi|332187533|ref|ZP_08389270.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
 gi|332012462|gb|EGI54530.1| 2OG-Fe(II) oxygenase superfamily protein [Sphingomonas sp. S17]
          Length = 228

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 13/135 (9%)

Query: 86  YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILE 145
           Y  +F +  QC ++IA      +PS L      +     G RTS    ++    +   ++
Sbjct: 47  YQADFLTPAQCDALIAMIDANRRPSTLL-----SDRPDYGFRTSESCDMNRWSPE---VQ 98

Query: 146 LIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASFL 200
            I+  IA+   +P   GE     RY  GQ++ +H+D F+ +E Y  ++     QR  + +
Sbjct: 99  PIDESIAQLLGIPPEQGETMQGQRYAPGQQFRAHHDYFHESESYWEKVKVHGGQRTWTAM 158

Query: 201 LYLSDVEEGGETMFP 215
           +YL+DV EGG T FP
Sbjct: 159 IYLNDVPEGGATWFP 173


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 6/143 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI   AK  L  +    + G   E     RT+ GT
Sbjct: 310 LKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQDGNDKEDP--ARTTKGT 367

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    + + +++ +       T       + F VL Y IG  Y +H+D     E G   
Sbjct: 368 WLV---ENSKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFDFLEDTEMG-HF 423

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           S R+A+ + YLSDV +GG T FP
Sbjct: 424 SDRIATAVFYLSDVPQGGATTFP 446


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 14/142 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I+ A + +K +++     ET   T   RT+ G ++    ++ 
Sbjct: 327 PYVVLYHEVLSAREISMLISKAAQNMKNTRV---HRETKPKTNRGRTAKGHWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ---- 194
            +   I  +I   T       E F V+ Y IG  Y  H D F+ A     GP+  Q    
Sbjct: 383 -LTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVL 441

Query: 195 --RLASFLLYLSDVEEGGETMF 214
             R+A+ L YLSDVE+GG T+F
Sbjct: 442 GDRIATVLFYLSDVEQGGATVF 463


>gi|219116348|ref|XP_002178969.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409736|gb|EEC49667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 302

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 9/142 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
            V+S  P  +   +F S   C+++I TA    K     +R     E    T  +S T   
Sbjct: 90  HVVSSEPPLVLIHDFLSTSMCKNLIDTATSTDK----MIRSTTGSEQETSTIRTSTTVWL 145

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ- 194
             E       +I  KI+  +  P  H E   V+RYE GQ +  H D  +      +M + 
Sbjct: 146 NDEQVPETSRIIAEKISSISGFPANHMENLQVVRYETGQSFKLHTDTIDAYN---EMDKR 202

Query: 195 -RLASFLLYLSDVEEGGETMFP 215
            R+A+ L+YL+    GGET+FP
Sbjct: 203 GRVATCLIYLAAPTIGGETLFP 224


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 68/141 (48%), Gaps = 13/141 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK RL  ++  +R  +T V +T   R S   ++   ED 
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             +++ +  +I   T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 395 --VIDRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 450

Query: 197 --ASFLLYLSDVEEGGETMFP 215
             A+FL Y+SDVE GG T+FP
Sbjct: 451 RVATFLNYMSDVEAGGATVFP 471


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 67/143 (46%), Gaps = 17/143 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + N  S  + + I   AK RL  ++  +R  +T V +T   R S   ++   ED 
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRL--ARATVRDPKTGVLTTANYRVSKSAWLEGEEDP 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E +  +I   T L     E   +  Y +G +Y+ H+D        AF     G   
Sbjct: 397 --VIERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTG--- 451

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A+FL Y+SDVE GG T+FP
Sbjct: 452 -NRVATFLNYMSDVEAGGATVFP 473


>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
          Length = 557

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 24/151 (15%)

Query: 77  VLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISA 136
            +S  P    F NF    +C+ +   A K LK S++      T       RTSS  F+  
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRV------TDGKLSNGRTSSSCFLIG 371

Query: 137 SEDKTGILELIEHKI------------ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           ++ K  +++ IE ++             R   L     E   ++RY   +KY SH+D  N
Sbjct: 372 AKGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD--N 429

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            A       +R+A+F+ YLSD  EGG T FP
Sbjct: 430 KA----GSFRRVATFMCYLSDQCEGGCTNFP 456


>gi|313768324|ref|YP_004062004.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
 gi|312599020|gb|ADQ91044.1| hypothetical protein MpV1_121c [Micromonas sp. RCC1109 virus MpV1]
          Length = 219

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 18/137 (13%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ--GETVESTKGTRTSSGTFISASE 138
           RPR   F +F + ++ + I+  A K LKPS ++  +   E+V      R S   ++   +
Sbjct: 45  RPRV--FHDFITPQERKHIMEMASKELKPSTVSTDRILNESV------RKSETAWLGRED 96

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLAS 198
               +++ + H+  +    P  + E   VLRY+ G  Y+ H D+ N        + RL +
Sbjct: 97  ---PVVDAVIHRCLKYIDRPIKNCEKLQVLRYKPGGYYNPHQDSINDGS-----NPRLYT 148

Query: 199 FLLYLSDVEEGGETMFP 215
           F+L L+D  EGGET FP
Sbjct: 149 FILALNDEYEGGETEFP 165


>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
          Length = 538

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461


>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
          Length = 538

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 454 GATAFPF 460


>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
 gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
          Length = 535

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 318 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 376

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 377 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 433

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 434 GYIDRLATTLFYLNDVPQGGATHFP 458


>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
          Length = 538

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 321 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 380 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 436

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 437 GYIDRLATTLFYLNDVPQGGATHFP 461


>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
 gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
          Length = 536

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 72/145 (49%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K ++ PS+  +      E  K   + S  
Sbjct: 319 IKTEILSVDPFVILLHDMVSHKEGALIRSSSKNQILPSE-TVNAANEFEIAKFRTSKSVW 377

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 378 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 434

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 435 GYIDRLATTLFYLNDVPQGGATHFP 459


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLSDVE+G
Sbjct: 394 LSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFYLSDVEQG 453

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 454 GATAFPF 460


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 77/146 (52%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +VLS +P  + + NF +  + + I   A+  L+ S +A   GE  ++T   R S   ++ 
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVA--SGEN-QATVEYRISKSAWLK 369

Query: 136 ASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            SE  + I+  ++ +I+  T L     + E   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 370 GSE--SCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 428 KTGNRVATFMIYLSSVEAGGSTAFIY 453


>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
 gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
 gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
 gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
          Length = 536

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  LS  P  +++ N  S  + + +    K  L+ +++  R  +  +    +R++ G
Sbjct: 313 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKV-FRVEKGSDEIDPSRSADG 371

Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
            ++   +   D   +L  I  +I   T L    G     L+Y  G  +  HYD FN   +
Sbjct: 372 AWLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTF 431

Query: 189 GPQ-MSQRLASFLLYLSDVEEGGETMFP 215
             + +  R+A+ L YL++V+ GG T+FP
Sbjct: 432 SLETVGDRIATVLFYLNNVDHGGATVFP 459


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 362 LQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVA--SGEKQLQVE-YRIS 418

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L   H   E   V+ Y IG  Y+ H+D A +P+
Sbjct: 419 KSAWLKDTADP--VLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPS 476

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 477 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 508


>gi|448924767|gb|AGE48348.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus AN69C]
 gi|448933638|gb|AGE57193.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NE-JV-4]
          Length = 239

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 67/134 (50%), Gaps = 10/134 (7%)

Query: 89  NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +F S  +C  +I  A K+  +K         + ++    +R S  T+ +  E K  I++ 
Sbjct: 57  DFLSDAECDILINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114

Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           I++K      ++   + + + E   V RY+ GQ Y  HYD  +  +  P+  QRLA+ ++
Sbjct: 115 IQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173

Query: 202 YLSDVEEGGETMFP 215
           YL   EEGGET FP
Sbjct: 174 YLKAPEEGGETDFP 187


>gi|381200505|ref|ZP_09907642.1| procollagen-proline dioxygenase [Sphingobium yanoikuyae XLDN2-5]
          Length = 221

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 59/132 (44%), Gaps = 13/132 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F  AE+C  + A      +PS L      T       RTS    +   ED   ++  I 
Sbjct: 43  DFLGAEECAVMRAIIDAGAQPSTLF-----TGRENADYRTSHSCNLD-REDP--LVHAIS 94

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYL 203
            +I   T L   HGE     RY  GQ+Y  H D F   A Y P+M     QR  + ++YL
Sbjct: 95  ARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPVNASYWPEMRKTGGQRNWTAMIYL 154

Query: 204 SDVEEGGETMFP 215
           S VE GGET FP
Sbjct: 155 SPVEGGGETHFP 166


>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
          Length = 478

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 84/212 (39%), Gaps = 46/212 (21%)

Query: 39  VPS----IRPKLRTLEVVEKENESGLPHGETG-DASIQSIPFQVLSWRPRALYFPNFASA 93
           VPS    + P +RT      +    L   ETG DA  +S+    LS RP+      F   
Sbjct: 114 VPSGVHFVWPTVRTGHATVTD----LADAETGVDAGHRSV-VTTLSMRPQVFRISQFMMG 168

Query: 94  EQCQSIIATAKKRLKPSQLAL--RQGETVESTKGTRTSSGTFISASEDKTGILELIEHKI 151
            + + +I   K R+KPS++ L  R G+       TRTS+  + +AS        +    I
Sbjct: 169 HETEKLIERNKPRIKPSEVGLVGRSGDK------TRTSTNAWDTAS-------PVARDVI 215

Query: 152 ARATMLPQTHG-----EAFNVLRYEIGQKYDSHYDAFNPAEYG----------------P 190
            RA  L +        +   VL YE  Q Y  H D F     G                 
Sbjct: 216 GRAFRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTAN 275

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPFEVDLQI 222
             + R A+  LYL++   GGET+FP     +I
Sbjct: 276 NGTNRFATVFLYLNNAGSGGETVFPLSTTHEI 307


>gi|400602974|gb|EJP70572.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Beauveria bassiana
           ARSEF 2860]
          Length = 269

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 70/154 (45%), Gaps = 12/154 (7%)

Query: 71  QSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS 130
           Q+   ++LS  P A+Y  NF +  + + ++A  +   KPS++A   G  V +T   R+S 
Sbjct: 43  QNYTVEILSIDPLAIYLNNFLNDAEIRYLLALGENIYKPSEVASHSGIIVNTT--VRSSE 100

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEI-GQKYDSHYDAFNPAEYG 189
             F+   ED      LI    +    +   H E+  +++Y   G +Y  H D    A+  
Sbjct: 101 SAFL--LEDDAVCNCLISRMKSLLGNVQHEHVESLQMVKYAAGGDRYRLHTDWSVAAKNN 158

Query: 190 P-------QMSQRLASFLLYLSDVEEGGETMFPF 216
                   + S+RL +  +YL D   GGET FP 
Sbjct: 159 TDEASGKLRQSRRLGTIFVYLEDSCAGGETYFPL 192


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 70/148 (47%), Gaps = 11/148 (7%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
           +  ++ S  P  + + +     + Q +I + ++R+  S  Q  +RQ E  E     RTS 
Sbjct: 141 LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 196

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH---YDAFNPAE 187
               +   D   +L+ I  ++   T       E  ++L Y+ G  +D H   +D +   E
Sbjct: 197 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYE 255

Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
           Y P    R AS + YL+DVE+GGET+FP
Sbjct: 256 YHP-FGDRQASVVFYLNDVEDGGETVFP 282


>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
          Length = 538

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  LS  P  +++ N  S  + + +    K  L+ +++  R  +  +    +R++ G
Sbjct: 315 PFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKV-FRVEKGSDEIDPSRSADG 373

Query: 132 TFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY 188
            ++   +   D   +L  I  +I   T L    G     L+Y  G  +  HYD FN   +
Sbjct: 374 AWLPHQNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTF 433

Query: 189 GPQ-MSQRLASFLLYLSDVEEGGETMFP 215
             + +  R+A+ L YL++V+ GG T+FP
Sbjct: 434 SLETVGDRIATVLFYLNNVDHGGATVFP 461


>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 418

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 7/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
           PR   +    SA++C+ ++  A+  L+ S++ +   +        RTS G T     ED 
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLA 197
                  + ++A    LP  H E  +VL Y  G++Y +H D   P   A   P    R  
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344

Query: 198 SFLLYLSDVEEGGETMFPF 216
           +  +YL+DV  GGET FP 
Sbjct: 345 TVCVYLNDVGAGGETEFPV 363


>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 418

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 63/139 (45%), Gaps = 7/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDK 140
           PR   +    SA++C+ ++  A+  L+ S++ +   +        RTS G T     ED 
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKV-IDPNDASTGRAPVRTSHGATLDPIIEDF 286

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLA 197
                  + ++A    LP  H E  +VL Y  G++Y +H D   P   A   P    R  
Sbjct: 287 AA--RAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQR 344

Query: 198 SFLLYLSDVEEGGETMFPF 216
           +  +YL+DV  GGET FP 
Sbjct: 345 TVCVYLNDVGAGGETEFPV 363


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 64/143 (44%), Gaps = 6/143 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++LS  P  + F +  S +   SI    K +L  +    + G   E     RT+ GT
Sbjct: 310 LKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD--RTTKGT 367

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++    +   +++ +       T       + F VL Y IG  Y  H+D    AE     
Sbjct: 368 WLV---ENNALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFDFLEDAELD-NF 423

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
           S R+A+ + YLSDV +GG T+FP
Sbjct: 424 SDRIATAVFYLSDVPQGGATIFP 446


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 65/143 (45%), Gaps = 14/143 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P  + +    SA +   ++  A + +K +++   Q E   +T   RT+ G ++    ++
Sbjct: 325 KPYVVLYHEVLSAREISMLMGKAAQNMKNTRV---QSEKAVNTNRERTAKGYWLKKESNE 381

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQ 191
             +   I  +I   T       E F V+ Y IG  Y  H+D F  A          +   
Sbjct: 382 --MTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIV 439

Query: 192 MSQRLASFLLYLSDVEEGGETMF 214
           +  R+A+ L YL+DVE+GG T+F
Sbjct: 440 LGDRIATVLFYLTDVEQGGATVF 462


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score = 56.6 bits (135), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 70/145 (48%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S ++   I +++K  + PS+      + V S    RTS   
Sbjct: 322 IKTEILSIDPFVVLLHDMVSPKEAALIRSSSKSTIFPSETVNAANDFVVSK--FRTSKSV 379

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NPAEYGP 190
           ++    ++  +   +  ++A AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 380 WLDRDANEATVK--LTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVG 437

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               R+A+ L YL+DV +GG T FP
Sbjct: 438 GFIDRIATTLFYLNDVPQGGATHFP 462


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score = 56.2 bits (134), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 74/150 (49%), Gaps = 16/150 (10%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  ++LS  P  + F +     + + +  TA+  L  S +     E+V S    RT+ G
Sbjct: 281 PFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVKKSNNESVVSK--VRTAKG 338

Query: 132 TFISA---SEDKTGILELIEHKIARATMLPQTHGEAFN---VLRYEIGQKYDSHYDAFNP 185
            F+ A   S +   +++ ++ ++   + L     E +N    L Y+ G  Y  H D FN 
Sbjct: 339 AFMHADRLSPESAQVVQRLKQRMGDLSDL-NIKREGYNEMQYLNYDFGDHYLLHMDYFNI 397

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           +     M+ R+A+FL+YL+DV  GG T+FP
Sbjct: 398 S-----MNDRIATFLIYLNDVTRGGGTIFP 422


>gi|410665706|ref|YP_006918077.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
 gi|409028063|gb|AFV00348.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
          Length = 270

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 20/135 (14%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSS----GTFISASEDKTGIL 144
           +F SA +C+ ++A    +L+PS +      T     G RTSS    G  +S       ++
Sbjct: 88  DFLSAAECEGVMALMDSKLRPSTVT-----TGNLHYGYRTSSTCDLGRMVSP------LI 136

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEY---GPQMSQRLASF 199
             I+ KIAR   +     E     +Y +GQ++ +H D F P  AEY        QR  +F
Sbjct: 137 RDIDQKIARTLGVNVAWSEEIQGQKYLVGQEFKAHTDYFQPQSAEYEKFAGARGQRSWTF 196

Query: 200 LLYLSDVEEGGETMF 214
           ++YL+  E GG T F
Sbjct: 197 MIYLNTTEAGGATRF 211


>gi|87199403|ref|YP_496660.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
 gi|87135084|gb|ABD25826.1| 2OG-Fe(II) oxygenase [Novosphingobium aromaticivorans DSM 12444]
          Length = 211

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 61/131 (46%), Gaps = 13/131 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F S  +C  +IA  ++  +PS +A   G+    T  T            D   I+ L E
Sbjct: 34  DFLSQAECNGLIARIERDRRPSTIADANGDHYFRTSET-------CDLPMDDPEIVALDE 86

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLLYL 203
            K+   + + +  GE     RYE GQ++ +H D F+P       +     QR  +F++YL
Sbjct: 87  -KLCALSGIGRPFGEPIQGQRYESGQEFKAHTDYFDPHGADFQRFCSVAGQRTWTFMVYL 145

Query: 204 SDVEEGGETMF 214
           +DVE GG T F
Sbjct: 146 NDVEAGGATRF 156


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V      P  L F N  S  + +++   A+ RL  +       + +E     R S  
Sbjct: 317 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +   L ++  ++A  T L  +  E F V+ Y IG  Y+ H+D  +  +  P 
Sbjct: 376 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 431

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           +  R+ + L YLSDVE+GG T+FP   ++Q+S
Sbjct: 432 IGSRIETVLFYLSDVEQGGATVFP---EIQVS 460


>gi|393725345|ref|ZP_10345272.1| putative oxygenase [Sphingomonas sp. PAMC 26605]
          Length = 226

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 62/136 (45%), Gaps = 15/136 (11%)

Query: 86  YFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS-SGTFISASEDKTGIL 144
           + P+F  A  C +++A      + S +        E+ +  RTS S      S D     
Sbjct: 44  HHPDFLDAATCDTLVAMIDANKRRSTVLAE-----ENVQEFRTSESCDMDRWSPDVRPTD 98

Query: 145 ELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM----SQRLASF 199
           E I H +     +   +GE     RY +GQ + +H+D FN  + Y PQM     QR  + 
Sbjct: 99  EAIAHLLG----IDPVYGETMQGQRYAVGQHFRAHFDYFNEKQAYWPQMIETGGQRTWTA 154

Query: 200 LLYLSDVEEGGETMFP 215
           ++YL+ V EGG T FP
Sbjct: 155 MIYLNHVAEGGATWFP 170


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V      P  L F N  S  + +++   A+ RL  +       + +E     R S  
Sbjct: 311 PFKVEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFP-FRISKV 369

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +   L ++  ++A  T L  +  E F V+ Y IG  Y+ H+D  +  +  P 
Sbjct: 370 AWLEDQEHQH--LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDFQSTVD--PA 425

Query: 192 MSQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           +  R+ + L YLSDVE+GG T+FP   ++Q+S
Sbjct: 426 IGSRIETVLFYLSDVEQGGATVFP---EIQVS 454


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 50/88 (56%), Gaps = 14/88 (15%)

Query: 135 SASEDKTGILELIEHKIARATML-PQTHGEAFNVLRYEIGQKYDSHYDAFNPAEY----- 188
           SAS DK      +  +++ AT L  + + E F V  Y IG  Y+ H+D F+  +Y     
Sbjct: 8   SASADK------LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFD-FSKVKYFTNPV 60

Query: 189 -GPQMSQRLASFLLYLSDVEEGGETMFP 215
              QM  R+A+F++YL+DVE GG T+FP
Sbjct: 61  LNEQMGDRIATFMIYLNDVEAGGRTVFP 88


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 69/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           RPR + F    S E+ +++   AK RL  S+  +   +T + +T   R S   ++S  E+
Sbjct: 342 RPRIVRFVEIISDEEIETVKELAKPRL--SRATVHDPQTGKLTTAHYRVSKSAWLSGYEN 399

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              I+  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 400 P--IVARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 455

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 456 --NRIATWLFYMSDVSAGGATVFP 477


>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
 gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
          Length = 518

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 71/145 (48%), Gaps = 6/145 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           I  ++LS  P  +   +  S  +   I +++K ++ PS+  +      E  K   + S  
Sbjct: 301 IKTEILSVDPFVILLHDMVSPTEGALIRSSSKNQILPSE-TVNAANEFEVAKFRTSKSVW 359

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA--FNPAEYGP 190
           F S + + T  L+L + ++  AT L   H E F V+ Y IG  ++SH+D    +   +  
Sbjct: 360 FDSDANEAT--LKLTQ-RLGEATGLDMKHSEPFQVINYGIGGVFESHFDTSLADEDRFVN 416

Query: 191 QMSQRLASFLLYLSDVEEGGETMFP 215
               RLA+ L YL+DV +GG T FP
Sbjct: 417 GYIDRLATTLFYLNDVPQGGATHFP 441


>gi|103487007|ref|YP_616568.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
 gi|98977084|gb|ABF53235.1| 2OG-Fe(II) oxygenase [Sphingopyxis alaskensis RB2256]
          Length = 218

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 61/133 (45%), Gaps = 13/133 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
            P F  AE C ++I       +PS +A   G+        RTSS   +   + +  I+  
Sbjct: 32  LPRFLDAETCAALIDLIDSDARPSTIADANGDAA-----FRTSSTCDL---DHRLPIVIA 83

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLL 201
           + +K+   T +P  +GE     RY+IG+++ +H D F+P       Y     QR  + ++
Sbjct: 84  VNNKLHDLTGIPLAYGEPLQGQRYDIGEEFKAHTDYFDPHGADWDTYCAVPGQRSWTLMI 143

Query: 202 YLSDVEEGGETMF 214
           YL+    GG T F
Sbjct: 144 YLNQPAAGGATRF 156


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 68/143 (47%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + F +  S E+ + +   +K RL+ + ++      +E T   R S   ++S  E+ 
Sbjct: 343 RPRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYENP 401

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 402 --VVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 456

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 457 -NRIATWLFYMSDVAAGGATVFP 478


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 49/85 (57%), Gaps = 6/85 (7%)

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-----AFNPAEYGPQMSQRLA 197
           +++ ++++I   T L     +A  V  Y IG  YD HYD       + +E   +   R+A
Sbjct: 393 VVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIA 452

Query: 198 SFLLYLSDVEEGGETMFPFEVDLQI 222
           +FLLY++DV+ GG T+FP  +D+++
Sbjct: 453 TFLLYMTDVDAGGATVFPI-IDVRV 476


>gi|427410040|ref|ZP_18900242.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425712173|gb|EKU75188.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 225

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 58/132 (43%), Gaps = 13/132 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F  AE+C  + A      +PS L      T       RTS    +   ED   ++  I 
Sbjct: 47  DFLGAEECAVMRAIIDAGAQPSTLF-----TGRENADYRTSHSCNLD-REDP--LVHAIS 98

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYL 203
            +I   T L   HGE     RY  GQ+Y  H D F   A Y P M     QR  + ++YL
Sbjct: 99  ARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPVNASYWPDMRKTGGQRNWTAMIYL 158

Query: 204 SDVEEGGETMFP 215
           S VE GGET FP
Sbjct: 159 SPVEGGGETHFP 170


>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
 gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
          Length = 248

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 66/162 (40%), Gaps = 20/162 (12%)

Query: 72  SIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
            I  + +S  PR     +F S  +   I+    +  +  + +            TRTS  
Sbjct: 34  DITLRTVSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMN 93

Query: 132 TFISASEDKTGILELIEHKIARATML---------PQTH---------GEAFNVLRYEIG 173
           T+I   E  T I++ I  ++A    +         P  H          E   ++ Y+ G
Sbjct: 94  TWIYREE--TAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPG 151

Query: 174 QKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           ++Y +H+D        P    R  + LLYL+DVEEGGET FP
Sbjct: 152 EEYTAHHDFGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP 193


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q+I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y      R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|156406532|ref|XP_001641099.1| predicted protein [Nematostella vectensis]
 gi|156228236|gb|EDO49036.1| predicted protein [Nematostella vectensis]
          Length = 410

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/134 (32%), Positives = 59/134 (44%), Gaps = 15/134 (11%)

Query: 96  CQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASED-------------KTG 142
           C   + T +K LK  +LA +  E  + T GT   + TF     D             +  
Sbjct: 173 CFDGVDTLRKNLKELKLAYKVSER-DFTMGTTCLNETFSGKLRDHFKWSHSMAFYTGENK 231

Query: 143 ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLY 202
              +   +I  AT L + +G  F +  Y IG  Y SH D     E  P+   R A+ L+Y
Sbjct: 232 FSTMYAKRIQAATGLREENGGKFQITGYPIGVGYKSHTDCV-VYEGEPEKRDRYATILVY 290

Query: 203 LSDVEEGGETMFPF 216
           L DVEEGGET FP 
Sbjct: 291 LQDVEEGGETDFPL 304


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +VL   P  + + +F S  + Q I   A   L+ S +A   GE  +  +  R S
Sbjct: 349 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 405

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 406 KSAWLKDTVDP--MLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 463

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y      R+A+F++YLS VE GG T F +
Sbjct: 464 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIY 495


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 68/142 (47%), Gaps = 13/142 (9%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----- 194
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q     
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTG 447

Query: 195 -RLASFLLYLSDVEEGGETMFP 215
            R+A++L Y+SDV  GG T+FP
Sbjct: 448 NRIATWLFYMSDVSAGGATVFP 469


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 5/148 (3%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           IQ I  +    +PR + + +  S E+ ++I   A+ R + + +  ++    E ++  R +
Sbjct: 334 IQPIKMEEALLKPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSR-YRIA 392

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
              ++   E     +  I  ++   T L     E   V  Y IG  Y+ HYD     E  
Sbjct: 393 KSAWLKHEEHD--YVSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQ 450

Query: 190 PQMSQ--RLASFLLYLSDVEEGGETMFP 215
                  R+A++L Y+SDVE GG T+FP
Sbjct: 451 QDFGWGGRIATWLFYMSDVEAGGATVFP 478


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 392

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 393 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 449

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 450 --NRIATWLFYMSDVSAGGATVFP 471


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 66/142 (46%), Gaps = 7/142 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++L   P  + + +  S  +   I+  A++R+  +    +   T   +  TRT+ G 
Sbjct: 310 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTAMGA 366

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++  S +   +   I  ++   + L     E   V+ Y IG  Y  H D F   ++   M
Sbjct: 367 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 422

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             RLA+ L YL+DVE+GG TMF
Sbjct: 423 GNRLATVLFYLTDVEQGGATMF 444


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 68/141 (48%), Gaps = 13/141 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +F S E+ + I   AK +L  ++  +R  ++ V +T   R S   ++   ED 
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKL--ARATVRDPKSGVLTTASYRVSKSAWLEGEEDP 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRL---- 196
             I+  +  +I   T L     E   V  Y +G +Y+ H+D     E  P   +RL    
Sbjct: 397 --IIARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDE--PDAFKRLGTGN 452

Query: 197 --ASFLLYLSDVEEGGETMFP 215
             A+FL Y+SDVE GG T+FP
Sbjct: 453 RVATFLNYMSDVEAGGATVFP 473


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIY 478


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q+I   A+  L+ S +A   GE     +  R S
Sbjct: 268 LQPVRKEVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 325 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 382

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y      R+A+F++YLS VE GG T F +
Sbjct: 383 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 414


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 372

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 373 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 429

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 430 --NRIATWLFYMSDVSAGGATVFP 451


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 394

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 395 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 452

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 453 KSGNRVATFMIYLSSVEAGGATAFIY 478


>gi|390166011|ref|ZP_10218284.1| putative oxygenase [Sphingobium indicum B90A]
 gi|390168995|ref|ZP_10220942.1| putative oxygenase [Sphingobium indicum B90A]
 gi|389588453|gb|EIM66501.1| putative oxygenase [Sphingobium indicum B90A]
 gi|389591170|gb|EIM69145.1| putative oxygenase [Sphingobium indicum B90A]
          Length = 225

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F SAE+C  +        +PS L      +  +    RTS+   +S     + ++  +  
Sbjct: 48  FLSAEECAGLRELIDAGAQPSTLF-----SGSANADYRTSASCHLSP---WSPLVMAVSD 99

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
           +I     L   HGE     RY+ GQ+Y +H D F   A Y P M     QR  + ++YLS
Sbjct: 100 RICALMGLAGEHGETLQGQRYQAGQEYKAHCDYFPVTASYWPAMLKTGGQRSWTAMIYLS 159

Query: 205 DVEEGGETMFP 215
            VE GGET FP
Sbjct: 160 PVEAGGETHFP 170


>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
 gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
          Length = 296

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 70/150 (46%), Gaps = 12/150 (8%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS--QLALRQGETVESTKGTRTSS 130
           +  ++ S  P  + + +     + Q +I + ++R+  S  Q  +RQ E  E     RTS 
Sbjct: 73  LKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQYEIRQIEISEQ----RTSK 128

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA----FNPA 186
               +   D   +L+ I  ++   T       E  ++L Y+ G  +D H D     ++P 
Sbjct: 129 EAPFTEKNDPQ-LLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWHPQ 187

Query: 187 EYGPQ-MSQRLASFLLYLSDVEEGGETMFP 215
           EY       R AS + YL+DVE+GGET+FP
Sbjct: 188 EYEYHPFGDRQASVVFYLNDVEDGGETVFP 217


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 71/148 (47%), Gaps = 9/148 (6%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES-TKGTRTSS 130
           PF  ++LS  P  + + +  +  +  ++   +K  +K   + +     V       RTS+
Sbjct: 312 PFKTEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSN 371

Query: 131 GTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP 190
             ++++ E+   ++E +E ++   T     + E + ++ Y IG  Y  H D F   +  P
Sbjct: 372 SVWLASHEN--AVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQ-AP 428

Query: 191 QM---SQRLASFLLYLSDVEEGGETMFP 215
           +      R+A+ L YLSDV +GG T+FP
Sbjct: 429 EHRGGGDRIATVLFYLSDVPQGGATLFP 456


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 433

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 434 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 493

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 494 FLNYMSDVEAGGATVFP 510


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 70/144 (48%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S E+ +++   AK RL  S+  +   ET + +T   R S   ++S  E 
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRL--SRATVHDPETGKLTTAHYRVSKSAWLSGYE- 345

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
            + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 346 -SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 402

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 403 --NRIATWLFYMSDVSAGGATVFP 424


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 66/142 (46%), Gaps = 7/142 (4%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  ++L   P  + + +  S  +   I+  A++R+  +    +   T   +  TRT+ G 
Sbjct: 335 LKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRT---SSPTRTALGA 391

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM 192
           ++  S +   +   I  ++   + L     E   V+ Y IG  Y  H D F   ++   M
Sbjct: 392 WLKRSSN--ALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFT--QHPEVM 447

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             RLA+ L YL+DVE+GG TMF
Sbjct: 448 GNRLATVLFYLTDVEQGGATMF 469


>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
           bacterium SAR86E]
 gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
           bacterium SAR86E]
          Length = 214

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 12/143 (8%)

Query: 79  SWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASE 138
           S  P      NF S  +C + I  A+ RL+ S +    G   E   G RTS   +I    
Sbjct: 18  SVNPIVYLVKNFLSDLECDAFINEAEGRLQDSTVI---GANDEIKLGARTSQNCWIE--H 72

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN------PAEYGPQM 192
           D   ++  +  +++    +P  + E + +  YE  ++Y   +D+F+         + P  
Sbjct: 73  DANELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSFDFDTLEGKKNWEPG- 131

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
            QR+ + ++YL+DV+ GG T FP
Sbjct: 132 GQRMLTIIVYLNDVQSGGGTDFP 154


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 74/146 (50%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S E+ Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRISKSAWLK 256

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 257 DTVDP--MLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 314

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 315 KSGNRVATFMIYLSSVEAGGATAFIY 340


>gi|340924033|gb|EGS18936.1| hypothetical protein CTHT_0055490 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 311

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 73/151 (48%), Gaps = 18/151 (11%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++S  P  +Y  +F +  +  +++   + R KPSQ+A + G  + +   T +S+G    
Sbjct: 78  ELVSLDPVIIYIHDFITKPEIDALLEVGEPRFKPSQVA-KYGVLMNTNDRTSSSAGL--- 133

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGE--AFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
             +D+  +  +   +    TM+     +  A  ++RY  G+K++ H+D F      PQM+
Sbjct: 134 PRDDQAVMCVMRRARNFLGTMIRDGWDDMGAPQLVRYREGEKFNHHHDWFE----NPQMA 189

Query: 194 --------QRLASFLLYLSDVEEGGETMFPF 216
                    R+ASF   L D   GGET FP+
Sbjct: 190 YDGTSRKWNRVASFFAILQDNCTGGETYFPY 220


>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
          Length = 561

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 72/157 (45%), Gaps = 15/157 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +  S  P  +   +  +  Q + +    + +L  S     +G+ V S    RTS
Sbjct: 337 LQPIKVEEQSLDPMIVVLHDLITERQTEILRQLGEPKLATSLHRGGEGKFVRSM--IRTS 394

Query: 130 SGTFISASEDKTGILELIEHKIARATML---PQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
              ++   E+ +  L  I H++  AT L   P+T  E F +  Y IG  Y +H D     
Sbjct: 395 KNAWLQEHENAS--LPAIRHRMELATGLIYGPETASEYFQIANYGIGGLYKTHTDNVIHP 452

Query: 187 EYGPQ--------MSQRLASFLLYLSDVEEGGETMFP 215
           +  P+        +  R+A+ ++YLSDVE GG T+FP
Sbjct: 453 DVRPEDQDPWNLYVGDRIATLMVYLSDVEAGGATVFP 489


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 68/143 (47%), Gaps = 9/143 (6%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISA 136
           LS  P  + + N  S  +    IA  ++  +P   ++  GE   S K   RT+ G +I  
Sbjct: 328 LSLDPYIVVYHNVLSDAE----IAKVERVAEPLLKSIGVGEMDNSKKSKVRTALGAWIPD 383

Query: 137 SEDKTG---ILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-M 192
                    +++ I  +I   T L    G+   +++Y  G  YD+H+D  N +    Q +
Sbjct: 384 ENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 443

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A+ L YL+DV+ GG T+FP
Sbjct: 444 GDRMATVLFYLNDVKHGGSTVFP 466


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 484 FLNYMSDVEAGGATVFP 500


>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 524

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 70/160 (43%), Gaps = 9/160 (5%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVES 122
           GE+G   +Q I  +  + +P  +   +         +IA AK R++  QL+ R    V S
Sbjct: 301 GESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRDLADMIAFAKPRVRKLQLSRRI--LVYS 358

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
                TS+      +     + + ++  +   T+  +   E + +  Y IG  Y  H+D 
Sbjct: 359 KHYCDTSTWLNDDDAPVAARVNQYLQSLLGLGTLYSKDEAEKYQLANYGIGGHYVPHHDY 418

Query: 183 FNPAEYGPQMS-------QRLASFLLYLSDVEEGGETMFP 215
                    +S        R+A+ ++Y+SDVEEGG T+FP
Sbjct: 419 LEETLTSRHVSIVTRLFGDRVATLMIYMSDVEEGGATVFP 458


>gi|294013234|ref|YP_003546694.1| putative oxygenase [Sphingobium japonicum UT26S]
 gi|292676564|dbj|BAI98082.1| putative oxygenase [Sphingobium japonicum UT26S]
          Length = 225

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 60/131 (45%), Gaps = 13/131 (9%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F SAE+C  +        +PS L      +  +    RTS+   +S     + ++  +  
Sbjct: 48  FLSAEECAGLRELIDAGAQPSTLF-----SGSANADYRTSASCHLSP---WSPLVMAVSD 99

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGPQM----SQRLASFLLYLS 204
           +I     L   HGE     RY+ GQ+Y +H D F   A Y P M     QR  + ++YLS
Sbjct: 100 RICALMGLAGEHGETLQGQRYQAGQEYKAHCDYFPVTASYWPAMLKTGGQRSWTAMIYLS 159

Query: 205 DVEEGGETMFP 215
            VE GGET FP
Sbjct: 160 AVEAGGETHFP 170


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 423

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 424 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 483

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 484 FLNYMSDVEAGGATVFP 500


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-SQRLASFLLYLSDVEEG 209
           +  AT L  T+ E   V  Y +G  Y+ H+D F  + + P     R+A+ + YLSDVE+G
Sbjct: 393 VGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 452

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 453 GATAFPF 459


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  I  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470


>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
           pv. vasculorum NCPPB 702]
          Length = 301

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 61/129 (47%), Gaps = 7/129 (5%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG-TFISASEDKTGILELIEHK 150
           SA++C+ ++  A+  L+ SQ+ +   +        RTS G T     ED      + + +
Sbjct: 121 SADECRLLMLLARPHLRDSQV-IDPNDASTQRAPVRTSRGATLDPIIEDFAA--RVAQAR 177

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLASFLLYLSDVE 207
           +A    L  TH E  +VL Y  G++Y +H D   P   A   P    R  +  +YL+ V+
Sbjct: 178 LAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNVVD 237

Query: 208 EGGETMFPF 216
            GGET FP 
Sbjct: 238 AGGETEFPL 246


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 346

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 347 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 406

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 407 FLNYMSDVEAGGATVFP 423


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 414

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 415 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 474

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 475 FLNYMSDVEAGGATVFP 491


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 13/142 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   QG  V      RT+ G +     ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 362

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 363 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 422

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 423 FLNYMSDVEAGGATVFP 439


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|390358555|ref|XP_003729286.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like
           [Strongylocentrotus purpuratus]
          Length = 508

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 63/126 (50%), Gaps = 26/126 (20%)

Query: 116 QGETVESTKG-TRTSSGTFISASEDKTGILELIEHKIARATMLP----QTHGEAFNVLRY 170
           +G  V+  KG TR S  T+IS++      L+ I+++I   T L     QT  E+  V+RY
Sbjct: 222 EGRMVKKWKGRTRVSDQTWISSNTVNGHPLQKIQNRIIALTRLSPEIVQT-SESMQVVRY 280

Query: 171 EIGQKYDSHYDA---------------FNPAEYGPQMSQR-----LASFLLYLSDVEEGG 210
             G  Y +H+D+               FN +E  P   +R     + + L YL+DV++GG
Sbjct: 281 RPGGHYHAHFDSSEVTSDLECRHTAITFNQSEETPFHEERCRYCRMITILYYLNDVDQGG 340

Query: 211 ETMFPF 216
           ET FPF
Sbjct: 341 ETAFPF 346


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 304 LQPVRKEVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 360

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 361 KSAWLKDTVDP--VLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 418

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y      R+A+F++YLS VE GG T F +
Sbjct: 419 SPLYRMNSGNRVATFMIYLSSVEAGGATAFIY 450


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           ++  T L  T+ E   V  Y +G  Y+ H+D F NP  Y  +   R+A+ + YLS+VE+G
Sbjct: 396 LSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYYLSEVEQG 455

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 456 GATAFPF 462


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 13/142 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   QG  V      RT+ G +     ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKATQNMKNTRVHKEQG--VPKKNRGRTAKGFWFKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---------YGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP-QMSQRLASFLLYLSDVEEG 209
           ++ AT L  T+ E   V  Y +G  Y+ H+D F  +++ P +   R+A+ + YLSDVE+G
Sbjct: 398 LSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQG 457

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 458 GATAFPF 464


>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
 gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
          Length = 439

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 61/155 (39%), Gaps = 19/155 (12%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           +S  PR     NF S E+  +I+  A   L PS +   Q      T G  T+     +A 
Sbjct: 205 VSRHPRLAVIHNFISKEEAAAIVDVAAPELHPSLVVRHQTAKRGDTAGGDTAVHGEATAG 264

Query: 138 EDKTGILELIEHKIARATM--------LPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
                      H I RA +        L  +H E   V+RY   Q+Y  H+D F+ A   
Sbjct: 265 RTSHNCRVSSSHPIVRAAIQRAAYLCGLEPSHAEPAQVVRYLPSQEYKPHHDWFDRAH-- 322

Query: 190 PQM---------SQRLASFLLYLSDVEEGGETMFP 215
           P+           QR  + L YL + E GG T FP
Sbjct: 323 PESFRAKTEGRGGQRAVTCLAYLVEPERGGRTYFP 357


>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
          Length = 525

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 92/201 (45%), Gaps = 30/201 (14%)

Query: 43  RPKLRTLEVVE---KENESGLPHGETGDASIQS------IPF-----QVLSWRPRALYFP 88
           RPKL++ +  E   +  +  LP  ++   +++       +PF     ++L+ +P  + F 
Sbjct: 263 RPKLKSTKAYEALCRGEQLKLPDVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFH 322

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +  S  + +++   A  +L+ + +A  + +   S    R S  +++        I +L  
Sbjct: 323 DVMSDAEAKTMKMEAMHKLERAHVADNENKHGHSASAKRISQVSWLWDDHANKTIHQL-S 381

Query: 149 HKIARATMLPQT-------HGEAFNVLRYEIGQKYDSHYDAF-------NPAEYGPQMSQ 194
            ++A  T L QT         E F +L Y IG +Y+ H D F       +  E+      
Sbjct: 382 RRVADITGL-QTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN 440

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           RLA+F+ YL+DV  GG T+FP
Sbjct: 441 RLATFMFYLNDVHAGGATVFP 461


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 311

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 312 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 367

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 368 --NRIATWLFYMSDVSAGGATVFP 389


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|149186836|ref|ZP_01865146.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
 gi|148829503|gb|EDL47944.1| 2OG-Fe(II) oxygenase [Erythrobacter sp. SD-21]
          Length = 211

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 68/152 (44%), Gaps = 21/152 (13%)

Query: 70  IQSIPFQVLSWRPRALYFP--NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           IQ +P       P+A  F   +FAS   C+ ++A  +K  +PS LA        +    R
Sbjct: 18  IQRVP------TPKAELFQLRDFASPAMCEQLVALIEKDRRPSTLA-----DAGNDHYFR 66

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA- 186
           TS    +   E    ++  IE  +A    +    GE     RY++GQ++  H D FN   
Sbjct: 67  TSETCDLDPDEP---VVCEIEALLAALNGIDPKFGEPLQGQRYDVGQEFKPHCDYFNRGG 123

Query: 187 ----EYGPQMSQRLASFLLYLSDVEEGGETMF 214
               +Y     QR  +F++YL+ VE GG T F
Sbjct: 124 QDWEKYCSVAGQRTWTFMVYLNAVEAGGATRF 155


>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
 gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
          Length = 535

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  LS  P         S++  + I   A+ ++K S +    G         RTS G
Sbjct: 316 PFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQG 375

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF---NPAEY 188
              + S  +    +++   +   + L     E   V  Y IG  Y+ H+D+F   +  + 
Sbjct: 376 ASFNYS--RNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYDE 433

Query: 189 GPQMSQRLASFLLYLSDVEEGGETMFPF 216
           G     R+A+ + YLSDVE GG T FPF
Sbjct: 434 GDDRGNRIATGIYYLSDVEAGGGTAFPF 461


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEG 209
           +  AT L  T+ E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+G
Sbjct: 397 LKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFYLSEVEQG 456

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 457 GATAFPF 463


>gi|448930897|gb|AGE54460.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus KS1B]
          Length = 239

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 67/134 (50%), Gaps = 10/134 (7%)

Query: 89  NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +F S  +C  +I  A K+  +K         + ++    +R S  T+ +  E K  I++ 
Sbjct: 57  DFLSDAECDILINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114

Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           I++K      ++   + + + E   V RY+ GQ Y  HYD  +  +  P+  QRLA+ ++
Sbjct: 115 IQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173

Query: 202 YLSDVEEGGETMFP 215
           YL   +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187


>gi|341614920|ref|ZP_08701789.1| hypothetical protein CJLT1_08183 [Citromicrobium sp. JLT1363]
          Length = 210

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 69/150 (46%), Gaps = 17/150 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +P+  L      L    F  AE C  +I   + + +PS LA   G+        RTS
Sbjct: 17  MQRLPYDKLE----LLQKRGFLDAEFCAELITMIEAKRRPSTLADFDGDEF-----FRTS 67

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---- 185
               +   + ++  L+ +   +A  + +   +GE     RY +GQ++ +H D FNP    
Sbjct: 68  ETCDLPMDDPRSQRLDAM---LADLSGIDPAYGEPLQGQRYAVGQEFKAHCDYFNPDGQD 124

Query: 186 -AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
             +Y     QR  +F++YL++ E GG T F
Sbjct: 125 WEKYCSVAGQRTWTFMIYLNEPEAGGVTRF 154


>gi|323444954|gb|EGB01828.1| tentative prolyl-4-hydroxylase [Aureococcus anophagefferens]
          Length = 230

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 13/150 (8%)

Query: 68  ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTR 127
           AS+      VLSW PR           +   +IA +  R   S+L       V +    R
Sbjct: 11  ASVSRAHITVLSWEPRIHLHHGMLDENEIAHLIALSSSRFAASEL------EVGANDQRR 64

Query: 128 TSSGTFI-SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA 186
           TS    +  A E    ++     ++  A  +P  +GEA  V RY   + Y      +N  
Sbjct: 65  TSESWMVPRAVERDDAVVAGAVKRMHAAVGIPPRYGEALQVARYNSTRNY------YNFH 118

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
               + + R+A+ L+ L   +EGGET+FPF
Sbjct: 119 ADSAKSTARVATVLISLERADEGGETIFPF 148


>gi|94495931|ref|ZP_01302510.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
 gi|94424623|gb|EAT09645.1| 2OG-Fe(II) oxygenase [Sphingomonas sp. SKA58]
          Length = 229

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 61/132 (46%), Gaps = 13/132 (9%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           +F S ++C  +        +PS L      +  +    RTS    +S  +    ++E I 
Sbjct: 52  DFLSPDECAELRRLIDANAQPSTLF-----SGSANADYRTSHSGNLSPRDP---LVERIT 103

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM----SQRLASFLLYL 203
            +I   T LP  +GE     RY  GQ+Y  H D F   A+Y  +M     QR  + ++YL
Sbjct: 104 QRICALTGLPAINGETLQGQRYTPGQEYKVHCDYFPATADYWQRMRGTGGQRTWTAMIYL 163

Query: 204 SDVEEGGETMFP 215
           S VE GGET FP
Sbjct: 164 SAVEAGGETHFP 175


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 71/137 (51%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +T   R S  +++  ++D 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTTASYRVSKSSWLEETDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 --VVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|85708137|ref|ZP_01039203.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
 gi|85689671|gb|EAQ29674.1| Prolyl 4-hydroxylase, alpha subunit [Erythrobacter sp. NAP1]
          Length = 211

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 13/130 (10%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F S E C  +I       +PS +A   G+        RTS    + A+E  T + EL E 
Sbjct: 34  FISPELCAELIRLIDLGRRPSTIADANGDDY-----FRTSETCDLDANE--TAVKEL-EA 85

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA-----EYGPQMSQRLASFLLYLS 204
           +      +   +GE     RY++GQ++ +H D F P      ++  +  QR  +F++YL+
Sbjct: 86  RFFSLNGIDPKYGEPVQGQRYDVGQEFKAHTDYFTPGGADFEKFCAESGQRTWTFMIYLN 145

Query: 205 DVEEGGETMF 214
           +VE GG T F
Sbjct: 146 EVEAGGATRF 155


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 451

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 452 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 511

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 512 FLNYMSDVEAGGATVFP 528


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471


>gi|341893923|gb|EGT49858.1| CBN-PHY-3 protein [Caenorhabditis brenneri]
          Length = 275

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 69/151 (45%), Gaps = 13/151 (8%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++SW P  + + N  +  Q    +   +KR    Q     G+++E+T   R ++G+
Sbjct: 41  VDMEIVSWSPTLVIYRNLLTPRQASEFLDFIEKRDMVMQKTSDNGKSIETTH--RRANGS 98

Query: 133 FIS--ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE--- 187
           FI   A+E  + +   ++ +I     L  T+ E F+ L Y  G  Y  HYD  +      
Sbjct: 99  FIDHGATEVTSEVHNWVQKRIP---ALNFTNAELFSALSYLPGGHYSVHYDYLSYRSEKE 155

Query: 188 ---YGPQMSQRLASFLLYLSDVEEGGETMFP 215
              +   M  R+ + +  L   E+GG T+FP
Sbjct: 156 HDWWMKNMRNRIGTLIFVLKTAEKGGGTVFP 186


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 66/139 (47%), Gaps = 9/139 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 396

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGP----QMSQRL 196
             ++  + H++   T L     E   V  Y +G +Y+ H+D     E           R+
Sbjct: 397 DPVVARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRV 456

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+FL Y+SDVE GG T+FP
Sbjct: 457 ATFLNYMSDVEAGGATVFP 475


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471


>gi|224006045|ref|XP_002291983.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972502|gb|EED90834.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 168

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 14/139 (10%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF +  +C  +I  +   L P+ +    G+       +RTSS  ++ A ED    L    
Sbjct: 8   NFLTPAECDFLIHASNDALGPAPVV---GKGAGEVSPSRTSSTCYL-AREDLPEYLR--- 60

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE-----YGPQMSQRLASFLLYL 203
            K+   T  P  H E   V RY   Q+Y  H+DAF+ +      +     QR  + L YL
Sbjct: 61  -KVGVLTGKPAEHCELPQVGRYLPSQQYLQHFDAFDLSNEDGRRFAANGGQRTITVLTYL 119

Query: 204 SDVEEGGETMFPFEVDLQI 222
           +DV  GG T FP  +++Q+
Sbjct: 120 NDVPRGGATSFP-NLNIQV 137


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 393 DPVVAQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 453 FLNYMSDVEAGGATVFP 469


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S E+ +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALE-TAHYRISKSAWLSGYE-- 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           + ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 SPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 450 -NRIATWLFYMSDVSAGGATVFP 471


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470


>gi|156405954|ref|XP_001640996.1| predicted protein [Nematostella vectensis]
 gi|156228133|gb|EDO48933.1| predicted protein [Nematostella vectensis]
          Length = 182

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 11/97 (11%)

Query: 129 SSGTFISASED-KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE 187
           SS  ++   ED K  IL  I     + +       E   + +Y++GQKY  HYD+     
Sbjct: 9   SSSLYLKNKEDSKITILRDIAQLAGKLSNTQWRFAEPVALTKYKVGQKYSLHYDS----- 63

Query: 188 YGPQMSQR----LASFLLYLSDVEEGGETMFPFEVDL 220
            G  M+QR     A+FL+YL+DV+ GGET+FP   ++
Sbjct: 64  -GFLMNQRRVKRTATFLVYLNDVKSGGETIFPLATNI 99


>gi|448930198|gb|AGE53763.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-3A]
 gi|448931603|gb|AGE55164.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus MA-1E]
          Length = 239

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 10/134 (7%)

Query: 89  NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +F S  +C  +I  A K+  +K         + ++    +R S  T+ +  E K  I++ 
Sbjct: 57  DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114

Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           I+ K      ++   + + + E   V RY+ GQ Y  HYD  +  +  P+  QRLA+ ++
Sbjct: 115 IQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173

Query: 202 YLSDVEEGGETMFP 215
           YL   +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187


>gi|406859917|gb|EKD12978.1| hypothetical protein MBM_08740 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 466

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/204 (24%), Positives = 84/204 (41%), Gaps = 30/204 (14%)

Query: 18  LLLCSFFFLAGFYGSTFLSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSIPFQV 77
           L +C +  L+     T     VP++   L         N S +   E            +
Sbjct: 11  LYICVYLPLSSIILGTSHKPAVPAVTSML---------NNSLIATNEALSCPSHKYNTHI 61

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           LS  P  +Y  +F SAE+ + ++  ++ + + S +          + G RTS    +  S
Sbjct: 62  LSLEPLVIYIEDFLSAEESRHLLEISEDKFEASTV----------STGARTSIQRDVRDS 111

Query: 138 E----DKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
           E    D+  ++  IEH+ AR      P  H E     RY +G  Y  HYD    +    +
Sbjct: 112 EVALVDRDDVVRCIEHR-ARDFQGWRPDLHIERLRTQRYGVGGHYAHHYDWSGAS----R 166

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
            + R+++F++Y+    EGG T FP
Sbjct: 167 DADRVSTFMVYVDADCEGGGTEFP 190


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 11/141 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q      
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 448

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A++L Y+SDV  GG T+FP
Sbjct: 449 RIATWLFYMSDVSAGGATVFP 469


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 75/152 (49%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 338 LQPVRKEVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 394

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 395 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 452

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 453 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 484


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 55/102 (53%), Gaps = 13/102 (12%)

Query: 120 VESTKGTRTSSGTFISASEDKTGI--LELIEHKIARATMLPQT--HGEAFNVLRYEIGQK 175
           ++     RTS+  F+    ++TGI  LE I  + A  T L  T    E   V+ Y +G +
Sbjct: 361 IDQADVDRTSNSVFM----EETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQ 416

Query: 176 YDSHYDAFNP-AEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           Y  H D F+  AE G     RLA+ L YL+DV++GG T+FPF
Sbjct: 417 YTPHCDYFDENAENG----DRLATVLFYLTDVQQGGATVFPF 454


>gi|448927821|gb|AGE51393.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CviKI]
          Length = 239

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 66/134 (49%), Gaps = 10/134 (7%)

Query: 89  NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +F S  +C  +I  A K+  +K         + ++    +R S  T+ +  E K  I++ 
Sbjct: 57  DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHK--IIDK 114

Query: 147 IEHKI-----ARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
           I+ K      ++   + + + E   V RY+ GQ Y  HYD  +  +  P+  QRLA+ ++
Sbjct: 115 IQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLMV 173

Query: 202 YLSDVEEGGETMFP 215
           YL   +EGGET FP
Sbjct: 174 YLKAPKEGGETDFP 187


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 383

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 384 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG-- 439

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 440 --NRIATWLFYMSDVSAGGATVFP 461


>gi|392546611|ref|ZP_10293748.1| 2OG-Fe(II) oxygenase [Pseudoalteromonas rubra ATCC 29570]
          Length = 932

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 72/148 (48%), Gaps = 11/148 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           R +   F    S ++CQS+I   +   +PS +  +QG   +S +G RTS    +     +
Sbjct: 743 RAQIFRFEGALSEQECQSLITRLEGHFQPSTVVDQQG---DSNQGYRTSCTACLEQISHQ 799

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-----MSQR 195
             I   I++KI+    +   + E+  + RY  GQ+Y  H D F+ +    Q     + QR
Sbjct: 800 ENI--KIKNKISALMGIDSAYLESLQIQRYGTGQEYKVHCDFFHESSPSYQKCIDSVGQR 857

Query: 196 LASFLLYLSDVEEGGETMFPFEVDLQIS 223
             + ++YL+D   GG T F  ++D+ ++
Sbjct: 858 TWTCVVYLNDDFSGGNTHFT-QLDITVT 884


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 85/199 (42%), Gaps = 22/199 (11%)

Query: 35  LSRDVPSIRPKLRTLEVVEKENESGLPHGETGDASIQSI------PF--------QVLSW 80
           L R+ P+I+PK     V + E           DA +  I      PF        +++  
Sbjct: 264 LIRNEPNIKPKPFNKSVGDFERGCRGEFPALTDAKLYCIYNTTSSPFLRLAPLKMELIGL 323

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
            P  + + +  S  +   +   AK  LK + +      T +  K TRT+   +   + ++
Sbjct: 324 DPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVK-TRTAKVAWFLDTFNQ 382

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ----RL 196
             + E +  +I   T       E   V+ Y +G  Y  H+D FN     P +SQ    R+
Sbjct: 383 --LTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTT-NPHISQINGDRI 439

Query: 197 ASFLLYLSDVEEGGETMFP 215
           A+ L YL+DVE+GG T+FP
Sbjct: 440 ATVLFYLNDVEQGGATVFP 458


>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 531

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 64/142 (45%), Gaps = 16/142 (11%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLA--LRQGETVESTKGTRTSSGTFIS-ASE 138
           P  L F    + E  + I   A  RL+PS++   + Q      T   R S   F   A E
Sbjct: 342 PDVLVFHEMITEEVAEKIRDVANPRLRPSEVIDPIIQKHV---TASYRVSKNVFFDDAFE 398

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA------EYGPQM 192
           ++  I   +   +  AT L     E   V  Y +G +Y+ H D  +P       E+G   
Sbjct: 399 EELEISRKLRPLVEDATDLNDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHG--- 455

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A+ L+YLSDVE GG+T+F
Sbjct: 456 -NRIATLLIYLSDVERGGDTVF 476


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           PR + + N  S E+   I   AK +L  ++  +R  +T V S    R S   ++  ++D 
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKL--ARATVRDPKTGVLSVANYRVSKSAWLEENDDP 395

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 396 --VIARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTDGNRLA 452

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 453 TFLNYMSDVEAGGATVFP 470


>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
 gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
          Length = 536

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 70/148 (47%), Gaps = 7/148 (4%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF++  LS  P  + + N  S  + + +   +K  L+ +++  R  +  +    +R++ G
Sbjct: 313 PFRMEELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLERAKV-FRVEKGSDEIAPSRSADG 371

Query: 132 TFISASE---DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAE 187
            ++   +   D   +L  I  +I   T L    G     L+Y  G  +  HYD FN    
Sbjct: 372 AWLPHQDTDPDDLEVLRRIGRRIKDLTGLNTRSGSQMQFLKYGFGGHFVPHYDYFNSKTS 431

Query: 188 YGPQMSQRLASFLLYLSDVEEGGETMFP 215
           Y  ++  R+A+ L YL++V+ GG T FP
Sbjct: 432 YLERVGDRIATVLFYLNNVDHGGATAFP 459


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 273 LQPVRKEVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 329

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 330 KSAWLKDTADP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 387

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 388 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 419


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 68/143 (47%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S ++ +++   AK RL+ + ++      +E T   R S   +++  E  
Sbjct: 348 RPYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLE-TASYRISKSAWLTGYEHP 406

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E+I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 407 --VIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 461

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 462 -NRIATWLFYMSDVAAGGATVFP 483


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
           T   R S   ++   ED+  ++E +  + A  T L     E   V+ Y IG  Y+ H+D 
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443

Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
               E     G  +  R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVF 479


>gi|428184271|gb|EKX53127.1| hypothetical protein GUITHDRAFT_53615, partial [Guillardia theta
           CCMP2712]
          Length = 110

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/64 (48%), Positives = 39/64 (60%), Gaps = 5/64 (7%)

Query: 163 EAFNVLRYEIGQKYDSHYDAFNPAE--YGPQMSQRLASFLLYLSDVEE--GGETMFPFEV 218
           EA  V+ YE GQ+YD H D F P    Y  ++ QR+ SF  YLS+V E  GG T FP E+
Sbjct: 34  EAVQVVHYERGQRYDVHNDWFQPGTPYYHDRVWQRIISFFCYLSEVPEGQGGCTFFP-EL 92

Query: 219 DLQI 222
           DL+ 
Sbjct: 93  DLRF 96


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
           T   R S   ++   ED+  ++E +  + A  T L     E   V+ Y IG  Y+ H+D 
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443

Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
               E     G  +  R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVF 479


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK RL  ++  +R  +T V +    R S  +++   ED 
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 397

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 398 DPVVAKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 456

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 457 TFLNYMSDVEAGGATVFP 474


>gi|163955099|ref|YP_001648203.1| hypothetical protein OsV5_126r [Ostreococcus virus OsV5]
 gi|163638548|gb|ABY27907.1| hypothetical protein OsV5_126r [Ostreococcus virus OsV5]
          Length = 206

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/126 (30%), Positives = 63/126 (50%), Gaps = 11/126 (8%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
           F + E+ + II  A+K+L+ S +A    E     K  R S   ++ AS+    +++ +  
Sbjct: 28  FITEEERKHIIRKAQKKLEVSTVA----ENRVVDKKIRDSETAWLDASD---PVVKRVME 80

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEG 209
           K    T  P  + E   VLRY+ G  Y  H D F+      + ++R+ + +L L+D  EG
Sbjct: 81  KCVSLTDRPLVNCEHIQVLRYKPGGHYSPHQDTFSDT----KGNKRMYTVILALNDDYEG 136

Query: 210 GETMFP 215
           GET FP
Sbjct: 137 GETEFP 142


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + +    S  + +++   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 350

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 351 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 406

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 407 --NRIATWLFYMSDVSAGGATVFP 428


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 67/141 (47%), Gaps = 11/141 (7%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVH-YRISKSAWLSGYEDP 365

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ------ 194
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D     E  P   Q      
Sbjct: 366 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDE--PDAFQELGTGN 421

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R+A++L Y+SDV  GG T+FP
Sbjct: 422 RIATWLFYMSDVSAGGATVFP 442


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 47/96 (48%), Gaps = 6/96 (6%)

Query: 123 TKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA 182
           T   R S   ++   ED+  ++E +  + A  T L     E   V+ Y IG  Y+ H+D 
Sbjct: 386 TANYRISKSAWLKTQEDR--VIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDF 443

Query: 183 FNPAEY----GPQMSQRLASFLLYLSDVEEGGETMF 214
               E     G  +  R+A+ L Y+SDVE+GG T+F
Sbjct: 444 ARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVF 479


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           PR + + +  S E+ + I   AK RL  ++  +R  +T V +    R S   ++   +D 
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRL--ARATVRDPKTGVLTVANYRVSKSAWLEEYDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++   T L +   E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 --VIGRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFD-FSRRPFDSNLKTEGNRLA 450

Query: 198 SFLLYLSDVEEGGETMFP 215
           ++L Y+SDVE GG T+FP
Sbjct: 451 TYLNYMSDVEAGGATVFP 468


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  ED 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVH-YRISKSAWLSGYEDP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +VL   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 324 LQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 380

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 381 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 438

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 439 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 470


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 68/142 (47%), Gaps = 15/142 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + + +  S  + + +   AK RL+ + ++      +E T   R S   +++A ED 
Sbjct: 345 RPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLE-TAHYRISKSAWLTAYEDP 403

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             +++ I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 404 --VVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 458

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A++L+Y+SDV  GG T+F
Sbjct: 459 -NRIATWLIYMSDVPSGGATVF 479


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +VL   P  + + +F S  + Q I   A   L+ S +A   GE  +  +  R S
Sbjct: 311 LQPVRKEVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVA--SGEKQQQVE-YRIS 367

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   IL  ++ +IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 368 KSAWLKDTVDP--ILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPS 425

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y      R+A+F++YLS VE GG T F +
Sbjct: 426 SPLYRMNSGNRVATFMIYLSSVEAGGSTAFIY 457


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 76/146 (52%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +++S +P  + + +F +  + + I + A   L+ S +A   GE  ++T   R S   ++ 
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVA--AGEK-QATADYRISKSAWLK 369

Query: 136 ASEDKTGILELIEHKIARATMLPQTH--GEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S     I+  ++ +I+  T L   H  GE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 370 GS--AQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 427

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 428 KTGNRVATFMIYLSPVEAGGSTAFIY 453


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 44  KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 101

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 102 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 157

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 158 --NRIATWLFYMSDVSAGGATVFP 179


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 69/146 (47%), Gaps = 8/146 (5%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L+  P   YF    ++ + + II      ++ S++   Q  T   T   RTS+ 
Sbjct: 329 PFKVEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNAT---TSEIRTSAN 385

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN-PAEYGP 190
           T++  +E+    L  I+ ++   T L     E   ++ Y IG +Y+ H+D    P +   
Sbjct: 386 TWLWYNENPW--LSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEPHFDFVEEPQKVFG 443

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
               R+ + L Y++DV  GG T FPF
Sbjct: 444 WKGNRMLTALFYINDVALGGATAFPF 469


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 1/67 (1%)

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-QRLASFLLYLSDVEEG 209
           ++ AT L  T  E   V  Y +G  Y+ H+D F  + + P     R+A+ + YLSDVE+G
Sbjct: 394 VSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFYLSDVEQG 453

Query: 210 GETMFPF 216
           G T FPF
Sbjct: 454 GATAFPF 460


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 50  ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 109

Query: 213 MFPF 216
            FPF
Sbjct: 110 AFPF 113


>gi|224013902|ref|XP_002296615.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968967|gb|EED87311.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 490

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 51/98 (52%), Gaps = 6/98 (6%)

Query: 122 STKGTRTSSGTFI---SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
           S   +RTS  T+    S  ED   +  +IE +IA  T     + E   +L+YE GQ Y  
Sbjct: 334 SVSESRTSHNTWCQEPSCYEDPL-VAPIIE-RIANVTGTKVENSEYLQMLQYEPGQYYRQ 391

Query: 179 HYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFPF 216
           H+D + P  +G     R+ +  +YL+DVEEGG T FP 
Sbjct: 392 HHD-YIPHHFGMPCGVRILTLFIYLNDVEEGGGTSFPL 428


>gi|363543307|ref|NP_001241869.1| prolyl 4-hydroxylase 6-4 precursor [Zea mays]
 gi|347978826|gb|AEP37755.1| prolyl 4-hydroxylase 6-4 [Zea mays]
          Length = 145

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/89 (38%), Positives = 52/89 (58%), Gaps = 5/89 (5%)

Query: 78  LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQ-GETVESTKGTRTSSGTFISA 136
           LS RPRA  +  F S  +C  I++ AK  ++ S +A    G++V S    RTSSGTF++ 
Sbjct: 38  LSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVAS--QARTSSGTFLAK 95

Query: 137 SEDKTGILELIEHKIARATMLPQTHGEAF 165
            ED+  I+  IE ++A  T LP+ + E+ 
Sbjct: 96  REDE--IVSAIEKRVAAWTFLPEENAESL 122


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 13/142 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   +   V      RT+ G ++    ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 70/137 (51%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLEEEDDP 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 --VVARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 13/142 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + +    SA +   +I  A + +K +++   +   V      RT+ G ++    ++ 
Sbjct: 326 PYVVLYHEVLSAREISMLIGKAAQNMKNTKI--HKERAVPKKNRGRTAKGFWLKKESNE- 382

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPA---------EYGPQM 192
            + + I  +I   T       E F V+ Y IG  Y  H D F+ A          Y   +
Sbjct: 383 -LTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDL 441

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A+ L YL+DVE+GG T+F
Sbjct: 442 GDRIATVLFYLTDVEQGGATVF 463


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + +    S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEEDDL 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  + H++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 --VVARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFD-FSRRPFDITLKTEGNRLA 450

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 451 TFLNYMSDVEAGGATVFP 468


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 66/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RP  + + +  S  +   I   AK RL+ + ++      +E T   R S   +++A ED 
Sbjct: 350 RPYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLE-TAPYRISKSAWLTAYEDP 408

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++E I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 409 --VVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 463

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 464 -NRIATWLFYMSDVSAGGATVFP 485


>gi|72000637|ref|NP_507251.2| Protein PHY-3 [Caenorhabditis elegans]
 gi|30145726|emb|CAB04736.2| Protein PHY-3 [Caenorhabditis elegans]
          Length = 318

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 9/149 (6%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGT 132
           +  +++SW P  + + N  S  Q  S +   ++R    Q     G ++E+T   R ++G+
Sbjct: 83  VDMEIISWAPTLVIYRNLMSPRQTASFLNFIEQRDLEIQKTSDFGTSIETTH--RRANGS 140

Query: 133 FISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN---PAEYG 189
           FI   ED    +E+      R   L  T  E F+ L Y  G  Y  HYD  +     +Y 
Sbjct: 141 FIPP-EDSNVTVEIKMQAQKRIPGLNLTVAEHFSALSYLPGGHYAVHYDYLDYRSKQDYD 199

Query: 190 PQMSQ---RLASFLLYLSDVEEGGETMFP 215
             M++   R+ + +  L   E+GG T+FP
Sbjct: 200 WWMNKTGNRIGTLIFVLKPAEKGGGTVFP 228


>gi|424864983|ref|ZP_18288870.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
           bacterium SAR86B]
 gi|400759083|gb|EJP73273.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
           bacterium SAR86B]
          Length = 418

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 57/132 (43%), Gaps = 7/132 (5%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
            PNF + E+C  +I         S + +  G         RTSS + +  +     I++ 
Sbjct: 4   IPNFITHEECDELIKMIDAHHTRSSVVV-GGTDRSDITDHRTSSTSNLDPN---NVIIKS 59

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM---SQRLASFLLYL 203
           I  KIA    L  + GE+     YE GQ +  H+D F    Y         R  + ++YL
Sbjct: 60  IHKKIADHLNLSISKGESLQGQLYEPGQYFKPHHDFFTGPAYDMHCLASGNRTHTLMIYL 119

Query: 204 SDVEEGGETMFP 215
           +D  EGGET FP
Sbjct: 120 NDDFEGGETNFP 131


>gi|149185530|ref|ZP_01863846.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
 gi|148830750|gb|EDL49185.1| hypothetical protein ED21_20934 [Erythrobacter sp. SD-21]
          Length = 241

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 63/142 (44%), Gaps = 14/142 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
            P F +  +C  +I   +  L PS L      +  +  G RTS   + SA E +   L  
Sbjct: 53  MPGFLAPAECTRLIELIESNLLPSPLF-----SDPTGTGARTSQTHYFSAEEPEVAALGA 107

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLL 201
              K+     L +   E     RY++GQ+Y  H D F       Q+      QR  S ++
Sbjct: 108 ---KLDDLLGLERRQAETVQGQRYDVGQEYRHHRDFFRVEREHWQLERRRGGQRTWSAMV 164

Query: 202 YLSDVEEGGETMFPFEVDLQIS 223
           YL+ +E GGET FP  +DL ++
Sbjct: 165 YLNAIEAGGETDFPV-LDLSVA 185


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
 gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
          Length = 539

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 6/152 (3%)

Query: 69  SIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-R 127
           S + +  + L   P  +   N  S +    +   A+  ++ SQ+  +     E+     R
Sbjct: 316 SYEVLELEELHQDPFVVQVHNIVSQKDMNLLQKIARPNIQRSQVYAQDHNANETVAAAYR 375

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPA 186
           TS G      E ++  +EL+   +A  + L     E   +  Y IG  Y+ H+D F +  
Sbjct: 376 TSKGATFEYFEHRS--MELLSRHVADLSGLDMNSAELLQIANYGIGGHYEPHWDCFPDHH 433

Query: 187 EYGP--QMSQRLASFLLYLSDVEEGGETMFPF 216
            Y P  +   R+A+ + YLS+VE GG T FPF
Sbjct: 434 VYLPDDRDGNRIATGIYYLSEVEAGGGTAFPF 465


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 394 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 453

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 454 FLNYMSDVEAGGATVFP 470


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPF 216
            FPF
Sbjct: 460 AFPF 463


>gi|448928822|gb|AGE52391.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CvsA1]
          Length = 239

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 66/135 (48%), Gaps = 12/135 (8%)

Query: 89  NFASAEQCQSIIATAKKR--LKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
           +F S  +C  +I  A K+  +K         + ++    +R S  T+ +  E +  +++ 
Sbjct: 57  DFLSDAECDVLINAAIKKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHE--VIDK 114

Query: 147 IEHKIARATMLPQTHG------EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFL 200
           I++K  R  +  + H       E   V RY+ GQ Y  HYD  +  +  P+  QRLA+ +
Sbjct: 115 IQNK-TRELLNNKKHCIDKYIFEDVQVARYKPGQYYYHHYDGDDCDDACPK-DQRLATLM 172

Query: 201 LYLSDVEEGGETMFP 215
           +YL   EEGGET FP
Sbjct: 173 VYLKAPEEGGETDFP 187


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 391

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQMS-QRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 392 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 451

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 452 FLNYMSDVEAGGATVFP 468


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPF 216
            FPF
Sbjct: 460 AFPF 463


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPF 216
            FPF
Sbjct: 460 AFPF 463


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 406 FLNYMSDVEAGGATVFP 422


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 345

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 346 DPVVARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 405

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 406 FLNYMSDVEAGGATVFP 422


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 75/152 (49%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE  +     R S
Sbjct: 334 LQPIRKEVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEK-QLQVDYRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 360 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 415

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 416 --NRIATWLFYMSDVSAGGATVFP 437


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F    S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 394

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 395 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 450

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 451 --NRIATWLFYMSDVSAGGATVFP 472


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 6/93 (6%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYDAFN 184
           R S   ++   +D+  I+  +  +I   T L  T    E   VL Y +G +Y+ H+D   
Sbjct: 126 RISQQAWLHDKDDE--IVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHDYMT 183

Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMFP 215
             E  +G  +  R+A+FL+YLSDV  GG T+FP
Sbjct: 184 AEEKMWGTILGNRMATFLMYLSDVTAGGATVFP 216


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 149 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 208

Query: 213 MFPF 216
            FPF
Sbjct: 209 AFPF 212


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P  Y  +   R+A+ + YLS+VE+GG T
Sbjct: 330 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGAT 389

Query: 213 MFPF 216
            FPF
Sbjct: 390 AFPF 393


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 345

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 346 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 401

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 402 --NRIATWLFYMSDVSAGGATVFP 423


>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
          Length = 421

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 74/179 (41%), Gaps = 30/179 (16%)

Query: 60  LPHGETGD--------ASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ 111
            P  E GD        A+  SI  +V S  PR L   +F + E+C  +I++AK  +  S 
Sbjct: 172 FPTCEVGDTFDIPLPSANRNSIA-KVRSISPRVLEVEDFLTPEECHELISSAKPLMSRST 230

Query: 112 LALRQGETVESTKGTRTSSGTFISASE--------DKTGILELI-----EHKIARATMLP 158
           ++      V   + +RTSS  ++            D+   L  I     EH +       
Sbjct: 231 VSAEGDSAVSLQESSRTSSTAWLPPHSHTLANKLYDRVSSLVGIDFRKHEHVVVEDLQAI 290

Query: 159 QTHG----EAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLASFLLYLSDVEEG 209
              G     A+ VLRYE+ Q Y  H+D F+P  +   +      R  +   YL+DVE G
Sbjct: 291 DKRGGSSVTAWQVLRYEVNQHYHIHHDYFDPVLHRGFLQGDGRNRFITAFFYLTDVERG 349


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 154 ATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQMSQRLASFLLYLSDVEEGGET 212
           AT L  T  E   V  Y +G  Y+ H+D F +P+ Y      R+A+ + YLS+VE+GG T
Sbjct: 400 ATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRIATAIFYLSEVEQGGAT 459

Query: 213 MFPF 216
            FPF
Sbjct: 460 AFPF 463


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 393

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 394 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 452

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 453 TFLNYMSDVEAGGATVFP 470


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score = 53.5 bits (127), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 3/89 (3%)

Query: 127 RTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NP 185
           RTS GT+I    D   + + IE +I     L   + E F V+ Y +G  Y +H D   + 
Sbjct: 345 RTSKGTWIE--RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDT 402

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
                +   R+A+ L YL+DVE+GG T+F
Sbjct: 403 WADKKEEDDRIATVLFYLTDVEQGGATVF 431


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score = 53.1 bits (126), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score = 53.1 bits (126), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 9/147 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A++F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 322 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 380

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  I+  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 381 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 438

Query: 192 MS----QRLASFLLYLSDVEEGGETMF 214
            S     RLA+ L Y++  E GG T+F
Sbjct: 439 QSLNTGNRLATLLFYMTQPESGGATVF 465


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F    S  + + +   AK RL  S+  +   ET + +T   R S   ++S  ED
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYED 242

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 243 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG-- 298

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 299 --NRIATWLFYMSDVSAGGATVFP 320


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 9/147 (6%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A++F +  + E+   I   A  RL+ + +       +E T   RTS  
Sbjct: 323 PFKVEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELE-TASYRTSKS 381

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   E +  I+  I  +I   T L Q   E   V  Y IG  YD H+D     E    
Sbjct: 382 AWLKDEEHE--IVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAF 439

Query: 192 MS----QRLASFLLYLSDVEEGGETMF 214
            S     RLA+ L Y++  E GG T+F
Sbjct: 440 QSLNTGNRLATLLFYMTQPESGGATVF 466


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 39/138 (28%), Positives = 69/138 (50%), Gaps = 9/138 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED 
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDD 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS---QRLA 197
             ++  +  ++ + T L     E   V  Y +G +Y+ H+D F+   +   +     RLA
Sbjct: 393 DPVVAKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFD-FSRRPFDSTLKSEGNRLA 451

Query: 198 SFLLYLSDVEEGGETMFP 215
           +FL Y+SDVE GG T+FP
Sbjct: 452 TFLNYMSDVEAGGATVFP 469


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
 gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
          Length = 546

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 71/150 (47%), Gaps = 9/150 (6%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGE---TVESTKGT-R 127
           I  ++LS  P  +   +  S E+   +   +K  + PS+ A L   E     E   G+ R
Sbjct: 322 IKTEILSIDPFIVLLHDMVSVEEGALLRTFSKNMISPSETAELSDSEEKSIFEFEVGSFR 381

Query: 128 TSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF--NP 185
           TS   ++    ++  +   +  ++  AT L  +H E F V+ Y IG  ++SH+D    + 
Sbjct: 382 TSKSVWLDNDANEATLK--LTQRLGDATGLDISHSEPFQVINYGIGGIFESHFDTSLQDE 439

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
             +      RLA+ L YL+DV +GG T FP
Sbjct: 440 NRFLDGYMDRLATTLFYLNDVPQGGATHFP 469


>gi|219113801|ref|XP_002186484.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583334|gb|ACI65954.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 431

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 75/159 (47%), Gaps = 13/159 (8%)

Query: 65  TGDASIQSIP----FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETV 120
           + D  + ++P    F++L   P  L   +F +A +C   +A A +  K + L  R     
Sbjct: 185 SADLPLDALPPFEKFRILHVDPLVLGIDDFFTAAECDRYVALATQNQKHNILESRSPTVG 244

Query: 121 E--STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
           +  + K  RTS+ T+ +  E   G+ EL+        +      E    +RY   +K+  
Sbjct: 245 KDAAAKAQRTST-TYYNHFE---GVPELMAKASRLLGLQSIDRWEEPQTVRYRKNEKFTW 300

Query: 179 HYDAFNPAEYGPQM-SQRLASFLLYLSDV--EEGGETMF 214
           H DA  P E  P +  QRLA+ L+Y +D+  +EGG TMF
Sbjct: 301 HLDALGPNENNPGLGGQRLATLLVYHTDLQSDEGGATMF 339


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 51/94 (54%), Gaps = 7/94 (7%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           TRTS  T+++ S +   +   +  +I+  T       E   V+ Y +G  YD H+D FN 
Sbjct: 372 TRTSKVTWLTDSLNPLTVR--LNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFN- 428

Query: 186 AEYGPQMSQ----RLASFLLYLSDVEEGGETMFP 215
           A     +++    R+A+ L YL+DVE+GG T+FP
Sbjct: 429 ATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFP 462


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 71/160 (44%), Gaps = 12/160 (7%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           H  T    I  +  Q ++  P  + + +  S ++  +II+ +K  +  S +     + V 
Sbjct: 296 HNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMVGDDHEKAVS 355

Query: 122 STKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD 181
            T   RTSS  ++   +    ++  +  +    T L  T  E   V  Y IG  Y  HYD
Sbjct: 356 KT---RTSSNAWLD--DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYD 410

Query: 182 AFNPAEYGPQM------SQRLASFLLYLSDVEEGGETMFP 215
            +  AE G ++        R+A+ + YLSDV  GG T+FP
Sbjct: 411 -YAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP 449


>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
          Length = 507

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|260665980|ref|YP_003212934.1| hypothetical protein H665_p111 [Ostreococcus tauri virus 1]
 gi|260160998|emb|CAY39699.1| hypothetical protein OTV1_111 [Ostreococcus tauri virus 1]
          Length = 185

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 65/134 (48%), Gaps = 13/134 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR +    F + E+ + II  A+K+L+ S +A    E     K  R S   ++   +D  
Sbjct: 10  PRVI--KEFITEEERKHIIRKAQKKLEVSTVA----ENRVVDKKVRDSETAWL---DDSD 60

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLL 201
            +++ +  K    T  P  + E   VLRY+ G  Y  H D F+      + ++R+ + +L
Sbjct: 61  PVVKRVMEKCVSLTDRPLVNCEHIQVLRYKPGGHYSPHQDTFSDT----KGNKRMYTVIL 116

Query: 202 YLSDVEEGGETMFP 215
            L+D  EGGET FP
Sbjct: 117 GLNDDYEGGETEFP 130


>gi|384252051|gb|EIE25528.1| hypothetical protein COCSUDRAFT_40745 [Coccomyxa subellipsoidea
           C-169]
          Length = 456

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 45/141 (31%), Positives = 64/141 (45%), Gaps = 32/141 (22%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
             ++S  PR L   +F S  +CQ  +A A+  L+ S +A   G  V S    RTSS  F+
Sbjct: 283 ISLVSTSPRMLMIKDFLSPRECQEFMAVAEPLLQNSLVAC--GSCVPS----RTSSSMFL 336

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQ 194
               ++   +  IE+                   R E GQ YD+H+D         +  +
Sbjct: 337 QGRHEREPCV--IEYD------------------RKE-GQFYDAHFD-----NKSNECWR 370

Query: 195 RLASFLLYLSDVEEGGETMFP 215
           R A+ + YLSDVEEGG T FP
Sbjct: 371 RAATLITYLSDVEEGGATFFP 391


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 66/154 (42%), Gaps = 17/154 (11%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           I  I  ++L + P A+ F N  S  + + I   A  +LK +        TV+++K     
Sbjct: 313 IAPIKVEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRA--------TVQNSKTGELE 364

Query: 130 SGTF-ISASE----DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
             T+ IS S     D   +++ +  +I   T L Q   E   V  Y +G  YD H+D   
Sbjct: 365 HATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFAR 424

Query: 185 PAEYGP----QMSQRLASFLLYLSDVEEGGETMF 214
             E           R+A+ L Y+S  E GG T+F
Sbjct: 425 KEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF 458


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 71/154 (46%), Gaps = 15/154 (9%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q    +V+  RP    F +  S ++ Q++I  +  RLK + +   +   +E     R S
Sbjct: 314 LQPSKLEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELE-VANYRIS 372

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-------- 181
              ++   + +  ++E +  +    T L     E   V+ Y IG  Y++H+D        
Sbjct: 373 KSAWLKNHDHE--VVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKD 430

Query: 182 AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           AF     G     R+A+++ Y+SDV+ GG T+FP
Sbjct: 431 AFKQLGTG----NRIATWINYMSDVKAGGATVFP 460


>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 225

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 62/121 (51%), Gaps = 19/121 (15%)

Query: 95  QCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARA 154
           +C  +++  +  ++ S LA         T G R SS   I   ED   ++  IE +I+  
Sbjct: 2   ECDHLVSMGRGNME-SSLAF--------TDGDRNSSYNNI---EDI--VVSKIEDRISLW 47

Query: 155 TMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           + LP+ +GE+  VL+Y + +       +          + RLA+ L+YLSDV++GGET+F
Sbjct: 48  SFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVF 102

Query: 215 P 215
           P
Sbjct: 103 P 103


>gi|224001508|ref|XP_002290426.1| prolyl 4-hydroxylase alpha subunit-like protein [Thalassiosira
           pseudonana CCMP1335]
 gi|220973848|gb|EED92178.1| prolyl 4-hydroxylase alpha subunit-like protein, partial
           [Thalassiosira pseudonana CCMP1335]
          Length = 180

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 37/137 (27%), Positives = 62/137 (45%), Gaps = 13/137 (9%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  +   NF S+ + + ++         S     + +  ++   TRTSS  +        
Sbjct: 1   PWVITLENFVSSTEMEILMEWG------SHFGYDRSQAGDTISETRTSSQAWCFNDCYDD 54

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP---AEYGPQMSQRLAS 198
            I++ +  +I   T +P+ + EA  +L Y+ GQ Y  H D         +GP    RL +
Sbjct: 55  PIVKGLRQRIHEVTGIPEKNYEALQLLHYDEGQFYKPHDDFIRKHVVQGHGP----RLLT 110

Query: 199 FLLYLSDVEEGGETMFP 215
           F +Y ++VEEGG T FP
Sbjct: 111 FFMYFNEVEEGGGTRFP 127


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 75/150 (50%), Gaps = 12/150 (8%)

Query: 74  PF--QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF  +V+   P  + + +F +  + Q I   A+  L+ S +A   GE  +     R S  
Sbjct: 336 PFRKEVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVA--SGEK-QLQVDYRISKS 392

Query: 132 TFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE- 187
            ++  S D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+  
Sbjct: 393 AWLKDSVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSP 450

Query: 188 -YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
            Y  +   R+A+F++YLS VE GG T F +
Sbjct: 451 LYRMKSGNRVATFMIYLSAVEAGGATAFIY 480


>gi|323448532|gb|EGB04429.1| hypothetical protein AURANDRAFT_67177 [Aureococcus anophagefferens]
          Length = 327

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 46/92 (50%), Gaps = 3/92 (3%)

Query: 125 GTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN 184
           G RTS    ++ + D      L+    A     P+   EA  V RYE G K+ +H DA  
Sbjct: 172 GGRTSEAVALNGAGDAALAAPLVARLAAALGASPRDF-EAPVVTRYERGAKFATHNDASA 230

Query: 185 PAE--YGPQMSQRLASFLLYLSDVEEGGETMF 214
             E  +G    QRLA+ +LYL+DV  GGET F
Sbjct: 231 DPERDWGDAGGQRLATVILYLNDVAAGGETSF 262


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 41/142 (28%), Positives = 65/142 (45%), Gaps = 15/142 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           RPR + +    S  + +++   AK RL+ + ++      +E T   R S   ++S  E  
Sbjct: 336 RPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLE-TAPYRISKSAWLSGYEHS 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
           T  +E I  +I   T L     E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 395 T--IERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 449

Query: 193 SQRLASFLLYLSDVEEGGETMF 214
             R+A++L Y+SDV  GG T+F
Sbjct: 450 -NRIATWLFYMSDVSAGGATVF 470


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 237 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 292

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 293 --NRIATWLFYMSDVSAGGATVFP 314


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  ++L   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 65/127 (51%), Gaps = 7/127 (5%)

Query: 92  SAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHK 150
           S E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED   ++  +  +
Sbjct: 2   SDEEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRR 57

Query: 151 IARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEE 208
           +   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+FL Y+SDVE 
Sbjct: 58  MQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEA 117

Query: 209 GGETMFP 215
           GG T+FP
Sbjct: 118 GGATVFP 124


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score = 53.1 bits (126), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 159

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 160 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 215

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 216 --NRIATWLFYMSDVSAGGATVFP 237


>gi|345324764|ref|XP_001505668.2| PREDICTED: LOW QUALITY PROTEIN: transmembrane prolyl 4-hydroxylase
           [Ornithorhynchus anatinus]
          Length = 495

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 53/115 (46%), Gaps = 17/115 (14%)

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQT---HGEAFNVLRYEIGQ 174
           + V+ +   R S  T++   E    ++  I+ ++ R T LPQ    H E   V+RY+ G 
Sbjct: 259 QKVKMSDLVRNSQHTWLYQGEGAHQVMRSIQQRVLRLTRLPQEIVEHSEPLQVVRYDQGG 318

Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFP 215
            Y +H D+             F   E  P + S R  + L YL++V  GGET FP
Sbjct: 319 HYHAHMDSGPVFPETACSHTKFITNETAPFETSCRYVTVLFYLNNVTGGGETTFP 373


>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
          Length = 586

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 321 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 377

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 378 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 435

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 436 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 467


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 3/58 (5%)

Query: 161 HG-EAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
           HG E   ++ Y +G  YD HYD FN   +        R+A+ L YL+DVE+GG T+FP
Sbjct: 407 HGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRIATVLFYLTDVEQGGATVFP 464


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 66/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F    S  + + +   AK RL+ + ++      +E T   R S   ++S  ED 
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLE-TAHYRISKSAWLSGYEDP 395

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 396 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTG--- 450

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 451 -NRIATWLFYMSDVSAGGATVFP 472


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 61/146 (41%), Gaps = 22/146 (15%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +P+     N  +  + + I   A+ RL+ ++        VES     T  G   S    K
Sbjct: 321 KPKLWVLHNILTDPEMEVIKKLAQPRLRRAR--------VESPT---TGEGELASYRISK 369

Query: 141 TGILELIEHKIAR--------ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YG 189
           +  L   EH++ R         T L     E   V+ Y IG  Y+ H+D     E     
Sbjct: 370 SAWLYDWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALD 429

Query: 190 PQMSQRLASFLLYLSDVEEGGETMFP 215
           P    R+A+ L Y+SDVE GG T+FP
Sbjct: 430 PNEGDRIATMLFYMSDVEAGGATVFP 455


>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 554

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 72/165 (43%), Gaps = 16/165 (9%)

Query: 63  GETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPS-QLALRQGETVE 121
           G+ G   +Q I  +  + +P  +   N       + ++A AK RL+ S    +R  E  +
Sbjct: 316 GQHGFFKLQPIKVEEANLKPYIVVMHNVIQDRDIEDLMAFAKPRLQRSTHYGVRGMEASQ 375

Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
                RTSS  +++  +    T +   +   +   T       E + +  Y IG +Y SH
Sbjct: 376 ----VRTSSNAWLNDLDAPVATRLNRFLRSLLGLGTTYLGGEAEQYQLANYGIGGQYMSH 431

Query: 180 YDAFNPAEYGPQM---------SQRLASFLLYLSDVEEGGETMFP 215
           +D      + P             R+A+ ++Y+SDVEEGG T+FP
Sbjct: 432 HDYLQDTYHIPNRVTDDFEKTSGDRIATLMVYMSDVEEGGATVFP 476


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 73/146 (50%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V+  RP    + +F S  + Q I   A+  L+ S +A   GE     +  R S   ++ 
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVA--SGEKQLPVE-YRISKSAWLK 361

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+   Y  
Sbjct: 362 DTVDP--MLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRM 419

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+F++YLS VE GG T F +
Sbjct: 420 KSGNRVATFMIYLSAVEAGGATAFIY 445


>gi|347527209|ref|YP_004833956.1| putative oxygenase [Sphingobium sp. SYK-6]
 gi|345135890|dbj|BAK65499.1| putative oxygenase [Sphingobium sp. SYK-6]
          Length = 229

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 54/131 (41%), Gaps = 14/131 (10%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH 149
             S  +C  +I       KPS+L         +  G RTSS   +   +     +  I  
Sbjct: 49  LVSPAECAKLIDKIDAGCKPSKL------FSGTASGYRTSSSCNLDIYDP---FVIAITK 99

Query: 150 KIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-NPAEYGPQM----SQRLASFLLYLS 204
           +I     +   HGE     RYE  Q+Y  H D F     Y P M     QR  + + YL 
Sbjct: 100 RIDALMGMEGDHGELLQGQRYEQTQQYHLHCDYFPGNVHYWPAMRVSGGQRCWTAMAYLC 159

Query: 205 DVEEGGETMFP 215
           DVEEGGET FP
Sbjct: 160 DVEEGGETHFP 170


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP 469


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 71/146 (48%), Gaps = 10/146 (6%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           + LS +P  + + +F S  + + I   A+  L+ S +A R  +    T   R S   ++ 
Sbjct: 81  ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQV---TAEYRISKSAWLK 137

Query: 136 ASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPAE--YGP 190
            S      +  ++ +I+  T L     HGE   V+ Y IG  Y+ H+D A +P+   +  
Sbjct: 138 GSAQSA--VSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKL 195

Query: 191 QMSQRLASFLLYLSDVEEGGETMFPF 216
           +   R+A+ ++YLS VE GG T F +
Sbjct: 196 KTGNRVATVMIYLSSVEAGGSTAFIY 221


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL+  +  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLR--RATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVSAGGATVFP 469


>gi|372272594|ref|ZP_09508642.1| Procollagen-proline dioxygenase [Marinobacterium stanieri S30]
          Length = 217

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 74/149 (49%), Gaps = 16/149 (10%)

Query: 75  FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFI 134
            ++L+  P  +   +  + ++ + +I  A+++L  ++++      V  T G R+    ++
Sbjct: 1   MEILNHHPLVMTLDHILTPKEAKGLIDLAREKLDRAKVSFDDAYGV--TDG-RSGQNCWL 57

Query: 135 SASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFN--------PA 186
             ++    + + +  +IA+   +P  + E+  VL Y   Q+Y +HYDA++          
Sbjct: 58  RYAD--YPLAKQVGDRIAKLAGIPLENAESLQVLHYGPEQEYRAHYDAYDLSTARGQRCC 115

Query: 187 EYGPQMSQRLASFLLYLSDVEEGGETMFP 215
            YG    QRL + L+YL+ VE GG T FP
Sbjct: 116 RYG---GQRLVTALVYLNAVEAGGGTAFP 141


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 69/137 (50%), Gaps = 7/137 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDK 140
           P  + + +  S E+ + I   AK +L  ++  +R  +T V +    R S  +++   +D 
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKL--ARATVRDPKTGVLTVANYRVSKSSWLEEGDDP 394

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLAS 198
             ++  +  ++   T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+
Sbjct: 395 --VIAQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLAT 452

Query: 199 FLLYLSDVEEGGETMFP 215
           FL Y+SDVE GG T+FP
Sbjct: 453 FLNYMSDVEAGGATVFP 469


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 66/142 (46%), Gaps = 6/142 (4%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
           +V++  P    + + AS  +   +I   + ++  S +     + V  +   RTS  ++++
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMVGDAAKKEVSKS---RTSQNSWLT 370

Query: 136 ASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS-- 193
              D   +  L       A  L +T  E+  V  Y IG  Y  HYD        P+++  
Sbjct: 371 -DYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTG 429

Query: 194 QRLASFLLYLSDVEEGGETMFP 215
            R+A+ + YLSDVEEGG T+FP
Sbjct: 430 NRIATLMFYLSDVEEGGATVFP 451


>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
           pulchellus]
          Length = 548

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 68/165 (41%), Gaps = 14/165 (8%)

Query: 62  HGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE 121
           +G  G   +Q +  +  + +P  + F +         ++A A  RL  S      GE   
Sbjct: 316 YGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDINDLLAYATPRLFRST---HYGEHGT 372

Query: 122 STKGTRTSSGTFISASED--KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            T   RTSS  ++   +    T +   +E  +   +   +   E + +  Y +G +Y +H
Sbjct: 373 ETSLIRTSSTAWLGDQDAPVATRLNRFVESLLGLGSQYLKGEAEYYQLANYGVGGQYIAH 432

Query: 180 YDAFNPAEYGPQ---------MSQRLASFLLYLSDVEEGGETMFP 215
           +D        P             R+A+ + YLSDVEEGG T+FP
Sbjct: 433 HDFLADIYADPNRKLDDFERSAGDRIATLMFYLSDVEEGGATVFP 477


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 325 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 381

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 382 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 439

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 440 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 471


>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
           ANT32C12]
          Length = 186

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/103 (32%), Positives = 55/103 (53%), Gaps = 10/103 (9%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           +RT+S  +I    D + I+  +  + +    +P  + E F ++ Y  G +Y  H+DAF+ 
Sbjct: 39  SRTNSYAWIQ--HDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDK 96

Query: 186 A-EYGPQM----SQRLASFLLYLSDVEEGGETMFPFEVDLQIS 223
           + E G        QR+ + L YL+DVE+GG T FP   D+ +S
Sbjct: 97  STEEGRNNWFPGGQRMVTALAYLNDVEDGGATDFP---DIHVS 136


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 268 LQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 324

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 325 KSAWLKDTVDP--LLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 382

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 383 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 414


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 8/149 (5%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLA-LRQGETVESTKGTRT 128
           ++ +  +VL   P    +    + ++ + II  AK  L+ + +  +  G+ + +    R 
Sbjct: 321 LKPLKMEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYA--DYRV 378

Query: 129 SSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD---AFNP 185
           S  T+I+  ED   I   I  ++   T L   + E   V  Y I  +Y+ H+D      P
Sbjct: 379 SKNTWIA--EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRP 436

Query: 186 AEYGPQMSQRLASFLLYLSDVEEGGETMF 214
             +      R+A+ LLYLSDV+ GG T+F
Sbjct: 437 KHFDRWGGNRIATMLLYLSDVDWGGRTVF 465


>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
          Length = 583

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVA--SGEKQLQVE-YRIS 374

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDP--MLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 464


>gi|401397395|ref|XP_003880042.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
 gi|325114451|emb|CBZ50007.1| hypothetical protein NCLIV_004830 [Neospora caninum Liverpool]
          Length = 308

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 67/156 (42%), Gaps = 19/156 (12%)

Query: 65  TGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQ----LALRQGETV 120
           TG    + I    +   P     P   +   C  ++   + R + S+     A  +    
Sbjct: 102 TGQLEERVIQILAIHENPGVYLIPELLTDSDCAHLLQLCEGRWERSKTSTGYATAEPRDY 161

Query: 121 ESTKG-TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSH 179
            S K  +RTS    ++  E  TG++E IE  ++    +P  H E   V+RYE GQ +  H
Sbjct: 162 MSKKSPSRTSWSVPLAIGE--TGVVENIERIVSAFAGMPVEHLEPLAVVRYEEGQYFKQH 219

Query: 180 YD-AFNPAEYGPQMSQRLASFLLYLSDVEEGGETMF 214
           +D  F P            + LLYL+DVE GGET F
Sbjct: 220 HDGGFRP-----------KTVLLYLNDVEAGGETSF 244


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 74/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q +  +V+   P  + + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 329 LQPVRKEVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVA--SGEKQLPVE-YRIS 385

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+I   T L     + E   V+ Y IG  Y+ H+D A +P 
Sbjct: 386 KSAWLKDTVDP--LLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPT 443

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 444 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 475


>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
 gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
          Length = 540

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 78/179 (43%), Gaps = 9/179 (5%)

Query: 43  RPKLRTLEVVEKENESGLPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIAT 102
           R +L+     +++    L  G    +S Q    + L   P  +   +  SAE+   +   
Sbjct: 292 REELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQL 351

Query: 103 AKKRLKPSQL-ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTH 161
           A+  L+ S + +L   E +  +   R S GTF    E    I++ +   +   + L    
Sbjct: 352 ARPELQRSMVYSLSNSEHI--STNFRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRS 407

Query: 162 GEAFNVLRYEIGQKYDSHYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPF 216
            E   V  Y IG  Y+ H D+F+    YG      + R+A+ + YLS+VE GG T FPF
Sbjct: 408 AEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF 466


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 69/151 (45%), Gaps = 16/151 (10%)

Query: 73  IPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGT-RTSSG 131
           I  ++LS  P  L F +  S ++   I +++K+ + PS          E    T RTS  
Sbjct: 328 IKTEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKS 387

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDA-------FN 184
            + S++ + T   + I  ++  AT L     E F V+ Y +G  +++H D        FN
Sbjct: 388 VWYSSTSNDTT--KRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFN 445

Query: 185 PAEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
                     RLA+ L YL++V +GG T FP
Sbjct: 446 GTR------DRLATTLFYLNEVRQGGGTHFP 470


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 76/152 (50%), Gaps = 9/152 (5%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P  + + +F +  + Q I   A+  +  +++ +  GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWVT-AEILVASGEKQLPVE-YRIS 391

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D   +L  ++H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 392 KSAWLKDTVDP--MLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPS 449

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 450 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 481


>gi|254513749|ref|ZP_05125810.1| oxidoreductase, 2OG-Fe(II) oxygenase family [gamma proteobacterium
           NOR5-3]
 gi|219675992|gb|EED32357.1| oxidoreductase, 2OG-Fe(II) oxygenase family [gamma proteobacterium
           NOR5-3]
          Length = 375

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 55/133 (41%), Gaps = 12/133 (9%)

Query: 87  FPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILEL 146
            P F  AEQC  +I   +    PSQ+     +  +     R+S    +   E     +  
Sbjct: 190 LPEFLKAEQCDRLIDIIRSNAHPSQV-----DGYQQQSDMRSSRTCNLDVREHP--YIAE 242

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP-----AEYGPQMSQRLASFLL 201
           I+  I+RA  +     E      YE GQ+Y  H D F P     A +     QR  +F++
Sbjct: 243 IDDAISRALGISLGWSEINQGQWYEPGQQYKPHPDYFPPGTPEYARFAATSGQRTWTFMI 302

Query: 202 YLSDVEEGGETMF 214
           YL+  E GG T F
Sbjct: 303 YLNKTERGGGTHF 315


>gi|126736198|ref|ZP_01751941.1| response regulator receiver domain protein (CheY-like) [Roseobacter
           sp. CCS2]
 gi|126714364|gb|EBA11232.1| response regulator receiver domain protein (CheY-like) [Roseobacter
           sp. CCS2]
          Length = 217

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 67/150 (44%), Gaps = 20/150 (13%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-----TRTS- 129
            VL   P      +    +  Q +I+         Q AL +   V+S  G     +RT+ 
Sbjct: 7   NVLCLNPLVAVIDDVFDEDLAQHVISLG-------QEALVRATVVDSAGGGKLDESRTND 59

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYG 189
           SGT    S+ K   L  +   I+    LP  + E   +LRYE  QK+D H DAF+    G
Sbjct: 60  SGTIDQWSDPK---LASLVTTISDLVRLPPENSEPSQLLRYEGEQKFDPHTDAFDNTVGG 116

Query: 190 PQM----SQRLASFLLYLSDVEEGGETMFP 215
                   QRL + + YL++V +GGET FP
Sbjct: 117 RDFISRGGQRLFTTICYLNNVGKGGETEFP 146


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 68/144 (47%), Gaps = 17/144 (11%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVE-STKGTRTSSGTFISASED 139
           +PR + F +  S  + + +   AK RL  S+  +   ET + +T   R S   ++S  E+
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRL--SRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 140 KTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQ 191
              ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G  
Sbjct: 392 P--VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG-- 447

Query: 192 MSQRLASFLLYLSDVEEGGETMFP 215
              R+A++L Y+SDV  GG T+FP
Sbjct: 448 --NRIATWLFYMSDVLAGGATVFP 469


>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
 gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
          Length = 242

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 71/162 (43%), Gaps = 9/162 (5%)

Query: 60  LPHGETGDASIQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL-ALRQGE 118
           L  G    +S Q    + L   P  +   +  SAE+   +   A+  L+ S + +L   E
Sbjct: 11  LHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSE 70

Query: 119 TVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDS 178
            + +    R S GTF    E    I++ +   +   + L     E   V  Y IG  Y+ 
Sbjct: 71  HISTN--FRISQGTFFEYHEHP--IMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEP 126

Query: 179 HYDAFNPAE-YGPQM---SQRLASFLLYLSDVEEGGETMFPF 216
           H D+F+    YG      + R+A+ + YLS+VE GG T FPF
Sbjct: 127 HMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPF 168


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 1/79 (1%)

Query: 139 DKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ-MSQRLA 197
           + T +++ +  ++   T L     +A  ++ Y +G  YD HYD+ N +E     +  R+A
Sbjct: 351 NDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIA 410

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + L Y+ +V+ GG T FP+
Sbjct: 411 TVLFYVGEVDSGGATTFPY 429


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|326928035|ref|XP_003210190.1| PREDICTED: WD repeat-containing protein 6-like [Meleagris
           gallopavo]
          Length = 900

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 17/116 (14%)

Query: 118 ETVESTKGTRTSSGTFISASEDKTGILELIEHKIARATMLP---QTHGEAFNVLRYEIGQ 174
           + V+++   R S  T++   E    ++  I  ++ R T LP     H E   V+RY+ G 
Sbjct: 134 QKVKTSDAVRNSQHTWLYQGEGAHQVMRAIRQRVMRLTRLPPEIVEHSEPLQVVRYDQGG 193

Query: 175 KYDSHYDA-------------FNPAEYGP-QMSQRLASFLLYLSDVEEGGETMFPF 216
            Y +H D+                 E  P + S R  + L YL++V  GGET+FP 
Sbjct: 194 HYHAHMDSGPVFPETACSHTKLVANESAPFETSCRYVTVLFYLNNVTGGGETVFPI 249


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 4/92 (4%)

Query: 126 TRTSSGTFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP 185
           TRTS   +   S +   +   +  +IA  T       E   ++ Y +G  YD HYD FN 
Sbjct: 372 TRTSKVAWFPDSYNPLTVR--LNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNT 429

Query: 186 --AEYGPQMSQRLASFLLYLSDVEEGGETMFP 215
             +        R+A+ L YL+DVE+GG T+FP
Sbjct: 430 INSNLTAMSGDRIATVLFYLTDVEQGGATVFP 461


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
 gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
 gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
          Length = 239

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 14/131 (10%)

Query: 89  NFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKTGILELIE 148
           NF + E+C+ I+   + +L  S++           K  R S   ++S  +    +++ + 
Sbjct: 61  NFINKEKCKEIMNNTQNKLFDSEV------ISGKNKAIRNSQQCWVSKYDP---MVKSMF 111

Query: 149 HKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAF-----NPAEYGPQMSQRLASFLLYL 203
            KI++   +P  + E   V+RY  GQ Y+ H+DA         E+  +  QR  + L+YL
Sbjct: 112 QKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVYL 171

Query: 204 SDVEEGGETMF 214
           ++  EGG T F
Sbjct: 172 NNEFEGGHTFF 182


>gi|341615768|ref|ZP_08702637.1| hypothetical protein CJLT1_12463 [Citromicrobium sp. JLT1363]
          Length = 223

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 68/141 (48%), Gaps = 20/141 (14%)

Query: 90  FASAEQCQSIIATAKKRLKPSQLALRQGETVEST--KGTRTS-SGTFISASEDKTGILEL 146
           F ++ +C+ +        +PS L        E T   G RTS SG F S       ++++
Sbjct: 47  FLTSNECRKLAGMIDMVARPSTL-------YEGTYKDGFRTSYSGNFDSQDP----MVKM 95

Query: 147 IEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQM-----SQRLASFLL 201
           I  ++  A  LP   GE     RY  GQ++  H+D F P E   +M      QR  + +L
Sbjct: 96  ISRRMDDALGLPGKIGETMQGQRYLPGQQFKDHHDYFYPEEEYWKMERKNGGQRSWTAML 155

Query: 202 YLSDVEEGGETMFPFEVDLQI 222
           +L++V+ GG T FP E+ L+I
Sbjct: 156 FLNNVKAGGATSFP-ELGLRI 175


>gi|429856435|gb|ELA31343.1| 2og-fe oxygenase family protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 279

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 78/172 (45%), Gaps = 24/172 (13%)

Query: 60  LPHGETGD---ASIQSIP----FQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQL 112
           LPH +T +    S++ +P     Q++S  P  +Y  NF S     +++A  +   KPS +
Sbjct: 22  LPHLQTLNLPSPSLKCLPQNYTTQIVSIDPLVIYIHNFLSESDIPALLAAGEPAFKPSYV 81

Query: 113 ALRQGETVESTKGTRTSSGTFISASEDKTGILELIEH----KIARATMLPQTHGEAFNVL 168
             + G    +   T +S+G  +  +  +  +L   E     ++ R  M P        ++
Sbjct: 82  -YKDGRRQGTPDRTSSSAGLPLDDAAVRC-VLARAERFMGARLGRDEMGPP------QLV 133

Query: 169 RYEIGQKYDSH---YDAFNPAEYG--PQMSQRLASFLLYLSDVEEGGETMFP 215
           RY  GQ+++ H   YD F P   G   +   R+ASF   L D   GGET FP
Sbjct: 134 RYTAGQRFNVHHDWYDTFQPDRAGGRDRRWNRVASFFAILEDECVGGETWFP 185


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/149 (28%), Positives = 69/149 (46%), Gaps = 13/149 (8%)

Query: 74  PFQV--LSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSG 131
           PF+V  L + P A+ F +  S E+ + I   A  RLK + +   +   +E T   R S  
Sbjct: 319 PFKVEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELE-TAAYRISKS 377

Query: 132 TFISASEDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQ 191
            ++   + +  +++ I  +I   T L Q   E   +  Y +G  YD H+D     E  P+
Sbjct: 378 AWLKGGDHE--LIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEE--PK 433

Query: 192 ------MSQRLASFLLYLSDVEEGGETMF 214
                    RLA+ L YL++ E GG T+F
Sbjct: 434 AFESLGTGNRLATVLFYLTEPEIGGGTVF 462


>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_a
           [Homo sapiens]
          Length = 657

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 72/150 (48%), Gaps = 10/150 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 366 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 422

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 423 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 480

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMF 214
              Y  +   R+A+F++YLS VE GG T F
Sbjct: 481 SPLYRMKSGNRVATFMIYLSSVEAGGATAF 510


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 307 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 363

Query: 130 SGTFISASEDKTGILELIEHKIARATMLPQT--HGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 364 KSAWLKDTVDPK--LVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 421

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 422 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 453


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|307108823|gb|EFN57062.1| hypothetical protein CHLNCDRAFT_143806 [Chlorella variabilis]
          Length = 514

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 62/142 (43%), Gaps = 19/142 (13%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  L    F S E+C  + A     +K S++      +       RTS GTF++    + 
Sbjct: 287 PCLLLADAFLSPEECGEVRALGAPHMKRSKV------SAGDETPLRTSWGTFLTGPLAQQ 340

Query: 142 GILELIEHKIARATMLPQTH--------GEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
            +   +E ++ +   L            GEA  ++RY+ GQ Y  H D           S
Sbjct: 341 PVAARLEGRVRQLAALACEAEGRRALQLGEATQIVRYDPGQFYALHLD-----NRAGDSS 395

Query: 194 QRLASFLLYLSDVEEGGETMFP 215
           +R A+ ++Y+SDVE GG T FP
Sbjct: 396 RRAATVMIYISDVEAGGATHFP 417


>gi|219109949|ref|XP_002176727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411262|gb|EEC51190.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 137

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 60/136 (44%), Gaps = 9/136 (6%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKG-TRTSSGTFISASEDK 140
           P     P F S  +C ++   A +R+  +    +  +T    K  +RTS+ T +   E  
Sbjct: 8   PVVWIVPGFLSESECDALQRKASRRM--TSCVTKNADTSRVYKDLSRTSTNTNVPRREVP 65

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNP--AEYGPQMSQRLAS 198
           T + +L    +       +   E   VLRY+ GQ +  H D F+      G   S RL +
Sbjct: 66  TVVSKLKNLTLCD----DEARFEILQVLRYQAGQHFSPHTDGFSGPITACGFWDSGRLVT 121

Query: 199 FLLYLSDVEEGGETMF 214
              YL+DVE+GG T F
Sbjct: 122 VFCYLNDVEKGGTTRF 137


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 64/125 (51%), Gaps = 7/125 (5%)

Query: 94  EQCQSIIATAKKRLKPSQLALRQGET-VESTKGTRTSSGTFISASEDKTGILELIEHKIA 152
           E+ + I   AK +L  ++  +R  +T V +    R S  +++   ED   ++  +  ++ 
Sbjct: 2   EEIERIKEIAKPKL--ARATVRDPKTGVLTVASYRVSKSSWLE--EDDDPVVARVNRRMQ 57

Query: 153 RATMLPQTHGEAFNVLRYEIGQKYDSHYD-AFNPAEYGPQM-SQRLASFLLYLSDVEEGG 210
             T L     E   V  Y +G +Y+ H+D +  P + G +    RLA+FL Y+SDVE GG
Sbjct: 58  HITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGG 117

Query: 211 ETMFP 215
            T+FP
Sbjct: 118 ATVFP 122


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 15/143 (10%)

Query: 81  RPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDK 140
           +PR + F +  S  + + +   AK RL+ + ++      +E+    R S   ++S  E+ 
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVH-YRISKSAWLSGYENP 392

Query: 141 TGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQM 192
             ++  I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G   
Sbjct: 393 --VVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTG--- 447

Query: 193 SQRLASFLLYLSDVEEGGETMFP 215
             R+A++L Y+SDV  GG T+FP
Sbjct: 448 -NRIATWLFYMSDVSAGGATVFP 469


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 318 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 374

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 375 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 432

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 433 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 464


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 62/139 (44%), Gaps = 7/139 (5%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           PR + + +     + + I   A+ RLK + +   +   +E     R S   ++   ED  
Sbjct: 44  PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFAD-YRISKSAWLKEHEDV- 101

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS----QRLA 197
            ++  +  ++   T L     E   V+ Y +G  YD HYD     E     S     R+A
Sbjct: 102 -VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIA 160

Query: 198 SFLLYLSDVEEGGETMFPF 216
           + L Y+SDV +GG T+FP+
Sbjct: 161 TVLFYMSDVAQGGATVFPW 179


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 15/141 (10%)

Query: 82  PRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISASEDKT 141
           P  + + + AS ++ +++   AK RL+ + +   Q   + +T   R S   ++ + E   
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKL-TTAQYRVSKSAWLGSHEHP- 381

Query: 142 GILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYD--------AFNPAEYGPQMS 193
            I++ I  +I   T L  +  E   V  Y +G +Y+ H+D        AF     G    
Sbjct: 382 -IVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTG---- 436

Query: 194 QRLASFLLYLSDVEEGGETMF 214
            R+A++LLY+SDV+ GG T+F
Sbjct: 437 NRIATWLLYMSDVQAGGNTVF 457


>gi|407917017|gb|EKG10344.1| Oxoglutarate/iron-dependent oxygenase [Macrophomina phaseolina MS6]
          Length = 289

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 19/145 (13%)

Query: 76  QVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTSSGTFIS 135
            +LS  P  +Y P+F SA++ + ++  ++ + KPS +     E ++ +   R S      
Sbjct: 67  HILSRDPLVIYIPDFLSADEAKHMVDISEGKFKPSTVWTGGEERLDES--VRVSE----K 120

Query: 136 ASEDKTGILELIEHKIAR--ATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAEYGPQMS 193
           A  D+  +++ IE + AR      P T  E     RY  G  Y  H+D       GP  S
Sbjct: 121 AEIDRDTVVQCIEER-ARLFQGWRPYTFIEKLWTQRYGPGGHYVHHFD-------GPTTS 172

Query: 194 Q---RLASFLLYLSDVEEGGETMFP 215
               R++SF++YL     GG T FP
Sbjct: 173 HQGGRISSFMVYLDANCTGGGTNFP 197


>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
          Length = 604

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 10/152 (6%)

Query: 70  IQSIPFQVLSWRPRALYFPNFASAEQCQSIIATAKKRLKPSQLALRQGETVESTKGTRTS 129
           +Q I  +V+   P    + +F S  + Q I   A+  L+ S +A   GE     +  R S
Sbjct: 334 LQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVA--SGEKQLQVE-YRIS 390

Query: 130 SGTFISASEDKTGILELIEHKIARATMLP--QTHGEAFNVLRYEIGQKYDSHYD-AFNPA 186
              ++  + D    L  + H+IA  T L     + E   V+ Y IG  Y+ H+D A +P+
Sbjct: 391 KSAWLKDTVDPK--LVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPS 448

Query: 187 E--YGPQMSQRLASFLLYLSDVEEGGETMFPF 216
              Y  +   R+A+F++YLS VE GG T F +
Sbjct: 449 SPLYRMKSGNRVATFMIYLSSVEAGGATAFIY 480


>gi|428183403|gb|EKX52261.1| hypothetical protein GUITHDRAFT_84780 [Guillardia theta CCMP2712]
          Length = 199

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 19/140 (13%)

Query: 82  PRALYFPNFASAEQCQSI----IATAKKRLKPSQLALRQGETVESTKGTRTSSGTFISAS 137
           PR     N  S E+C+++    IA   KR   S +            G         +  
Sbjct: 42  PRIFVLHNLLSKEECENLRDLGIARGMKRNAQSPVL-----------GDDPRKHEVATLD 90

Query: 138 EDKTGILELIEHKIARATMLPQTHGEAFNVLRYEIGQKYDSHYDAFNPAE---YGPQMSQ 194
            ++   +  +E K+A  T    +HGEAF ++RY     Y  H D  +PA+    G   S+
Sbjct: 91  FNENDFVRRLEDKLANLTRTSSSHGEAFQIIRYAQSDFYPEHVDYIDPAKSDLLGKGKSE 150

Query: 195 RLASFLLYLSDVEEGGETMF 214
            +A+ ++YL   + GGET F
Sbjct: 151 -IATVIIYLKSADSGGETFF 169


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.133    0.385 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,495,929,268
Number of Sequences: 23463169
Number of extensions: 135441501
Number of successful extensions: 306749
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 598
Number of HSP's successfully gapped in prelim test: 1048
Number of HSP's that attempted gapping in prelim test: 304506
Number of HSP's gapped (non-prelim): 1790
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 74 (33.1 bits)